Micro LLAMA

This is a tiny implementation of the LLAMA 3 model architecture for didactical purposes. The entire implementation is approximately 180 lines of code, hence the name "micro".

The code uses the smallest LLAMA 3 model, i.e., the 8B parameters one. This model is still 15GB in size, and requires about 30GB of memory to execute. The code by defaults runs this on the CPU, but beware of the memory impact.

Start exploring the code using the notebook micro_llama.ipynb.

The model's code itself is entirely contained in the micro_llama.py file.

Requirements

Use the following instruction to create a suitable Conda environment, called micro_llama:

conda env create --file conda-env.yaml --yes
conda activate micro_llama

You can get rid of the Conda enviroment as follows:

conda remove -n micro_llama --all --y

References

This implementation is inspired by:

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
conda-env.yaml		conda-env.yaml
micro_llama.ipynb		micro_llama.ipynb
micro_llama.py		micro_llama.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Micro LLAMA

Requirements

References

About

Releases

Packages

Languages

vedaldi/micro_llama

Folders and files

Latest commit

History

Repository files navigation

Micro LLAMA

Requirements

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages