This repo hosts a Jupyter Notebook that demonstrates the architecture, training, and inference process of the LLAMA3 language model. It includes sections for model initialization, training, and inference, with detailed explanations of the implementation. The test dataset is the tiny shakespeare dataset, included in the repo.
- Python 3.8 or higher
 - Required libraries:
torchtransformersnumpymatplotlib(optional for visualizations)