mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2026-04-11 14:21:41 +08:00
* updated requirements * fixes llama2 to llama3 * fixed llama 3.2 standalone * fixed typo * fixed rope formula * Update requirements-extra.txt * Update ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb * Update ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb * Update ch05/07_gpt_to_llama/standalone-llama32.ipynb --------- Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com> |
||
|---|---|---|
| .. | ||
| tests | ||
| config.json | ||
| converting-gpt-to-llama2.ipynb | ||
| converting-llama2-to-llama3.ipynb | ||
| previous_chapters.py | ||
| README.md | ||
| requirements-extra.txt | ||
| standalone-llama32.ipynb | ||
Converting GPT to Llama
This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture in the following recommended reading order:
- converting-gpt-to-llama2.ipynb: contains code to convert GPT to Llama 2 7B step by step and loads pretrained weights from Meta AI
- converting-llama2-to-llama3.ipynb: contains code to convert the Llama 2 model to Llama 3, Llama 3.1, and Llama 3.2
- standalone-llama32.ipynb: a standalone notebook implementing Llama 3.2