rasbt-LLMs-from-scratch/ch05/07_gpt_to_llama
casinca 152a087a37
removing unused RoPE parameters (#590)
* removing unused RoPE parameters

* remove redundant context_length in GQA

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2025-03-31 17:10:39 -05:00
..
tests Auto download DPO dataset if not already available in path (#479) 2025-01-12 12:27:28 -06:00
config.json move access token to config.json 2024-09-23 08:56:16 -05:00
converting-gpt-to-llama2.ipynb Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
converting-llama2-to-llama3.ipynb Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
previous_chapters.py GPT to Llama (#368) 2024-09-23 07:34:06 -05:00
README.md Implement Llama 3.2 (#383) 2024-10-05 07:30:47 -05:00
requirements-extra.txt fixed Llama 2 to 3.2 NBs (#388) 2024-10-06 09:56:55 -05:00
standalone-llama32-mem-opt.ipynb removing unused RoPE parameters (#590) 2025-03-31 17:10:39 -05:00
standalone-llama32.ipynb Uv workflow improvements (#531) 2025-02-16 13:16:51 -06:00

Converting GPT to Llama

This folder contains code for converting the GPT implementation from chapter 4 and 5 to Meta AI's Llama architecture in the following recommended reading order: