rasbt-LLMs-from-scratch/ch05
2025-03-31 16:49:10 -05:00
..
01_main-chapter-code Alt weight loading code via PyTorch (#585) 2025-03-27 20:10:23 -05:00
02_alternative_weight_loading Alt weight loading code via PyTorch (#585) 2025-03-27 20:10:23 -05:00
03_bonus_pretraining_on_gutenberg Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
04_learning_rate_schedulers Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
05_bonus_hparam_tuning Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
06_user_interface Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
07_gpt_to_llama remove redundant context_length in GQA 2025-03-31 16:49:10 -05:00
08_memory_efficient_weight_loading Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
09_extending-tokenizers Add PyPI package (#576) 2025-03-23 19:28:49 -05:00
10_llm-training-speed Adjust comment to save compiled model (#583) 2025-03-27 10:43:45 -05:00
README.md Add readme (#577) 2025-03-23 19:35:12 -05:00

Chapter 5: Pretraining on Unlabeled Data

 

Main Chapter Code

 

Bonus Materials



Link to the video