rasbt-LLMs-from-scratch/ch07
2024-08-06 19:45:11 -05:00
..
01_main-chapter-code minor DPO fixes (#298) 2024-08-05 08:40:46 -05:00
02_dataset-utilities fix typos, add codespell pre-commit hook (#264) 2024-07-16 07:07:04 -05:00
03_model-evaluation Fix 8-billion-parameter spelling 2024-07-28 10:48:56 -05:00
04_preference-tuning-with-dpo add more explanations 2024-08-06 19:45:11 -05:00
05_dataset-generation Fix 8-billion-parameter spelling 2024-07-28 10:48:56 -05:00
README.md Direct Preference Optimization from scratch (#294) 2024-08-04 08:57:36 -05:00

Chapter 7: Finetuning to Follow Instructions

Main Chapter Code

Bonus Materials