rasbt-LLMs-from-scratch/ch07/04_preference-tuning-with-dpo
Sebastian Raschka be5e2a3331
Readability and code quality improvements (#959)
* Consistent dataset naming

* consistent section headers
2026-02-17 18:44:56 -06:00
..
create-preference-data-ollama.ipynb Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
dpo-from-scratch.ipynb Readability and code quality improvements (#959) 2026-02-17 18:44:56 -06:00
instruction-data-with-preference.json Generate preference dataset with Llama 3.1 70B (#289) 2024-07-27 09:44:04 -05:00
previous_chapters.py Make quote style consistent (#891) 2025-10-21 19:42:33 -05:00
README.md Direct Preference Optimization from scratch (#294) 2024-08-04 08:57:36 -05:00

Chapter 7: Finetuning to Follow Instructions