mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2026-04-11 02:11:40 +08:00

History

Sebastian Raschka be5e2a3331 Readability and code quality improvements (#959 ) * Consistent dataset naming * consistent section headers		2026-02-17 18:44:56 -06:00
..
create-preference-data-ollama.ipynb	Switch from urllib to requests to improve reliability (#867 )	2025-10-07 15:22:59 -05:00
dpo-from-scratch.ipynb	Readability and code quality improvements (#959 )	2026-02-17 18:44:56 -06:00
instruction-data-with-preference.json	Generate preference dataset with Llama 3.1 70B (#289 )	2024-07-27 09:44:04 -05:00
previous_chapters.py	Make quote style consistent (#891 )	2025-10-21 19:42:33 -05:00
README.md	Direct Preference Optimization from scratch (#294 )	2024-08-04 08:57:36 -05:00

Chapter 7: Finetuning to Follow Instructions

create-preference-data-ollama.ipynb: A notebook that creates a synthetic dataset for preference finetuning dataset using Llama 3.1 and Ollama
dpo-from-scratch.ipynb: This notebook implements Direct Preference Optimization (DPO) for LLM alignment