rasbt-LLMs-from-scratch/ch04/01_main-chapter-code
2024-05-24 07:20:37 -05:00
..
ch04.ipynb update formatting 2024-05-24 07:20:37 -05:00
exercise-solutions.ipynb update formatting 2024-05-24 07:20:37 -05:00
gpt.py Rename drop_resid to drop_shortcut (#136) 2024-04-28 14:31:27 -05:00
previous_chapters.py Make datesets and loaders compatible with multiprocessing (#118) 2024-04-13 13:57:56 -05:00
README.md flops analysis 2024-05-23 20:35:41 -05:00
tests.py Ch05 supplementary code (#81) 2024-03-19 09:26:26 -05:00

Chapter 4: Implementing a GPT Model from Scratch To Generate Text

  • ch04.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module from the previous chapter, which we import in ch04.ipynb to create the GPT model
  • gpt.py is a standalone Python script file with the code that we implemented thus far, including the GPT model we coded in this chapter