rasbt-LLMs-from-scratch/ch04/01_main-chapter-code
2024-03-09 17:42:31 -06:00
..
figures use smaller number of tokens to emphasize next token prediction goal 2024-02-15 20:09:20 -06:00
ch04.ipynb add dropout for embedding layers 2024-03-04 07:05:06 -06:00
exercise-solutions.ipynb add dropout for embedding layers 2024-03-04 07:05:06 -06:00
gpt.py remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
previous_chapters.py remove redundant unsqueeze in mask 2024-03-09 17:42:31 -06:00
README.md add and update readme files 2024-02-05 06:51:58 -06:00

Chapter 4: Implementing a GPT model from Scratch To Generate Text

  • ch04.ipynb contains all the code as it appears in the chapter
  • previous_chapters.py is a Python module that contains the MultiHeadAttention module from the previous chapter, which we import in ch04.ipynb to create the GPT model
  • gpt.py is a standalone Python script file with the code that we implemented thus far, including the GPT model we coded in this chapter