rasbt-LLMs-from-scratch/ch03
2024-04-18 05:56:23 -05:00
..
01_main-chapter-code Use dim=-1 for consistency (#122) 2024-04-18 05:56:23 -05:00
02_bonus_efficient-multihead-attention cleanup 2024-04-04 07:58:41 -05:00
README.md mha variants 2024-03-06 08:30:32 -06:00

Chapter 3: Coding Attention Mechanisms