rasbt-LLMs-from-scratch

mirror of https://github.com/rasbt/LLMs-from-scratch.git synced 2026-04-11 14:21:41 +08:00

History

Sebastian Raschka e742d8af2c Improve MoE implementation (#841 )		2025-09-22 15:21:06 -05:00
..
__init__.py	Llama 3 KV Cache (#685 )	2025-06-21 10:55:20 -05:00
generate.py	Add defensive context trimming for multiturn (#815 )	2025-09-09 20:19:00 -05:00
gpt2.py	remove redundant next_cache (#817 )	2025-09-11 15:16:08 -05:00
llama3.py	remove redundant next_cache (#817 )	2025-09-11 15:16:08 -05:00
qwen3.py	Improve MoE implementation (#841 )	2025-09-22 15:21:06 -05:00
utils.py	Improve KV cache code for torch.compile (#705 )	2025-06-23 18:08:49 -05:00