rasbt-LLMs-from-scratch/tool_calling_experiment
2026-02-15 14:10:16 +02:00
..
architecture.py Add tool calling experiment files 2026-02-15 13:09:34 +02:00
config.py Fix config: set qkv_bias=True for pretrained checkpoint compatibility 2026-02-15 13:46:29 +02:00
dataset_prep.py Fix dataset_prep: add collation and padding for variable lengths 2026-02-15 13:54:40 +02:00
execution_sandbox.py Add tool calling experiment files 2026-02-15 13:09:34 +02:00
inference_colab.py Add tool calling experiment files 2026-02-15 13:09:34 +02:00
tokenizer_utils.py Add tool calling experiment files 2026-02-15 13:09:34 +02:00
train_colab.py Optimize training: add gradient accumulation & increase batch size 2026-02-15 14:10:16 +02:00