rasbt-LLMs-from-scratch/.github/workflows
Aviral Garg 27d52d6378
Fix MHAEinsum weight dimension bug when d_in != d_out (#857) (#893)
* Fix MHAEinsum weight dimension bug when d_in != d_out (#857)

Previously MHAEinsum initialized weight matrices with shape (d_out, d_in) and used inappropriate einsum notation, causing failures for non-square input-output dimensions. This commit corrects weight initialization to shape (d_in, d_out), updates einsum notation to 'bnd,do->bno', and adds three unit tests to verify parity across different d_in and d_out settings. All tests pass successfully.

* use pytest

* Update .gitignore

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2025-10-31 21:45:31 -05:00
..
basic-tests-latest-python.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-linux-uv.yml Fix MHAEinsum weight dimension bug when d_in != d_out (#857) (#893) 2025-10-31 21:45:31 -05:00
basic-tests-macos-uv.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-old-pytorch.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-pip.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-pixi.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-pytorch-rc.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-windows-uv-pip.yml Switch from urllib to requests to improve reliability (#867) 2025-10-07 15:22:59 -05:00
basic-tests-windows-uv-pip.yml.disabled Simplify the setup in CI (#542) 2025-02-20 11:56:52 -06:00
basic-tests-windows-uv.yml.disabled add HF equivalency tests for standalone nbs (#774) 2025-08-18 18:58:46 -05:00
check-links.yml Qwen3 Coder Flash & MoE from Scratch (#760) 2025-08-01 19:13:17 -05:00
check-spelling-errors.yml Simplify the setup in CI (#542) 2025-02-20 11:56:52 -06:00
pep8-linter.yml Simplify the setup in CI (#542) 2025-02-20 11:56:52 -06:00