Commit Graph

5 Commits

Author SHA1 Message Date
Contributor
ebf357841b fix(train): make NaN fast-fail check explicit 2026-03-11 04:28:08 +00:00
haosenwang1018
b5ba8ac00d fix NaN loss not caught by fast-fail check
`train_loss_f > 100` silently passes on NaN because IEEE 754 NaN
comparisons always return False. When an agent experiment produces
NaN (e.g. from an aggressive LR change), the run wastes the full
5-minute budget instead of failing fast.

`not (x <= 100)` catches both >100 and NaN with no added complexity.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-09 23:51:02 +08:00
Andrej Karpathy
8a5c4869bd bunch of small changes to docs and files, and a teaser figure with a blooper :) 2026-03-07 19:00:04 +00:00
Marcin Bogdanski
17b480aa65 add fallback FA3 kernel for non-Hopper GPUs 2026-03-07 01:31:48 +00:00
Andrej Karpathy
b11d6f283f initial commit 2026-03-06 21:58:52 +00:00