How an AI co-scientist found MaskGXT

Each node is one model the agent wrote, scored on validation METRe. The budget escalates 2h training → 12h training → 30m sampling; the larger seed nodes mark each transition. Drag to pan, scroll to zoom, hover a node for details, and click a node to expand or collapse its subtree.

ran ok buggy size = METRe   path to best model seed (budget transition) · stage: 2h 12h sampling