• Top
  • New

LLM Speedrunner: Eval for frontier models to reproduce scientific findings

by zerojames on 6/27/2025, 12:34:35 PM with 0 comments