Joined 9/17/2013, 3:36:27 PM has 1674 karma
OpenRISC Emulator running Linux: http://s-macke.github.com/jor1k/
My Github site: https://github.com/s-macke
My personal site: www.simulationcorner.net
Reinforcement Learning Finetunes Small Subnetworks in Large Language Models
The simplest, fastest repository for training/finetuning small-sized VLMs
Gemini 2.5 Pro won Pokémon Blue in 106k moves
SWE-Smith: Scaling Data for Software Engineering Agents
Sam Altman: we added one million users in the last hour
Measuring AI Ability to Complete Long Tasks
FrontierMath Was Funded by OpenAI
Grokking at the Edge of Numerical Stability
LLMs struggle with perception, not reasoning, in ARC-AGI
A Llama 70B finetune that has reflection baked into it's weights