Joined 4/9/2025, 1:33:39 PM has 7 karma
Show HN: Open Operator Evals – real-world benchmarks for LLM web agents