Hacker News Clone

More capable models are better at in-context scheming

by miles on 6/20/2025, 9:28:19 PM with 1 comments

by chiph2o on 6/20/2025, 10:03:00 PM
in-context scheming = alignment red flag
More capability + low clarity on intent = low trust