Joined 3/20/2021, 12:34:48 PM has 36 karma
Managing LLM application performance through code standards
Catching Claude Cheating
Scanning AI application code for vulnerabilities and performance issues
Show HN: A static scanner for LLM app code
The Model Trust Score: The Framework for Strategic Enterprise AI Model Selection
Evals are not all you need
An AI Cyber Incident in Plain Sight
AI agent using Anthropic's tool calling and the Pandas Python library
Following LLM Manufacturer's Instructions
AI Cybersecurity Lessons from GenAI