Tag
Most LLM features ship on vibes — skim three outputs, merge. Here's the 40-line Python eval harness with JSON fixtures and a judge-model gate you can wire into CI today.
May 06, 2026 AI AgentsOnce your agent calls tools, prompt injection is a live attack path. Here's the exploit in real Python — and the Pydantic fix that belongs in your dispatch layer.
May 05, 2026