Pragma
Python · MIT · Claude Code plugin + CLI · pytest, Vitest, Jest · three layered tiers — AST, coverage, LLM judge.
Watches every test file your AI assistant writes and blocks the
edit when the test is gamed. Three tiers, each catching what the
previous one misses. Tier 1 — fast deterministic
AST classifier (~10 ms): tautologies, mocked-away targets,
swallowed exceptions, conditional assertions, monkeypatched fakes,
vi.mock / jest.mock on default exports.
Tier 2 — runs the test under coverage and asks
whether the target's lines actually executed; if not, it isn't a
test. Tier 3 — a small LLM (DeepSeek by default,
any OpenAI-compatible endpoint, Ollama works) reads test +
production code and decides whether the test verifies behaviour or
just confidently asserts on its own mocks. Production target is
inferred from imports, expected outcome from the test name. Zero
config to start.