Agent Eval Suite Langsmith
Production agent eval suite LangSmith dataset curation + Promptfoo assertion framework +…
Forged from real client work, proof attached. Pick a piece or take the whole system.
Browse the full catalog → Browse ready-made kits → Build your own set →Skill upgrader to Ultra (Pro+) standard
An agent that upgrades existing skills to a higher standard. It measures against a rubric, adds edge cases, anti-patterns, cross-skill awareness and mental models, and promotes recurring patterns into durable rules. It scores every skill against a quality rubric before handing it back, and rejects generic mental-model padding outright.
Prices include 20% VAT. · Forged on real agency work · one-time, no lock-in
Inside the run · no black box
Upgrading a skill is a three-phase loop: score it against a 12-criterion rubric, transform without deleting working content, then re-score until the threshold is genuinely met. Every claim is proven with command output, never impression.
skill-alchemist · core
core active · 5 lines
Upgrading a thin skill to production quality
Adding edge cases and anti-patterns to a skill
Injecting cross-skill awareness and mental models
Auditing a set of skills against a quality rubric
Promoting repeated patterns into rules
Drag time forward. Watch what stays.
Forever
That's what owning means.
ai writing tool: subscription
expired · access lostanalytics suite: subscription
expired · access lostdesign platform: subscription
expired · access lost(nothing left)
Skills that handle the hard cases, not just the happy path
license: perpetualConsistent quality across your whole skill library
license: perpetualKnowledge captured once, reused forever
license: perpetualA measurable bar instead of subjective polish
license: perpetualsubscriptions expire · deeds don't
Pick a piece up. Watch it work.
Skill upgrades to the 7-dimension Ultra standard with skill-specific content
6 parts · one working system · ships instantly by email
Teams maintaining a skill library who want a consistent quality bar.
then this was forged for you.Universal by design: these run in any AI. Delivered in the open Agent Skills + MCP format (native in Claude); ChatGPT, Gemini, Cursor and Copilot adapt the same files their own way.
Yes, that's the core use case. It takes an existing thin skill, measures it against the quality rubric, then adds edge cases, anti-patterns, cross-skill links and skill-specific mental models until it clears the Ultra bar.
Every skill gets scored against the rubric before it ships back. Generic mental-model padding: a bare SOLID or DRY stub: is rejected outright, and each skill needs at least two typed, bidirectional cross-skill connections. The bar is measurable, not subjective polish.
No. Its scope is upgrading skills that already exist; creation is a separate job. What it adds beyond depth is a monthly pattern-promotion pass, where recurring lessons get turned into permanent skill content instead of being relearned.
By email right after purchase: ready to run, downloaded instantly, no setup wait.
A one-time purchase; no subscription or hidden fees. VAT (20%) is included.
As a digital product, it can’t be refunded once downloaded. That’s why we show exactly what’s inside and who it’s for, right here.