Agent loop that critiques and revises its own output for meeting note extraction. Full trace capture via OpenTelemetry + Honeycomb, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for onboarding coordinator. Full trace capture via Braintrust, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for SEO keyword research. Full trace capture via LangSmith, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for incident postmortem drafting. Full trace capture via Weights & Biases Weave, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for investor update drafting. Full trace capture via Braintrust, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for marketplace moderation. Full trace capture via Langfuse, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for code PR review. Full trace capture via Weights & Biases Weave, retry budget, and ship criteria.
Agent loop that critiques and revises its own output for customer support triage. Full trace capture via Helicone, retry budget, and ship criteria.
Implement a entity memory memory system for a Claude Agent SDK agent handling bug triage from Sentry logs. Vector store: Milvus. Covers write, retrieve, prune, and eval.
Implement a entity memory memory system for a Mastra agent handling investor update drafting. Vector store: Turbopuffer. Covers write, retrieve, prune, and eval.
Implement a entity memory memory system for a Pydantic AI agent handling invoice reconciliation. Vector store: Turbopuffer. Covers write, retrieve, prune, and eval.
Implement a entity memory memory system for a Smolagents agent handling sales lead enrichment. Vector store: Turbopuffer. Covers write, retrieve, prune, and eval.