Token-cost and latency reduction playbook for a funnel analysis prompt running on Claude Haiku 4, judged by tool-call accuracy.
Token-cost and latency reduction playbook for a funnel analysis prompt running on Qwen 2.5 72B, judged by regex match checks.
Token-cost and latency reduction playbook for a funnel analysis prompt running on Command R+, judged by factuality with retrieval.
Token-cost and latency reduction playbook for a funnel analysis prompt running on Claude Haiku 4, judged by BLEU/ROUGE.
Token-cost and latency reduction playbook for a funnel analysis prompt running on DeepSeek-R1, judged by semantic similarity.
Token-cost and latency reduction playbook for a funnel analysis prompt running on Qwen 2.5 72B, judged by human pairwise comparison.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on GPT-4o-mini, judged by G-Eval.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude 3.7 Sonnet, judged by Trulens feedback functions.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude 4.5 Sonnet, judged by DeepEval metrics.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude Haiku 4, judged by DeepEval metrics.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Gemini 2.0 Flash, judged by promptfoo assertions.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on DeepSeek-R1, judged by promptfoo assertions.