Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude 4 Sonnet, judged by rubric scoring.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude Haiku 4, judged by G-Eval.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Mistral Small 3, judged by DeepEval metrics.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on GPT-4.1, judged by embedding distance.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Claude 4 Sonnet, judged by JSON schema validation.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on Gemini 2.0 Flash, judged by regex match checks.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on o1-mini, judged by factuality with retrieval.
Token-cost and latency reduction playbook for a sales lead qualification prompt running on GPT-4.1, judged by exact match.
Token-cost and latency reduction playbook for a resume screening prompt running on Gemini 2.0 Flash, judged by semantic similarity.
Token-cost and latency reduction playbook for a resume screening prompt running on DeepSeek-R1, judged by human pairwise comparison.
Token-cost and latency reduction playbook for a resume screening prompt running on Llama 3.1 405B, judged by rubric scoring.
Token-cost and latency reduction playbook for a resume screening prompt running on Mistral Small 3, judged by rubric scoring.