Category Not Found

1252 prompts

Sort:

Cut mean tokens per request by 30% on math word problems Prompt for Claude 3.7 Sonnet

Token-cost and latency reduction playbook for a math word problems prompt running on Claude 3.7 Sonnet, judged by human pairwise comparison.

Cut cost-per-correct-answer by 30% on math word problems Prompt for Claude 4.5 Sonnet

Token-cost and latency reduction playbook for a math word problems prompt running on Claude 4.5 Sonnet, judged by rubric scoring.

Cut token cost by 30% on math word problems Prompt for Gemini 2.5 Pro

Token-cost and latency reduction playbook for a math word problems prompt running on Gemini 2.5 Pro, judged by G-Eval.

Cut mean tokens per request by 30% on math word problems Prompt for DeepSeek-V3

Token-cost and latency reduction playbook for a math word problems prompt running on DeepSeek-V3, judged by G-Eval.

Cut cost-per-correct-answer by 30% on math word problems Prompt for Llama 3.3 70B

Token-cost and latency reduction playbook for a math word problems prompt running on Llama 3.3 70B, judged by Trulens feedback functions.

Cut token cost by 30% on math word problems Prompt for Mistral Large

Token-cost and latency reduction playbook for a math word problems prompt running on Mistral Large, judged by Trulens feedback functions.

Cut mean tokens per request by 30% on math word problems Prompt for Qwen 2.5 72B

Token-cost and latency reduction playbook for a math word problems prompt running on Qwen 2.5 72B, judged by DeepEval metrics.

Cut cost-per-correct-answer by 30% on math word problems Prompt for o3

Token-cost and latency reduction playbook for a math word problems prompt running on o3, judged by promptfoo assertions.

Cut p95 latency by 30% on math word problems Prompt for Grok 3

Token-cost and latency reduction playbook for a math word problems prompt running on Grok 3, judged by promptfoo assertions.

Cut mean tokens per request by 30% on math word problems Prompt for GPT-4o

Token-cost and latency reduction playbook for a math word problems prompt running on GPT-4o, judged by embedding distance.

Reduce hallucination rate on contract review Prompt via manual grid search over temperature+system

Use manual grid search over temperature+system to optimize a contract review prompt on Claude 4 Sonnet against hallucination rate without regressing safety.

Reduce user satisfaction (CSAT) on customer support routing Prompt via manual grid search over temperature+system

Use manual grid search over temperature+system to optimize a customer support routing prompt on o3-mini against user satisfaction (CSAT) without regressing safety.

💬ChatGPT

352552