Category Not Found

1252 prompts

Sort:

Cut cost-per-correct-answer by 30% on A/B test interpretation Prompt for o1

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on o1, judged by factuality with retrieval.

Cut p95 latency by 30% on A/B test interpretation Prompt for o3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on o3, judged by factuality with retrieval.

Cut token cost by 30% on A/B test interpretation Prompt for Grok 3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Grok 3, judged by LLM-as-judge.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for GPT-4o

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on GPT-4o, judged by LLM-as-judge.

Cut p95 latency by 30% on A/B test interpretation Prompt for GPT-4o-mini

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on GPT-4o-mini, judged by exact match.

Cut token cost by 30% on A/B test interpretation Prompt for Claude 4 Sonnet

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Claude 4 Sonnet, judged by BLEU/ROUGE.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for Claude Opus 4.5

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Claude Opus 4.5, judged by BLEU/ROUGE.

Cut cost-per-correct-answer by 30% on A/B test interpretation Prompt for Gemini 2.5 Pro

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Gemini 2.5 Pro, judged by semantic similarity.

Cut token cost by 30% on A/B test interpretation Prompt for DeepSeek-V3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on DeepSeek-V3, judged by semantic similarity.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for Llama 3.3 70B

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Llama 3.3 70B, judged by human pairwise comparison.

Cut cost-per-correct-answer by 30% on A/B test interpretation Prompt for Mistral Small 3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Mistral Small 3, judged by rubric scoring.

Cut token cost by 30% on A/B test interpretation Prompt for o1

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on o1, judged by rubric scoring.

🤖Any Model

296739