Category Not Found

1252 prompts

Sort:

Cut mean tokens per request by 30% on A/B test interpretation Prompt for DeepSeek-V3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on DeepSeek-V3, judged by Trulens feedback functions.

Cut token cost by 30% on A/B test interpretation Prompt for Mistral Large

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Mistral Large, judged by DeepEval metrics.

Cut cost-per-correct-answer by 30% on A/B test interpretation Prompt for o3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on o3, judged by promptfoo assertions.

Cut p95 latency by 30% on A/B test interpretation Prompt for Claude 3.7 Sonnet

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Claude 3.7 Sonnet, judged by JSON schema validation.

Cut token cost by 30% on A/B test interpretation Prompt for Llama 3.3 70B

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Llama 3.3 70B, judged by BERTScore.

Cut token cost by 30% on A/B test interpretation Prompt for o3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on o3, judged by LLM-as-judge.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for Claude 3.7 Sonnet

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Claude 3.7 Sonnet, judged by BLEU/ROUGE.

Cut cost-per-correct-answer by 30% on A/B test interpretation Prompt for Llama 3.3 70B

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Llama 3.3 70B, judged by rubric scoring.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for Qwen 2.5 72B

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Qwen 2.5 72B, judged by G-Eval.

Cut p95 latency by 30% on A/B test interpretation Prompt for Grok 3

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Grok 3, judged by Trulens feedback functions.

Cut token cost by 30% on A/B test interpretation Prompt for Claude Opus 4.5

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Claude Opus 4.5, judged by promptfoo assertions.

Cut mean tokens per request by 30% on A/B test interpretation Prompt for Mistral Large

Token-cost and latency reduction playbook for a A/B test interpretation prompt running on Mistral Large, judged by JSON schema validation.

🟠Claude

3171504