Refactor a baseline SQL query writing prompt into a Contrastive Chain-of-Thought version and compare quality on o3-mini.
Refactor a baseline contract review prompt into a Contrastive Chain-of-Thought version and compare quality on Llama 3.1 405B.
Refactor a baseline log anomaly detection prompt into a Contrastive Chain-of-Thought version and compare quality on Gemini 2.5 Pro.
Refactor a baseline sales lead qualification prompt into a Contrastive Chain-of-Thought version and compare quality on Mistral Large.
Refactor a baseline customer support routing prompt into a Contrastive Chain-of-Thought version and compare quality on Grok 3.
Refactor a baseline product requirement drafting prompt into a Contrastive Chain-of-Thought version and compare quality on Claude 3.7 Sonnet.
Refactor a baseline academic grading prompt into a Contrastive Chain-of-Thought version and compare quality on GPT-4o.
Refactor a baseline financial report analysis prompt into a Contrastive Chain-of-Thought version and compare quality on Claude 4.5 Sonnet.
Refactor a baseline academic grading prompt into a Thread-of-Thought version and compare quality on GPT-4.1.
Refactor a baseline SQL query writing prompt into a Thread-of-Thought version and compare quality on o1.
Refactor a baseline contract review prompt into a Thread-of-Thought version and compare quality on Gemini 2.0 Flash.
Refactor a baseline customer support routing prompt into a Thread-of-Thought version and compare quality on Claude 3.5 Sonnet.