Refactor a baseline financial report analysis prompt into a ReAct version and compare quality on DeepSeek-V3.
Refactor a baseline scientific literature review prompt into a ReAct version and compare quality on Claude 3.7 Sonnet.
Refactor a baseline log anomaly detection prompt into a ReAct version and compare quality on o3.
Refactor a baseline product requirement drafting prompt into a ReAct version and compare quality on Llama 3.3 70B.
Refactor a baseline threat modeling prompt into a ReAct version and compare quality on Claude 4 Sonnet.
Refactor a baseline bug root-cause analysis prompt into a ReAct version and compare quality on Grok 3.
Refactor a baseline funnel analysis prompt into a ReAct version and compare quality on Claude 3.7 Sonnet.
Refactor a baseline medical triage prompt into a ReAct version and compare quality on DeepSeek-V3.
Refactor a baseline incident post-mortems prompt into a ReAct version and compare quality on Qwen 2.5 72B.
Scratchpad-style ReAct prompt for a staff data scientist working on threat modeling, tuned for DeepSeek-R1.
Scratchpad-style ReAct prompt for a product manager working on legal brief summarization, tuned for Llama 3.3 70B.
Scratchpad-style ReAct prompt for a SRE working on code generation, tuned for Llama 3.3 70B.