Category Not Found

DPO Fine-Tune of Mistral Small 3 for customer support classification

Full fine-tuning recipe: DPO on Mistral Small 3 via Unsloth, targeting AWS g5.12xlarge, with data mix and eval plan.

383936

Free

DPO Fine-Tune of Qwen 2.5 7B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Qwen 2.5 7B via OpenRLHF, targeting AWS g5.12xlarge, with data mix and eval plan.

4659

DPO Fine-Tune of Qwen 2.5 32B for legal clause extraction

Full fine-tuning recipe: DPO on Qwen 2.5 32B via torchtune, targeting AWS g5.12xlarge, with data mix and eval plan.

3601309

DPO Fine-Tune of Qwen 2.5-Coder 7B for customer support classification

Full fine-tuning recipe: DPO on Qwen 2.5-Coder 7B via Unsloth, targeting AWS p4d.24xlarge, with data mix and eval plan.

371546

DPO Fine-Tune of Gemma 2 27B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Gemma 2 27B via OpenRLHF, targeting AWS p4d.24xlarge, with data mix and eval plan.

101729

DPO Fine-Tune of Phi-3.5-mini for legal clause extraction

Full fine-tuning recipe: DPO on Phi-3.5-mini via DeepSpeed, targeting Lambda Labs 8xH100, with data mix and eval plan.

2691194

Free

DPO Fine-Tune of DeepSeek-V3 base for customer support classification

Full fine-tuning recipe: DPO on DeepSeek-V3 base via Hugging Face TRL, targeting Lambda Labs 8xH100, with data mix and eval plan.

334214

DPO Fine-Tune of Mixtral 8x7B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Mixtral 8x7B via Megatron-LM, targeting Lambda Labs 8xH100, with data mix and eval plan.

🟠Claude

301242

DPO Fine-Tune of Yi 1.5 34B for legal clause extraction

Full fine-tuning recipe: DPO on Yi 1.5 34B via DeepSpeed, targeting single A100 80GB, with data mix and eval plan.

🟠Claude

901097

DPO Fine-Tune of Qwen 2.5-Coder 7B for legal clause extraction

Full fine-tuning recipe: DPO on Qwen 2.5-Coder 7B via LitGPT, targeting 2x A100 80GB, with data mix and eval plan.

🟠Claude

11723

DPO Fine-Tune of Mistral Nemo 12B for legal clause extraction

Full fine-tuning recipe: DPO on Mistral Nemo 12B via OpenRLHF, targeting single RTX 3090 (24GB), with data mix and eval plan.

🤖Any Model

2441213

DPO Fine-Tune of Qwen 2.5-Coder 7B for function-calling with strict JSON

Full fine-tuning recipe: DPO on Qwen 2.5-Coder 7B via Hugging Face TRL, targeting 2x RTX 4090, with data mix and eval plan.