Category Not Found

KTO Fine-Tune of Llama 3.1 8B for function-calling with strict JSON

Full fine-tuning recipe: KTO on Llama 3.1 8B via Unsloth, targeting 2x RTX 4090, with data mix and eval plan.

🤖Any Model

3801330

KTO Fine-Tune of Llama 3.1 70B for legal clause extraction

Full fine-tuning recipe: KTO on Llama 3.1 70B via OpenRLHF, targeting 2x RTX 4090, with data mix and eval plan.

🤖Any Model

1291408

KTO Fine-Tune of Mistral Nemo 12B for customer support classification

Full fine-tuning recipe: KTO on Mistral Nemo 12B via torchtune, targeting AWS g5.12xlarge, with data mix and eval plan.

306897

KTO Fine-Tune of Qwen 2.5 7B for function-calling with strict JSON

Full fine-tuning recipe: KTO on Qwen 2.5 7B via Unsloth, targeting AWS g5.12xlarge, with data mix and eval plan.

54143

KTO Fine-Tune of Qwen 2.5-Coder 7B for legal clause extraction

Full fine-tuning recipe: KTO on Qwen 2.5-Coder 7B via OpenRLHF, targeting AWS g5.12xlarge, with data mix and eval plan.

318397

KTO Fine-Tune of Gemma 2 9B for customer support classification

Full fine-tuning recipe: KTO on Gemma 2 9B via DeepSpeed, targeting AWS p4d.24xlarge, with data mix and eval plan.

154125

KTO Fine-Tune of Phi-3.5-mini for function-calling with strict JSON

Full fine-tuning recipe: KTO on Phi-3.5-mini via Hugging Face TRL, targeting AWS p4d.24xlarge, with data mix and eval plan.

314687

KTO Fine-Tune of Phi-4 for legal clause extraction

Full fine-tuning recipe: KTO on Phi-4 via Megatron-LM, targeting Lambda Labs 8xH100, with data mix and eval plan.

131350

KTO Fine-Tune of DeepSeek-V3 base for customer support classification

Full fine-tuning recipe: KTO on DeepSeek-V3 base via FSDP, targeting Lambda Labs 8xH100, with data mix and eval plan.

3151201

KTO Fine-Tune of DeepSeek-V3 base for legal clause extraction

Full fine-tuning recipe: KTO on DeepSeek-V3 base via Unsloth, targeting 4x A100 40GB, with data mix and eval plan.

🟠Claude

321934

Build an SFT Dataset for legal clause extraction from internal SME interviews

End-to-end SFT dataset construction: collection, labeling, cleaning, dedup, contamination check for legal clause extraction.

251326

Build an SFT Dataset for legal clause extraction from rejection-sampled from base model

End-to-end SFT dataset construction: collection, labeling, cleaning, dedup, contamination check for legal clause extraction.