Category Not Found

A/B Rollout and Drift Detection for citation accuracy in code assistant

Design A/B rollout analysis and drift detection for citation accuracy on a production LLM app in code assistant.

🤖Any Model

239776

QLoRA (4-bit) Fine-Tune of DeepSeek-V3 base for financial report summarization

Full fine-tuning recipe: QLoRA (4-bit) on DeepSeek-V3 base via Unsloth, targeting single A100 80GB, with data mix and eval plan.

18293

QLoRA (4-bit) Fine-Tune of DeepSeek-V3 base for technical documentation QA

Full fine-tuning recipe: QLoRA (4-bit) on DeepSeek-V3 base via LitGPT, targeting 8x H100, with data mix and eval plan.

🤖Any Model

343534

QLoRA (4-bit) Fine-Tune of Gemma 2 27B for technical documentation QA

Full fine-tuning recipe: QLoRA (4-bit) on Gemma 2 27B via OpenRLHF, targeting AWS g5.12xlarge, with data mix and eval plan.

💬ChatGPT

295883

QLoRA (4-bit) Fine-Tune of Gemma 2 27B for SQL-from-text generation

Full fine-tuning recipe: QLoRA (4-bit) on Gemma 2 27B via torchtune, targeting 2x A100 80GB, with data mix and eval plan.

194654

Free

QLoRA (4-bit) Fine-Tune of Gemma 2 27B for financial report summarization

Full fine-tuning recipe: QLoRA (4-bit) on Gemma 2 27B via Megatron-LM, targeting single RTX 3090 (24GB), with data mix and eval plan.

🤖Any Model

382301

QLoRA (4-bit) Fine-Tune of Qwen 2.5 32B for financial report summarization

Full fine-tuning recipe: QLoRA (4-bit) on Qwen 2.5 32B via Unsloth, targeting Lambda Labs 8xH100, with data mix and eval plan.

💬ChatGPT

386867

DPO Fine-Tune of Gemma 2 27B for technical documentation QA

Full fine-tuning recipe: DPO on Gemma 2 27B via torchtune, targeting Lambda Labs 8xH100, with data mix and eval plan.

821466

DPO Fine-Tune of Phi-4 for financial report summarization

Full fine-tuning recipe: DPO on Phi-4 via Unsloth, targeting single A100 80GB, with data mix and eval plan.

2611354

Free

DPO Fine-Tune of DeepSeek-V3 base for SQL-from-text generation

Full fine-tuning recipe: DPO on DeepSeek-V3 base via OpenRLHF, targeting single A100 80GB, with data mix and eval plan.

178506

DPO Fine-Tune of Mixtral 8x22B for technical documentation QA

Full fine-tuning recipe: DPO on Mixtral 8x22B via DeepSpeed, targeting single H100 80GB, with data mix and eval plan.

43150

DPO Fine-Tune of Yi 1.5 34B for financial report summarization

Full fine-tuning recipe: DPO on Yi 1.5 34B via Hugging Face TRL, targeting single H100 80GB, with data mix and eval plan.