empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
A domain-optimized reasoning model built on DeepSeek-R1-Distill-Qwen-32B, refined through a multi-stage pipeline of GPTQ quantization-aware training and QLoRA fine-tuning. Achieves 84% on MedQA — within 4 points of GPT-4o — in a ~20GB package that fits on a single L40/L40s GPU.
README
license: cc-by-4.0 datasets: allenai/c4 language: en metrics: accuracy basemodel: deepseek-ai/DeepSeek-R1-Distill-Qwen-32B pipelinetag: text-generation tags: gptq int4 quantized qlora medical medqa biology chemistry finance legal climate reasoning 4-bit model-index: name: Chaperone-Thinking-LQ-1.0 results: task: type: text-generation name: Medical QA dataset: name: MedQA type: medqa metrics: type: accuracy value: 84.0 task: type: text-generation name: Math Reasoning dataset: name: MATH-500…
Source attribution
- HuggingFace — empirischtech/DeepSeek-R1-Distill-Qwen-32B-gptq-4bit
Related resources
zeroentropy/zerank-1-small-reranker
by zeroentropyIn search enginers, rerankers are crucial for improving the accuracy of your retrieval system.
Fine-tuned version of google/gemma-4-E4B-it across three professional domains — Medical, Legal, and Finance — using QLoRA (4-bit NF4) with Optuna-tuned hyperparameters, trained on Kaggle T4 GPU.
zeroentropy/zerank-2-reranker
by zeroentropyIn search engines, rerankers are crucial for improving the accuracy of your retrieval system.