lastmass/Qwen3_Medical_GRPO
中文版说明
README
basemodel: unsloth/Qwen3-4B-Base tags: text-generation-inference transformers unsloth qwen3 medical license: apache-2.0 language: en zh datasets: FreedomIntelligence/medical-o1-reasoning-SFT lastmass/medical-o1-reasoning-SFT-keywords 中文版说明 Qwen3MedicalGRPO This is a fine-tuned version of unsloth/Qwen3-4B-Base, specializing in the medical domain. Space demonstrates the lastmass/Qwen3MedicalGRPO model (Q4KM quantized version). Qwen3MedicalGRPO Space(CPU ONLY VERY SLOW) Model Introduction This…
Source attribution
- HuggingFace — lastmass/Qwen3_Medical_GRPO
Related resources
darkknight25/deepseek-16b-medical-GPT
by darkknight25darkknight25/deepseek-16b-medical-GPT is a fine-tuned version of deepseek-ai/deepseek-l6b-moe-chat, optimized for medical question answering, reasoning, and clinical summarization using QLoRA and open-access healthcare datasets.
thelamapi/next-ocr
by thelamapi![Language: Multilingual]()
Apertus-70B-MeditronFO is a 70B-parameter medical specialist LLM, produced by supervised fine-tuning of Apertus-70B-Instruct on the Fully Open Meditron Corpus.
zeroentropy/zerank-1-small-reranker
by zeroentropyIn search enginers, rerankers are crucial for improving the accuracy of your retrieval system.