lastmass/Qwen3_Medical_GRPO

text-generation
Maintenance lightby lastmass7719updated 8 months ago
Python

中文版说明

README

basemodel: unsloth/Qwen3-4B-Base tags: text-generation-inference transformers unsloth qwen3 medical license: apache-2.0 language: en zh datasets: FreedomIntelligence/medical-o1-reasoning-SFT lastmass/medical-o1-reasoning-SFT-keywords 中文版说明 Qwen3MedicalGRPO This is a fine-tuned version of unsloth/Qwen3-4B-Base, specializing in the medical domain. Space demonstrates the lastmass/Qwen3MedicalGRPO model (Q4KM quantized version). Qwen3MedicalGRPO Space(CPU ONLY VERY SLOW) Model Introduction This…

Source attribution

  • HuggingFacelastmass/Qwen3_Medical_GRPO

Related resources

darkknight25/deepseek-16b-medical-GPT is a fine-tuned version of deepseek-ai/deepseek-l6b-moe-chat, optimized for medical question answering, reasoning, and clinical summarization using QLoRA and open-access healthcare datasets.

010 months ago
Python

![Language: Multilingual]()

3.3K2 months ago
Python

Apertus-70B-MeditronFO is a 70B-parameter medical specialist LLM, produced by supervised fine-tuning of Apertus-70B-Instruct on the Fully Open Meditron Corpus.

3976 days ago
Python

HuatuoGPT-o1-7B

5031 year ago

In search enginers, rerankers are crucial for improving the accuracy of your retrieval system.

12.7K2 months ago
Python