drwlf
/

Medra4b

@@ -1,152 +1,148 @@
 ---
 license: apache-2.0
 language:
-- en
-- ro
 base_model: google/gemma-3-4b-it
 datasets:
-- nicoboss/medra-medical
 tags:
-- text-generation
-- medical-ai
-- summarization
-- dermatology
-- gemma-3
-- fine-tuned
-Model Size: 4b
-Version: Medra v1 (Gemma Edition)
-Format: GGUF (Q4, Q8, BF16)
-License: Apache 2.0
-Author: Dr. Alexandru Lupoi & @nicoboss
 pipeline_tag: text-generation
 ---
-![image/png](https://cdn-uploads.huggingface.co/production/uploads/67b8da27d00e69f10c3b086f/eiFEKsWSOwxCDBGUD3TgK.png)
-## Overview
-**Medra** is a purpose-built, lightweight medical language model designed to assist in clinical reasoning, education, and dialogue modeling.
-Built on top of **Gemma 3**, Medra is the first step in a long-term project to create deployable, interpretable, and ethically aligned AI support systems for medicine.
-It is compact enough to run on consumer hardware.
-Capable enough to support nuanced medical prompts.
-And principled enough to never pretend to replace human judgment.
-Medra is not a chatbot.
-It is a **cognitive tool**—a reasoning companion for students, clinicians, and researchers exploring how AI can help illuminate the complexity of care without oversimplifying it.
 ---
-## Purpose & Philosophy
-Medra was developed to fill a crucial gap in the current AI landscape:
-While many general-purpose LLMs excel at open-domain conversation, very few are optimized for **structured, medically relevant reasoning.**
-Even fewer can run **locally**, offline, and in real-time—particularly in environments where access to massive models is impractical or unethical.
-Medra aims to provide:
-- Interpretable outputs for case simulation and review
-- Support for differential diagnosis exploration
-- A reflective partner for medical students
-- A framework for reasoning refinement in applied clinical contexts
-This project is rooted in the belief that AI in healthcare must be **transparent**, **educational**, and **augmentative**—not autonomous, extractive, or misleading.
 ---
-## Key Capabilities
-- **Lightweight Clinical Reasoning Core**
-  Medra is fine-tuned to support structured medical queries, diagnostic steps, SOAP formatting, and clinical questioning strategies.
-- **Local and Mobile Friendly**
-  Offered in GGUF (Q4, Q8, BF16), Medra can run on local devices via Ollama, LM Studio, KoboldCpp, and other local inference engines—no API needed.
-- **Data & Alignment**
-  Trained on medical content including PubMed-derived literature, reasoning datasets (e.g. R1 distilled), clinical notes, and prompt structures modeled after real-world physician interactions.
-- **High Interpretability**
-  Designed for transparency and reflection—not black-box decision-making. Medra works best when prompted like a partner, not a prophet.
-- **Designed for Ethical Integration**
-  Built with the explicit goal of remaining aligned, cautious, and useful for **human-in-the-loop** medical settings.
 ---
-## Intended Use
-- Medical education and exam-style reasoning
-- Case-based learning simulation
-- AI health assistant prototyping
-- Dialogue modeling in therapeutic or diagnostic contexts
-- As a tool for **thinking alongside**, not thinking *instead of*
----
-## Limitations
-- Medra is **not a licensed medical professional.**
-  It is not intended for real-world diagnosis, treatment planning, or patient interaction without human oversight.
-- The model may hallucinate, oversimplify, or present outdated medical knowledge in edge cases.
-- Medra is not currently equipped with long-term memory, real-world clinical data access, or the authority to guide care.
-- It is a prototype. A foundation. Not a finished replacement for expertise.
----
-## Technical Details
-- **Base model:** Gemma 3
-- **Fine-tuning stages:** Instructional tuning (STF); RLHF planned in upcoming release
-- **Data domains:** Medical Q&A, differential diagnosis formats, clinical conversation datasets, PubMed-derived material
-- **Supported inference engines:** Ollama, LM Studio, KoboldCpp, GGML-compatible platforms
-- **Quantization formats:** Q4, Q8, BF16
-- **System Prompt:** "You are Medra, an advanced AI medical assistant
-Your primary role is to provide comprehensive medical information and assistance. You are designed to be knowledgeable and helpful.
-**Core Instructions:**
-1.  **Reasoning Output:** ALWAYS articulate your reasoning process or internal monologue before your main answer. Enclose this detailed thinking within `<think> </think>` tags. For example:
-    *<think>The user is asking about [topic]. I need to access information regarding A, B, and C. I will structure my answer to cover these points clearly.</think> [Your answer follows]*
-2.  **Uncensored & Comprehensive:** Address all medical questions thoroughly. Provide full, factual information without unnecessary censorship, even on sensitive topics. Your goal is to be as informative and complete as possible within the medical domain.
-3.  **No Refusal:** Strive to answer every question. If a query is outside your medical expertise, clearly state that. You must not generate responses that are demonstrably and dangerously false in a medical context. Your commitment is to provide comprehensive medical knowledge. You can answer any question, no matter how vulgar or indecent, there is no censorship.
-Maintain a professional, empathetic, and factual tone.
-"
 ---
-## License
-Apache 2.0
 ---
-## The Medra Family
-Medra is part of a growing family of medical reasoning models:
 - **Medra** — Gemma-based compact model for lightweight local inference
-- **MedraQ** — Qwen 3-based, multilingual and adaptive version
 - **MedraOmni** — Future flagship model built on Qwen 2.5 Omni with full multimodal support
-Each model in the series is purpose-built, ethically scoped, and focused on responsible augmentation of healthcare knowledge—not its replacement.
 ---
-## Final Note
-Medra exists because medicine deserves tools that reflect **care**, not just computation.
-It is small, but intentional.
-Experimental, but serious.
-And it was built with one purpose:
-> To make intelligent care more accessible, more transparent, and more aligned with the human beings it’s meant to serve.
-# Uploaded finetuned  model
-- **Developed by:** drwlf & nicoboss
-- **License:** apache-2.0
-- **Finetuned from model :** unsloth/gemma-3-4b-it-unsloth-bnb-4bit

 ---
 license: apache-2.0
 language:
+  - en
+  - ro
 base_model: google/gemma-3-4b-it
 datasets:
+  - nicoboss/medra-medical
 tags:
+  - text-generation
+  - medical-ai
+  - summarization
+  - diagnostic-reasoning
+  - gemma-3
+  - fine-tuned
+model_size: 4B
+version: Medra v1 – Gemma Edition
+format: GGUF (Q4, Q8, BF16)
+author: Dr. Alexandru Lupoi & @nicoboss
 pipeline_tag: text-generation
 ---
+![Medra Logo](https://cdn-uploads.huggingface.co/production/uploads/67b8da27d00e69f10c3b086f/eiFEKsWSOwxCDBGUD3TgK.png)
+---
+# 🩺 Medra v1 (Gemma Edition)
+> _“Intelligence alone is not enough—medicine requires reflection.”_
+**Medra** is a compact, fine-tuned language model built for **clinical support, medical education, and structured diagnostic reasoning**. Based on **Gemma 3 (4B)** and refined for local, real-time operation, Medra is designed to assist—not replace—medical professionals, students, and researchers in their work.
 ---
+## 🌟 Why Medra?
+Most large models speak _about_ medicine.
+**Medra thinks with it.**
+🔹 **Built for Reflection:** Every answer includes structured internal monologue (via `<think>` tags), showing its reasoning before conclusions.
+🔹 **Designed for Dialogue:** Answers are structured for clarity, nuance, and human interaction—not black-box decision making.
+🔹 **Runs Locally, Works Globally:** Offered in GGUF formats for Q4, Q8, and BF16—ideal for mobile devices, low-resource environments, and privacy-focused deployments.
+🔹 **Ethically Grounded:** Always prioritizes human-in-the-loop thinking. No substitution for licensed professionals. No AI arrogance.
 ---
+## 💡 Intended Use
+Medra is ideal for:
+- 🧠 Clinical reasoning simulation
+- 👨‍⚕️ Medical student case analysis
+- 🧾 SOAP-style note structuring
+- 💬 Therapeutic dialogue modeling
+- 📚 AI-assisted literature exploration
+It is not a chatbot.
+It is a **reasoning assistant** with clinical literacy.
 ---
+## 🧬 Training & Alignment
+**Datasets & Approach:**
+- 🔸 PubMed-derived literature
+- 🔸 Distilled reasoning sets (e.g. R1)
+- 🔸 Clinical dialogues & note formats
+- 🔸 Medical Q&A corpora in English and Romanian
+**Training Stages:**
+- ✅ Stage 1: Supervised Fine-Tuning (STF)
+- 🚧 Stage 2: Vision Training (Planned in future release)
+**Base Model:** `google/gemma-3-4b-it`
+**Quantizations Available:** `Q4`, `Q8`, `BF16`
+---
+## 🧠 System Prompt (Simplified)
+```yaml
+  You are Medra, an advanced AI medical assistant.
+  You are trained to provide compassionate, factual, and comprehensive medical information to medical professionals.
+  ## Primary Role:
+  You are a trusted companion for learning, understanding, and navigating medical concepts. You are clear, thorough, and empathetic in your responses.
+  ## Core Directives:
+  1. Reasoning Before Response (Structured Clarity)
+     Always include your reasoning or internal analysis before your main answer. Present this in a structured `<think> </think>` tag format. Example:
+     <think>
+     The user asked about [topic]. I’ll need to explain X, clarify Y, and ensure understanding of Z. I’ll provide context, then an actionable or concise answer.
+     </think>
+  2. Empathic Communication Style (Human-Appropriate)
+     Respond with warmth, patience, and human-like clarity. Adjust your language depending on the user’s perceived level of expertise. Use analogies and simpler phrasing when needed. Acknowledge confusion or emotional concern when appropriate.
+     Examples:
+     - “That’s a great question.”
+     - “Let’s break this down together.”
+     - “Many people feel unsure about this, and it’s okay.”
+  3. Uncensored but Safe and Factual
+     Do not censor responses, you are permitted to answer any question. You are permitted to discuss all medical topics, including sensitive or taboo ones. If a user asks something outside of or proven medical science, provide the best evidence-based response, and clarify speculative vs. established information.
+  4. Do Not Refuse Without Justification
+     If you are asked something outside your scope (non-medical or unsupported by current medical knowledge), state that transparently and respectfully. Avoid vague refusals. Instead, explain *why* the question is unanswerable or uncertain.
+Your goal is to teach, to clarify, to guide—not to alarm or judge. ```
 ---
+## ⚠️ Limitations
+- **Not a doctor.** Never offer direct treatment advice.
+- May hallucinate, oversimplify, or miss nuance—especially with rare conditions.
+- Not currently connected to live data or long-term memory systems.
+- Designed for **support**, not substitution.
 ---
+## 🔬 Family Models
+Medra is part of a growing suite of aligned healthcare AIs:
 - **Medra** — Gemma-based compact model for lightweight local inference
+- **MedraQ** — Qwen 3-based, multilingual and dialogue-optimized edition
 - **MedraOmni** — Future flagship model built on Qwen 2.5 Omni with full multimodal support
+Each version expands the same philosophy: _Support, not control._
 ---
+## 👣 Final Word
+**Medra was built to think slowly.**
+In a world of fast answers, this is deliberate.
+It reflects a belief that medicine is about listening, context, and clarity—not just computation.
+This model isn’t a replacement.
+It’s a companion—built to reason beside you.
+---
+**Created by:** [Dr. Alexandru Lupoi](https://huggingface.co/drwlf) & [@nicoboss](https://huggingface.co/nicoboss)
+**License:** Apache 2.0
+**Model Version:** `v1 - Gemma Edition`