Add library name tag

Browse files

This PR ensures the "how to use" button appears on the top right (with a Transformers code snippet).

Files changed (1) hide show

README.md +7 -11

README.md CHANGED Viewed

@@ -1,12 +1,13 @@
 ---
-license: cc-by-nc-4.0
 language:
 - en
 metrics:
 - accuracy
-base_model:
-- meta-llama/Llama-3.1-405B-Instruct
 pipeline_tag: text-generation
 ---
 # CoALM-405B: The Largest Open-Source Agentic LLM
@@ -48,8 +49,7 @@ It is designed to **push the boundaries** of open-source agentic LLMs, excelling
 - **🚨 Largest Open-Source Agentic LLM:** A **405B** parameter model that brings state-of-the-art agentic capabilities to the public domain.
 - **🎯 Best Open-Source Performance on BFCL V3:** Outperforms leading proprietary models like **GPT-4o, Gemini, and Claude** in function-calling tasks.
 - **🔍 True Zero-Shot Function Calling:** Generalizes to unseen API tasks with **unmatched accuracy**.
-- **🤖 Multi-Turn Dialogue Mastery:** Excels at long conversations, **task tracking, and complex reasoning**.
-- **🛠 API Tool Use and Reasoning:** Makes precise API calls, interprets responses, and synthesizes **coherent** multi-step solutions.
 - **📜 Fully Open-Source & Reproducible:** Released under **cc-by-nc-4.0**, including model weights, training logs, and datasets.
@@ -65,8 +65,7 @@ It is designed to **push the boundaries** of open-source agentic LLMs, excelling
 ---
 ## 🔧 Training Process
 ### Fine-tuning Stages
-1. **TOD Fine-tuning:** Optimized for **dialogue state tracking** (e.g., augmented SNIPS in instruction-tuned format).
-2. **Function Calling Fine-tuning:** Trained to generate **highly accurate API calls** from LA datasets.
 3. **ReAct-based Fine-tuning:** Enhances multi-turn conversations with structured **thought-action-observation-response reasoning**.
 ### Training Hyperparameters
@@ -128,7 +127,4 @@ If you use **CoALM-405B** in your research, please cite:
 }
 ```
-For more details, visit [Project Repository](https://github.com/oumi-ai/oumi/tree/main/configs/projects/CALM) or contact **[email protected]**.

 ---
+base_model:
+- meta-llama/Llama-3.1-405B-Instruct
 language:
 - en
+license: cc-by-nc-4.0
 metrics:
 - accuracy
 pipeline_tag: text-generation
+library_name: transformers
 ---
 # CoALM-405B: The Largest Open-Source Agentic LLM
 - **🚨 Largest Open-Source Agentic LLM:** A **405B** parameter model that brings state-of-the-art agentic capabilities to the public domain.
 - **🎯 Best Open-Source Performance on BFCL V3:** Outperforms leading proprietary models like **GPT-4o, Gemini, and Claude** in function-calling tasks.
 - **🔍 True Zero-Shot Function Calling:** Generalizes to unseen API tasks with **unmatched accuracy**.
+- **🤖 Multi-Turn Dialogue Mastery:** Excels at long conversations, **task tracking, and complex reasoning**.\n- **🛠 API Tool Use and Reasoning:** Makes precise API calls, interprets responses, and synthesizes **coherent** multi-step solutions.
 - **📜 Fully Open-Source & Reproducible:** Released under **cc-by-nc-4.0**, including model weights, training logs, and datasets.
 ---
 ## 🔧 Training Process
 ### Fine-tuning Stages
+1. **TOD Fine-tuning:** Optimized for **dialogue state tracking** (e.g., augmented SNIPS in instruction-tuned format).\n2. **Function Calling Fine-tuning:** Trained to generate **highly accurate API calls** from LA datasets.
 3. **ReAct-based Fine-tuning:** Enhances multi-turn conversations with structured **thought-action-observation-response reasoning**.
 ### Training Hyperparameters
 }
 ```
+For more details, visit [Project Repository](https://github.com/oumi-ai/oumi/tree/main/configs/projects/CALM) or contact **[email protected]**.