Initial commit

Browse files

Files changed (4) hide show

.gitattributes +2 -0
README.md +97 -3
ReaderLM-v2-Q4_K_M.gguf +3 -0
ReaderLM-v2-Q8_0.gguf +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+ReaderLM-v2-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+ReaderLM-v2-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -1,3 +1,97 @@
----
-license: cc-by-nc-4.0
----

+---
+license: cc-by-nc-4.0
+tags:
+  - llama.cpp
+  - gguf
+  - ReaderLM-v2
+  - html-to-markdown
+  - jina-ai
+---
+# ReaderLM-v2 GGUF Quantized Models for llama.cpp
+This repository contains **GGUF quantized versions** of the [ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2) model by [Jina AI](https://jina.ai/). These models are optimized for **llama.cpp**, making them efficient to run on CPUs and GPUs.
+## Model Information
+ReaderLM-v2 is a **1.5 billion parameter** model designed for **HTML-to-Markdown** and **HTML-to-JSON** conversion. It supports **29 languages** and can handle **up to 512,000 tokens** in combined input and output length.
+The model is useful for extracting structured data from web pages and various NLP applications.
+## Available Quantized Models
+| Model File                | Quantization Type | Size  | Description |
+|---------------------------|------------------|-------|-------------|
+| `ReaderLM-v2-Q4_K_M.gguf` | Q4_K_M           | 986MB | Lower precision, optimized for CPU performance |
+| `ReaderLM-v2-Q8_0.gguf`   | Q8_0             | 1.6GB | Higher precision, better quality |
+These quantized versions balance **performance and accuracy**, making them suitable for different hardware setups.
+## Usage
+### Running the Model with llama.cpp
+1. **Clone and build llama.cpp**:
+   ```bash
+   git clone https://github.com/ggerganov/llama.cpp.git
+   cd llama.cpp
+   mkdir build && cd build
+   cmake ..
+   make -j$(nproc)
+   ```
+2. **Run the model**:
+   ```bash
+   ./llama-cli --model ReaderLM-v2-Q4_K_M.gguf --no-conversation --no-display-prompt --temp 0 --prompt '<|im_start|>system
+Convert the HTML to Markdown.
+<|im_end|>
+<|im_start|>user
+<html><body><h1>Hello, world!</h1></body></html>
+<|im_end|>
+<|im_start|>assistant' 2>/dev/null
+   ```
+   Replace `ReaderLM-v2-Q4_K_M.gguf` with `ReaderLM-v2-Q8_0.gguf` for better quality at the cost of performance.
+### Using the Model in Python with llama-cpp-python
+```bash
+pip install llama-cpp-python
+```
+```python
+model_path = "./models/ReaderLM-v2-Q4_K_M.gguf"
+llm = Llama(model_path=model_path, chat_format="chatml")
+output = llm.create_chat_completion(
+    messages = [
+        {"role": "system", "content": "Convert the HTML to Markdown."},
+        {
+            "role": "user",
+            "content": "<html><body><h1>Hello, world!</h1><p>This is a test!</p></body></html>"
+        }
+    ],
+    temperature=0.1,
+)
+print(output['choices'][0]['message']['content'].strip())
+```
+## Hardware Requirements
+- **Q4_K_M (986MB)**: Runs well on CPUs with **8GB RAM or more**
+- **Q8_0 (1.6GB)**: Requires **16GB RAM** for smooth performance
+For **GPU acceleration**, compile `llama.cpp` with CUDA support.
+## Credits
+- **Original Model**: [Jina AI - ReaderLM-v2](https://huggingface.co/jinaai/ReaderLM-v2)
+- **Quantization**: Performed using [llama.cpp](https://github.com/ggerganov/llama.cpp)
+## License
+This model is released under **Creative Commons Attribution-NonCommercial 4.0 (CC-BY-NC-4.0)**. See [LICENSE](https://huggingface.co/spaces/jinaai/ReaderLM-v2) for details.
+---
+_Last updated: **January 31, 2025**_

ReaderLM-v2-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5c19ed3117873c716e25a3556dbdc6e7c99969acfbac26e2273d8eb563244ddf
+size 986046080

ReaderLM-v2-Q8_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0a0b0464ee4f91a2f9ae8294fc01a00e2023c1498d5c4dde2870df532dc0829d
+size 1646570624