alea-institute
/

charboundary-small

Text Classification

sentence-boundary-detection

paragraph-detection

text-segmentation

document-processing

Model card Files Files and versions

alea-institute commited on Apr 11

Commit

fe26bb0

·

verified ·

1 Parent(s): cc016ca

Update README for small model

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -46,6 +46,10 @@ a fast character-based sentence and paragraph boundary detection system optimize
 ## Usage
 ```python
 from huggingface_hub import hf_hub_download
 from charboundary import TextSegmenter
@@ -53,8 +57,8 @@ from charboundary import TextSegmenter
 # Download the model
 model_path = hf_hub_download(repo_id="alea-institute/charboundary-small", filename="model.pkl")
-# Load the model
-segmenter = TextSegmenter.load(model_path)
 # Use the model
 text = "This is a test sentence. Here's another one!"

 ## Usage
+> **Important:** When loading models from Hugging Face Hub, you must set `trust_model=True` to allow loading custom class types.
+>
+> **Security Note:** The ONNX model variants are recommended in security-sensitive environments as they don't require bypassing skops security measures with `trust_model=True`. See the [ONNX versions](https://huggingface.co/alea-institute/charboundary-small-onnx) for a safer alternative.
 ```python
 from huggingface_hub import hf_hub_download
 from charboundary import TextSegmenter
 # Download the model
 model_path = hf_hub_download(repo_id="alea-institute/charboundary-small", filename="model.pkl")
+# Load the model (trust_model=True is required when loading from external sources)
+segmenter = TextSegmenter.load(model_path, trust_model=True)
 # Use the model
 text = "This is a test sentence. Here's another one!"