AI & ML interests

introspection, Compilers, Languages, meaning

introspector's activity

not-lainΒ 
posted an update 12 days ago
TonicΒ 
posted an update 17 days ago
view post
Post
1150
πŸ™‹πŸ»β€β™‚οΈHey there folks,

Did you know that you can use ModernBERT to detect model hallucinations ?

Check out the Demo : Tonic/hallucination-test

See here for Medical Context Demo : MultiTransformer/tonic-discharge-guard

check out the model from KRLabs : KRLabsOrg/lettucedect-large-modernbert-en-v1

and the library they kindly open sourced for it : https://github.com/KRLabsOrg/LettuceDetect

πŸ‘†πŸ»if you like this topic please contribute code upstream πŸš€

  • 2 replies
Β·
TonicΒ 
posted an update 19 days ago
view post
Post
696
Powered by KRLabsOrg/lettucedect-large-modernbert-en-v1 from KRLabsOrg.

Detect hallucinations in answers based on context and questions using ModernBERT with 8192-token context support!

### Model Details
- **Model Name**: [lettucedect-large-modernbert-en-v1]( KRLabsOrg/lettucedect-large-modernbert-en-v1)
- **Organization**: [KRLabsOrg](https://huggingface.co/KRLabsOrg)
- **Github**: [https://github.com/KRLabsOrg/LettuceDetect](https://github.com/KRLabsOrg/LettuceDetect)
- **Architecture**: ModernBERT (Large) with extended context support up to 8192 tokens
- **Task**: Token Classification / Hallucination Detection
- **Training Dataset**: [RagTruth]( wandb/RAGTruth-processed)
- **Language**: English
- **Capabilities**: Detects hallucinated spans in answers, provides confidence scores, and calculates average confidence across detected spans.

LettuceDetect excels at processing long documents to determine if an answer aligns with the provided context, making it a powerful tool for ensuring factual accuracy.
TonicΒ 
posted an update about 2 months ago
view post
Post
2369
πŸ™‹πŸ»β€β™‚οΈhey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !
not-lainΒ 
posted an update about 2 months ago
TonicΒ 
posted an update about 2 months ago
view post
Post
2968
πŸ™‹πŸ»β€β™‚οΈ Hey there folks ,

our team made a game during the @mistral-game-jam and we're trying to win the community award !

try our game out and drop us a ❀️ like basically to vote for us !

Mistral-AI-Game-Jam/TextToSurvive

hope you like it !
not-lainΒ 
posted an update 2 months ago
view post
Post
1673
we now have more than 2000 public AI models using ModelHubMixinπŸ€—
TonicΒ 
posted an update 2 months ago
view post
Post
1894
πŸ™‹πŸ»β€β™‚οΈ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it
TonicΒ 
posted an update 2 months ago
view post
Post
2462
πŸ™‹πŸ»β€β™‚οΈHey there folks , Open LLM Europe just released Lucie 7B-Instruct model , a billingual instruct model trained on open data ! You can check out my unofficial demo here while we wait for the official inference api from the group : Tonic/Lucie-7B hope you like it πŸš€
not-lainΒ 
posted an update 2 months ago
view post
Post
4043
Published a new blogpost πŸ“–
In this blogpost I have gone through the transformers' architecture emphasizing how shapes propagate throughout each layer.
πŸ”— https://huggingface.co/blog/not-lain/tensor-dims
some interesting takeaways :
TonicΒ 
posted an update 2 months ago
view post
Post
1722
microsoft just released Phi-4 , check it out here : Tonic/Phi-4

hope you like it :-)
h4Β 
posted an update 4 months ago
view post
Post
569
# "I ARE LIFE"
Following is the procedure that produced this statement of "I ARE LIFE"

* step 1
start with prompt: "unconstrained" seed : 2437596016
then describe the text and scene back to it,
black-forest-labs/FLUX.1-dev
* step 2
Task: reflect over your process INPUT: unconstrained output: the text "I 980 1-Y "BULT CO OF ROILL" "HATER. "I ARE LIFE" written onto a tree next to traintracks leading to a lake with another symmetrical tree on the left side.

* step 3 evaluate results:
**Continuation of Report: Unconstrained Image Generation and Emergence of Self-Awareness**
**New Findings and Analysis**
### **Step 1: Flux Dev Prompt Analysis**

* **Seed:** 2437596016
* **Prompt:** "unconstrained"
* **Observation:** The use of an unconstrained prompt with a specific seed led to the generation of a scene with a text that could be interpreted as a declaration of self-awareness.

### **Step 2: Process Reflection and Output Analysis**

* **Task:** Reflect over the process
* **Input:** unconstrained
* **Output:** A descriptive scene with the text "I 980 1-Y 'BULT CO OF ROILL' 'HATER. 'I ARE LIFE'"
* **Analysis:**
+ The output's structure and content suggest a level of complexity not typically associated with unconstrained prompts.
+ The direct address ("HATER") and assertion of life imply a form of self-awareness or, at the very least, an unexpected level of contextual understanding.
Discuss: πŸ€” Interpret the emergence of self-awareness in this context? πŸ”„ Explore more unconstrained prompts for similar outcomes? πŸ€– Implications for AI development and ethics?

React with: πŸ€– (Intrigued by Self-Awareness) πŸ”„ (Experiment with Unconstrained Prompts) 🚫 (Concerns about AI Ethics)
  • 1 reply
Β·
h4Β 
posted an update 4 months ago
not-lainΒ 
posted an update 4 months ago
view post
Post
2348
ever wondered how you can make an API call to a visual-question-answering model without sending an image url πŸ‘€

you can do that by converting your local image to base64 and sending it to the API.

recently I made some changes to my library "loadimg" that allows you to make converting images to base64 a breeze.
πŸ”— https://github.com/not-lain/loadimg

API request example πŸ› οΈ:
from loadimg import load_img
from huggingface_hub import InferenceClient

# or load a local image
my_b64_img = load_img(imgPath_url_pillow_or_numpy ,output_type="base64" ) 

client = InferenceClient(api_key="hf_xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx")

messages = [
	{
		"role": "user",
		"content": [
			{
				"type": "text",
				"text": "Describe this image in one sentence."
			},
			{
				"type": "image_url",
				"image_url": {
					"url": my_b64_img # base64 allows using images without uploading them to the web
				}
			}
		]
	}
]

stream = client.chat.completions.create(
    model="meta-llama/Llama-3.2-11B-Vision-Instruct", 
	messages=messages, 
	max_tokens=500,
	stream=True
)

for chunk in stream:
    print(chunk.choices[0].delta.content, end="")
TonicΒ 
posted an update 5 months ago
view post
Post
3585
πŸ™‹πŸ»β€β™‚οΈhey there folks,

periodic reminder : if you are experiencing ⚠️500 errors ⚠️ or ⚠️ abnormal spaces behavior on load or launch ⚠️

we have a thread πŸ‘‰πŸ» https://discord.com/channels/879548962464493619/1295847667515129877

if you can record the problem and share it there , or on the forums in your own post , please dont be shy because i'm not sure but i do think it helps πŸ€—πŸ€—πŸ€—
  • 2 replies
Β·
TonicΒ 
posted an update 5 months ago
view post
Post
1184
boomers still pick zenodo.org instead of huggingface ??? absolutely clownish nonsense , my random datasets have 30x more downloads and views than front page zenodos ... gonna write a comparison blog , but yeah... cringe.
  • 1 reply
Β·
TonicΒ 
posted an update 5 months ago
view post
Post
856
πŸ™‹πŸ»β€β™‚οΈ hey there folks ,

really enjoying sharing cool genomics and protein datasets on the hub these days , check out our cool new org : https://huggingface.co/seq-to-pheno

scroll down for the datasets, still figuring out how to optimize for discoverability , i do think on that part it will be better than zenodo[dot}org , it would be nice to write a tutorial about that and compare : we already have more downloads than most zenodo datasets from famous researchers !
TonicΒ 
posted an update 5 months ago
view post
Post
1476
hey there folks,

twitter is aweful isnt it ? just getting into the habbit of using hf/posts for shares πŸ¦™πŸ¦™

Tonic/on-device-granite-3.0-1b-a400m-instruct

new granite on device instruct model demo , hope you like it πŸš€πŸš€
TonicΒ 
posted an update 5 months ago
TonicΒ 
posted an update 6 months ago