Nielly

AI & ML interests

None yet

Recent Activity

reacted to openfree's post with 🔥 about 16 hours ago

📚 Multilingual RAG Chatbot with PDF Support Chat naturally with your documents! 🌟 ✨ Key Features: • 🌏 Multilingual Q&A support (English, Korean, etc.) • 📄 Real-time PDF and text file processing • 🔍 Context-aware accurate responses • ⚡ Intuitive Chainlit-powered chat interface 🛠️ Tech Stack: • 💻 Clean, documented open-source code • 🤝 User-friendly Chainlit UI • 📊 Vector database for efficient retrieval • 🔄 Real-time streaming responses 📱 Try it now! → Demo: https://huggingface.co/spaces/openfree/PDF-RAG 🔧 Special Features: • 📊 Support for PDF/text files up to 2MB • 🎯 Precise context understanding • ⚡ Fast response time • 🔒 Secure file handling Full source code available - ready to integrate into your projects! #RAG #NLP #Chatbot #OpenSource #PDFProcessing

upvoted a collection about 24 hours ago

DeepSeek-R1-abliterated

liked a Space 2 days ago

Qwen/Qwen2.5-Max-Demo

View all activity

Organizations

None yet

Nielly's activity

reacted to openfree's post with 🔥 about 16 hours ago

Post

4424

📚 Multilingual RAG Chatbot with PDF Support

Chat naturally with your documents! 🌟

✨ Key Features:
• 🌏 Multilingual Q&A support (English, Korean, etc.)
• 📄 Real-time PDF and text file processing
• 🔍 Context-aware accurate responses
• ⚡ Intuitive Chainlit-powered chat interface

🛠️ Tech Stack:
• 💻 Clean, documented open-source code
• 🤝 User-friendly Chainlit UI
• 📊 Vector database for efficient retrieval
• 🔄 Real-time streaming responses

📱 Try it now!
→ Demo: openfree/PDF-RAG

🔧 Special Features:
• 📊 Support for PDF/text files up to 2MB
• 🎯 Precise context understanding
• ⚡ Fast response time
• 🔒 Secure file handling

Full source code available - ready to integrate into your projects!

#RAG #NLP #Chatbot #OpenSource #PDFProcessing

upvoted a collection about 24 hours ago

DeepSeek-R1-abliterated

Collection

6 items • Updated 1 day ago • 13

liked a Space 2 days ago

Running

237

🐢

Qwen2.5 Max Demo

liked a model 2 days ago

chenguolin/DiffSplat

Updated 2 days ago • 7

upvoted 2 papers 2 days ago

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published 3 days ago • 14

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published 2 days ago • 48

reacted to victor's post with 🚀 2 days ago

Post

2745

Finally, an open-source AI that turns your lyrics into full songs is here—meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot

replied to dylanebert's post 2 days ago

I don’t really think it’s a side project.

upvoted a paper 2 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 5 days ago • 41

upvoted a collection 2 days ago

2025 January Papers 🧐

Collection

10 items • Updated 2 days ago • 4

liked 2 models 2 days ago

m-a-p/YuE-s1-7B-anneal-en-cot

Text Generation • Updated about 12 hours ago • 5.27k • 258

Aryanne/YuE-s1-7B-anneal-en-cot-Q6_K-GGUF

Text Generation • Updated 3 days ago • 138 • 5

reacted to nicolay-r's post with 👀 3 days ago

Post

1734

📢 For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
✅ OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
✅ Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
✅ Json Schema for Chain-of-Though reasoning (see screenshot 📷 below)

📺 below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

📃 To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

⏳ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1

1 reply

liked a dataset 3 days ago

cais/hle

Viewer • Updated 8 days ago • 3k • 1.61k • 133

reacted to lewtun's post with 🔥🚀 3 days ago

Post

9473

We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

🧪 Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

🔥 Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1