Nielly

Nielly
Β·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

Nielly's activity

reacted to openfree's post with πŸ”₯ about 16 hours ago
view post
Post
4424
πŸ“š Multilingual RAG Chatbot with PDF Support

Chat naturally with your documents! 🌟

✨ Key Features:
β€’ 🌏 Multilingual Q&A support (English, Korean, etc.)
β€’ πŸ“„ Real-time PDF and text file processing
β€’ πŸ” Context-aware accurate responses
β€’ ⚑ Intuitive Chainlit-powered chat interface

πŸ› οΈ Tech Stack:
β€’ πŸ’» Clean, documented open-source code
β€’ 🀝 User-friendly Chainlit UI
β€’ πŸ“Š Vector database for efficient retrieval
β€’ πŸ”„ Real-time streaming responses

πŸ“± Try it now!
β†’ Demo: openfree/PDF-RAG

πŸ”§ Special Features:
β€’ πŸ“Š Support for PDF/text files up to 2MB
β€’ 🎯 Precise context understanding
β€’ ⚑ Fast response time
β€’ πŸ”’ Secure file handling

Full source code available - ready to integrate into your projects!

#RAG #NLP #Chatbot #OpenSource #PDFProcessing
reacted to victor's post with πŸš€ 2 days ago
view post
Post
2745
Finally, an open-source AI that turns your lyrics into full songs is hereβ€”meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot
replied to dylanebert's post 2 days ago
view reply

I don’t really think it’s a side project.

reacted to nicolay-r's post with πŸ‘€ 3 days ago
view post
Post
1734
πŸ“’ For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
βœ… OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
βœ… Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
βœ… Json Schema for Chain-of-Though reasoning (see screenshot πŸ“· below)

πŸ“Ί below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

πŸ“ƒ To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

⏳ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1
  • 1 reply
Β·
reacted to lewtun's post with πŸ”₯πŸš€ 3 days ago
view post
Post
9473
We are reproducing the full DeepSeek R1 data and training pipeline so everybody can use their recipe. Instead of doing it in secret we can do it together in the open!

πŸ§ͺ Step 1: replicate the R1-Distill models by distilling a high-quality reasoning corpus from DeepSeek-R1.

🧠 Step 2: replicate the pure RL pipeline that DeepSeek used to create R1-Zero. This will involve curating new, large-scale datasets for math, reasoning, and code.

πŸ”₯ Step 3: show we can go from base model -> SFT -> RL via multi-stage training.

Follow along: https://github.com/huggingface/open-r1
Β·
reacted to AdinaY's post with πŸš€ 4 days ago
view post
Post
2487
πŸ”₯So many exciting releases coming from the Chinese community this month!
zh-ai-community/2025-january-6786b054f492fb223591269e

LLMs:
✨ Qwen2.5 -1M by Alibaba
Qwen/qwen25-1m-679325716327ec07860530ba
✨ InternLM3-8B-Instruct by Shanghai AI Lab
internlm/internlm3-8b-instruct
✨ MiniMax-Text-01 by MiniMax AI
MiniMaxAI/MiniMax-Text-01
✨ RWKV-7 by BlinkDL -- RNN + Transformer πŸ‘€
BlinkDL/rwkv-7-world
✨ DeepSeek-R1 by DeepSeek -- THE ONE πŸ™Œ
https://huggingface.co/deepseek-ai
✨ Baichuan-M1-14B by Baichuan - Medical 🩺
baichuan-inc/Baichuan-M1-14B-Base
✨ Qwen2.5-Math-PRM by Alibaba - Math πŸ”’
Qwen/Qwen2.5-Math-PRM-7B

Code:
✨ Tare by Bytedance
https://trae.ai

TTS:
✨ T2A-01-HD by MiniMax AI
https://hailuo.ai/audio
✨ LLaSA by HKUST Audio
HKUSTAudio/Llasa-3B

MLLM:
✨ Kimi k1.5 by Moonshot AI
https://kimi.ai
✨ MiniCPM-o-2_6 by OpenBMB
openbmb/MiniCPM-o-2_6
✨ Sa2VA-4B by ByteDance
ByteDance/Sa2VA-4B
✨ VideoLLaMA 3 by Alibaba DAMO
DAMO-NLP-SG/videollama3-678cdda9281a0e32fe79af15
✨ LLaVA-Mini by Chinese Academy of Sciences
ICTNLP/llava-mini-llama-3.1-8b
✨Hunyuan-7B by Tencent
tencent/Hunyuan-7B-Instruct
✨ Hunyuan 3D 2.0 by Tencent
tencent/Hunyuan3D-2
✨MiniMax-VL-01 by MiniMax AI - A non transformer based VLM πŸ‘€
MiniMaxAI/MiniMax-VL-01

Agent:
✨ UI-TARS by Bytedance
bytedance-research/UI-TARS-7B-SFT
✨ GLM-PC by Zhipu AI
https://cogagent.aminer.cn

Dataset:
✨ Fineweb-Edu-Chinese by Opencsg
opencsg/Fineweb-Edu-Chinese-V2.1
✨ Multimodal_textbook by Alibaba
DAMO-NLP-SG/multimodal_textbook
✨ MME-Finance by Hithink AI
Β·