Prithiv Sakthi's picture

Prithiv Sakthi PRO

prithivMLmods

AI & ML interests

computer vision, multimodality, realism engine adapters @starngerzonehf

Recent Activity

Articles

Organizations

Stanford AI's profile picture DataScienceEngineering's profile picture AI FILMS's profile picture Samsung Electronics's profile picture MISATO-dataset's profile picture GEM benchmark's profile picture OpenGVLab's profile picture MusicAI's profile picture BigScience Biomedical Datasets's profile picture OpenVINO Toolkit's profile picture LLMs's profile picture ONNXConfig for all's profile picture Gradio-Themes-Party's profile picture scikit-learn's profile picture lora concepts library's profile picture Open-Source AI Meetup's profile picture Kornia AI's profile picture Universitรฉ Dauphine-PSL's profile picture Platzi Community's profile picture Tune a video concepts library's profile picture Keras Dreambooth Event's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture The Waifu Research Department's profile picture Musika's profile picture Blog-explorers's profile picture OpenSky's profile picture AI Tamil Nadu's profile picture OpenLLM France's profile picture huggingPartyParis's profile picture Team Tonic's profile picture That Time I got Reincarnated as a Hugging Face Organization's profile picture LocalLLaMA's profile picture Major TOM's profile picture MLX Community's profile picture C4AI Community's profile picture M4-ai's profile picture Chinese LLMs on Hugging Face's profile picture ONNX Community's profile picture Dataset Tools's profile picture Nerdy Face's profile picture Stranger Zone's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

prithivMLmods's activity

reacted to not-lain's post with ๐Ÿค— about 17 hours ago
reacted to hexgrad's post with ๐Ÿš€๐Ÿš€๐Ÿš€๐Ÿš€ 1 day ago
replied to their post 1 day ago
view reply

Iโ€™m not sure, but if context continues to evolve in the future, the real OpenAI will improve more and more compared to paid AI providers. People will increasingly use open-source models for their domain-specific tasks.

posted an update 1 day ago
view post
Post
3014
Deepswipe by
.
.
.
. Deepseek๐Ÿฌ๐Ÿ—ฟ






Everything is now in recovery. ๐Ÿ“‰๐Ÿ“ˆ
ยท
reacted to AdinaY's post with ๐Ÿ”ฅ 2 days ago
reacted to m-ric's post with ๐Ÿ”ฅ 2 days ago
view post
Post
2807
๐—ง๐—ต๐—ฒ ๐—›๐˜‚๐—ฏ ๐˜„๐—ฒ๐—น๐—ฐ๐—ผ๐—บ๐—ฒ๐˜€ ๐—ฒ๐˜…๐˜๐—ฒ๐—ฟ๐—ป๐—ฎ๐—น ๐—ถ๐—ป๐—ณ๐—ฒ๐—ฟ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ฝ๐—ฟ๐—ผ๐˜ƒ๐—ถ๐—ฑ๐—ฒ๐—ฟ๐˜€!

โœ… Hosting our own inference was not enough: now the Hub 4 new inference providers: fal, Replicate, SambaNova Systems, & Together AI.

Check model cards on the Hub: you can now, in 1 click, use inference from various providers (cf video demo)

Their inference can also be used through our Inference API client. There, you can use either your custom provider key, or your HF token, then billing will be handled directly on your HF account, as a way to centralize all expenses.

๐Ÿ’ธ Also, PRO users get 2$ inference credits per month!

Read more in the announcement ๐Ÿ‘‰ https://huggingface.co/blog/inference-providers
  • 1 reply
ยท
replied to fdaudens's post 2 days ago
reacted to victor's post with ๐Ÿš€ 2 days ago
view post
Post
2745
Finally, an open-source AI that turns your lyrics into full songs is hereโ€”meet YuE! Unlike other tools that only create short clips, YuE can make entire songs (up to 5 minutes) with vocals, melody, and instruments all working together. Letsss go!

m-a-p/YuE-s1-7B-anneal-en-cot
  • 1 reply
ยท
reacted to clem's post with โค๏ธ 3 days ago
view post
Post
6654
AI is not a zero-sum game. Open-source AI is the tide that lifts all boats!
reacted to fdaudens's post with โค๏ธ 3 days ago
view post
Post
7543
Yes, DeepSeek R1's release is impressive. But the real story is what happened in just 7 days after:

- Original release: 8 models, 540K downloads. Just the beginning...

- The community turned those open-weight models into +550 NEW models on Hugging Face. Total downloads? 2.5Mโ€”nearly 5X the originals.

The reason? DeepSeek models are open-weight, letting anyone build on top of them. Interesting to note that the community focused on quantized versions for better efficiency & accessibility. They want models that use less memory, run faster, and are more energy-efficient.

When you empower builders, innovation explodes. For everyone. ๐Ÿš€

The most popular community model? @bartowski 's DeepSeek-R1-Distill-Qwen-32B-GGUF version โ€” 1M downloads alone.
ยท
reacted to nicolay-r's post with ๐Ÿ”ฅ 4 days ago
view post
Post
1734
๐Ÿ“ข For those who wish to apply DeepSeek-R1 for handling tabular / streaming data using schema of prompts (CoT), the OpenRouter AI hosts API for accessing:
https://openrouter.ai/deepseek/deepseek-r1

The no-string option to quick start with using DeepSeek-R1 includes three steps:
โœ… OpenRouter provider: https://github.com/nicolay-r/nlp-thirdgate/blob/master/llm/open_router.py
โœ… Bulk-chain for infering data: https://github.com/nicolay-r/bulk-chain
โœ… Json Schema for Chain-of-Though reasoning (see screenshot ๐Ÿ“ท below)

๐Ÿ“บ below is a screenshot of how to quick start the demo, in which you can test your schema for LLM responses. It would ask to type all the parameters first for completing the requests (which is text within this example).

๐Ÿ“ƒ To apply it for JSONL/CSV data, you can use --src shell parameter for passing the related file

โณ As for time, OpenRouter finds me relatively slow with 30~40 seconds per request

Models:
deepseek-ai/DeepSeek-R1
  • 1 reply
ยท
reacted to AdinaY's post with ๐Ÿ”ฅ 6 days ago
reacted to burtenshaw's post with ๐Ÿคฏ 7 days ago
view post
Post
2082
AI was built on side projects!
reacted to AdinaY's post with ๐Ÿ”ฅ 7 days ago
reacted to AdinaY's post with ๐Ÿง  8 days ago
reacted to sharpenb's post with ๐Ÿš€ 8 days ago
reacted to JingzeShi's post with ๐Ÿ”ฅ 9 days ago
view post
Post
1593
๐Ÿคฉwarmup -> stable -> decay leanring rate scheduler:
๐Ÿ˜Žuse the Stable Phase CheckPoints to Continue Training the model on Any New Dataset without spikes of the training!!!
JingzeShi/Doge-20M-checkpoint
JingzeShi/Doge-60M-checkpoint
ยท