Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
4
3
sriram vasudevan
sriramvasu
Follow
Temoxtrading01's profile picture
1 follower
ยท
3 following
AI & ML interests
None yet
Recent Activity
liked
a model
5 days ago
deepseek-ai/DeepSeek-R1
reacted
to
merve
's
post
with ๐
5 days ago
smolagents can see ๐ฅ we just shipped vision support to smolagents ๐ค agentic computers FTW you can now: ๐ป let the agent get images dynamically (e.g. agentic web browser) ๐ pass images at the init of the agent (e.g. chatting with documents, filling forms automatically etc) with few LoC change! ๐คฏ you can use transformers models locally (like Qwen2VL) OR plug-in your favorite multimodal inference provider (gpt-4o, antrophic & co) ๐ค read our blog http://hf.co/blog/smolagents-can-see
reacted
to
chansung
's
post
with ๐
5 days ago
Simple summarization of Evolving Deeper LLM Thinking (Google DeepMind) The process starts by posing a question. 1) The LLM generates initial responses. 2) These generated responses are evaluated according to specific criteria (program-based checker). 3) The LLM critiques the evaluated results. 4) The LLM refines the responses based on the evaluation, critique, and original responses. The refined response is then fed back into step 2). If it meets the criteria, the process ends. Otherwise, the algorithm generates more responses based on the refined ones (with some being discarded, some remaining, and some responses potentially being merged). Through this process, it demonstrated excellent performance in complex scheduling problems (travel planning, meeting scheduling, etc.). It's a viable method for finding highly effective solutions in specific scenarios. However, there are two major drawbacks: ๐ค An excessive number of API calls are required. (While the cost might not be very high, it leads to significant latency.) ๐ค The evaluator is program-based. (This limits its use as a general method. It could potentially be modified/implemented using LLM as Judge, but that would introduce additional API costs for evaluation.) https://arxiv.org/abs/2501.09891
View all activity
Organizations
None yet
sriramvasu
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
5 days ago
deepseek-ai/DeepSeek-R1
Text Generation
โข
Updated
5 days ago
โข
498k
โข
5.4k
liked
a Space
6 months ago
Running
on
Zero
4.14k
๐๏ธ๐จ
FLUX.1 [Schnell]
liked
a model
12 months ago
nvidia/parakeet-tdt-1.1b
Automatic Speech Recognition
โข
Updated
Apr 30, 2024
โข
160k
โข
85