Friedrich Marty

Smorty100

https://gitlab.com/users/Marty_Friedrich/projects

AI & ML interests

I'm most interested in content rerouting between LLM and VLLM agens for automation possibilities. Using templates for each agent which is then filled in by another agents inputs seems really useful.

Recent Activity

liked a model about 24 hours ago

google/gemma-3-27b-it

reacted to pidou's post with 😎 3 days ago

testing post

reacted to pidou's post with 😎 3 days ago

testing post

View all activity

Organizations

None yet

Smorty100's activity

liked a model about 24 hours ago

google/gemma-3-27b-it

Image-Text-to-Text • Updated 5 days ago • 190k • 669

reacted to pidou's post with 😎😎 3 days ago

Post

1623

testing post

New activity in Qwen/QwQ-32B-GGUF 6 days ago

Create params

#4 opened 7 days ago by

jklj077

reacted to onekq's post with 👍 7 days ago

Post

3241

QwQ-32B is amazing!

It ranks below o1-preview, but beats DeepSeek v3 and all Gemini models.
onekq-ai/WebApp1K-models-leaderboard

Now we have such a powerful model that can fit into a single GPU, can someone finetune a web app model to push SOTA of my leaderboard? 🤗

1 reply

replied to onekq's post 7 days ago

to me it's a bit weird to see QwQ not get the hype it... should deserve.

it's a crazy good model, can be run LOCALLY with non-business-level GPUs and actuallly performs supr gud even compared to huge gigantic V3 model.

Even for smaller businesses, having a completely local and secure LLM solution like this MUST have some value, right?

like - huh? people should be doin backflips, like i am

*flip* *flip* *flip* *flip*

anyway, i go play more with... cloud-hosted free LLMs (codestral 25.01) which probably does collect and train on my data... *sigh*

replied to clem's post 7 days ago

i very much agree.
it really seems like many models just push for that initial completion, which in many cases, can't even occur (like with lookup tools)
some models really do just.... execute like - 12 actions at a time to get them all in in one block.

reacted to clem's post with 👍❤️ 7 days ago

Post

7090

I was chatting with @peakji , one of the cofounders of Manu AI, who told me he was on Hugging Face (very cool!).

He shared an interesting insight which is that agentic capabilities might be more of an alignment problem rather than a foundational capability issue. Similar to the difference between GPT-3 and InstructGPT, some open-source foundation models are simply trained to 'answer everything in one response regardless of the complexity of the question' - after all, that's the user preference in chatbot use cases. Just a bit of post-training on agentic trajectories can make an immediate and dramatic difference.

As a thank you to the community, he shared 100 invite code first-come first serve, just use “HUGGINGFACE” to get access!

6 replies

liked a model 11 days ago

Qwen/QwQ-32B

Text Generation • Updated 5 days ago • 370k • • 2.26k

reacted to WENGSYX's post with 😔 11 days ago

Post

1670

🔬 Exciting Research Breakthrough! 🚀
We've developed a new AI research assistant LLMs trained through RL that can:
- Generate research ideas from reference literature
- Preview potential research methodologies
- Automatically draft research reports
- Transform experimental results directly into academic papers! 📝

See in -> WestlakeNLP/CycleResearcher-12B

Check out our free demo at http://ai-researcher.cn and experience the future of academic research workflows. 🌐

Proud to share that our work has been accepted as a Poster at ICLR 2025! 🏆 #AIResearch #AcademicInnovation #MachineLearning

reacted to nroggendorff's post with ❤️ 13 days ago

Post

2806

We're using RLHF on diffusion models, right? Just making sure..

4 replies

reacted to singhsidhukuldeep's post with 👍 13 days ago

Post

6753

Exciting New Tool for Knowledge Graph Extraction from Plain Text!

I just came across a groundbreaking new tool called KGGen that's solving a major challenge in the AI world - the scarcity of high-quality knowledge graph data.

KGGen is an open-source Python package that leverages language models to extract knowledge graphs (KGs) from plain text. What makes it special is its innovative approach to clustering related entities, which significantly reduces sparsity in the extracted KGs.

The technical approach is fascinating:

1. KGGen uses a multi-stage process involving an LLM (GPT-4o in their implementation) to extract entities and relations from source text
2. It aggregates graphs across sources to reduce redundancy
3. Most importantly, it applies iterative LM-based clustering to refine the raw graph

The clustering stage is particularly innovative - it identifies which nodes and edges refer to the same underlying entities or concepts. This normalizes variations in tense, plurality, stemming, and capitalization (e.g., "labors" clustered with "labor").

The researchers from Stanford and University of Toronto also introduced MINE (Measure of Information in Nodes and Edges), the first benchmark for evaluating KG extractors. When tested against existing methods like OpenIE and GraphRAG, KGGen outperformed them by up to 18%.

For anyone working with knowledge graphs, RAG systems, or KG embeddings, this tool addresses the fundamental challenge of data scarcity that's been holding back progress in graph-based foundation models.

The package is available via pip install kg-gen, making it accessible to everyone. This could be a game-changer for knowledge graph applications!

liked a model 13 days ago

GSAI-ML/LLaDA-8B-Instruct

Text Generation • Updated 18 days ago • 30.6k • 218

New activity in huggingchat/chat-ui 18 days ago

Deepseek r1 32b model is reasoning less and often answering without accuracy

#673 opened about 1 month ago by

rishadsojon

New activity in mradermacher/Rombo-LLM-V3.0-Qwen-32b-i1-GGUF 22 days ago

Please make i1 quants of my latest 72b model

#1 opened 25 days ago by

rombodawg

New activity in perplexity-ai/r1-1776 23 days ago

This is not "uncensored". This is just anti-china.

#160 opened 23 days ago by

Smorty100

liked a model 25 days ago

tomg-group-umd/huginn-0125

Text Generation • Updated 21 days ago • 8.35k • 242

reacted to Reality123b's post with 😔 27 days ago

Post

2208

https://huggingface.co/posts/Reality123b/533143502736808
Since many of you upvoted that post, I'm open-sourcing this on 19th February 2025.

I don't know, but, this may be the "smartest AI on earth". im not totally sure.
also, i need some kind of help with the UI coz i suck at that.

updated a Space about 1 month ago

First Agent Template

⚡

Get current time in any timezone