2 25 136

Kostis Gourgoulias

kgourgou

http://kgourgou.me

AI & ML interests

Language modeling, few-shot learning, bayesian inference, information theory, uncertainty quantification.

Recent Activity

liked a model 12 days ago

internlm/Intern-S1-mini

liked a model 20 days ago

LiquidAI/LFM2-350M

liked a model 22 days ago

LiquidAI/LFM2-1.2B

View all activity

Organizations

upvoted 2 articles about 2 months ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

and 5 others •

Feb 4

• 106

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

and 1 other •

Jul 9

• 663

upvoted an article 3 months ago

Article

MCP is at a Tipping Point: Here's Why You Should Care

•

Jun 10

• 17

upvoted a paper 3 months ago

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 132

upvoted a collection 5 months ago

SuperBPE

Collection

SuperBPE tokenizers and models trained with them • 8 items • Updated Apr 10 • 15

upvoted a collection 6 months ago

Hallucination detection

Collection

Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated May 18 • 17

upvoted 2 papers 7 months ago

An Open Recipe: Adapting Language-Specific LLMs to a Reasoning Model in One Day via Model Merging

Paper • 2502.09056 • Published Feb 13 • 32

Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 23

upvoted an article 8 months ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 206

upvoted 2 papers about 1 year ago

Can Large Language Models Infer Causation from Correlation?

Paper • 2306.05836 • Published Jun 9, 2023 • 6

TextGrad: Automatic "Differentiation" via Text

Paper • 2406.07496 • Published Jun 11, 2024 • 32

upvoted a collection over 1 year ago

Models and Linearity

Collection

2 items • Updated Jun 14, 2024 • 1

upvoted 2 papers over 1 year ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19, 2024 • 159

Not All Language Model Features Are Linear

Paper • 2405.14860 • Published May 23, 2024 • 42

upvoted 2 articles over 1 year ago

Article

Let's talk about LLM evaluation

•

May 23, 2024

• 184

Article

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

and 3 others •

Apr 22, 2024

• 81

upvoted 2 papers over 1 year ago

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 21

Resolving Interference When Merging Models

Paper • 2306.01708 • Published Jun 2, 2023 • 15

upvoted a collection over 1 year ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 246

upvoted a paper over 1 year ago

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Paper • 2203.05482 • Published Mar 10, 2022 • 7

Kostis Gourgoulias

AI & ML interests

Recent Activity

Organizations

kgourgou's activity

DABStep: Data Agent Benchmark for Multi-step Reasoning

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

MCP is at a Tipping Point: Here's Why You Should Care

Train 400x faster Static Embedding Models with Sentence Transformers

Let's talk about LLM evaluation

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent