14 32 190

Ken Tsui

kenhktsui

https://kenhktsui.github.io/

AI & ML interests

ML engineer, researcher VLM, LLM benchmark Opinions are my own

Recent Activity

liked a dataset 4 days ago

VITRA-VLA/VITRA-1M

liked a dataset about 2 months ago

Hothan/OlympiadBench

liked a dataset about 2 months ago

mixture-vitae-backup/MixtureVitae-2TT

View all activity

Organizations

liked a dataset 4 days ago

VITRA-VLA/VITRA-1M

Updated Dec 3, 2025 • 4.49k • 16

liked 2 datasets about 2 months ago

Hothan/OlympiadBench

Viewer • Updated Jun 8, 2025 • 8.48k • 3.77k • 36

mixture-vitae-backup/MixtureVitae-2TT

Viewer • Updated 19 days ago • 418k • 283 • 2

upvoted 2 papers 3 months ago

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published Oct 13, 2025 • 165

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6, 2025 • 501

authored a paper 3 months ago

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 8

upvoted 3 papers 3 months ago

VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model

Paper • 2509.09372 • Published Sep 11, 2025 • 243

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30, 2025 • 539

MixtureVitae: Open Web-Scale Pretraining Dataset With High Quality Instruction and Reasoning Data Built from Permissive-First Text Sources

Paper • 2509.25531 • Published Sep 29, 2025 • 8

liked a dataset 4 months ago

HuggingFaceFW/finepdfs

Viewer • Updated Dec 2, 2025 • 476M • 24.7k • 693

authored a paper 6 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9

commented a paper 6 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9 •

upvoted a paper 6 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9

commented a paper 6 months ago

Self-Correction Bench: Revealing and Addressing the Self-Correction Blind Spot in LLMs

Paper • 2507.02778 • Published Jul 3, 2025 • 9 •

upvoted a paper 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 75

published a dataset 6 months ago

kenhktsui/num_seq_bench

Viewer • Updated Aug 5, 2024 • 2.12k • 6

published an article 6 months ago

Article

NumSeqBench: Benchmarking Inductive Reasoning in Language Models via Number Sequences

Jul 3, 2025

updated 3 models 6 months ago

Ken Tsui

AI & ML interests

Recent Activity

Organizations

kenhktsui's activity

NumSeqBench: Benchmarking Inductive Reasoning in Language Models via Number Sequences