TMLR Group

university

https://bhanml.github.io/group.html

tmlrgroup

tmlr-group

Activity Feed

AI & ML interests

Trustworthy Machine Learning and Reasoning

Recent Activity

resistz new activity about 2 months ago

TMLR-Group-HF/GT-Qwen3-4B-Base-DAPO14k:Improve model card: Add `transformers` library, pipeline tag, paper link, and abstract

resistz new activity about 2 months ago

TMLR-Group-HF/GT-Llama-3.2-3B-Instruct-DAPO14k:Improve model card: Add pipeline tag, library name, and paper link

resistz new activity about 2 months ago

TMLR-Group-HF/Self-Certainty-Qwen3-8B-Base-DAPO14k:Improve model card: Add pipeline tag, library name, and paper link

View all activity

Organization Card

Community About org cards

Trustworthy Machine Learning and Reasoning (TMLR) Group, an online-offline-mixed machine learning research group, locates in different cities, including Hong Kong, Melbourne, Shanghai, Nottingham and Sydney. We share the vision for the future ML technology: building trustworthy learning and reasoning algorithms, theories and systems.

Collections 2

models 66

datasets 5

TMLR-Group-HF/Co-rewarding-RephrasedDAPO-14k

Viewer • Updated Oct 11 • 14.1k • 42

TMLR-Group-HF/Co-rewarding-RephrasedMATH

Viewer • Updated Oct 11 • 7.5k • 64

TMLR-Group-HF/Co-rewarding-RephrasedOpenRS

Viewer • Updated Oct 11 • 7k • 50

TMLR-Group-HF/NoRa

Viewer • Updated May 1 • 185k • 30 • 2

TMLR-Group-HF/counteranimal

Viewer • Updated Apr 21 • 13.3k • 44 • 1

TMLR Group

AI & ML interests

Recent Activity

Collections 2

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-3B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-7B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen3-1.7B-Base-MATH

TMLR-Group-HF/NoRa

Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-3B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-7B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen3-1.7B-Base-MATH

TMLR-Group-HF/NoRa

Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?

models 66

TMLR-Group-HF/GT-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/GT-Llama-3.2-3B-Instruct-DAPO14k

TMLR-Group-HF/Self-Certainty-Qwen3-8B-Base-DAPO14k

TMLR-Group-HF/Self-Certainty-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/Entropy-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/Entropy-Llama-3.2-3B-Instruct-DAPO14k

TMLR-Group-HF/Majority-Voting-Qwen3-8B-Base-DAPO14k

TMLR-Group-HF/Majority-Voting-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/Co-rewarding-I-Qwen3-8B-Base-OpenRS

TMLR-Group-HF/Majority-Voting-Llama-3.2-3B-Instruct-DAPO14k

datasets 5

TMLR-Group-HF/Co-rewarding-RephrasedDAPO-14k

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-RephrasedOpenRS

TMLR-Group-HF/NoRa

TMLR-Group-HF/counteranimal

AI & ML interests

Recent Activity

Team members 13

Collections 2

models 66 Sort: Recently updated

datasets 5 Sort: Recently updated

models 66

datasets 5