view article Article Building Enterprise-Ready Text Classifiers in Minutes with Adaptive Learning By codelion • 24 days ago • 12
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published 26 days ago • 167
view article Article Unsupervised Model Improvement via Internal Coherence Maximization: Outperforming Human-Supervised Methods Through Self-Elicitation By codelion • about 1 month ago • 7
GLM-4.5 Collection GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 22 days ago • 224
view article Article Understanding Model Reasoning Through Thought Anchors: A Comparative Study of Qwen3 and DeepSeek-R1 By codelion • Jul 23 • 4
Ellora Collection Ellora: Enhancing LLMs with LoRA - Standardized Recipes for Capability Enhancement • 10 items • Updated about 1 month ago • 2
Internal Coherence Maximization Collection Internal Coherence Maximization (ICM): A Label-Free, Unsupervised Training Framework for LLMs • 7 items • Updated about 1 month ago • 2
Pre-training Dataset Samples Collection A collection of pre-training datasets samples of sizes 10M, 100M and 1B tokens. Ideal for use in quick experimentation and ablations. • 9 items • Updated Jul 7 • 3
view article Article Automated Discovery of High-Performance GPU Kernels with OpenEvolve By codelion • Jun 27 • 21
view article Article Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • Jun 20 • 15
ProtoReasoning: Prototypes as the Foundation for Generalizable Reasoning in LLMs Paper • 2506.15211 • Published Jun 18 • 36
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs Paper • 2506.14245 • Published Jun 17 • 42
Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques Paper • 2506.08060 • Published Jun 9 • 8