-
CRUXEval: A Benchmark for Code Reasoning, Understanding and Execution
Paper ā¢ 2401.03065 ā¢ Published ā¢ 11 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper ā¢ 2401.14196 ā¢ Published ā¢ 60 -
WaveCoder: Widespread And Versatile Enhanced Instruction Tuning with Refined Data Generation
Paper ā¢ 2312.14187 ā¢ Published ā¢ 51 -
On the Effectiveness of Large Language Models in Domain-Specific Code Generation
Paper ā¢ 2312.01639 ā¢ Published ā¢ 1
Collections
Discover the best community collections!
Collections including paper arxiv:2401.14196
-
Self-Rewarding Language Models
Paper ā¢ 2401.10020 ā¢ Published ā¢ 147 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ā¢ 2401.08967 ā¢ Published ā¢ 30 -
Tuning Language Models by Proxy
Paper ā¢ 2401.08565 ā¢ Published ā¢ 23 -
TrustLLM: Trustworthiness in Large Language Models
Paper ā¢ 2401.05561 ā¢ Published ā¢ 69
-
Attention Is All You Need
Paper ā¢ 1706.03762 ā¢ Published ā¢ 53 -
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Paper ā¢ 1810.04805 ā¢ Published ā¢ 17 -
RoBERTa: A Robustly Optimized BERT Pretraining Approach
Paper ā¢ 1907.11692 ā¢ Published ā¢ 7 -
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Paper ā¢ 1910.01108 ā¢ Published ā¢ 14
-
563
OpenAI TTS New
š -
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
Paper ā¢ 2501.12948 ā¢ Published ā¢ 330 -
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code Intelligence
Paper ā¢ 2401.14196 ā¢ Published ā¢ 60 -
zed-industries/zeta
Updated ā¢ 1.12k ā¢ 204
-
Understanding LLMs: A Comprehensive Overview from Training to Inference
Paper ā¢ 2401.02038 ā¢ Published ā¢ 63 -
Learning To Teach Large Language Models Logical Reasoning
Paper ā¢ 2310.09158 ā¢ Published ā¢ 1 -
ChipNeMo: Domain-Adapted LLMs for Chip Design
Paper ā¢ 2311.00176 ā¢ Published ā¢ 9 -
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct
Paper ā¢ 2308.09583 ā¢ Published ā¢ 7
-
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper ā¢ 2310.17680 ā¢ Published ā¢ 69 -
MFTCoder: Boosting Code LLMs with Multitask Fine-Tuning
Paper ā¢ 2311.02303 ā¢ Published ā¢ 8 -
A Survey on Language Models for Code
Paper ā¢ 2311.07989 ā¢ Published ā¢ 22 -
Magicoder: Source Code Is All You Need
Paper ā¢ 2312.02120 ā¢ Published ā¢ 82