-
Aligning Instruction Tuning with Pre-training
Paper • 2501.09368 • Published -
Parameter-Efficient Fine-Tuning for Large Models: A Comprehensive Survey
Paper • 2403.14608 • Published -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 53
ROHITH VENKATA REDDY
knight7561
AI & ML interests
Deep learning, Autonomous Driving
Recent Activity
updated
a Space
3 days ago
knight7561/groot
new activity
4 days ago
agents-course/First_agent:Max Tokens reached for initial discussion
published
a Space
4 days ago
knight7561/groot
Organizations
Collections
2
Papers dump of LLM Reasoning domain
-
Internal Consistency and Self-Feedback in Large Language Models: A Survey
Paper • 2407.14507 • Published • 46 -
Large Language Models are Zero-Shot Reasoners
Paper • 2205.11916 • Published • 1 -
Let's Verify Step by Step
Paper • 2305.20050 • Published • 10 -
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Paper • 2201.11903 • Published • 11
spaces
2
models
5
knight7561/SmolLM2_python_coder-FT-ORPO
Text Generation
•
Updated
•
13
knight7561/SmolLM2-FT-DPO-python-code
Text Generation
•
Updated
•
13
knight7561/SmolLM2_python_coder
Text Generation
•
Updated
•
57
knight7561/SmolLM2-eli5_precomputed_top_slice
Text Generation
•
Updated
•
27
knight7561/SmolLM2-FT-MyDataset
Text Generation
•
Updated
•
16
datasets
None public yet