Collection related to the paper, "Training a Generally Curious Agent" (Project page: https://paprika-llm.github.io/)
			
	
	Fahim Tajwar
ftajwar
		AI & ML interests
LLMs, RLHF
		Recent Activity
						updated 
								a collection
							
						30 days ago
						
					Self-Rewarding-LLM-Training
						
						updated 
								a collection
							
						30 days ago
						
					Self-Rewarding-LLM-Training
						
						updated
								a dataset
							
						30 days ago
						
					
						
						
						
						ftajwar/evaluation_bitwise_arithmetic-2