Several trained models to compare the differences between each method. Each model has a complete description of hyperparams with wandb reports. 
			
	
	G
G-reen
		AI & ML interests
SFT, DPO, ORPO, LLMs, text-generation
		
		Organizations
None yet
			models
			31
		
			
	
	
	
	
	G-reen/adamwbone2epoch5_6lr_test
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				
				
	
				
				
G-reen/adamwbone2epoch5_6lr_test_adapter
		
	
				Updated
					
				
				
				
	
				
				
G-reen/adamwlora2epoch5_6lr_test
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				
				
G-reen/adamwlora2epoch5_6lr_test_adapter
		
	
				Updated
					
				
				
				
	
				
				
G-reen/adamwlora2epoch5_6lr
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				
				
	
				
				
G-reen/adamwlora2epoch5_6lr_adapter
		
	
				Updated
					
				
				
				
	
				
				
G-reen/adamwbat2epoch5_6lr
			Text Generation
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				
				
G-reen/adamwbat2epoch5_6lr_adapter
		
	
				Updated
					
				
				
				
	
				
				
G-reen/Qwen2.5-Coder-32b-Instruct-Fp8
		
	
				Updated
					
				
				• 
					
					1
				
	
				
				
G-reen/Mistral-Small-2501-Instruct-Fp8
		
	
				Updated
					
				
				
				
	
				
				
			datasets
			14
		
			
	
	
	
	
	G-reen/Duet-v0.6
			Viewer
			• 
	
				Updated
					
				• 
			
			5k
	
				• 
					
					10
				
				
				
G-reen/reflexion-agi
			Viewer
			• 
	
				Updated
					
				• 
			
			5k
	
				• 
					
					22
				
				• 
					
					38
				
G-reen/TheatreLM-v2.1-Characters
			Viewer
			• 
	
				Updated
					
				• 
			
			5.01k
	
				• 
					
					41
				
				• 
					
					56
				
G-reen/Duet-v0.5
			Viewer
			• 
	
				Updated
					
				• 
			
			5k
	
				• 
					
					21
				
				• 
					
					20
				
G-reen/deepmindcodecontestssharegpt
			Viewer
			• 
	
				Updated
					
				• 
			
			13.1k
	
				• 
					
					7
				
				
				
G-reen/TheatreLM-v2.0-Settings
			Viewer
			• 
	
				Updated
					
				• 
			
			200
	
				• 
					
					6
				
				
				
G-reen/TheatreLM-v2.0-Characters
			Viewer
			• 
	
				Updated
					
				• 
			
			1k
	
				• 
					
					3
				
				
				
G-reen/TheatreLM-v2.1-chats-preview
			Viewer
			• 
	
				Updated
					
				• 
			
			3.94k
	
				• 
					
					9
				
				
				
G-reen/TheatreLM-v2.0-chats-preview
			Viewer
			• 
	
				Updated
					
				• 
			
			264
	
				• 
					
					8
				
				
				
G-reen/TheatreLM-v1.0-DPO
			Viewer
			• 
	
				Updated
					
				• 
			
			1
	
				• 
					
					2