Commit History
improve llama pad token handling (#475)
		cb9797e
	
		
		unverified
	support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348)
		d2e7f27
	
		
		unverified
	add utils.data.prepare_dataset
		2e22404
	
		
		
	use context manager to run things on rank0 before others (#397)
		fc2d6be
	
		
		unverified
	Attention mask and position id fixes for packing (#285)
		2bb0b78
	
		
		unverified
	experimental llama 2 chat support (#296)
		3392270
	
		
		unverified
	
		Jan Philipp Harries
		
		Jan Philipp Harries
		
	commited on
		
		
 
		 
		 
		