Commit History
support for true batches with multipack (#1230)
00568c1
unverified
winglian
commited on
Multipack simplify for Mixtral (#1142)
6910e6a
unverified
winglian
commited on
optimize calculation of cu_seqlens from position_ids (#1084) [skip ci]
90036eb
unverified
winglian
commited on
Implement fused modules (#747)
15d3a65
unverified
Attention mask and position id fixes for packing (#285)
2bb0b78
unverified
winglian
commited on