Wenjiao Feng's picture

2 2

Wenjiao Feng

NeuronNomad

AI & ML interests

None yet

Organizations

None yet

authored a paper 5 months ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published Apr 7 • 134

authored a paper 11 months ago

TPI-LLM: Serving 70B-scale LLMs Efficiently on Low-resource Edge Devices

Paper • 2410.00531 • Published Oct 1, 2024 • 35