arxiv:2410.13754
Zheng Zian(Andy)
OrionZheng
AI & ML interests
LLM, Mixture-of-Experts, Data-Centric AI
Recent Activity
new activity
4 days ago
OrionZheng/openmoe-8b:Model source code
liked
a dataset
3 months ago
MixEval/MixEval-X
authored
a paper
3 months ago
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models
Organizations
None yet
Papers
2
models
11
OrionZheng/openmoe-34b-200B
Text Generation
•
Updated
•
3
•
11
OrionZheng/openmoe-8b-chat
Text Generation
•
Updated
•
12
•
8
OrionZheng/openmoe-8b
Text Generation
•
Updated
•
11
•
3
OrionZheng/openmoe-8b-1T
Text Generation
•
Updated
•
77
•
2
OrionZheng/openmoe-8b-800B
Text Generation
•
Updated
•
5
•
1
OrionZheng/openmoe-8b-600B
Text Generation
•
Updated
•
5
OrionZheng/openmoe-8b-400B
Text Generation
•
Updated
•
3
OrionZheng/openmoe-8b-200B
Text Generation
•
Updated
•
5
•
2
OrionZheng/openmoe-base
Text Generation
•
Updated
•
186
•
4
OrionZheng/openmoe-8b-890B
Text Generation
•
Updated
•
6
datasets
None public yet