arxiv:2406.12375
Haoze Wu
WaitHZ
ยท
AI & ML interests
Modular DL, Complex Reasoning
Recent Activity
upvoted
an
article
about 8 hours ago
How to generate text: using different decoding methods for language generation with Transformers
upvoted
an
article
2 days ago
You could have designed state of the art positional encoding
upvoted
a
paper
14 days ago
Sigma: Differential Rescaling of Query, Key and Value for Efficient
Language Models
Organizations
None yet
Papers
1
models
None public yet
datasets
None public yet