arxiv:2402.13991
Szymon Tworkowski
syzymon
AI & ML interests
Language models, theorem proving and much more!
Recent Activity
authored
a paper
about 2 months ago
Magnushammer: A Transformer-based Approach to Premise Selection
authored
a paper
about 2 months ago
Structured Packing in LLM Training Improves Long Context Utilization
authored
a paper
about 2 months ago
Hierarchical Transformers Are More Efficient Language Models
Organizations
None yet
models
5
syzymon/long_llama_code_7b_instruct
Text Generation
•
Updated
•
39
•
11
syzymon/long_llama_code_7b
Text Generation
•
Updated
•
31
•
31
syzymon/long_llama_3b
Text Generation
•
Updated
•
102
•
120
syzymon/long_llama_3b_instruct
Text Generation
•
Updated
•
1.21k
•
25
syzymon/long_llama_3b_v1_1
Text Generation
•
Updated
•
19
•
10
datasets
None public yet