Max Glushko's picture
5 14

Max Glushko

Pelmeshek
ยท

AI & ML interests

NLP, LLM, SLM

Recent Activity

Organizations

None yet

Pelmeshek's activity

upvoted an article 26 days ago
view article
Article

Open-source DeepResearch โ€“ Freeing our search agents

โ€ข 1.17k
reacted to Jaward's post with ๐Ÿ‘€ 3 months ago
view post
Post
3100
nanoBLT: Simplified lightweight implementation of a character-level Byte Latent Transformer model (under 500 lines of code). The model is 2x4x2 (n_layers_encoder, n_layers_latent, n_layers_decoder) layer deep trained on ~1M bytes of tiny Shakespeare with a patch size of 4.

Code: https://github.com/Jaykef/ai-algorithms/blob/main/byte_latent_transformer.ipynb