Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Mariusj G's picture
17 13

Mariusj G

MariusjG
·

AI & ML interests

None yet

Recent Activity

upvoted an article 15 days ago
From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels
upvoted a paper 15 days ago
Prompt Orchestration Markup Language
upvoted a paper 30 days ago
Attention Heads of Large Language Models: A Survey
View all activity

Organizations

None yet

Collections 1

LLM Papers
  • DeBERTa: Decoding-enhanced BERT with Disentangled Attention

    Paper • 2006.03654 • Published Jun 5, 2020 • 3
  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    Paper • 1810.04805 • Published Oct 11, 2018 • 21
  • RoBERTa: A Robustly Optimized BERT Pretraining Approach

    Paper • 1907.11692 • Published Jul 26, 2019 • 9
  • Language Models are Few-Shot Learners

    Paper • 2005.14165 • Published May 28, 2020 • 16
LLM Papers
  • DeBERTa: Decoding-enhanced BERT with Disentangled Attention

    Paper • 2006.03654 • Published Jun 5, 2020 • 3
  • BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

    Paper • 1810.04805 • Published Oct 11, 2018 • 21
  • RoBERTa: A Robustly Optimized BERT Pretraining Approach

    Paper • 1907.11692 • Published Jul 26, 2019 • 9
  • Language Models are Few-Shot Learners

    Paper • 2005.14165 • Published May 28, 2020 • 16

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs