-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 43 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
fangtongen
fangtongen
·
AI & ML interests
None yet
Recent Activity
liked a model about 1 month ago
xiaolv/ocr-captcha liked a model about 1 month ago
openai/whisper-small liked a model about 1 month ago
Systran/faster-whisper-large-v3Organizations
None yet
Text to Image
-
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Paper • 2307.01952 • Published • 92 -
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Paper • 2211.01324 • Published • 4 -
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
Paper • 2108.01073 • Published • 9 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17
LLM
-
The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only
Paper • 2306.01116 • Published • 43 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15 -
RoFormer: Enhanced Transformer with Rotary Position Embedding
Paper • 2104.09864 • Published • 17 -
Language Models are Few-Shot Learners
Paper • 2005.14165 • Published • 20
Text to Image
-
SDXL: Improving Latent Diffusion Models for High-Resolution Image Synthesis
Paper • 2307.01952 • Published • 92 -
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers
Paper • 2211.01324 • Published • 4 -
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
Paper • 2108.01073 • Published • 9 -
High-Resolution Image Synthesis with Latent Diffusion Models
Paper • 2112.10752 • Published • 17
models 0
None public yet
datasets 0
None public yet