Foundational Deep Learning - Architecture - a YedsonUQ Collection

YedsonUQ 's Collections

Foundational Deep Learning - Architecture

AI-Automated Scientific Research

Benchmark and Evaluation

Distributed Training and Federated Learning

Explainable AI - Interpretable AI

Learning Paradigm/Scheme

Models

Reinforcement Learning (RL)

Retrieval Augmented Generation (RAG)

Uncertainty Quantification

Survey

Foundational Deep Learning - Architecture

updated 2 days ago

Forgetting Transformer: Softmax Attention with a Forget Gate

Paper • 2503.02130 • Published 8 days ago • 26
L^2M: Mutual Information Scaling Law for Long-Context Language Modeling

Paper • 2503.04725 • Published 6 days ago • 19