Collections
Discover the best community collections!
Collections including paper arxiv:2312.00763
-
Measuring the Effects of Data Parallelism on Neural Network Training
Paper • 1811.03600 • Published • 2 -
Adafactor: Adaptive Learning Rates with Sublinear Memory Cost
Paper • 1804.04235 • Published • 2 -
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Paper • 1905.11946 • Published • 3 -
Yi: Open Foundation Models by 01.AI
Paper • 2403.04652 • Published • 62
-
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment
Paper • 2401.12474 • Published • 36 -
More Agents Is All You Need
Paper • 2402.05120 • Published • 53 -
VideoAgent: Long-form Video Understanding with Large Language Model as Agent
Paper • 2403.10517 • Published • 33 -
Octopus v4: Graph of language models
Paper • 2404.19296 • Published • 117
-
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper • 2312.04474 • Published • 31 -
Training Chain-of-Thought via Latent-Variable Inference
Paper • 2312.02179 • Published • 9 -
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Paper • 2312.01552 • Published • 31 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 53