Deep RL Course - Hugging Face arhamk/a2c-PandaReachDense-v2 Reinforcement Learning • Updated Oct 11, 2023 • 1 arhamk/ppo-Huggy Reinforcement Learning • Updated Aug 2, 2023 • 12 arhamk/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Aug 2, 2023 arhamk/q-Taxi-v3 Reinforcement Learning • Updated Aug 5, 2023
Recent Models arhamk/llama2-qlora-sft Updated Oct 11, 2023 • 1 arhamk/llama2-finance-sft Text Generation • Updated Sep 6, 2023
Deep RL Course - Hugging Face arhamk/a2c-PandaReachDense-v2 Reinforcement Learning • Updated Oct 11, 2023 • 1 arhamk/ppo-Huggy Reinforcement Learning • Updated Aug 2, 2023 • 12 arhamk/q-FrozenLake-v1-4x4-noSlippery Reinforcement Learning • Updated Aug 2, 2023 arhamk/q-Taxi-v3 Reinforcement Learning • Updated Aug 5, 2023
Recent Models arhamk/llama2-qlora-sft Updated Oct 11, 2023 • 1 arhamk/llama2-finance-sft Text Generation • Updated Sep 6, 2023