AI & ML interests
None yet
Organizations
guydebruyn/InstructionFollowing_SFT_V1.4
Text Generation
•
0.5B
•
Updated
•
9
guydebruyn/MathReasoning_SFT_V1.3
Text Generation
•
0.5B
•
Updated
•
7
guydebruyn/InstructionFollowing_SFT_V1.3
Text Generation
•
0.5B
•
Updated
•
8
guydebruyn/MathReasoning_DPO_V1.2
Text Generation
•
0.5B
•
Updated
•
7
guydebruyn/MathReasoning_SFT_V1.2
Text Generation
•
0.5B
•
Updated
•
8
guydebruyn/MathReasoning_SFT_V1.1
Text Generation
•
0.5B
•
Updated
•
11
guydebruyn/MathReasoning_SFT_v1.0
Text Generation
•
0.5B
•
Updated
•
10
guydebruyn/InstructionFollowing_DPO_V1.1
Text Generation
•
0.5B
•
Updated
•
7
guydebruyn/InstructionFollowing_SFT_V1.2
Text Generation
•
0.5B
•
Updated
•
6
guydebruyn/InstructionFollowing_SFT_v1.0
Text Generation
•
0.5B
•
Updated
•
9
guydebruyn/bert-finetuned-squad
Question Answering
•
Updated
•
6
Text Generation
•
Updated
•
6
guydebruyn/marian-finetuned-kde4-en-to-fr
Translation
•
Updated
•
15
guydebruyn/distilbert-base-uncased-finetuned-imdb
Fill-Mask
•
Updated
•
9
guydebruyn/bert-finetuned-ner
Token Classification
•
Updated
•
9
guydebruyn/code-search-net-tokenizer
Updated
Fill-Mask
•
Updated
•
6
guydebruyn/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
guydebruyn/ppo-CartPole-v2
Reinforcement Learning
•
Updated
guydebruyn/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
•
2
guydebruyn/Reinforce-Copter3
Reinforcement Learning
•
Updated
guydebruyn/Reinforce-Copter2
Reinforcement Learning
•
Updated
guydebruyn/ppo-PyramidsTraining
Reinforcement Learning
•
Updated
•
15
guydebruyn/ppo-SnowballTarget
Reinforcement Learning
•
Updated
•
14
guydebruyn/Reinforce-Copter
Reinforcement Learning
•
Updated
guydebruyn/Reinforce-PoleCart1
Reinforcement Learning
•
Updated
guydebruyn/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
•
4
Reinforcement Learning
•
Updated
guydebruyn/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
guydebruyn/ppo-LunarLander-v2-2
Reinforcement Learning
•
Updated
•
2