J Jonah Jellynose suspects Spiderman is an AI. Captain Blubber is arrested twice. A phone screen is smashed. What is happening
0:00 Intro
0:30 Basics
1:30 States, Actions and Rewards
2:45 Discount Factor
4:09 Neural Networks
5:59 PPO
7:03 Policy Gradient
9:54 Clamping the Policy
10:34 What the AI Learned
13:05 Just Swinging
White paper on how to create an AI like this from scratch:
https://docs.google.com/document/d/1FZZvz0JMHKWOOVlXnrmeRMoGpyjqa0m6Q0S2qLECDpA/edit?usp=sharing
Download this AI: https://github.com/b2developer/SpidermanPPO
Discord: https://discord.gg/KgMgeQ7EMP
Reddit: https://www.reddit.com/r/b2studios/
Twitch: https://www.twitch.tv/b2studios
Useful Links:
https://huggingface.co/blog/deep-rl-ppo#the-clipped-part-of-the-clipped-surrogate-objective-function
https://fse.studenttheses.ub.rug.nl/25709/1/mAI_2021_BickD.pdf
https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/
0:00 Intro
0:30 Basics
1:30 States, Actions and Rewards
2:45 Discount Factor
4:09 Neural Networks
5:59 PPO
7:03 Policy Gradient
9:54 Clamping the Policy
10:34 What the AI Learned
13:05 Just Swinging
White paper on how to create an AI like this from scratch:
https://docs.google.com/document/d/1FZZvz0JMHKWOOVlXnrmeRMoGpyjqa0m6Q0S2qLECDpA/edit?usp=sharing
Download this AI: https://github.com/b2developer/SpidermanPPO
Discord: https://discord.gg/KgMgeQ7EMP
Reddit: https://www.reddit.com/r/b2studios/
Twitch: https://www.twitch.tv/b2studios
Useful Links:
https://huggingface.co/blog/deep-rl-ppo#the-clipped-part-of-the-clipped-surrogate-objective-function
https://fse.studenttheses.ub.rug.nl/25709/1/mAI_2021_BickD.pdf
https://iclr-blog-track.github.io/2022/03/25/ppo-implementation-details/
- Category
- Artificial Intelligence
- Tags
- spiderman, spiderman ai, piderman ai
Comments