Devin, SIMA, Figure 01, all in 24 hours. What does it mean and are AI models taking the wheel? I’ll go through 5 relevant papers and 11 articles to get you all the relevant details, from what exactly Devin accomplished, and didn’t, to DeepMind's new AGI-attempt-in-3D (SIMA) to just how far AI agents have come and what that means for the future of jobs. They’ll also be a guest star … discussing … me?
AI Insiders [Exclusive videos, Discord, Interviews and More]: https://www.patreon.com/AIExplained
Devin: https://www.cognition-labs.com/blog
Devin YT: https://www.youtube.com/watch?v=V_J-xOeCklQ
SWE-bench: https://arxiv.org/pdf/2310.06770.pdf
Cognition Twitter: https://twitter.com/cognition_labs/with_replies
Reality Check: https://twitter.com/bindureddy/status/1768056098995814836
Karpathy Tweet: https://twitter.com/karpathy/status/1767598414945292695
Bloomberg: https://www.bloomberg.com/news/articles/2024-03-12/cognition-ai-is-a-peter-thiel-backed-coding-assistant?
Chollet Prediction: https://twitter.com/fchollet/status/1767935813646716976
https://magic.dev/
SIMA: https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/
SIMA Paper: https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/sima-generalist-ai-agent-for-3d-virtual-environments/Scaling%20Instructable%20Agents%20Across%20Many%20Simulated%20Worlds.pdf
MobileAgent: https://arxiv.org/pdf/2401.16158.pdf
OpenAI Agent: https://www.theinformation.com/articles/openai-shifts-ai-battleground-to-software-that-operates-devices-automates-tasks?rc=sy0ihq
Red Dead Redemption AI: https://arxiv.org/pdf/2403.03186.pdf
RT-X: https://deepmind.google/discover/blog/scaling-up-learning-across-many-different-robot-types/
Figure 01 Hz: https://twitter.com/coreylynch/status/1767928771875868677
MasterPlan: https://www.figure.ai/master-plan
Unit Cost: https://www.cnbc.com/2024/02/29/robot-startup-figure-valued-at-2point6-billion-by-bezos-amazon-nvidia.html
MMMU: https://arxiv.org/pdf/2311.16502.pdf
https://github.com/MMMU-Benchmark/MMMU
Jeff Clune Tweet: https://twitter.com/jeffclune/status/1768320487627579466
Semianalysis: https://www.semianalysis.com/p/ai-datacenter-energy-dilemma-race
Huang AGI Quote: https://www.reuters.com/technology/nvidia-ceo-says-ai-could-pass-human-tests-five-years-2024-03-01/
Altman Quote: https://www.marketingaiinstitute.com/blog/sam-altman-ai-agi-marketing?s=09
US Govt Report: https://twitter.com/jeffclune?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor
AI Insiders: https://www.patreon.com/AIExplained
AI Insiders [Exclusive videos, Discord, Interviews and More]: https://www.patreon.com/AIExplained
Devin: https://www.cognition-labs.com/blog
Devin YT: https://www.youtube.com/watch?v=V_J-xOeCklQ
SWE-bench: https://arxiv.org/pdf/2310.06770.pdf
Cognition Twitter: https://twitter.com/cognition_labs/with_replies
Reality Check: https://twitter.com/bindureddy/status/1768056098995814836
Karpathy Tweet: https://twitter.com/karpathy/status/1767598414945292695
Bloomberg: https://www.bloomberg.com/news/articles/2024-03-12/cognition-ai-is-a-peter-thiel-backed-coding-assistant?
Chollet Prediction: https://twitter.com/fchollet/status/1767935813646716976
https://magic.dev/
SIMA: https://deepmind.google/discover/blog/sima-generalist-ai-agent-for-3d-virtual-environments/
SIMA Paper: https://storage.googleapis.com/deepmind-media/DeepMind.com/Blog/sima-generalist-ai-agent-for-3d-virtual-environments/Scaling%20Instructable%20Agents%20Across%20Many%20Simulated%20Worlds.pdf
MobileAgent: https://arxiv.org/pdf/2401.16158.pdf
OpenAI Agent: https://www.theinformation.com/articles/openai-shifts-ai-battleground-to-software-that-operates-devices-automates-tasks?rc=sy0ihq
Red Dead Redemption AI: https://arxiv.org/pdf/2403.03186.pdf
RT-X: https://deepmind.google/discover/blog/scaling-up-learning-across-many-different-robot-types/
Figure 01 Hz: https://twitter.com/coreylynch/status/1767928771875868677
MasterPlan: https://www.figure.ai/master-plan
Unit Cost: https://www.cnbc.com/2024/02/29/robot-startup-figure-valued-at-2point6-billion-by-bezos-amazon-nvidia.html
MMMU: https://arxiv.org/pdf/2311.16502.pdf
https://github.com/MMMU-Benchmark/MMMU
Jeff Clune Tweet: https://twitter.com/jeffclune/status/1768320487627579466
Semianalysis: https://www.semianalysis.com/p/ai-datacenter-energy-dilemma-race
Huang AGI Quote: https://www.reuters.com/technology/nvidia-ceo-says-ai-could-pass-human-tests-five-years-2024-03-01/
Altman Quote: https://www.marketingaiinstitute.com/blog/sam-altman-ai-agi-marketing?s=09
US Govt Report: https://twitter.com/jeffclune?ref_src=twsrc%5Egoogle%7Ctwcamp%5Eserp%7Ctwgr%5Eauthor
AI Insiders: https://www.patreon.com/AIExplained
- Category
- Artificial Intelligence
Comments