INSANE AI news: OpenAI o3-mini, DeepSeek Janus-Pro, Qwen2.5-Max, Riffusion FUZZ, YuE AI music generator, Doubao 1.5 Pro, Google Daily Listen, Tulu 3 #ai #aitools #ainews #agi
Thanks to our sponsor, AI Portrait. Make professional headshots in seconds! https://aiportrait.me/
Sources in order of mention:
https://research.nvidia.com/labs/toronto-ai/DiffusionRenderer/
https://github.com/Alpha-VLLM/Lumina-Image-2.0
https://chenguolin.github.io/projects/DiffSplat/
https://map-yue.github.io/
https://riffusion.com/
https://github.com/deepseek-ai/Janus
https://hailuoai.video/
https://chat.qwenlm.ai/
https://qwenlm.github.io/blog/qwen2.5-max/
https://qwenlm.github.io/blog/qwen2.5-vl/
https://huggingface.co/spaces/wjbmattingly/caracal
https://qwenlm.github.io/blog/qwen2.5-1m/
https://team.doubao.com/zh/special/doubao_1_5_pro
https://labs.google.com/
https://allenai.org/blog/tulu-3-405B
0:00 Intro
0:34 Diffusion Renderer
3:42 Lumina Image 2.0
6:59 DiffSplat 3D models
9:20 YuE AI music generator
16:44 Riffusion FUZZ
20:17 DeepSeek Janus-Pro
22:50 Hailuo Director
24:03 Alibaba Wanx video generator
24:49 AI Portrait
25:42 Qwen2.5-Max
28:50 Qwen2.5-VL open source vision model
32:02 Caracal text recognition
33:40 Qwen2.5-1M huge context window
35:36 Bytedance Doubao 1.5 Pro
36:27 OpenAI o3-mini release
40:27 Google Daily Listen
41:39 Tulu 3 open source AI model
Newsletter: https://aisearch.substack.com/
Find AI tools & jobs: https://ai-search.io/
Support: https://ko-fi.com/aisearch
Here's my equipment, in case you're wondering:
Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0
GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS
Mouse/Keyboard: ALOGIC Echelon https://bit.ly/alogic-echelon
Mic: Shure SM7B https://amzn.to/3DErjt1
Audio interface: Scarlett Solo https://amzn.to/3qELMeu
Thanks to our sponsor, AI Portrait. Make professional headshots in seconds! https://aiportrait.me/
Sources in order of mention:
https://research.nvidia.com/labs/toronto-ai/DiffusionRenderer/
https://github.com/Alpha-VLLM/Lumina-Image-2.0
https://chenguolin.github.io/projects/DiffSplat/
https://map-yue.github.io/
https://riffusion.com/
https://github.com/deepseek-ai/Janus
https://hailuoai.video/
https://chat.qwenlm.ai/
https://qwenlm.github.io/blog/qwen2.5-max/
https://qwenlm.github.io/blog/qwen2.5-vl/
https://huggingface.co/spaces/wjbmattingly/caracal
https://qwenlm.github.io/blog/qwen2.5-1m/
https://team.doubao.com/zh/special/doubao_1_5_pro
https://labs.google.com/
https://allenai.org/blog/tulu-3-405B
0:00 Intro
0:34 Diffusion Renderer
3:42 Lumina Image 2.0
6:59 DiffSplat 3D models
9:20 YuE AI music generator
16:44 Riffusion FUZZ
20:17 DeepSeek Janus-Pro
22:50 Hailuo Director
24:03 Alibaba Wanx video generator
24:49 AI Portrait
25:42 Qwen2.5-Max
28:50 Qwen2.5-VL open source vision model
32:02 Caracal text recognition
33:40 Qwen2.5-1M huge context window
35:36 Bytedance Doubao 1.5 Pro
36:27 OpenAI o3-mini release
40:27 Google Daily Listen
41:39 Tulu 3 open source AI model
Newsletter: https://aisearch.substack.com/
Find AI tools & jobs: https://ai-search.io/
Support: https://ko-fi.com/aisearch
Here's my equipment, in case you're wondering:
Dell Precision 5690: https://www.dell.com/en-us/dt/ai-technologies/index.htm?utm_source=AISearchTools&utm_medium=youtube&utm_campaign=precisionai#tab0=0
GPU: Nvidia RTX 5000 Ada https://nvda.ws/3zfqGqS
Mouse/Keyboard: ALOGIC Echelon https://bit.ly/alogic-echelon
Mic: Shure SM7B https://amzn.to/3DErjt1
Audio interface: Scarlett Solo https://amzn.to/3qELMeu
- Category
- AI prompts
Comments