Spend just 1 minute daily to get comprehensive updates on AI technology developments, industry trends, and market insights.
The content includes but is not limited to cutting-edge AI news, AI tools, AI painting, open-source projects, and learning tutorials.
Stay tuned to AI Daily for the latest in AI trends. For important information, detailed posts will be made separately.
Here is the latest AI information for July 4.
Cutting-edge News
1. Perplexity releases Pro Search advanced search features.
Provides more complex problem-solving capabilities through multi-step reasoning, significantly enhancing its math and code execution abilities.
Detailed introduction: https://www.perplexity.ai/hub/blog/pro-search-upgraded-for-more-advanced-problem-solving
2. A French AI lab Kyutai will open-source a model comparable to GPT-4o called Moshi.
Moshi, like GPT-4o, is a real-time multimodal model that can listen, speak, and see, and can be interrupted at any time.
From the demo, it appears slightly inferior to GPT-4o but is very close.
Kyutai official website: https://kyutai.org/
Online experience: https://moshi.chat/?queue_id=talktomoshi
Tried it out, currently unusable, possibly due to high user volume. Interested parties can try it. Looking forward to the open-source release.
3. ElevenLabs launches noise reduction feature.
Simply upload audio containing noise to remove the noise and retain only the human voice. Currently free to try.
Use it here: https://elevenlabs.io/voice-isolator
Cutting-edge Technology
1. Kuaishou has come up with something new and fun!
Released an efficient portrait animation framework that can transfer facial expressions from one person to a static person, generating videos with rich expressions.
Supports various styles of images while maintaining character consistency and excellent facial control.
GitHub: https://github.com/KwaiVGI/LivePortrait
Detailed introduction: https://liveportrait.github.io/
Open-source Projects
1. Found a meaningful open-source project on GitHub: "Meet Li Bai".
The project aims to promote and popularize Li Bai's poetry culture by building an AI agent with a Li Bai knowledge graph in a generative dialogue application.
GitHub: https://github.com/BinNong/meet-libai
The ultimate goal is to develop a generative dialogue application to interact with you in real-time, providing a personalized Li Bai poetry appreciation experience.
2. An open-source, free, and powerful image editor: Image ToolBox.
Offers all the image editing functions you can think of, including batch cropping, filters (over 180), text extraction from images, image stitching and overlay, background removal, watermark addition, various format conversions, and more.
GitHub: https://github.com/T8RIN/ImageToolbox
The tool is developed in Kotlin and currently only available for Android.