Spend one minute each day to get the latest AI news.
The content includes but is not limited to cutting-edge AI news, AI tools, AI art, open-source projects, and learning tutorials, etc.
Follow AI Daily to stay up to date with AI trends. For important information, detailed posts will be made separately.
Here is the latest AI information for July 31.
Cutting-edge News
1. OpenAI releases GPT-4o long output version.
Each request can output up to 64K tokens, and API users can access it through the model name "gpt-4o-64k-output-alpha."
In terms of pricing, it costs $6 per million input tokens and $18 per million output tokens.
Detailed announcement: https://openai.com/gpt-4o-long-output/
2. ByteDance enters the AI music track with a product called "Sponge Music."
They directly launched the official website and released a lot of music created by AI. After listening, the effect is very good, with no noise in the vocals, perfectly supporting Chinese, and it feels comparable to Suno!
Official website: https://www.haimian.com
Currently, you can create music for free on the site by simply describing the type of song you want to generate, and the AI will create the lyrics with one click. The generation speed is very fast.
If you're interested, you can try it out.
3. OpenAI announces the rollout of advanced voice features for GPT-4o!
The advanced voice mode of GPT-4o can provide more natural, real-time conversations, allowing you to interrupt at any time and can perceive and respond to your emotions.
However, it is only being pushed to some Plus users, and it will be fully rolled out to all Plus users by fall.
If you are a GPT Plus user, update to the latest version and check if you are the lucky one.
Cutting-edge Technology
1. Meta AI releases Segment Anything Model 2 (SAM 2).
It can perform real-time detection and segmentation of objects in both images and videos. With this technology, moving objects in videos can be quickly replaced!
GitHub: https://github.com/facebookresearch/segment-anything-2
Detailed introduction: https://ai.meta.com/blog/segment-anything-2/
Open-source Projects
1. An AI tool called ai-renamer that automatically batch renames files based on content.
Built on Node.js, it is a command-line tool that uses Ollama's local models (such as Gemma, Llama, etc.) by default to intelligently recognize files, images, or videos in a specified local directory and automatically batch rename them based on their content.
GitHub: https://github.com/ozgrozer/ai-renamer
It also offers a variety of customizable parameters such as language, custom prompts, file name length, models, etc.
Learning Books
1. An open-source and free online English grammar learning book: "Grammar Club."
The book is organized progressively, from simple sentences in the beginner section, through complex sentences in the intermediate section, to simplified subordinate clauses in the advanced section. It is divided into three main sections and further subdivided into twenty-two chapters.
GitHub: https://github.com/llwslc/grammar-club
Online reading: https://llwslc.github.io/grammar-club/content/Introduction.html
Briefly summarize the content:
Part One: Introduces the basic sentence structures of simple sentences and their various components, including nouns, verbs, adjectives, adverbs, etc., and explores infinitive phrases, gerunds, and participles.
Part Two: Discusses compound and complex sentences, emphasizing how to combine multiple simple sentences into complex ones through conjunctions.
Part Three: Introduces simplified subordinate clauses, based on compound and complex sentences, simplifying complex sentences into concise advanced sentence structures, making expressions more accurate and concise.
By reading this book, it is hoped that you can improve your English skills, build your confidence in English, and enhance your interest in reading English.