Spend 1 minute a day to get the latest cutting-edge AI information.
The content covers but is not limited to cutting-edge AI news, AI tools, AI painting, open-source projects, and learning tutorials, etc.
Follow AI Daily to keep up with AI trends. I hope it helps you. Important information will be posted separately for detailed introduction.
Here is the latest AI information for July 27.
Open-source Projects
1. A one-stop, open-source, high-quality data extraction tool: MinerU.
It can convert PDFs, web pages, and multi-format eBooks to Markdown format, not only extracting images and tables but also converting formulas to LaTex.
GitHub: https://github.com/opendatalab/MinerU
Main features:
- Supports multiple front-end model inputs;
- Removes elements such as headers, footers, footnotes, and page numbers;
- Human-readable layout format;
- Retains the structure and format of the original document, including titles, paragraphs, lists, etc.;
- Extracts images and tables and displays them in markdown;
- Converts formulas to LaTex;
- Automatically recognizes and converts garbled PDFs;
- Web page extraction, accurately parsing text, images, tables, and formulas across modalities;
- Extracts eBook literature, supporting formats like epub, mobi, with full adaptation of text and images;
- Language type identification, supporting accurate recognition of 176 languages.
The tool supports installation and use on Windows, Linux, and Mac systems. Those in need might as well give it a try.
Learning Courses
1. The professional course "Machine Learning" taught by Andrew Ng is now open for free!
It mainly includes three courses: "Machine Learning: Regression and Classification," "Advanced Learning Algorithms," and "Unsupervised Learning, Recommenders, Reinforcement Learning."
It covers various fields such as supervised learning, unsupervised learning, neural networks, decision trees, and recommendation systems.
Learning address: https://www.coursera.org/specializations/machine-learning-introduction
Through this course, you can master the basic knowledge and practical skills of machine learning, suitable for beginners and professionals who wish to develop in the field of artificial intelligence.