November 12

Today's featured cutting-edge AI information, welcome to read 👇

💻 Alibaba releases new generation programming model Qwen2.5-Coder, offering 4 parameter scales, 32B version performance comparable to GPT-4, supporting Cursor integration and visual Artifacts.

🤖 OpenHands multi-agent platform turns AI into programmers, capable of executing code modifications, command running, API calls, and other development tasks, providing Docker deployment.

🎙️ Voice-Pro open-source tool integrates speech-to-text, translation, and TTS functions, supporting 100+ languages, providing noise reduction and subtitle export features.

Latest News

1. Alibaba releases latest programming model: Qwen2.5-Coder full series.

It comes with four different parameters: 0.5B, 3B, 14B, and 32B, all of which can be integrated with Cursor and feature the ability to generate visual Artifacts panels similar to Claude.

Detailed introduction: https://qwenlm.github.io/zh/blog/qwen2.5-coder-family/

Model download: https://huggingface.co/collections/Qwen/qwen25-coder-66eaa22e6f99801bf65b0c2f

The 32B parameter model rivals GPT-4 in multiple benchmark test scores, demonstrating powerful and comprehensive coding and mathematical capabilities.

Cutting-edge Technology

1. An AI-driven software development multi-agent platform: OpenHands, claiming to replace "programmers".

Composed of multiple agents forming an "AI programmer," capable of executing various real-world development tasks, including code modification, command execution, web browsing, API calls, and even copying code snippets from StackOverflow.

GitHub: https://github.com/All-Hands-AI/OpenHands

The project provides detailed installation and deployment tutorials, with quick start via Docker. It's somewhat similar to an open-source version of Cursor or Bolt.

Open Source Projects

1. An open-source tool integrating transcription, translation, and text-to-speech: Voice-Pro.

Provides a clean and intuitive visual interface, one-click installation, supports real-time transcription and translation, as well as batch processing mode.

GitHub: https://github.com/abus-aikorea/voice-pro

Main features:

Provides integrated environment for YouTube downloader, noise removal, subtitles, translation, and TTS
Supports speech recognition and text-to-speech in over 100 languages
Supports speech-to-text using Whisper, Faster-Whisper, etc.
Supports TTS voice speed, volume, and pitch adjustment
Provides word-level highlighting and noise reduction features
Supports export of multiple subtitle file formats, such as ass, srt, tmp, ssa, etc.
Supports output of multiple audio formats, such as wav, flac, mp3, etc.

Latest News ​

Cutting-edge Technology ​

Open Source Projects ​

Latest News

Cutting-edge Technology

Open Source Projects