Microsoft accelerates AI race with a series of cheap models

Cát Tiên |

Microsoft announces three new AI models, accelerating multi-modal strategy, directly competing with major competitors in the global artificial intelligence race.

Microsoft has just announced three new platformed artificial intelligence models including MAI-Transcribe-1, MAI-Voice-1 and MAI-Image-2, marking an important step forward in its ambition to build its own multi-mode AI ecosystem.

This is a product of Microsoft AI, a Artificial Intelligence research department led by CEO Mustafa Suleyman, established at the end of 2025.

The launch of these models shows that Microsoft is gradually reducing dependence on partners, while directly competing with big names like OpenAI or Google.

In which, MAI-Transcribe-1 is a voice-to-text model, supporting up to 25 languages and is said to be 2.5 times faster than the current Azure Fast service.

MAI-Voice-1 focuses on creating sound, capable of creating 60 seconds of voice in just one second and allows voice customization according to user needs.

Notably, MAI-Image-2 does not only stop at images but also supports video creation, expanding the application capabilities of AI in content creation.

This model has been tested since March 19 on MAI Playground, Microsoft's new model testing platform – before being uploaded to the Microsoft Foundry ecosystem.

Currently, all three models are available on Microsoft Foundry, while voice-related models are also integrated into MAI Playground for testing and development.

According to Mr. Mustafa Suleyman - CEO of Microsoft AI, Microsoft AI's development philosophy is to put people at the center.

The models are designed to optimize the way people communicate in reality, instead of just focusing on technical performance. He also added that many new models will soon be announced and integrated directly into Microsoft products.

Another notable point is the price strategy. Microsoft said that MAI models are priced lower than many competitors. Specifically, MAI-Transcribe-1 costs from 0.36 USD per hour, MAI-Voice-1 from 22 USD per million characters, and MAI-Image-2 costs from 5 USD per million text input tokens and 33 USD per image output.

In the context of the increasingly competitive large language model market, the cost factor is considered an important advantage to attract businesses and developers.

Despite promoting the development of its own models, Microsoft still affirms its continued close cooperation with OpenAI. The company has invested more than 13 billion USD in this partner and integrated many AI technologies into its product ecosystem.

However, recent adjustments in the cooperation agreement have opened up more space for Microsoft to pursue research on "superintelligence". This shows that the company is pursuing a parallel strategy of both cooperation and technological autonomy.

Cát Tiên
RELATED NEWS

Microsoft upgrades Copilot, increases research power by combining AI tools

|

Microsoft is improving Copilot with better research capabilities thanks to the combination of two artificial intelligence systems of OpenAI and Anthropic.

Microsoft launches Xbox Mode on Windows 11, reveals new generation game console

|

Microsoft introduces Xbox Mode for Windows 11 at GDC 2026 and reveals new game console promising superior performance.

Microsoft prepares for major step forward for Xbox

|

Microsoft reveals new generation Xbox codenamed Project Helix, aiming for high performance and a gaming experience combining console and PC.

Base salary increase: There will be new regulations to handle the balance of the year-end bonus fund

|

The Government will amend regulations on handling the balance of the year-end bonus fund to avoid generating different understandings in implementation when increasing the base salary.

Large fire in the old Nha Trang airport area, suspected of burning electric wires to remove copper core

|

Khanh Hoa - A sudden fire occurred in the area of the old Nha Trang airport (Nha Trang ward), burning up many areas of dry grass and vegetation.

Ho Chi Minh City provides free screening at 64 wards and communes on the occasion of National Health Day

|

Responding to the All People's Health Day April 7, the Ho Chi Minh City Department of Health organized free screening at 64 wards and communes on April 5.

My Wedding", "50 Years Later" and the vocal race between famous singers and AI

|

The success of the songs "My Wedding", "50 Years Later" as well as the explosion of AI singers poses challenges for Vietnamese singers.

Submitting to Ho Chi Minh City People's Council a plan to exempt bus tickets in April

|

Ho Chi Minh City - The plan to exempt buses for people is being finalized to submit to the Ho Chi Minh City People's Council for consideration and approval at the session scheduled to take place at the end of April.

Microsoft upgrades Copilot, increases research power by combining AI tools

QUANG MINH |

Microsoft is improving Copilot with better research capabilities thanks to the combination of two artificial intelligence systems of OpenAI and Anthropic.

Microsoft launches Xbox Mode on Windows 11, reveals new generation game console

Cát Tiên |

Microsoft introduces Xbox Mode for Windows 11 at GDC 2026 and reveals new game console promising superior performance.

Microsoft prepares for major step forward for Xbox

Cát Tiên |

Microsoft reveals new generation Xbox codenamed Project Helix, aiming for high performance and a gaming experience combining console and PC.