Home > Quick > Body

Microsoft Open Source Multimodal AI Agent - Magma

clock
2025-02-25 22:47:45
At 3 am today, Microsoft open-sourced the basic model of multi-modal AI Agent - Magma on its official website. Compared with traditional agents, Magma has multi-modal capabilities across the digital and physical world, and can automatically process different types of data such as images, videos, and text. For example, you can use Magma to automatically place e-commerce orders and check the weather; you can also automatically operate physical robots, or get help when playing real chess. In addition, Magma can also have built-in psychological prediction functions, which enhances the ability to understand the spatiotemporal dynamics in future video frames, and can accurately predict the intentions and future behaviors of people or objects in the video.
Disclaimer:
1. The information provided does not constitute investment advice. Investors should make independent decisions and bear all risks themselves.
2. The copyright of this content belongs to the original author. The views expressed herein are solely those of the author and do not represent the stance or position of this website.
New Tab Page - Desk3 | Plugin
Stay ahead of the game in the cryptocurrency space.