Close Menu
Breaking News in Technology & Business – Tech Geekwire

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Millions of PCs Face Security Risks as Microsoft Ends Windows 10 Support

    May 10, 2025

    My Complicated Relationship with AI-Generated Photos

    May 10, 2025

    Trump Shares AI-Generated Image of Himself as Pope After Joking About Becoming Pontiff

    May 10, 2025
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech GeekwireBreaking News in Technology & Business – Tech Geekwire
    • New
      • Amazon
      • Digital Health Technology
      • Microsoft
      • Startup
    • AI
    • Corporation
    • Crypto
    • Event
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech Geekwire
    Home » Microsoft’s Magma AI: A Single Model that Controls Software and Robots
    Microsoft

    Microsoft’s Magma AI: A Single Model that Controls Software and Robots

    techgeekwireBy techgeekwireFebruary 25, 2025No Comments2 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email

    Microsoft Unveils Magma: An AI for the Physical and Digital Worlds

    Microsoft Research has introduced Magma, a new AI model that integrates visual and language processing capabilities. According to a report by Ars Technica, Magma is designed to control both software interfaces and robotic systems. If Magma’s performance surpasses Microsoft’s internal benchmarks, it could represent a major stride in the development of true, versatile multimodal AI capable of interactive operation in the physical and digital realms.

    Microsoft claims Magma is unique because it’s the first AI model that not only processes various data forms, including text, images, and video, but also directly acts on it. This holds true whether the model is navigating a user interface or manipulating physical objects. The project is the result of a collaboration between researchers from Microsoft, KAIST, the University of Maryland, the University of Wisconsin-Madison, and the University of Washington.

    Advancing Beyond Previous AI Systems

    Similar AI-driven robotics projects have been developed previously. Examples include Google’s PaLM-E and RT-2, as well as Microsoft’s own ChatGPT for Robotics. These projects often use large language models (LLMs) as interfaces. However, unlike many prior multimodal AI systems, which rely on separate models for perception and control, Magma consolidates these capabilities into a single base model.

    Magma as a Step Toward Agentic AI

    Microsoft is positioning Magma as an advance toward ‘agentic AI.’ This type of AI system is designed to autonomously create plans and perform complex tasks on behalf of a human, rather than simply responding to queries about its environment. In its research report, Microsoft indicates that Magma can formulate plans and take actions, enabling the AI to achieve a user’s specified objective.

    Microsoft is not alone in this pursuit. OpenAI is also exploring agentic AI, with projects such as Operator, which can perform UI tasks within a web browser. Google has several agentic AI projects as well, including Gemini 2.0.

    A Truly Multimodal Agent

    Magma builds on transformer-based LLM technology, training a neural network on extensive data. However, it distinguishes itself from traditional language models like GPT-4V by integrating ‘spatial intelligence’ in addition to verbal intelligence. Microsoft asserts that its training data, a collection of images, videos, robotics data, and UI interactions, has allowed Magma to become a truly multimodal agent, not just a perceptual model.

    Agentic AI AI Magma Microsoft Multimodal AI robotics
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    techgeekwire
    • Website

    Related Posts

    Millions of PCs Face Security Risks as Microsoft Ends Windows 10 Support

    May 10, 2025

    My Complicated Relationship with AI-Generated Photos

    May 10, 2025

    Trump Shares AI-Generated Image of Himself as Pope After Joking About Becoming Pontiff

    May 10, 2025

    Google to Make Gemini AI Chatbot Available to Children Under 13

    May 10, 2025

    The Evolving Role of Teachers in AI-Driven Classrooms

    May 10, 2025

    Amazon’s Alexa+ Voice Assistant Reaches 100,000 Users as Rollout Continues

    May 10, 2025
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    Millions of PCs Face Security Risks as Microsoft Ends Windows 10 Support

    May 10, 2025

    My Complicated Relationship with AI-Generated Photos

    May 10, 2025

    Trump Shares AI-Generated Image of Himself as Pope After Joking About Becoming Pontiff

    May 10, 2025

    Google to Make Gemini AI Chatbot Available to Children Under 13

    May 10, 2025
    Advertisement
    Demo
    About Us
    About Us

    A rich source of news about the latest technologies in the world. Compiled in the most detailed and accurate manner in the fastest way globally. Please follow us to receive the earliest notification

    We're accepting new partnerships right now.

    Email Us: info@example.com
    Contact: +1-320-0123-451

    Our Picks

    Millions of PCs Face Security Risks as Microsoft Ends Windows 10 Support

    May 10, 2025

    My Complicated Relationship with AI-Generated Photos

    May 10, 2025

    Trump Shares AI-Generated Image of Himself as Pope After Joking About Becoming Pontiff

    May 10, 2025
    Categories
    • AI (1,957)
    • Amazon (793)
    • Corporation (745)
    • Crypto (873)
    • Digital Health Technology (784)
    • Event (416)
    • Microsoft (945)
    • New (7,027)
    • Startup (814)
    © 2025 TechGeekWire. Designed by TechGeekWire.
    • Home

    Type above and press Enter to search. Press Esc to cancel.