Microsoft introduces Magma, a novel foundational model that bridges the gap between language and visual data, enabling AI agents to perform complex tasks in both digital and physical environments.
Microsoft Research unveils Magma, a new integrated AI model that combines visual and language processing to control software interfaces and robotic systems, potentially representing a significant advancement in versatile multimodal AI.