Close Menu
Breaking News in Technology & Business – Tech Geekwire

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech GeekwireBreaking News in Technology & Business – Tech Geekwire
    • New
      • Amazon
      • Digital Health Technology
      • Microsoft
      • Startup
    • AI
    • Corporation
    • Crypto
    • Event
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech Geekwire
    Home » MIT Researchers Develop AI Tool to Generate High-Quality Images Faster
    AI

    MIT Researchers Develop AI Tool to Generate High-Quality Images Faster

    techgeekwireBy techgeekwireMarch 28, 2025No Comments5 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email

    MIT News

    AI Tool Generates High-Quality Images Faster

    By Adam Zewe, MIT News

    Researchers from MIT and NVIDIA have developed a new artificial intelligence tool capable of generating high-quality images more quickly and efficiently than existing methods. This innovative approach combines the strengths of two popular AI model types, promising advancements in various fields.

    An image being put together by two sets of tweezers; one branded as autoregressive and the other as diffusion
    An image being put together by two sets of tweezers; one branded as autoregressive and the other as diffusion

    Researchers combined two types of generative AI models, an autoregressive model and a diffusion model, to create a tool that leverages the best of each model to rapidly generate high-quality images. Credit: Christine Daniloff, MIT; image of astronaut on horseback courtesy of the researchers

    Four AI-generated images of an astronaut riding a horse
    Four AI-generated images of an astronaut riding a horse

    The new image generator, called HART (short for Hybrid Autoregressive Transformer), can generate images that match or exceed the quality of state-of-the-art diffusion models, but do so about nine times faster. Credit: Courtesy of the researchers

    This new tool, called HART (Hybrid Autoregressive Transformer), can generate images that match or exceed the quality of state-of-the-art diffusion models. However, it operates about nine times faster and consumes fewer computational resources. This allows HART to run locally on a standard laptop or smartphone. Users simply enter a natural language prompt into HART to generate an image.

    “If you are painting a landscape, and you just paint the entire canvas once, it might not look very good. But if you paint the big picture and then refine the image with smaller brush strokes, your painting could look a lot better. That is the basic idea with HART,” says Haotian Tang SM ’22, PhD ’25, co-lead author of a new paper on HART.

    The Challenge of Image Generation

    The ability to generate high-quality images quickly is crucial for producing realistic simulated environments. These environments can be used to train self-driving cars to avoid unpredictable hazards, enhancing their safety on real streets. However, generative AI techniques currently used for this purpose have limitations. Diffusion models can create very realistic images, but they are slow and computationally intensive.

    “If you are painting a landscape, and you just paint the entire canvas once, it might not look very good. But if you paint the big picture and then refine the image with smaller brush strokes, your painting could look a lot better. That is the basic idea with HART,” according to Haotian Tang. On the other hand, autoregressive models, like those used in LLMs such as ChatGPT, are much faster but often produce lower-quality images.

    A Hybrid Approach: HART

    To overcome these limitations, the MIT and NVIDIA researchers combined the best features of both methods. Their hybrid image-generation tool uses an autoregressive model to quickly capture the broad outlines of an image and then a small diffusion model to refine the details.

    This approach allows HART to generate images with quality that matches or surpasses state-of-the-art diffusion models, but about nine times faster. Moreover, the generation process requires fewer computational resources than typical diffusion models. As a result, HART can run locally on a commercial laptop or smartphone. Users only need to enter a natural language prompt into the HART interface to generate an image.

    Applications and Impact

    HART has the potential to be applied in various fields. It could assist researchers training robots to perform complex real-world tasks and aid designers in creating striking scenes for video games.

    HART’s efficiency and compatibility with existing models open up exciting possibilities. For example, it can be integrated with unified vision-language generative models. In the future, users may be able to interact with these models by asking them to show the intermediate steps involved in actions like assembling furniture. Tang indicates that, efficient image-generation models could unlock many possibilities.

    Technical Details

    Popular diffusion models, such as Stable Diffusion and DALL-E, are known for producing highly detailed images, generating them through iterative steps of predicting and subtracting noise across each pixel. However, since these models process all pixels at each step, the process is slow and computationally expensive. On the other hand, autoregressive models predict patches of an image sequentially but are faster. They use tokens to make predictions; an autoregressive model uses an autoencoder to compress raw image pixels into discrete tokens and reconstruct the image from predictions. With HART, the researchers developed a hybrid approach that predicts compressed, discrete image tokens using an autoregressive model, and then uses a small diffusion model to predict residual tokens to capture high-frequency image details.

    “We can achieve a huge boost in terms of reconstruction quality. Our residual tokens learn high-frequency details, like edges of an object, or a person’s hair, eyes, or mouth. These are places where discrete tokens can make mistakes,” says Tang. The use of the small diffusion model as the final step of the process allows HART to maintain the speed advantages of autoregressive models.

    HART’s method uses an autoregressive transformer model with 700 million parameters. However, it generates images comparable quality to larger diffusion model and does so nine times faster. The model uses about 31 percent less computation than state-of-the-art models.

    Funding and Collaboration

    This research was supported, in part, by the MIT-IBM Watson AI Lab, the MIT and Amazon Science Hub, the MIT AI Hardware Program, and the U.S. National Science Foundation. NVIDIA provided the GPU infrastructure for training the model.

    Originally published March 21, 2025

    AI computer vision HART image generation Machine Learning MIT Nvidia
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    techgeekwire
    • Website

    Related Posts

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025

    Andreessen Horowitz Backs Controversial Startup Cluely Despite ‘Rage-Bait’ Marketing

    July 4, 2025

    Invesco QQQ ETF Hits All-Time High as Tech Stocks Continue to Soar

    July 4, 2025

    ContractPodAi Partners with Microsoft to Advance Legal AI Automation

    July 4, 2025
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025

    Andreessen Horowitz Backs Controversial Startup Cluely Despite ‘Rage-Bait’ Marketing

    July 4, 2025
    Advertisement
    Demo
    About Us
    About Us

    A rich source of news about the latest technologies in the world. Compiled in the most detailed and accurate manner in the fastest way globally. Please follow us to receive the earliest notification

    We're accepting new partnerships right now.

    Email Us: info@example.com
    Contact: +1-320-0123-451

    Our Picks

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025
    Categories
    • AI (2,696)
    • Amazon (1,056)
    • Corporation (990)
    • Crypto (1,130)
    • Digital Health Technology (1,079)
    • Event (523)
    • Microsoft (1,230)
    • New (9,568)
    • Startup (1,164)
    © 2025 TechGeekWire. Designed by TechGeekWire.
    • Home

    Type above and press Enter to search. Press Esc to cancel.