Close Menu
Breaking News in Technology & Business – Tech Geekwire

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech GeekwireBreaking News in Technology & Business – Tech Geekwire
    • New
      • Amazon
      • Digital Health Technology
      • Microsoft
      • Startup
    • AI
    • Corporation
    • Crypto
    • Event
    Facebook X (Twitter) Instagram
    Breaking News in Technology & Business – Tech Geekwire
    Home » Inception Emerges with Diffusion-Based AI Model, Promising Faster Performance and Reduced Costs
    AI

    Inception Emerges with Diffusion-Based AI Model, Promising Faster Performance and Reduced Costs

    techgeekwireBy techgeekwireFebruary 28, 2025No Comments3 Mins Read
    Facebook Twitter Pinterest Telegram LinkedIn Tumblr WhatsApp Email
    Share
    Facebook Twitter LinkedIn Pinterest Telegram Email

    Inception Launches Diffusion-Based AI Model to Challenge LLMs

    A new company, Inception, based in Palo Alto and founded by Stanford computer science professor Stefano Ermon, has announced the development of a novel AI model based on “diffusion” technology. This new model is being called a diffusion-based large language model, or “DLM.” The emergence of Inception adds to the growing landscape of generative AI, which currently revolves around two primary model types: large language models (LLMs) and diffusion models.

    LLMs are the workhorses behind text generation, the technology that allows for the creation of human-like text. Conversely, diffusion models, which power popular art and media generators like Midjourney and OpenAI’s Sora, excel at creating images, video, and audio. Inception’s DLM attempts to unify these typically separate types of AI models, offering the capabilities of conventional LLMs, including generating code and answering questions, but with a critical advantage: significantly faster performance alongside reduced computing costs, according to the company.

    Ermon explained to TechCrunch that he has spent considerable time in his Stanford lab investigating how to apply diffusion models to text. His research was driven by the observation that traditional LLMs are relatively slow compared to the potential of diffusion technology. Ermon explained that with LLMs, “you cannot generate the second word until you’ve generated the first one, and you cannot generate the third one until you generate the first two.” This sequential process inherently limits speed.

    Ermon sought a method to apply a diffusion approach to text. Unlike LLMs, which are built on a sequential approach, diffusion models start with a rough representation of the data they are generating, be it an image or a block of text, and then refine it all at once. Ermon hypothesized that generating and modifying large blocks of text in parallel was achievable with diffusion models.

    After years of dedicated research, Ermon and a student achieved a significant breakthrough, which they detailed in a research paper published last year. Recognizing the potential of this advancement, Ermon established Inception last summer, bringing in two former students, UCLA professor Aditya Grover and Cornell professor Volodymyr Kuleshov, to co-lead the company.

    While Ermon declined to disclose the specific funding Inception has received, TechCrunch understands that the Mayfield Fund has invested in the company. Inception has already secured several customers, including unnamed Fortune 100 companies, by addressing their critical need for reduced AI latency and increased speed, Ermon stated.

    “What we found is that our models can leverage the GPUs much more efficiently,” Ermon said, speaking of the computer chips typically used to run models in production. “I think this is a big deal. This is going to change the way people build language models.”

    Inception offers an accessible API, along with options for on-premises and edge device deployment, complete with support for model fine-tuning. The company is also offering a suite of out-of-the-box DLMs tailored for a multitude of use cases. The company claims its DLMs can operate up to 10 times faster than conventional LLMs while incurring 10 times less in costs.

    “Our ‘small’ coding model is as good as [OpenAI’s] GPT-4o mini while more than 10 times as fast,” a company spokesperson told TechCrunch. “Our ‘mini’ model outperforms small open-source models like [Meta’s] Llama 3.1 8B and achieves more than 1,000 tokens per second.”

    “Tokens,” in industry terminology, refer to units of raw data processed by the models. One thousand tokens per second is certainly an impressive speed, assuming Inception’s claims are proven to be accurate.

    AI Diffusion Models Inception large language models Stefano Ermon
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    techgeekwire
    • Website

    Related Posts

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025

    Andreessen Horowitz Backs Controversial Startup Cluely Despite ‘Rage-Bait’ Marketing

    July 4, 2025

    Invesco QQQ ETF Hits All-Time High as Tech Stocks Continue to Soar

    July 4, 2025

    ContractPodAi Partners with Microsoft to Advance Legal AI Automation

    July 4, 2025
    Leave A Reply Cancel Reply

    Top Reviews
    Editors Picks

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025

    Andreessen Horowitz Backs Controversial Startup Cluely Despite ‘Rage-Bait’ Marketing

    July 4, 2025
    Advertisement
    Demo
    About Us
    About Us

    A rich source of news about the latest technologies in the world. Compiled in the most detailed and accurate manner in the fastest way globally. Please follow us to receive the earliest notification

    We're accepting new partnerships right now.

    Email Us: info@example.com
    Contact: +1-320-0123-451

    Our Picks

    IEEE Spectrum: Flagship Publication of the IEEE

    July 4, 2025

    GOP Opposition Mounts Against AI Provision in Reconciliation Bill

    July 4, 2025

    Navigation Help

    July 4, 2025
    Categories
    • AI (2,696)
    • Amazon (1,056)
    • Corporation (990)
    • Crypto (1,130)
    • Digital Health Technology (1,079)
    • Event (523)
    • Microsoft (1,230)
    • New (9,568)
    • Startup (1,164)
    © 2025 TechGeekWire. Designed by TechGeekWire.
    • Home

    Type above and press Enter to search. Press Esc to cancel.