Deepgram Launches Nova-3 Medical: An AI Solution for Healthcare Transcription
Deepgram has introduced Nova-3 Medical, a new artificial intelligence (AI) speech-to-text (STT) model specifically designed for the demanding environment of healthcare. This innovative model promises to integrate seamlessly into existing clinical workflows, addressing the crucial need for accurate and efficient transcription within both the UK’s National Health Service (NHS) and private healthcare systems.

With the increasing adoption of electronic health records (EHRs), telemedicine, and digital health platforms, the demand for reliable AI-powered transcription has never been greater. Traditional speech-to-text models often struggle with the specialized vocabulary and complex terminology used in clinical settings. This can lead to errors and what are sometimes called “hallucinations,” potentially compromising patient care.
Deepgram’s Nova-3 Medical is engineered to overcome these challenges by leveraging advanced machine learning and specialized medical vocabulary training to accurately capture medical terms, acronyms, and clinical jargon, even in less-than-ideal audio conditions. This is particularly beneficial in environments where healthcare professionals may move away from a recording device.
“Nova-3 Medical represents a significant leap forward in our commitment to transforming clinical documentation through AI,” said Scott Stephenson, CEO of Deepgram. “By addressing the nuances of clinical language and offering unprecedented customization, we are empowering developers to build products that improve patient care and operational efficiency.”
Streamlined Workflows and Customization
A key feature of the model is its ability to deliver structured transcriptions that integrate seamlessly with clinical workflows and EHR systems. This ensures that vital patient data is accurately organized and readily accessible. Moreover, the model offers flexible, self-service customization, including Keyterm Prompting for up to 100 key terms. This allows developers to tailor the solution to the unique needs of various medical specialties.
Versatile deployment options, encompassing on-premises and Virtual Private Cloud (VPC) configurations, ensure enterprise-grade security and HIPAA compliance, which meets UK data protection regulations.
“Speech-to-text for enterprise use cases is not trivial, and there is a fundamental difference between voice AI platforms designed for enterprise use cases vs entertainment use cases,” said Kevin Fredrick, Managing Partner at OneReach.ai. “Deepgram’s Nova-3 model and Nova-3-Medical model, are leading voice AI offerings, including TTS, in terms of the accuracy, latency, efficiency, and scalability required for enterprise use cases.”
Benchmarking Results: Accuracy, Speed, and Cost
Deepgram has conducted extensive benchmarking to demonstrate the performance of Nova-3 Medical, which claims to deliver industry-leading transcription accuracy. The model excels at optimizing both overall word recognition and, vitally, critical medical term accuracy.
- Word Error Rate (WER): With a median WER of 3.45%, Nova-3 Medical outperforms its competitors, achieving a 63.6% reduction in errors compared to the next best competitor. This improvement minimizes manual corrections, streamlining workflows for healthcare providers.
- Keyword Error Rate (KER): Nova-3 Medical achieves a KER of 6.79%, marking a 40.35% reduction in errors compared to the next best competitor. This accurate transcription is critical to ensuring that essential medical terms, such as drug names and conditions, are transcribed correctly, thus reducing the risk of miscommunication and improving patient safety.
Beyond accuracy, Nova-3 Medical also excels in real-time applications. The model transcribes speech 5-40 times faster than many alternative speech recognition vendors, making it ideal for telemedicine and digital health platforms. Its scalable architecture guarantees high performance even as transcription volumes increase.
Finally, Nova-3 Medical is also designed to be cost-effective. Starting at $0.0077 per minute of streaming audio, which Deepgram claims is more than twice as affordable as leading cloud providers, the model enables healthcare tech companies to reinvest in innovation and accelerate product development.
Deepgram’s Nova-3 Medical aims to empower developers to create transformative medical transcription applications, thereby driving exceptional outcomes across healthcare and improving patient care.