Vodia PBX now offers real-time call transcription through a seamless integration with Whisper AI, OpenAI’s advanced speech recognition system. With support for multiple languages, technical vocabulary, and noisy environments, Whisper delivers accurate transcriptions even in complex call scenarios. Administrators can enable transcription per tenant using an OpenAI API key, making setup simple and flexible. Once active, all calls are automatically transcribed and accessible in the user portal for easy review and record-keeping. This powerful integration brings enhanced clarity, compliance, and insight to voice communication—whether you're managing support teams, analyzing conversations, or working across language barriers.
Vodia is pleased to announce another enhancement to our phone system - call transcription with Whisper AI. This seamless integration enables Vodia customers to deploy Whisper AI speech-to-text capability within their communications ecosystem, and to configure it for individual tenants.
Whisper, developed by OpenAI, is an Automatic Speech Recognition (ASR) system. Trained on nearly three quarters of a million hours of supervised multilingual and multitask data from the Web, Whisper deftly handles technical language, background noises, and accents (thanks to this diverse, massive data set). It transcribes in a multitude of languages and translates all of them into English.
The Whisper architecture, an encoder-decoder transformer, is a simple, stem-to-stern approach: it separates audio into 30-second segments, which are then converted into a log-Mel spectrum and delivered through an encoder; the decoder anticipates the correct text caption, combined with specific tokens that steer the single model to accomplish numerous tasks, including multilingual speech transcription, speech translation to English, and timestamps at phrase-level.
Vodia announced a beta version of our PBX that connects a phone system to OpenAI realtime API (beta version) in November of 2024. We are delighted we can now provide our customers with a cloud pathway to leverage the power of Whisper AI.
Getting Started with OpenAI Cloud Transcription
To utilize OpenAI's cloud transcription, an OpenAI account and API key are required.
OpenAI Account: Navigate to the OpenAI platform and create or log in via Google, Microsoft, or email.
API Key Retrieval: Access the API Keys page, generate a new secret key, and securely copy it. This key is displayed only once.
Vodia Integration: Within tenant general settings, enable transcription and input the OpenAI API key.
Upon completion, all calls will be transcribed and available within the user portal.
Accessing Call Transcriptions
To view the transcribed content, simply log in to your user portal, navigate to the History section, select the desired call, and review the Call Content area.
Vodia’s prepaid offering on DigitalOcean provides businesses with a fast, flexible way to deploy a fully licensed PBX system. Our tailored plans include options for SBC functionality, certified for MS Teams, enabling seamless integration of advanced PBX features like intelligent call routing, automated attendants, and voicemail, all while ensuring secure, high-fidelity voice communication. With extension-based licensing, organizations can easily scale their system from 10 to 80 extensions, making deployment simple and quick. Whether you’re new to PBX systems or an experienced developer, our step-by-step installation video and documentation guide you through the setup process.
With over 360 million SMBs across the globe, small and medium-sized businesses face a common challenge: delivering the same high-quality communication experience customers expect from large enterprises—without the enterprise budget. The Vodia phone system is built to meet this need, offering a powerful, scalable solution that transforms any device into a professional business phone. Whether your team is remote, mobile, or in-office, Vodia provides advanced features like call recording, voicemail transcription, Microsoft Teams integration, hot desking, CRM integration, and real-time analytics. Our per-user pricing ensures you get all the functionality without overpaying, and our mobile apps keep you connected from anywhere.
The Vodia PBX User Web Portal offers a comprehensive and intuitive interface that gives users full control over their communication experience. Designed to complement Vodia’s zero-touch provisioning, the portal enables secure browser-based calling via WebRTC, and syncing with Microsoft or Google contacts. Users can monitor real-time presence, manage call forwarding, handle parked calls, and control service flags for dynamic call routing. It also supports advanced call queue management, voicemail access with optional transcription, call recordings, and internal or SMS messaging. CRM integrations with platforms like Zoho further streamline workflows, while granular user settings and admin-controlled visibility ensure tailored access based on roles.