Vodia PBX now offers real-time call transcription through a seamless integration with Whisper AI, OpenAI’s advanced speech recognition system. With support for multiple languages, technical vocabulary, and noisy environments, Whisper delivers accurate transcriptions even in complex call scenarios. Administrators can enable transcription per tenant using an OpenAI API key, making setup simple and flexible. Once active, all calls are automatically transcribed and accessible in the user portal for easy review and record-keeping. This powerful integration brings enhanced clarity, compliance, and insight to voice communication—whether you're managing support teams, analyzing conversations, or working across language barriers.
Vodia is pleased to announce another enhancement to our phone system - call transcription with Whisper AI. This seamless integration enables Vodia customers to deploy Whisper AI speech-to-text capability within their communications ecosystem, and to configure it for individual tenants.
Whisper, developed by OpenAI, is an Automatic Speech Recognition (ASR) system. Trained on nearly three quarters of a million hours of supervised multilingual and multitask data from the Web, Whisper deftly handles technical language, background noises, and accents (thanks to this diverse, massive data set). It transcribes in a multitude of languages and translates all of them into English.
The Whisper architecture, an encoder-decoder transformer, is a simple, stem-to-stern approach: it separates audio into 30-second segments, which are then converted into a log-Mel spectrum and delivered through an encoder; the decoder anticipates the correct text caption, combined with specific tokens that steer the single model to accomplish numerous tasks, including multilingual speech transcription, speech translation to English, and timestamps at phrase-level.
Vodia announced a beta version of our PBX that connects a phone system to OpenAI realtime API (beta version) in November of 2024. We are delighted we can now provide our customers with a cloud pathway to leverage the power of Whisper AI.
Getting Started with OpenAI Cloud Transcription
To utilize OpenAI's cloud transcription, an OpenAI account and API key are required.
OpenAI Account: Navigate to the OpenAI platform and create or log in via Google, Microsoft, or email.
API Key Retrieval: Access the API Keys page, generate a new secret key, and securely copy it. This key is displayed only once.
Vodia Integration: Within tenant general settings, enable transcription and input the OpenAI API key.
Upon completion, all calls will be transcribed and available within the user portal.
Accessing Call Transcriptions
To view the transcribed content, simply log in to your user portal, navigate to the History section, select the desired call, and review the Call Content area.
Vodia’s browser calling solution allows businesses to make and receive VoIP calls directly from any web browser, eliminating the need for apps or desk phones. It offers convenience, cost savings, and a wide range of features including chat, voicemail, call transfers, conference calls, video calls, and CRM integration. The system is secure, operating entirely within the browser to reduce exposure to malware, and scalable to support remote and hybrid work environments. With easy setup through the Vodia PBX web interface, organizations can streamline communication, improve productivity, and provide employees with a flexible, reliable, and fully integrated business communication experience.
JavaScript IVR transforms the way businesses handle incoming calls by enabling fully customizable, intelligent phone menu systems. Unlike static IVR setups with limited, pre-defined options, JavaScript IVR allows you to create dynamic call flows that adapt in real time based on caller input, business data, or even external API integrations. This means you can route calls more efficiently, automate complex processes, and offer highly personalized experiences to your customers. Whether you want to check customer records before transferring a call, adjust menu options based on time of day, or integrate with CRM systems for instant data access, JavaScript IVR gives you the flexibility and control to make it happen - all while improving efficiency and enhancing caller satisfaction.
Vodia support is now easier to access through the Vodia Help Center on Jira, giving partners and customers a centralized platform to submit technical support tickets, ask sales or licensing questions, and suggest new features. With a valid license key, users can open detailed requests and track their status in one place. The portal also brings together Vodia documentation, the PBX API, and the Vodia forum, making it the go-to resource for everything Vodia. Whether you're troubleshooting, planning an upgrade, or just need guidance, the Help Center is designed to streamline your experience and connect you with the right support faster.