Voice transcription is about to be your new favourite ChatGPT function

The extra you employ ChatGPT and related generative AI providers, the higher you’ll get at issuing prompts that ship the outcomes you need. And also you’ll uncover new options and capabilities that aren’t instantly apparent to new customers. But it surely turns on the market’s a superb ChatGPT function that you just’re most likely not utilizing, and that’s as a result of it’s hidden in plain sight contained in the official ChatGPT app for iPhone.
It’s a voice transcription function so good it’ll fully change the way you transcribe audio.
Voice transcription is just accessible on ChatGPT for iPhone
You don’t use your voice to speak to ChatGPT on a pc. You load the generative AI in a browser and begin typing away. However I totally count on to make use of voice to speak to ChatGPT-like AI merchandise sooner or later. Particularly on units like Apple’s new Imaginative and prescient Professional spatial pc that may profit from such performance.
It seems that OpenAI added voice enter to the iPhone app. A function I hadn’t observed till stumbling on Insider’s piece that describes the shocking transcription function.
Enable the official app to entry your iPhone’s microphone, faucet that icon on the correct of the textual content enter subject, and ChatGPT will report the audio it hears. That’s a fantastic function should you work with audio recordings like interviews, podcasts, and movies and must extract textual content from them.
Insider discovered that ChatGPT might precisely transcribe somebody’s speech right down to the punctuation. The distinction between ChatGPT on iPhone and a distinct AI app was “outstanding.” Insider says that “you may nearly hear the particular person talking” within the ChatGPT model.
How good is it?
Naturally, I went on to check it on a latest one-minute promo clip from Marvel’s Secret Invasion Disney Plus TV present, which you’ll see under.
ChatGPT recorded the whole lot, after which I simply issued that transcription as a immediate. Foolish ChatGPT supplied a response, which I ignored. The purpose right here was to check the transcription function, which is admittedly good. To not point out that now I’ve the identical chat on my Mac, the place I can export the transcription and edit it.

I’ll say the transcription isn’t good, however that is perhaps my fault. I ought to have raised the audio quantity.
One other factor to notice right here is that the audio doesn’t acknowledge totally different individuals speaking. However that’s as a result of it’s not programmed to take action. Subsequently, should you plan on utilizing ChatGPT as a voice transcription service, you’ll want to concentrate to who’s talking.
The excellent news in all of that is that OpenAI CEO Sam Altman instructed Insider that ChatGPT makes use of one other OpenAI tech known as Whisper. And Whisper is so good as a result of OpenAI used giant quantities of audio knowledge from the web to coach the AI with out human supervision.
Meaning the AI type of skilled itself to know any audio by processing giant quantities of audio knowledge.

I’m enthusiastic about the way forward for utilizing voice to speak to AI
With that in thoughts, there’s clearly a case right here for transcription apps constructed with Whisper that might provide extra options. Like recognizing every impartial speaker, providing timestamps, and permitting you to rapidly flick thru an audio file by way of prompts. And these apps ought to do the transcription nearly instantaneously, with out you really opening the file.
Whereas I’m simply capturing concepts off the highest of my head, I’m certain we’ll see such AI providers down the street from OpenAI or different firms.
There’s already an app like that for Mac that was found by Insider. Whisper Transcription is offered as a free obtain from the Mac App Retailer. It’s been there for greater than 4 years, apparently.
Additionally, Whisper tech makes me enthusiastic about having voice-based conversations with AI on the Imaginative and prescient Professional. However we’ll cross that bridge after we get there.
That mentioned, right here’s the clip that ChatGPT transcribed for me, so you may see for your self how good the function is.