Skip to main content

Six AI Audio Cleaning and Transcription Resources for Video and Animation Content Creators

AI Audio Cleaning Robot. Image by TET and Leonardo.ai
While AI applications may have been receiving a lot of bad press lately in the visual arts, there are definitely times when AI is a game changer, in a good way, for more mundane applications like audio cleaning and transcribing.

Maybe there's some militant audio engineers or transcribers out there who just love what they do but the ability to give an audio file to an AI, and have it automatically improve the quality of sound or transcribe an hour or more of speech in seconds, is pure magic.

Sure the AI doesn't always get it right, particularly with transcribing, but it's pretty good. Plus, correcting a few short falls is certainly better than doing all the work yourself.

I recently recorded a thirty minute video with audio that was borderline awful. It was clear enough to understand but I hadn't been able to filter out the static noise of the microphone, and it would distort on the louder sections, just enough to notice, even though the levels were well under the clipping threshold.

Nothing I did in post would fix it so I decided to see what AI audio cleaning services were out there (and of course find out if there were any free services).

Clean Voice AI

Sound Wave Before and After Cleaning.
I'm leading with Clean Voice AI because it was the service I used to fix my audio. Your first 30 minutes is completely free (which was all I needed).

Clean Voice over delivers, not just fixing terrible audio but also removing dead air, 'ums' and 'ahs' (as well as other mouth sounds), and more.

Although I didn't try Clean Voice's audio transcribing, its inclusion  makes the site a great, one stop service, for many of your audio needs. It's particularly targeted at podcasters including a number of free tools for them like their Podcast Episode Title Generator.

Clean Voice AI is browser based. Subscription pricing seems quite reasonable to me but what I most liked is that you can pay as you go as well.

Audo Studio

If all you need is just a straight up audio cleaner that can remove almost any unwanted background noise then Audo Studio is a browser based application that may be what you're looking for.

Some examples of the kind of noises they can fix include background restaurant noise, bird squawks, dog barking and more. Audo Studio can also auto adjust volume levels so your voice can be heard.

Another nice feature is that you can upload video files for audio cleaning. No need to separate your audio just to use the service. Which may be useful if you're wanting to fix audio on older completed videos.

Audo Studio is subscription based but they do have a free plan that gives you 20 minutes of audio cleaning per month.


Deciphr

Transcript Sample from Deciphr
Another browser based AI service targeting podcasters (but also can process video files) Deciphr is a one stop shop for turning audio into all kinds of text. 

Not limited to transcribing, it can also generate show notes, show summaries, pull out quote highlights, create captions for social media posts, list keywords for SEO and, on a paid plan includes the creation of audiograms and video reels (highlighted audio and video for social media). 

All output is organized on a sharable page with a nice headline or you can download everything as a Word Document.

Deciphr has a flexi-free plan that gives you 40 minutes of audio/video upload to get started then it's a pay as you go plan. Unfortunately you do need a credit card as they only accept payments through Stripe. Which for me is disappointing because I would definitely subscribe to a plan if I could use PayPal.

Riverside

Riverside is actually a complete browser based studio for professional podcast and video recording which you can try for free. Alongside that Riverside has a host of free and paid tools including their transcription service (which is free).

You don't need an account for their transcription service, just drag'n'drop an audio or video file onto the browser window and you're away. Text can be downloaded as a transcript or caption text file. 

Note that if you're using a browser other than Chrome or Edge you may find this doesn't work. I've been trialing Opera's Browser and the transcript tool wouldn't go past the choose file section. Chrome worked just fine though.

Well worth checking out some of their other free tools which include things like a YouTube Channel Name generator.

Descript

The Descript Editor
The Descript Editor can edit video
direct from your script.
Descript
is also an all in one video creation tool that is browser based but also has a desktop version. Their studio uses a fairly unique concept of editing video based on your text script.

Some of Descript's features include cleaning your audio, removing filler words, and you can clone your own voice and have it speak new dialogue. There's also natural speaking AI voices you can utilize.

Since Descript's video editor relies on a text based script it goes without saying that it can also transcribe your audio and video. Not only that, if you already have a transcription, they can sync it to your media word for word.

Descript is well worth a look since there is a free plan that gives you most features with the ability to create up to an hour of video per month.

* Note: Links to Descript are affiliate links that support this site if you sign up for a Descript paid plan.

AI-coustics

AI-coustics is a fairly basic, browser based, AI audio cleaner if you just need to knock out some background noise from your recordings. Clean up to an hour of audio per month on the free account. Test the service out before you sign up in their Playground area.

The Levelator (Bonus Non-AI Free Software)

Not really an audio cleaning tool but more of an audio enhancing tool for podcasters and video creators too. The Levelator is free software for Mac or Windows that simply takes your voice audio and adjusts the speaking level of all voices to one consistent level.

Pretty much does the job of a compressor filter but the authors say it does more than that, evening out all the voices so none sound too quiet and hard to hear.

The software is quite old now but still does the job. Great if you just need something simple to even out your audio and don't really understand the technicalities of using a compressor filter in your video/audio editing software.
 

Comments

Popular posts from this blog

Inochi2D - Free Open Source 2D VTuber Avatar Rigging and Puppeteering Software (Part 1)

Inochi2D Creator - Free Open Source VTuber Software. If you've been looking for a way to live perform as a 2D cartoon avatar on camera, whether it be for a live stream or for pre-recorded content like educational videos, then VTuber software is a low cost (or even no cost) option worth looking into. In my previous post, How to Become a VTuber - 2D and 3D Software for Creating and Controlling Your Avatar , I took a brief look at the relatively new but completely free and open source Inochi2D  which I thought showed great potential for my own needs of creating a live performance character rig for my own TET Avatar that I use for all my promotional materials. While it is possible to live perform my character using Cartoon Animator itself, Reallusion's MotionLive2D capture system isn't great - with lip sync in particular. More importantly though, I can't exactly teach people how to use Cartoon Animator if I'm using Cartoon Animator to control my Avatar. What is Inochi2D...

LTX Studio (Beta): AI-Powered Visual Storytelling, From Script to Screen in One App.

LTX Studio can generate consistent characters across storyboard panels - even if one character is a dragon! W hile text to image, and text to video (and image to video) AI tend to be getting a lot of the press, the real exciting aspect of generative AI implementation is how it can be used to speed up creator workflow. Being able to realize your creative vision in a shorter length of time can lead to more ambitious projects. Particularly if you're a team of one, with a very limited budget, but you one day dream of creating your own epic animated feature film. LTX Studio (beta), a new 'all-in-one' AI film making tool, is not going to let you realize that dream from a single text prompt but, by bringing a bunch of generative AI technologies together, the developers have created a one platform workflow that can help anyone rapidly visualize and deliver a story from initial idea to finished film in days rather than weeks (depending upon how ambitious the project is). Even bette...

Review: Toon Boom Harmony 14 - What I learned in 21 Days

Toon Boom Harmony is widely considered the industry standard for primarily 2D animation. You don't get to be that if your software isn't exceptional. However, industry standard and exceptional usually translates to steep learning curve and probably contains more features than I'll ever use. So, in reviewing the latest version, Harmony 14, I'm setting out to answer two questions; How easy is it to learn the basics and is it software an independent artist/animator, like myself, should seriously consider as their go to, 2D animation studio of choice?

The Ultimate Independent Animator's App and Resource List - Animation and Video Life

Image created with Cartoon Animator 4. Being an independent animator is not like a studio animation job. There's so much more to do that is indirectly related to the actual task of animating. Over the years I've sought out many apps, tools, and services that can help me achieve that one single task, expressing myself through animation. Below is my Ultimate Independent Animator's Resource List for 2024 (last updated Oct 2024). It started out as a list of free or low cost apps that could help you in every stage of producing either 2D or 3D animation, and then just kind of grew from there. You may not have been looking for a Time Management App as much as you needed something to get you started in 3D animation but when those commissioned projects start coming in you'll have a head start on maximizing your time. All the apps and services on this list had to meet two main criteria: They had to be useful and relevant to an Indy Animator/artist. The base app/se...

Eight 2D Animation Apps For Your Phone or Tablet Mobile Device

M obile productivity apps have become so capable that they can be great alternatives to their PC/MAC equivalents or serve as great tools in their own right when you're away from your desk. While some apps simply mimic their desktop counterparts, others offer well thought out, touch-friendly interfaces that are easier and more fun to use. Every so often I check out what's available for 2D animation for Android devices, since that's what I use, that can complement my workflow with Reallusion's Cartoon Animator 5. Some may be available for Apple devices as well. Below I've listed six free (F) apps (with optional paid (P) upgrades) on the Google Play Store that you might want to explore. Some are just fun apps on their own while others may be useful as part of your workflow on bigger animation projects. Not all are exclusively animation apps and could be used on any production. JotterPad (F/P) The name JotterPad makes this sound like a notepad application but it's ...

Krita AI Diffusion - Generative Image AI For Krita is Seriously Useful, Powerful and Free (If You Can Install it Locally)

Generative AI sequence of a woman in a business suit. From sketch to refined image using Krita AI Diffusion - by TET G enerative image AI, where you describe an image with a text prompt to an Artificial Intelligence model and it produces a new image based on your prompt, is gaining a strong hold as a tool for many artists. Krita AI Diffusion brings generative AI image tools right into your favourite free and opensource, graphics editor, Krita. Not only that, if you have a computer with decent specs (and at least 10GB of hard drive space), Krita AI Diffusion is completely free. What If I Don't Have a Powerful Computer? If you're in my situation, with a computer that was around before anyone in the mainstream had even heard of generative AI, you can still access Krita AI Diffusion for free, using a cloud based AI server, Interstice  and 300 tokens, to get you started. Once your initial tokens run out, purchase 5000 more for 10€ (approx US$11.00). Tokens never expire. I would...

The Future of Animation? Using Reallusion's Cartoon Animator and Image to Video Generative AI to Animate Sequences

Animation software is already capable of auto generating tweens but will AI take it to the next level and tween entire sequences of motion? M y earlier article,  Reallusion's Cartoon Animator Versus PixVerse Image to Video and Lip Sync AI - Battle of the Talking Head Avatars , got me thinking about using generative AI for creating the 'tween' animation in between key poses. This isn't a new idea, and I have explored tweening with AI before in my article,  Five AI Generative Image to Video Tools For Animation You Can Try Free Right Now . That was almost eight months ago and the results were mixed and barely usable. However PixVerse AI is exceptional at maintaining the art style of whatever image you begin with and will also let you include an end frame, making it an ideal tool for tweening, so...  What if, in a future version of Cartoon Animator (or your preferred 2D animation software), all you had to do was create the key moments of the scene, and then the software...