What’s audio extraction?

Print anything with Printful



Audio extraction is used in speech recognition and music analysis to search for specific characteristics. It identifies words in speech and can organize music by genre. Dragon® sells AudioMining® which transcribes audio files and tags them for search. Other manufacturers include Nuance® and Nexidia®.

Audio extraction is typically used in speech recognition software and music analysis. This technology gives the user the ability to search through speech or music audio that has been analyzed for specific characteristics. When used in speech recognition technology, audio extraction identifies words spoken in audio and puts them into a searchable file. This feature can be useful for students or those in business who attend many meetings because it allows the user to more easily browse topical information from spoken presentations. This type of analysis can also be used in music to determine characteristics such as beats per minute (BPM), key and musical structure, information used to classify music.

In speech recognition, where the technology is used most often, audio extraction is used to create an acoustic model. An acoustic model programs speech recognition software to recognize speech patterns as words. This technology is developed by audio extraction of a recording of a spoken sentence, which is compared to the text corresponding to the spoken sentence. The computer uses the information to recognize words when the user makes sounds similar to those in the acoustic model. An acoustic model is used in conjunction with a file that tells the speech recognition program which language to interpret and which word patterns can be spoken in certain sentences and situations.

Musicians and music listeners alike can benefit from audio extraction in music. Sometimes, music software that classifies music by genre uses audio extraction to organize the music. The process identifies and groups music files with sonic similarities that frequently occur across music genres. While this technology can make it easier to organize music and find new music, it can make mistakes by rating songs that have similar measured characteristics but different overall sound. Audio analysis software can be useful to musicians, especially composers, because it allows the composer to jump to specific parts of the song’s structure, including changes in musical keys and lyrics within lyrics.

Speech recognition software maker Dragon® sells a program called AudioMining® that transcribes audio files and tags the files so they can be searched for text. Dragon is a producer of computer linguistics programs, the technical term for the field of software designed to interpret speech. Audio mining, when used as two words, is a general term that refers to analyzing an audio file for a certain set of audio characteristics. Other manufacturers of audio extraction software include Nuance® and Nexidia®.




Protect your devices with Threat Protection by NordVPN


Skip to content