How to separate voice and background music from a video file

8

2

Is it possible to separate voice and background music from a video file?

I only need the background music.

Type of video: MP4

Working on Windows XP 32 bit SP3

SpongeBob SquarePants

Posted 2011-02-14T16:42:35.343

Reputation: 429

Perhaps not so easy http://www.youtube.com/watch?v=LtXrKo8Btfc

– Dr. belisarius – 2011-02-14T17:00:13.200

Answers

4

Unless they're separate audio tracks in your video, not easily. What you'll probably have to do is extract the audio track from the video into a separate file, edit the audio file with a dedicated tool, then remux the result back into the video.

The demux/remux part is easy. What's going to be difficult is attempting to isolate the background music. You'll probably have to experiment with different effects, all of which will most likely result in either a significant loss of fidelity in the audio or not entirely removing the dialogue, if not both. What's more is that you're going to be re-encoding that output into a new mp3/aac file, and between the re-encoding and audio processing, your output is going to sound much worse than the original.

You may have better results by trying to re-master the background music and replacing the audio track in the movie file entirely.

afrazier

Posted 2011-02-14T16:42:35.343

Reputation: 21 316

replacing it would be crazy and hopeless, since I am just music lover, I don't know anything else about music. Is there any 3rd party program out there which can help me? – SpongeBob SquarePants – 2011-02-14T17:08:41.370

No, unless you have source material with separate tracks for the various audio pieces, separating out music from voice is extremely difficult and for the tools there are out there that claim to do it, quality is always poor. – BBlake – 2011-02-14T17:42:22.170

7

Audacity

Audacity – a free and open-source cross-platform audio editor – can do this, using the Vocal Reduction and Isolation effect. You should first extract the audio from the video file, e.g. using ffmpeg:

ffmpeg -i video.mp4 -c:a pcm_s16le audio.wav

And then load the audio.wav file into Audacity:

If you only want to get background music, select the Remove Vocals option; if you want the opposite, choose Isolate Vocals.

Note that this is never going to sound perfect. Vocal isolation is a hard task, as everything you hear is basically mixed into two tracks. An algorithm will never be as good as your brain in isolating different sound sources. Your audio source should be a stereo file with the vocals being panned dead-center. It might also produce false-positives, removing other instruments in the process.

Spleeter

Spleeter is a Python library that can extract music and vocals from a joint audio source. It is machine-learning based and can provide different output types (the number of stems extracted).

slhck

Posted 2011-02-14T16:42:35.343

Reputation: 182 472

This was one of best tutorials I've read about Audacity. Underrated answer. Thank you very much :) – Zeta.Investigator – 2017-12-02T18:59:23.743

Is it just a band-pass filter? Then what is "Strength" meaning in this? Does is attenuate the amplitude of the out-of-band signals or does it have something to do with power (or log of power)? – Zeta.Investigator – 2017-12-02T19:01:18.867

1

@Zeta.Investigator Just briefly skimming the article and the source code, it appears to be the power of the band pass / band stop that will be added or subtracted from the center. Apparently it's normalized by the difference between 1) the power of the middle vs. overall signal and 2) the power of the side vs. overall signal. Then it's exponentiated – so I assume it's on the log scale. (But I'm not an expert on this kind of processing or LISP.)

– slhck – 2017-12-03T11:30:59.153

0

I managed to remove background conversation/noise from a video by running a low-pass filter that was built in to the editor that I was using. The whirrs of my robot, which was the target of my video, remained.

Note that I am not an audio expert, and I cannot guarantee this will work for your particular case, but it's worth a try.

Bigbio2002

Posted 2011-02-14T16:42:35.343

Reputation: 3 804