• by mmastrac on 2/16/2025, 3:53:00 PM

    I tried the Audacity noise-removal plugin recently and it's complete crap. I fed a high-quality audio stream from a Rode mic into a few different options to see which could remove the noise of my server rack. iMovie made the voice sound like a robot and Audacity barely did anything. The only thing that worked was DeepFilterNet and it's free, open-source and cargo installable.

    There's no reason to lock yourself into an intel-only solution. Just use DeepFilterNet. The results of this on my noisy server room were insanely good. Almost no voice dropout with 100% fan noise removal.

    https://github.com/Rikorose/DeepFilterNet

    EDIT: Even more interesting, it looks like OpenVino is just DeepFilterNet glued to Whisper.cpp and tied to Intel hardware.

    https://github.com/intel/openvino-plugins-ai-audacity/tree/m...

  • by sorenjan on 2/16/2025, 3:16:27 PM

    This only works with Intel GPUs, and CPUs and NPUs. No Nvidia support for instance.

    https://docs.openvino.ai/2024/about-openvino/release-notes-o...

  • by smusamashah on 2/16/2025, 3:35:53 PM

    Is there a tool that can remove very noisy audio recording of a song using actual song as a reference?

    I found a very old audio cassette from my childhood with me and some other kids talking while a song is playing in background. I tried subtracting the song using Audacity but for that to work reference song and recording must align "perfectly" which is very very hard. Not just the timing (which i found can be a problem with cassettes) loudness/frequency distribution must also align perfectly.

    Found Smartsubtract https://oxfordwaveresearch.com/products/smartsubtract/ which seems to do exactly the same but it's not available for download.

    Is there any (AI even?) tool that might do that? I tried an online AI tool which claimed it can extract voices but it returned back silence. I want to try OpenVino but not sure it will be useful with faint spoken words in a noisy environment with a song.

  • by kmfrk on 2/16/2025, 4:05:12 PM

    I'm a big fan of RTX Voice, but it seems like the kind of feature you can only use in real-time as virtual audio and not as postprocessing. Anyone if Nvidia makes this possible?

  • by pabs3 on 2/17/2025, 5:34:09 AM

    Wonder if these are open models like RNNoise now is.

    https://github.com/xiph/rnnoise