As someone whose tried this repo extensively with politican sound clips (I wante...

forgingahead · on April 18, 2020

Real Time Voice Cloning certainly has iffy output, but it's probably the most popular because it provides the easiest plug-and-play experience with even a simple UI to get started.

The author says he's working on a more polished toolkit called Resemble.AI, but I've never tried it. https://www.resemble.ai/

There's certainly a market out there for just beautifying existing repos to making it easier for non-scholars to get going. Even having a Colab Notebook ready to click-and-start is quite powerful -- probably a big reason why First Order Model (source paper to the original story) got so much traction so quickly.

egfx · on April 18, 2020

See https://www.descript.com/lyrebird-ai for another one with an on-site demo.

im3w1l · on April 18, 2020

And sometimes it's not even non-scholars, just people from a different, and not even very-different field.

abledon · on April 18, 2020

could you post some examples with dropbox of the 'less than stellar' results you were getting?

Der_Einzige · on April 19, 2020

Find audio clips of trump speaking Try running then through this

See what you get.

To be fair, I was trying this repo almost a 6 months ago so updates may have improved it.