Multi-speaker voice cloning

#5
by thomasgauthier - opened

How does voice cloning work for multi-speaker prompts like "[S1] Speaker 1 says something [S2] Speaker 2 responds"?

Does the voice encoder support multi-voice reference clips?

Sign up or log in to comment