In this guide, how to install RVC on your computer and run locally

Download RVC

RVC Project has made portable version in zip file, this method are much quicker as you can download with IDM. At this time of writing, version 1006 is most recent.

Click download as shown below

Wait download to finish

Install

Once download complete, open the 7z file and extract to place you want, for me I extract to E:\ drive and rename to rvc-1006

Run

Once extracting complete, to run as user, execute go-web.bat as shown below.

RVC will open Web GUI automatically with default browser that you set

Training

By default, RVC ship a simple model that you can play with, but this guide show how to train custom model, most lengthy process is prepare dataset

Prepare Dataset

In order training to work, you must gather a clean voice:

No background noise
No background music
No echo
No reverb

Audio format that RVC accept can be any (MP3, MP4, OGG, OPUS). For better, convert voice dataset to WAV file, preprocess will be quicker.

For this example, I will be using my waifu: Ijichi Nijika from Bocchi the Rock. Her voice can be get from this YouTube because uploader has remove background noise, music, echo and reverb. Nijika voice in plain

Convert the MKV (VP9, Opus) file to FLAC file by using Internet Friendly Media Encoder:

Once convert is complete, open the FLAC file in Audio Editor, for me, Adobe Audition, this time we need split her voice into 10 sec or less:

Once you split her voice in 10 seconds or less, save it in WAV file like this and gather around 10 file or more:

If you too lazy, I have prepared the dataset, simply extract it:

Let’s Training

What we need

Give a name (no space!):

Higher Sample Rate:

Use V2

How many CPU thread you have? set to that or less

Folder location that contain wav dataset:

Size of training, adjust Batch size per GPU based on VRAM you have, RTX3090 can use 20 batch:

Follow this

Once everything is set, click One-click training:

Wait training to complete

Training can take time based on how many dataset you have, how powerful your GPU is, for this time just wait until training complete, check the Windows Command Prompt console

Split Music & Voice

This step show how to split music and voice, even background choir, for this to work, we need:

Ultimate Vocal Remover

UVR5 can be download here: GitHub Release

Download UVR Model

Click wrench icon (settings) and go to here:

You need

VR Architecture

UVR-DeEcho-DeReverb
UVR-De-Echo-Aggressive
UVR-DeNoise

MDX-Net

MDX23C-InstVoc HQ
UVR-MDX-NET Karaoke
UVR-MDX-NET-Voc_FT
UVR_MDXNET_9482

Set Input File/Output Folder

Step to Properly Split Audio Stem

Split Music & Voice

If music have background choir, do this: MDX-Net ➡️ UVR-MDX-NET Karaoke

If music only have one singer, do this: MDX-Net ➡️ MDX23C-InstVoc HQ -or- UVR-MDX-NET-Voc_FT

Once process splitting the stem is complete, open the Output folder, UVR will create s sub-folder of music file name like this, then use processed file back to UVR for further cleanup: