HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

🎉 Discrete Neural Codec With 24 Tokens Per Second (24KHZ) for Spoken Language Modeling!

Installation

To install HHCodec, follow these steps:

conda create -n hhcodec python=3.10 # it must >3.10 beacause use bigvgan
conda activate hhcodec
git clone https://github.com/rongkunxue/HH-Codec.git
cd HH-Codec 
pip install -e .

#if you want to eval by UTMOS
pip install pip==24.0
pip install fairseq

Train

Step 1: Prepare the Training Dataset

Ensure your dataset is preprocessed by following the instructions in dataset

Step 2: Modify Configuration Files

Before starting training, update the configuration settings

# Open and modify the following file "configs/train.yaml"
# Adjust parameters such as:
# - log settings
# - train_path
# - save_dir
# - device (e.g., CPU/GPU)

Step 3: Start Training

Once the dataset is prepared and the configuration is set, launch the training process:

#We expect to finalize and open-source the training code within two weeks.

Acknowledgement

The HHCodec codebase is adapted from the following repositories:

A huge thanks to the authors of these projects for their outstanding contributions! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
config		config
dataset		dataset
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
requirements.txt		requirements.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

Installation

Train

Step 1: Prepare the Training Dataset

Step 2: Modify Configuration Files

Step 3: Start Training

Acknowledgement

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

opendilab/HH-Codec

Folders and files

Latest commit

History

Repository files navigation

HH-Codec: High Compression High-fidelity Discrete Neural Codec for Spoken Language Modeling

Installation

Train

Step 1: Prepare the Training Dataset

Step 2: Modify Configuration Files

Step 3: Start Training

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages