Sound Resurrection

Sound Resurrection is a project containing research results from the Sound Processing in ML area. The goal of the research is to design a model performing sound generation based on masked input with the focus put on the sound taken from video conferences. Another part of the project includes audio signal denoising (in this case - deleting background rustle or conversations, if present) and generating sound with richer spectral envelope from downsampled signal (with effort put on audio distortions that result from audio compression).

The following document contains brief description of the project as well as the explanation of the project's structure.

Project structure

root:
    doc:                    # project's documentation
        Documentation.md    # mainly non-technical documentation of the project
    scripts:                # executables connecting the user with software modules
    src:                    # the source code
        preprocessing:      # data preprocessing tools
        utilities:          # project-wide utils
    tests:                  # tests for the software
        unit:               # unit tests

How to use

To use the project's functionality, follow these steps:

In the root project's directory run:

python3 setup.py setup_venv

Then activate the virtual environment with:

source venv/bin/activate`

Install necessary python packages:

pip install -r requirements.txt

Then you can run scripts from the scripts folder.

If you are a contributor, after activating the environment do:

Install python packages needed for project managing, testing etc:

pip install -r requirements-dev.txt

To properly install tensorflow library:

# If you want to use GPU acceleration (recommended)
pip install --extra-index-url https://pypi.nvidia.com tensorrt-bindings==8.6.1 tensorrt-libs==8.6.1
pip install -U tensorflow[and-cuda]

# Otherwise
pip install tensorflow==2.15.0

To use code checks:

# Install pre-commit hooks that shall be fired at each commit...
pre-commit install
# ...or run the checks manually (especially before marking a feature branch as ready)
pre-commit run --all-files

You have to ensure that all tests from the tests directory pass, before merge of a feature branch to master may be considered:

python3 setup.py run_unit_tests

To properly use the .editorconfig file, make sure to install the IDE extension e.g.

code --install-extension EditorConfig.EditorConfig

Running training

The training data is available at datashare.ed.ac.uk. The VCTK is a free data set containing short speech samples varying mainly in gender and accent.

To set up the workspace before training follow the above guide. Training a specific model involves running a script from the scripts folder. For each such executable file there should be provided an external config with comments describing its contents and purpose of each part.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sound Resurrection

Project structure

How to use

To use the project's functionality, follow these steps:

If you are a contributor, after activating the environment do:

Running training

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 99 Commits
doc		doc
scripts		scripts
src		src
tests/unit		tests/unit
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

WiktorProsowicz/sound-resurrection

Folders and files

Latest commit

History

Repository files navigation

Sound Resurrection

Project structure

How to use

To use the project's functionality, follow these steps:

If you are a contributor, after activating the environment do:

Running training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages