Musical onset detection using a CNN

Pytorch implementation of the method described in:
Schlüter, Jan, and Sebastian Böck. "Improved musical onset detection with convolutional neural networks." 2014 ieee international conference on acoustics, speech and signal processing (icassp). IEEE, 2014.

Requirements

Pytorch
Librosa
Numpy
Matplotlib(optional)

Dataset (used in the paper)

Can be obtained from here (until the Google drive links are alive) - "CPJKU/onset_db#1 (comment)"

Usage

Train the network

Run gen_songlist.py to get the list of all songs for which there is onset annotation data available(there are some extra audios in the dataset)
Run get_data_stats.py to compute the mean and standard deviation across 80 mel bands over the entire dataset
Run gen_data.py to generate the 15-frame mel spectrogram chunks and frame-wise labels for all the audios
Run train.py to train the network. Specify a fold number in the command line when running this script. This is used to partition the data into train and val splits using the splits data provided by the authors. The training almost exactly follows the procedure described in the paper. The weights at the end of 100 epochs get saved in the models folder.

Evaluate the network

Run test.py to evaluate on the dataset. Again, specify a fold number to get the results for that fold. Results get saved to a text file in the form of #true-postives, #false-alarms, and #ground-truth-onsets, summed over all the validation songs, for different evaluation thresholds.

Load saved model

If you wish to use the trained model on different data, utils.py contains the model class definition (and some other helper functions). Import the model class from here and load one of the saved model state dicts from the models folder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Musical onset detection using a CNN

Requirements

Dataset (used in the paper)

Usage

Train the network

Evaluate the network

Load saved model

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
models		models
README.md		README.md
gen_data.py		gen_data.py
gen_songlist.py		gen_songlist.py
get_data_stats.py		get_data_stats.py
songlist.txt		songlist.txt
test.py		test.py
train.py		train.py
utils.py		utils.py

rohitma38/cnn-onset-detection

Folders and files

Latest commit

History

Repository files navigation

Musical onset detection using a CNN

Requirements

Dataset (used in the paper)

Usage

Train the network

Evaluate the network

Load saved model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages