Automatic Music Transcription Evaluation

This repository contains evaluation tools and MIDI files for measuring F1 scores of our automatic music transcription model. The evaluation framework is implemented in Google Colab, providing a cloud-based environment for transparent assessment of model performance and detailed inspection of the evaluation process.

Features

Cloud-based evaluation environment
Pre-selected MIDI test files
F1 score calculation implementation
Interactive result inspection
Reproducible evaluation process

Notebooks

MusScribeF1Augmentation.ipynb — broader playground: standard F1 across HPPNet, Sony HFT, Basic Pitch, and Transkun checkpoints on Spretten and Godvaersdagen, plus a final section adding strict F1 and onset/offset/pitch MAE for the post-processing stages in postpros/.
paper_evaluation.ipynb — paper-focused: produces Table 1 of the ISMIR submission "Raw Note Transcription for Hardanger Fiddle via a Hybrid Neural/Rule-Based Approach". Loads postpros/, runs metrics for raw / +pitch / +offset stages, and writes table_results.csv and table_results.tex.
eval_utils.py — shared loader (.mid and post-processing CSVs), F1, and MAE helpers used by both notebooks.

Metrics and thresholds

Note-level metrics use mir_eval.transcription with the standard MIREX/MAESTRO tolerances: onset ±50 ms, offset max(50 ms, 20% duration), pitch 50 cents (raffel2014mireval, hawthorne2018onsets, hawthorne2019maestro, bay2009mirex). The strict F1 is the same metric with offset tolerance reduced to max(25 ms, 5% duration); it is a sensitivity variant of the standard metric, configured through the same mir_eval API. See references.bib for the BibTeX entries.

Getting Started

Prerequisites

Google account
Web browser
Internet connection
Human being

Running the Evaluation

Open the Google Colab notebook
Click "Runtime" in the top menu
Select "Restart and run all"
Follow the cell-by-cell execution to view results

Contributing

We welcome contributions to improve the evaluation framework. Please follow these steps:

Fork the repository
Create a feature branch (git checkout -b feature/improvement)
Commit your changes (git commit -am 'Add some improvement')
Push to the branch (git push origin feature/improvement)
Open a Pull Request

For major changes, please open an issue first to discuss your proposed changes.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
midi-files		midi-files
model3_postpros		model3_postpros
per_note		per_note
per_note_test1		per_note_test1
postpros		postpros
recheckingcorrectionsfiddletranscriptionsduringthisw		recheckingcorrectionsfiddletranscriptionsduringthisw
.gitignore		.gitignore
MusScribeF1Augmentation.ipynb		MusScribeF1Augmentation.ipynb
eval_utils.py		eval_utils.py
evaluate_all.py		evaluate_all.py
graph123.png		graph123.png
paper_evaluation.ipynb		paper_evaluation.ipynb
readme.md		readme.md
references.bib		references.bib
table_results.csv		table_results.csv
table_results.tex		table_results.tex
table_results_diagnostics.csv		table_results_diagnostics.csv
test_split_results.csv		test_split_results.csv
test_split_results.tex		test_split_results.tex
test_split_results_diagnostics.csv		test_split_results_diagnostics.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automatic Music Transcription Evaluation

Features

Notebooks

Metrics and thresholds

Getting Started

Prerequisites

Running the Evaluation

Contributing

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Automatic Music Transcription Evaluation

Features

Notebooks

Metrics and thresholds

Getting Started

Prerequisites

Running the Evaluation

Contributing

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages