Binaural-Source-Localization-CNN

Basic Information

Author: Gregory Hunkins

Organization: University of Rochester

License: MIT

Abstract: A Convolutional Neural Network (CNN) classification system was designed for the task of source localization of human voices in 3-D space. A new dataset, VoiceBin100K, is introduced to accomplish this task and for future work in the field. The CNN inputs variable-length binaurual short- time Fourier Transform (STFT) magnitude and phase features and predicts location of the speaker’s voice according to 168 location classes.

Running The Code

Reference: https://cs.rochester.edu/~cxu22/t/577F17/bluehive_tutorial.html

Data

Please contact ghunkins@u.rochester.edu for access to the data. A public link will available shortly.

Name	Name	Last commit message	Last commit date
Latest commit GREGORY DAVID HUNKINS confusion Dec 16, 2017 4368280 · Dec 16, 2017 History 78 Commits
Data_Processing	Data_Processing	fixes	Dec 15, 2017
Neural_Net	Neural_Net	confusion	Dec 16, 2017
.DS_Store	.DS_Store	fixes	Dec 15, 2017
LICENSE.txt	LICENSE.txt	more upload	Dec 3, 2017
README.md	README.md	more upload	Dec 3, 2017
job.sh	job.sh	job.sh	Dec 5, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binaural-Source-Localization-CNN

Basic Information

Running The Code

Data

About

Releases

Packages

Languages

License

ghunkins/Binaural-Source-Localization-CNN

Folders and files

Latest commit

History

Repository files navigation

Binaural-Source-Localization-CNN

Basic Information

Running The Code

Data

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages