Skip to content

Commit 34d1556

Browse files
committed
[Issue #25] Add links to Turn-to-Diarize paper
1 parent 3b5057f commit 34d1556

File tree

3 files changed

+36
-20
lines changed

3 files changed

+36
-20
lines changed

README.md

+28-16
Original file line numberDiff line numberDiff line change
@@ -1,24 +1,26 @@
11
# Spectral Clustering
22
[![Python application](https://github.com/wq2012/SpectralCluster/workflows/Python%20application/badge.svg)](https://github.com/wq2012/SpectralCluster/actions) [![PyPI Version](https://img.shields.io/pypi/v/spectralcluster.svg)](https://pypi.python.org/pypi/spectralcluster) [![Python Versions](https://img.shields.io/pypi/pyversions/spectralcluster.svg)](https://pypi.org/project/spectralcluster) [![Downloads](https://pepy.tech/badge/spectralcluster)](https://pepy.tech/project/spectralcluster) [![codecov](https://codecov.io/gh/wq2012/SpectralCluster/branch/master/graph/badge.svg)](https://codecov.io/gh/wq2012/SpectralCluster) [![Documentation](https://img.shields.io/badge/api-documentation-blue.svg)](https://wq2012.github.io/SpectralCluster)
33

4-
## Note
4+
## Overview
5+
6+
This is a Python re-implementation of the spectral clustering and
7+
constrained spectral clustering algorithms in these two papers:
58

6-
We are currently adding new functionalities to this library to include
7-
some algorithms to appear in an upcoming paper. We are updating the APIs as
8-
well.
9+
* [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/)
10+
* [Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection](https://arxiv.org/abs/2109.11641)
11+
12+
![refinement](https://raw.githubusercontent.com/wq2012/SpectralCluster/master/resources/refinement.png)
13+
14+
## Notice
15+
16+
We recently added new functionalities to this library to include
17+
algorithms in a [new paper](https://arxiv.org/abs/2109.11641). We updated the APIs as well.
918

1019
If you depend on our old API, please use an **older version** of this library:
1120
```
1221
pip3 install spectralcluster==0.1.0
1322
```
1423

15-
## Overview
16-
17-
This is a Python re-implementation of the spectral clustering algorithm in the
18-
paper [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/).
19-
20-
![refinement](https://raw.githubusercontent.com/wq2012/SpectralCluster/master/resources/refinement.png)
21-
2224
## Disclaimer
2325

2426
**This is not a Google product.**
@@ -49,7 +51,8 @@ python3 -m pip install spectralcluster
4951

5052
Simply use the `predict()` method of class `SpectralClusterer` to perform
5153
spectral clustering. The example below should be closest to the original C++
52-
implemention used my our [ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/).
54+
implemention used our
55+
[ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/).
5356

5457
```python
5558
from spectralcluster import configs
@@ -163,7 +166,9 @@ For the complete list of parameters of `AutoTune`, see
163166

164167
### Constrained spectral clustering
165168

166-
We also implemented 2 constrained spectral clustering methods:
169+
In the [Turn-to-Diarize paper](https://arxiv.org/abs/2109.11641),
170+
the spectral clustering is constrained by speaker turns.
171+
We implemented two constrained spectral clustering methods:
167172

168173
* Affinity integration.
169174
* Constraint propagation (see paper [[1](https://link.springer.com/chapter/10.1007/978-3-642-15567-3_1)] and [[2](https://arxiv.org/abs/1109.4684)]).
@@ -203,22 +208,29 @@ constraint_matrix = constraint.ConstraintMatrix(
203208

204209
## Citations
205210

206-
Our paper is cited as:
211+
Our papers are cited as:
207212

208213
```
209214
@inproceedings{wang2018speaker,
210-
title={Speaker diarization with lstm},
215+
title={{Speaker Diarization with LSTM}},
211216
author={Wang, Quan and Downey, Carlton and Wan, Li and Mansfield, Philip Andrew and Moreno, Ignacio Lopz},
212217
booktitle={2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
213218
pages={5239--5243},
214219
year={2018},
215220
organization={IEEE}
216221
}
222+
223+
@article{xia2021turn,
224+
title={{Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection}},
225+
author={Wei Xia and Han Lu and Quan Wang and Anshuman Tripathi and Ignacio Lopez Moreno and Hasim Sak},
226+
journal={arXiv preprint arXiv:2109.11641},
227+
year={2021}
228+
}
217229
```
218230

219231
## Misc
220232

221-
Our new speaker diarization systems are now fully supervised, powered by
233+
We also have fully supervised speaker diarization systems, powered by
222234
[uis-rnn](https://github.com/google/uis-rnn).
223235
Check this [Google AI Blog](https://ai.googleblog.com/2018/11/accurate-online-speaker-diarization.html).
224236

docs/configs.html

+4-2
Original file line numberDiff line numberDiff line change
@@ -71,8 +71,10 @@ <h1 class="title">Module <code>spectralcluster.configs</code></h1>
7171
refinement_options=icassp2018_refinement_options,
7272
custom_dist=&#34;cosine&#34;)
7373

74-
# Configurations of Turn-To-Diarize system using the
75-
# Turn + Constraint Propagation + AutoTune method
74+
# Configurations of Turn-to-Diarize system using the
75+
# Turn + Constraint Propagation + AutoTune method described in the paper
76+
# &#34;Turn-to-Diarize: Online Speaker Diarization Constrained by
77+
# Transformer Transducer Speaker Turn Detection&#34;.
7678
TURNTODIARIZE_REFINEMENT_SEQUENCE = [
7779
RefinementName.RowWiseThreshold, RefinementName.Symmetrize
7880
]

spectralcluster/configs.py

+4-2
Original file line numberDiff line numberDiff line change
@@ -42,8 +42,10 @@
4242
refinement_options=icassp2018_refinement_options,
4343
custom_dist="cosine")
4444

45-
# Configurations of Turn-To-Diarize system using the
46-
# Turn + Constraint Propagation + AutoTune method
45+
# Configurations of Turn-to-Diarize system using the
46+
# Turn + Constraint Propagation + AutoTune method described in the paper
47+
# "Turn-to-Diarize: Online Speaker Diarization Constrained by
48+
# Transformer Transducer Speaker Turn Detection".
4749
TURNTODIARIZE_REFINEMENT_SEQUENCE = [
4850
RefinementName.RowWiseThreshold, RefinementName.Symmetrize
4951
]

0 commit comments

Comments
 (0)