|
1 | 1 | # Spectral Clustering
|
2 | 2 | [](https://github.com/wq2012/SpectralCluster/actions) [](https://pypi.python.org/pypi/spectralcluster) [](https://pypi.org/project/spectralcluster) [](https://pepy.tech/project/spectralcluster) [](https://codecov.io/gh/wq2012/SpectralCluster) [](https://wq2012.github.io/SpectralCluster)
|
3 | 3 |
|
4 |
| -## Note |
| 4 | +## Overview |
| 5 | + |
| 6 | +This is a Python re-implementation of the spectral clustering and |
| 7 | +constrained spectral clustering algorithms in these two papers: |
5 | 8 |
|
6 |
| -We are currently adding new functionalities to this library to include |
7 |
| -some algorithms to appear in an upcoming paper. We are updating the APIs as |
8 |
| -well. |
| 9 | +* [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/) |
| 10 | +* [Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection](https://arxiv.org/abs/2109.11641) |
| 11 | + |
| 12 | + |
| 13 | + |
| 14 | +## Notice |
| 15 | + |
| 16 | +We recently added new functionalities to this library to include |
| 17 | + algorithms in a [new paper](https://arxiv.org/abs/2109.11641). We updated the APIs as well. |
9 | 18 |
|
10 | 19 | If you depend on our old API, please use an **older version** of this library:
|
11 | 20 | ```
|
12 | 21 | pip3 install spectralcluster==0.1.0
|
13 | 22 | ```
|
14 | 23 |
|
15 |
| -## Overview |
16 |
| - |
17 |
| -This is a Python re-implementation of the spectral clustering algorithm in the |
18 |
| -paper [Speaker Diarization with LSTM](https://google.github.io/speaker-id/publications/LstmDiarization/). |
19 |
| - |
20 |
| - |
21 |
| - |
22 | 24 | ## Disclaimer
|
23 | 25 |
|
24 | 26 | **This is not a Google product.**
|
@@ -49,7 +51,8 @@ python3 -m pip install spectralcluster
|
49 | 51 |
|
50 | 52 | Simply use the `predict()` method of class `SpectralClusterer` to perform
|
51 | 53 | spectral clustering. The example below should be closest to the original C++
|
52 |
| -implemention used my our [ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/). |
| 54 | +implemention used our |
| 55 | +[ICASSP 2018 paper](https://google.github.io/speaker-id/publications/LstmDiarization/). |
53 | 56 |
|
54 | 57 | ```python
|
55 | 58 | from spectralcluster import configs
|
@@ -163,7 +166,9 @@ For the complete list of parameters of `AutoTune`, see
|
163 | 166 |
|
164 | 167 | ### Constrained spectral clustering
|
165 | 168 |
|
166 |
| -We also implemented 2 constrained spectral clustering methods: |
| 169 | +In the [Turn-to-Diarize paper](https://arxiv.org/abs/2109.11641), |
| 170 | +the spectral clustering is constrained by speaker turns. |
| 171 | +We implemented two constrained spectral clustering methods: |
167 | 172 |
|
168 | 173 | * Affinity integration.
|
169 | 174 | * Constraint propagation (see paper [[1](https://link.springer.com/chapter/10.1007/978-3-642-15567-3_1)] and [[2](https://arxiv.org/abs/1109.4684)]).
|
@@ -203,22 +208,29 @@ constraint_matrix = constraint.ConstraintMatrix(
|
203 | 208 |
|
204 | 209 | ## Citations
|
205 | 210 |
|
206 |
| -Our paper is cited as: |
| 211 | +Our papers are cited as: |
207 | 212 |
|
208 | 213 | ```
|
209 | 214 | @inproceedings{wang2018speaker,
|
210 |
| - title={Speaker diarization with lstm}, |
| 215 | + title={{Speaker Diarization with LSTM}}, |
211 | 216 | author={Wang, Quan and Downey, Carlton and Wan, Li and Mansfield, Philip Andrew and Moreno, Ignacio Lopz},
|
212 | 217 | booktitle={2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
|
213 | 218 | pages={5239--5243},
|
214 | 219 | year={2018},
|
215 | 220 | organization={IEEE}
|
216 | 221 | }
|
| 222 | +
|
| 223 | +@article{xia2021turn, |
| 224 | + title={{Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection}}, |
| 225 | + author={Wei Xia and Han Lu and Quan Wang and Anshuman Tripathi and Ignacio Lopez Moreno and Hasim Sak}, |
| 226 | + journal={arXiv preprint arXiv:2109.11641}, |
| 227 | + year={2021} |
| 228 | +} |
217 | 229 | ```
|
218 | 230 |
|
219 | 231 | ## Misc
|
220 | 232 |
|
221 |
| -Our new speaker diarization systems are now fully supervised, powered by |
| 233 | +We also have fully supervised speaker diarization systems, powered by |
222 | 234 | [uis-rnn](https://github.com/google/uis-rnn).
|
223 | 235 | Check this [Google AI Blog](https://ai.googleblog.com/2018/11/accurate-online-speaker-diarization.html).
|
224 | 236 |
|
|
0 commit comments