From 90725dbf5bc3809e0364d20d0837c58968ceb2b1 Mon Sep 17 00:00:00 2001 From: Ricardo Rei Date: Sun, 3 Oct 2021 18:23:59 +0100 Subject: [PATCH] Update README.md --- README.md | 8 ++++++++ 1 file changed, 8 insertions(+) diff --git a/README.md b/README.md index ef05496..74a60db 100644 --- a/README.md +++ b/README.md @@ -5,6 +5,14 @@ In this repository, we explore different strategies for automatic transcription > [Download IWSLT corpus](https://unbabel-experimental-data-sets.s3-eu-west-1.amazonaws.com/video-pt2020/IWSLT-punkt.tar.gz) +# Publications: +- [Multilingual Simultaneous Sentence End and Punctuation Prediction](http://ceur-ws.org/Vol-2957/sepp_paper3.pdf) +- [Towards better subtitles: A multilingual approach for punctuation restoration of speech transcripts](https://www.sciencedirect.com/science/article/abs/pii/S0957417421011180) +- [Automatic truecasing of video subtitles using BERT: a multilingual adaptable approach](https://link.springer.com/chapter/10.1007/978-3-030-50146-4_52) + +## Sentence end and punctuation prediction shared task +To replicate our winning submission to SEPP 2021 please go to the `shared-task` branch. + ## Model architecture: ![base_model](images/base_model.png)