TDAC is the first time-domain astrophysics corpus based on astronomical observation reports (GCN Circulars, ATel Telegrams and AstroNotes).
@inproceedings{alkan-etal-2022-tdac,
title = "{TDAC}, The First Corpus in Time-Domain Astrophysics: Analysis and First Experiments on Named Entity Recognition",
author = "Alkan, Atilla Kaan and
Grouin, Cyril and
Schussler, Fabian and
Zweigenbaum, Pierre",
editor = "Ghosal, Tirthankar and
Blanco-Cuaresma, Sergi and
Accomazzi, Alberto and
Patton, Robert M. and
Grezes, Felix and
Allen, Thomas",
booktitle = "Proceedings of the first Workshop on Information Extraction from Scientific Publications",
month = nov,
year = "2022",
address = "Online",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2022.wiesp-1.15",
pages = "131--139",
abstract = "The increased interest in time-domain astronomy over the last decades has resulted in a substantial increase in observation reports publication leading to a saturation of how astrophysicists read, analyze and classify information. Due to the short life span of the detected astronomical events, the information related to the characterization of new phenomena has to be communicated and analyzed very rapidly to allow other observatories to react and conduct their follow-up observations. This paper introduces TDAC: the first Corpus in Time-Domain Astrophysics, based on observation reports. We also present the NLP experiments we made for named entity recognition based on annotations we made and annotations from the WIESP NLP Challenge.",
}