You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: _pages/publications.md
+14-28Lines changed: 14 additions & 28 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,76 +7,62 @@ author_profile: true
7
7
8
8
## Preprints
9
9
10
-
[**Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation**](https://arxiv.org/abs/2502.05151)
11
-
10
+
[**Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation**](https://arxiv.org/abs/2502.05151)\
12
11
Steffen Eger, Yong Cao, Jennifer D'Souza, Andreas Geiger, Christian Greisinger, Stephanie Gross, Yufang Hou, Brigitte Krenn, Anne Lauscher, Yizhi Li, Chenghua Lin, Nafise Sadat Moosavi, Wei Zhao, Tristan Miller.\
13
12
In ArXiv, 2025
14
13
15
-
[**FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models**](https://arxiv.org/abs/2502.18573)
16
-
14
+
[**FactReasoner: A Probabilistic Approach to Long-Form Factuality Assessment for Large Language Models**](https://arxiv.org/abs/2502.18573)\
17
15
Radu Marinescu, Debarun Bhattacharjya, Junkyu Lee, Tigran Tchrakian, Javier Carnerero Cano, Yufang Hou, Elizabeth Daly, Alessandra Pascale.\
18
16
In ArXiv, 2025
19
17
20
-
[**The Nature of NLP: Analyzing Contributions in NLP Papers**](https://arxiv.org/abs/2409.19505)
21
-
18
+
[**The Nature of NLP: Analyzing Contributions in NLP Papers**](https://arxiv.org/abs/2409.19505)\
22
19
Aniket Pramanick, Yufang Hou, Saif M. Mohammad, Iryna Gurevych.\
23
20
In ArXiv, 2024
24
21
25
22
## 2025
26
23
27
-
[**Grounding Fallacies Misrepresenting Scientific Publications in Evidence**](https://www.arxiv.org/abs/2408.12812)
28
-
24
+
[**Grounding Fallacies Misrepresenting Scientific Publications in Evidence**](https://www.arxiv.org/abs/2408.12812)\
29
25
Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych.\
30
26
In NAACL, 2025
31
27
32
28
## 2024
33
29
34
-
[**WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia**](https://openreview.net/pdf/8039d279e177deb3cae86e8385213135651a047c.pdf)
35
-
30
+
[**WikiContradict: A Benchmark for Evaluating LLMs on Real-World Knowledge Conflicts from Wikipedia**](https://openreview.net/pdf/8039d279e177deb3cae86e8385213135651a047c.pdf)\
36
31
Yufang Hou, Alessandra Pascale, Javier Carnerero-Cano, Tigran Tchrakian, Radu Marinescu, Elizabeth Daly, Inkit Padhi, Prasanna Sattigeri.\
37
32
In NeurIPS D&B Track, 2024
38
33
39
-
[**Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards**](https://aclanthology.org/2024.emnlp-main.453.pdf)
40
-
34
+
[**Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards**](https://aclanthology.org/2024.emnlp-main.453.pdf)\
[**SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement**](https://aclanthology.org/2024.findings-emnlp.780.pdf)
45
-
38
+
[**SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement**](https://aclanthology.org/2024.findings-emnlp.780.pdf)\
46
39
Ishani Mondal, Zongxia Li, Yufang Hou, Anandhavelu Natarajan, Aparna Garimella, Jordan Lee Boyd-Graber.\
47
40
In EMNLP Findings, 2024
48
41
49
-
[**MISSCI: Reconstructing Fallacies in Misrepresented Science**](https://aclanthology.org/2024.acl-long.240.pdf)
50
-
42
+
[**MISSCI: Reconstructing Fallacies in Misrepresented Science**](https://aclanthology.org/2024.acl-long.240.pdf)\
51
43
Max Glockner, Yufang Hou, Preslav Nakov, Iryna Gurevych.\
52
44
In ACL, 2024
53
45
54
-
[**How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study**](https://aclanthology.org/2024.acl-long.795.pdf)
55
-
46
+
[**How to Handle Different Types of Out-of-Distribution Scenarios in Computational Argumentation? A Comprehensive and Fine-Grained Field Study**](https://aclanthology.org/2024.acl-long.795.pdf)\
56
47
Andreas Waldis, Yufang Hou, Iryna Gurevych.\
57
48
In ACL, 2024
58
49
59
-
[**A Course Shared Task on Evaluating LLM Output for Clinical Questions**](https://aclanthology.org/2024.teachingnlp-1.11.pdf)
60
-
50
+
[**A Course Shared Task on Evaluating LLM Output for Clinical Questions**](https://aclanthology.org/2024.teachingnlp-1.11.pdf)\
61
51
Yufang Hou, Thy Thy Tran, Doan Nam Long Vu, Yiwen Cao, Kai Li, Lukas Rohde, Iryna Gurevych.\
62
52
In Proceedings of the 6th Workshop on Teaching NLP at ACL 2024
63
53
64
-
[**Beyond Abstracts: A New Dataset, Prompt Design Strategy and Method for Biomedical Synthesis Generation**](https://aclanthology.org/2024.acl-srw.42.pdf)
65
-
54
+
[**Beyond Abstracts: A New Dataset, Prompt Design Strategy and Method for Biomedical Synthesis Generation**](https://aclanthology.org/2024.acl-srw.42.pdf)\
66
55
James O'Doherty, Cian Nolan, Yufang Hou, Anya Belz.\
67
56
In ACL Student Research Workshop, 2024
68
57
69
-
[**On the Role of Summary Content Units in Text Summarization Evaluation**](https://aclanthology.org/2024.naacl-short.25.pdf)
70
-
58
+
[**On the Role of Summary Content Units in Text Summarization Evaluation**](https://aclanthology.org/2024.naacl-short.25.pdf)\
71
59
Marcel Nawrath, Agnieszka Wiktoria Nowak, Tristan Ratz, Danilo Constantin Walenta, Juri Opitz, Leonardo F. R. Ribeiro, João Sedoc, Daniel Deutsch, Simon Mille, Yixin Liu, Sebastian Gehrmann, Lining Zhang, Saad Mahamood, Miruna Clinciu, Khyathi Chandu, Yufang Hou.\
72
60
In NAACL, 2024
73
61
74
-
[**Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization**](https://aclanthology.org/2024.findings-eacl.146.pdf)
75
-
62
+
[**Dive into the Chasm: Probing the Gap between In- and Cross-Topic Generalization**](https://aclanthology.org/2024.findings-eacl.146.pdf)\
76
63
Andreas Waldis, Yufang Hou, Iryna Gurevych.\
77
64
In EACL Findings, 2024
78
65
79
-
[**Holmes: Benchmark the Linguistic Competence of Language Models**](https://aclanthology.org/2024.tacl-1.88.pdf)
80
-
66
+
[**Holmes: Benchmark the Linguistic Competence of Language Models**](https://aclanthology.org/2024.tacl-1.88.pdf)\
81
67
Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou, Iryna Gurevych.\
0 commit comments