Skip to content

Commit 390e62b

Browse files
committed
Add Jin Wang's talk details
1 parent a058add commit 390e62b

1 file changed

Lines changed: 25 additions & 1 deletion

File tree

nwds/nwds.markdown

Lines changed: 25 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ Faisal Nawab is an assistant professor in the computer science department at the
4949
---
5050

5151
<p><a name="Pat_Helland_2024_01_26"></a>
52-
<strong>Speaker</strong>: <a href="">Pat Helland</a> </p>
52+
<strong>Speaker</strong>: <a href="pathelland.substack.com">Pat Helland</a> </p>
5353

5454
<p><strong>Where</strong>: University of Washington, Seattle.<br>
5555
Allen School of Computer Science and Engineering.<br>
@@ -72,6 +72,30 @@ Pat Helland has been building distributed systems, database systems, high-perfor
7272

7373
---
7474

75+
<p><a name="Jin_Wang_2024_01_19"></a>
76+
<strong>Speaker</strong>: <a href="">Jin Wang</a> </p>
77+
78+
<p><strong>Where</strong>: University of Washington, Seattle.<br>
79+
Allen School of Computer Science and Engineering.<br>
80+
Paul G. Allen Center, CSE 291</p>
81+
82+
<p><strong>When</strong>:
83+
Friday, January 19th, 2024, 2:30pm-3:30pm</p>
84+
85+
<p><strong>Title</strong>:
86+
Towards End-to-end Data Pipeline for Effective Data Science
87+
</p>
88+
89+
<p><strong>Abstract</strong>:
90+
Nowadays data-driven approaches have become a mainstream research methodology in multiple communities. To support effective and scalable data science applications on the ever growing datasets, researchers from both academic and industrial fields have made great efforts in building end-to-end data pipelines. In this talk, I will present my efforts in improving two essential components of an end-to-end data pipeline: data preparation and data processing. First, I will present a unified self-supervised learning paradigm that can improve the performance of a variety of data preparation tasks, such as dataset discovery, table annotation and entity matching. Next, I will introduce my work in optimizing parallel recursive queries to support analytical workloads in data processing. Finally, I will conclude with the vision for future work of data pipelines.
91+
</p>
92+
93+
<p><strong>Bio</strong>:
94+
Jin Wang is a research scientist and research lead from Megagon Labs. Before that he obtained his PhD degree of Computer Science from University of California, Los Angeles in July 2020. His research interests lie in the board area of data management and data science. In particular, his research focuses on Database systems, Datalog, Data Integration and Table Representation Learning. His work appears in leading conferences and journals of data management such as SIGMOD, VLDB, ICDE and VLDB Journal.
95+
</p>
96+
97+
---
98+
7599
#### Winter 2023
76100

77101
---

0 commit comments

Comments
 (0)