This PhD course delivered at the University Jaume I of Castellon in June 2024. The course is organised in 4 sessions of 3 hours each combining theory and practice. This course is a tailored version of City's Semantic Web Technologies and Knowledge Graphs module.
The example codes will be in Python (tested with Python version >=3.10). I strongly recommend using Python environments to use the right library versions. Follow the instructions in the lab session 0 and run the scripts in the test folder to check you have the required infrastructure and Python libraries.
To avoid clashes with the changes on the main branch it is suggested to add generated codes and data in the folder student-codes-data.
- Lecture: Introduction to Knowledge Graphs (Slides)
- Laboratory: Creating a small KG and ontology (Lab notes) (support slides)
- Support codes.
- Lecture: Reasoning and Querying with Knowledge Graphs (Slides)
- Laboratory: First steps with the SPARQL query language (Lab notes) (support slides)
- Support codes.
- Small tutorial on GraphDB we follow with my students at City.
- Lecture: Matching: KG-to-KG and CSV-to-KG (Slides)
- Laboratory: Creation of a (simple) matching system (Lab notes)
- Support codes.
- Evaluation campaigns: Ontology Alignment Evaluation Initiative (OAEI) and Semantic Web Challenge on Tabular Data to Knowledge Graph Matching (SemTab).
- Tough Tables (2T) Dataset: https://doi.org/10.5281/zenodo.7419275
- Lecture: Knowledge Graphs and Language Models (Slides)
- Laboratory: Ontology Embeddings with OWL2Vec* (Lab notes)
- Support codes.
- OWL2Vec* paper: Springer link.
Students need to work on a small project (max 2 students per group). There are two options (see Lab 3 notes):
- Create a (simple) system that performs KG to KG alignment. Try to implement some sophisticated ideas as described in the lecture. Selected systems may try to participate in the OAEI campaign.
- Create a (simple) system that performs CSV to KG matching. Try to implement disambuguation techniques as shown in the lecture. Selected systems may try to participate in the SemTab challenge.
Submission:
- When:
June 30, 23:59 CEST (strict deadline)
- What: a link to the GitHub repository where the system codes are. Please document the repository so that the codes are easy to run
- How: via this Google form.