Codebase for TSAR 2025 research paper Template-Based Text-to-Image Alignment for Language Accessibility: A Study on Visualizing Text Simplifications, conducted at the University of Zurich (UZH). The project develops a framework for generating accessible images from simplified text using structured prompt templates, specifically designed to help users with cognitive disabilities and learning difficulties through cognitively accessible AI-generated visual content.
-
Master’s Thesis: Template-Based Text-to-Image Alignment for Language Accessibility: A Study on Visualizing Simplified Text
(Link will be added once publicly available via UZH ZORA repository) -
TSAR 2025 Paper: Template-Based Text-to-Image Alignment for Language Accessibility: A Study on Visualizing Text Simplifications
(To appear in the ACL Anthology after the TSAR 2025 Workshop on Text Simplification, Accessibility, and Readability)
Belkiss_ThesisFiles/
├── Code/
│ ├── annotation_analysis/ # Analysis of annotation data and inter-annotator agreement
│ │ ├── 100 score analysis/
│ │ ├── IAA/ # Inter-Annotator Agreement calculations
│ │ ├── Style Recognition/
│ │ ├── best dataset and style/
│ │ ├── correlation study/
│ │ └── qualitative study/
│ ├── intermediate_exploration/ # Exploratory analysis and experiments
│ ├── phase1_template_selection/ # Phase 1: Template selection methodology
│ ├── phase2_style_evaluation/ # Phase 2: Style evaluation framework
│ └── phase3_label_studio/ # Phase 3: Label Studio annotation setup
└── required files/ # Essential datasets and configuration files
├── complete_dataset_400.json
├── expert_distribution_mapping.csv
├── refined_prompts.json
└── renamed_images_mapping.csv
Due to file size limitations, the annotation data and generated images are hosted on SwissUbase:
- Annotation Data: Contains individual annotator files.
- Generated Images: Complete dataset of the generated images used in the study.
- Access: SwissUbase Dataset Link
- Click the dataset link above.
- In the left sidebar, select Files.
- Click on any of the listed file collections (e.g., Annotation Files, Simplified Sentences and Prompts, or Generated Images).
- Open the menu icon (☰) on the top left.
- Select File → then click Download to save the file locally.
We compiled a text-to-image text simplification (TS) corpus by sampling from four established datasets:
- OneStopEnglish (Vajjala & Lučić, 2018) — A corpus of news articles rewritten at different reading levels for automatic readability assessment and text simplification.
- SimPA (Scarton et al., 2018) — A sentence-level simplification corpus for the public administration domain.
- Wikipedia (Sun et al., 2020) — A dataset exploring the effect of document context on sentence simplification (used without context).
- ASSET (Alva-Manchego et al., 2020) — A dataset for tuning and evaluating sentence simplification models with multiple rewriting transformations.
Together, these resources form a balanced corpus of 400 simplified sentence pairs covering diverse text domains including news, encyclopedic, and administrative content.
Five structured prompt templates were developed, each following different layout logic based on cognitive accessibility principles:
- Basic Object Focus: Simple, object-centered visual representations
- Contextual Scene: Scene-based imagery with environmental context
- Educational Layout: Structured layouts optimized for learning
- Multi-Level Detail: Hierarchical visual information presentation
- Grid Layout: Organized grid-based visual arrangements
- Automatic metrics: CLIP similarity scores for semantic alignment
- Expert annotations: Multi-dimensional human evaluation across accessibility criteria
- Style analysis: Assessment of visual styles impact on accessibility and clarity
- Objective: Develop and select optimal prompt templates for accessibility-focused image generation
- Key Files:
phase1_template_selection/prompt_generator_p1.ipynbphase1_template_selection/image_generator_p1.ipynbphase1_template_selection/clip_evaluator_p1.ipynb
- Objective: Evaluate different visual styles across the five prompt templates for accessibility impact
- Key Files:
phase2_style_evaluation/code/image_generator_p2.ipynbphase2_style_evaluation/code/clip_evaluator_p2.ipynbphase2_style_evaluation/code/p2_prompt_generator.ipynb
- Objective: Conduct expert evaluation of generated images across multiple accessibility dimensions
- Key Files:
phase3_label_studio/image_processing.ipynbannotation_analysis/(complete analysis suite)
@mastersthesis{souayed2025thesis,
title={Template-Based Text-to-Image Alignment for Language Accessibility: A Study on Visualizing Simplified Text},
author={Souayed, Belkiss},
year={2025},
school={University of Zurich},
type={Master's thesis}
}@inproceedings{souayed2025tsar,
title={Template-Based Text-to-Image Alignment for Language Accessibility: A
Study on Visualizing Text Simplifications},
author={Souayed, Belkiss and Ebling, Sarah and Gao, Yingqiang},
booktitle={},
year={2025},
organization={},
address={},
note={To appear},
url={https://aclanthology.org/2025.tsar-1.XX} % to update once published
}This dataset was developed as part of a research study exploring the alignment between simplified text and accessible image generation.
The simplified sentences were sampled from four publicly available text simplification datasets:
ASSET, OneStopEnglish, SimPA, and Wikipedia.
Only sentence sampling and minimal preprocessing (removal of artifacts, special characters, and formatting inconsistencies) were performed.
No semantic modifications were made to the original texts.
The dataset also includes automatically generated text-to-image prompts for each sentence.
No original datasets are redistributed in full.
| Dataset | License | Link |
|---|---|---|
| ASSET | Creative Commons Attribution–NonCommercial 4.0 International (CC BY-NC 4.0) | https://github.com/facebookresearch/asset/blob/main/LICENSE |
| OneStopEnglish | Creative Commons Attribution–ShareAlike 4.0 International (CC BY-SA 4.0) | https://creativecommons.org/licenses/ |
| SimPA | Freely available under a Creative Commons Licence (SimPA: A Simplification Corpus for Arabic, English and Spanish, LREC 2018) | https://aclanthology.org/L18-1685/ |
| Wikipedia | Creative Commons Attribution 4.0 International (CC BY 4.0) | https://creativecommons.org/licenses/by/4.0/ |
All listed licenses permit academic reuse with proper attribution.
Belkiss Souayed
Email: [email protected]
Institution: University of Zurich (UZH)
Supervisor: Dr. Yingqiang Gao — [email protected]
Examiner: Prof. Dr. Sarah Ebling — [email protected]
For questions about the data, methodology, or collaboration opportunities, please contact the author.