mothra-text

Experiments in automatic text/music line segmentation on medieval manuscript folios.

Current experiment: line segmentation model comparison

Three models compared head-to-head on a set of medieval manuscript folio images:

Model	Tool	HuggingFace ID
YOLOv9 lines	htrflow	`Riksarkivet/yolov9-lines-1`
RTMDet lines	htrflow	`Riksarkivet/rtmdet_lines`
BLLA baseline segmenter	Kraken	(built-in default model)

Repo layout

mothra-text/
├── data/
│   └── folios/                  # original manuscript folio images
├── experiments/
│   └── pylaia_baseline/         # zero-shot HTR baselines (multiple models)
│       ├── 01_segment.py        # shared: Kraken BLLA → line coord JSON + visualisation
│       ├── 02_extract_crops.py  # shared: JSON → 128px-high grayscale line crop PNGs
│       ├── folios.txt           # shared: list of folios used in all sub-experiments
│       ├── README.md            # index of all sub-experiments
│       ├── pylaia_home_alcar/   # Teklia/pylaia-home-alcar (Latin medieval)
│       │   ├── 03_run_pylaia.py
│       │   ├── run_experiment.py
│       │   └── README.md
│       └── pylaia_himanis/      # Teklia/pylaia-himanis (French medieval)
│           ├── 03_run_pylaia.py
│           ├── run_experiment.py
│           └── README.md
├── outputs/
│   ├── htrflow_yolo/            # annotated images — YOLO line polygons (green)
│   ├── htrflow_rtmdet/          # annotated images — RTMDet masks (blue-orange)
│   ├── kraken_blla/             # annotated images — baselines (orange) + polygons (purple)
│   └── pylaia_baseline/
│       ├── segmentation/        # Kraken BLLA line coord JSON per folio (shared)
│       ├── crops/               # 128px-high grayscale line crop PNGs per folio (shared)
│       ├── pylaia_home_alcar/   # home-alcar transcriptions + results.csv
│       └── pylaia_himanis/      # himanis transcriptions + results.csv
├── pipelines/
│   ├── yolo_pipeline.yaml
│   └── rtmdet_pipeline.yaml
├── run_all.py           # runs all three segmentation models
├── run_htrflow.py       # runs YOLO and/or RTMDet via htrflow Python API
└── run_kraken.py        # runs Kraken BLLA segmenter

Environment setup

conda create -n line-seg-eval python=3.10 -y
conda activate line-seg-eval

pip install htrflow kraken

# OpenMMLab stack for htrflow's RTMDet adapter
pip install yapf==0.40.1 mmengine --no-build-isolation
pip install mmcv==2.0.1 --no-build-isolation   # builds from source
pip install mmdet==3.1.0 mmocr==1.0.1

Apple Silicon note: mmcv 2.0.1 compiled against torch 2.10.0 references at::mps::MPSStream::commit(bool), a symbol removed from torch's MPS backend after 2.0. run_htrflow.py works around this at runtime by preloading a stub dylib (/tmp/libmps_stub.dylib) before importing mmcv. Build the stub once:
cat > /tmp/mps_stub.cpp << 'EOF'
namespace at { namespace mps {
class MPSStream { public: void commit(bool); };
void MPSStream::commit(bool) {}
}}
EOF
clang++ -dynamiclib -std=c++17 -o /tmp/libmps_stub.dylib /tmp/mps_stub.cpp

Running

# Run all three models (images from data/folios/, outputs to outputs/)
python run_all.py

# Use your own image directory (outputs still go to the repo's outputs/)
python run_all.py --folios /path/to/your/images

# Use your own image and output directories (nothing written to the repo)
python run_all.py --folios /path/to/your/images --output /path/to/your/outputs

Output subfolders are created automatically:

<output>/htrflow_yolo/
<output>/htrflow_rtmdet/
<output>/kraken_blla/

Already-processed images are skipped on re-runs.

The individual scripts also accept the same flags and can be run separately:

python run_htrflow.py --model yolo
python run_htrflow.py --model rtmdet
python run_kraken.py
# all accept --folios and --output

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mothra-text

Current experiment: line segmentation model comparison

Repo layout

Environment setup

Running

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data/folios		data/folios
experiments/pylaia_baseline		experiments/pylaia_baseline
outputs		outputs
pipelines		pipelines
steps		steps
tests		tests
.gitignore		.gitignore
README.md		README.md
run_all.py		run_all.py
run_htrflow.py		run_htrflow.py
run_kraken.py		run_kraken.py

Folders and files

Latest commit

History

Repository files navigation

mothra-text

Current experiment: line segmentation model comparison

Repo layout

Environment setup

Running

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages