This repository contains a dataset of images from 17th century American Spanish notary documents. The dataset is organized into one directory: images
. And a JSON file Labeled Data
which is in JSON formate. The images
, directory contains the actual images of the documents and the annotations for the images, and the JSON file contians all information related to the images with the contents and classes which is annotated and extracted using the labelImg software.
images/
: Contains the scanned images and the xml files of 17th century American Spanish notary documents.Labeled Data
: This JSON file contians all information related to the images with the contents and classes.
To view the annotations, you can use the labelImg software. Follow these steps to load the dataset and view the annotations:
- Download and install LabelImg.
- Clone this repository to your local machine:
git clone https://github.com/raopr/SpanishNotaryCollection.git
- The main page of LabelImg will look like the image shown below. At the beginning, you have to set the directory where your images and XML files are saved. After that, you need to set the directory where changes will be saved. To view the annotations in the LabelImg software, make sure that the scanned images and their corresponding XML files are in the same directory, as organized in the images directory. The annotations will look like the image below. The bounding boxes with the green circles in the corners represent the labeling or annotation process we performed.
![Notary](https://private-user-images.githubusercontent.com/58792703/340327717-98732301-2875-44d8-999d-eba70bc038c4.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNzM3NjEsIm5iZiI6MTczOTM3MzQ2MSwicGF0aCI6Ii81ODc5MjcwMy8zNDAzMjc3MTctOTg3MzIzMDEtMjg3NS00NGQ4LTk5OWQtZWJhNzBiYzAzOGM0LnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEyVDE1MTc0MVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTExNDIwMmJjMWRjNmNjM2FlMjg3NmUxMTUwOTg3ODVhYzc0ZjEzYTE5OGZiNDdhZTMyZGUxYmUyMTJkMmFjM2MmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.G3ImvVMiYCafOrJypXODhLt6qFkTN6afd_Fq8qp18v4)
Below are some sample images from rollos 40 and 38:
![rollo](https://private-user-images.githubusercontent.com/58792703/336920977-9f40fdcc-f8ed-443b-afda-866aec771730.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNzM3NjEsIm5iZiI6MTczOTM3MzQ2MSwicGF0aCI6Ii81ODc5MjcwMy8zMzY5MjA5NzctOWY0MGZkY2MtZjhlZC00NDNiLWFmZGEtODY2YWVjNzcxNzMwLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEyVDE1MTc0MVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTI3ZGU2NGY5YjUxYTIyY2IyNDE2MGJhZmU1MTU1YjE0OTc0YzBjZDFlZjQxYTUyN2E5ODUyZjAxNWMxNGM1ZjQmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.6Lc-50cwcviVtOdBHWWvLZwSB0duSKlsp40B-r9Xw2s)
![Rolloss](https://private-user-images.githubusercontent.com/58792703/336921934-30880d76-b0f1-4743-8b2f-6ac0dfe22182.png?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MzkzNzM3NjEsIm5iZiI6MTczOTM3MzQ2MSwicGF0aCI6Ii81ODc5MjcwMy8zMzY5MjE5MzQtMzA4ODBkNzYtYjBmMS00NzQzLThiMmYtNmFjMGRmZTIyMTgyLnBuZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNTAyMTIlMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjUwMjEyVDE1MTc0MVomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTQzYzAxMzZiMGE0NDBiNjYzMGQ4OWMxNDc3NjRjNGI2YTI5ZTI3YTEwNmU4Mzg2MDI0MGUxZWE5MjVmMDE3NTAmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0In0.sxbQi6AY2-tv3tDF2aIvlYUWGE9VGnlrj2mGlBqjZTM)