Develop an algorithm that is capable of checking if a dataset is conformant to a given schema and generating a machine-readable report about the conformity status.
The algorithm can be manually configured to check if the dataset format is the one provided by the user (specified using a mime type like MIME text/csv) and also if it conforms to the provided schema (XML Schema, JSON Schema, SHACL Shapes, CSVW,...).
Based on the provided information, the validation algorithm first checks whether the format is correct, then whether the data complies with the defined schema. The conformsTo URL value should point to the XSD, JSON, TTL,... file that contains the schema definition and can be used to automate conformity checking.
For instance, for JSON-LD or other RDF serialisation formats SHACL can be used and validation automated, like done for the online service: https://www.itb.ec.europa.eu/shacl/any/upload
And additional feature might be to have this same service without consumer parameters for format and conformsTo (or if they are left empty by the consumer). In that case, the algorithm will try to get them from the dataset DDO metadata, looking for dct:format and dct:conformsTo
Develop an algorithm that is capable of checking if a dataset is conformant to a given schema and generating a machine-readable report about the conformity status.
The algorithm can be manually configured to check if the dataset format is the one provided by the user (specified using a mime type like MIME
text/csv) and also if it conforms to the provided schema (XML Schema, JSON Schema, SHACL Shapes, CSVW,...).Based on the provided information, the validation algorithm first checks whether the format is correct, then whether the data complies with the defined schema. The conformsTo URL value should point to the XSD, JSON, TTL,... file that contains the schema definition and can be used to automate conformity checking.
For instance, for JSON-LD or other RDF serialisation formats SHACL can be used and validation automated, like done for the online service: https://www.itb.ec.europa.eu/shacl/any/upload
And additional feature might be to have this same service without consumer parameters for format and conformsTo (or if they are left empty by the consumer). In that case, the algorithm will try to get them from the dataset DDO metadata, looking for
dct:formatanddct:conformsTo