Dataset Catalog Vocabulary (DCAT)
Presentation, document
What is DCAT?
The W3C Dataset Catalog vocabulary (DCAT) is designed to facilitate interoperability between data catalogs published on the Web. It is based on a generic model for catalog contents including
record
description of (conceptual) entity
related concrete artefacts
DCAT includes properties from the well known Dublin Core metadata element set, which are associated with classes for
(Original 2014 edition of DCAT)
Catalog - a set of dataset descriptions and associated catalog records
Record - the registration information for an item in the catalog
Dataset - the description of a dataset, considered as a conceptual entity
Distribution - the description of an actual representation of a dataset, e.g. a file using a specified format
since a Dataset may be available in more than one format or representation, there is in general a 1:N relationship between Dataset and Distribution
(Upcoming 2019 extension of DCAT)
DataService - the description of a facility for discovery, access or processing data or related resources
and sub-classes DataDistributionService and DiscoveryService
catalogued Resource
superclass of Dataset, DataService, and potentially other things that need cataloguing
Where is it used?
Europe - various APs, incl GeoDCAT-AP
CKAN add-on
Implementation
DCAT is formalised as an RDF vocabulary.
dcat: Classes + skos:Concept/ConceptScheme + foaf:Agent
properties from dct, dcat
new classes for Services
new properties from PROV-O, DQV
Mix and match with any (compatible) RDF
ORG
SSN/SOSA
QB
VoID