I have a set of RDF documents (ontologies that describe datasets, meta-data, business ontologies etc.) in turtle file format. I would to import these RDFs into my newly setup CKAN data catalog.
Based on my limited knowledge, my understanding is that I need to have this " ckanext-dcat" extension/plugin installed on my ckan distribution. And the feature specifically I would be requiring is: An RDF Harvester that allows importing RDF serializations from other catalogs to create CKAN datasets (
Further my understanding is that this harvester needs a remote source, to download the remote file, extract all datasets using the parser and create or update actual CKAN datasets based on that. It will also handle deletions, ie if a dataset is not present any more in the DCAT dump anymore it will get deleted from CKAN.
Now, comes a list of my questions:
- I have a set of .ttl files on my fileshare. How can I import them using this harvester?
- Do my turtle file .ttl need to comply to some specific ckan schema in order for them to be parsed successfully by the harvester?
- Any pointer/example as to how I can get started with some test rdf datasets import into my newly setup ckan instance?