ArCo, GVP and DBpedia Linking Initiative
Stefano Faralli, Andrea Lenzi, Paola Velardi
Department of Computer Science, Sapienza University of Rome, Italy
Intelligent Information Mining - http://iim.di.uniroma1.it/
The Linking Initiative
The ArCo, GVP and DBpedia, Linking Initiative (AGDLI), is a research activity part of the project "SMARTOUR: intelligent platforms for tourism", funded by the Italian Ministry of University and Research.
Our initiative is aimed at linking ArCo’s cultural entities [1,4] to the well known Getty Vocabulary Program [2,5] and DBpedia [3,6] ontologies, with the main goal of providing a semantically rich representation of the Italian cultural heritage for tourism-related knowledge-based applications.
Resources
Neural Word Sense Disambiguation to prune a large Knowledge Graph of the Italian Cultural Heritage (September 2022)
code and data repository:
AGDLI (ver 2.0 - January 2022)
The current version of the graph is released in the form of a tab separated values (.tsv) file consisting of three columns (i.e., subject, predicate and object). Hence, our resource is ready to be processed with state-of-the-art tools such as, for instance, PyKEEN.
The zip archive containing the 'AGDLI_2_0.tsv' file.
Example:
...
https://w3id.org/arco/resource/ScientificOrTechnologicalHeritage/2000236322 type http://vocab.getty.edu/aat/300431697
https://w3id.org/arco/resource/ScientificOrTechnologicalHeritage/2000236323 subject http://vocab.getty.edu/aat/300193182
...
AGDLI (ver 1.0 - June 2021)
to cite the version 1.0 of our resource please use the following:
@inproceedings{DBLP:conf/semweb/0001LV21, author = {Stefano Faralli and Andrea Lenzi and Paola Velardi}, editor = {Oshani Seneviratne and Catia Pesquita and Juan Sequeda and Lorena Etcheverry}, title = {{AGDLI:} ArCo, {GVP} and DBpedia Linking Initiative}, booktitle = {Proceedings of the {ISWC} 2021 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice co-located with 20th International Semantic Web Conference {(ISWC} 2021), Virtual Conference, October 24-28, 2021}, series = {{CEUR} Workshop Proceedings}, volume = {2980}, publisher = {CEUR-WS.org}, year = {2021}, url = {http://ceur-ws.org/Vol-2980/paper304.pdf}, timestamp = {Mon, 25 Oct 2021 15:03:56 +0200}, biburl = {https://dblp.org/rec/conf/semweb/0001LV21.bib}, bibsource = {dblp computer science bibliography, https://dblp.org}}AAT_translation_EN_IT.tsv (download)
A tab separated values (.tsv) containing the automatic translation from English to Italian of the 55K AAT terms;.
Example:
...
http://vocab.getty.edu/aat/300379603 salinization salinizzazione
http://vocab.getty.edu/aat/300379600 thermochemistry termochimica
http://vocab.getty.edu/aat/300379601 aerobiology aerobiologia
...
A N-triples (.nt) file containing triples, where:
1) subjects are ArCo entities;
2) predicates are skos:related or skos:relatedMatch (for relations extracted from dc:label and dc:type respectively);
3) objects are GVP AAT concepts;
Example:
....
<https://w3id.org/arco/resource/HistoricOrArtisticProperty/1200230637> <http://www.w3.org/2004/02/skos/core#related> <http://vocab.getty.edu/aat/300041872> .
<https://w3id.org/arco/resource/HistoricOrArtisticProperty/1200230637> <http://www.w3.org/2004/02/skos/core#relatedMatch> <http://vocab.getty.edu/aat/300041872> .
...
A N-triples (.nt) file containing triples, where:
1) subjects are of type clvapit:City;
2) predicates are skos:relatedMatch;
3) objects are GVP TGN concepts;
Example:
....
<https://w3id.org/arco/resource/City/tarquinia> <http://www.w3.org/2004/02/skos/core#relatedMatch> <http://vocab.getty.edu/tgn/7006776> .
....
A N-triples (.nt) file containing triples, where:
1) subjects are of type clvapit:City;
2) predicates are owl:sameAs;
3) objects are DBpedia entities;
Example:
...
<https://w3id.org/arco/resource/City/treviso> <http://www.w3.org/2002/07/owl#sameAs> <https://dbpedia.org/resource/Treviso> .
...
License
Resources are licensed under: Creative Commons Attribution 4.0 International (CC BY 4.0)
Acknowledgements
This work was carried out within the research project "SMARTOUR: intelligent platform for tourism" funded by the Italian Ministry of University and Research with the Regional Development Fund of European Union (PON Research and Competitiveness (2007-2013).
ISWC2021 POSTER P1 video teaser (26 October 2021)
References
1. Carriero, V.A., Gangemi, A., Mancinelli, M.L., Marinucci, L., Nuzzolese, A.G., Presutti, V., Veninata, C.: Arco: The Italian Cultural Heritage Knowledge Graph. In: ISWC 2019. pp. 36–52. Springer Int. Publishing, Cham (2019).
2. Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.: DBpedia: A Nucleus for a Web of Open Data. In: ISWC’07/ASWC’07. p. 722–735. Springer-Verlag, Berlin, Heidelberg (2007).
3. Harpring, P.: Development of the Getty Vocabularies: AAT, TGN, ULAN, and CONA. Art Documentation: Journal of the Art Libraries Society of North America 29 (1), 67–72 (2010).
4. ArCo: http://wit.istc.cnr.it/arco/index.php?lang=en
5. GVP: https://www.getty.edu/research/tools/vocabularies/
6. DBpedia: https://www.dbpedia.org/
Contacts
Stefano Faralli: surname[at]di[dot]uniroma1[dot] it
Last Update: 05 January 2022, 12:00 CET