Deployment of RDFa, Microdata, and Microformats on the Web - A Quantitative Analysis
Bizer, Christian
;
Eckert, Kai
;
Meusel, Robert
;
Mühleisen, Hannes
;
Schuhmacher, Michael
;
Völker, Johanna
DOI:
|
https://doi.org/10.1007/978-3-642-41338-4_2
|
URL:
|
http://dws.informatik.uni-mannheim.de/fileadmin/le...
|
Weitere URL:
|
http://hannes.muehleisen.org/Bizer-etal-Deployment...
|
Dokumenttyp:
|
Konferenzveröffentlichung
|
Erscheinungsjahr:
|
2013
|
Buchtitel:
|
The Semantic Web - ISWC 2013 : 12th International Semantic Web Conference, Sydney, NSW, Australia, October 21-25, 2013, Proceedings, Part II
|
Titel einer Zeitschrift oder einer Reihe:
|
Lecture Notes in Computer Science
|
Band/Volume:
|
8219
|
Seitenbereich:
|
17-32
|
Veranstaltungsdatum:
|
Oct. 21-25, 2013
|
Herausgeber:
|
Alani, Harith
|
Ort der Veröffentlichung:
|
Berlin [u.a.]
|
Verlag:
|
Springer
|
ISBN:
|
978-3-642-41337-7
|
ISSN:
|
0302-9743 , 1611-3349
|
Sprache der Veröffentlichung:
|
Englisch
|
Einrichtung:
|
Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Information Systems V: Web-based Systems (Bizer 2012-) Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Practical Computer Science II: Artificial Intelligence (Stuckenschmidt 2009-)
|
Fachgebiet:
|
004 Informatik
|
Fachklassifikation:
|
CCS:
|
Freie Schlagwörter (Englisch):
|
Web Science , Web of Data , RDFa , Microdata , Microformats
|
Abstract:
|
More and more websites embed structured data describing for instance products, reviews, blog posts, people, organizations, events, and cooking recipes into their HTML pages using markup standards such as Microformats, Microdata and RDFa. This development has accelerated in the last two years as major Web companies, such as Google, Facebook, Yahoo!, and Microsoft, have started to use the embedded data within their applications. In this paper, we analyze the adoption of RDFa, Microdata, and Microformats across the Web. Our study is based on a large public Web crawl dating from early 2012 and consisting of 3 billion HTML pages which originate from over 40 million websites. The analysis reveals the deployment of the different markup standards, the main topical areas of the published data as well as the different vocabularies that are used within each topical area to represent data. What distinguishes our work from earlier studies, published by the large Web companies, is that the analyzed crawl as well as the extracted data are publicly available. This allows our findings to be verified and to be used as starting points for further domain-specific investigations as well as for focused information extraction endeavors.
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
Suche Autoren in
BASE:
Bizer, Christian
;
Eckert, Kai
;
Meusel, Robert
;
Mühleisen, Hannes
;
Schuhmacher, Michael
;
Völker, Johanna
Google Scholar:
Bizer, Christian
;
Eckert, Kai
;
Meusel, Robert
;
Mühleisen, Hannes
;
Schuhmacher, Michael
;
Völker, Johanna
ORCID:
Bizer, Christian ORCID: https://orcid.org/0000-0003-2367-0237, Eckert, Kai, Meusel, Robert, Mühleisen, Hannes, Schuhmacher, Michael and Völker, Johanna
Sie haben einen Fehler gefunden? Teilen Sie uns Ihren Korrekturwunsch bitte hier mit: E-Mail
Actions (login required)
|
Eintrag anzeigen |
|