Towards automatic topical classification of LOD datasets
Meusel, Robert
;
Spahiu, Blerina
;
Bizer, Christian
;
Paulheim, Heiko
URL:
|
http://ceur-ws.org/Vol-1409/paper-03.pdf
|
Additional URL:
|
http://www.slideshare.net/BlerinaSpahiu/towards-au...
|
Document Type:
|
Conference or workshop publication
|
Year of publication:
|
2015
|
Book title:
|
LDOW 2015 : Proceedings of the Workshop on Linked Data on the Web ; co-located with the 24th International World Wide Web Conference (WWW 2015) ; Florence, Italy, May 19th, 2015
|
The title of a journal, publication series:
|
CEUR Workshop Proceedings
|
Volume:
|
1409
|
Page range:
|
Paper 03
|
Conference title:
|
LDOW 2015
|
Location of the conference venue:
|
Florence, Italy
|
Date of the conference:
|
May 19th 2015
|
Publisher:
|
Bizer, Christian
|
Place of publication:
|
Aachen, Germany
|
Publishing house:
|
RWTH Aachen
|
ISSN:
|
1613-0073
|
Publication language:
|
English
|
Institution:
|
School of Business Informatics and Mathematics > Information Systems V: Web-based Systems (Bizer 2012-) School of Business Informatics and Mathematics > Web Data Mining (Juniorprofessur) (Paulheim 2013-2017)
|
Subject:
|
004 Computer science, internet
|
Keywords (English):
|
LOD , LDOW , Classification
|
Abstract:
|
The datasets that are part of the Linking Open Data cloud
diagramm (LOD cloud) are classified into the following topical
categories: media, government, publications, life sciences,
geographic, social networking, user-generated content,
and cross-domain. The topical categories were manually
assigned to the datasets. In this paper, we investigate to
which extent the topical classification of new LOD datasets
can be automated using machine learning techniques and the
existing annotations as supervision. We conducted experiments
with different classification techniques and different
feature sets. The best classification technique/feature set
combination reaches an accuracy of 81.62% on the task of
assigning one out of the eight classes to a given LOD dataset.
A deeper inspection of the classification errors reveals problems
with the manual classification of datasets in the current
LOD cloud.
|
Additional information:
|
Online-Ressource
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
Search Authors in
BASE:
Meusel, Robert
;
Spahiu, Blerina
;
Bizer, Christian
;
Paulheim, Heiko
Google Scholar:
Meusel, Robert
;
Spahiu, Blerina
;
Bizer, Christian
;
Paulheim, Heiko
ORCID:
Meusel, Robert, Spahiu, Blerina, Bizer, Christian ORCID: https://orcid.org/0000-0003-2367-0237 and Paulheim, Heiko ORCID: https://orcid.org/0000-0003-4386-8195
["search_editors_ORCID" not defined]
Bizer, Christian ORCID: https://orcid.org/0000-0003-2367-0237
You have found an error? Please let us know about your desired correction here: E-Mail
Actions (login required)
|
Show item |
|
|