Exploiting microdata annotations to consistently categorize product offers at web scale
Meusel, Robert
;
Primpeli, Anna
;
Meilicke, Christian
;
Paulheim, Heiko
;
Bizer, Christian
DOI:
|
https://doi.org/10.1007/978-3-319-27729-5_7
|
URL:
|
https://www.researchgate.net/publication/300113438...
|
Weitere URL:
|
http://dws.informatik.uni-mannheim.de/fileadmin/le...
|
Dokumenttyp:
|
Konferenzveröffentlichung
|
Erscheinungsjahr:
|
2015
|
Buchtitel:
|
E-Commerce and Web Technologies : 16th International Conference on Electronic Commerce and Web Technologies, EC-Web 2015, Valencia, Spain, September 2015, revised selected papers
|
Titel einer Zeitschrift oder einer Reihe:
|
Lecture Notes in Business Information Processing : LNBIP
|
Band/Volume:
|
239
|
Seitenbereich:
|
83-99
|
Veranstaltungstitel:
|
EC-Web 2015
|
Veranstaltungsort:
|
Valencia, Spain
|
Veranstaltungsdatum:
|
September 01-02, 2015
|
Herausgeber:
|
Stuckenschmidt, Heiner
|
Ort der Veröffentlichung:
|
Berlin [u.a.]
|
Verlag:
|
Springer
|
ISBN:
|
978-3-319-27728-8 , 978-3-319-27729-5
|
ISSN:
|
1865-1348 , 1865-1356
|
Sprache der Veröffentlichung:
|
Englisch
|
Einrichtung:
|
Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Information Systems V: Web-based Systems (Bizer 2012-) Fakultät für Wirtschaftsinformatik und Wirtschaftsmathematik > Web Data Mining (Juniorprofessur) (Paulheim 2013-2017)
|
Fachgebiet:
|
004 Informatik
|
Freie Schlagwörter (Englisch):
|
Microdata , RDFa , Structured Web Data , Classification
|
Abstract:
|
Semantically annotated data, using markup languages like RDFa and Microdata, has become more and more publicly available in the Web, especially in the area of e-commerce. Thus, a large amount of structured product descriptions are freely available and can be used for various applications, such as product search or recommendation. However, little efforts have been made to analyze the categories of the available product descriptions. Although some products have an explicit category assigned, the categorization schemes vary a lot, as the products originate from thousands of different sites. This heterogeneity makes the use of supervised methods, which have been proposed by most previous works, hard to apply. Therefore, in this paper, we explain how distantly supervised approaches can be used to exploit the heterogeneous category information in order to map the products to set of target categories from an existing product catalogue. Our results show that, even though this task is by far not trivial, we can reach almost 56% accuracy for classifying products into 37 categories.
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
Suche Autoren in
BASE:
Meusel, Robert
;
Primpeli, Anna
;
Meilicke, Christian
;
Paulheim, Heiko
;
Bizer, Christian
Google Scholar:
Meusel, Robert
;
Primpeli, Anna
;
Meilicke, Christian
;
Paulheim, Heiko
;
Bizer, Christian
ORCID:
Meusel, Robert, Primpeli, Anna ORCID: https://orcid.org/0000-0002-1783-2482, Meilicke, Christian ORCID: https://orcid.org/0000-0002-0198-5396, Paulheim, Heiko ORCID: https://orcid.org/0000-0003-4386-8195 and Bizer, Christian ORCID: https://orcid.org/0000-0003-2367-0237
Sie haben einen Fehler gefunden? Teilen Sie uns Ihren Korrekturwunsch bitte hier mit: E-Mail
Actions (login required)
|
Eintrag anzeigen |
|