RUSSE'2018 : a shared task on word sense induction for the Russian language
Panchenko, Alexander
;
Lopukhina, Anastasiya
;
Ustalov, Dmitry
;
Lopukhin, Konstantin
;
Arefyev, Nikolay
;
Leontyev, Alexey
;
Loukachevitch, Natalia
URL:
|
https://ub-madoc.bib.uni-mannheim.de/44299
|
Additional URL:
|
http://www.dialog-21.ru/media/4280/dialog2018scopu...
|
URN:
|
urn:nbn:de:bsz:180-madoc-442992
|
Document Type:
|
Conference or workshop publication
|
Year of publication:
|
2018
|
Book title:
|
Computational Linguistics and Intellectual Technologies : Papers from the Annual conference "Dialogue" 2018 : 24th International Conference on Computational Linguistics and Intellectual Technologies, May 30 - June 2, 2018 Moscow
|
The title of a journal, publication series:
|
Dialogue
|
Volume:
|
17
|
Page range:
|
547-564
|
Conference title:
|
24rd International Conference on Computational Linguistics and Intellectual Technologies 2018
|
Location of the conference venue:
|
Moscow, Russia
|
Date of the conference:
|
May 30-June 2, 2018
|
Place of publication:
|
Moscow, Russia
|
Publishing house:
|
RSUH
|
ISSN:
|
2221-7932 , 2075-7182
|
Publication language:
|
English
|
Institution:
|
School of Business Informatics and Mathematics > Information Systems III: Enterprise Data Analysis (Ponzetto 2016-)
|
License:
|
Creative Commons Attribution 4.0 International (CC BY 4.0)
|
Subject:
|
004 Computer science, internet 400 Language, linguistics
|
Keywords (English):
|
lexical semantics , word sense induction , word sense disambiguation , polysemy , homonymy
|
Abstract:
|
The paper describes the results of the first shared task on word sense induction (WSI) for the Russian language. While similar shared tasks were conducted in the past for some Romance and Germanic languages, we explore the performance of sense induction and disambiguation methods for a Slavic language that shares many features with other Slavic languages, such as rich morphology and free word order. The participants were asked to group contexts with a given word in accordance with its senses that were not provided beforehand. For instance, given a word “bank” and a set of contexts with this word, e.g. “bank is a financial institution that accepts deposits” and “river bank is a slope beside a body of water”, a participant was asked to cluster such contexts in the unknown in advance number of clusters corresponding to, in this case, the “company” and the “area” senses of the word “bank”. For the purpose of this evaluation campaign, we developed three new evaluation datasets based on sense inventories that have different sense granularity. The contexts in these datasets were sampled from texts of Wikipedia, the academic corpus of Russian, and an explanatory dictionary of Russian. Overall 18 teams participated in the competition submitting 383 models. Multiple teams managed to substantially outperform competitive state-of-the-art baselines from the previous years based on sense embeddings.
|
| Dieser Eintrag ist Teil der Universitätsbibliographie. |
| Das Dokument wird vom Publikationsserver der Universitätsbibliothek Mannheim bereitgestellt. |
Search Authors in
BASE:
Panchenko, Alexander
;
Lopukhina, Anastasiya
;
Ustalov, Dmitry
;
Lopukhin, Konstantin
;
Arefyev, Nikolay
;
Leontyev, Alexey
;
Loukachevitch, Natalia
Google Scholar:
Panchenko, Alexander
;
Lopukhina, Anastasiya
;
Ustalov, Dmitry
;
Lopukhin, Konstantin
;
Arefyev, Nikolay
;
Leontyev, Alexey
;
Loukachevitch, Natalia
ORCID:
Panchenko, Alexander ORCID: https://orcid.org/0000-0001-6097-6118, Lopukhina, Anastasiya, Ustalov, Dmitry ORCID: https://orcid.org/0000-0002-9979-2188, Lopukhin, Konstantin, Arefyev, Nikolay, Leontyev, Alexey and Loukachevitch, Natalia
You have found an error? Please let us know about your desired correction here: E-Mail
Actions (login required)
|
Show item |
|