Sentence alignment methods for improving text simplification systems


Štajner, Sanja ; Franco-Salvador, Mark ; Ponzetto, Simone Paolo ; Rosso, Paolo ; Stuckenschmidt, Heiner


DOI: https://doi.org/10.18653/v1/P17-2016
URL: http://aclanthology.coli.uni-saarland.de/pdf/P/P17...
Additional URL: http://www.aclweb.org/anthology/P17-2016
Document Type: Conference or workshop publication
Year of publication: 2017
Book title: The 55th Annual Meeting of the Association for Computational Linguistics - proceedings of the conference : July 30-August 4, 2017, Vancouver, Canada : ACL 2017
Volume: 2
Page range: 97-102
Conference title: The 55th Annual Meeting of the Association for Computational Linguistics (ACL)
Location of the conference venue: Vancouver, Canada
Date of the conference: July 30 - August 4 2017
Author/Publisher of the book
(only the first ones mentioned)
:
Barzilay, Regina
Place of publication: Stroudsburg, PA
Publishing house: Association for Computational Linguistics
ISBN: 978-1-945626-76-0
Related URLs: http://aclweb.org/anthology/P17-2
Publication language: English
Institution: School of Business Informatics and Mathematics > Wirtschaftsinformatik III (Ponzetto 2016-)
Außerfakultäre Einrichtungen > SFB 884
School of Business Informatics and Mathematics > Praktische Informatik II (Stuckenschmidt 2009-)
Subject: 004 Computer science, internet
Keywords (English): automated text simplification , sentence alignment , natural language processing
Abstract: We provide several methods for sentence alignment of texts with different complexity levels. Using the best of them, we sentence-align the Newsela corpora, thus providing large training materials for automatic text simplification (ATS) systems. We show that using this dataset, even the standard phrase-based statistical machine translation models for ATS can outperform the state-of-the-art ATS systems.

Dieser Eintrag ist Teil der Universitätsbibliographie.




+ Citation Example and Export

Štajner, Sanja ; Franco-Salvador, Mark ; Ponzetto, Simone Paolo ; Rosso, Paolo ; Stuckenschmidt, Heiner ORCID: 0000-0002-0209-3859 Sentence alignment methods for improving text simplification systems. Barzilay, Regina 2 97-102 In: The 55th Annual Meeting of the Association for Computational Linguistics - proceedings of the conference : July 30-August 4, 2017, Vancouver, Canada : ACL 2017 (2017) Stroudsburg, PA The 55th Annual Meeting of the Association for Computational Linguistics (ACL) (Vancouver, Canada) [Conference or workshop publication]


+ Search Authors in

+ Page Views

Hits per month over past year

Detailed information



You have found an error? Please let us know about your desired correction here: E-Mail


Actions (login required)

Show item Show item