Analyzing Text Complexity and Text Simplification: Connecting Linguistics, Processing and Educational Applications

Vajjala Balakrishna, Sowmya

Publikationsdienste
→
TOBIAS-lib - Publikationen und Dissertationen
→
5 Philosophische Fakultät
→
Dokumentanzeige

dc.contributor.advisor	Meurers, Detmar (Prof. Dr.)
dc.contributor.author	Vajjala Balakrishna, Sowmya
dc.date.accessioned	2015-08-03T07:57:17Z
dc.date.available	2015-08-03T07:57:17Z
dc.date.issued	2015-08-03
dc.identifier.other	442984162	de_DE
dc.identifier.uri	http://hdl.handle.net/10900/64359
dc.identifier.uri	http://nbn-resolving.de/urn:nbn:de:bsz:21-dspace-643598	de_DE
dc.identifier.uri	http://dx.doi.org/10.15496/publikation-5781
dc.identifier.uri	http://nbn-resolving.org/urn:nbn:de:bsz:21-dspace-643596	de_DE
dc.description.abstract	Reading plays an important role in the process of learning and knowledge acquisition for both children and adults. However, not all texts are accessible to every prospective reader. Reading difficulties can arise when there is a mismatch between a reader’s language proficiency and the linguistic complexity of the text they read. In such cases, simplifying the text in its linguistic form while retaining all the content could aid reader comprehension. In this thesis, we study text complexity and simplification from a computational linguistic perspective. We propose a new approach to automatically predict the text complexity using a wide range of word level and syntactic features of the text. We show that this approach results in accurate, generalizable models of text readability that work across multiple corpora, genres and reading scales. Moving from documents to sentences, We show that our text complexity features also accurately distinguish different versions of the same sentence in terms of the degree of simplification performed. This is useful in evaluating the quality of simplification performed by a human expert or a machine-generated output and for choosing targets to simplify in a difficult text. We also experimentally show the effect of text complexity on readers’ performance outcomes and cognitive processing through an eye-tracking experiment. Turning from analyzing text complexity and identifying sentential simplifications to generating simplified text, one can view automatic text simplification as a process of translation from English to simple English. In this thesis, we propose a statistical machine translation based approach for text simplification, exploring the role of focused training data and language models in the process. Exploring the linguistic complexity analysis further, we show that our text complexity features can be useful in assessing the language proficiency of English learners. Finally, we analyze German school textbooks in terms of their linguistic complexity, across various grade levels, school types and among different publishers by applying a pre-existing set of text complexity features developed for German.	en
dc.language.iso	en	de_DE
dc.publisher	Universität Tübingen	de_DE
dc.rights	ubt-podok	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=de	de_DE
dc.rights.uri	http://tobias-lib.uni-tuebingen.de/doku/lic_mit_pod.php?la=en	en
dc.subject.classification	Computerlinguistik , Sprachdaten , Lesbarkeit	de_DE
dc.subject.ddc	400	de_DE
dc.subject.other	Readability Assessment	en
dc.subject.other	Text Simplification	en
dc.subject.other	Text Komplexität	de_DE
dc.subject.other	Educational Applications	en
dc.subject.other	Text Klassifikation	de_DE
dc.subject.other	Text Classification	en
dc.subject.other	Complexity of Textbooks	en
dc.subject.other	Simplification	de_DE
dc.subject.other	L2 Proficiency Assessment	en
dc.subject.other	Zweitspracherweb	de_DE
dc.subject.other	Lesbarkeit	de_DE
dc.title	Analyzing Text Complexity and Text Simplification: Connecting Linguistics, Processing and Educational Applications	en
dc.type	PhDThesis	de_DE
dcterms.dateAccepted	2015-07-27
utue.publikation.fachbereich	Allgemeine u. vergleichende Sprachwissenschaft	de_DE
utue.publikation.fakultaet	5 Philosophische Fakultät	de_DE

Dateien:	THESIS-FINAL.pdf 2.18 MB PDF

Das Dokument erscheint in:

5 Philosophische Fakultät [1767]

Zur Kurzanzeige

Veröffentlichen

Stöbern

Gesamter Bestand
Diese Sammlung

Mein Benutzerkonto

Einloggen

Analyzing Text Complexity and Text Simplification: Connecting Linguistics, Processing and Educational Applications

DSpace Repositorium (Manakin basiert)

Das Dokument erscheint in:

Stöbern

Gesamter Bestand

Diese Sammlung

Mein Benutzerkonto