Uncertainty Detection in Natural Language Texts

Vincze Veronika
Uncertainty Detection in Natural Language Texts.
PhD, University of Szeged.
(2015)

[img]
Preview
PDF (disszertáció)
Download (982kB) | Preview
[img]
Preview
PDF (tézis)
Download (359kB) | Preview
[img]
Preview
PDF (tézis)
Download (359kB) | Preview

Abstract in foreign language

Uncertainty is an important linguistic phenomenon that is relevant in many fields of language processing. In its most general sense, it can be interpreted as lack of information: the hearer or the reader cannot be certain about some pieces of information. Thus, uncertain propositions are those whose truth value or reliability cannot be determined due to lack of information. Distinguishing between factual (i.e. true or false) and uncertain propositions is of primary importance both in linguistics and natural language processing applications. For instance, in information extraction an uncertain piece of information might be of some interest for an end-user as well, but such information must not be confused with factual textual evidence (reliable information) and the two should be kept separated. The main objective of this thesis is to detect uncertainty in English and Hungarian natural language texts. As opposed to earlier studies that focused on specific domains and were English-oriented, we will offer here a comprehensive approach to uncertainty detection, which can be easily adapted to the specific needs of many domains and languages. In our investigations, we will pay attention to create linguistically plausible models of uncertainty that will be exploited in creating manually annotated corpora that will serve as the base for the implementation of our uncertainty detectors for several domains, with the help of supervised machine learning techniques. Furthermore, we will also demonstrate that uncertainty detection can be fruitfully applied in a real-world application, namely, information extraction from clinical discharge summaries.

Item Type: Thesis (PhD)
Creators: Vincze Veronika
Hungarian title label: Bizonytalanság azonosítása természetes nyelvű szövegekben
Title of the thesis in foreign language: Uncertainty Detection in Natural Language Texts
Divisions: Doctoral School Informatics
Discipline label: műszaki tudományok > informatikai tudományok
Defence date label: 2015. June 24.
Supervisor label:
SupervisorSupervisor scientific name labelEmail
Prof. Csirik JánosDSc., egyetemi tanár, SZTE TTIK Számítógépes Algoritmusok és Mesterséges Intelligencia Tanszékcsirik@inf.u-szeged.hu
Reviewer label:
Reviewer name labelReviewer scientific name labelEmail
Prof. Prószéky Gáboregyetemi tanár, DSc., Pázmány Péter Katolikus Egyetem ITKproszeky.gabor@itk.ppke.hu
Dr. Miháltz MártonPh.D., tudományos munkatárs, MTA Nyelvtudományi Intézetmihaltz.marton@nytud.mta.hu
President label:
President name labelPresident scientific name labelEmail
Prof. Gyimóthy TiborDSc., tanszékvezető egyetemi tanár, SZTE TTIK Szoftverfejlesztés Tanszékgyimi@inf.u-szeged.hu
Member label:
Member name labelMember scientific name labelEmail
Dr. Simon EszterPh.D., tudományos munkatárs, MTA Nyelvtudományi Intézetsimon.eszter@nytud.mta.hu
Dr. Bánhelyi BalázsPh.D., adjunktus, SZTE TTIK Számítógépes Optimalizálás Tanszékbanhelyi@inf.u-szeged.hu
Item ID: 2291
Identification Number: 2804039
doi: https://doi.org/10.14232/phd.2291
Date Deposited: 2014. Jul. 04. 22:38
Last Modified: 2016. Feb. 19. 15:44
Depository no.: B 5902
Supervisor: Prof. Csirik János
Supervisor scientific name label: DSc., egyetemi tanár, SZTE TTIK Számítógépes Algoritmusok és Mesterséges Intelligencia Tanszék
URI: http://doktori.bibl.u-szeged.hu/id/eprint/2291
Defence/Citable status: Defended.

Actions (login required)

View Item View Item

Downloads

Downloads per month over past year