Think Twice Before You Answer: Mitigating Biases of Question Answering Models

Mikula, Lukáš

CS SKLog in Log in (EduId)

Theses s8blbw

Think Twice Before You Answer: Mitigating Biases of Question Answering Models – Mgr. Lukáš Mikula

Zpět na vyhledávání

Mgr. Lukáš Mikula

Master's thesis

Think Twice Before You Answer: Mitigating Biases of Question Answering Models

Abstract:

Veľké jazykové modely (z angl. Large Language Models) založené na architektúre Transformerov predstavujú najlepšie modely pre vačšinu problémov spracovania prirodzeného jazyka (z angl. Natural Language Modeling). Napriek tomu majú tieto modely tendenciu učiť sa skreslenia a systematické chyby z trénovacích datasetov. Toto učenie im síce môže pomôcť na datasetoch s rovnakou distribúciou, lenže znižuje …more

Abstract:

Large Language Models based on Transformer architecture hold state-of-the-art in a majority of Natural Language Modeling tasks. Nevertheless, these models tend to learn biases from training dataset, which help them on the training dataset, but hurt their out-of-domain accuracy as a result. This work focuses on obtaining more robust BERT models for the Extractive Question Answering task. We explore …more

Keywords

BERT SQuAD biases Extractive Question Answering robustness super-sampling fine-tuning Transformers

Language used: English

Date on which the thesis was submitted / produced: 17. 5. 2022

Identifier: https://is.muni.cz/th/adh58/

Thesis defence

Date of defence: 22. 6. 2022
Supervisor: Mgr. Michal Štefánik
Reader: RNDr. Vít Suchomel, Ph.D.

Citation record

Cite this text

ISO 690-compliant citation record:

MIKULA, Lukáš. \textit{Think Twice Before You Answer: Mitigating Biases of Question Answering Models}. Online. Master's thesis. Brno: Masaryk University, Faculty of Informatics. 2022. Available from: https://theses.cz/id/s8blbw/.

{{Citace kvalifikační práce
 | příjmení = Mikula
 | jméno = Lukáš
 | instituce = Masaryk University, Faculty of Informatics
 | titul = Think Twice Before You Answer: Mitigating Biases of Question Answering Models
 | url = https://theses.cz/id/s8blbw/
 | typ práce = Master's thesis
 | vedoucí = Mgr. Michal Štefánik
 | rok = 2022
 | počet stran =
 | strany =
 | citace = 2024-06-09
 | poznámka =
 | jazyk = 
}}

Full text of thesis

Contents of on-line thesis archive

Published in Theses:

světu

Other ways of accessing the text

Institution archiving the thesis and making it accessible: Masarykova univerzita, Fakulta informatiky

Reference to the local database directory of the institution

Masaryk University

Faculty of Informatics

Master programme / field:
Artificial intelligence and data processing / Big data

Theses on a related topic

Cross-lingual sentiment analysis with BERT
Mohsen Amini Riseh
BERT models in document classification
Ahmad Arsalan Khateeb
Analýza díla Jiřiny Prekopové a Berta Hellingera z transkulturní perspektivy
Lukáš Nosek
Nestandardní nástroje měnové politiky centrálních bank
Berta Smékalová
Přenos a tvorba elektronických položek v platebním systému České republiky
Berta Smékalová