Publikace: Towards the use of entropy as a measure for the reliability of automatic MT evaluation metrics
Článekopen accesspeer-reviewedpostprintNačítá se...
Soubory
Datum
Autoři
Munk, Michal
Munkova, Dasa
Benko, Ľubomír
Název časopisu
ISSN časopisu
Název svazku
Nakladatel
IOS Press
Abstrakt
The study describes an experiment with different estimations of reliability. Reliability reflects the technical quality of the measurement procedure such as an automatic evaluation of Machine Translation (MT). Reliability is an indicator of accuracy, the reliability of measuring, in our case, measuring the accuracy and error rate of MT output based on automatic metrics (precision, recall, f-measure, Bleu-n, WER, PER, and CDER). The experiment showed metrics (Bleu-4 and WER) that reduce the overall reliability of the automatic evaluation of accuracy and error rate using entropy. Based on the results we can say, that the use of entropy for the estimation of reliability brings more accurate results than conventional estimations of reliability (Cronbach's alpha and correlation). MT evaluation, based on n-grams or edit distance, using entropy could offer a new view on lexicon-based metrics in comparison to commonly used ones.
Popis
Klíčová slova
Entropy, machine translation, reliability estimation, quality, automatic MT evaluation, entropie, strojový překlad, odhad spolehlivosti, kvalita, automatická evaluace MT