The MIIS Eprints Archive

Estimation of errors in text and data processing

Slavova, A. and Valkov, B. and Tonchev, K. and Daskalova, N. and Nikolova, M. and Bivas, M. and Mateev, P. and Yordanova, R. and Zhelezova, S. (2013) Estimation of errors in text and data processing. [Study Group Report]



The company Adiss Lab Lts. obtained 1 000 000 medical reports that are either in free form text, or in XML format. One of the main goals of their development is to integrate an algorithm for information extraction (IE) in their platform. The verification of the algorithm’s output for a report is done by a medical doctor (MD) for a certain fee. Validating the correctness of all data would be overwhelming and very expensive. Hence, the problem, as presented by the company, is to provide a method (algorithm) which determines the minimum amount of reports that will validate the correctness of the IE algorithm and a procedure for selecting these reports.

In order to solve the problem we have considered an algorithm-centric approach uses active learning and semi-supervised learning.

Item Type:Study Group Report
Problem Sectors:Medical and pharmaceutical
Data processing
Study Groups:European Study Group with Industry > ESGI 95 (Sofia, Bugaria, Sept 23-27, 2013)
Company Name:Adiss Lab Ltd.
ID Code:630
Deposited By: Matthew Hennessy
Deposited On:04 Dec 2013 23:04
Last Modified:29 May 2015 20:15

Repository Staff Only: item control page