A Survey of Data Processing of EMR (Electronic Medical Record) Based on Data Mining

Wencheng Sun; Fang Liu; Zhiping Cai; Shengqun Fang; Guoyan Wang

doi:10.20944/preprints201708.0055.v1

Submitted:

11 August 2017

Posted:

15 August 2017

You are already at the latest version

Abstract

At present, medical institutes generally use EMR to record patient's condition, including diagnostic information, procedures performed and treatment results. EMR has been recognized as a valuable resource for large scale analysis. However, EMR has the characteristics of diversity, incompleteness, redundancy and privacy, which make it difficult to carry out data mining and analysis directly. Therefore, it is necessary to preprocess the source data in order to improve data quality and improve the data mining results. Different types of data require different processing technologies. Most structured data commonly needs classic preprocessing technologies, including data cleansing, data integration, data transformation and data reduction. For semi-structured or unstructured data, such as medical text, containing more health information, it requires more complex and challenging processing methods. The task of information extraction for medical texts mainly includes NER (Named Entity Recognition) and RE (Relation Extraction). In this paper, we introduce the process of EMR processing, including data collection, data preprocessing, data mining, evaluation and knowledge application, analyze the current status of the key technologies, such as data preprocessing and data mining, and provide an overview of the application domains and prospects of EMR mining technologies. Finally, we summarize the existing problems in the research of EMR mining, and review the development trends.

Keywords:

EMR

;

data preprocessing

;

text mining

;

information extraction

;

medical decision support system

Subject:

Computer Science and Mathematics - Information Systems

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Survey of Data Processing of EMR (Electronic Medical Record) Based on Data Mining

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe