Medical text data-oriented filtering method and system

A technology of text data and filtering methods, which is applied in the fields of text database clustering/classification, unstructured text data retrieval, neural learning methods, etc. The effect of small training memory overhead, reducing redundancy, and improving data filtering quality

Pending Publication Date: 2021-04-27
ENJOYOR COMPANY LIMITED
View PDF0 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the length of the text sequence of medical records is far greater than the length of the sequence that can be processed by the deep neural network, resulting in the neural network not being able to obtain the entire text sequence information well, and training to obtain the optimal model, so it is necessary to compress and filter the medical data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical text data-oriented filtering method and system
  • Medical text data-oriented filtering method and system
  • Medical text data-oriented filtering method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0118] The present invention will be further described below in conjunction with specific examples, but the present invention is not limited to these specific implementations. Those skilled in the art will realize that the present invention covers all alternatives, modifications and equivalents as may be included within the scope of the claims.

[0119] Such as figure 1 As shown, this embodiment provides a filtering system for medical text data, including a data preprocessing module, a model training module, a fusion voting mechanism module, and a feedback correction module. The data preprocessing module is used for data preprocessing, including obtaining the medical record corpus of patients in the medical field as training corpus, data cleaning, regular extraction, sentence segmentation, text segmentation, and text labeling; the model training module is in the convolution Five basic models constructed on the basis of the two model structures of the neural network CNN and th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a medical text data-oriented filtering method, which comprises the following specific steps of: obtaining a medical record data set S which is not labeled with a medical category label, inputting the medical record data set S into a trained DSSM-C-BiLSTM model, outputting a predicted medical category label Label of the data set S, and performing data filtering through the label Label; the DSSM-C-BiLSTM model training process comprises the following steps: (1) collecting a medical record data set A, performing data preprocessing on the medical record data set A to obtain a data set B, and dividing the data set B into a training set and a test set; (2) constructing a DSSM-C-BiLSTM model, inputting a training set of the data set B into the DSSM-C-BiLSTM model for training and learning, and inputting a test set of the data set B into the trained DSSM-C-BiLSTM model to obtain a medical category label probability; and (3) performing model evaluation index calculation according to the predicted medical category label and the real medical category label, and finishing model training when the model evaluation index meets the condition.

Description

technical field [0001] The invention belongs to the field of natural language processing, and in particular relates to a filtering method and system for medical text data. Background technique [0002] With the rapid development of Internet technology, hospitals store a large amount of medical information and resources, but due to the hysteresis of relevant laws and regulations in the field of health and medical big data, the data has not been fully mined and utilized, and related development has been restricted. In recent years, health and medical big data related industries have been included in the national big data strategic layout, and policies related to medical and health big data have been frequently issued, as well as the development and progress of artificial intelligence technology, using machine learning algorithms to fully mine useful information in medical data become a research hotspot. [0003] The most core, largest and most valuable data in medical data is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/35G06N3/04G06N3/08
CPCG06F16/35G06N3/08G06N3/044G06N3/045G06F18/2415G06F18/25G06F18/259G06F18/214
Inventor 郑申文韩振兴刘祥丁锴陈涛李建元
Owner ENJOYOR COMPANY LIMITED
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products