Medical information data item name standardization method and system, equipment and medium

A project name, medical information technology, applied in the field of data standardization of medical data sources, can solve problems such as large amount of calculation, difficult data source data, time-consuming and labor-intensive, etc., to achieve the effect of strong adaptability and simplified calculation amount

Pending Publication Date: 2021-12-28
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This brings difficulties to the fusion of data from multiple data sources. Unified standardization of data from multiple data sources requires a lot of labor, time-consuming and labor-intensive. The existing standardization methods, on the one hand, vectorize the names and compare them, and the amount of calculation Large, and different vectorization will bring deviations in the results, it is impossible to unify, and standard adjustment is also difficult. On the other hand, it will establish a standard library and then standardize it after comparison, which has poor adaptability. The requirements are high, and the update is slow, and it is easy to encounter problems that cannot be matched, resulting in incomplete standardization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Medical information data item name standardization method and system, equipment and medium
  • Medical information data item name standardization method and system, equipment and medium
  • Medical information data item name standardization method and system, equipment and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The present invention will be further described in detail below in conjunction with specific embodiments, which are explanations of the present invention rather than limitations.

[0044] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0045] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to data standardization of medical data sources, in particular to a medical information data item name standardization method and system, equipment and a medium, which can automatically standardize data of a plurality of data sources from a literal description level, is reasonable in design, simple in processing and high in adaptability, greatly liberates manpower and improves efficiency. The method comprises the following steps: unifying and de-duplicating acquired initial data item names of a plurality of medical information data sources in a character level to obtain data items with different names; constructing an n-gram feature set of each data item according to the number of characters of the name of each data item; according to the n-gram feature set of each data item, obtaining a character level-based name similarity between every two data items, and constructing a similar matrix; and clustering the data items greater than a similarity threshold in the similar matrix, and assigning the same standardization name for all the data items in each cluster for standardization.

Description

technical field [0001] The invention relates to data standardization of medical data sources, in particular to a method, system, device and medium for standardizing names of medical information data items. Background technique [0002] With the advancement of informatization construction in various industries, massive amounts of data are stored electronically. For example, in the medical industry, more and more medical institutions use a Hospital Information System (Hospital Information System, HIS system) to manage collected data. This type of information system improves the ability of data collection and management, but also brings about the problem of data standardization from different data sources. [0003] The HIS system of each medical institution has a set of data standard methods. However, the data standardization methods of different medical institutions are usually different, and it is very difficult to implement data standardization in multiple medical institut...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/247G06F40/216G06K9/62
CPCG06F40/247G06F40/216G06F18/23G06F18/22
Inventor 唐蕊
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products