Combination drug recognition and ranking method based on medical literature database

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of medical literature and sorting method, applied in computer technology in the field of medical clinic, can solve the problem of result error and so on

Pending Publication Date: 2017-05-24

CHONGQING UNIV

View PDF6 Cites 16 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Although this method proposes how to scientifically sort medical literature, there is a problem. What MedRank actually provides is the ranking of all involved single drugs for a certain disease, but many literatures now propose a ranking for a certain disease. The treatment plan of a disease involves a combination of multiple drugs. For such a document in MedRank, the relationship of multiple drugs mentioned in the document will be defined as a parallel relationship, that is, each drug has a therapeutic effect on the disease. effect, which misinterpreted the meaning of the literature and caused certain errors in the results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0056] Such as figure 1 as shown, figure 1 It is a schematic diagram; a method for identifying and sorting combined drugs based on a medical literature database provided in this embodiment, first uses text mining to extract classification features from abstracts that meet the requirements, and secondly uses the support vector machine model in machine learning to perform Classify, and use genetic algorithm to optimize the parameters of the support vector machine model; since then, the literature containing multiple drugs and the combination relationship between drugs can be identified, and finally the medrank algorithm is used to sort these literatures, and the results for a certain disease are obtained. Recommendation results for combination drugs.

[0057] Among them, the extraction of classification features can be implemented simply by using the JAVA language, and the classification by using the support vector machine model can use a simple, easy-to-use, fast and effective...

Embodiment 2

[0059] The method provided in this embodiment is as follows:

[0060] First, grab the article information containing the specified disease in the MEDLINE literature database, and use the drug entity to identify the literature information containing multiple drugs; use the abstract information and title information in the article as a data set, and then use part of these data sets as The training set and the test set were manually marked, and the documents marked as the combination relationship and the non-combination relationship of the drug were marked; then, the feature selection method CHI chi-square statistical method in text mining was used to extract the classification keywords, and TF / IDF was used to classify each A keyword is weighted as a feature, and the selected classification features include classification keywords, whether the drug appears in the same sentence, word features, part-of-speech features, logical features, and dependent syntactic features of this sente...

Embodiment 3

[0097] This implementation example uses data from the medline medical literature dataset from 1966 to 2015. Use the xml dataset provided by medline. The format of the dataset is as follows:

[0098] Each of the bibliographic information starts with start with Finish. The key fields included are described below:

[0099]

[0100] The disease studied in this example is hypertension.

[0101] 2. Specific steps:

[0102] Grab the document information containing the keywords "humans" and "hypertension" in the mesh word;

[0103] Grab the literature containing multiple drug entities in the abstract, and obtain 7911 abstracts as the original corpus;

[0104] Manually annotate some of the summaries. Mark as summaries with combined relationship and abstracts without combined relationship;

[0105] Use the text representation method and text feature selection method in text mining to extract classification keywords. Finally, 20 classification keywords are selected, and thei...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a combination drug recognition and ranking method based on a medical literature database. First, public medical literature abstracts in the medical literature database are captured, and drug entities in the medical literature abstracts are recognized; then, a feature extraction method in text mining is used for extracting features, a classification algorithm in machine learning is used for classifying drugs, and parameters of the classification algorithm are optimized through an optimization algorithm; last, Medrank is used for performing combination drug ranking to obtain a combination drug use recommendation scheme related to a certain disease. According to the combination drug recognition and ranking method, the problem that medical researchers cannot read and discover the law in mass medical literature which is increased at an exponential order every year is solved through the text mining technology and machine learning related knowledge, the ranking result of combination drugs for treating a certain disease and the variation trend over the years in the literature can be quickly known, and therefore the pressure on the medical researchers reading the mass literature is relieved.

Description

technical field [0001] The invention relates to computer technology in the field of medical clinical technology, in particular to a combined drug identification and sorting method based on a medical literature database. Background technique [0002] As we all know, medical literature has become an important source of information for medical researchers and workers, but in today's society where information is exploding, medical information is also exploding in large numbers. According to statistics, medical information resources account for more than 30% of Internet information resources, and the number of medical literature is growing at an alarming rate. There are nearly 30,000 medical journals in the world, and more than 2 million papers are published every year with an annual growth rate of 7%. The increasing update of medical literature has become a major challenge for medical researchers and workers. On average, clinicians have to read a large amount of professional li...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F17/30

CPCG06F16/35G06F16/367

Inventor 李学明张琦

Owner CHONGQING UNIV

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Combination drug recognition and ranking method based on medical literature database

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology