Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Novel molecular similarity search algorithm based on graph structure attention

A similarity search and attention technology, applied in the field of molecular similarity search, can solve the problems of increased computing time, huge amount of data, limiting the speed of shape recognition methods, etc., to achieve the effect of ensuring universality and improving accuracy

Pending Publication Date: 2020-07-17
OCEAN UNIV OF CHINA +1
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the huge amount of data in the molecular database, a molecule needs to be subjected to exponentially-sized incremental experiments, which greatly increases the calculation time and limits the speed of using the shape recognition method.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Novel molecular similarity search algorithm based on graph structure attention
  • Novel molecular similarity search algorithm based on graph structure attention
  • Novel molecular similarity search algorithm based on graph structure attention

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0041] according to Figure 1 to Figure 4 As shown, the center of a new molecular similarity search algorithm based on graph structure attention is the similarity calculation formula, which uses similar molecules as pre-training data to train the parameters in the formula. Use the formula obtained from training to perform similarity calculation on the input molecular graph, and obtain two molecular similarity scores.

[0042] Using the existing method to convert t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a novel molecular similarity search algorithm based on graph structure attention, which includes the following steps: 1) with similar molecules as a pretraining data, expressingthe similar molecules into the format of molecular graph through a graph convolution neural network, and converting the molecular graph into feature vectors by introducing the attention mechanism into the graph convolution neural network; 2) training the network to obtain the parameters in the formula, thus generating the similarity calculation formula; 3) expressing the new molecular by the formula of the new molecular graph through the method in the step 1), and converting the new molecular graph into a new feature vector, and performing similarity calculation by using the similarity calculation formula to obtain similarity score. By means of the method, the accuracy of the molecular similarity calculation is improved on the basis of guaranteeing the universality of searching.

Description

technical field [0001] The invention belongs to the field of molecular similarity search, in particular to a novel molecular similarity search algorithm based on graph structure attention. Background technique [0002] At present, with the rapid development of information technology, computers play an increasingly important role in information technology in the field of drug discovery. In recent years, the development of artificial intelligence has brought more possibilities for drug discovery. Molecular similarity search is an important technology in drug small molecule discovery engineering. It has laid a theoretical foundation for many existing virtual screening technologies. Molecules are a commonly used screening tool. [0003] In chemical analysis, the analysis of physical and chemical properties based on similarity, the principle of similar miscibility, etc. are all based on [0004] "Compounds with similar structures generally have similar physical and chemical pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16C20/20G16C20/70G06N3/04G06N3/08
CPCG16C20/20G16C20/70G06N3/08G06N3/045Y02D10/00
Inventor 魏志强李臻王爽乔木张树刚
Owner OCEAN UNIV OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products