A protein complex identification method based on node vector

A protein complex and recognition method technology, applied in the field of network data mining, can solve the problem of low recognition performance, achieve the effect of reducing experimental costs and saving manpower and material resources

Active Publication Date: 2020-01-14
DALIAN UNIV OF TECH
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The use of computational methods to identify protein complexes has the advantages of low cost and high efficiency compared with experimental methods, which will further promote the development of life sciences. However, most current research cannot fully explore the topological characteristics of protein interaction networks. Grasp the characteristics of protein complexes in the protein interaction network, and its recognition performance is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A protein complex identification method based on node vector
  • A protein complex identification method based on node vector
  • A protein complex identification method based on node vector

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention is described below in conjunction with accompanying drawing and specific embodiment:

[0034] figure 1 It is a schematic flowchart of a method for identifying protein complexes based on node vectors in the present invention. Such as figure 1 As shown, a protein complex identification method based on node vectors, including the following steps:

[0035] S1. Collect protein pair datasets containing protein interaction relationships: collect protein pairs with protein interaction relationships from the existing protein interaction relationship database, remove duplication and protein pairs with self-connected protein interaction relationships, and The protein pairs are stored as a protein pair data set in a unified format;

[0036] S2. Construct protein interaction relationship network: use protein pair data set to construct protein interaction relationship network G(V, E, W), where V is a set of nodes, E is a set of edges, and W is a weight set of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A protein complex recognition method based on a node vector comprises the following steps: S1, collecting protein pair data set containing protein interaction relationship; S2, constructing protein interaction relationship network; S3, performing network node vectorization; S4, network edge weighting; 5, selecting a seed node; 6, expanding the seed node to form a candidate protein complex subgraph; S7, filtering the candidate protein complex sub-map and outputting the protein complex sub-map identified finally. The invention is applicable to the work of identifying protein complexes from existing protein interaction relationships, is not limited to the source of protein interaction relationships, can effectively identify protein complexes, and will help to reveal the basic mechanism of life activities such as diseases at the protein level.

Description

technical field [0001] The invention relates to the field of network data mining methods, in particular to a method for identifying protein complexes based on node vectors. Background technique [0002] A protein complex is a group of proteins that interact to form a whole to complete a certain biological function. Understanding the structure and function of protein complexes is the basis for exploring the mechanisms of various life activities. It can help humans reveal the basic mechanisms of life activities such as diseases at the protein level, and obtain a comprehensive and holistic view of a series of physiological processes such as disease occurrence and cell metabolism. understanding. Protein complex identification is the first step in protein complex research and an important basis for protein-related research. Therefore, how to effectively identify protein complexes has great theoretical and practical value. [0003] The current methods for identifying protein com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/00G16B40/30
Inventor 杨志豪刘晓霞
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products