Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Prediction Method of Scientific Research Collaboration Based on Heterogeneous Information Network

A heterogeneous information network and heterogeneous network technology, which is applied in the field of scientific research cooperation relationship prediction based on heterogeneous information network, can solve the problems of high computational complexity of topology features, error in prediction results, loss of semantic information, etc., to avoid calculation errors. , improve accuracy, and solve the effect of excessive dimension

Active Publication Date: 2021-05-25
ANHUI UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the literature [The link-prediction problem for social network, 2007] judges the similarity between authors by comparing the node topology similarity index in several networks in the co-authored network, including common neighbors, Jaccard, SimRank and Katz, etc. Through experimental analysis, the Katz index is superior to several other similarity calculations. The method based on machine learning mainly uses the classification model, especially the idea of ​​binary classification, to predict the cooperation between authors. The literature [Predicting co-author relationship in medical co-authorship network, 2014 ] Treat scientific research cooperation prediction as a binary classification problem, use the structural similarity index as a feature in the co-authorship network, and then train the model, and use logistic regression and SVM as the prediction model to predict the links between author nodes, but the above methods are not It is based on a homogeneous network. Although the calculation is simple, it loses rich semantic information, which may lead to errors in prediction results.
[0004] In addition, there are also a small number of studies based on heterogeneous networks. Among them, the literature [Co-author relationship prediction in heterogeneous bibliographic network, 2011] took the lead in applying meta-path-based topological features to heterogeneous networks and using logistic regression models to predict cooperative relationships. , but the computational complexity of several topological features used in this method is relatively high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Prediction Method of Scientific Research Collaboration Based on Heterogeneous Information Network
  • A Prediction Method of Scientific Research Collaboration Based on Heterogeneous Information Network
  • A Prediction Method of Scientific Research Collaboration Based on Heterogeneous Information Network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0066] In this embodiment, a method for predicting scientific research collaborations based on heterogeneous information networks is applied to any two authors who have not collaborated before, predicting the possibility of their future cooperation; specifically, as figure 1 As shown, proceed as follows:

[0067] Step 1. The heterogeneous network of scientific and technological literature is transformed into an author-author isomorphic network:

[0068] Step 1.1, build a heterogeneous network of scientific and technological literature G=(V, E); where, V represents the node set in the heterogeneous network of scientific and technological literature, and the types of node sets include author node sets, paper node sets, conference node sets and term nodes set; author node set is denoted as A={A 1 ,A 2 ,...,A i ,...,A a}, A i Indicates the i-th author node, 1≤i≤a; the paper node set is recorded as P={P 1 ,P 2 ,...P w ,...P p}, P w Indicates the wth paper node, 1≤w≤p; the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for predicting scientific research cooperation relationships based on heterogeneous information networks, comprising the following steps: 1. Transforming the heterogeneous network of scientific and technological literature into an author-author isomorphic network; 2. Vector representation of author nodes in the isomorphic network; 3. Calculation of similarity between authors. The present invention is applied to any two authors who have not yet cooperated to predict the possibility of their cooperation in the future, thereby effectively solving the problem of information loss and simplification in the author-author homogeneous network, so as to increase the accuracy of cooperation prediction and help Scholars conduct scientific research more efficiently.

Description

technical field [0001] The invention belongs to the technical field of information retrieval, and in particular relates to a method for predicting scientific research cooperation relationships based on heterogeneous information networks. Background technique [0002] In recent years, with the diversification and complexity of research problems, multidisciplinary integration has become more and more common. At the same time, changes in scholars' research directions have also led to an increase in the number of author collaborations in various fields. Finding the most valuable collaborators from the vast academic digital library is a great challenge. This makes the prediction of scientific research partnerships more and more important. [0003] At present, the existing scientific research cooperation relationship prediction mainly adopts similarity-based methods and machine learning-based methods. The similarity-based methods mainly focus on text similarity and structural sim...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06Q10/04G06F40/205G06N3/04
CPCG06Q10/04G06N3/045
Inventor 陈志立杨晴叶凡仲红
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products