Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text mining method for heterogeneous graph conversion based on meta-structure learning

A text mining and heterogeneous graph technology, applied in neural learning methods, unstructured text data retrieval, data mining, etc., can solve problems such as mining of difficult semantic patterns, loss of semantic information, and complex real relationships

Pending Publication Date: 2021-12-17
SHIJIAZHUANG TIEDAO UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the limitations of human experience, manually constructed feature engineering can only capture part of the information
And the meta-path considers each type of relationship between nodes separately. However, the real relationship in a heterogeneous environment is often quite complex, and there may be different types of relationships between nodes, and the meta-path cannot represent two semantic relationships that work at the same time. , which further leads to the loss of semantic information
In traditional text classification methods, limited by word order and lack of text information types, it is difficult to mine information-rich semantic patterns

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text mining method for heterogeneous graph conversion based on meta-structure learning
  • Text mining method for heterogeneous graph conversion based on meta-structure learning
  • Text mining method for heterogeneous graph conversion based on meta-structure learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the purpose, technical solution and advantages of the present invention clearer, the present invention will be further elaborated below in conjunction with the accompanying drawings.

[0052] In this example, see figure 1 and 2 As shown, the present invention proposes a text mining method for heterogeneous graph conversion based on meta-structure learning, including steps:

[0053] S10, for the text data, extracting information in the text to construct a heterogeneous information network graph;

[0054] S20, using the graph conversion layer to obtain meta-paths to capture the relationship between nodes;

[0055] S30, extracting the metagraph structure by establishing a channel-type Hadamard product module, so as to capture multiple interactions between nodes;

[0056] S40, using a graph convolutional network for the extracted meta-structure including meta-paths and meta-graphs, to generate node embeddings;

[0057] S50, using the obtained node embed...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a text mining method for heterogeneous graph conversion based on meta-structure learning, which comprises the following steps of: aiming at text data, extracting information in a text to construct a heterogeneous information network graph; acquiring a meta-path through a graph conversion layer to capture a relationship between nodes; extracting a meta-graph structure by establishing a channel type Hadamard product module, so that multiple interaction conditions existing between nodes at the same time are captured; generating node embedding for the extracted meta-structure containing the meta-path and the meta-graph by using a graph convolutional network; and embedding and mining a downstream text by using the obtained nodes. The method can be suitable for a complex text recognition environment, loss of semantic information is effectively avoided, and rich and complete semantic information can be obtained.

Description

technical field [0001] The invention belongs to the technical field of text mining, in particular to a text mining method for heterogeneous graph conversion based on meta-structure learning. Background technique [0002] With the development of Internet technology, global informatization data presents the characteristics of explosive growth, mass accumulation, and rapid dissemination. We have entered a "big data era", which has had a major impact on cultural communication, information management, etc. Natural language Processing technology has received more and more attention and has become a current hot spot. Natural language processing refers to allowing computers to accept user input in the form of natural language, and internally perform a series of operations such as processing and calculation through algorithms defined by humans, so as to simulate human understanding of natural language and return the results expected by users. The purpose of natural language processi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/33G06F16/335G06F16/35G06F17/16G06F40/30G06N3/04G06N3/08
CPCG06F16/3344G06F16/335G06F16/35G06F17/16G06F40/30G06N3/08G06F2216/03G06N3/045Y02D10/00
Inventor 王书海彭浩刘明瑞刘欣
Owner SHIJIAZHUANG TIEDAO UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products