Scientific and technological paper data text semantic feature extraction method and system and storage medium

A technology of semantic features and extraction methods, applied in the computer field, can solve the problems of inability to extract semantic features of scientific papers, inability to extract features, and ignoring associations, etc., to achieve the effect of rich semantic representation

Active Publication Date: 2022-07-29
BEIJING UNIV OF POSTS & TELECOMM
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In the existing feature extraction methods based on the pre-trained corpus model, the semantic feature extraction is carried out for the text context. However, for scientific and technological papers, various attributes of the paper, especially keywords and titles, have a large number of correlations. These attributes can also be associated through the co-occurrence relationship of keywords. These attributes cover the main semantic information of the paper, but the existing feature extraction methods ignore the association between these paper attributes, and cannot be analyzed from the context and the paper association relationship at the same time. Feature extraction, which leads to the inability of existing feature extraction methods to extract the semantic features of scientific papers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Scientific and technological paper data text semantic feature extraction method and system and storage medium
  • Scientific and technological paper data text semantic feature extraction method and system and storage medium
  • Scientific and technological paper data text semantic feature extraction method and system and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] In order to make the purposes, technical solutions and advantages of the embodiments of the present invention more clearly understood, the embodiments of the present invention will be further described in detail below with reference to the accompanying drawings. Here, the exemplary embodiments of the present invention and their descriptions are used to explain the present invention, but not to limit the present invention.

[0040] Here, it should be noted that, in order to avoid obscuring the present invention due to unnecessary details, only structures and / or processing steps closely related to the solution according to the present invention are shown in the drawings, and the Invent other details that are less relevant.

[0041] It should be emphasized that the term "comprising / comprising / having" as used herein refers to the presence of a feature, element, step or component, but does not preclude the presence or addition of one or more other features, elements, steps o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a science and technology paper data text semantic feature extraction method and system and a storage medium, and the method comprises the steps: obtaining the text information of a science and technology paper, and constructing an entity relation graph based on the obtained text information of the science and technology paper, the text information comprises a paper title and a keyword, and the entity relation graph comprises the keyword; nodes in the entity relationship graph are paper titles or keywords, and edges in the entity relationship graph are association relationships among the nodes; semantic features are extracted based on the obtained text information of the science and technology papers, and a semantic feature matrix is obtained; determining an original adjacency matrix based on the entity relation graph, and inputting the semantic feature matrix and the original adjacency matrix into a graph network model to obtain a spatial feature matrix; and carrying out feature fusion on the semantic feature matrix and the spatial feature matrix to obtain final semantic features of the science and technology paper. According to the feature extraction method, on the basis of extracting the semantic features of the scientific and technological paper corpus, the semantic features of the scientific and technological paper can be well extracted by utilizing the spatial association of the knowledge graph.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method, system and storage medium for extracting semantic features of scientific and technological paper data text. Background technique [0002] As an important source of research results display and information acquisition, a large number of scientific papers are published almost every day. These academic results contain a variety of the latest professional field information. These scientific papers can be obtained effectively and quickly and the semantic features can be represented. and learning is particularly important. However, the data of scientific papers often contain a large number of complex attributes, such as abstracts, keywords, citations, etc. of the papers, and the correlation between papers is closer. In addition, a large amount of professional knowledge in papers covers a wide range of disciplines, which makes the feature extraction of scientific papers It ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/30G06K9/62G06N3/04G06N3/08
CPCG06F40/30G06N3/08G06N3/045G06F18/22G06F18/253
Inventor 薛哲杜军平郑长伟李文玲梁美玉邵蓥侠寇菲菲
Owner BEIJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products