Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural network and tag library-based statement similarity algorithm

A neural network and similarity calculation technology, which is applied in the field of sentence similarity algorithm, can solve problems such as data sparseness, no consideration of word synonymous replacement, and inflexible editing operations

Active Publication Date: 2010-07-14
成都安客云网络科技有限公司
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the method based on the same vocabulary has obvious limitations: it can do nothing for the replacement of synonyms; the method of using semantic dictionaries can solve the problem of synonyms replacement well, but the method of simply using semantic dictionaries has no effect. Considering the internal structure of the sentence and the interaction between words, the accuracy rate is not high; the method of calculating the edit distance is usually used in the field of fast fuzzy matching of sentences, but the editing operation specified by it is not flexible enough, and it does not consider the same words. The method based on statistics needs to construct a large amount of training corpus, the workload is very huge, and there is also the problem of data sparseness

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network and tag library-based statement similarity algorithm
  • Neural network and tag library-based statement similarity algorithm
  • Neural network and tag library-based statement similarity algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The present invention will be described in detail below by way of examples.

[0039] Here we first introduce the dependency-based semantic similarity algorithm and the edit distance algorithm.

[0040] 1. Dependency-based semantic similarity algorithm

[0041]Dependency syntax was proposed by French linguist L. Tesniere in his book "Basis of Structural Syntax" (1959). Dependency grammar reveals its syntactic structure by analyzing the dependency relationship between components in a language unit. It holds that the verb in a sentence is the central component that dominates other components, but it itself is not dominated by any other components. Dependencies are subordinate to the dominator. In the 1970s, Robinson proposed four axioms about dependency in dependency grammar. In the study of dealing with Chinese information, Chinese scholars proposed the fifth axiom of dependency:

[0042] ① Only one element in a sentence is independent;

[0043] ②Other components are ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a neural network and tag library-based statement similarity algorithm in the information retrieval field, which is characterized by comprising the following steps: (1) loading a semantic dictionary and a synonym lexicon with a neural network respectively; (2) inputting a complete statement to be analyzed; (3) analyzing the integral syntactic structure of the statement by using a dependency grammar analyzer, then layering the statement, and acquiring an effective component sequence of the statement; (4) determining a corresponding header field of the statement in an exUCL tag library according to the layering and the effective component sequence thereof; and (5) judging whether the statement has similar word pairs, if so, calculating the similarity of the statement, otherwise, re-inputting a new statement to be analyzed, and performing the similarity calculation again. The algorithm combines the advantages of dependency-based statement similarity algorithm and edit distance algorithm so that the calculation precision is greatly improved.

Description

technical field [0001] The invention relates to a sentence similarity algorithm, in particular to a sentence similarity algorithm based on a neural network and a tag library. Background technique [0002] In recent years, due to the continuous emergence of new network applications, especially the introduction and deepening of the Internet concept, network traffic and behavior have undergone great changes, shaking the traditional theoretical basis of the Internet, that is, the traffic model has changed from Poisson distribution to self-similar properties. The lack of accurate understanding and precise description of network traffic distribution, traffic characteristics, transmission efficiency, user and network behavior, etc., seriously affects the effective use of network resources and the development of the network itself, thus making the network controllable And manageability is getting worse and worse, there is a sharp contradiction between the quality of service provide...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/27G06F17/30G06N3/02
Inventor 马建国邢玲王娟娟
Owner 成都安客云网络科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products