Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Text similarity calculation method and device

A text similarity and calculation method technology, applied in the field of text similarity calculation methods and devices, can solve the problems affecting the service quality of chat robots, user experience, inaccurate similarity, inaccuracy, etc., so as to improve service quality and user satisfaction. Experience, overcome the low accuracy of semantic understanding, and improve the effect of computing accuracy

Active Publication Date: 2019-02-15
安徽省泰岳祥升软件有限公司
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The problem is that each individual vocabulary cannot accurately express the original meaning of the corresponding text, which results in inaccurate similarity between the texts calculated using each vocabulary, for example, there are two texts: I like you and you Like me, the meanings of these two texts are completely different, but the vocabulary after word segmentation of the two texts is exactly the same, then the similarity of the two texts calculated by using the existing technology is 1, which is obviously inaccurate
Furthermore, since the calculation of text similarity in the prior art is not accurate enough, the replies that the chat robot pushes for the user based on the text similarity must not be accurate enough, which seriously affects the service quality of the chat robot and the user experience.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Text similarity calculation method and device
  • Text similarity calculation method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of the present invention.

[0051] In one embodiment, a method for calculating text similarity is provided, such as figure 1 As shown, the method includes the following steps:

[0052] 110. Obtain the longest common subsequence of the first text and the second text;

[0053] In this step, the first text and the second text are two texts whose similarity needs to be calculated;

[0054] The longest common subsequence (LCS Longest Common Subsequence) refers to the longest subsequence i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method and a device for calculating text similarity are provided in that embodiment of the present invention, At first, that embodiment of the invention obtain the longest common sub-sequence of thetwo texts, and then computes the intersection and union of the vocabulary sets corresponding to the two texts, Then, the first similarity is calculated according to the obtained intersection and union, the second similarity is calculated according to the vocabulary set corresponding to the longest common subsequence and the union obtained before, and the target similarity of the two texts is calculated according to the first similarity and the second similarity. The technical scheme combines the longest common sub-sequence and each word in the text to calculate the similarity between the twotexts, which effectively improves the accuracy of text similarity calculation. Furthermore, the accurate text similarity can provide users with more accurate answers, which improves the quality of service and user experience of intelligent interaction.

Description

[0001] This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on June 5, 2018, the application number is 201810569749.6, and the invention title is "Text Similarity Calculation Method and Device", the entire content of which is incorporated into this application by reference . Technical field [0002] The embodiments of the present invention relate to the technical field of text processing, and more specifically, to a method and device for calculating text similarity. Background technique [0003] Chatbot is a popular application driven by big data and artificial intelligence technology. In the process of use, the user enters the chat content, that is, the user enters the question raised by the user. The chat robot automatically generates the corresponding reply according to the question entered by the user. And feedback to users. This artificial intelligence processing method can greatly improve service efficiency and user exper...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F17/27G06F17/22
CPCG06F40/194G06F40/289
Inventor 杨凯程李健铨蒋宏飞
Owner 安徽省泰岳祥升软件有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products