Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Method and device for screening similar statements in a database

A database and sentence technology, applied in the field of data processing, can solve the problems of insufficient screening results, high experience correlation, and time-consuming, etc., and achieve the effects of perfect screening results, improving screening efficiency, and saving time

Active Publication Date: 2019-05-31
IFLYTEK CO LTD
View PDF8 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] After research, the inventor found that in the prior art, the proportion of random selection by the staff is limited and only some sentence patterns can be extracted, which has a certain risk of missed detection; according to the staff's Experience manual screening requires a lot of manpower and time-consuming, and has high professional requirements for the staff. The screening results are highly correlated with the experience of the staff, which ultimately leads to low screening efficiency and insufficient screening results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for screening similar statements in a database
  • Method and device for screening similar statements in a database
  • Method and device for screening similar statements in a database

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to enable those skilled in the art to better understand the solution of the present application, the technical solution in the embodiment of the application will be clearly and completely described below in conjunction with the accompanying drawings in the embodiment of the application. Obviously, the described embodiment is only It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0053] At present, more and more fields need to apply knowledge bases, such as multi-channel intelligent customer service, intelligent robots, and intelligent voice navigation. However, the extended sentences of different standard sentences in the knowledge base may be intertwined, and the gap between different standard sentences It may be confusing. There are cert...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and device for screening similar statements in a database. The method comprises the steps that semantic vectors and semantic vectors of multiple target extension statements of a target standard statement are utilized to expect to screen all the target extension statements to obtain extension statements to be subjected to quality inspection; Calculating the similarity between the semantic vector of each extended statement to be subjected to quality inspection and the semantic vectors of other extended statements in the database; Wherein the semantic vector is obtained through a semantic measurement model; And based on the similarity and a first preset screening condition, screening each extended statement to be subjected to quality inspection and each otherextended statement to obtain similar extended statements to be subjected to quality inspection and corresponding similar other extended statements. Therefore, only the target extension statement needing quality inspection is screened as the to-be-quality-inspected extension statement, and the number of the to-be-quality-inspected extension statements is reduced; Compared with a screening result obtained through manual screening, the automatic similar statement screening is more perfect and accurate, manpower and time are saved, and the screening efficiency is improved.

Description

technical field [0001] The present application relates to the technical field of data processing, in particular to a method and device for screening similar sentences in a database. Background technique [0002] With the rapid development of artificial intelligence, the application of intelligent customer service system that best embodies artificial intelligence technology is becoming more and more extensive. The realization of intelligent customer service system depends on its core "brain" - knowledge base. The knowledge points in the knowledge base generally adopt the “question-answer” input-output form, in which the text input representing the knowledge point is called a standard sentence, and the text input derived from the expansion and deformation of the standard sentence is called an extended sentence, which has the same semantics as the standard sentence Same, slightly different text. Usually, the knowledge base includes multiple standard sentences, and each standar...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/33G06F17/27G06Q30/02
Inventor 黄永江邱志国庄纪军张毅赵乾
Owner IFLYTEK CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products