Test corpus generation method and device and electronic equipment

A corpus and corpus technology, applied in the field of big data, can solve problems such as limited testing time and human resource investment, and the inability to test the hit rate of question-and-answer robots.

Pending Publication Date: 2020-08-04
BANK OF CHINA
View PDF8 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] During the test process, the test time and human resources are limited, and it is impossible to formulate enough test corpus under t

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Test corpus generation method and device and electronic equipment
  • Test corpus generation method and device and electronic equipment
  • Test corpus generation method and device and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] Technical terms:

[0061] Dependency syntax was first proposed by French linguist L. Tesniere, which analyzes sentences into a dependency syntax tree and describes the dependency relationship between words. That is to say, it points out the syntactic collocation relationship between words, which is related to semantics.

[0062] The following will clearly and completely describe the technical solutions in the embodiments of the application with reference to the drawings in the embodiments of the application. Apparently, the described embodiments are only some of the embodiments of the application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0063] It can be understood that there are multiple business function points in the business system, and each business function point has several historica...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a test corpus generation method and device and electronic equipment. The method comprises the steps of obtaining historical test corpora on a service function point line; determining a grammatical structure and a vocabulary set corresponding to the service function points in the historical test corpus; judging whether a grammar structure white list corresponding to the service function point contains the grammar structure or not; if so, querying synonym sets in one-to-one correspondence with the vocabularies in the vocabulary set from a synonym library of the service function points; establishing a plurality of synonym phrases based on the synonym sets and the syntax structure; and respectively replacing the vocabulary set in the historical test corpus with each synonym phrase to obtain a plurality of test corpuses of the service function point. A plurality of test corpora are extended from an online real historical test corpus, so that the test corpus is greatlyenriched.

Description

technical field [0001] The present application relates to the technical field of big data, in particular to a test corpus generation method, device and electronic equipment. Background technique [0002] Question-answering robots have gradually entered banks and some customer service systems. In order to test the performance of the question-answering robot, the hit rate is usually used to measure the performance of the robot. That is, some test corpus is used to test the question-answering robot, so as to obtain the hit rate of the question-answering robot. [0003] During the testing process, the testing time and human resource investment are limited, and it is impossible to formulate enough test corpus under the condition of limited resources, and it is impossible to accurately and comprehensively test the hit rate of the question-answering robot. [0004] Therefore, there is a need for a solution that can automatically expand the test corpus so as to enrich the test tes...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F40/253G06F40/247
CPCG06F40/253G06F40/247Y02D10/00
Inventor 宋雨
Owner BANK OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products