Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Question and answer corpus generation method and device based on text generation model

A technology for generating models and texts, applied in natural language data processing, special data processing applications, instruments, etc., can solve problems such as the inability to provide FAQ corpus, insufficient matching between questions and answers, and achieve quality improvement Effect

Pending Publication Date: 2021-02-05
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The quantity and quality of the FAQ corpus is the basis of the entire system, but there is currently no way to give a general and fully covered FAQ corpus, so each vertical field needs to start building the FAQ corpus from scratch
Reconstructing the corpus usually uses the method of entering historical data to create FAQs. However, this data entry method will result in insufficient matching between some of the entered questions and answers.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Question and answer corpus generation method and device based on text generation model
  • Question and answer corpus generation method and device based on text generation model
  • Question and answer corpus generation method and device based on text generation model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, and are not intended to limit the present application.

[0059] The question and answer corpus generation method based on the text generation model provided by this application can be applied to such as figure 1 shown in the application environment. Wherein, the terminal 102 communicates with the server 104 through the network. The server responds to the question-and-answer corpus generation request of the terminal, generates the request according to the question-and-answer corpus, obtains historical questions and standard documents, extracts keywords in standard documents and paraphrases corresponding to keywords, performs wor...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of artificial intelligence, and provides a question and answer corpus generation method and device based on a text generation model, computer equipment and a storagemedium. The method comprises the steps of obtaining historical questions and a standard document, extracting keywords in the standard document and paraphrasing sentences corresponding to the keywords, performing word segmentation processing on the historical questions, identifying and discarding entity nouns in the historical questions to obtain syntactic feature words of the historical questions, combining the syntactic feature words with the keywords, and inputting the combined data into a pre-trained text generation model to obtain a target question corresponding to the keyword, wherein the text generation model by training based on a training sample marked with the keyword and syntax feature words are obtained, and according to the target question corresponding to the keyword and a paraphrasing statement corresponding to the keyword, a question-answer pair comprising the target question sentence and the paraphrasing sentence is constructed so as to improve the quality of the target question sentence and the question-answer pair.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, in particular to a question-answer corpus generation method, device, computer equipment and storage medium based on a text generation model. Background technique [0002] With the development of artificial intelligence technology, artificial intelligence has been applied in more and more scenarios. Among them, the question answering system is one of the important fields of artificial intelligence, especially for many businesses that currently need a customer service system to solve some of the user's questions, and most of the user's questions are concentrated on some high-frequency problems in the head. It is the motivation of Frequently Asked Questions (FAQ, frequently asked questions). [0003] The quantity and quality of the FAQ corpus are the basis of the entire system, but there is currently no way to provide a general and fully covered FAQ corpus, so each vertica...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/332G06F16/335G06F40/279
CPCG06F16/3329G06F16/335G06F40/279Y02D10/00
Inventor 谢忠玉陈立
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products