Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dialogue system training data construction method, device, electronic equipment and storage medium

A dialogue system and training data technology, applied in the field of data processing, can solve problems such as low efficiency and high cost, achieve the effect of reducing cycle time, improving accuracy and reliability, and saving labor costs

Active Publication Date: 2021-08-31
BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The dialog system training data construction method, device, electronic equipment, and storage medium proposed in this application are used to solve the problem of high cost and low efficiency in the related art of manually labeling data to construct training data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dialogue system training data construction method, device, electronic equipment and storage medium
  • Dialogue system training data construction method, device, electronic equipment and storage medium
  • Dialogue system training data construction method, device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] Embodiments of the present application are described in detail below, examples of which are illustrated in the accompanying drawings, wherein the same or similar reference numerals designate the same or similar elements throughout. The embodiments described below by referring to the figures are exemplary, and are intended to explain the present application, and should not be construed as limiting the present application.

[0018] The embodiment of the present application aims at the problem of high cost and low efficiency in the method of manually marking data to construct training data in related technologies, and proposes a method for constructing training data of a dialog system.

[0019] The dialogue system training data construction method provided by the embodiment of the present application can perform statistical processing on the historical use data of the dialogue system, determine the historical query statement set corresponding to the dialogue system, the que...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

This application proposes a dialogue system training data construction method, device, electronic equipment, and storage medium, wherein the method includes: performing statistical processing on the historical use data of the dialogue system, determining the historical query statement set corresponding to the dialogue system, each history The query frequency corresponding to the query statement and the identification result corresponding to each historical query statement; according to the query frequency and identification result corresponding to each historical query statement, the reference query statement is obtained from the historical query statement set; determine whether the number of all reference query statements is greater than The first threshold; if yes, use all reference query sentences and the recognition results corresponding to all reference query sentences to construct a training data set for the dialogue system. Therefore, the method for constructing the training data of the dialogue system not only saves labor cost, improves the construction efficiency of the training data set, but also further improves the accuracy and reliability of the dialogue system.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular to a method, device, electronic equipment and storage medium for constructing training data of a dialogue system. Background technique [0002] With the development of machine learning technology, especially the rapid development of neural networks in recent years, effective training data has become more and more important, and it is even called the "data oil" of the future. The Spoken Language Understanding (SLU) task in the field of Natural Language Processing (Natural Language Processing, referred to as NLP) aims to solve the problem of semantic understanding in human-computer dialogue, and parse the spoken dialogue (query) into intent (intent) and Slots are structured data for computer processing. [0003] In related technologies, machine learning technology is a main method for realizing SLU tasks. Realizing the SLU task through machine learning technolo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/332G06F16/33G06F16/335
Inventor 韩磊张红阳陈雷
Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products