Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural machine translation system-oriented sentence granularity metamorphic test method

A technology of machine translation and metamorphosis testing, which is applied in natural language translation, biological neural network models, instruments, etc., and can solve problems such as ignoring word order, grammar, and semantics

Pending Publication Date: 2022-08-09
XIAN UNIV OF POSTS & TELECOMM
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although the similarity calculation method based on cosine distance is well constructed, it ignores the original word order, grammar and semantics of the text

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural machine translation system-oriented sentence granularity metamorphic test method
  • Neural machine translation system-oriented sentence granularity metamorphic test method
  • Neural machine translation system-oriented sentence granularity metamorphic test method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to make the purpose, technical solutions and advantages of the present invention clearer, taking Google translation engine as an example, the specific implementation of the sentence granularity transformation test method for neural machine translation system proposed by the present invention will be described.

[0031]Step 1: Select the Google translation engine as the translation model T, and select 5000 pairs of Chinese and English sentences in the five fields of education, Weibo, news, spoken language and subtitles in the UM-Corpus dataset as the original dataset D 1 , D 2 ;

[0032] Step 2: Put D 1 Input into the translation model T to get the Chinese test case set D 1t , D 1t Input into the translation model T to get the English test case set D 1 ’, and then D 1 'Enter the translation model T to get the Chinese test case set D 1t ';

[0033] Step 3: For the original dataset D 1 and the generated test case set D 1t , D 1 ', D 1t ' Perform data pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sentence granularity metamorphic testing method oriented to a neural machine translation system, and belongs to the field of metamorphic testing, aiming at the translation robustness evaluation problem existing in the neural machine translation system. The method comprises the following steps: firstly, carrying out data preprocessing operation on two groups of texts which need to be subjected to similarity calculation to generate two groups of word lists; then, using a TF-IDF-based word bag model to construct text vectors for the two groups of word lists, and calculating an included angle cosine value of the two groups of text vectors; meanwhile, calculating a Jaccard similarity coefficient and an editing distance of the two groups of word lists; and finally, calculating the similarity between the sentences and the metamorphic relationship satisfaction rate according to the defined similarity calculation formula and the metamorphic relationship. According to the method, the problem that a single cosine similarity method is difficult to reflect semantic changes caused by different word sequences of the sentences is solved, the capability of distinguishing the semantic changes caused by the word sequence changes of the sentences is enhanced, and the sentence similarity calculation accuracy and the translation quality evaluation accuracy are improved.

Description

technical field [0001] The invention belongs to the field of metamorphosis testing, in particular to the problem of sentence granularity similarity calculation in the metamorphosis test process of a neural machine translation system, and proposes a sentence granularity metamorphosis test method oriented to a neural machine translation system. Background technique [0002] With the development of deep learning, machine translation based on neural network has made great progress. However, the neural network model adopted by the neural machine translation system lacks interpretability and understandability, which leads to the test prediction problem in the testing process of the neural machine translation system. The transformation test is a test method proposed to solve the test prediction problem. Therefore, the transformation test It is also one of the main methods for testing neural machine translation systems. During the metamorphosis test of the neural machine translatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F40/58G06F40/216G06F40/289G06F40/30G06N3/02
CPCG06F40/58G06F40/216G06F40/289G06F40/30G06N3/02
Inventor 王曙燕马晶晶孙家泽王小银
Owner XIAN UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products