Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Preprocessing module of multi-language intelligent preprocessing real-time statistical machine translation system

A technology for statistical machine translation and preprocessing modules, applied in natural language translation, electronic digital data processing, special data processing applications, etc. and other problems to achieve the effect of improving the accuracy

Inactive Publication Date: 2017-08-11
唐亮
View PDF2 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] At present, the function of the preprocessing module of machine translation is not perfect. Most of them are trained and translated by the translation module after simple typo judgment and punctuation prediction after receiving by the receiving module. This not only increases the difficulty of machine translation, but also for small probability Words, the translation module may have inaccurate translation problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Preprocessing module of multi-language intelligent preprocessing real-time statistical machine translation system
  • Preprocessing module of multi-language intelligent preprocessing real-time statistical machine translation system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art fall within the protection scope of the present invention.

[0023] Such as Figure 1-2 As shown, the preprocessing module of a multi-language intelligent preprocessing real-time statistical machine translation system according to an embodiment of the present invention includes a text preprocessing module and a speech recognition result preprocessing module, the text preprocessing module The processing module is used to perform word standardization operations, category recognition and labeling, and word order adjustment f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a preprocessing module of a multi-language intelligent preprocessing real-time statistical machine translation system. The preprocessing module comprises a text preprocessing module and an automatic speech recognition result preprocessing module, wherein the text preprocessing module is used for carrying out word normalized operation, class recognition labeling and chuck and word order adjustment on languages input in a text manner; and the automatic speech recognition result preprocessing module is used for carrying out word normalized operation and punctuation prediction on the languages. The preprocessing module disclosed by the invention is capable of carrying out basic operations such as word normalized operation, class recognition labeling and chuck and word order adjustment on to-be-translated text languages, so that convenience is brought to the translation carried out on the to-be-translated language texts by a subsequent translation module; or the preprocessing module disclosed by the invention is capable of carrying out work normalized operation on speech languages or carrying out preprocessing such as prediction and the like on the punctuations in speech flows, so that convenience is brought to the translation carried out by a subsequent machine translation module. The preprocessing module has the effect of carrying out labelling and preferential translation on small-probability words, so as to improve the correctness of translating the small-probability words.

Description

Technical field [0001] The invention relates to the technical field of artificial intelligence machine translation, and in particular to a preprocessing module of a multi-language intelligent preprocessing real-time statistical machine translation system. Background technique [0002] Machine translation is a technology that uses computers to automatically translate human natural languages. It is a process of using computers to convert one natural language into another natural language, and the two natural languages ​​should be equivalent in meaning. [0003] At present, a relatively mature and mainstream machine translation method is based on statistics. The advantage of this method is that there is almost no need to manually write translation rules. All translation information is automatically learned from the corpus, so this method is the biggest To a great extent, the characteristics of computer high-speed operation are brought into play, and labor costs are greatly reduced. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
CPCG06F40/44G06F40/58
Inventor 张昱琪唐亮
Owner 唐亮
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products