Speech translation apparatus and speech translation method

A speech translation and speech technology, applied in speech analysis, speech recognition, speech synthesis, etc., can solve problems such as inability to fluent speech dialogue, and achieve the effect of fluent communication

Inactive Publication Date: 2015-03-25
KK TOSHIBA
View PDF8 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, voice conversations cannot be smoothly performed between users

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech translation apparatus and speech translation method
  • Speech translation apparatus and speech translation method
  • Speech translation apparatus and speech translation method

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0031] image 3 is a block diagram of main components of the speech translation apparatus according to the first embodiment.

[0032] exist image 3 The block diagram, shows the figure 1 Component example for . However, it is possible to apply figure 2Component example for . In order to enable user A and user B (located in a remote place) to communicate bidirectionally, user terminal A ( 100 ) includes: a first voice input unit 401 and a first voice output unit 402 . Likewise, the user terminal B (150) includes: a second voice input unit 411 and a second voice output unit 412 . The first voice input unit 401 of the user terminal A (100) is equivalent to figure 1 The microphone 113, and the first speech output unit 402 are equivalent to figure 1 speaker 111. The second voice input unit 411 of the user terminal B (150) is equivalent to figure 1 The microphone 153, and the second voice output unit 412 are equivalent to figure 1 speaker 151 .

[0033] Speech recog...

no. 2 example

[0057] In the first embodiment, the first speech recognition unit 421, the second speech recognition unit 422, the first machine translation unit 423, the second machine translation unit 424, the first speech synthesis unit 425 and the second speech synthesis unit 426 according to order to perform processing. However, in the second embodiment, by operating these units in parallel, processing can be performed asynchronously. In the following instructions, refer to figure 1 and image 3 hardware components.

[0058] Figures 7A-7C is a flowchart of the operation of the second embodiment. In short, in parallel operation of the first speech recognition unit 421, the second speech recognition unit 422, the first machine translation unit 423, the second machine translation unit 424, the first speech synthesis unit 425, and the second speech synthesis unit 426 case, Figures 7A-7C is a flowchart.

[0059] First, by pressing the voice input button (114) of user terminal A (100...

no. 3 example

[0069] Figure 9 is a block diagram of a speech translation device according to the third embodiment. In the third embodiment, compared with the first embodiment, the difference is that a volume adjustment unit 700 is equipped. The volume adjustment unit 700 is capable of adjusting volumes of voices output from the first voice output unit 402 and the second voice output unit 412 .

[0070] Figure 10 is a flowchart of control processing by the volume adjustment unit 700 . To simplify the description, the Figure 10 In , only the flow chart of adjusting the volume of the first voice output unit 402 is shown. Through the same flowchart, the volume of the second voice output unit 412 can be adjusted.

[0071] First, at S710, the volume adjustment unit 700 confirms whether the first voice input unit 401 is operating. If the first voice input unit 700 is operating, the volume adjustment unit 700 measures the volume of the first voice input unit 401 at S720. Next, the volume ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A first speech processing device includes a first speech input unit and a first speech output unit. A second speech processing device includes a second speech input unit and a second speech output unit. In a server therebetween, a speech of a first language sent from the first speech input unit is recognized. The speech recognition result is translated into a second language. The translation result is back translated into the first language. A first speech synthesis signal of the back translation result is sent to the first speech output unit. A second speech synthesis signal of the translation result is sent to the second speech output unit. Duration of the second speech synthesis signal or the first speech synthesis signal is measured. The first speech synthesis signal and the second speech synthesis signal are outputted by synchronizing a start time and an end time thereof, based on the duration.

Description

technical field [0001] Embodiments described herein generally relate to speech translation devices and speech translation methods. Background technique [0002] In recent years, with the globalization of culture and economy, speech translation devices that support communication between people with different native languages ​​are highly expected. For example, voice translation applications that operate in conjunction with smartphones are commercialized. In addition, a service that presents a speech translation function is used. [0003] In these application software and services, when the user speaks the voice of the first voice to the voice translation device in a short unit (one sentence or several sentences), the voice is converted into a character string corresponding to the voice by the voice recognition function. Furthermore, this character string in the first language (source language) is translated into a character string in the second language (target language). ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G10L15/26
CPCG10L13/00G10L15/22G10L15/30G10L15/00
Inventor 住田一男河村聪典釜谷聪史
Owner KK TOSHIBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products