Video voice conversion method, video voice conversion device and server
A technology of video voice and conversion method, applied in voice analysis, voice synthesis, voice recognition, etc., can solve the problems of high cost, low efficiency, difficult to guarantee accuracy, etc., achieve the accuracy of results, reduce costs, and avoid accurate lower sex effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0025] Figure 1A It is a flow chart of the video-to-speech conversion method provided in Embodiment 1 of the present invention, Figure 1B It is a schematic diagram of segmentation of a speech signal in a source language provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation where it is necessary to convert the speech signal of the source language in the video into the speech signal of the target language, and the method can be executed by a video-to-speech conversion device, which can be set in a server. The method specifically includes the following operations:
[0026] 101: Extracting the speech signal of the source language in the video, segmenting the speech signal of the source language, and obtaining at least one sub-speech signal of the source language;
[0027] Here, when the speech signal of the source language in the video is relatively long, segmenting the speech signal of the source language according to a certain metho...
Embodiment 2
[0046] Figure 2A For the video-to-speech conversion method provided in Embodiment 2 of the present invention, Figure 2B A schematic diagram of an interface for selecting a target language type for a user in Embodiment 2 of the present invention. This embodiment is applicable to the situation that the voice signal of the source language in the video is converted into the voice signal of the target language before playing the video. The playback device can be set in the same server or in different servers. The method specifically includes the following operations:
[0047] 201: The video-to-voice conversion device determines at least one target language to be converted according to the setting information;
[0048] 202: The video-to-speech conversion device performs the following operations for each target language that needs to be converted: extract the speech signal of the source language in the video, segment the speech signal of the source language, and obtain at least ...
Embodiment 3
[0055] image 3 It is the video-to-speech conversion method provided by Embodiment 3 of the present invention. This embodiment is applicable to the situation that the voice signal of the source language in the video is converted into the voice signal of the target language in real time after receiving the play request, and the method can be performed by a video voice conversion device and a video playback device, and the video voice conversion device and the video playback device can be set in the same server or different servers. The method specifically includes the following operations:
[0056] 301: The video and audio playback device receives a video playback request, and the playback request includes the target language type selected by the user or automatically selected;
[0057] Among them, an example of the user selecting the target language type can be found in Figure 2B , the user can select Mandarin or Sichuanese as the target language type in the menu of "Simul...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com