Patents
Literature
Hiro is an intelligent assistant for R&D personnel, combined with Patent DNA, to facilitate innovative research.
Hiro

84 results about "Change voice" patented technology

System for handling frequently asked questions in a natural language dialog service

A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.
Owner:NUANCE COMM INC

Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof

InactiveUS7487093B2Continuously and easily changeSound input/outputSpeech synthesisMorphingSynthesis methods
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag < / morphing>, a feature of synthetic voice is continuously changed while gradually changing voice from a happy voice to an angry voice upon outputting synthetic voice.
Owner:CANON KK

Voice-enabled dialog system

A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer the frequently asked question.
Owner:NUANCE COMM INC

Dynamically Changing Voice Attributes During Speech Synthesis Based upon Parameter Differentiation for Dialog Contexts

A method of speech synthesis can include automatically identifying spoken passages within a text source and converting the text source to speech by applying different voice configurations to different portions of text within the text source according to whether each portion of text was identified as a spoken passage. The method further can include identifying the speaker and / or the gender of the speaker and applying different voice configurations according to the speaker identity and / or speaker gender.
Owner:CERENCE OPERATING CO

Car navigation device

There is provided a car navigation device for performing voice guidance on a guiding route to a predetermined destination. The car navigation device has history recording means and change means. The history recording means records the vehicle's driving history on a storage medium. The change means changes voice guidance quantities for a given section in the guiding route based on the driving history recorded by the history recording means concerning the given section.
Owner:DENSO CORP

Method of handling frequently asked questions in a natural language dialog service

A voice-enabled help desk service is disclosed. The service comprises an automatic speech recognition module for recognizing speech from a user, a spoken language understanding module for understanding the output from the automatic speech recognition module, a dialog management module for generating a response to speech from the user, a natural voices text-to-speech synthesis module for synthesizing speech to generate the response to the user, and a frequently asked questions module. The frequently asked questions module handles frequently asked questions from the user by changing voices and providing predetermined prompts to answer frequently asked questions.
Owner:NUANCE COMM INC

Text structure for voice synthesis, voice synthesis method, voice synthesis apparatus, and computer program thereof

InactiveUS20050065795A1Continuously and easily changeSound input/outputSpeech synthesisMorphingSynthesis methods
In a voice synthesis apparatus, by bounding a desired range of input text to be output by, e.g., a start tag “<morphing type=“emotion” start=“happy” end=“angry”>” and end tag < / morphing>, a feature of synthetic voice is continuously changed while gradually changing voice from a happy voice to an angry voice upon outputting synthetic voice.
Owner:CANON KK

Apparatus for controlling depth/distance of sound and method thereof

ActiveUS20120243689A1More sense of realismImage enhancementImage analysisParallaxComputer science
An apparatus for controlling depth / distance of sound and method thereof are disclosed, by which an audio signal can be outputted to correspond to a depth of an image, i.e., a disparity in displaying a stereoscopic image. The present invention includes extracting at least one object from an image, measuring a depth change value in accordance with a motion of the object within the image, and changing a depth / distance level of the sound based on the depth change value of the object.
Owner:LG ELECTRONICS INC

Changed voice detection method and device

The invention provides a changed voice detection method and device. The method comprises: acquiring voice data to be tested and to be subjected to certification matching with a target object; using apreset changed voice detection model to determine voiceprint characteristic information matching with the voice data to be tested and voice faking judgment results; determining similarity between thevoiceprint characteristic information to be tested and registered voiceprint characteristic information of the target object to obtain voiceprint similarity; determining whether the voice data to be tested is artificially-faked changed voice data according to the voice faking judgment results and the voiceprint similarity. The changed voice detection model is used herein to determine voiceprint characteristic information to be tested, which matches with the voice data to be tested, as well as voice faking judgment results; therefore, detection of the voice data to be tested is achieved, detection efficiency is greatly improved for the voice data to be tested, and precision of test results is greatly improved.
Owner:IFLYTEK CO LTD

Method, system and device for converting voice into lip shape and storage medium

The invention discloses a method, system and device for converting voice into a lip shape and a storage medium. The method comprises the steps: acquiring a voice sequence; receiving and processing thevoice sequence by using a trained generative adversarial network model; and obtaining a lip-shaped image output by the trained generative adversarial network model. According to the method, the generative adversarial network model (GAN) is trained, and the trained generative adversarial network model is utilized to convert the voice into the lip shape, so that the lip-shaped image with high quality and high resolution can be obtained; the generative adversarial network model is trained in an unsupervised learning mode, so that the voice quality can be obviously improved, the voice distortionis reduced, and the robustness of the system is enhanced; when changed voice is continuously input, a dynamic lip-shaped image can be finally output, and a smooth visual effect can be provided; and meanwhile, the generated lip-shaped image is combined with the voice, so that a high-quality face speaking video can be synthesized. The method, system and device are widely applied to the technical field of voice data.
Owner:RES INST OF TSINGHUA PEARL RIVER DELTA +1

Method for realizing sound speed-variation without tone variation and system for realizing speed variation and tone variation

The invention discloses a system for realizing sound speed variation and tone variation, which comprises an input cache module, a tone variation processing module, a speed-variation no-tone-variation processing module and a data output module, wherein the input cache module is used for reading the sound signal data to be processed into the cache; the tone variation processing module is used for carrying out the tone variation processing on the sound signal to change the sound tone; the speed-variation no-tone-variation processing module is used for carrying out the speed-variation no-tone-variation processing on the sound signal, thereby changing the sound speed without changing the tone; and the data output module is used for outputting the speed-variation tone-variation signal. The speed-variation no-tone-variation processing module comprises a segmentation data module and a connection data module, wherein the speed-variation no-tone-variation processing module extracts a string of signal subfamilies (namely small sections of sound) from the original speech signal according to the coefficient of variation in speed by using a window function; and the connection data module connects the signal subfamilies according to the time sequence, thereby obtaining the speed-variation no-tone-variation signal. The invention realizes the speed-variation no-tone-variation function and the speed-variation tone-variation function of the audio frequency by using very low algorithm complexity, and does not introduce noise, thereby enhancing the quality of the processed sound.
Owner:刘盛举 +1

Multiplayer Gaming Machine Capable Of Changing Voice Pattern

Herein disclosed is a gaming machine executing a game and paying out a predetermined amount of credits according to a game result; generating voice data based on a player's voice; identifying a voice pattern corresponding to the voice data by retrieving the dialogue voice database and identifying a type of voice corresponding to the voice data, so as to store the voice data along with the voice pattern into the memory; calculating a value indicative of a game result, and updating the play history data stored in the memory using the result of the calculation; comparing the play history data thus updated with a predetermined threshold value data; generating voice data according to the voice pattern based on the play history data if the play history data thus updated exceeds the predetermined threshold value data; and outputting voices from the speaker.
Owner:INTERBLOCK DOO

Shiatsu type fundamental frequency adjustment electronic artificial larynx

The invention relates to a shiatsu type fundamental frequency adjustment electronic artificial larynx. The shiatsu type fundamental frequency adjustment electronic artificial larynx is characterized by changing fundamental frequency of a glottal wave by a shiatsu switch button at any time so as to change voice tones and mainly comprising a shiatsu sensing part, a waveform generating and processing system, a power amplification circuit and an electricity-force conversion system. The shiatsu type fundamental frequency adjustment electronic artificial larynx is characterized in that a glottal waveform having individual voice characteristics is stored in the waveform generating and processing system; the fundamental frequency of the waveform is changed under the control of the switch / shiatsubutton at any time during the process of waveform generation; the generated waveform is converted to an analog signal through a digital to analog conversion module in the system; and the signal waveform output by a digital to analog converter is applied to the electricity-force conversion system after power amplification. The amplified waveform is converted to mechanical vibration through an electricity-force energy converter of a high magnetic field, the vibration is applied tp the neck of a patient through a vibration film to produce the glottal wave, and the waveform forms sound outside a lip after being modulated by a tongue, a nasal cavity, an oral cavity, a lip, and the like of the patient.
Owner:BEIHANG UNIV +1

Implementing method of software capable of being used by disabled people

The invention relates to an implementing method of software capable of being used by disabled people. In order to solve the problem that the disabled people have difficulty in using software, voice prompt is used to replace a display-based displaying manner, and the operation of a keyboard and a mouse is replaced by voice input and voice control. To help the blind to find a mouse, the software can produce a proper voice prompt when the mouse is dragged within or beyond a software interface, and reminds users of taking proper operations. An operational guidance for the disabled is set in the software, and voice prompt is carried out in a tree form to remind the disabled of making choices. The disabled can make choices in voice until the specific content is broadcasted in voice. The keyboard reminding function is provided, and the key pressing operation in the software operation process can be changed into corresponding voice for reminding. The focus switching can be controlled by voice, and the implementation of controls in the focus state can be controlled by voice instructions. The software can change voice input into corresponding texts and can enter the texts in an input box.
Owner:GUILIN UNIV OF ELECTRONIC TECH

Phoneme changing method based on digital signal processing

InactiveCN1567428AFacilitate real-time transmissionSimple methodSpeech synthesisDigital signal processingSpeech sound
This invention published a kind of voice variation method that based on digital signal processing. It includes the steps of (1) selecting the original voice signal that needs to change; (2) finding the basic tone cycle length of original voice signal; (3) confirming the position of every basic tone cycle of whole original voice signal according to the basic tone cycle length; (4) inserting or deleting the basic tone cycle between the basic tone cycle of original voice signal, get the shortened or prolonged voice signal; (5) linearly extending or compressing the shortened or prolonged voice signal to the same length of original voice signal and getting the changed voice signal. This invention can be realized real-time on DSP chip. The changed voice is very natural.
Owner:BEIJING KEXIN TECH +1

Bone conductive speaker

The provide manufacturing technology and method for a bone conductive speaker that enables a person to be aware and recognize audio signal, with the cranial bones of a human body, through the vibrations of a vibrator. In a bone conductive speaker, a magnet is not installed proximately to a voice coil in a configuration box, but one magnet is installed in an upper portion of an iron piece. Thus, the trouble of matching magnet directionality is eliminated, not only characteristics adjustment can be made wider by enlarging the size of the voice coil, but also strength adjustment that depends on the changes in the size of the magnet can be significantly facilitated, and the bone conductive speaker can be made small-sized and lightweight.
Owner:GOLDENDANCE

Audio signal processing apparatus, audio signal processing method, program, and input apparatus

Audio signal processing apparatus is disclosed. The audio signal processing apparatus includes a first audio signal extracting section, a second audio signal extracting section, a sense-of-depth controlling section, a sense-of-sound-expansion controlling section, a control signal generating section, and a mixing section. The first audio signal extracting section extracts a main audio signal. The second audio signal extracting section extracts a sub audio signal. The sense-of-depth controlling section processes the extracted main audio signal to control a sense of depth. The sense-of-sound-expansion controlling section processes the extracted sub audio signal to vary a sense of sound expansion. The control signal generating section generates a first control signal with which the sense-of-depth controlling section is controlled and a second control signal with which the sense-of-sound-expansion controlling section is controlled. The mixing section mixes an output audio signal of the sense-of-depth controlling section and an output audio signal of the sense-of-sound-expansion controlling section.
Owner:SONY CORP

Multiplayer gaming machine capable of changing voice pattern

Herein disclosed is a gaming machine executing a game and paying out a predetermined amount of credits according to a game result; generating voice data based on a player's voice; identifying a voice pattern corresponding to the voice data by retrieving the dialogue voice database and identifying a type of voice corresponding to the voice data, so as to store the voice data along with the voice pattern into the memory; calculating a value indicative of a game result, and updating the play history data stored in the memory using the result of the calculation; comparing the play history data thus updated with a predetermined threshold value data; generating voice data according to the voice pattern based on the play history data if the play history data thus updated exceeds the predetermined threshold value data; and outputting voices from the speaker.
Owner:INTERBLOCK DOO

Sound apparatus, method of changing sound characteristics, and data recording medium on which a sound correction program

This invention is a sound apparatus having a plurality of different kinds of speakers and that independently performs correction of the sound signal for each speaker to obtain optimum sound and sound field. This sound apparatus has an output device that receives audio signals and outputs sound, and comprises: a correction device for correcting the audio signals that are input to each of the output devices; and a correction-characteristic-setting device which sets correction characteristic for each of the output devices; and where the correction device correct the audio signals based on the set correction characteristic.
Owner:PIONEER CORP

Voiceprint identity authentication device and authentication optimization method and system

The invention discloses an authentication optimization method of a voiceprint identity authentication device. The authentication optimization method comprises the steps that the Mel-frequency cepstral coefficients corresponding to registration voice signals are extracted and preset number binding is performed on the Mel-frequency cepstral coefficients; the Mel-frequency cepstral coefficients act as an input layer and the bound numbers act as an output layer to perform differentiated deep belief network training and acquire the parameter space; the Mel-frequency cepstral coefficients are inputted to the differentiated deep belief network to acquire the hidden layer output to act as the feature vectors; all the feature vectors act as the input to construct a Gaussian mixture model; and the corresponding Mel-frequency cepstral coefficient of any registration voice signal is inputted to the differentiated deep belief network to acquire multiple hidden layer outputs, and the hidden layer outputs of which the degree of distinction is higher than the preset threshold are selected to act as the training data to update the Gaussian mixture model. The following spontaneously changed voice signal of the registrant acts as the raining data to update the Gaussian mixture model so as to be more adaptive to the present sound production state of the registrant, and the recognition rate can be guaranteed.
Owner:GUANGDONG UNIV OF TECH

Speaker

A speaker includes a housing; a support assembly installed on the lower end of the housing to support the housing; at least one first speaker unit installed in the housing to reproduce a sound signal; and a second speaker unit installed in the support assembly to reproduce a sound signal which belongs to a low sound band. Openings are defined through both sides of the support assembly to transmit to the outside sound reproduced by the second speaker unit. A guide cone is provided to the lower end of the support assembly to guide sound toward the openings. The speaker unit can be installed in a baffle which is supported at the opened front end of the housing to be rotated about a vertical axis. Through rotating the baffle, the speaker unit can be actually rotated within a predetermined range to change a sound transmission direction.
Owner:LG ELECTRONICS INC

Display apparatus and voice conversion method thereof

The voice conversion method of a display apparatus includes: in response to the receipt of a first video frame, detecting one or more entities from the first video frame; in response to the selection of one of the detected entities, storing the selected entity; in response to the selection of one of a plurality of previously-stored voice samples, storing the selected voice sample in connection with the selected entity; and in response to the receipt of a second video frame including the selected entity, changing a voice of the selected entity based on the selected voice sample and outputting the changed voice.
Owner:SAMSUNG ELECTRONICS CO LTD

Dynamically changing voice attributes during speech synthesis based upon parameter differentiation for dialog contexts

A method of speech synthesis can include automatically identifying spoken passages and non-spoken passages within a text source and converting the text source to speech by applying different voice configurations to different portions of text within the text source according to whether each portion of text was identified as a spoken passage or a non-spoken passage. The method further can include identifying the speaker and / or the gender of the speaker and applying different voice configurations according to the speaker identity and / or speaker gender.
Owner:CERENCE OPERATING CO

Remote-controlling system with wireless earphone function and method thereof

The present invention relates to a remote control system with wireless earphone function and its method. Said method includes the following steps: providing a main machine and a remote control terminal; the remote control terminal can detect that an earphone is inserted into the voice signal output interface or not; if an earphone is inserted into the voice signal output interface, sending a first instruction for regulating volume into main machine; according to said instruction main machine can change the sound volume of sound playback unit; main machine can utilize a voice-frequency transmitting unit to send the voice-frequency signal into remote control terminal; said remote control terminal can receive said voice-frequency signal; and utilize voice signal output interface to output said voice-frequency signal.
Owner:HONG FU JIN PRECISION IND (SHENZHEN) CO LTD +1
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products