Method and device for recognizing emotion of voice session, server and storage medium
An emotion recognition and voice technology, applied in voice recognition, voice analysis, instruments, etc., can solve the problems of high manpower consumption, inability to guarantee better model time, long model training time, etc., and achieve the effect of improving optimization efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] figure 1 It is a flow chart of a voice conversation emotion recognition method provided by Embodiment 1 of the present invention. This embodiment is applicable to the situation of recognizing user voice emotion in an intelligent voice conversation scene, and the method can be performed by a voice conversation emotion recognition method device to execute. The method specifically includes the following steps:
[0031] S110. Recognize the conversational voice by using a priori emotion recognition rule to obtain a first recognition result.
[0032] In a specific embodiment of the present invention, emotion is a general term for a series of subjective cognitive experiences, and refers to a user's psychological and physiological state comprehensively generated through various feelings, thoughts and behaviors. Furthermore, emotions reflect the user's mental state during human-computer voice interaction. Correspondingly, in order to provide users with better quality and more ...
Embodiment 2
[0050] On the basis of the above-mentioned first embodiment, this embodiment provides a preferred implementation of the emotion recognition method for voice conversation, which can generate and select currently available prior emotion recognition rules. figure 2 The flow chart of voice conversation emotion recognition based on prior emotion recognition rules provided by Embodiment 2 of the present invention, such as figure 2 As shown, the method includes the following specific steps:
[0051] S210. Perform audio feature extraction on historical conversation speech associated with each preset emotional state.
[0052] In a specific embodiment of the present invention, the historical conversational voice refers to the user's voice generated during the intelligent voice interaction process between the user and the smart product or smart platform, and the historical conversational voice is determined by the emotion recognition result and the emotion recognition The result is co...
Embodiment 3
[0063] On the basis of the above-mentioned first embodiment, this embodiment provides a preferred implementation manner of an emotion recognition method for speech conversation, which can use a neural network to perform emotion recognition on the spectrogram of conversation speech. Figure 4 The flow chart of the speech conversation emotion recognition based on the emotion recognition model provided by Embodiment 3 of the present invention, as Figure 4 As shown, the method includes the following specific steps:
[0064] S410. Generate a conversational spectrogram according to the conversational voice information.
[0065] In the specific embodiment of the present invention, in order to simplify the speech emotion recognition process and improve the accuracy of speech emotion recognition, in view of the fact that the image recognition technology is more mature than the speech recognition technology, this embodiment converts the speech recognition into image recognition, based ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com