The invention discloses a cross-language emotional
speech synthesis method and
system. The method comprises steps of establishing a context-dependent
label format and a context-dependent clustering
problem set; determining a
first language annotation document, a second language
annotation document, a target emotional Putonghua
annotation document, an annotation document to be synthesized, a
first language acoustic parameter, a second language acoustic parameter, a target emotional acoustic parameter; based on the
first language annotation document, the second language annotation document, the target emotional Putonghua annotation document, the first language acoustic parameter, the second language acoustic parameter, and the target emotional acoustic parameter, determining the multi-speaker target emotional average
acoustic model; and finally, inputting the annotation document to be synthesized into the multi-speaker target emotional average
acoustic model to obtain a first language or / and a second language target emotional
speech synthesis document to realize the synthesis of the same speaker or a different speaker cross-language emotional voice.