The invention provides a multimedia transliteration method is applied to a multimedia transliteration system and comprises the following steps of: S1, receiving a demonstration manuscript, and constructing a key information tree of the demonstration manuscript; S2, receiving voice data, carrying out voice identification on the voice data, and obtaining transliteration texts of the voice data; S3, synchronizing the voice data and the transliteration texts with the demonstration manuscript by means of the key information tree; and S4, displaying the demonstration manuscript with the synchronized voice data and transliteration texts to a user. The user can hear the voices of a speaker and see the texts transliterated by the voices of the speaker while seeing the demonstration manuscript, and furthermore, the transliteration texts are segmented according to sub-themes included in each page of the transliteration texts, the transliteration texts of the same sub-theme is in one segment, and the transliteration texts of different sub-theme serve as different segments, so that the user can conveniently understand the transliteration texts, and the experience of the user is further improved.