The invention discloses an audio processing method and device, a language model training method and device and system, computer equipment and a storage medium, and belongs to the technical field of signal processing. A phoneme sequence of a target audio is obtained; a first word sequence is obtained based on a traditional voice recognition mode; in addition, contextual information is introduced tocarry out voice recognition under a specific context again, so a second word sequence conforming to the contextual information of the specific context is obtained; the first word sequence and the second word sequence are comprehensively considered; the final semantic information is planned and decoded, equivalently, by introducing the second word sequence, the occurrence probability of some wordsconforming to the specific context in the semantic information is enhanced, the misjudgment situation of some keywords in the semantic information is reduced, the accuracy of the automatic speech recognition process is improved, and thus the accuracy of the audio processing process is improved.