The invention discloses a man-
machine voice
interaction method. The method includes the following steps: dividing a recording
signal and reference
signal echo cancellation process into multiple sub-processes in a
system state, splicing the sub-processes, and obtaining an output result of recording
signal and reference signal echo cancellation, the reference signal being a voice signal emitted by a
machine player; inputting the output result of recording signal and reference signal echo cancellation to a VAD model, and outputting a voice recognition result in real time when a user voice signal is detected; stopping a
system from playing the voice signal when determining, based on the real-time output voice recognition result, the intention of a user and interrupting output of the
system voice signal, entering a
user state, and monitoring and recognizing the user voice signal; and sending feedback to the user according to the final voice recognition result when detecting that the output of the user voice signal is finished, and completing the interruption and responding process of one round of users. Interruption behaviors of users during an interaction process can be effectively detected, talking opportunities and contents of a
machine can be decided, and thus the machine becomes smarter.