The embodiment of the invention relates to the technical field of audio control, in particular to a method, system and equipment for automatically controlling video playing according to an external environment. A video playing control method comprises the following steps: monitoring a sound in the external environment through one or more external sensors; analyzing the sound through an internal analysis module, and if the sound is a human voice, carrying out voice analysis on the human voice, wherein the voice analysis comprises intelligent identification of a voice meaning, and analysis of avoice volume and / or a voice duration; and if a voice analysis result meets preset conditions, generating a video control signal, wherein the video control signal is used for controlling video playing.The method, system and equipment adopting the technical scheme have the advantages that whether the sound in the external environment is a human language or an external noise can be automatically identified, and whether the video playing needs to be automatically suspended or not is further analyzed and judged once the sound is identified as the human language, so that artificial intelligence control on the video program playing is achieved, manual intervention is avoided, and better use experience is obtained.