The invention relates to a multimedia visual entrance guard system, which comprises an audio collecting unit, a video collecting unit, a digital signal processor (DSP) unit, an advanced reduced instruction-set computer machine (ARM) controller, a video synthesis unit, a display unit, a human face detecting unit, a gain regulating unit, a storage unit, a keyboard input unit, a control door lock unit and a loudspeaker, wherein the audio collecting unit, the video collecting unit and the DSP unit are positioned at an outdoor part, the ARM controller, the video synthesis unit, the display unit, the human face detecting unit, the gain regulating unit, the storage unit, the keyboard input unit, the control door lock unit and the loudspeaker are positioned at an indoor part, and the indoor part and the outdoor part are connected through an Ethernet bus controlled by an Ethernet controller. The audio signal gain is dynamically regulated through judging the distances and the positions of visitors, in addition, multimedia files are played through an audio and video device of the system when no visitors exist, the user experience is improved, and meanwhile, the equipment utilization rate is also improved.