The invention provides a voice control system for multimedia equipment. The voice control system comprises an image sensing module used for collecting action images of a user, an image recognition module for determining a control command type or state according to the action image of the user, a voice recognition state management module for activating or suspending voice recognition according to the current control command type, a voice picking module for collecting voice data, a voice recognition module for recognizing collected voice data, and a multimedia function module for executing a control command and providing corresponding multimedia functions to the user. The invention further provides a voice control method for multimedia equipment. According to the invention, the technologies of image recognition and voice recognition are combined, and then the purpose of realizing voice control freely and conveniently without a handheld remote controller or a near voice picking module can be achieved, so that voice recognition of a control command can be effectively prevented from being disturbed by voice output by multimedia equipment, environment background sound and voice signals of a non-control command of the user, and a control command sent by the user can be recognized accurately.