A
system and method is disclosed for enabling
user friendly interaction with a camera
system. Specifically, the inventive
system and method has several aspects to improve the interaction with a camera system, including voice recognition,
gaze tracking, touch sensitive inputs and others. The voice recognition unit is operable for, among other things, receiving multiple different voice commands, recognizing the vocal commands, associating the different voice commands to one camera command and controlling at least some aspect of the
digital camera operation in response to these voice commands. The
gaze tracking unit is operable for, among other things, determining the location on the
viewfinder image that the user is gazing upon. One aspect of the touch sensitive inputs provides that the touch sensitive pad is mouse-like and is operable for, among other things, receiving user touch inputs to control at least some aspect of the camera operation. Another aspect of the disclosed invention provides for
gesture recognition to be used to interface with and control the camera system.