In an embodiment, speech is recorded and converted to digital text based on a shared vocabulary dictionary. During the session, voice recognition errors, that is, speech that could not be automatically identified (if any exists), are identified by the system and associated with digital text. When text for the voice recognition error is identified (e.g., by an editor), the shared vocabulary dictionary is updated (so that that particular voice recognition error will not occur again), thereby improving the performance of the system for all users that use the shared vocabulary dictionary. The identification of voice recognitions errors and the updated of the vocabulary dictionary are performed on an ongoing basis, so that the performance of the system for all users continually improves.