A
system and method of automatically classifying a communication involving at least one human, e.g., a human-to-human telephone conversation, into predefined categories of interest, e.g., “angry customer, etc. The
system automatically, or semi-automatically with user interaction, expanding
user input into semantically equivalent events. The
system recognizes these events, each of which is given a confidence level, and classifies the telephone conversation based on an overall confidence level. In some embodiments, a different base unit is utilized. Instead of recognizing individual words, the system recognizes composite words, each of which is pre-programmed as an atomic unit, in a given context. The recognition includes semantically relevant composite words and contexts automatically generated by the system. The composite words based contextual recognition technique enables the system to efficiently and logically classifying and indexing large volumes of communications and audio collections such as call center calls, Webinars, live news feeds, etc.