This invention concerns a method and
system for monitoring an automated
dialog system for the automatic recognition of
language understanding errors based on a user's input communications in a task classification
system. The method may include determining whether the user's input communication can be understood in order to make a task classification decision. If the user's input communication cannot be understood and a task classification decision cannot be made, a probability of understanding the user's input communication may be determined. If the probability exceeds a first threshold, further dialog may be conducted with the user. Otherwise, the user may be directed to a human for assistance. In another possible embodiment, the method operates as above except that if the probability exceeds a second threshold, the second threshold being higher than the first, then further dialog may be conducted with the user using the current dialog strategy. However, if the probability falls between a first threshold and a second threshold, the dialog strategy may be adapted in order to improve the chances of conducting a successful dialog with the user. This process may be cumulative. In particular, the first dialog exchange may be stored in a
database. Then, a second dialog exchange is conducted with the user. As a result, a second determination is made as to whether the user's input communication can be understood can be conducted based on the stored first exchange and the current second exchanges. This cumulative process may continue using a third and fourth exchange, if necessary.