The present invention relates to methods for mining real-world databases that have mixed data types (e.g., scalar, binary, category, etc.) to extract an implicit time-sequence to the data and to utilize the extracted information to compute categories for the input data and to predict
categorization of future input data vectors. Many real-world databases may not have explicit
time data yet there may be inherent
time data which may be extracted from the
database itself. The present invention extracts such inherent
time sequence data and utilizes it to classify the data vectors at each instant in time for purposes of categorizing the data at that time instant. The present invention has wide applicability and may find use in fields such as manufacturing, financial services, or government. In particular, the present invention may be used to identify potential threats, to predict the presence of a
threat, and even to evaluate the degree of
threat posed. For purposes of this discussion, the threats may be security threats or other adverse events occurring at a particular company, location, or systems, such as a manufacturing or information systems.