The invention discloses an operation and maintenance
automation system and method. The
system comprises a
data acquisition module, a pre-
processing and storing module, a prediction module, an
algorithm evaluation module and an operation and maintenance monitoring management module, wherein the
data acquisition module is used for acquiring key performance indexes and running states of monitored units in an operation and
maintenance system through a
network management protocol or a log file; the pre-
processing and storing module is used for performing pre-
processing work and sorted storing on data acquired by the
data acquisition module; the prediction module is used for performing predictions, including a CPU (
Central Processing Unit) load prediction and a disk load prediction according to the data processed by the pre-processing and storing module; the
algorithm evaluation module is used for establishing an evaluation criterion of a prediction
algorithm and the prediction module, comparing an actual value with a predicted value of the prediction algorithm, and establishing a self-learning process; and the operation and maintenance monitoring management module is used for interacting with operation and
maintenance management personnel. A load prediction mechanism and an algorithm prediction model are established in order to finish predictions specific to
resource use situations of CPUs, memories, disks and the like. Alarm information is analyzed by further referring to a load prediction result in order to give a relevant auxiliary decision. Resource expansion and
fault handling are realized in a way of using scripts, an API (
Application Programming Interface) interface and the like.