The invention discloses a
big data-based
online analytical processing system and method. The
system can be used for carrying out quick multi-dimensional query and analysis on data sets with different scales and levels under a Hadoop environment. A
query plan selected through query, planning and
estimation comprises MDX query supporting Hive and Hbase
precomputation cache mechanism-based multi-dimensional query. According to the
system and method, optimization of the MDX query supporting Hive data warehouses on extensible cluster nodes and of the Hbase
precomputation cache mechanism-based multi-dimensional query are realized, the low-
delay multi-dimensional query requirements of the data sets with different scales and levels are satisfied, and the OLAP multi-dimensional query of different OLAP data organization models under a single
data source background is solved. Aiming at the performance
optimization problem of Hive multi-dimensional query on large-scale data sets, an Hbase cache-based segmented layered dimensionality-reduction aggregation
algorithm is proposed, and the
algorithm brings MOLAP for solving the multi-dimensional query calculation of large-scale data into a
big data OLAP system, so that the extendibility and effectiveness of the multi-dimensional query of data with different scales and levels under a
big data background are greatly enhanced.