Partitioning and Multi-Core Parallelization of Multi-Equation Forecast Models

Forecasting is an important analysis technique used in many application domains such as electricity management, sales and retail and, traffic predictions. The employed statistical models already provide very accurate predictions, but recent developments in these domains pose new requirements on the calculation speed of the forecast models. Especially, the often used multi-equation models tend to be very complex and their estimation is very time consuming. To still allow the use of these highly accurate forecast models, it is necessary to improve the data processing capabilities of the involved data management systems. For this purpose, we introduce a partitioning approach for multi-equation forecast models that considers the specific data access pattern of these models to optimize the data storage and memory access. With the help of our approach we avoid the redundant reading of unnecessary values and improve the utilization of the CPU cache. Furthermore, we utilize the capabilities of modern multi-core hardware and parallelize the model estimation. Our experimental results on real-world data show speedups of up to 73x for the initial model estimation. Thus, our partitioning and parallelization approach significantly increases the efficiency of multi-equation models.

[read more]

Lars Dannecker, Matthias Boehm, Wolfgang Lehner, Gregor Hackenbroich: Partitioning and Multi-Core Parallelization of Multi-Equation Forecast Models; In: Proceedings of the 24th Scientific and Statistical Database Management Conference (SSDBM 2012, Chania, Crete, Greece), 2012.