65th ISI World Statistics Congress 2025

65th ISI World Statistics Congress 2025

Research on Optimizing Seasonal Adjustment of Time Series Data Using DTW and Machine Learning

Conference

65th ISI World Statistics Congress 2025

Format: CPS Abstract - WSC 2025

Keywords: dtw, machine learning, random-forest, randomforest, seasonal_adjustment, x-13_arima_seats

Session: CPS 49 - Seasonal Adjustment and Time Series Analysis for Official Statistics

Monday 6 October 5:10 p.m. - 6:10 p.m. (Europe/Amsterdam)

Abstract

This study introduces a methodology aimed at enhancing and streamlining the process of seasonal adjustment with regards to time series data within the economic domain. The significance of seasonal adjustments is to reveal fundamental trends and cycles by removing seasonal fluctuations from time series data from national statistics.
However, the seasonal adjustment process becomes more complex due to economic fluctuations, requiring practical experience and industry-specific knowledge. This study focuses on the question, "Can practical experience optimize seasonal adjustments?" and aims to identify this issue and propose a solution.
The study focuses on the "Mining Manufacturing Industry Trend Survey" by the Statistics Korea. It analyzes the monthly production index (both raw and seasonally adjusted) for manufacturing sectors with distinct seasonality. Using Python to automate the X-13 ARIMA SEATS program generated diverse option combinations, allowing verification of similarity between seasonally adjusted and forecast indexes via DTW. A random forest regression model identified an optimal model with high explanatory power (R² scores are 0.93 to 0.99). Key features include Sum Seas MA for the C265 industry and D11F 3yr for the C331 industry. Industry-specific application of these elements is expected to enhance seasonal adjustment efficiency.
This study has enabled the automation of repetitive tasks to lessen workload, with the incorporation of DTW and machine learning enabling the identification of pertinent option combinations and diagnostic outcomes tailored to distinct industry classifications. Furthermore, the results of this research study are anticipated to improve the operational efficiency of national statistical bodies, thereby supporting more accurate economic assessments and informed policy-making processes.

Figures/Tables

Figure 1 Comparison of Traditional Process and Python Automation Process for X-13A-S

Figure 2 C291(the industry of general purpose machinery manufacturing) Production Index

Figure 3 C152(the sector of footwear and shoe parts manufacturing) Production Index

Figure 4 Feature Importance of C265(manufacturing of imaging and audio equipment)

Figure 5 Feature Importance of C331(manufacturing of precious metals and jewelry)