USA Daily Energy Consumption Forecasting with Prophet & NeuralProphet - American Electric Power (AEP)
This project focuses on forecasting daily energy demand for the AEP region, leveraging advanced time series modeling techniques using the PJM Interconnection dataset. PJM Interconnection LLC is a regional transmission organization (RTO) responsible for coordinating wholesale electricity movement across multiple U.S. states. The dataset provides hourly energy consumption data, which is aggregated to daily levels to extract broader consumption patterns.
The primary objective is to predict future energy usage to support energy grid planning, load balancing, and long-term infrastructure management. This work builds a robust, interpretable, and scalable pipeline utilizing:
- Prophet (with external regressors, hyperparameter tuning, and holidays)
- NeuralProphet (with autoregressive memory and recursive forecasting)
- Multi-horizon forecasting (7 days, 3 months, 1 year)
Author: Einstein Ebereonwu • GitHub
Dataset: Hourly Energy Consumption
Notebooks: GitHub Notebook | Kaggle Version
Stage | Key Highlights |
---|---|
Data Preparation | • Aggregated hourly -> daily • Checked for missing days • Verified completeness across years • Mapped US holidays using holidays lib |
Feature Engineering | • Seasonal OHE • Lag-based AR features • Holiday indicator columns • Multi-season encodings • Recursive forecasting frame setup |
Stationarity Testing | • ADF test + visual checks • Differenced rolling plots • Structural shift inspection |
Visual Decomposition | • Seasonally grouped boxplots • Trend overlays • ACF/PACF inspection post-modeling |
Modeling Techniques | • Base Prophet • Hyperparameter-tuned Prophet • NeuralProphet + AR lags • Recursive daily forecast loop • Seasonal regressor injection |
Diagnostics & Evaluation | • MAE, RMSE, MAPE, SMAPE • Visual residual inspection • ACF/PACF residuals • Changepoint tuning • Full horizon tracking for 7, 90, 365-day targets |
- Time-Aware Cross Validation: Prophet’s
cross_validation()
used withinitial
,horizon
, andperiod
settings to simulate real forecasting scenarios. - Hyperparameter Grid Search: 60+ combinations explored across:
changepoint_prior_scale
seasonality_prior_scale
seasonality_mode
holidays_prior_scale
- Holiday Effects Modeling: Official U.S. holidays injected as events; model learns temporal context of holiday-driven consumption behavior.
- External Regressors: Seasonal OHE (
season_spring
,season_summer
, etc.) integrated into Prophet for contextual modeling. - Recursive Forecasting: NeuralProphet AR model used recursively to build step-by-step daily forecasts into the future.
- Multi-Horizon Strategy:
- Short-Term (7 days) - high accuracy focus
- Medium-Term (90 days) - directional insight
- Long-Term (1 year) - trend guidance only
Python • pandas • Prophet • NeuralProphet • scikit-learn • statsmodels • seaborn • ACF/PACF • holidays • matplotlib • plotly • Git/GitHub
- Time-only trends, no regressors
- Captured general direction but missed seasonal/holiday dips
- Residuals had strong autocorrelation (especially lag-2)
- Seasonal one-hot encoding + holidays as regressors
- Changepoint, seasonality, and holiday priors tuned
- Used time-aware CV for hyper-parameter optimization
- Substantial reduction in test RMSE and autocorrelated residuals
- Built-in seasonality/trend handling
- Slightly outperformed base Prophet
- Lacked long-memory structure
- Introduced AR(2) lags
- Recursive forecasting strategy
- Achieved lowest test error metrics, indicating robust memory modeling
Horizon | Purpose | Accuracy Focus |
---|---|---|
Short-term | Next 7 days - operational accuracy | Critical focus |
Medium-term | Next 3 months - strategic planning | Approximate trend |
Long-term | Next 1 year - infrastructure forecasting | Directional only |
This updated pipeline demonstrates how combining interpretable models, external calendar data, and robust CV-tuned hyperparameters can produce high-accuracy short-term forecasts, while also capturing seasonal and holiday-driven trends in long-term energy usage.
It reflects a rigorous modeling strategy that scales beyond the academic realm and is deployable in real-world energy planning and sustainability workflows.