Updated 1st August 2023
Summary
The retraining process resulted in two new models, one for PD and one for 7day price forecast. After considerable experimentation we retrained the price forecasting models with a considerable increase in accuracy (see validation results table below comparing the previous models with the new models).
Training period
Both PD and 7day price forecast models were trained on data between 01-10-2020 and 20-08-2023.
The market suspension period and a period after the Callide explosion were removed from the training data.
New features (input data) exceptions
A new “recent prices feature” was added to give the model some context for the current pricing environment. This feature is the median price for a window around the forecast time for the most recent three days. For example, for a forecast occurring in 2 days at 11am, the model generated the median value for prices between 10am-12pm for the three days before the forecast was run.
A number of additional experiments were conducted on the 7 day forecast model, with the major modifications including:
The 50th percent probability of demand features were augmented with 10th percent probability of demand data
The new recent prices features mentioned previously
A new time of day feature
A new weekday/weekend feature.
Predispatch model | p7day model | |
---|---|---|
Model_id | 20230729_213406_CB_2 | 20230213_012319_CB_7D_1 |
Previous model release date | 20th April 2023 | 20th April 2023 |
Latest release date (after retraining) | 1st August 2023 | 1st August 2023 |
Machine Learning Algorithm | Ensemble predictor: gradient boosting | Ensemble predictor: gradient boosting |
Training Period | 1st October 2020 to 20th July 2023 | 1st October 2020 to 20th July 2023 |
Hyper-parameter type | A | A |
Hyper-parameter notes | No weighting across time periods | No weighting across time periods |
Target | Dispatchprice (30min resolution) | Dispatchprice (30min resolution) |
Input (and training) variables | PREDISPATCHPRICE rrp DISPATCHPRICE last 3 day average actual RRP for same time PREDISPATCHREGIONSUM (intervention = 0) hours_out time_of_day day_type (weekday_or_weekend) dispatchableload netinterchage (availablegeneration - uigf) (dispatchablegeneration - scheduled_clearedmw) ss_solar_clearedmw ss_solar_uigf ss_wind_clearedmw ss_wind_uigf totaldemand totalintermittentgeneration | STPASA_REGIONSOLUTION hours_out time_of_day day_type (weekday_or_weekend) aggregatepasaavailability aggregatescheduledload netinterchangeunderscarcity aggregatecapacityavailable ss_solar_cleared ss_wind_uigf ss_wind_cleared ss_solar_uigf demand10 demand50 totalintermittentgeneration demand_and_nonschedgen |
...