Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Model updated

...

9th November 2023

Summary

The retraining process resulted in two new models, one for PD and one for 7day price forecast. After considerable experimentation we retrained the price forecasting models with a considerable using existing features and hyper parameter settings. The increase in accuracy (using Absolute Mean Error, AME) was between 10% and 15% (with the exception of SA which was 50%). (see validation results table below comparing the previous models with the new models).

...

  • Both PD and 7day price forecast models were trained on data between 01-1011-2020 and 01-11-2023.

    • The market suspension period and a period after the Callide explosion were removed from the training data.

New features (input data) exceptions

  • The feature set is consistent with the previous

  • Validation set 1st November 7th November.

  • A new “recent prices feature” was added to give the model some context for the current pricing environment. This feature is the median price for a window around the forecast time for the most recent three days. For example, for a forecast occurring in 2 days at 11am, the model generated the median value for prices between 10am-12pm for the three days before the forecast was run.

  • A number of additional experiments were conducted on the 7 day forecast model, with the major modifications including:

  • The 50th percent probability of demand features were augmented with 10th percent probability of demand data

  • The new recent prices features mentioned previously

  • A new time of day feature

  • A new weekday/weekend feature

    the same as the previous model. The main benefit of the new model is that more recent market data are consumed in the training process. The training does not weight the most recent market data.

    • A number of experiments were conducted by changing the feature list and some hyper parameters however this didn’t not achieve a significant benefit to the model tests. Hence for consistency we have not made changes to the model other than extending the training period to the most recent market data. Unlike the August model update included significant changes to both the feature set and the hyper parameters.

  • Validation set is from the 1st of November to the 7th of November. Oddly enough the impact of the Melbourne Cup did not have a significant impact on the performance metrics.

Predispatch model

p7day model

Model_id

2023072920231108_213406142510_CB_2

2023021320231108_012319140809_CB_7D_12

Previous model release date

20th April 8th August 202320th April

8th August 2023

Latest release date (after retraining)

8th August 9th November 20238th August

9th November 2023

Machine Learning Algorithm

Ensemble predictor: gradient boosting

Ensemble predictor: gradient boosting

Training Period

1st October 2020 to 20th July 31st October 2023

1st October 2020 to 20th July 31st October 2023

Hyper-parameter type

A

A

Hyper-parameter notes

No weighting across time periods

No weighting across time periods

Target

Dispatchprice (30min resolution)

Dispatchprice (30min resolution)

Input (and training) variables

PREDISPATCHPRICE

rrp

DISPATCHPRICE

last 3 day average actual RRP for same time

PREDISPATCHREGIONSUM (intervention = 0)

hours_out

time_of_day

day_type (weekday_or_weekend)

dispatchableload

netinterchage

(availablegeneration - uigf)

(dispatchablegeneration - scheduled_clearedmw)

ss_solar_clearedmw

ss_solar_uigf

ss_wind_clearedmw

ss_wind_uigf

totaldemand

totalintermittentgeneration

STPASA_REGIONSOLUTION

hours_out

time_of_day

day_type (weekday_or_weekend)

aggregatepasaavailability

aggregatescheduledload

netinterchangeunderscarcity

aggregatecapacityavailable

ss_solar_cleared

ss_wind_uigf

ss_wind_cleared

ss_solar_uigf

demand10

demand50

totalintermittentgeneration

demand_and_nonschedgen

...