T-MAAC

This is the official implementation of the paper Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer (KDD2022 Research Track).

The Transformer-based Multi-Agent Actor-Critic Framework (T-MAAC) is based on MAPDN. Please refer to that repo for more documentation.

Installation

We suggest you install dependencies with Dockerfile and run the code with Docker.

docker build . -t tmaac

Downloading the Dataset

We use load profiles and PV profiles provided by MAPDN.

Download the data from the link.
Unzip the zip file and there are 3 folders as following:
- case33_3min_final
- case141_3min_final
- case322_3min_final
Go to the directory [Your own parent path]/T-MAAC/environments/var_voltage_control/ and create a folder called data.
Move the 3 folders in step 2 to the directory [Your own parent path]/T-MAAC/environments/var_voltage_control/data.

Running experiments

Training

You can train the model using the following command.

source activate mapdn

## running in case141
python train.py --alg icstransmaddpg --alias example_0 --mode distributed --scenario case141_3min_final --qweight 0.01 --voltage-barrier-type l2 --save-path trial

## running in case322
python train.py --alg icstransmatd3 --alias example_0 --mode distributed --scenario case141_3min_final --qweight 0.1 --voltage-barrier-type l2 --save-path trial

The meanings of the arguments:

--alg is the MARL algorithm, e.g. maddpg, matd3, icstransmaddpg, icstransmatd3.
--alias is the alias to distinguish different experiments.
--mode is the mode of environment, e.g. distributed.
--scenario is the power system on which you like to train, e.g. case141_3min_final, case322_3min_final.
--qweight is the q_weight used in training. We recommend 0.01 for case141 and 0.1 for case322.
--voltage-barrier-type is the voltage barrier function in training, e.g. l1, l2, bowl.
--save-path is the path to save the model and configures.

Testing

It is worth noting that the difficulty of voltage control problem varies during different months of a year. For example, during the midday summer, excessive active power from intense sunlight is injected into the grid, creating a more significant challenge for the voltage control task than in winter. Thus, a series of fixed scenarios must be chosen to evaluate algorithms fairly.

We randomly select 10 episodes per month, a total of 120 episodes, which constitute the test dataset test_data.csv. Each episode lasts for 480 time steps (i.e., a day). And fixed 10 episodes of 120 episodes are selected to evaluate performance in training phase. (see def evaluation() in models/model.py)

python test.py --save-path trial/model_save --alg icstransmaddpg --alias example_0 --scenario case141_3min_final --qweight 0.01 --voltage-barrier-type l2 --test-mode test_data

The meanings of the arguments:

--alg is the MARL algorithm, e.g. maddpg, matd3, icstransmaddpg, icstransmatd3.
--alias is the alias to distinguish different experiments.
--scenario is the power system on which you like to train, e.g. case141_3min_final, case322_3min_final.
--qweight is the q_weight used in training. We recommend 0.01 for case141 and 0.1 for case322.
--voltage-barrier-type is the voltage barrier function in training, e.g. l1, l2, bowl.
--save-path is the path to save the model and configures.
--test-mode is the test mode, e.g. single, test_data. test_data means use test_data.csv to evaluate algorithms.
--test-day is the day that you would like to do the test. Note that it is only activated if the --test-mode is single.
--render indicates activating the rendering of the environment.

cjdjr / t-maac Goto Github PK

t-maac's Introduction

T-MAAC

Installation

Downloading the Dataset

Running experiments

Training

Testing

t-maac's People

Contributors

Stargazers

Watchers

Forkers

t-maac's Issues

Big differences between training and testing curves

Agent Observation

How to get the testing results for Spring, Summer, Fall, and Winter

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent