A Hybrid Imitation-Reinforcement Learning Framework for Optimal Operation of Soft Open Points in Unbalanced Distribution Networks
This paper proposes a hybrid deep actor-critic framework for the optimal operation of a phase-changing soft open point (PCSOP) in an unbalanced distribution network. The framework combines algorithmic features of off-policy reinforcement learning and imitation learning. The proposed method comprises a policy-guiding module based on the PCSOP physics and an adaptive dynamic experience replay buffer.…