Abstract: In steel energy systems, an optimal assignment of multiple energy media of gas, electricity and steam is crucial. Reinforcement learning (RL) becomes an effective approach to realize dynamic ...