连续动作空间搜寻的离线强化学习轧钢机械性能软测量算法
Continuous action space exploration enabled offline reinforcement learning for mechanical property soft sensing in steel rolling processes