数学科学学院

The power of depth in deep Q-Learning

来源:数学科学学院 发布时间:2020-10-19   506

报告人:林绍波 教授(西安交通大学)

时间:2020年10月22日(星期四)15: 30-16: 30

地点:腾讯会议

会议ID:118 601 263

会议密码:310027

摘要:With the help of massive data and rich computational resource, deep Q-learning has been widely used in operations research and management science and receives great success in numerous applications including, recommender system, games and robotic manipulation. Compared with avid research activities in practice, there lack solid theoretical verifications and interpretability for the success of deep Q-learning, making it be a little bit mystery. The aim of this talk is to discuss the power of depth in deep Q-learning. In the framework of learning theory, we rigorously prove that deep Q-learning outperforms the traditional one by showing its good generalization error bound.  Our results show that the main reason of the success of deep Q-learning is due to the excellent performance of  deep neural networks (deep nets) in capturing special properties of rewards such as the spatially sparse and piecewise constant rather than due to their large capacities. In particular, we provide answers to questions why and when deep Q-learning performs better than the traditional one and how about the generalization capability of deep Q-learning. 

联系人:郭正初(guozhengchu@zju.edu.cn


Copyright © 2023 浙江大学数学科学学院    版权所有

    浙ICP备05074421号

技术支持: 寸草心科技     管理登录

    您是第 1000 位访问者