首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Stochastic chaining and strengthened information-theoretic generalization bounds
Institution:1. School of Electronics and Communication Engineering, Sun Yat-sen University, Shenzhen, 518107 China;2. Guangxi Key Lab of Multi-source Information Mining & Security, Guangxi Normal University, Guilin, 541004 China;3. Shenzhen Key Laboratory of Navigation and Communication Integration, Shenzhen, 518107 China;1. School of Automation, Southeast University, Nanjing 210096, China;2. Department of Materials Science and Engineering, Harbin Institute of Technology, Weihai, 264209, China;3. College of Artificial Intelligence, Nankai University, Tianjin 300350, China;4. School of Computer Science, Northwestern Polytechnical University, Xi’an 710129, China;1. College of Electrical Engineering and Control Science, Nanjing Tech University, Nanjing 211816, China;2. School of Automation, Southeast University, Nanjing 210096, China;3. Department of Electrical Engineering, Yeungnam University, 280 Daehak-Ro, Kyonsan 38541, South Korea;1. College of Electrical and Information Engineering, Hunan University of Technology, Zhuzhou, Hunan, 412007, China;2. Guangxi Power Grid Company Guilin Power Supply Bureau, Guilin,Guangxi, 541000, China;1. Department of Control Science and Engineering, Tongji University, Shanghai, 201804, China;2. Shanghai Institute of Intelligent Science and Technology, Tongji University, Shanghai, 200092, China
Abstract:We propose a new approach to apply the chaining technique in conjunction with information-theoretic measures to bound the generalization error of machine learning algorithms. Different from the deterministic chaining approach based on hierarchical partitions of a metric space, previously proposed by Asadi et al., we propose a stochastic chaining approach, which replaces the hierarchical partitions with an abstracted Markovian model borrowed from successive refinement source coding. This approach has three benefits over deterministic chaining: (1) the metric space is not necessarily bounded, (2) facilitation of subsequent analysis to yield more explicit bound, and (3) further opportunity to optimize the bound by removing the geometric rigidity of the partitions. The proposed approach includes the traditional chaining as a special case, and can therefore also utilize any deterministic chaining construction. We illustrate these benefits using the problem of estimating Gaussian mean and that of phase retrieval. For the former, we derive a bound that provides an order-wise improvement over previous results, and for the latter we provide a stochastic chain that allows optimization over the chaining parameter.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号