Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions
Fan, Jianqing1,2; Li, Quefeng3; Wang, Yuyan1,4
刊名JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY
2017
卷号79期号:1页码:247-265
关键词High dimension Huber loss M-estimator Optimal rate Robust regularization
ISSN号1369-7412
DOI10.1111/rssb.12166
英文摘要Data subject to heavy-tailed errors are commonly encountered in various scientific fields. To address this problem, procedures based on quantile regression and least absolute deviation regression have been developed in recent years. These methods essentially estimate the conditional median (or quantile) function. They can be very different from the conditional mean functions, especially when distributions are asymmetric and heteroscedastic. How can we efficiently estimate the mean regression functions in ultrahigh dimensional settings with existence of only the second moment? To solve this problem, we propose a penalized Huber loss with diverging parameter to reduce biases created by the traditional Huber loss. Such a penalized robust approximate (RA) quadratic loss will be called the RA lasso. In the ultrahigh dimensional setting, where the dimensionality can grow exponentially with the sample size, our results reveal that the RA lasso estimator produces a consistent estimator at the same rate as the optimal rate under the light tail situation. We further study the computational convergence of the RA lasso and show that the composite gradient descent algorithm indeed produces a solution that admits the same optimal rate after sufficient iterations. As a by-product, we also establish the concentration inequality for estimating the population mean when there is only the second moment. We compare the RA lasso with other regularized robust estimators based on quantile regression and least absolute deviation regression. Extensive simulation studies demonstrate the satisfactory finite sample performance of the RA lasso.
资助项目National Science Foundation[DMS-1206464] ; National Science Foundation[DMS-1406266] ; National Institutes of Health[R01-GM072611-9] ; National Institutes of Health[R01-GM100474-4]
WOS研究方向Mathematics
语种英语
出版者WILEY-BLACKWELL
WOS记录号WOS:000392486000012
内容类型期刊论文
源URL[http://ir.amss.ac.cn/handle/2S8OKBNM/24610]  
专题中国科学院数学与系统科学研究院
通讯作者Fan, Jianqing
作者单位1.Princeton Univ, Princeton, NJ 08544 USA
2.Acad Math & Syst Sci, Beijing, Peoples R China
3.Univ N Carolina, Chapel Hill, NC USA
4.Princeton Univ, Princeton, NJ USA
推荐引用方式
GB/T 7714
Fan, Jianqing,Li, Quefeng,Wang, Yuyan. Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions[J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY,2017,79(1):247-265.
APA Fan, Jianqing,Li, Quefeng,&Wang, Yuyan.(2017).Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions.JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY,79(1),247-265.
MLA Fan, Jianqing,et al."Estimation of high dimensional mean regression in the absence of symmetry and light tail assumptions".JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY 79.1(2017):247-265.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace