Jiang Du

Assistant Professor
College of Applied Sciences
Beijing University of Technology

Home

Research

CV

Code

Teaching

Personal

Social
Email
GitHub

Contact:
College of Applied Sciences
Beijing University of Technology
No. 100 Pingle Yuan, Chaoyang District, Beijing 100124
P. R. China

Published & Forthcoming Papers

* represents the corresponding author.

Estimation for generalized partially functional linear additive regression model
Jiang Du, Ruiyuan Cao*, Eddy Kwessi & Zhongzhan Zhang.
Journal of Applied Statistics, 2019, 46(5), pp. 914-925.

In practice, it is not uncommon to encounter the situation that a discrete response is related to both a functional random variable and multiple real-value random variables whose impact on the response is nonlinear. In this paper, we consider the generalized partial functional linear additive models (GPFLAM) and present the estimation procedure. In GPFLAM, the nonparametric functions are approximated by polynomial splines and the infinite slope function is estimated based on the principal component basis function approximations. We obtain the estimator by maximizing the quasi-likelihood function. We investigate the finite sample properties of the estimation procedure via Monte Carlo simulation studies and illustrate our proposed model by a real data analysis.

Dynamic partially functional linear regression model
Jiang Du*, Hui Zhao, Zhongzhan Zhang.
Journal of the Korean Statistical Society, 2019, In press.

This paper considers the nonparametric inverse probability weighted estimation for functional data with missing response data at random. Under mild conditions, the asymptotic properties of the proposed estimation method are established. Based on the resampling method, the estimation of the asymptotic variance of the proposed estimator is obtained. Finally, the finite sample properties of the proposed estimation method are investigated via Monte Carlo simulation studies. A real data analysis is given to illustrate the use of the proposed method.

A nonparametric inverse probability weighted estimation for functional data with missing response data at random
Longbing Wang, Ruiyuan Cao, Jiang Du & Zhongzhan Zhang*.
Statistical Methods and Applications, 2019, In press.

In this paper, we develop a dynamic partially functional linear regression model in which the functional dependent variable is explained by the first order lagged functional observation and a finite number of real-valued variables. The bivariate slope function is estimated by bivariate tensor-product B-splines. Under some regularity conditions, the large sample properties of the proposed estimators are established. We investigate the finite sample performance of the proposed methods via Monte Carlo simulation studies, and illustrate its usefulness by the analysis of the electricity consumption data.

FPCA-based estimation for generalized functional partially linear models
Ruiyuan Cao, Jiang Du*, Jianjun Zhou & Tianfa Xie.
Statistical Papers, 2019, In press.

In real data analysis, practitioners frequently come across the case that a discrete response will be related to both a function-valued random variable and a vector-value random variable as the predictor variables. In this paper, we consider the generalized functional partially linear models (GFPLM). The infinite slope function in the GFPLM is estimated by the principal component basis function approximations. Then, we consider the theoretical properties of the estimator obtained by maximizing the quasi likelihood function. The asymptotic normality of the estimator of the finite dimensional parameter and the rate of convergence of the estimator of the infinite dimensional slope function are established, respectively. We investigate the finite sample properties of the estimation procedure via Monte Carlo simulation studies and a real data analysis.

Estimation and variable selection for partially functional linear models
Jiang Du, Dengke Xu, Ruiyuan Cao*.
Journal of the Korean Statistical Society, 2018, 47(4), pp. 436-449.

In this paper, a new estimation procedure based on composite quantile regression and functional principal component analysis (PCA) method is proposed for the partially functional linear regression models (PFLRMs). The proposed estimation method can simultaneously estimate both the parametric regression coefficients and functional coefficient components without specification of the error distributions. The proposed estimation method is shown to be more efficient empirically for non-normal random error, especially for Cauchy error, and almost as efficient for normal random errors. Furthermore, based on the proposed estimation procedure, we use the penalized composite quantile regression method to study variable selection for parametric part in the PFLRMs. Under certain regularity conditions, consistency, asymptotic normality, and Oracle property of the resulting estimators are derived. Simulation studies and a real data analysis are conducted to assess the finite sample performance of the proposed methods.

Model averaging for M-estimation
Jiang Du*, Zhongzhan Zhang, Tianfa Xie.
Statistics, 2018, 52(6), pp. 1417-1432.

M-estimation is a widely used technique for robust statistical inference. In this paper, we study model selection and model averaging for M-estimation to simultaneously improve the coverage probability of confidence intervals of the parameters of interest and reduce the impact of heavy-tailed errors or outliers in the response. Under general conditions, we develop robust versions of the focused information criterion and a frequentist model average estimator for M-estimation, and we examine their theoretical properties. In addition, we carry out extensive simulation studies as well as two real examples to assess the performance of our new procedure, and find that the proposed method produces satisfactory results.

Estimation for the censored partially linear quantile regression models
Jiang Du*, Zhongzhan Zhang, Dengke Xu.
Communications in Statistics: Simulation and Computation, 2018, 47(8), pp. 2393-2408.

In this article, we develop estimation procedures for partially linear quantile regression models, where some of the responses are censored by another random variable. The nonparametric function is estimated by basis function approximations. The estimation procedure is easy to implement through existing weighted quantile regression, and it requires no specification of the error distributions. We show the large-sample properties of the resulting estimates, the proposed estimator of the regression parameter is root-n consistent and asymptotically normal and the estimator of the functional component achieves the optimal convergence rate of the nonparametric function. The proposed method is studied via simulations and illustrated with the analysis of a primary biliary cirrhosis (BPC) data.

Model averaging based on rank
Jiang Du*, Xiuping Chen, Eddy Kwessi & Zhimeng Sun.
Journal of Applied Statistics, 2018, 45(10), pp. 1900-1919.

In this paper, we investigate model selection and model averaging based on rank regression. Under mild conditions, we propose a focused information criterion and a frequentist model averaging estimator for the focused parameters in rank regression model. Compared to the least squares method, the new method is not only highly efficient but also robust. The large sample properties of the proposed procedure are established. The finite sample properties are investigated via extensive Monte Claro simulation study. Finally, we use the Boston Housing Price Dataset to illustrate the use of the proposed rank methods.

A weighted M-estimator for linear regression models with randomly truncated data
Jiang Du*, Zhongzhan Zhang, Tianfa Xie.
Statistics and Probability Letters, 2018, 138, pp. 90-94.

This paper considers M-estimation for randomly truncated data. We propose a new estimation for left truncated data, and establish the sample properties of the proposed estimator. Finite sample performance of the proposed estimator is investigated via simulation studies.

Statistical inference for partially linear additive spatial autoregressive models
Jiang Du*, Xiaoqian Sun, Ruiyuan Cao & Zhongzhan Zhang.
Spatial Statistics, 2018, 25, pp. 52-67.

In this paper, a class of partially linear additive spatial autoregressive models (PLASARM) is studied. With the nonparametric functions approximated by basis functions, we propose a generalized method of moments estimator for PLASARM. Under mild conditions, we obtain the asymptotic normality for the finite parametric vector and the optimal convergence rate for nonparametric functions. In order to make statistical inference for parametric component, we propose the estimator for asymptotic covariance matrix of the parameter estimator and establish the asymptotic properties for the resulting estimators. Finite sample performance of the proposed method is assessed by Monte Carlo simulation studies, and the developed methodology is illustrated by an analysis of the Boston housing price data.

Variable selection for spatial autoregressive models with a diverging number of parameters
Tianfa Xie, Ruiyuan Cao*, Jiang Du.
Statistical Papers, 2018, pp. 1-21. In press.

Variable selection has played a fundamental role in regression analysis. Spatial autoregressive model is a useful tool in econometrics and statistics in which context variable selection is necessary but not adequately investigated. In this paper, we consider conducting variable selection in spatial autoregressive models with a diverging number of parameters. Smoothly clipped absolute deviation penalty is considered to obtain the estimators. Moreover the dimension of the covariates are allowed to vary with sample size. In order to attenuate the bias caused by endogeneity, instrumental variable is adopted in the estimation procedure. The proposed method can do parametric estimation and variable selection simultaneously. Under mild conditions, we establish the asymptotic and oracle property of the proposed estimators. Finally, the performance of the proposed estimation procedure is examined via Monte Carlo simulation studies and a data set from a Boston housing price is analyzed as an illustrative example.

Single-index partially functional linear regression model
Ping Yu, Jiang Du, Zhongzhan Zhang*.
Statistical Papers, 2018, pp. 1-17. In press.

In this paper, we propose a flexible single-index partially functional linear regression model, which combines single-index model with functional linear regression model. All the unknown functions are estimated by B-spline approximation. Under some mild conditions, the convergence rates and asymptotic normality of the estimators are obtained. Finally, simulation studies and a real data analysis are conducted to investigate the performance of the proposed methodologies.

Varying-coefficient partially functional linear quantile regression models
Ping Yu, Jiang Du, Zhongzhan Zhang*.
Journal of the Korean Statistical Society, 2017, 46(3), pp. 462-475.

In this paper, we introduce a new varying-coefficient partially functional linear quantile regression model, which combines varying-coefficient quantile regression model with functional linear quantile regression model. The functional principal component basis and regression splines are employed to estimate the slope function and varying-coefficient functions, respectively, and the convergence rates of the estimators are obtained under some regularity conditions. Simulations and an illustrative real example are presented.

Focused information criterion and model averaging in censored quantile regression
Jiang Du*, Zhongzhan Zhang, Tianfa Xie.
Metrika, 2017, 80(5), pp. 547-570.

In this paper, we study model selection and model averaging for quantile regression with randomly right censored response. We consider a semi-parametric censored quantile regression model without distribution assumptions. Under general conditions, a focused information criterion and a frequentist model averaging estimator are proposed, and theoretical properties of the proposed methods are established. The performances of the procedures are illustrated by extensive simulations and the primary biliary cirrhosis data.

M-Estimation for partially functional linear regression model based on splines
Jianjun Zhou, Jiang Du*, Zhimeng Sun.
Communications in Statistics - Theory and Methods, 2016, 45(21), pp. 6436-6446.

M-estimation is a widely used technique for robust statistical inference. In this paper, we study robust partially functional linear regression model in which a scale response variable is explained by a function-valued variable and a finite number of real-valued variables. For the estimation of the regression parameters, which include the infinite dimensional function as well as the slope parameters for the real-valued variables, we use polynomial splines to approximate the slop parameter. The estimation procedure is easy to implement, and it is resistant to heavy-tailederrors or outliers in the response. The asymptotic properties of the proposed estimators are established. Finally, we assess the finite sample performance of the proposed method by Monte Carlo simulation studies.

A test of linearity in partial functional linear regression
Ping Yu, Zhongzhan Zhang*, Jiang Du.
Metrika, 2016, 79(8), pp. 953-969.

This paper investigates the hypothesis test of the parametric component in partial functional linear regression. We propose a test procedure based on the residual sums of squares under the null and alternative hypothesis, and establish the asymptotic properties of the resulting test. A simulation study shows that the proposed test procedure has good size and power with finite sample sizes. Finally, we present an illustration through fitting the Berkeley growth data with a partial functional linear regression model and testing the effect of gender on the height of kids.

Variable Selection for Semiparametric Partially Linear Covariate-Adjusted Regression Models
Jiang Du, Gaorong Li, Heng Peng*.
Communications in Statistics - Theory and Methods, 2015, 44(13), pp. 2809-2826.

In this article, the partially linear covariate-adjusted regression models are considered, and the penalized least-squares procedure is proposed to simultaneously select variables and estimate the parametric components. The rate of convergence and the asymptotic normality of the resulting estimators are established under some regularization conditions. With the proper choices of the penalty functions and tuning parameters, it is shown that the proposed procedure can be as efficient as the oracle estimators. Some Monte Carlo simulation studies and a real data application are carried out to assess the finite sample performances for the proposed method.

Skew-normal semiparametric varying coefficient model and score test
Dengke Xu, Zhongzhan Zhang*, Jiang Du.
Journal of Statistical Computation and Simulation, 2015, 85(2), pp. 216-234.

In this paper, we consider inference aspects of skew-normal semiparametric varying coefficient models which provide a useful extension of the normal regression models. The maximum likelihood estimation based on B-spline is proposed. Further, we discuss the score test for homogeneity of the variance in skew-normal semiparametric varying coefficient models. Their asymptotical properties are investigated. Some simulated examples are used to examine our proposed methods.

Variable selection for covariate adjusted regression model
Xuejing Li, Jiang Du, Gaorong Li* & Mingzhi Fan.
Journal of Systems Science and Complexity, 2014, 27(6), pp. 1227-1246.

This paper employs the SCAD-penalized least squares method to simultaneously select variables and estimate the coefficients for high-dimensional covariate adjusted linear regression models. The distorted variables are assumed to be contaminated with a multiplicative factor that is determined by the value of an unknown function of an observable covariate. The authors show that under some appropriate conditions, the SCAD-penalized least squares estimator has the so called “oracle property”. In addition, the authors also suggest a BIC criterion to select the tuning parameter, and show that BIC criterion is able to identify the true model consistently for the covariate adjusted linear regression models. Simulation studies and a real data are used to illustrate the efficiency of the proposed estimation algorithm.

Focused Information Criterion and Model Averaging in Quantile Regression
Jiang Du, Zhongzhan Zhang*, Tianfa Xie.
Communications in Statistics - Theory and Methods, 2013, 42(20), pp. 3716-3734.

In this article, we study model selection and model averaging in quantile regression. Under general conditions, we develop a focused information criterion and a frequentist model average estimator for the parameters in quantile regression model, and examine their theoretical properties. The new procedures provide a robust alternative to the least squares method or likelihood method, and a major advantage of the proposed procedures is that when the variance of random error is infinite, the proposed procedure works beautifully while the least squares method breaks down. A simulation study and a real data example are presented to show that the proposed method performs well with a finite sample and is easy to use in practice.

Semiparametric analysis of isotonic errors-in-variables regression models with randomly right censored response
Zhimeng Sun*, Zhongzhan Zhang, Jiang Du.
Journal of Systems Science and Complexity, 2013, 26(3), pp. 441-461.

This paper considers the estimation of a semiparametric isotonic regression model when the covariates are measured with additive errors and the response is randomly right censored by a censoring time. The authors show that the proposed estimator of the regression parameter is rootn consistent and asymptotically normal. The authors also show that the isotonic estimator of the functional component, at a fixed point, is cubic root-n consistent and converges in distribution to the slope at zero of the greatest convex minorant of the sum of a two-sided standard Brownian motion and the square of the time parameter. A simulation study is carried out to investigate the performance of the estimators proposed in this article.

VARIABLE SELECTION FOR PARTIALLY LINEAR VARYING COEFFICIENT QUANTILE REGRESSION MODEL
Jiang Du, Zhongzhan Zhang, Zhimeng Sun*.
International Journal of Biomathematics, 2013, 6(3),1350015.

In this paper, we propose a variable selection procedure for partially linear varying coefficient model under quantile loss function with adaptive Lasso penalty. The functional coefficients are estimated by B-spline approximations. The proposed procedure simultaneously selects significant variables and estimates unknown parameters. The major advantage of the proposed procedures over the existing ones is easy to implement using existing software, and it requires no specification of the error distributions. Under the regularity conditions, we show that the proposed procedure can be as efficient as the Oracle estimator, and derive the optimal convergence rate of the functional coefficients. A simulation study and a real data application are undertaken to assess the finite sample performance of the proposed variable selection procedure.

M-estimation for the partially linear regression model under monotonic constraints
Jiang Du*, Zhimeng Sun, Tianfa Xie.
Statistics and Probability Letters, 2013, 83(5), pp. 1353-1363.

In this paper, we study M-estimation for the partially linear model under monotonic constraints. We use monotone B-splines to approximate the monotone nonparametric function. We show the large sample properties of the resulting estimators. The proposed estimator of parameter part is root-n consistent, and asymptotically normal and the estimator for the nonparametric component achieves the optimal convergence rate. A simulation study is conducted to evaluate the finite sample performance of the method. The proposed procedure is illustrated by an air pollution study.

Semiparametric Analysis of Isotonic Errors-in-Variables Regression Models with Missing Response
Zhimeng Sun*, Zhongzhan Zhang, Jiang Du.
Communications in Statistics - Theory and Methods, 2012, 41(11), pp. 2034-2060.

This article is concerned with the estimation problem in the semiparametric isotonic regression model when the covariates are measured with additive errors and the response is missing at random. An inverse marginal probability weighted imputation approach is developed to estimate the regression parameters and a least-square approach under monotone constraint is employed to estimate the functional component. We show that the proposed estimator of the regression parameter is root-n consistent and asymptotically normal and the isotonic estimator of the functional component, at a fixed point, is cubic root-n consistent. A simulation study is conducted to examine the finite-sample properties of the proposed estimators. A data set is used to demonstrate the proposed approach.

Variable Selection for Semiparametric Isotonic Regression Models
Jiang Du, Zhongzhan Zhang, Tianfa Xie.
Advances in Intelligent and Soft Computing, 2011, 100, pp. 525-532.

In this paper, we propose a penalized constrained least squares method for variable selection in semiparametric isotonic regression model. Under certain regularity conditions, asymptotic properties of the proposed estimators are established. A simulation study is presented for illustrations.


Papers under Revision to Resubmit


Working Papers


Works in Progress