Estimation in semiparametric models with missing data munich. Semiparametric estimation of treatment effect in a pretestposttest study with missing data. Semiparametric theory and missing data springer series in statistics anastasios tsiatis. Semiparametric theory and missing data researchgate.
Missing data often appear as a practical problem while applying classical models in the statistical analysis. This paper investigates a class of estimation problems of the semiparametric model with missing data. In this paper, we consider a semiparametric regression model in the presence of missing covariates for nonparametric components under a bayesian framework. Introduction to semiparametric models springerlink. Kriging regression imputation method to semiparametric model.
Both estimators are semiparametric as they do not require any model assumptions regarding the missing data mechanism nor the speci. Values in a data set are missing completely at random mcar if the events that lead to any particular dataitem being missing are independent both of observable variables and of unobservable parameters of interest, and occur entirely at random. V \displaystyle \theta \subseteq \mathbb r k\times v, where v \displaystyle v is an infinitedimensional space. This book summarizes current knowledge regarding the theory of estimation for semiparametric models with missing data, in an organized and. Semiparametric regression analysis under imputation for.
Semiparametric models allow at least part of the datagenerating process to be unspecified and unrestricted, and can often yield robust estimators that nonetheless behave similarly to those based. The proposed model contains one smooth term and a set of possible linear predictors. Sep 15, 2017 semiparametric models allow at least part of the data generating process to be unspecified and unrestricted, and can often yield robust estimators that nonetheless behave similarly to those based. Semiparametric theory and missing data by tsiatis, a. Semiparametric theory and missing data book summary. Tsiatis and others published semiparametric theory and missing data find, read and cite all the research you need on researchgate. There are of course many other good ones not listed. Asymptotic theory for the semiparametric accelerated. To remove this serious limitation on the methodology, we. We introduce below novel bounded influence function estimators. Semiparametric models allow at least part of the data generating process to be unspecified and unrestricted, and can often yield robust estimators that nonetheless behave similarly to those based.
With a semiparametric model, the parameter has both a finitedimensional component and an infinitedimensional component often a realvalued function defined on the real line. We then establish the consistency, asymptotic normality, and semiparametric efficiency of the resulting estimators for the regression parameters by appealing to modern empirical process theory. On weighting approaches for missing data lingling li. When data are mcar, the analysis performed on the data is unbiased. Semiparametric theory and missing data springerlink. Strategies for bayesian modeling and sensitivity analysis m. Methods for estimating parameters with missing or coarsened data in as e. In this article, we consider a general rankbased estimating method for model 1.
In statistics, a semiparametric model is a statistical model that has parametric and nonparametric components a statistical model is a parameterized family of distributions. Efficiency bounds, multiple robustness and sensitivity analysis. Semiparametric theory for causal mediation analysis. Semiparametric estimation of logistic regression model. The main difficulty in solving such problems is that the missing probability and. Aug 04, 2019 during the past few decades, missing data problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities.
Semiparametric theory and missing data springer series in statistics kindle edition by anastasios tsiatis. Search semiparametric theory and missing data pdf ebook for download and read online. Despite the popularity of this design, a consensus on an appropriate analysis when no data are missing, let alone for. The following are some the books on survival analysis that i have found useful. A semiparametric regression imputation estimator, a marginal average estimator and a marginal propensity score weighted estimator are defined.
A normal semiparametric mixture regression model is proposed for longitudinal data. Semiparametric regression analysis with missing response. A common method for handling missing data in a large data set is to impute i. Covariates are usually introduced in the models to partially explain interindividual variations. By adopting nonparametric components for the model, the estimation method can be made robust. A semiparametric estimation of mean functionals with. Semiparametric modelbased inference in the presence of. Model terms are estimated using the penalized likelihood method with the emalgorithm. This book combines much of what is known in regard to the theory of estimation for semiparametric models with missing data in an organized and comprehensive manner. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Values in a data set are missing completely at random mcar if the events that lead to any particular data item being missing are independent both of observable variables and of unobservable parameters of interest, and occur entirely at random. Use pdf download to do whatever you like with pdf files on the web and regain control web to pdf convert any web pages to high quality. Calibration estimation of semiparametric copula models.
Classical semiparametric inference with missing outcome data is not robust to contamination of the observed data and a single observation can have arbitrarily large influence on estimation of a parameter of interest. Analysis of semiparametric regression models for repeated. We propose a nonparametric imputation method for the missing values, which then leads to imputed. The main difficulty in solving such problems is that the missing probability and the regression likelihood function are. A semiparametric mixture regression model for longitudinal. Maximum likelihood for general patterns of missing data. Maximum likelihood estimation for multivariate normal examples, ignoring the missing data mechanism. The asymptotic distribution theory is developed under the assumption that all covariate. In the past 20 years or so, there has been a serious attempt. A semiparametric inference to regression analysis with. The semiparametric models allow for estimating functions that are nonsmooth with respect to the parameter.
Xuejournalofmultivariateanalysis1022011723740 727 linearmodelandthesingleindexmodel,arealsoaniceselection. Semiparametric theory and fundamentals of missing data. The ipw methods rely on the intuitive idea of creating a pseudopopulation of weighted copies of the complete cases to remove selection bias introduced by the missing data. C mathematical and quantitative methods c7 game theory and bargaining theory. Semiparametric theory and missing data anastasios tsiatis. The theory of missing data applied to semiparametric models is scattered throughout the literature with no thorough comprehensive treatment of the subject. Semiparametric efficiency in gmm models with auxiliary data. Semiparametric theory and missing data missing data arise in almost all scienti.
It starts with the study of semiparametric methods when there are no missing data. We propose a class of calibration estimators for the nonparametric marginal. During the past few decades, missingdata problems have been studied extensively, with a focus on the ignorable missing case, where the missing probability depends only on observable quantities. This comprehensive monograph offers an indepth look at the associated theory. Kriging regression imputation method to semiparametric. Semiparametric estimation of logistic regression model with. The description of the theory of estimation for semiparametric models is both rigorous and intuitive, relying on geometric ideas to reinforce the intuition and understanding of the theory.
This sensitivity is exacerbated when inverse probability weighting methods are used, which may overweight contaminated observations. Calibration estimation of semiparametric copula models with. Parameter estimation in parametric regression models with missing covariates is considered under a survey sampling setup. Semiparametric theory and missing data pdf free download. This book summarizes current knowledge regarding the theory of estimation for semiparametric models with missing data, in an organized and comprehensive manner. Identification relies on auxiliary data containing information about the distribution of the missing variables conditional on proxy variables that are observed in both the primary and the auxiliary database, when such distribution is common to the two. In addition, we show through extensive simulation studies that the proposed methods perform well in realistic situations. The theory and methods for measurement errors and missing. Maximum likelihood estimation for multivariate normal examples, ignoring the missingdata mechanism. In order to overcome the robust defect of traditional complete data estimation method and regression imputation estimation technique, we propose a modified imputation estimation approach called krigingregression imputation. We consider a class of doubly weighted rankbased estimating methods for the transformation or accelerated failure time model with missing data as arise, for. Semiparametric nonlinear mixedeffects nlme models are flexible for modelling complex longitudinal data. In many cases, the treatment of missing data in an analysis is carried out in a casual and adhoc manner, leading, in many cases, to invalid inference and erroneous conclusions.
Missing data arise in almost all scientific disciplines. In this paper, based on the exponential tilting model, we propose a semiparametric estimation method of mean functionals with nonignorable missing data. Moreover, the responses may be missing and the missingness may be nonignorable. In order to derive asymptotic properties for singleindex models. A semiparametric regression imputation estimator, a marginal. To remove this serious limitation on the methodology. Get your kindle here, or download a free kindle reading app.
Search semiparametric theory and missing data pdf ebook for. All the estimators are proved to be asymptotically normal, with the same asymptotic variance. A semiparametric mixture regression model for longitudinal data. We propose a nonparametric imputation method for the missing values, which then leads to imputed estimating equations for the finite dimensional parameter of interest. Some covariates, however, may be measured with substantial errors. This paper considers the problem of parameter estimation in a general class of semiparametric models when observations are subject to missingness at random. The application to missing data is also clearly of great interest. Henceforth, i refer to this model as the semiparametric missing data model or the missing at random mar setup. Semiparametric theory and missing data springer series in. Models for pertilliy classified contingency tables, ignoring the missingdata mechanism. Empirical likelihood for semiparametric regression model.
Under missingness at random, a semiparametric maximum likelihood approach is proposed which requires no parametric specification of the marginal covariate distribution. This paper investigates the estimation of semiparametric copula models with data missing at random. Semiparametric inverse propensity weighting for nonignorable. We develop inference tools in a semiparametric regression model with missing response data. These methods are then applied to problems with missing, censored, and coarsened data with the goal of deriving estimators that are as robust and efficient. We study semiparametric efficiency bounds and efficient estimation of parameters defined through general moment restrictions with missing data. Abstract we develop inference tools in a semiparametric partially linear regression model with missing response data. Missing available for download and read online in other formats. In order to overcome the robust defect of traditional complete data estimation method and regression imputation estimation technique, we propose a modified imputation estimation approach. A process point of view, by aalen, borgan and gjessing. Efficient and adaptive estimation for semiparametric models. By contrast, research into nonignorable missing data problems is quite limited. Semiparametric theory and missing yg262322020 adobe acrobat reader dcdownload adobe.
Semiparametric theory and missing data springer series in statistics series by anastasios tsiatis. Journal of the american statistical association volume 90, 1995 issue 429. Estimation in semiparametric models with missing data. This treatment will give the reader a deep understanding of the underlying theory for missing and coarsened data. Semiparametric modelling is, as its name suggests, a hybrid of the parametric and nonparametric approaches to construction, fitting, and validation of statistical models. We consider a semiparametric model that parameterizes the conditional density of the response, given covariates, but allows the marginal distribution of the covariates to be completely arbitrary. Semiparametric theory and missing data pdf free download epdf. Modelling survival data in medical research, by collett 2nd edition 2003 survival and event history analysis. A computationally feasible alternative method that provides an ap. In many cases, the treatment of missing data in an analysis is carried out in a casual and.
The maximum pseudolikelihood estimation of genest et al. Fullsemiparametriclikelihoodbased inference for non. The geometric ideas for semiparametric fulldata models are extended to missingdata models. Models for pertilliy classified contingency tables, ignoring the missing data mechanism. Kosorok 2007, as well as many modern developments in missing data and causal. A semiparametric logistic regression model is assumed for the response probability and a nonparametric regression approach for missing data discussed in cheng 1994 is used in the estimator. We propose a nonparametric imputation method for the missing values, which then.
787 481 185 878 148 1519 632 1280 1408 449 1650 1539 1395 1423 1424 824 1518 1136 987 465 157 169 588 547 164 802 353 534 1105 423 86 780 847 1036 378 248 1206 814 262