Home > Department of Statistics > Events > abstracts > Variable selection using Adaptive Non-linear Interaction Structures in High dimensions

 

Variable selection using Adaptive Non-linear Interaction Structures in High dimensions

When 2.00pm on Friday 22nd October 2010
Where B617, Leverhulme Library, Columbia House
Presentations  
Speaker Gareth James
From University of Southern California
Abstract

Numerous penalization based methods have been proposed for fitting a traditional linear regression model in which the number of predictors, p, is large relative to the number of observations, n. Most of these approaches assume sparsity in the underlying coefficients and perform some form of variable selection. Recently, some of this work has been extended to non-linear additive regression models. However, in many contexts one wishes to allow for the possibility of interactions among the predictors. This poses serious statistical and computational difficulties when p is large, as the number of candidate interaction terms is of order p squared.

In this talk I will introduce a new approach, "Variable selection using Adaptive Non-linear Interaction Structures in High dimensions" (VANISH), that is based on a penalized least squares criterion and is designed for high dimensional non-linear problems. The criterion is convex and enforces the heredity constraint, in other words if an interaction term is added to the model, then the corresponding main effects are automatically included. Detailed simulation results will also be provided, demonstrating that VANISH is computationally efficient and can be applied to non-linear models involving thousands of terms while producing superior predictive performance over other approaches. Time permitting, I will also discuss theoretical conditions under which VANISH can be shown to select the correct main effects and interactions.

These conditions suggest that VANISH should outperform certain natural competitors when the true interaction structure is sufficiently sparse.

This is joint work with Peter Radchenko.

For further information Postgraduate Administrator Ext. 6879
Department of Statistics, Columbia House
Share:Facebook|Twitter|LinkedIn|