Project title High-dimensional variable selection problems with applications to genetics
Our current ability to measure massive numbers of variables, for example, in biological sciences (millions of DNA variants), in personal health applications (thousands of time points in activity monitoring) or in customer behavior modeling (credit card usage history) call for efficient ways to separate the important variables from the unimportant ones. In this project, we study such variable selection methods when variables have been measured incompletely and we implement new ways to account for the missing information in our statistical inference. Our motivating example is a large study on migraine genetics where these new variable selection methods are needed to assess which genetic variants, out of thousands of candidates, are biologically connected to the migraine susceptibility.