Monte Carlo likelihood calculation for identity by descent data
(1999)Two individuals are identical by descent at a genetic locus if they share the same gene copy at that locus due to inheritance from a recent common ancestor. Identity by descent can be thought of as a continuous process ... 
Nonparametric inference on monotone functions, with applications to observational studies
In this dissertation, we study general strategies for constructing nonparametric monotone function estimators in two broad statistical settings. In the first setting, a sensible initial estimator of the monotone function ... 
Parameter Identification and Assessment of Independence in Multivariate Statistical Modeling
We are interested in the extent to which, possibly causal, relationships can be statistically quantified from multivariate data obtained from a system of random variables. In the ideal setting, we would begin with refined ... 
Phylogenetic Stochastic Mapping
Phylogenetic stochastic mapping is a method for reconstructing the history of trait changes on a phylogenetic tree relating species/organisms carrying the trait. Stateoftheart methods assume that the trait evolves ... 
Portfolio Optimization with Tail Risk Measures and NonNormal Returns
(20100820)The traditional Markowitz meanvariance portfolio optimization theory uses volatility as the sole measure of risk. However, volatility is flawed both intuitively and theoretically: being symmetric it does not differentiate ... 
Predictive Modeling of Cholera Outbreaks in Bangladesh
Despite seasonal cholera outbreaks in Bangladesh, little is known about the relationship between environmental conditions and cholera cases. We seek to develop a predictive model for cholera outbreaks in Bangladesh based ... 
Preferential sampling and model checking in phylodynamic inference
Estimating population size fluctuations is one of the key tasks in Ecology. Traditional sampling based approaches to this task have limitations when populations of interest are extinct or are hard to reach, as is the case ... 
Probabilistic Population Projection for Countries with Generalized HIV/AIDS Epidemics
Population projection has long been an issue for researchers, governments and international organizations so that they can monitor and plan development and resources. The United Nation Population Division (UNPD) publishes ... 
Projection and Estimation of International Migration
I propose techniques for improving both estimation and projection of international migration. By applying a Bayesian hierarchical modeling approach to net migration data, I produce projections of international migration ... 
Rsquared inference under nonnormal error
Assessment of the relationship between diet and health status, especially association between diet and chronic disease risk, has attracted lot of research interest in statistical and epidemiologic studies. However, due to ... 
A resampling approach to clustering with confidence
(20120913)We propose a method for estimating the number of groups in a data set. Our method is an extension of Generalized Single Linkage clustering (GSL) (Stuetzle and Nugent 2010), a nonparametric clustering method based on the ... 
Robust estimation of factor models in finance
(2005)Standard assetpricing models entail expressions for expected returns in terms of coefficients relative to risk factors. Methods to estimate premiums of risk factors have, at its core, a single or multiple linear regression ... 
Scalable Manifold Learning and Related Topics
The subject of manifold learning is vast and still largely unexplored. As a subset of unsupervised learning it has a fundamental challenge in adequately defining the problem but whose solution is to an increasingly important ... 
Scalable Methods for the Inference of Identity by Descent
Identity by descent (IBD) describes the shared inheritance of DNA and underlies genetic similarity between individuals. Estimated IBD graphs describing the IBD relationships among individuals have many uses in statistical ... 
ShapeConstrained Inference for ConcaveTransformed Densities and their Modes
(20131114)We consider inference about functions estimated via shape constraints based on concavity. We consider logconcave densities and other “concavetransformed” densities on the real line, where a concavetransformed class is ... 
SpaceTime Smoothing Models for Surveillance and Complex Survey Data
Area and timespecific estimates of disease rates, causespecific mortality rates and other key health indicators are of great interest for health care and policy purposes. Such estimates provide the information needed to ... 
Statistical Hurdle Models for Single Cell Gene Expression: Differential Expression and Graphical Modeling
This dissertation describes a set of statistical methods developed for analysis of single cell gene expression. A characteristic of single cell expression is bimodal expression, in which two clusters of expression are ... 
Statistical inference using Kronecker structured covariance
(20131114)We present results for testing and estimation in the context of separable covariance models. We concentrate on two types of data: relational data and crossclassified data. Relational data is frequently represented by a ... 
Statistical Methods for Manifold Recovery and C^{1, 1} Regression on Manifolds
Highdimensional data sets often have lowerdimensional structure taking the form of a submanifold of a Euclidean space. It is challenging but necessary to develop statistical methods for these data sets that respect the ...