Browsing Statistics by Title
Now showing items 1433 of 65

Degeneracy, Duration, and Coevolution: Extending Exponential Random Graph Models (ERGM) for Social Network Analysis
We address three aspects of statistical methodology in the application of Exponential family Random Graphs to modeling social network processes. The first is the topic of model degeneracy in ERGMs. We show this is a ... 
Detecting and extracting complex patterns from images and realizations of spatial point processes
(2000)A common goal in the field of Computer Vision is the detection and extraction of patterns (e.g. lines, object boundaries) from binary image data . These images routinely occur as the product of edge detection algorithms, ... 
Discovering Interactions in Multivariate Time Series
In large collections of multivariate time series it is of interest to determine interactions between each pair of time series. Classically, interactions between time series have been studied using linear vector autoregressive ... 
DiscreteTime Threshold Regression for Survival Data with TimeDependent Covariates
A natural approach to survival analysis in many settings is to model the subject's ``health'' status as a latent stochastic process, where the terminal event is represented by the first time that the process crosses a ... 
Estimating Population Size Using the Network Scale Up Method
(20130725)We develop methods for estimating hardtoreach populations from data collected using networkbased questions on standard surveys. Such data arise by asking respondents how many people they know in a specific group (e.g. ... 
Estimation and Testing Following Model Selection
The field of postselection inference focuses on developing solutions for problems in which a researcher uses a single dataset to both identify a promising set of hypotheses and conduct statistical inference. One promising ... 
Finite Population Inference for Causal Parameters
Randomized experiments are often employed to determine whether a treatment X has a causal effect on an outcome Y. Under the NeymanRubin causal model with binary X and Y, each patient is characterized by two binary potential ... 
Finite Sampling Exponential Bounds
This dissertation develops new exponential bounds for the tail of the hypergeometric distribution. It is organized as follows. In Chapter 1, it reviews existing exponential bounds used to control the hypergeometric tail. ... 
Functional Quantitative Genetics and the Missing Heritability Problem
In classical quantitative genetics, the correlation between the phenotypes of individuals with unknown genotypes and a known pedigree relationship is expressed in terms of probabilities of IBD states. In existing models ... 
Generalization of boosting algorithms and applications of Bayesian inference for massive datasets
(1999)In recent years statisticians, computational learning theorists, and engineers have developed more advance techniques to learn complex nonlinear relationships from datasets. However, not only have models increased in ... 
Generalized linear mixed models: development and comparison of different estimation methods
(2002)The use of generalized linear mixed models is growing in popularity in the modelling of correlated data. To date, methods available are either computationally intensive or asymptotically biased. The following work examines ... 
Genetic restoration on complex pedigrees
(1990)Analyses of genetic data observed on groups of related individuals frequently require the computation of probabilities on pedigrees. Existing methods are computationally intensive and can be infeasible on large and complex ... 
Gravimetric Anomaly Detection using Compressed Sensing
We address the problem of identifying underground anomalies (e.g. holes) based on gravity measurements. This is a theoretically wellstudied yet difficult problem. In all except a few special cases, the inverse problem has ... 
Inference for HighDimensional Instrumental Variables Regression
This thesis concerns statistical inference for the components of a highdimensional regression parameter despite possible endogeneity of each regressor. Given a firststage linear model for the endogenous regressors and a ... 
Latent models for crosscovariance
(2001)Crosscovariance problems arise in the analysis of multivariate data that can be divided naturally into two blocks of variables, X and Y, observed on the same units. In a crosscovariance problem we are interested, not in ... 
Learning and Manifolds: Leveraging the Intrinsic Geometry
(20130723)In this work, we explore and exploit the use of differential operators on manifolds  the LaplaceBeltrami operator in particular  in learning tasks. In particular, we are interested in uncovering the geometric structure ... 
The Likelihood Pivot: Performing Inference with Confidence
Maximum likelihood estimation is a popular statistical method. To account for possible model misspecification, the sandwich estimate of variance can be used to generate asymptotically correct confidence intervals. Several ... 
LikelihoodBased Inference for Partially Observed MultiType Markov Branching Processes
Markov branching processes are a class of continuoustime Markov chains (CTMCs) frequently used in stochastic modeling with ubiquitous applications. Bivariate or multitype processes are necessary to model phenomena such ... 
Linear Structural Equation Models with NonGaussian Errors: Estimation and Discovery
Linear structural equation models (SEMs) are multivariate models which encode direct causal effects. We focus on SEMs in which unobserved latent variables have been marginalized and only observed variables are explicitly ... 
Lord's Paradox and Targeted Interventions: The Case of Special Education
Lord (1967) describes a hypothetical “paradox” in which two statisticians, analyzing the same dataset using different but defensible methods, come to very different conclusions about the effects of an intervention on student ...