Browsing Statistics by Title
Now showing items 1635 of 77

Degeneracy, Duration, and Coevolution: Extending Exponential Random Graph Models (ERGM) for Social Network Analysis
We address three aspects of statistical methodology in the application of Exponential family Random Graphs to modeling social network processes. The first is the topic of model degeneracy in ERGMs. We show this is a ... 
Detecting and extracting complex patterns from images and realizations of spatial point processes
(2000)A common goal in the field of Computer Vision is the detection and extraction of patterns (e.g. lines, object boundaries) from binary image data . These images routinely occur as the product of edge detection algorithms, ... 
Discovering Interactions in Multivariate Time Series
In large collections of multivariate time series it is of interest to determine interactions between each pair of time series. Classically, interactions between time series have been studied using linear vector autoregressive ... 
DiscreteTime Threshold Regression for Survival Data with TimeDependent Covariates
A natural approach to survival analysis in many settings is to model the subject's ``health'' status as a latent stochastic process, where the terminal event is represented by the first time that the process crosses a ... 
Estimating Population Size Using the Network Scale Up Method
(20130725)We develop methods for estimating hardtoreach populations from data collected using networkbased questions on standard surveys. Such data arise by asking respondents how many people they know in a specific group (e.g. ... 
Estimation and Inference in Changepoint Models
This thesis is motivated by statistical challenges that arise in the analysis of calcium imaging data, a new technology in neuroscience that makes it possible to record from huge numbers of neurons at singleneuron resolution. ... 
Estimation and Testing Following Model Selection
The field of postselection inference focuses on developing solutions for problems in which a researcher uses a single dataset to both identify a promising set of hypotheses and conduct statistical inference. One promising ... 
Estimation and testing under shape constraints
This thesis consists of three projects, the common thread to all of which is using shaperestricted densities in inference problems. In the first project, we revisit the problem of estimating the center of symmetry of ... 
Finite Population Inference for Causal Parameters
Randomized experiments are often employed to determine whether a treatment X has a causal effect on an outcome Y. Under the NeymanRubin causal model with binary X and Y, each patient is characterized by two binary potential ... 
Finite Sampling Exponential Bounds
This dissertation develops new exponential bounds for the tail of the hypergeometric distribution. It is organized as follows. In Chapter 1, it reviews existing exponential bounds used to control the hypergeometric tail. ... 
Fitting Stochastic Epidemic Models to Multiple Data Types
Traditional infectious disease epidemiology focuses on fitting deterministic and stochastic epidemics models to surveillance case count data. Recently, researchers began to make use of infectious disease agent genetic data ... 
Functional Quantitative Genetics and the Missing Heritability Problem
In classical quantitative genetics, the correlation between the phenotypes of individuals with unknown genotypes and a known pedigree relationship is expressed in terms of probabilities of IBD states. In existing models ... 
Generalization of boosting algorithms and applications of Bayesian inference for massive datasets
(1999)In recent years statisticians, computational learning theorists, and engineers have developed more advance techniques to learn complex nonlinear relationships from datasets. However, not only have models increased in ... 
Generalized linear mixed models: development and comparison of different estimation methods
(2002)The use of generalized linear mixed models is growing in popularity in the modelling of correlated data. To date, methods available are either computationally intensive or asymptotically biased. The following work examines ... 
Genetic restoration on complex pedigrees
(1990)Analyses of genetic data observed on groups of related individuals frequently require the computation of probabilities on pedigrees. Existing methods are computationally intensive and can be infeasible on large and complex ... 
Gravimetric Anomaly Detection using Compressed Sensing
We address the problem of identifying underground anomalies (e.g. holes) based on gravity measurements. This is a theoretically wellstudied yet difficult problem. In all except a few special cases, the inverse problem has ... 
Inference for HighDimensional Instrumental Variables Regression
This thesis concerns statistical inference for the components of a highdimensional regression parameter despite possible endogeneity of each regressor. Given a firststage linear model for the endogenous regressors and a ... 
Inferring Network Structure From Partially Observed Graphs
Collecting social network data is notoriously difficult, meaning that indirectly observed or missing observations are very common. In this dissertation, We address two of such scenarios: inference on network measures without ... 
LargeScale B Cell Receptor Sequence Analysis Using Phylogenetics and Machine Learning
The adaptive immune system synthesizes antibodies, the soluble form of B cell receptors (BCRs), to bind to and neutralize pathogens that enter our body. B cells are able to generate a diverse set of high affinity antibodies ... 
Latent models for crosscovariance
(2001)Crosscovariance problems arise in the analysis of multivariate data that can be divided naturally into two blocks of variables, X and Y, observed on the same units. In a crosscovariance problem we are interested, not in ...