Browsing Statistics by Title
Now showing items 4052 of 52

A resampling approach to clustering with confidence
(20120913)We propose a method for estimating the number of groups in a data set. Our method is an extension of Generalized Single Linkage clustering (GSL) (Stuetzle and Nugent 2010), a nonparametric clustering method based on the ... 
Robust estimation of factor models in finance
(2005)Standard assetpricing models entail expressions for expected returns in terms of coefficients relative to risk factors. Methods to estimate premiums of risk factors have, at its core, a single or multiple linear regression ... 
Scalable Manifold Learning and Related Topics
The subject of manifold learning is vast and still largely unexplored. As a subset of unsupervised learning it has a fundamental challenge in adequately defining the problem but whose solution is to an increasingly important ... 
Scalable Methods for the Inference of Identity by Descent
Identity by descent (IBD) describes the shared inheritance of DNA and underlies genetic similarity between individuals. Estimated IBD graphs describing the IBD relationships among individuals have many uses in statistical ... 
ShapeConstrained Inference for ConcaveTransformed Densities and their Modes
(20131114)We consider inference about functions estimated via shape constraints based on concavity. We consider logconcave densities and other “concavetransformed” densities on the real line, where a concavetransformed class is ... 
SpaceTime Smoothing Models for Surveillance and Complex Survey Data
Area and timespecific estimates of disease rates, causespecific mortality rates and other key health indicators are of great interest for health care and policy purposes. Such estimates provide the information needed to ... 
Statistical Hurdle Models for Single Cell Gene Expression: Differential Expression and Graphical Modeling
This dissertation describes a set of statistical methods developed for analysis of single cell gene expression. A characteristic of single cell expression is bimodal expression, in which two clusters of expression are ... 
Statistical inference using Kronecker structured covariance
(20131114)We present results for testing and estimation in the context of separable covariance models. We concentrate on two types of data: relational data and crossclassified data. Relational data is frequently represented by a ... 
Testing Independence in High Dimensions & Identifiability of Graphical Models
In this thesis two problems in multivariate statistics will be studied. In the first chaper, we treat the problem of testing independence between m continuous observations when m can be larger than the available sample ... 
Tests for Differences between Least Squares and Robust Regression Parameter Estimates and Related Topics
(20130417)At the present time there is no well accepted test for comparing least squares and robust linear regression coefficient estimates. To fill this gap we propose and demonstrate the efficacy of two Waldlike statistical tests ... 
Theory and Methods for Tensor Data
We present novel methods and new theory in the statistical analysis of tensorvalued data. A tensor is a multidimensional array. When data come in the form of a tensor, special methods and models are required to capture ... 
Topics in Graph Clustering
In this thesis, two problems in social networks will be studied. In the first part of the thesis, we focus on community recovery problems for social networks. There have been many recent theoretical advances in the modelbased ... 
The weighted likelihood bootstrap and an algorithm for prepivoting
(1991)The method of bootstrapping, which has transformed the theory and practice of frequentist statistical inference, is applicable within the Bayesian paradigm. Rather than simulating data that might have been observed, this ...