Bootstrap Quantification of Estimation Uncertainties in Network Degree Distributions

Date

2018-08-20

ORCID

Journal Title

Journal ISSN

Volume Title

Publisher

Springer Nature

item.page.doi

Abstract

We propose a new method of nonparametric bootstrap to quantify estimation uncertainties in functions of network degree distribution in large ultra sparse networks. Both network degree distribution and network order are assumed to be unknown. The key idea is based on adaptation of the "blocking" argument, developed for bootstrapping of time series and re-tiling of spatial data, to random networks. We first sample a set of multiple ego networks of varying orders that form a patch, or a network block analogue, and then resample the data within patches. To select an optimal patch size, we develop a new computationally efficient and data-driven cross-validation algorithm. The proposed fast patchwork bootstrap (FPB) methodology further extends the ideas for a case of network mean degree, to inference on a degree distribution. In addition, the FPB is substantially less computationally expensive, requires less information on a graph, and is free from nuisance parameters. In our simulation study, we show that the new bootstrap method outperforms competing approaches by providing sharper and better-calibrated confidence intervals for functions of a network degree distribution than other available approaches, including the cases of networks in an ultra sparse regime. We illustrate the FPB in application to collaboration networks in statistics and computer science and to Wikipedia networks.

Description

Includes supplementary material

Keywords

Nonparametric statistics, Bootstrap (Statistics), Confidence intervals, Computer networks, Sampling (Statistics)

item.page.sponsorship

NSF grant nos. (IIS 1633331 and IIS 1633355); EPSRC grant EP/K032208/1.

Rights

CC BY 4.0 (Attribution), ©2017 The Authors

Citation

Collections