Spatial Autocorrelation for Massive Spatial Data: Verification of Efficiency and Statistical Power Asymptotics

dc.contributor.ORCID0000-0001-5125-6450 (Griffith, DA)
dc.contributor.VIAF14855602 (Griffith, DA)
dc.contributor.authorLuo, Q.
dc.contributor.authorGriffith, Daniel A.
dc.contributor.authorWu, H.
dc.contributor.utdAuthorGriffith, Daniel A.
dc.date.accessioned2019-11-18T21:33:33Z
dc.date.available2019-11-18T21:33:33Z
dc.date.created2019-03-25
dc.descriptionDue to copyright restrictions full text access from Treasures at UT Dallas is restricted to current UTD affiliates (use the provided Link to Article).
dc.description.abstractBeing a hot topic in recent years, many studies have been conducted with spatial data containing massive numbers of observations. Because initial developments for classical spatial autocorrelation statistics are based on rather small sample sizes, in the context of massive spatial datasets, this paper presents extensions to efficiency and statistical power comparisons between the Moran coefficient and the Geary ratio for different variable distribution assumptions and selected geographic neighborhood definitions. The question addressed asks whether or not earlier results for small n extend to large and massively large n, especially for non-normal variables; implications established are relevant to big spatial data. To achieve these comparisons, this paper summarizes proofs of limiting variances, also called asymptotic variances, to do the efficiency analysis, and derives the relationship function between the two statistics to compare their statistical power at the same scale. Visualization of this statistical power analysis employs an alternative technique that already appears in the literature, furnishing additional understanding and clarity about these spatial autocorrelation statistics. Results include: the Moran coefficient is more efficient than the Geary ratio for most surface partitionings, because this index has a relatively smaller asymptotic as well as exact variance, and the superior power of the Moran coefficient vis-à-vis the Geary ratio for positive spatial autocorrelation depends upon the type of geographic configuration, with this power approaching one as sample sizes become increasingly large. Because spatial analysts usually calculate these two statistics for interval/ration data, this paper also includes comments about the join count statistics used for nominal data. ©2019 Springer-Verlag GmbH Germany, part of Springer Nature.
dc.description.departmentSchool of Economic, Political and Policy Studies
dc.description.sponsorshipFunding was provided by The National Key Research and Development Program of China (Grant No. 2017YFB0503802) and China Scholarship Council (Grant No. 201406270075).
dc.identifier.bibliographicCitationLuo, Q., D. A. Griffith, and H. Wu. 2019. "Spatial autocorrelation for massive spatial data: verification of efficiency and statistical power asymptotics." Journal of Geographical Systems 21: art. 237, doi: 10.1007/s10109-019-00293-3
dc.identifier.issn1435-5930
dc.identifier.urihttps://hdl.handle.net/10735.1/7112
dc.identifier.volume21
dc.language.isoen
dc.publisherSpringer Verlag
dc.relation.urihttps://dx.doi.org/10.1007/s10109-019-00293-3
dc.rights©2019 Springer-Verlag GmbH Germany, part of Springer Nature
dc.source.journalJournal of Geographical Systems
dc.subjectSpatial data mining
dc.subjectCoefficients, Moran
dc.subjectAnalysis of variance, Asymptotic
dc.subjectStatistics, Spatial autocorrelation
dc.subjectStatistics, Join count
dc.titleSpatial Autocorrelation for Massive Spatial Data: Verification of Efficiency and Statistical Power Asymptotics
dc.type.genrearticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
EPPS-3736-260419.12-LINK.pdf
Size:
164.53 KB
Format:
Adobe Portable Document Format
Description:
Link to Article