Friday, April 5, 2019

Inference with Social Network Dependence

I'm running behind as usual. I meant to post this right after the seminar, about two weeks ago.  Really interesting stuff -- spatial correlation due to network dependence.  A Google search will find the associated paper(s) instantly. Again, really good stuff.  BUT I would humbly suggest that the biostat people need to read more econometrics. A good start is this survey (itself four years old, and distilled for practitioners as the basic insights were known/published decades ago). The cool question moving forward is whether/when/how network structure can be used to determine/inform clustering.

Elizabeth L. Ogburn
Department of Biostatistics
Johns Hopkins University

Social Network dependence,
the replication crisis, and (in)valid inference

In the first part of this talk, I will show that social network structure can result in a new kind of structural confounding, confounding by network structure, potentially contributing to replication crises across the health and social sciences.  Researchers in these fields frequently sample subjects from one or a small number of communities, schools, hospitals, etc., and while many of the limitations of such convenience samples are well-known, the issue of statistical dependence due to social network ties has not previously been addressed. A paradigmatic example of this is the Framingham Heart Study (FHS). Using a statistic that we adapted to measure network dependence, we test for network dependence and for possible confounding by network structure in several of the thousands of influential papers published using FHS data. Results suggest that some of the many decades of research on coronary heart disease, other health outcomes, and peer influence using FHS data may be biased (away from the null) and anticonservative due to unacknowledged network structure.

But data with network dependence abounds, and in many settings researchers are explicitly interested in learning about social network dynamics.  Therefore, there is high demand for methods for causal and statistical inference with social network data. In the second part of the talk, I will describe recent work on causal inference for observational data from a single social network, focusing on (1) new types of causal estimands that are of interest in social network settings, and (2) conditions under which central limit theorems hold and inference based on approximate normality is licensed.

No comments:

Post a Comment

Note: Only a member of this blog may post a comment.