Abstract
In an effort to quantify and combat sexual assault, U.S. colleges and universities are required to disclose the number of reported sexual assaults on their campuses each year. However, many instances of sexual assault are never reported to authorities, and consequently, the number of reported assaults does not fully reflect the true total number of assaults that occurred; the reported values could arise from many combinations of reporting rate and true incidence. In this paper we estimate these underlying quantities via a hierarchical Bayesian model of the reported number of assaults. We use informative priors, based on national crime statistics, to act as a tiebreaker to help distinguish between reporting rates and incidence. We outline a Hamiltonian Monte Carlo (HMC) sampling scheme for posterior inference regarding reporting rates and assault incidence at each school and apply this method to campus sexual assault data from 2014–2019. Results suggest an increasing trend in reporting rates for the overall college population during this time. However, the extent of underreporting varies widely across schools. That variation has implications for how individual schools should interpret their reported crime statistics.
Funding Statement
The second author is supported by NSF IIS-2127869, NSF DMS-2311108, ONR N000142412243, and the Simons Foundation.
Acknowledgments
The authors would like to thank the referees, Associate Editor, and the Editor for their constructive comments and suggestions that improved the quality of this paper.
David M. Blei is also affiliated with the Department of Computer Science at Columbia University.
Citation
Casey Bradshaw. David M. Blei. "A Bayesian model of underreporting for sexual assault on college campuses." Ann. Appl. Stat. 18 (4) 3146 - 3164, December 2024. https://doi.org/10.1214/24-AOAS1928
Information