Statistical Science

Highly Structured Models for Spectral Analysis in High-Energy Astrophysics

David A. van Dyk and Hosung Kang

Source: Statist. Sci. Volume 19, Number 2 (2004), 275-293.

Abstract

The Chandra X-Ray Observatory, launched by the space shuttle Columbia in July 1999, has taken its place with the Hubble Space Telescope, the Compton Gamma Ray Observatory and the Spitzer Infrared Space Telescope in NASA’s fleet of state of the art space-based Great Observatories. As the world’s premier X-ray observatory, Chandra gives astronomers a powerful tool to investigate black holes, exploding stars and colliding galaxies in the hot turbulent regions of the universe. Chandra uses four pairs of ultra-smooth high-resolution mirrors and efficient X-ray photon counters to produce images at least 30 times sharper than any previous X-ray telescope. Unlocking the information in these images, however, requires subtle statistical analysis; currently popular statistical methods typically involve Gaussian approximations (e.g., minimum χ2 fitting), which are not justifiable for the high-resolution low-count data. In this article, we employ modern Bayesian computational techniques (e.g., expectation–maximization-type algorithms, the Gibbs sampler and Metropolis–Hastings) to fit new highly structured models that account for the Poisson nature of photon counts, background contamination, image blurring due to instrumental constraints, photon absorption, photon pileup and source features such as spectral emission lines and absorption features. This application demonstrates the flexibility and power of modern Bayesian methodology and algorithms to handle highly structured models that are convolved with complex data collection mechanisms involving nonignorable missing data.

Keywords: Astrostatistics; Bayesian methods; the Chandra X-Ray Observatory; data augmentation; EM algorithm; Markov chain Monte Carlo; missing data; Poisson model; posterior predictive checks; nonignorable missing data; spectral analysis; pileup

Full-text: Open access

Links and Identifiers

Permanent link to this document: http://projecteuclid.org/euclid.ss/1105714163
Digital Object Identifier: doi:10.1214/088342304000000314
Mathematical Reviews number (MathSciNet): MR2140542
Zentralblatt MATH identifier: 1100.62637

References

Ballet, J. (1999). Pile-up on X-ray CCD instruments. Astronomy and Astrophysics Suppl. Ser. 135 371--381.
Bayarri, M. J. and Berger, J. O. (1999). Quantifying surprise in the data, and model verification. In Bayesian Statistics 6 (J. M. Bernardo, J. O. Berger, A. P. Dawid and A. F. M. Smith, eds.) 53--82. Oxford Univ. Press, London.
Mathematical Reviews (MathSciNet): MR1723493
Zentralblatt MATH: 0974.62021
Cash, W. (1979). Parameter estimation in astronomy through application of the likelihood ratio. Astrophysical J. 228 939--947.
Davis, J. E. (2001). Event pileup in charge-coupled devices. Astrophysical J. 562 575--582.
Dempster, A. P., Laird, N. M. and Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. Roy. Statist. Soc. Ser. B 39 1--38.
Mathematical Reviews (MathSciNet): MR501537
Esch, D. N. (2003). Extensions and applications of three statistical models. Ph.D. dissertation, Dept. Statistics, Harvard Univ.
Esch, D. N., Connors, A., Karovska, M. and van Dyk, D. A. (2004). An image restoration technique with error estimates. Astrophysical J. 610 1213--1227.
Feigelson, E. D. and Babu, G. J., eds. (2003). Statistical Challenges in Modern Astronomy III. Springer, New York.
Fessler, J. A. and Hero, A. O. (1994). Space-alternating generalized expectation--maximization algorithm. IEEE Trans. Signal Process. 42 2664--2677.
Freeman, P., Graziani, C., Lamb, D., Loredo, T., Fenimore, E., Murakami, T. and Yoshida, A. (1999). Statistical analysis of spectral line candidates in gamma-ray burst GRB 870303. Astrophysical J. 524 753--771.
Gelfand, A. E. and Smith, A. F. M. (1990). Sampling-based approaches to calculating marginal densities. J. Amer. Statist. Assoc. 85 398--409.
Mathematical Reviews (MathSciNet): MR1141740
Digital Object Identifier: doi:10.2307/2289776
Gelman, A., Meng, X.-L. and Stern, H. (1996). Posterior predictive assessment of model fitness via realized discrepancies (with discussion). Statist. Sinica 6 733--807.
Mathematical Reviews (MathSciNet): MR1422404
Gelman, A. and Rubin, D. B. (1992). Inference from iterative simulation using multiple sequences (with discussion). Statist. Sci. 7 457--472.
Mathematical Reviews (MathSciNet): MR1196276
Zentralblatt MATH: 0767.62021
Hans, C. M. and van Dyk, D. A. (2003). Accounting for absorption lines in high energy spectra. In Statistical Challenges in Modern Astronomy III (E. Feigelson and G. Babu, eds.) 429--430. Springer, New York.
Kang, H., van Dyk, D. A., Yu, Y., Siemiginowska, A., Connors, A. and Kashyap, V. (2003). New MCMC methods to address pile-up in the Chandra X-ray observatory. In Statistical Challenges in Modern Astronomy III (E. Feigelson and G. Babu, eds.) 449--450. Springer, New York.
Lampton, M., Margon, B. and Bowyer, S. (1976). Parameter estimation in X-ray astronomy. Astrophysical J. 208 177--190.
Lange, K. and Carson, R. (1984). EM reconstruction algorithms for emission and transmission tomography. J. Computer Assisted Tomography 8 306--316.
Lucy, L. B. (1974). An iterative technique for the rectification of observed distributions. Astronomical J. 79 745--754.
Meng, X.-L. (1994). Posterior predictive $p$-values. Ann. Statist. 22 1142--1160.
Mathematical Reviews (MathSciNet): MR1311969
Digital Object Identifier: doi:10.1214/aos/1176325622
Project Euclid: euclid.aos/1176325622
Meng, X.-L. and van Dyk, D. A. (1997). The EM algorithm---An old folk song sung to a fast new tune (with discussion). J. Roy. Statist. Soc. Ser. B 59 511--567.
Mathematical Reviews (MathSciNet): MR1452025
Digital Object Identifier: doi:10.1111/1467-9868.00082
Morrison, R. and McCammon, D. (1983). Interstellar photoelectric absorption cross sections, 0.03--10 keV. Astrophysical J. 270 119--122.
Protassov, R., van Dyk, D. A., Connors, A., Kashyap, V. and Siemiginowska, A. (2002). Statistics: Handle with care---Detecting multiple model components with the likelihood ratio test. Astrophysical J. 571 545--559.
Richards, W. H. (1972). Bayesian-based iterative method of image restoration. J. Opt. Soc. Amer. 62 55--59.
Rubin, D. B. (1984). Bayesianly justifiable and relevant frequency calculations for the applied statistician. Ann. Statist. 12 1151--1172.
Mathematical Reviews (MathSciNet): MR760681
Digital Object Identifier: doi:10.1214/aos/1176346785
Project Euclid: euclid.aos/1176346785
Shepp, L. A. and Vardi, Y. (1982). Maximum likelihood reconstruction for emission tomography. IEEE Trans. on Medical Imaging 1 113--122.
Siemiginowska, A., Elivis, M., Alanna, C., Freeman, P., Kashyap, V. and Feigelson, E. (1997). AXAF data analysis challanges. In Statistical Challenges in Modern Astronomy II (G. Babu and E. Feigelson, eds.) 241--258. Springer, New York.
Smith, A. F. M. and Roberts, G. O. (1993). Bayesian computation via the Gibbs sampler and related Markov chain Monte Carlo methods. J. Roy. Statist. Soc. Ser. B 55 3--23.
Mathematical Reviews (MathSciNet): MR1210421
Sourlas, N., van Dyk, D. A., Kashyap, V., Drake, J. and Pease, D. (2003). Bayesian spectral analysis of ``MAD'' stars. In Statistical Challenges in Modern Astronomy III (E. Feigelson and G. Babu, eds.) 489--490. Springer, New York.
Tanner, M. A. and Wong, W. H. (1987). The calculation of posterior distributions by data augmentation (with discussion). J. Amer. Statist. Assoc. 82 528--550.
Mathematical Reviews (MathSciNet): MR898357
Digital Object Identifier: doi:10.2307/2289457
Titterington, D. M., Smith, A. F. M. and Makov, U. E. (1985). Statistical Analysis of Finite Mixture Distributions. Wiley, New York.
Mathematical Reviews (MathSciNet): MR838090
Zentralblatt MATH: 0646.62013
van Dyk, D. A. (2000a). Fast new EM-type algorithms with applications in astrophysics. Technical report.
van Dyk, D. A. (2000b). Nesting EM algorithms for computational efficiency. Statist. Sinica 10 203--225.
Mathematical Reviews (MathSciNet): MR1742109
van Dyk, D. A. (2003). Hierarchical models, data augmentation, and Markov chain Monte Carlo (with discussion). In Statistical Challenges in Modern Astronomy III (E. Feigelson and G. Babu, eds.) 41--56. Springer, New York.
van Dyk, D. A. and Hans, C. M. (2002). Accounting for absorption lines in images obtained with the Chandra X-ray Observatory. In Spatial Cluster Modelling (A. Lawson and D. Denison, eds.) 175--198. CRC Press, London.
Mathematical Reviews (MathSciNet): MR2015037
van Dyk, D. A., Connors, A., Kashyap, V. and Siemiginowska, A. (2001). Analysis of energy spectra with low photon counts via Bayesian posterior simulation. Astrophysical J. 548 224--243.
van Dyk, D. A., Connors, A., Esch, D. N., Freeman, P., Kang, H., Karovska, M., Kashyap, V., Siemiginowska, A. and Zezas, A. (2004). Deconvolution in high energy astrophysics: Science, instrumentation, and methods (with discussion). Bayesian Analysis. To appear.
van Dyk, D. A. and Meng, X.-L. (2000). Algorithms based on data augmentation. In Computing Science and Statistics: Proc. 31st Symposium on the Interface (M. Pourahmadi and K. Berk, eds.) 230--239. Interface Foundation of North America, Fairfax Station, VA.
van Dyk, D. A. and Meng, X.-L. (2001). The art of data augmentation (with discussion). J. Comput. Graph. Statist. 10 1--111.
Mathematical Reviews (MathSciNet): MR1936358
Digital Object Identifier: doi:10.1198/10618600152418584

2009 © Institute of Mathematical Statistics