Taiwanese Journal of Mathematics

Compressed Hierarchical Schur Algorithm for Frequency-domain Analysis of Photonic Structures

Cheng-Han Du, Yih-Peng Chiou, and Weichung Wang

Full-text: Open access


Three-dimensional finite-difference frequency-domain analyses of partially periodic photonic structures result in large-scale ill-conditioned linear systems. Due to the lack of efficient preconditioner and reordering scheme, existed general-purpose iterative and direct solvers are inadequate to solve these linear systems in time or memory. We propose an efficient direct solver to tackle this problem. By exploring the physical properties, the coefficient matrix structure, and hardware computing efficiency, we extend the concepts of grid geometry manipulation and multi-level Schur method to propose the Compressed Hierarchical Schur algorithm (CHiS). The proposed CHiS algorithm can use less memory and remove redundant computational workloads due to the homogeneity and periodicity of photonic structures. Moreover, CHiS relies on dense BLAS3 operations of sub-matrices that can be computed efficiently with strong scalability by the latest multicore processors or accelerators. The implementation and benchmarks of CHiS demonstrate promising memory usage, timing, and scalability results. The feasibility of future hardware acceleration for CHiS is also addressed using computational data. This high-performance analysis tool can improve the design and modeling capability for various photonic structures.

Article information

Taiwanese J. Math., Volume 23, Number 2 (2019), 473-501.

Received: 7 May 2018
Revised: 24 November 2018
Accepted: 25 November 2018
First available in Project Euclid: 10 December 2018

Permanent link to this document

Digital Object Identifier

Mathematical Reviews number (MathSciNet)

Primary: 65F05: Direct methods for linear systems and matrix inversion 65Y05: Parallel computation 65Z05: Applications to physics

partially periodic photonic structures finite-difference frequency-domain method direct solver for ill-conditioned linear systems Schur complements BLAS3 operations multithreading parallelism


Du, Cheng-Han; Chiou, Yih-Peng; Wang, Weichung. Compressed Hierarchical Schur Algorithm for Frequency-domain Analysis of Photonic Structures. Taiwanese J. Math. 23 (2019), no. 2, 473--501. doi:10.11650/tjm/181113. https://projecteuclid.org/euclid.twjm/1544432420

Export citation


  • Y. Akahane, T. Asano, B.-S. Song and S. Noda, Erratum: High-Q photonic nanocavity in a two-dimensional photonic crystal, Nature 425 (2003), no. 6961, 944–947.
  • W. Bogaerts, M. Fiers and P. Dumon, Design challenges in silicon photonics, IEEE J. Sel. Top. Quantum Electron. 20 (2014), no. 4, 1–8.
  • J. H. Bramble and J. E. Pasciak, Analysis of a Cartesian PML approximation to the three dimensional electromagnetic wave scattering problem, Int. J. Numer. Anal. Model. 9 (2012), no. 3, 543–561.
  • W. C. Chew, J. M. Jin and E. Michielssen, Complex coordinate stretching as a generalized absorbing boundary condition, Microw. Opt. Techn. Lett. 15 (1997), no. 6, 363–369.
  • J. García, P. Sanchis, A. Martínez and J. Martí, 1D periodic structures for slow-wave induced non-linearity enhancement, Opt. Express 16 (2008), no. 5, 3146–3160.
  • P. Ghysels, X. S. Li, F.-H. Rouet, S. Williams and A. Napov, An efficient multicore implementation of a novel HSS-structured multifrontal solver using randomized sampling, SIAM J. Sci. Comput. 38 (2016), no. 5, S358–S384.
  • T.-M. Huang, H.-E. Hsieh, W.-W. Lin and W. Wang, Eigendecomposition of the discrete double-curl operator with application to fast eigensolver for three-dimensional photonic crystals, SIAM J. Matrix Anal. Appl. 34 (2013), no. 2, 369–391.
  • ––––, Eigenvalue solvers for three dimensional photonic crystals with face-centered cubic lattice, J. Comput. Appl. Math. 272 (2014), 350–361.
  • J. D. Joannopoulos, S. G. Johnson, J. N. Winn and R. D. Meade, Photonic Crystals: Molding the Flow of Light, Princeton University Press, Princeton, NJ, 2008.
  • S. Johnson and J. Joannopoulos, Block-iterative frequency-domain methods for Maxwell's equations in a planewave basis, Opt. Express 8 (2001), no. 3, 173–190.
  • S. G. Johnson, P. R. Villeneuve, S. Fan and J. D. Joannopoulos, Linear waveguides in photonic-crystal slabs, Phys. Rev. B 62 (2000), no. 12, 8212–8222.
  • G. Karypis and V. Kumar, A fast and high quality multilevel scheme for partitioning irregular graphs, SIAM J. Sci. Comput. 20 (1999), no. 1, 359–392.
  • Z. Li, Y. Saad, M. Sosonkina, pARMS: A parallel version of the algebraic recursive multilevel solver, Numer. Linear Algebra Appl. 10 (2003), no. 5-6, 485–509.
  • K. Nakajima, Parallel iterative solvers for ill-conditioned problems with heterogeneous material properties, Procedia Comput. Sci. 80 (2016), 1635–1645.
  • S. Operto, J. Virieux, P. Amestoy, J.-Y. L'Excellent, L. Giraud and H. Ben Hadj Ali, 3D finite-difference frequency-domain modeling of visco-acoustic wave propagation using a massively parallel direct solver: A feasibility study, Geophysics 72 (2007), no. 5, SM195–SM211.
  • F. Pellegrini and J. Roman, Scotch: A software package for static mapping by dual recursive bipartitioning of process and architecture graphs, in: Lecture Notes in Computer Science, 493–498, High-Performance Computing and Networking, 1996.
  • M. Qiu, M. Mulot, M. Swillo, S. Anand, B. Jaskorzynska, A. Karlsson, M. Kamp and A. Forchel, Photonic crystal optical filter based on contra-directional waveguide coupling, Appl. Phys. Lett. 83 (2003), no. 25, 5121–5123.
  • R. C. Rumpf, Simple implementation of arbitrarily shaped total-field/scattered-field regions in finite-difference frequency-domain, Progr. Electromagn. Res. B 36 (2012), 221–248.
  • Y. Saad and B. Suchomel, ARMS: an algebraic recursive multilevel solver for general sparse linear systems, Numer. Linear Algebra Appl. 9 (2002), no. 5, 359–378.
  • P. Sao, X. Liu, R. Vuduc and X. Li, A sparse direct solver for distributed memory Xeon Phi-accelerated systems, IEEE Int. Parallel Distrib. Proc. Symp., (2015), 71–81.
  • P. Sao, R. Vuduc and X. S. Li, A distributed CPU-GPU sparse direct solver, in: Lecture Notes in Computer Science, 487–498, Euro-Par 2014 Parallel Processing, 2014.
  • O. Schenk, M. Christen and H. Burkhart, Algorithmic performance studies on graphics processing units, J. Parallel Distrib. Comput. 68 (2008), no. 10, 1360–1369.
  • O. Schenk and K. Gärtner, Solving unsymmetric sparse systems of linear equations with PARDISO, Future Gener. Comput. Syst. 20 (2004), no. 3, 475–487.
  • P. G. Schmitz and L. Ying, A fast nested dissection solver for Cartesian 3D elliptic problems using hierarchical matrices, J. Comput. Phys. 258 (2014), 227–245.
  • S. Shi, C. Chen and D. W. Prather, Plane-wave expansion method for calculating band structure of photonic crystal slabs with perfectly matched layers, J. Opt. Soc. Amer. A 21 (2004), no. 9, 1769–1775.
  • W. Shin and S. Fan, Choice of the perfectly matched layer boundary condition for frequency-domain Maxwell's equations solvers, J. Comput. Phys. 231 (2012), no. 8, 3406–3431.
  • ––––, Accelerated solution of the frequency-domain Maxwell's equations by engineering the eigenvalue distribution of the operator, Opt. Express 21 (2013), no. 19, 22578–22595.
  • A. Taflove and S. C. Hagness, Computational Electrodynamics: the Finite-difference Time-domain Method, Artech House, Boston, MA, 2005.
  • P. Tsuji and L. Ying, A sweeping preconditioner for Yee's finite difference approximation of time-harmonic Maxwell's equations, Front. Math. China 7 (2012), no. 2, 347–363.
  • Y. A. Vlasov, M. O'Boyle, H. F. Hamann and S. J. McNab, Active control of slow light on a chip with photonic crystal waveguides, Nature 438 (2005), no. 7064, 65–69.
  • J. Vučković and Y. Yamamoto, Photonic crystal microcavities for cavity quantum electrodynamics with a single quantum dot, Appl. Phys. Lett. 82 (2003), no. 15, 2374–2376.
  • K. Yee, Numerical solution of initial boundary value problems involving Maxwell's equations in isotropic media, IEEE Trans. Antenn. Propag. 14 (1966), no. 3, 302–307.
  • C. D. Yu, W. Wang and D. Pierce, A CPU-GPU hybrid approach for the unsymmetric multifrontal method, Parallel Comput. 37 (2011), no. 12, 759–770.