Toeplitz matrix
In-game article clicks load inline without leaving the challenge.
In linear algebra, a Toeplitz matrix or diagonal-constant matrix, named after Otto Toeplitz, is a matrix in which each descending diagonal from left to right is constant. For instance, the following matrix is a Toeplitz matrix:
[ a b c d e f a b c d g f a b c h g f a b i h g f a ] . {\displaystyle \qquad {\begin{bmatrix}a&b&c&d&e\\f&a&b&c&d\\g&f&a&b&c\\h&g&f&a&b\\i&h&g&f&a\end{bmatrix}}.}
Any n × n {\displaystyle n\times n} matrix A {\displaystyle A} of the form
A = [ a 0 a − 1 a − 2 ⋯ ⋯ a − ( n − 1 ) a 1 a 0 a − 1 ⋱ ⋮ a 2 a 1 ⋱ ⋱ ⋱ ⋮ ⋮ ⋱ ⋱ ⋱ a − 1 a − 2 ⋮ ⋱ a 1 a 0 a − 1 a n − 1 ⋯ ⋯ a 2 a 1 a 0 ] {\displaystyle A={\begin{bmatrix}a_{0}&a_{-1}&a_{-2}&\cdots &\cdots &a_{-(n-1)}\\a_{1}&a_{0}&a_{-1}&\ddots &&\vdots \\a_{2}&a_{1}&\ddots &\ddots &\ddots &\vdots \\\vdots &\ddots &\ddots &\ddots &a_{-1}&a_{-2}\\\vdots &&\ddots &a_{1}&a_{0}&a_{-1}\\a_{n-1}&\cdots &\cdots &a_{2}&a_{1}&a_{0}\end{bmatrix}}}
is a Toeplitz matrix. If the i , j {\displaystyle i,j} element of A {\displaystyle A} is denoted A i , j {\displaystyle A_{i,j}} then we have
A i , j = A i + 1 , j + 1 = a i − j . {\displaystyle A_{i,j}=A_{i+1,j+1}=a_{i-j}.}
A Toeplitz matrix is not necessarily square.
Solving a Toeplitz system
A matrix equation of the form
A x = b {\displaystyle Ax=b}
is called a Toeplitz system if A {\displaystyle A} is a Toeplitz matrix. If A {\displaystyle A} is an n × n {\displaystyle n\times n} Toeplitz matrix, then the system has at most only 2 n − 1 {\displaystyle 2n-1} unique values, rather than n 2 {\displaystyle n^{2}}. We might therefore expect that the solution of a Toeplitz system would be easier, and indeed that is the case.
Toeplitz systems can be solved by algorithms such as the Schur algorithm or the Levinson algorithm in O ( n 2 ) {\displaystyle O(n^{2})} time. Variants of the latter have been shown to be weakly stable (i.e. they exhibit numerical stability for well-conditioned linear systems). The algorithms can also be used to find the determinant of a Toeplitz matrix in O ( n 2 ) {\displaystyle O(n^{2})} time.
A Toeplitz matrix can also be decomposed (i.e. factored) in O ( n 2 ) {\displaystyle O(n^{2})} time. The Bareiss algorithm for an LU decomposition is stable. An LU decomposition gives a quick method for solving a Toeplitz system, and also for computing the determinant. Using displacement rank we obtain method requiring O ~ ( α ω − 1 n ) {\displaystyle {\tilde {O}}({\alpha ^{\omega -1}}n)} ops with the use of fast matrix multiplication algorithms, where α {\displaystyle \alpha } is the rank and ∼ 2.37 ≤ ω < 3 {\displaystyle ^{\sim }2.37\leq \omega <3}.
Properties
- An n × n {\displaystyle n\times n} Toeplitz matrix may be defined as a matrix A {\displaystyle A} where A i , j = c i − j {\displaystyle A_{i,j}=c_{i-j}}, for constants c 1 − n , … , c n − 1 {\displaystyle c_{1-n},\ldots ,c_{n-1}}. The set of n × n {\displaystyle n\times n} Toeplitz matrices is a subspace of the vector space of n × n {\displaystyle n\times n} matrices (under matrix addition and scalar multiplication).
- Two Toeplitz matrices may be added in O ( n ) {\displaystyle O(n)} time (by storing only one value of each diagonal) and multiplied in O ( n 2 ) {\displaystyle O(n^{2})} time.
- Toeplitz matrices are persymmetric. Symmetric Toeplitz matrices are both centrosymmetric and bisymmetric.
- Toeplitz matrices are also closely connected with Fourier series, because the multiplication operator by a trigonometric polynomial, compressed to a finite-dimensional space, can be represented by such a matrix. Similarly, one can represent linear convolution as multiplication by a Toeplitz matrix.
- Toeplitz matrices are asymptotically equivalent to circulant matrices as the dimension grows, a result known as the Grenander–Szegő theorem. This asymptotic circulant property is the reason that the discrete Fourier transform approximately diagonalizes large Toeplitz matrices, and underlies the effectiveness of DFT-based spectral density estimation for stationary processes.
- Toeplitz matrices commute asymptotically. This means they diagonalize in the same basis when the row and column dimension tends to infinity.
- For symmetric Toeplitz matrices, there is the decomposition
1 a 0 A = G G T − ( G − I ) ( G − I ) T {\displaystyle {\frac {1}{a_{0}}}A=GG^{\operatorname {T} }-(G-I)(G-I)^{\operatorname {T} }}
where G {\displaystyle G} is the lower triangular part of 1 a 0 A {\displaystyle {\frac {1}{a_{0}}}A}.
- The inverse of a nonsingular symmetric Toeplitz matrix has the representation
A − 1 = 1 α 0 ( B B T − C C T ) {\displaystyle A^{-1}={\frac {1}{\alpha _{0}}}(BB^{\operatorname {T} }-CC^{\operatorname {T} })}
where B {\displaystyle B} and C {\displaystyle C} are lower triangular Toeplitz matrices and C {\displaystyle C} is a strictly lower triangular matrix.
Discrete convolution
The convolution operation can be constructed as a matrix multiplication, where one of the inputs is converted into a Toeplitz matrix. For example, the convolution of h {\displaystyle h} and x {\displaystyle x} can be formulated as:
y = h ∗ x = [ h 1 0 ⋯ 0 0 h 2 h 1 ⋮ ⋮ h 3 h 2 ⋯ 0 0 ⋮ h 3 ⋯ h 1 0 h m − 1 ⋮ ⋱ h 2 h 1 h m h m − 1 ⋮ h 2 0 h m ⋱ h m − 2 ⋮ 0 0 ⋯ h m − 1 h m − 2 ⋮ ⋮ h m h m − 1 0 0 0 ⋯ h m ] [ x 1 x 2 x 3 ⋮ x n ] {\displaystyle y=h\ast x={\begin{bmatrix}h_{1}&0&\cdots &0&0\\h_{2}&h_{1}&&\vdots &\vdots \\h_{3}&h_{2}&\cdots &0&0\\\vdots &h_{3}&\cdots &h_{1}&0\\h_{m-1}&\vdots &\ddots &h_{2}&h_{1}\\h_{m}&h_{m-1}&&\vdots &h_{2}\\0&h_{m}&\ddots &h_{m-2}&\vdots \\0&0&\cdots &h_{m-1}&h_{m-2}\\\vdots &\vdots &&h_{m}&h_{m-1}\\0&0&0&\cdots &h_{m}\end{bmatrix}}{\begin{bmatrix}x_{1}\\x_{2}\\x_{3}\\\vdots \\x_{n}\end{bmatrix}}}
y T = [ h 1 h 2 h 3 ⋯ h m − 1 h m ] [ x 1 x 2 x 3 ⋯ x n 0 0 0 ⋯ 0 0 x 1 x 2 x 3 ⋯ x n 0 0 ⋯ 0 0 0 x 1 x 2 x 3 … x n 0 ⋯ 0 ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ ⋮ 0 ⋯ 0 0 x 1 ⋯ x n − 2 x n − 1 x n 0 0 ⋯ 0 0 0 x 1 ⋯ x n − 2 x n − 1 x n ] . {\displaystyle y^{T}={\begin{bmatrix}h_{1}&h_{2}&h_{3}&\cdots &h_{m-1}&h_{m}\end{bmatrix}}{\begin{bmatrix}x_{1}&x_{2}&x_{3}&\cdots &x_{n}&0&0&0&\cdots &0\\0&x_{1}&x_{2}&x_{3}&\cdots &x_{n}&0&0&\cdots &0\\0&0&x_{1}&x_{2}&x_{3}&\ldots &x_{n}&0&\cdots &0\\\vdots &&\vdots &\vdots &\vdots &&\vdots &\vdots &&\vdots \\0&\cdots &0&0&x_{1}&\cdots &x_{n-2}&x_{n-1}&x_{n}&0\\0&\cdots &0&0&0&x_{1}&\cdots &x_{n-2}&x_{n-1}&x_{n}\end{bmatrix}}.}
This approach can be extended to compute autocorrelation, cross-correlation, moving average etc.
Infinite Toeplitz matrix
A bi-infinite Toeplitz matrix (i.e. entries indexed by Z × Z {\displaystyle \mathbb {Z} \times \mathbb {Z} }) A {\displaystyle A} induces a linear operator on ℓ 2 {\displaystyle \ell ^{2}}.
A = [ ⋮ ⋮ ⋮ ⋮ ⋯ a 0 a − 1 a − 2 a − 3 ⋯ ⋯ a 1 a 0 a − 1 a − 2 ⋯ ⋯ a 2 a 1 a 0 a − 1 ⋯ ⋯ a 3 a 2 a 1 a 0 ⋯ ⋮ ⋮ ⋮ ⋮ ] . {\displaystyle A={\begin{bmatrix}&\vdots &\vdots &\vdots &\vdots \\\cdots &a_{0}&a_{-1}&a_{-2}&a_{-3}&\cdots \\\cdots &a_{1}&a_{0}&a_{-1}&a_{-2}&\cdots \\\cdots &a_{2}&a_{1}&a_{0}&a_{-1}&\cdots \\\cdots &a_{3}&a_{2}&a_{1}&a_{0}&\cdots \\&\vdots &\vdots &\vdots &\vdots \end{bmatrix}}.}
The induced operator is bounded if and only if the coefficients of the Toeplitz matrix A {\displaystyle A} are the Fourier coefficients of some essentially bounded function f {\displaystyle f}.
In such cases, f {\displaystyle f} is called the symbol of the Toeplitz matrix A {\displaystyle A}, and the spectral norm of the Toeplitz matrix A {\displaystyle A} coincides with the L ∞ {\displaystyle L^{\infty }} norm of its symbol. The proof can be found as Theorem 1.1 of Böttcher and Grudsky.
See also
- Circulant matrix, a square Toeplitz matrix with the additional property that a i = a i + n {\displaystyle a_{i}=a_{i+n}}
- Hankel matrix, an "upside down" (i.e., row-reversed) Toeplitz matrix
- Szegő limit theorems – Determinant of large Toeplitz matrices
- Toeplitz operator
Notes
- Bojanczyk, A. W.; Brent, R. P.; de Hoog, F. R.; Sweet, D. R. (1995), "On the stability of the Bareiss and related Toeplitz factorization algorithms", SIAM Journal on Matrix Analysis and Applications, 16: 40–57, arXiv:, doi:, S2CID
- Böttcher, Albrecht; Grudsky, Sergei M. (2012), , Birkhäuser, ISBN 978-3-0348-8395-5
- Brent, R. P. (1999), "Stability of fast algorithms for structured linear systems", in Kailath, T.; Sayed, A. H. (eds.), Fast Reliable Algorithms for Matrices with Structure, SIAM, pp. 103–116, arXiv:, doi:, hdl:, ISBN 978-0-89871-431-9, S2CID
- Chan, R. H.-F.; Jin, X.-Q. (2007), An Introduction to Iterative Toeplitz Solvers, SIAM, doi:, ISBN 978-0-89871-636-8
- Chandrasekeran, S.; Gu, M.; Sun, X.; Xia, J.; Zhu, J. (2007), "A superfast algorithm for Toeplitz systems of linear equations", SIAM Journal on Matrix Analysis and Applications, 29 (4): 1247–66, CiteSeerX , doi:
- Chen, W. W.; Hurvich, C. M.; Lu, Y. (2006), "On the correlation matrix of the discrete Fourier transform and the fast solution of large Toeplitz systems for long-memory time series", Journal of the American Statistical Association, 101 (474): 812–822, CiteSeerX , doi:, S2CID
- Hayes, Monson H. (1996), Statistical digital signal processing and modeling, Wiley, ISBN 0-471-59431-8
- Krishna, H.; Wang, Y. (1993), , SIAM Journal on Numerical Analysis, 30 (5): 1498–1508, doi:
- Monahan, J. F. (2011), Numerical Methods of Statistics, Cambridge University Press, doi:, ISBN 978-1-139-08211-2
- Mukherjee, Bishwa Nath; Maiti, Sadhan Samar (1988), (PDF), Linear Algebra and Its Applications, 102: 211–240, doi:
- Press, W. H.; Teukolsky, S. A.; Vetterling, W. T.; Flannery, B. P. (2007), Numerical Recipes: The Art of Scientific Computing (3rd ed.), Cambridge University Press, ISBN 978-0-521-88068-8
- Stewart, M. (2003), "A superfast Toeplitz solver with improved numerical stability", SIAM Journal on Matrix Analysis and Applications, 25 (3): 669–693, doi:, S2CID
- Yang, Zai; Xie, Lihua; Stoica, Petre (2016), "Vandermonde decomposition of multilevel Toeplitz matrices with application to multidimensional super-resolution", IEEE Transactions on Information Theory, 62 (6): 3685–3701, arXiv:, doi:, S2CID
Further reading
- Bareiss, E. H. (1969), "Numerical solution of linear equations with Toeplitz and vector Toeplitz matrices", Numerische Mathematik, 13 (5): 404–424, doi:, S2CID
- Goldreich, O.; Tal, A. (2018), "Matrix rigidity of random Toeplitz matrices", Computational Complexity, 27 (2): 305–350, doi:, S2CID
- Golub, G. H.; van Loan, C. F. (1996), Matrix Computations, Johns Hopkins University Press, §4.7—Toeplitz and Related Systems, ISBN 0-8018-5413-X, OCLC
- Gray, R. M. (2005), (PDF), Foundations and Trends in Communications and Information Theory, 2 (3), Now Publishers: 155–239, doi:
- Noor, F.; Morgera, S. D. (1992), "Construction of a Hermitian Toeplitz matrix from an arbitrary set of eigenvalues", IEEE Transactions on Signal Processing, 40 (8): 2093–4, Bibcode:, doi:
- Pan, Victor Y. (2001), Structured Matrices and Polynomials: unified superfast algorithms, Birkhäuser, ISBN 978-0817642402
- Ye, Ke; Lim, Lek-Heng (2016), "Every matrix is a product of Toeplitz matrices", Foundations of Computational Mathematics, 16 (3): 577–598, arXiv:, doi:, S2CID