Unilateral shift operator
In-game article clicks load inline without leaving the challenge.
In operator theory, the unilateral shift is a one-sided shift operator, that is, a shift operator acting on one-sided sequences or shift spaces. The term "operator" is used to draw contrast to finite-dimensional shift matrices. The term "unilateral" draws a distinction to the bilateral shift operator, of which the Baker's map is an example.
Shift operators are commonly studied in the context of measure-preserving dynamical systems. In such general settings, the unilateral shift operator is usually called the transfer operator or the Frobenius-Peron operator; it's inverse is the Koopman operator. The properties of shift operators depend very strongly on the topology of the spaces on which they act; for example, the Bernoulli shift famously has a discrete spectrum given by the Bernoulli polynomials when acting on the space of bounded smooth functions on the unit interval, but has a continuous spectrum (on the unit disk), when acting on the Hilbert space of square-integrable functions. When acting on a measure space, the eigenfunctions of shift operators are characteristically fractal in shape, often differentiable-nowhere or even continuous-nowhere. Eigenvalues on the unit circle are associated with unitary time evolution, while those inside the unit disk are conventionally identified with decaying modes in statistical systems. In quantum mechanics, the prototypical unilateral shift operator is the annihilation operator of the quantum harmonic oscillator; it's eigenfunctions correspond to coherent states.
This article deals primarily with unilateral shifts acting on Hilbert space, specifically in two representations: as an operator on the sequence space ℓ 2 {\displaystyle \ell ^{2}}, or as a multiplication operator on a Hardy space. Its properties, particularly its invariant subspaces, are well-understood and serve as a model for more general theories.
Definition
Let ℓ 2 {\displaystyle \ell ^{2}} be the Hilbert space of square-summable sequences of complex numbers, i.e., ℓ 2 = { ( a 0 , a 1 , a 2 , … ) : a n ∈ C and ∑ n = 0 ∞ | a n | 2 < ∞ } {\displaystyle \ell ^{2}=\left\{(a_{0},a_{1},a_{2},\dots ):a_{n}\in \mathbb {C} {\text{ and }}\sum _{n=0}^{\infty }|a_{n}|^{2}<\infty \right\}}The unilateral shift is the linear operator S : ℓ 2 → ℓ 2 {\displaystyle S:\ell ^{2}\to \ell ^{2}} defined by: S ( a 0 , a 1 , a 2 , … ) = ( 0 , a 0 , a 1 , a 2 , … ) {\displaystyle S(a_{0},a_{1},a_{2},\dots )=(0,a_{0},a_{1},a_{2},\dots )}This operator is also called the forward shift.
With respect to the standard orthonormal basis ( e n ) n = 0 ∞ {\displaystyle (e_{n})_{n=0}^{\infty }} for ℓ 2 {\displaystyle \ell ^{2}}, where e n {\displaystyle e_{n}} is the sequence with a 1 in the n-th position and 0 elsewhere, the action of S {\displaystyle S} is S e n = e n + 1 {\displaystyle Se_{n}=e_{n+1}}. Its matrix representation is:S = [ 0 0 0 0 ⋯ 1 0 0 0 ⋯ 0 1 0 0 ⋯ 0 0 1 0 ⋯ ⋮ ⋮ ⋮ ⋮ ⋱ ] {\displaystyle S={\begin{bmatrix}0&0&0&0&\cdots \\1&0&0&0&\cdots \\0&1&0&0&\cdots \\0&0&1&0&\cdots \\\vdots &\vdots &\vdots &\vdots &\ddots \end{bmatrix}}}This is a Toeplitz operator whose symbol is the function f ( z ) = z {\displaystyle f(z)=z}. It can be regarded as an infinite-dimensional lower shift matrix.
Properties
Adjoint operator
The adjoint of the unilateral shift, denoted S ∗ {\displaystyle S^{*}}, is the backward shift. It acts on ℓ 2 {\displaystyle \ell ^{2}} as: S ∗ ( b 0 , b 1 , b 2 , b 3 , … ) = ( b 1 , b 2 , b 3 , … ) {\displaystyle S^{*}(b_{0},b_{1},b_{2},b_{3},\dots )=(b_{1},b_{2},b_{3},\dots )}The matrix representation of S ∗ {\displaystyle S^{*}} is the conjugate transpose of the matrix for S {\displaystyle S}: S ∗ = [ 0 1 0 0 ⋯ 0 0 1 0 ⋯ 0 0 0 1 ⋯ 0 0 0 0 ⋯ ⋮ ⋮ ⋮ ⋮ ⋱ ] {\displaystyle S^{*}={\begin{bmatrix}0&1&0&0&\cdots \\0&0&1&0&\cdots \\0&0&0&1&\cdots \\0&0&0&0&\cdots \\\vdots &\vdots &\vdots &\vdots &\ddots \end{bmatrix}}}It can be regarded as an infinite-dimensional upper shift matrix.
Basic properties
- S , S ∗ {\displaystyle S,S^{*}} are both continuous but not compact.
- S ∗ S = I {\displaystyle S^{*}S=I}.
- S , S ∗ {\displaystyle S,S^{*}} make up a pair of unitary equivalence between ℓ 2 {\displaystyle \ell ^{2}} and the set of ℓ 2 {\displaystyle \ell ^{2}}-sequences whose first element is zero.
The resolvent operator has matrix representation( z I − S ) − 1 = [ z − 1 0 0 0 ⋯ z − 2 z − 1 0 0 ⋯ z − 3 z − 2 z − 1 0 ⋯ z − 4 z − 3 z − 2 z − 1 ⋯ ⋮ ⋮ ⋮ ⋮ ⋱ ] {\displaystyle (zI-S)^{-1}={\begin{bmatrix}z^{-1}&0&0&0&\cdots \\z^{-2}&z^{-1}&0&0&\cdots \\z^{-3}&z^{-2}&z^{-1}&0&\cdots \\z^{-4}&z^{-3}&z^{-2}&z^{-1}&\cdots \\\vdots &\vdots &\vdots &\vdots &\ddots \end{bmatrix}}}which is bounded iff | z | > 1 {\displaystyle |z|>1}. Similarly, ( z I − S ∗ ) − 1 = ( ( z ∗ I − S ) − 1 ) ∗ {\displaystyle (zI-S^{*})^{-1}=((z^{*}I-S)^{-1})^{*}}.
For any z ∈ C , a ∈ ℓ 2 {\displaystyle z\in \mathbb {C} ,a\in \ell ^{2}} with ‖ a ‖ = 1 {\displaystyle \|a\|=1},‖ ( z I − S ) a ‖ 2 = 1 + | z | 2 − 2 ℜ ( ⟨ S a , a ⟩ z ) , ‖ ( z I − S ∗ ) a ‖ 2 = 1 − | a 0 | 2 + | z | 2 − 2 ℜ ( ⟨ S a , a ⟩ z ∗ ) {\displaystyle \|(zI-S)a\|^{2}=1+|z|^{2}-2\Re (\langle Sa,a\rangle z),\quad \|(zI-S^{*})a\|^{2}=1-|a_{0}|^{2}+|z|^{2}-2\Re (\langle Sa,a\rangle z^{*})}where ℜ {\displaystyle \Re } is the real part.
Spectral theory
Spectrum of the forward shift—Let D {\displaystyle \mathbb {D} } be the open unit disk, D ¯ {\displaystyle {\overline {\mathbb {D} }}} the closed unit disk, and T {\displaystyle \mathbb {T} } the unit circle.
- The spectrum of S {\displaystyle S} is σ ( S ) = D ¯ {\displaystyle \sigma (S)={\overline {\mathbb {D} }}}.
- The point spectrum of S {\displaystyle S} is empty: σ p ( S ) = ∅ {\displaystyle \sigma _{p}(S)=\emptyset }.
- The approximate point spectrum of S {\displaystyle S} is the unit circle: σ a p ( S ) = T {\displaystyle \sigma _{ap}(S)=\mathbb {T} }.
To show σ ( S ) = D ¯ {\displaystyle \sigma (S)={\overline {\mathbb {D} }}}, use the matrix representation of ( z I − S ) − 1 {\displaystyle (zI-S)^{-1}}, and note that it is bounded iff | z | > 1 {\displaystyle |z|>1}. To show σ p ( S ) = ∅ {\displaystyle \sigma _{p}(S)=\emptyset }, directly show that S a = λ a {\displaystyle Sa=\lambda a} implies a = 0 {\displaystyle a=0}.
To show σ a p ( S ) = T {\displaystyle \sigma _{ap}(S)=\mathbb {T} }, note that ‖ ( z I − S ) a ‖ 2 ≥ 1 + | z | 2 − 2 | z | = ( 1 − | z | ) 2 {\displaystyle \|(zI-S)a\|^{2}\geq 1+|z|^{2}-2|z|=(1-|z|)^{2}} for any z ∈ C , a ∈ ℓ 2 {\displaystyle z\in \mathbb {C} ,a\in \ell ^{2}} with ‖ a ‖ = 1 {\displaystyle \|a\|=1}, so σ a p ( S ) ⊂ D ¯ ∖ D = T {\displaystyle \sigma _{ap}(S)\subset {\overline {\mathbb {D} }}\setminus \mathbb {D} =\mathbb {T} }. Conversely, for any z ∈ T {\displaystyle z\in \mathbb {T} }, construct the following unit vectora = 1 N ( 1 , z − 1 , z − 2 , … , z − ( N − 1 ) , 0 , 0 , … ) {\displaystyle a={\frac {1}{\sqrt {N}}}(1,z^{-1},z^{-2},\dots ,z^{-(N-1)},0,0,\dots )}then ‖ ( z I − S ) a ‖ 2 = 2 / N {\displaystyle \|(zI-S)a\|^{2}=2/N}, which converges to 0 at N → ∞ {\displaystyle N\to \infty }.
The spectral properties of S ∗ {\displaystyle S^{*}} differ significantly from those of S {\displaystyle S}:
- σ ( S ∗ ) = D ¯ {\displaystyle \sigma (S^{*})={\overline {\mathbb {D} }}} (since σ ( A ∗ ) = σ ( A ) ¯ {\displaystyle \sigma (A^{*})={\overline {\sigma (A)}}}).
- The point spectrum σ p ( S ∗ ) {\displaystyle \sigma _{p}(S^{*})} is the entire open unit disk D {\displaystyle \mathbb {D} }. For any λ ∈ D {\displaystyle \lambda \in \mathbb {D} }, the corresponding eigenvector is the geometric sequence ( 1 , λ , λ 2 , λ 3 , … ) {\displaystyle (1,\lambda ,\lambda ^{2},\lambda ^{3},\dots )}.
- The approximate point spectrum σ a p ( S ∗ ) {\displaystyle \sigma _{ap}(S^{*})} is the entire closed unit disk D ¯ {\displaystyle {\overline {\mathbb {D} }}}. To show this, it remains to show T ⊂ σ a p ( S ∗ ) {\displaystyle \mathbb {T} \subset \sigma _{ap}(S^{*})}, which can be proven by a similar construction as before, using a = 1 N ( 1 , z 1 , z 2 , … , z ( N − 1 ) , 0 , 0 , … ) {\displaystyle a={\frac {1}{\sqrt {N}}}(1,z^{1},z^{2},\dots ,z^{(N-1)},0,0,\dots )}.
Hardy space model
The unilateral shift can be studied using complex analysis.
Define the Hardy space H 2 {\displaystyle H^{2}} as the Hilbert space of analytic functions f ( z ) = ∑ n = 0 ∞ a n z n {\displaystyle f(z)=\sum _{n=0}^{\infty }a_{n}z^{n}} on the open unit disk D {\displaystyle \mathbb {D} } for which the sequence of coefficients ( a n ) {\displaystyle (a_{n})} is in ℓ 2 {\displaystyle \ell ^{2}}.
Define the multiplication operator M z {\displaystyle M_{z}} on H 2 {\displaystyle H^{2}}: ( M z f ) ( z ) = z f ( z ) {\displaystyle (M_{z}f)(z)=zf(z)}then S {\displaystyle S} and M z {\displaystyle M_{z}} are unitarily equivalent via the unitary map U : ℓ 2 → H 2 {\displaystyle U:\ell ^{2}\to H^{2}} defined byU ( a 0 , a 1 , a 2 , … ) = ∑ n = 0 ∞ a n z n {\displaystyle U(a_{0},a_{1},a_{2},\dots )=\sum _{n=0}^{\infty }a_{n}z^{n}}which gives U ∗ M z U = S {\displaystyle U^{*}M_{z}U=S}. Using this unitary equivalence, it is common in the literature to use S {\displaystyle S} to denote M z {\displaystyle M_{z}} and to treat H 2 {\displaystyle H^{2}} as the primary setting for the unilateral shift.
Commutant
The commutant of an operator A {\displaystyle A}, denoted { A } ′ {\displaystyle \{A\}'}, is the algebra of all bounded operators that commute with A {\displaystyle A}. The commutant of the unilateral shift is the algebra of multiplication operators on H 2 {\displaystyle H^{2}} by bounded analytic functions.{ S } ′ = { M φ : φ ∈ H ∞ } {\displaystyle \{S\}'=\{M_{\varphi }:\varphi \in H^{\infty }\}}Here, H ∞ {\displaystyle H^{\infty }} is the space of bounded analytic functions on D {\displaystyle \mathbb {D} }, and ( M φ f ) ( z ) = φ ( z ) f ( z ) {\displaystyle (M_{\varphi }f)(z)=\varphi (z)f(z)}.
Cyclic vectors
A vector x {\displaystyle x} is a cyclic vector for an operator A {\displaystyle A} if the linear span of its orbit { A n x : n ≥ 0 } {\displaystyle \{A^{n}x:n\geq 0\}} is dense in the space. We have:
- For the unilateral shift S {\displaystyle S} on H 2 {\displaystyle H^{2}}, the cyclic vectors are the outer functions.
- A function f ∈ H 2 {\displaystyle f\in H^{2}} that has a zero in the open unit disk D {\displaystyle \mathbb {D} } is not a cyclic vector. This is because every function in the span of its orbit will also be zero at that point, so the subspace cannot be dense.
- A function f ∈ H 2 {\displaystyle f\in H^{2}} that is bounded away from zero (i.e., inf z ∈ D | f ( z ) | > 0 {\displaystyle \inf _{z\in \mathbb {D} }|f(z)|>0}) is a cyclic vector.
- A function f ∈ H 2 {\displaystyle f\in H^{2}}, that is in the open unit disk D {\displaystyle \mathbb {D} } is nonzero but inf z ∈ D | f ( z ) | = 0 {\displaystyle \inf _{z\in \mathbb {D} }|f(z)|=0}, may or may not be cyclic. For example, f ( z ) = 1 − z {\displaystyle f(z)=1-z} is a cyclic vector.
The cyclic vectors are precisely the outer functions.
Lattice of invariant subspaces
The S {\displaystyle S}-invariant subspaces of H 2 {\displaystyle H^{2}} are completely characterized analytically. Specifically, they are precisely M u ( H 2 ) {\displaystyle M_{u}(H^{2})} where u {\displaystyle u} is an inner function.
The S {\displaystyle S}-invariant subspaces make up a lattice of subspaces. The two lattice operators, join and meet, correspond to operations on inner functions.
Given two invariant subspaces M u ( H 2 ) , M v ( H 2 ) {\displaystyle M_{u}(H^{2}),M_{v}(H^{2})}, we have M u ( H 2 ) ⊂ M v ( H 2 ) {\displaystyle M_{u}(H^{2})\subset M_{v}(H^{2})} iff u / v ∈ H 2 {\displaystyle u/v\in H^{2}}.