poltomo: matrix 2-norm, signal energy, and lipschitz constants

Intuitively, the matrix 2-norm measures the greatest action achievable by applying A to x. A = UΣV^*, where U and V are orthogonal matrices and Σ is a non-negative diagonal matrix. This is the full singular value decomposition of A. If A is hermitian, then U = V. Basically, A is equivalent to an orthonormal basis change (U) followed by scaling (Σ) and another orthonormal basis change (V). The L^2 norm of a vector is equivalent to the "energy" of a discrete signal in a signal processing sense up to a power:

L^2 norm of vector: (x^Tx)^1/2

Energy of discrete signal: x^Tx

Orthogonal transforms, like rotations and reflections, preserve norms by Parseval's theorem:

(Qx)^*(Qx)=x^*Q^*Qx = x^*x

This is why the induced matrix 2-norm of matrix A is equal to its biggest singular value. The matrix 2-norm measures the greatest action achievable by applying A:

||A||₂ = sup_x∈Cⁿ (||Ax||₂)/(||x||₂)

Basically, the matrix 2-norm measures the greatest action achievable by applying A. Because U and V preserve the L^2 norm, the supremum is achieved, when all of a discrete signal/ vector's energy is scaled by the largest singular value.

||A||₂ = sup_{x∈Cⁿ, ||x||₂=1} (||Ax||₂)

||A||₂ = sup_{x∈Cⁿ, ||x||₂=1} (||Σx||₂)

The unit vector that achieves this largest scaling is the right singular vector v₁, or the first column of V.

Notice, that V^*v₁ = e₁, and Σe₁ = σ_max(A), which is the largest singular value of A.

This is relevant to lipschitz continuous functions: functions f : Rⁿ to R^mwhere the following holds for some K ≥ 0 for all x1, x2 in Rⁿ:

||f(x₁) - f(x₂)||₂ / ||x₁ - x₂||₂ ≤ K

If f is linear, then f(x₁) - f(x₂) becomes f(x₁ - x₂) = A(x₁ - x₂) for some A. So, in this case, K, the lipschitz constant is simply the largest singular value of that matrix A. This would mean that Ax would always have less signal energy than x.

With a little more work, this observation can be extended to nonlinear functions via their linearization (jacobian matrix). I won't explain it full here, but this line of thinking is very useful for convergence proofs of dynamical systems and indirect/iterative solvers.

poltomo

Labels

Thursday, February 6, 2025

matrix 2-norm, signal energy, and lipschitz constants