This and subsequent posts continue the discussion in my previous post about multiplying block matrices. In this series of posts I consider block diagonal matrices and in particular how to partition an existing matrix into block diagonal form. I’m doing this both to gain a better understanding of what it means to have a block diagonal matrix, and also as background for a subsequent post discussing how to multiply and invert block diagonal matrices.
First, what is a block diagonal matrix? As defined by the relevant Wikipedia article,
A block diagonal matrix is a block matrix which is a square matrix, and having [as] main diagonal blocks square matrices, such that the off-diagonal blocks are zero matrices.
Wolfram Mathworld provides a similar definition:
A block diagonal matrix, also called a diagonal block matrix, is a square diagonal matrix in which the diagonal elements are square matrices of any size (possibly even 1×1), and the off-diagonal elements are 0. A block diagonal matrix is therefore a block matrix in which the blocks off the diagonal are the zero matrices, and the diagonal matrices are square.
Here’s another definition of block diagonal form consistent with the above definitions; it uses
partition in the same sense as in my previous post on multiplying block matrices.
Let be an by square matrix, and suppose we partition into submatrices with row and column partitions:
Thus partitioned, is a block diagonal matrix (or, alternately, has been partitioned into block diagonal form) if .
Note that by convention we consider any square matrix to be in block diagonal form with a single partition, i.e., when . (This follows from the Wolfram Mathworld definition above, which defines a block diagonal matrix as a special type of diagonal matrix, and also from the Wolfram Mathworld definition of a diagonal matrix, which allows it to have just one element.)
We could stop with this definition and then proceed to derive the results of multiplying or inverting block diagonal matrices using the formula above. However in this series of posts I have tried to take a more rigorous approach (though I have probably failed to achieve full rigor in some areas of the argument). In particular I was interested in the question of how to best partition a matrix into block diagonal form, and felt it was important to address the fine details in order to convince myself that the informal approach was actually correct.
By convention a matrix is trivially in block diagonal form if considered to have a single partition, and a diagonal matrix is also in block diagonal form as well, with each entry on the diagonal considered as a submatrix with a single entry. What about the in-between cases though? Is there a
canonical way to partition an arbitrary matrix so that it could (depending on its values) have multiple submatrices on the diagonal, with each submatrix having multiple entries?
I therefore want to prove a more definitive statement:
For any by square matrix we can uniquely and maximally partition into a block diagonal matrix with submatrices, for some where , such that
This partitioning is maximal in the sense that cannot also be partitioned as a block diagonal matrix with partitions, where .
This partitioning is unique in the sense that if we partition into block diagonal form as
and also into block diagonal form as
In the next post I start the process of trying to prove the above statement, by constructing a way to partition as described.