[Linear Algebra] 6. Inverse matrices, column space and null space

[Linear Algebra] 6. Inverse matrices, column space and null space

2022. 3. 12. 12:50ㆍMathematics/Linear Algebra

Let's think about the usefulness of linear algebra. One of the main reasons that linear algebra is broadly applicable is that it can solve system of linear equations.

"System of linear equations":

$a x + b y + c z = l d x + e y + f z = m g x + h y + i z = n <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>a</mi><mi>x</mi><mo>+</mo><mi>b</mi><mi>y</mi><mo>+</mo><mi>c</mi><mi>z</mi><mo>=</mo><mi>l</mi></mtd></mtr><mtr><mtd><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mi>z</mi><mo>=</mo><mi>m</mi></mtd></mtr><mtr><mtd><mi>g</mi><mi>x</mi><mo>+</mo><mi>h</mi><mi>y</mi><mo>+</mo><mi>i</mi><mi>z</mi><mo>=</mo><mi>n</mi></mtd></mtr></mtable></math>$

Let's package "system of linear equations" into single vector equation, which consists of 1). matrix ( $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ ) which contain all of the constant coefficient, 2). vector ( $\to x <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">x</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ ) which contain all of the variables, and constant vector ( $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ ), result of matrix-vector multiplication.

$a x + b y + c z = l d x + e y + f z = m g x + h y + i z = n ⟶ [a b c d e f g h i] A [x y z] \to x = [l m n] \to v <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>a</mi><mi>x</mi><mo>+</mo><mi>b</mi><mi>y</mi><mo>+</mo><mi>c</mi><mi>z</mi><mo>=</mo><mi>l</mi></mtd></mtr><mtr><mtd><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mi>z</mi><mo>=</mo><mi>m</mi></mtd></mtr><mtr><mtd><mi>g</mi><mi>x</mi><mo>+</mo><mi>h</mi><mi>y</mi><mo>+</mo><mi>i</mi><mi>z</mi><mo>=</mo><mi>n</mi></mtd></mtr></mtable><mo stretchy="false">⟶</mo><munder><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>a</mi></mtd><mtd><mi>b</mi></mtd><mtd><mi>c</mi></mtd></mtr><mtr><mtd><mi>d</mi></mtd><mtd><mi>e</mi></mtd><mtd><mi>f</mi></mtd></mtr><mtr><mtd><mi>g</mi></mtd><mtd><mi>h</mi></mtd><mtd><mi>i</mi></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mi>A</mi></munder><munder><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>x</mi></mtd></mtr><mtr><mtd><mi>y</mi></mtd></mtr><mtr><mtd><mi>z</mi></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">x</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></munder><mo>=</mo><munder><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>l</mi></mtd></mtr><mtr><mtd><mi>m</mi></mtd></mtr><mtr><mtd><mi>n</mi></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></munder></math>$

There is pretty cool geometric interpretation for this problem. The matrix $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ corresponds with some linear transformation, so solving $A \to x = \to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">x</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ means we're looking a vector $\to x <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">x</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ which, after applying the transformation, lands on $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ .

Now, let's think about how to solve these equations. It depends on whether the transformation associated with $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ squishes where determinant is $0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>0</mn></math>$ , or not.

Non-zero determinant

In this case, transformation $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ does not squish the space. So there always be one and only one vector that lands on $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ . Therefore, we can solve these equations by playing the transformation $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ in inverse.

The transformation $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ in inverse is also another linear transformation, commonly called "the inverse of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ " denoting $A - 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>A</mi><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup></math>$ . The core property of this transformation is that if we first apply $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ , then follow it with the transformation $A - 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>A</mi><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup></math>$ , we end up back where we started. So $A - 1 A <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>A</mi><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><mi>A</mi></math>$ equals the matrix known as "identity transformation" which does nothing.

Therefore, once we find $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ inverse, we can solve these equations by multiplying $A - 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>A</mi><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup></math>$ by $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ . If determinant of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ is a non-zero, then there is a unique solution. This idea also makes sense in higher dimensions.

Zero determinant

In this case, transformation $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ squishes the space. So there is no inverse of $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi></math>$ . Because we cannot restore a line to plane. Moreover, there is no function that transform individual vector into a whole line.

It's still possible that a solution exists even when there is no inverse. If transformation squishes space onto a line, there will be a solution when the vector $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ is sitting somewhere on that line.

Pseudo-inverse matrix

If there is no inverse matrix, we can solve equations by pesudo-inverse matrix $A + <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>A</mi><mo>+</mo></msup></math>$ .

$if n \geq m A + = (A ⊤ A) - 1 A ⊤ else if n \leq m A + = A ⊤ (A A ⊤) - 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mtext>if </mtext><mi>n</mi><mo>\geq</mo><mi>m</mi><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msup><mi>A</mi><mo>+</mo></msup><mo>=</mo><mo stretchy="false">(</mo><msup><mi>A</mi><mi mathvariant="normal">⊤</mi></msup><mi>A</mi><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup><msup><mi>A</mi><mi mathvariant="normal">⊤</mi></msup></mtd></mtr><mtr><mtd><mtext>else if </mtext><mi>n</mi><mo>\leq</mo><mi>m</mi><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msup><mi>A</mi><mo>+</mo></msup><mo>=</mo><msup><mi>A</mi><mi mathvariant="normal">⊤</mi></msup><mo stretchy="false">(</mo><mi>A</mi><msup><mi>A</mi><mi mathvariant="normal">⊤</mi></msup><msup><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>-</mo><mn>1</mn></mrow></msup></mtd></mtr></mtable></math>$

Rank

There is a difference in degree among the cases where the determinant is $0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>0</mn></math>$ . Given a $3 \times 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>3</mn><mo>\times</mo><mn>3</mn></math>$ matrix, for example, it seems a lot harder for a solution to exist, when it squishes space onto a line compared to when it squishes onto plane, even though both of those are zero determinant.

So there is terminology that's a bit more specific than "zero determinant". When the output of a transformation is a line, meaning it's one-dimensional, we say the transformation has a "rank" of $1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>1</mn></math>$ . If all the vectors land on some two-dimensional plane, we say the transformation has a rank of $2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>2</mn></math>$ . So the word "rank" means the number of dimensions of the output of a transformation.

Column space

column space of matrix is the set of all possible outputs (ex. line, plane, or 3-D space) for matrix. The columns of matrix tell us where the basis vector land, and the span of those transformed basis vectors gives us all possible outputs. In other words, the column space is the span of the columns of matrix. So, a more precise definition of rank would be "the number of dimensions in the column space". When rank equals the number of columns, we call the matrix "full rank".

Null space

If matrices that aren't full rank, squish to a smaller dimension, here are bunch of vectors that land on zero. This set of vectors that lands on the origin is called the "null space" or the "kernel" of matrix. When $\to v <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mover><mi mathvariant="bold">v</mi><mo mathvariant="bold" stretchy="false">\to</mo></mover></mrow></mrow></math>$ is a zero vector in system of linear equations, the null space represents all of the possible solutions.

Summary

In this section, we overview of "system of linear equations" at very high-level. 1). Each "system of linear equations" matches with some kind of linear transformation. And when transformation has an inverse, we can solve equations by that inverse. Otherwise, 2). the concept of column space helps us to understand whether a solution exists or not. and 3). the concept of null space helps us to understand what the set of possible solutions is.

저작자표시 비영리 변경금지

'Mathematics > Linear Algebra' 카테고리의 다른 글

[Linear Algebra] 8. Dot products and duality (0)	2022.03.12
[Linear Algebra] 7. Nonsquare matrices as transformations between dimensions (0)	2022.03.12
[Linear Algebra] 5. The determinants (0)	2022.03.12
[Linear Algebra] 4. Matrix multiplication as composition (0)	2022.03.12
[Linear Algebra] 3. Matrices as linear transformations (0)	2022.03.12

강정노트 닭강정정강이 님의 블로그입니다.

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

강정노트

강정노트

최근글

Rank

Column space

Null space

Summary

'Mathematics > Linear Algebra' 카테고리의 다른 글

관련글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역