机器学习数学学习笔记：Chapter 2. Linear Algebra

Chapter 2. Linear Algebra

Resources

Gilbert Strang’s Linear Algebra course

Linear Algebra Series by 3Blue1Brown

Algebra

Construct a set of objects (symbols) and a set of rules to manipulate these objects.

Vector objects:

Geometric vectors

Polynomials

Audio signals are vectors

Elements of $R^n$ are vectors

2.1 Systems of Linear Algebra

$a_{11}X_1 + ··· + a_{1n}x_n = b_1$
$a_{m1}X_1 + ··· + a_{mn}x_n = b_m$

where $a_{ij} \in \R$ and $b_i \in \R$

The above is the general form of a system of linear algebra. $(x_1,......,x_n) \in \R^n$ that satisfies above equations is a solution of the linear equation system

Every linear equation represents a line

The solution set is the intersection of these lines

$\left[ \begin{matrix} a_{11} \\ \vdots \\ a_{m1} \end{matrix} \right] x_1 + \left[ \begin{matrix} a_{12} \\ \vdots \\ a_{m2} \end{matrix} \right] x_2 + \cdots + \left[ \begin{matrix} a_{1n} \\ \vdots \\ a_{mn} \end{matrix} \right] x_n = \left[ \begin{matrix} b_{1} \\ \vdots \\ b_{m} \end{matrix} \right]$

$\left[ \begin{matrix} a_{11} \cdots a_{1n} \\ \vdots \vdots\\ a_{m1} \cdots a_{mn} \end{matrix} \right] \left[\begin{matrix} x_1 \\\vdots\\ x_n \end{matrix} \right]= \left[ \begin{matrix} b_{1} \\ \vdots \\ b_{m} \end{matrix} \right]$

2.2 Matrices

$A=\left [\begin{matrix} a_{11} \ a_{12} \ \cdots a_{1n} \\ a_{21}\ a_{22}\ \cdots a_{2n} \\ \vdots \ \ \vdots \ \ \ \ \ \ \ \ \ \ \ \ \ \vdots \\ a_{m1} \ a_{m2} \cdots a_{mn} \end{matrix}\right], a_{ij} \in \R$

(1,n) - matrices = rows

(m,1) - matrices = columns

2.2.1 Addition and Multiplication

Addition
- plus each other
Multiplication
- $c_{ij} = \displaystyle \sum^{n}_{l=1}{a_{ij}b_{ij}} , i = 1, \cdots ,m, j = 1, \cdots , k$ .
For dimensions: M \cdot k \times k \cdot N = M \times N

2.2.2 Inverse and Transpose

Inverse
- $AB=I_n = BA$ B is called the inverse of A and denoted by $A^{-1}$
- $\left[ \begin{matrix} a_{11} \ \ \ a_{12} \\ a_{21} \ \ \ a_{22} \end{matrix}\right] => A^{-1} = \frac{1}{( a_{11} a_{22} - a_{12}a_{21})} \left[ \begin{matrix} a_{22} \ \ \ -a_{12} \\ -a_{21} \ \ \ \ \ a_{11} \end{matrix} \right]$
Transpose
- $\in \R^{m\times n} and B\in \R^{n\times m} with b_{ij} = a_{ji}$ is called the transpose of $A. B = A^T$
Important Properties
- $AA^{-1} = I = A^{-1}A$
- $AB)^{-1} = B^{-1}A^{-1}$
- $(A+B)^{-1} \neq A^{-1} + B^{-1}$
- $A^T)^T = A$
- $A+B)^T = A^T + B^T$
- $AB)^T = B^T A^T$
Symmetric Matrix
- $A^T = A$

2.2.3 Multiplication by Scalar

$A\in \R^{m\times n}$ and $\lambda \in \R Then, \lambda A = K, K _{ij} = \lambda a_{ij}$
Associativity
- $(\lambda \psi)C = \lambda \psi(C), C\in \R^{m\times n}$
- $\lambda(BC) = (\lambda B)C=B(\lambda C) = (BC)\lambda, B \in \R^{m\times n}, C\in \R^{n\times k}$
- $(\lambda C)^T = C^T\lambda^T=C^T\lambda = \lambda C^T since \lambda = \lambda^T for all \lambda \in \R$
Distributivity
- $(\lambda + \psi)C = \lambda C + \psi C , C \in \R^{m\times n}$
- $\lambda(B+C) = \lambda B + \psi C, B,C \in \R^{m\times n}$

2.3 Solving Systems of Linear Equations

2.3.1 Particular and General Solution

The general approach we followed by three steps:
1. Find a particular solution to $A x = b$
2. Find all solutions to $A x = 0$
3. Combine the above steps to the general solution
Gasussian Elimination
- Transform system of linear equations into particular simple form. Only simple form can use above three steps

2.3.2 Elementary Transformation

Keep the solution set the same, but transfor the equation system into a simple form (Equation = Row):
- Exchange the two equations
- Multiplication of equations with a constant
- Addition of two equations
Pivot
- The leading coefiicient of a row (first nonzero number from the left) is called the pivot .
REF- Row Echelon From
- The variable corresponding to the pivots in the REF are called basic variable and the other variables are free variables.
Gaussian Elimination
- An algorithm that performs elementary transformation to bring a system of linear equations into reduced REF
Reduced REF
- $\left[\begin{matrix} 1 \ 3 \ 0 \ 0 \ \ \ \ \ \ 3 \\ 0 \ 0\ 1 \ 0 \ \ \ \ \ \ 9 \\ 0 \ 0 \ 0 \ 1 \ -4 \end{matrix} \right]$
- The pivot is the position of the first row the first column, the second row the third column and the third row the fourth column.
- The key idea for find the solutions of $A x = 0$ is to look at the non-pivot columns ， which we will need to expree as a linear combination of the pivot columns.
- Solutions:
  - $\left[ \begin{matrix} 3 \ -1 \ 0 \ 0 \ 0 \end{matrix} \right] and \left[ \begin{matrix} 3 \ 0 \ 9 \ -4 \ -1 \end{matrix} \right]$

2.3.3 The Minus-1 Trick

The columns of $\widetilde{A}$ that contain the -1 pivots are solutions of the homogeneous equation system Ax = 0 ,These columns form a basis of the solution space of $A x = 0$ , which we call the kernel or null space
Example
- $\left [ \begin{matrix} 1 \ \ \ \ \ 3 \ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 3 \\ 0 \ \ \ \ \ 0 \ \ \ \ \ 1 \ \ \ \ \ 0 \ \ \ \ \ 9 \\ 0 \ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 1 -4 \end{matrix} \right]$
- augment this matrix to a 5\times 5 matrix by adding rows of the Minus -1 at the place where the pivots on the diagonal are missing.
- Obtain: $\left [ \begin{matrix} 1 \ \ \ \ \ 3 \ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 3 \\ 0 -1\ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 0 \\ 0 \ \ \ \ \ 0 \ \ \ \ \ 1 \ \ \ \ \ 0 \ \ \ \ \ 9 \\ 0 \ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 1 -4 \\ 0 \ \ \ \ \ 0 \ \ \ \ \ 0 \ \ \ \ \ 0 -1 \end{matrix} \right]$
- Soltions: $\left[ \begin{matrix} 3\\ -1 \\0 \\0 \\0 \end{matrix} \right] and \left[ \begin{matrix} 3\\ 0 \\ 9 \\ -4 \\-1 \end{matrix} \right]$
Colculating the Inverse
- The matrix is augmented by inserting a matrix of diagonal to 1 to the right. Then, use Gaussian Elimination to bring it into REF. Finally, the right side is the inverse of the matrix
- $AX = I_{n} , X = A ^{-1}$
- Example:
  - Origin Matrix:
  - $\left[ \begin{matrix} 1 \ 0 \ 2 \ 0 \\1 \ 1 \ 0 \ 0 \\ 1\ 2\ 0 \ 1 \\ 1 \ 1\ 1\ 1 \end{matrix} \right]$
  - Augmented:
  - $\left[ \begin{matrix} 1 \ 0\ 2 \ 0 \ | \ 1 \ 0 \ 0 \ 0 \\ 1 \ 1 \ 0 \ 0 \ | \ 0 \ 1 \ 0 \ 0 \\ 1 \ 2 \ 0 \ 1 \ | \ 0 \ 0 \ 1 \ 0 \\ 1 \ 1 \ 1 \ 1 \ | \ 0 \ 0 \ 0 \ 1 \end{matrix} \right]$
  - Gaussian Elimination
  - $\left[ \begin{matrix} 1 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ | \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ -2 \\ 0 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ | \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ -2 \\ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ | \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \\ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ | \ \ \ \ -1 \ \ \ \ \ \ \ \ \ \ 0 \ \ \ \ -1 \ \ \ \ \ \ \ \ \ \ 2 \end{matrix} \right]$
  - The Verse is $^{-1} = \left[ \begin{matrix} \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ -2 \\ \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ -2 \\ \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ -1 \\ \ \ \ \ -1 \ \ \ \ \ \ \ \ \ \ 0 \ \ \ \ -1 \ \ \ \ \ \ \ \ \ \ 2 \end{matrix} \right]$

2.4 Vectors Space

A set of elements and an operation defined on these elements that keeps some structure of the set intact.

2.4.1 Groups

Consider a set $\mathcal{G}$ and an operation $\otimes :\mathcal{G} \times \mathcal{G} \rightarrow \mathcal{G}$ defined on $\mathcal{G}$ . Then, $(\mathcal{G}, \otimes)$ is called a group if the following hold:
- Closure of $\mathcal{G}$ under $\otimes : \forall x,y\in \mathcal{G}:(x \otimes y) \in \mathcal{G}$
- Associativity: $\forall x,y , z\in \mathcal{G}:(x \otimes y) \otimes z = x \otimes (y \otimes z)$
- Neutral element: $\exists \ e \in \mathcal{G} \ \forall x \in \mathcal{G} : x\otimes e = x \ and \ e \otimes x = x$
- Inverse element: $\forall \ x \in \mathcal{G} \ \exists y \in \mathcal{G} : x\otimes y = e \ and \ y \otimes x = e$ , where e is the neutral element.
If additionally $\forall x, y \in \mathcal{G} : x \otimes y = y \otimes x , then \ G = (\mathcal{G},\otimes)$ is an Abelian group (commutative)
Examples:
- $(\Z , +)$ is an Abelian group
- $N_0 , + )$ is not a group, because the inverse elements are missing.

2.4.2 Vector Spaces

A real-valued vector space $V = (V, +, \cdot)$ is a set \mathcal{V} with two operations: $\\ + : \mathcal{V} \times \mathcal{V} \rightarrow \mathcal{V} \\ \ · \ : \R \times \mathcal{V} \rightarrow \mathcal{V}$
where 1. $(\mathcal{V}, +)$ is an Abelian group.
2, Distributivity: $\\ \ \ \ \ \ 1. \forall \lambda \in \R , x,y \in \mathcal{V}: \lambda ·(x + y) = \lambda·x + \lambda·y \\ \ \ \ \ \ 2. \forall \lambda, \psi \in \R, x \in \mathcal{V}: (\lambda + \psi) · x = \lambda·x + \psi·x$
1. Associativity (outer operation): $\forall \lambda, \psi \in \R , x \in \mathcal{V} : \lambda · (\psi · x) = (\lambda\psi) · x$
1. Neutral element with repsect to the outer operation: $\forall x\in \mathcal{V} : 1 ·x=x$
The element $x\in \mathcal{V}$ are called vectors.
The neutral element of $(\mathcal{V},+)$ is the zero vector $[0,\cdots,0]^T$ , the inner operation + is called vector addition.
The $\lambda \in \R$ are called scalars. the outer operation · is a multiplication by scalars.
A “vector multiplication” $\in \R^{n}$ is not defined.
Defined multiplication: $ab^T \in \R^{n\times n}$ (outer product), $a^Tb \in \R$ (inner/scalar/dot product

2.3.4 Vector Subspaces

Let $V = (V, +, \cdot)$ be a vector space and $\mathcal{U} \subseteq\mathcal{V},\mathcal{U} \neq \emptyset$ . Then $(\mathcal{U},+,·)$ is called vector subspace of $V$ (or linear subspace) if $U$ is a vector space with the vector space operations + and · restricted to $\mathcal{U} \times \mathcal{U} and \R \times \mathcal{U}$ . We write $U\subseteq V$ to denote a subspace $U$ of $V$
Every subspsce $\subseteq (\R^n ,+,·)$ is the solution space of a homogeneous system of linear equations $A x = 0$ for $x\in\R^n$

2.5 Linear Independence

Linear Combination

Consider a vector space $V$ and a finite number of vectors $x_1,\cdots,x_k \in V$ . Then, every $v\in V$ of the form $\\ v = \lambda_1x_1 + \cdots + \lambda_kx_k = \sum^{k}_{i=1} \lambda_ix_i \in V$ with $\lambda_1,\cdots,\lambda_k \in \R$ is a linear combination of the vectors $x_1,\cdots,x_k$ .

Linear (In)dependence

Let us consider a vector space $V$ with $k\in\N and x_1,\cdots,x_k \in V$ . If there is a non-trivial linear combination, such that $\sum_{i=1}^{k} \lambda_ix_i$ with at least one $\lambda_i \neq 0$ . The vectors $x_1,\cdots,x_k$ are linear dependent. If only the trivial solution exists, i.e., $\lambda_1 = \cdots = \lambda_k = 0$ the vectors $x_1,\cdots,x_k$ are linear independent.

either independent or dependent. No third option.

The $\{x_1,\cdots,x_k : x_i \neq 0, i = 1,\cdots k\}, k \geqslant 2$ linearly dependent if and oly if at least one of them is a linear combination of the others. In particular, if one vector is a multiple of another vector then the set is linearly dependent.

determine by Gaussian Elimination in REF.

All column vectors are linearly independent if and only if all columns are pivot columns. If there is at least one non-pivot column, the columns are dependent.

2.6 Basis and Rank

2.6.1 Generating Set and Basis

Generating Set ans Span
- Consider a vector space $(\mathcal{V},+,·)$ and set of vectors $\mathcal{A} = \{ x_1,\cdots,x_k\} \subseteq \mathcal{V}$ . If every vector $\in \mathcal{V}$ can be expressed as a linear combination of $x_1,\cdots,x_k,\mathcal{A}$ is called a generating set of V$. The set of all linear combination of vectors in $\mathcal{A}$ is called the span of $\mathcal{A}$ . If $\mathcal{A}$ spans the vector space $V$ , we write $span[\mathcal{A}]$ or $span\left[x_1,\cdots,x_k \right]$ .
- Generating sets are sets of vectors that span vector (sub)spaces.
Basis
- There exists no smaller set that spans $V$ . Every linearly independent generating set of $V$ is minimal and is called a basis of $V$ .
In $R^3$ , the canonical/standard basis is $\\ \mathcal{B} = \{ \left[ \begin{matrix} 1 \\ 0 \\0 \end{matrix} \right] , \left[ \begin{matrix} 0 \\ 1 \\0 \end{matrix} \right] , \left[ \begin{matrix} 0 \\ 0 \\1 \end{matrix} \right] \}$
Determining a Basis
- REF: $\left[ \begin{matrix} 1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ 3 \ \ \ \ \ \ \ -1\\0 \ \ \ \ \ \ \ 1 \ \ \ \ \ \ \ 2 \ \ \ \ \ \ \ -2\\ 0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 1\\ 0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 0 \\0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 0 \ \ \ \ \ \ \ \ 0\end{matrix} \right]$
- The pivot columns indicate which set of vectors is linearly independent, see from REF that $x_1,x_2,x_3,x_4$ are linearly independent. because $\lambda_1 x_1 + \lambda_2 x_2 + \lambda_4 x_4 = 0$ can only be solved with $\lambda_1 = \lambda_2 = \lambda_4 = 0$ . Therefore, ${x_1,x_2,x_3,x_4\}$ is a basis of $U$ .

2.6.2 Rank

The number of linearly independent columns of a matrix $\in \R^{m\times n}$ equals the number of linearly independent row and is called the rank of $A$ and is denoted by $r k (A)$
Important properties
- $rk(A) = rk(A^T)$
- The columns of $A\in \R^{m\times n}$ span a subspace $U\subseteq\R^{m}$ with $d i m (U) = r k (A)$ . We call this subspace the image or range. $A$ basis of $U$ can be found by applying Gaussian Elimination to $A$ to identify the pivot columns.
- The Rows of $\in \R^{m\times n}$ span a subspace $W\subseteq \R^n with dim(W) = rk(A)$ . A basis of $W$ can be found by applying Gaussian Elimination to $A^T$
- $\in \R^{n\times n}$ it holds that $A$ is regular (invertible) if and only if $r k (A) = n$ .
- $A x = b$ can be solved if and only if $r k (A) = r k (A ∣ b)$ . The A|b denotes the augmented system.
- $A x = 0$ possessed dimension $n - r k (A)$ . We call this subspace the kernel or the null space.
- When rank equals the largest possible rank for a matrix of the same dimensions that means the matrix has full rank.This mean that the rank of a full-rank matrix is the lesser of the number of rows and columns.
- If it does not have full rank, the matrix is said to be rank deficient.

2.7 Linear Mappings - Linear Transformation

Definition: For vector spaces V,W, a mapping $\Phi :V \rightarrow W$ is called a linear mapping.(or vector space homomorphism/ linear transformation) if $\\ \forall x,y \in V \ \forall \lambda,\psi \in \R : \Phi(\lambda x+\psi y) = \lambda \Phi(x) + \psi\Phi(y)$ .

$\Phi :\mathcal{V \rightarrow W} \ \ \mathcal{V,W}$ can be arbitrary sets. Then $\Phi$ is called

Injective if $\forall x,y\in \mathcal{V}: \Phi(x) = \Phi(y) \Rightarrow x= y$
Surjective if $\Phi(\mathcal{V} ) = \mathcal{W}$
Bijective if it is injective and surjective.

Finite-dimensional vector spaces $V$ and $W$ are isomorphic if and only if $d i m (V) = d i m (W)$ .

2.7.1 Matrix Representation of Linear Mapping

$A_{\Phi} = \left[\mathcal{a_1,a_2,a_3}\right] = \left[\begin{matrix} \ \ \ 1 \ 2 \ 0 \\ -1 \ 1\ 3 \\ \ \ \ 3 \ 7 \ 1 \\-1 \ 2 \ 4 \end{matrix}\right]$ where the $a_j , j = 1,2,3$ are the coordinate vectors of $\Phi(b_j)$ with respect to $C$ .
Basis Vectors

2.7.2 Basis Change

Equivalence
- Two matrices $A,\tilde{A}\in \R ^{m\times n} are equivalent if there exist regular matrices S\in \R^{n\times n}$ and $\in \R^{m\times m}$ , such that $\tilde{A} = T^{-1}AS$
Similarity
- Two matrices $A,\tilde{A}\in \R ^{n\times n}$ are similarity if there exist regular matrices $S\in \R^{n\times n}$ with $\tilde{A} = S^{-1}AS$
Transformation formular: $\\ \tilde{A} = T^{-1}A_{\Phi}S \\ A_{\Phi} : B \rightarrow C, \tilde{A}_{\Phi} : \tilde{B} \rightarrow \tilde{C} , S : \tilde{B} \rightarrow B, T:\tilde{C} \rightarrow C \ and \ \ T^{-1} : C \rightarrow \tilde{C} \\ \tilde{B} \rightarrow \tilde{C} = \tilde{B} \rightarrow B \rightarrow C \rightarrow \tilde{C}$

2.7.3 Image and Kernel

For $\Phi : V \rightarrow W$ , we defined the kernel/null space $\\ ker(\Phi) := \Phi^{-1}(0_W) = \{v \in V : \Phi(v) = 0_W\}$ and the image/range $\\ Im(\Phi) := \Phi(V) = \{w\in W | \exist v\in V : \Phi(v) = w\}$ we call $V$ and $W$ also the domain and codomain of $\Phi$ , respectively.
$dim(Im(\Phi))$
$\left[ \begin{matrix} 1 \ \ \ \ 0 \ \ \ \ 0 \ \ \ \ \ \ 1 \\ 0 \ \ \ \ 1 -\frac{1}{2} -\frac{1}{2} \end{matrix} \right]$ This matrix is in REF, and we can use the Minus-1 Trick to compute a basis of the kernel. Alternatively, we can express the non-pivot columns(3,4) as linear combinations of the pivot columns(1,2). The third column $a_3$ is equivalent to $-\frac{1}{2}$ times the second column $a_2$ . Therefor, $a_3 + \frac{1}{2}a_2$ . In the same way, we see that $a_4 = a_1 - \frac{1}{2}a_2$ and, therefore, $a_1 - \frac{1}{2}a_2 -a_4$ . Overall, this gives us the kernel (null space) is $\\ ker(\Phi) = span[ \left[ \begin{matrix} 0 \\ \frac{1}{2} \\1 \\ 0 \end{matrix} \right], \left[ \begin{matrix} -1\\ \frac{1}{2} \\0 \\ 1 \end{matrix} \right]]$
Rank-Nullity Theorem
- For vector spaces $V, W$ and a linear mapping $\Phi : V \rightarrow W$ it holds that $\\ dim(ker(\Phi)) + dim(Im(\Phi)) = dim(V)$
- Fundamental theorem of linear mappings
- If $dim(Im(\Phi)) < dim(V )$ , then $ker(\Phi)$ is non-trivial, i.e., the kernel contains more than $0_v$ and $dim(ker(\Phi)) > 1$
- If $A_{\Phi}$ is the transformation matrix of $\Phi$ with respect to an ordered basis and $dim(Im(\Phi)) < dim(V)$ , then the system of linear equations $A_{\Phi}x = 0$ has infinitely many solutions.
- If $d i m (V) = d i m (W)$ , then the $\Phi$ is bijective(includ: injective and surjective). since $Im(\Phi) \subseteq W$ .

2.8 Affine Spaces

Resemble linear mapping

Affine Subspace

Let $V$ be a vector space, $x_0 \in V$ and $\subseteq V$ a subspace. The the subset $\\ L = x_0 + U := \{x_0 + u : u \in U\} \\ \ \ \ \ = \{v\in V | \ \exist u\in U : v=x_0 + u\} \subseteq V$ is called affine subspace or linear manifold of $V$ . $U$ is called direction or direction space, and $x_0$ is called support point.
If $x_0 \notin U$ , an affine subspace is not a linear subspace(vector subspace) of $V$ .
Are points, lines, and planes in $R^3$ , which do not (necessarily) go through the origin.

Affine Mappings

For two vector spaces $V, W$ , a linear mapping $\Phi : V \rightarrow W$ , and $a\in W$ , the mapping $\\ \phi: V \rightarrow W \\ x \mapsto a + \Phi(x)$ is an affine mapping from $V$ to $W$ . The vector a is called the translation vector of $\phi$ .
Affine mappings keep the geometric structure invariant. They also preserve the dimension and parallelism.