Part 1
Definition: An element \(r\in R\) is irreducible if whenever \(r=st\), then either \(s\) or \(t\) is a unit.
Definition: Two elements \(r,s\in R\) are associates if \(r = \ell s\) for some unit \(\ell\).
A ring \(R\) is a unique factorization domain iff for every \(r\in R\), there exists a set \(\left\{{p_i \mathrel{\Big|}1\leq i \leq n}\right\}\) such that \(r = u \prod_{i=1}^n p_i\) where \(u\) is a unit and each \(p_i\) is irreducible.
Moreover, this factorization is unique in the sense that if \(r = w \prod_{i=1}^n q_i\) for some \(w\) a unit and \(q_i\) irreducible elements, then each \(q_i\) is an associate of some \(p_i\).
Part 2
A ring \(R\) is a principal ideal domain iff whenever \(I {~\trianglelefteq~}R\) is an ideal of \(R\), there is a single element \(r_i \in R\) such that \(I = (r_i)\).
Part 3
An example of a UFD that is not a PID is given by \(R = k[x, y]\) for \(k\) a field.
That \(R\) is a UFD follows from the fact that if \(k\) is a field, then \(k\) has no prime elements since every non-zero element is a unit. So the factorization condition holds vacuously for \(k\), and \(k\) is a UFD. But then we can use the following result:
Theorem: If \(R\) is a UFD, then \(R[x]\) is a UFD.
Since \(k\) is a UFD, the theorem implies that \(k[x]\) is a UFD, from which it follows that \(k[x][y] = k[x, y]\) is also a UFD.
To see that \(R\) is not a PID, consider the ideal \(I = (x, y)\), and suppose \(I = (g)\) for some single \(g\in k[x, y]\).
Note that \(I \neq R\), since \(I\) contains no degree zero polynomials. Moreover, since \((x) \subset I = (g)\) (and similarly for \(y\)), we have \(g\divides x\) and \(g\divides y\), which forces \(\deg g = 0\).
So in fact \(g\in k\) and thus \(g\) is invertible, but then \((g) = g^{-1}(g) = (1) = k\), so this forces \(I = k {~\trianglelefteq~}k[x, y]\). However, \(x\not\in k\) (nor \(y\)), which is a contradiction.
Problem 2
Lemma 1
\(A\) has \(n\) distinct eigenvalues \(\iff m_A(x) = \chi_A(x)\).
Proof:
The eigenvalues are always root of both \(m_A(x)\) and \(\chi_A(x)\) (potentially with differing multiplicities), so we can write
\begin{align*} m_A(x) = \prod_i (x-\lambda_i)^{p_i} \\ \chi_A(x) = \prod_i (x-\lambda_i)^{q_i} \\ \end{align*}
where \(1 \leq p_i \leq q_i\) for every \(i\).
\(\implies\): If \(A\) has \(n\) distinct eigenvalues, then \(\chi_A(x) = \prod_{i=1}^n (x-\lambda_i)\) in \(\overline{k} [x]\). Noting that every exponent is 1, we have \(q_i = 1\) for all \(i\), which forces \(p_i = 1\) and thus \(m_A(x) = \chi_A(x)\).
\(\impliedby\): If \(m_A(x) = \chi_A(x)\), then \(p_i = q_i\) for all \(i\). If we then consider \(JCF(A)\), we have
- The number of Jordan block \(J_{\lambda_i}\) is the dimension of the eigenspace \(E_{\lambda_i}\),
- \(q_i =\) the sum of the sizes of all Jordan blocks \(J_{\lambda_i}\), and
- \(p_i =\) the size of the largest Jordan block \(J_{\lambda_i}\).
Thus \(p_i = q_i\) for every \(i\) \(\iff\) there is one Jordan block for every \(\lambda_i\) \(\iff\) \(\dim E_{\lambda_i} = 1\) for every \(i\).
But \(\dim E_{\lambda_i}\) is precisely the multiplicity of \(\lambda_i\) in \(\chi_A(x)\), which means that \(\chi_A(x) = \prod_{i} (x-\lambda_i)\). Since \(\chi_A(x)\) is a degree \(n\) polynomial, this says that \(\chi_A\) has \(n\) distinct linear factors, corresponding to \(n\) distinct eigenvalues of \(A\).
\(\hfill\blacksquare\)
Lemma 2
Let \(k[x] \curvearrowright V\) in the usual way with \(A\) to obtain an invariant factor decomposition \begin{align*} V = \frac{k[x]}{(f_1)} \oplus \frac{k[x]}{(f_2)} \oplus \cdots \oplus \frac{k[x]}{(f_k)}, \quad f_1 \divides f_2 \divides \cdots \divides f_k. \end{align*}
Then it is always the case that
- \(m_A(x) = f_k(x)\), i.e. the minimal polynomial is the invariant factor of largest degree,
- \(\chi_A(x) = \prod_{i=1}^k f_i(x)\), i.e. the characteristic polynomial is the product of all of the invariant factors.
\(\hfill\blacksquare\)
Main Result
\((1) \implies (2)\):
Suppose \begin{align*} V = \mathop{\mathrm{span}}_k\left\{{\mathbf{v}, A\mathbf{v}, A^2 \mathbf{v}, \cdots A^{n-1}\mathbf{v}}\right\} \coloneqq\mathop{\mathrm{span}}_k \mathcal B \end{align*} where \(\dim_k V = n\).
Then \(A^n \mathbf{v}\) is necessarily a linear combination of these basis elements, and in particular, there are coefficients \(c_i\) (not all zero) such that \begin{align*} A^n \mathbf{v} = \sum_{i=0}^{n-1} c_i A^i \mathbf{v}. \end{align*}
The consider computing the matrix of \(A\) in \(\mathcal B\) by considering the images of all basis elements under \(A\).
Letting \(\mathcal B = \left\{{\mathbf{w}_i \coloneqq A^i \mathbf{v} \mathrel{\Big|}0\leq i \leq n-1}\right\}\), we have
\begin{align*} \mathbf{w}_0 \coloneqq\mathbf{v} &\mapsto A\mathbf{v} \coloneqq\mathbf{w}_1 \\ \mathbf{w}_1 \coloneqq A\mathbf{v} &\mapsto A^2\mathbf{v} \coloneqq\mathbf{w}_2 \\ \mathbf{w}_2 \coloneqq A^2\mathbf{v} &\mapsto A^3\mathbf{v} \coloneqq\mathbf{w}_3 \\ \vdots \quad & \quad \vdots \\ \mathbf{w}_{n-2} \coloneqq A^{n-2}\mathbf{v} &\mapsto A^{n-1}\mathbf{v} \coloneqq\mathbf{w}_{n-1} \\ \mathbf{w}_{n-1} \coloneqq A^{n-1}\mathbf{v} &\mapsto A^n\mathbf{v} = \sum_{i=0}^{n-1} c_i A^i \mathbf{v}_i \coloneqq\sum_{i=0}^{n-1} c_i \mathbf{w}_i .\end{align*}
This means that with respect to the basis \(\mathcal B\), \(A\) has the following matrix representation:
\begin{align*} [A]_{\mathcal B} = \left[\begin{array}{ccccc}{0} & {0} & {\dots} & {0} & {c_{0}} \\ {1} & {0} & {\dots} & {0} & {c_{1}} \\ {0} & {1} & {\dots} & {0} & {c_{2}} \\ {} & {} & {\ddots} & {} & {\vdots} \\ {0} & {0} & {\dots} & {1} & {c_{n-1}}\end{array}\right] \end{align*}
But this is the companion matrix for \(p(x) = \sum_{i=0}^{n-1} c_i x^i\), which always satisfy the property that \(p(x)\) equals both their characteristic and their minimal polynomial.
Thus by lemma 1, the matrix \([A]_{\mathcal{B}}\) has distinct eigenvalues, and thus so does \(A\).
\((2) \implies (1)\):
Suppose \(A\) has distinct eigenvalues. By Lemma 1, \(\chi_A(x) = m_A(x)\), and so we have \begin{align*} \chi_A(x) = f_k(x) = \prod_{i=1}^k f_i(x) = m_A(x), \end{align*}
which can only happen if \(f_1(x) = f_2(x) = \cdots = f_{n-1}(x) = 1\), in which case there is only one nontrivial invariant factor.
So we have \begin{align*} V \cong \frac{k[x]}{(f_k)}, \quad \operatorname{Ann}(V) = (f_k), \quad \deg f_k = n. \end{align*}
If we now take the Rational Canonical Form of \(A\), it follows that \(RCF(A)\) has only a single block in a suitable ordered basis \(\mathcal B = \left\{{\mathbf{w}_0, \cdots, \mathbf{w}_{n-1}}\right\}\).
So write \(f_k(x) = \sum_{i=0}^n c_i x^i\); then \([A]_{\mathcal B}\) is the companion matrix to \(f_k(x)\) in the basis \(\mathcal B\), which by construction satisfies \begin{align*} A = \left[\begin{array}{ccccc}{0} & {0} & {\dots} & {0} & {c_{0}} \\ {1} & {0} & {\dots} & {0} & {c_{1}} \\ {0} & {1} & {\dots} & {0} & {c_{2}} \\ {} & {} & {\ddots} & {} & {\vdots} \\ {0} & {0} & {\dots} & {1} & {c_{n-1}}\end{array}\right] \implies A \mathbf{w}_i = \begin{cases} \mathbf{w}_{i+1} & 0 \leq i < n-2 \\ \sum_{i=0}^{n-1} c_i \mathbf{w}_i & i = n-1, \end{cases} \end{align*}
and thus we have
\begin{align*} V \cong \mathop{\mathrm{span}}_k \mathcal{B} = \mathop{\mathrm{span}}_k\left\{{\mathbf{w}_0, \cdots \mathbf{w}_{n-1}}\right\} \cong \mathop{\mathrm{span}}_k\left\{{\mathbf{w}_0, A\mathbf{w}_0, A^2\mathbf{w}_0, \cdots, A^{n-1} \mathbf{w}_0}\right\}. \end{align*}
\(\hfill\blacksquare\)
Problem 3
Part 1
Let \(\mathbf{v} = [0,1,0]^t\), We compute
\begin{align*} M\mathbf{v} = \left[\begin{array}{ccc} 1 & 0 & x \\ 0 & 1 & 0 \\ y & 0 & 1 \end{array}\right] \left[\begin{array}{c} 0 \\ 1 \\ 0 \end{array}\right] = \left[\begin{array}{cc} 1(0) + 0(1) + x(0) \\ 0(0) + 1(1) + 0(0) \\ y(0) + 0(1) + 1(0) \end{array}\right] = \left[\begin{array}{cc} 0 \\ 1 \\ 0 \end{array}\right] = 1 \left[\begin{array}{cc} 0 \\ 1 \\ 0 \end{array}\right] ,\end{align*}
which shows that \(\mathbf{v}\) is an eigenvector of \(M\) with eigenvalue \(\lambda = 1\).
Part 2
Noting that the rank is the dimension of the column space, we find that
- \(\operatorname{rank}(M) \geq 1\), since it is not the zero matrix,
- \(\operatorname{rank}(M) \geq 2\), since neither \([1,0,y]^t\) or \([x,0,1]^t\) can be in the span of \([0,1,0]^t\), and
- \(\operatorname{rank}(M) = 3 \iff \operatorname{det}(M) \neq 0\).
So we compute
\begin{align*} \operatorname{det}_M(x, y) = \left|\begin{array}{ccc} 1 & 0 & x \\ 0 & 1 & 0 \\ y & 0 & 1 \end{array}\right| = 1(1-0) - 0(1-xy) + x(-y) = 1 - xy ,\end{align*}
and so \(\operatorname{det}_M(x, y) = 0 \iff xy = 1\). Thus
\begin{align*} \operatorname{rank}(M) = \begin{cases} 3 & xy = 1 \\ 2 & \text{else.} \end{cases} \end{align*}
Part 3
Since \(M\) is diagonalizable \(\iff M\) is full rank, which in this case means \(\operatorname{rank}(M) = 3\), we have \begin{align*} S = \left\{{(x, y) \in {\mathbf{R}}^2 \mathrel{\Big|}M \text{ is diagonalizable }}\right\} = \left\{{ \left( x, \frac 1 x \right) \mathrel{\Big|}x\in {\mathbf{R}}\setminus\left\{{0}\right\} }\right\} \subset {\mathbf{R}}^2. \end{align*}