What's the intution behind defining the cotangent sheaf as $\Delta^\ast(\mathscr{I}/\mathscr{I}^2)$?

Take $B$ a ring over another ring $A$. Look at $C = B\otimes_A B$. Let $I$ be the kernel of the ring morphism $C = B\otimes_A B \rightarrow B$ induced by the multiplication $B\times B\rightarrow B$. (This ideal is generated by all $1\otimes x - x\otimes 1$ for $x\in B$.) This ideal is an $B$-module, and has an $B$-submodule $I^2$. The quotient $B$-module $\Omega_{B / A}^1 = I / I^2$ is the differentials module of $A\rightarrow B$. The $\mathscr{O}_{\textrm{Spec}(B)}$-Module $\Delta^{*} ( \mathscr{I} / \mathscr{I}^2 )$ is but $\widetilde{\Omega_{B / A}^1}$. Note that the universal property of $\Omega_{B / A}^1$ (representing the derivations towards a $B$-module) justifies the denomination "cotangent sheaf" for its dual sheaf.

Let me give more details in the differentiable case. The three different ways one can define the tangent space $T_x M$ of a smooth differentiable variety $M$ at a point $x$ (and then the tangent bundle $TM$) are the following :

  • (1) Using equivalent classes of smooth parametrized curves passing through $x$
  • (2) Using derivations at $x$
  • (3) Using cotangent vectors at $x$

What (1), (2) and (3) have in common is that they describe the "first order" behavior of a smooth function on $M$ locally at the point $x$. More precisely :

(1) The derivative of a smooth function $f$ along a curve $c$ with $c(0) = x$ depends on $c$ only through $c'(0)$, and indeed it recovers the directional derivative of $f$ at $x$ in the direction $c'(0)$. The directional derivatives of $f$ determine the derivative of $f$ at $x$ which in turn determines the first order behavior of $f$ at $x$.

(2) Since a derivation $D$ at $x$ sees only the values of a function $f$ and its derivatives at $x$, you can legitimately replace $f$ by a polynomial $P$ by Taylor's theorem. By the Leibniz rule $D(P)$ depends only on the linear part of a $P$ and $D(f)$ depends only on the first order part of $f$.

(3) Recall that the cotangent bundle of $M$ at $x$ is the space $I / I^2$ where $I$ is the ideal consisting of (germs of) functions $f$ in ${\mathscr{C}}^\infty(M,\mathbf{R})$ (defined at $x$) such that $f(x) = 0$. If we imagine replacing the ring of germs by a ring of "polynomials" then $I$ represents the ideal of polynomials whose lowest order part has degree $1$ and $I^2$ is the ideal of polynomials whose lowest order part has degree $2$. In this case $I/I^2$ is naturally identified with the space of linear polynomials. Thus the cotangent bundle at $x$ is in a sense the space of "first order parts" of smooth functions on $M$.

The "diagonal" trick comes explicitely at play in (3), but let me make it clear in the schem context. Let me suppose for simplicity that $f : X \rightarrow S$. By abuse let $\Delta$ denote the image of the diagonal map in $X\times_S X$.

Following Sandor Kovacs here (and in fact EGA $\textrm{IV}_4$...) from now :

For a submanifold of a manifold you have the well-known short exact sequence connecting the tangent bundle of the ambient manifold restricted to the submanifold, the tangent bundle of the submanifold and the normal bundle of that submanifold in the ambient manifold. The geometric explanation to why the definition of the cotangent sheaf via the conormal bundle of the diagonal in $X\times_S X$ is the right one is that the normal bundle of the diagonal is isomorphic to its tangent bundle and the (co)normal bundle can be defined without the tangent bundle, so the tangent bundle may be defined as the normal bundle for this particular embedding.

In algebraic geometry one prefers the dual version involving the cotangent bundles (or cotangent sheaves as I initially wrote). The story goes like this : the short exact sequence for $\Delta\subset X\times_S X$ is :

$$ 0 \to \mathscr I/\mathscr I^2 \to \Omega_{X\times_S X/S}\otimes \mathscr O_{\Delta} \to \Omega_{\Delta/S} \to 0. $$

Now $\Omega_{X\times_S X/S}\simeq p_1^*\Omega_{X/S}\oplus p_2^*\Omega_{X/S}$ (where $p_i$ are the $S$-projection to $X$) so that $\Omega_{X\times_S X/S}\otimes \mathscr O_{\Delta} \simeq \Omega_{\Delta/S}\oplus \Omega_{\Delta/S}$. The natural morphism $\Omega_{X\times_S X/S}\otimes \mathscr O_{\Delta} \to \Omega_{\Delta/S}$ in the above short exact sequence may be identified with either projection to one of the direct summands as restricting either projections to the diagonal induces an isomorphism, which is another way to say the diagonal locally closed immersion is an $S$-section of the $p_i$'s. This implies that $\mathscr I/\mathscr I^2\simeq \Omega_{\Delta/S}$. As the diagonal morphism is an isomorphism between $X$ and $\Delta$, whatever way we may define $\Omega_{X/S}^1$, it has to be isomorphic to the pull-back of $\mathscr I/\mathscr I^2$.