New notation yielded new insights into identities

There are a few examples of important notational changes which come to mind. I'll put them all in separate sections, in various levels of fleshed out description. Keep in mind, notation typically comes after mathematics. Oftentimes we choose a certain notation because it fits a form we are used to, or in order to simplify a proof. With that caveat out of the way...


One fairly recent example that comes to mind is this paper ("Extending the Algebraic Manipulability of Differentials", Bartlett and Khurshudyan).

In it, the authors notice the typical (second) differential notion $\frac{d^2f}{dx^2}$ is not ideal, because it obscures what algebraic manipulations are permitted. One basic example they give is that $\frac{dy}{dx} \cdot \frac{dx}{dt} = \frac{dy}{dt}$, but $\frac{d^2 y}{dx^2} \cdot \frac{dx^2}{dt^2} = \frac{d^2 y}{dx^2} \left ( \frac{dx}{dt} \right )^2 \not = \frac{d^2 y}{dt^2}$.

The authors argue that the notation is confusing, and misleads students into trying to perform illegal algebraic manipulations. Their solution is to reform the notation for higher derivatives into something which can be algebraically manipulated. They have a simple methodology for figuring out what the correct notion is, but it is in the paper and this answer is already quite long.

Suffice to say, using $\frac{d^2 y}{dx^2} - \frac{dy}{dx} \frac{d^2 x}{dx^2}$ allows us to algebraically manipulate second derivatives as well. (Note - they do not suggest actually using this notation, instead they suggest we use $D^2_x y$ for this. It is just useful to be able to expand it when need arises, much like foiling a polynomial).

This notation, while more cumbersome to write out, bears fruits somewhat immediately. Just as we can show $\frac{dx}{dy} = \frac{1}{\frac{dy}{dx}}$ by algebraic manipulation, we can show a similar formula for the second derivative using the new notation:

$- D^2_x y = \left ( \frac{1}{D^1_x y} \right )^3 = D^2_y x$

In traditional notation, this says $- \frac{d^2 y}{dx^2} \left ( \frac{1}{\frac{dy}{dx}} \right )^3 = \frac{d^2 x}{dy^2}$, a formula which is widely unknown.

The proof is nothing but algebraic manipulation, and is in the linked paper. Indeed the authors of the paper thought that nobody had proven this before! (As with most things in basic calculus, somebody had already done it. That said the original proof was almost certainly less clear than the one in the paper.)


As a potentially controversial point, using $\tau = 2 \pi$ as the fundamental circle constant is, at its core, a notational change. The arguments are well documented, but the gist is that it makes many identities and patterns involving circles seem more natural. While this is not responsible for proving new results, it could be responsible for making the concepts seem simpler, which I think is worth mentioning.


A (negative) anecdote comes from mathematical logic, where we really dodged a bullet fired by Frege. The way we typically write formulas nowadays (using symbols like $\forall$, and $\exists$, $\land$, $\lor$, $\lnot$, $\to$, etc.) was not the first notation tried. Indeed Frege invented a notation for logical formulas called Begriffsschrift, which would have completely killed logic if it had been adopted widely. It is 2 dimensional, and here's an example of it in use:

$[\forall a. (P(x,a) \to Q(a)) \to P(x,y) \to Q(y)] \to [\forall b . Q(b) \to \forall a . P(b,a) \to Q(a)] \to Q(x) \to P(x,y) \to Q(y)$

(I grant you, this is a complicated formula, but I bet you can read it fairly quickly. Look what it becomes):

Frege

While you could make an argument saying we read the first one faster because it's what we're used to, it is objectively more compact. Imagine writing formulas at a blackboard, or deriving things with proof trees using Frege's notation. I shudder at the thought.


Back to calculus for a moment, one piece of notation that you probably don't even notice is the integral! The fact that we use the same notation to refer to antiderivatives and areas under curves from the start clues students into the fact that they are the same concept! This falls into the $\tau$ vs $\pi$ camp again, as it is primarily a didactic argument instead of a research-focused one, but students can see the connection much quicker than the inventors of calculus because our notation emphasizes it! A priori, it is extremely unclear that these two topics should have anything to do with each other, and yet students tend to grasp this quite quickly nowadays. I think that the notation is a big part of this, and while I have no direct results to cite, I'm sure this was responsible for countless results after its adoption.


Categorically, we use $X \times Y$ and $X + Y$ to denote product and coproduct, and we use $X^Y$ to denote an internal hom object. Why do we use this notation? Because (in many important categories) all of the standard rules work. For example, $(X \times Y)^Z \cong X^Z \times Y^Z$, $X^{Y + Z} \cong X^Y \times X^Z$, $X \times (Y + Z) \cong X \times Y + X \times Z$, etc.

Knowing only the notation, one might conjecture that some of the above results are true, and then try to prove them! Even without having seen the proof, and without me telling you, I bet you can already guess other identities that hold between these constructions. That is the power of good notation!


One last proposed change in notation is the Triangle of Power, popularized by this youtube video. The idea is to better relate three connected topics (exponentiation, radicals, and logrithms) by giving them a more related notation.

Instead of summarizing the notation here, I'll link to the original mse post, which makes a good argument for how it clarifies pre-existing theorems (as most new notation does).

To whet your appetite, it makes the following results seem obvious: $\log_{\sqrt[y]{z}}(z)=y$, and $\sqrt[\log_x(z)]{z} = x$.


Again, I'm not sure if this is exactly what you're looking for, but finding examples of genuinely "new" notation which prove genuinely new results is hard. Oftentimes we change the notation to make results we already know seem more obvious, as in the case of the integral and the triangle above. Sometimes, however, we get lucky, and our new (clearer) notation lets us prove new results related to the topic which we have clarified, as in the case of the algebraic notation for iterated derivatives. Of course, just because the result was already known doesn't mean that changing notation is worthless -- as mathematicians it is in part our job to make mathematics easier for future generations to do. Progress is not just made by geniuses seeing patterns with their amazing intuition, progress can also be made by taking the patterns we found hard to see, and making them easy to see. In this way, our students can spend less time rediscovering what we have found, and spend more time pushing into new areas of math.

I hope this was interesting ^_^

Tags:

Notation