What is approximate error of Pythagorean Theorem vs. Haversine Formula in measuring distances on sphere at various scales?
Using the Pythagorean formula on positions given in latitude and longitude makes as little sense as, say, computing the area of a circle using the formula for a square: although it produces a number, there is no reason to suppose it ought to work.
Although at small scales any smooth surface looks like a plane, the accuracy of the Pythagorean formula depends on the coordinates used. When those coordinates are latitude and longitude on a sphere (or ellipsoid), we can expect that
Distances along lines of longitude will be reasonably accurate.
Distances along the Equator will be reasonably accurate.
All other distances will be erroneous, in rough proportion to the differences in latitude and longitude.
The error depends on the start and end point of the distance calculations. However, because both the sphere and ellipsoid have a circular symmetry around the axis, the error depends only on the difference of the longitudes, so to study this error we might as well take the point of origin to be at the Prime Meridian. Because both the sphere and ellipsoid are symmetric under a north-south reflection, we only need to study points of origin in the southern hemisphere. For any such point we may draw a contour map of the relative error, equal to [Pythagorean calculation] / [True distance].
The Pythagorean formula, using the mean radius of the earth, is
Pythagorean distance = 6371000. * Sqrt[dx^2 + dy^2]] * pi / 180 meters
where dx is the difference in longitudes and dy is the difference in latitudes, both in degrees. (The difference in longitude values is reduced modulo 360 to give the correct value of dx when crossing the antimeridian; not doing so would introduce artificially large errors that tell us nothing about the Pythagorean formula itself.)
The following plots show the relative error compared to the correct distance on the WGS 84 ellipsoid for latitudes from -70 to 0 in increments of 10 degrees. The horizontal coordinate is the difference in longitudes and the vertical coordinate is the latitude of the destination. Light regions have relatively small error: the contour lines are at 1, 1.01, 1.02, 1.05, 1.1, 1.2, 1.5, 2, etc. (The pure white areas in the corners are places where the error goes beyond the range of these contours.) The red dots show the point of origin.
The vertical white bands testify to the correctness of expectation (1): Pythagorean distances are accurate when there is a small difference in longitudes. The horizontal white bands at low latitudes confirm expectation (2): near the Equator, horizontal distances are reasonably accurate. Otherwise, as witnessed by the extensive darker regions, at all other distances the Pythagorean formula is bad.
We can make quantitative estimates of the maximum error attained for pairs of nearby points (within, say, a few hundred kilometers of each other). Scale--using an appropriate value for the radius--is true along the meridian but along a circle of latitude it errs approximately by the secant of the latitude. For example, at a latitude of 40 degrees the secant is 1.31, implying the Pythagorean formula will give distances about 31% too large in the east-west direction. (This is evident in the upper right contour plot, for a point of origin at -40 degrees latitude, where the region immediately east-west of the red dot lies between the 1.2 and 1.5 contours.) Short distances in all other directions will be too large by some amount between 0% and 31%; longer distances may err by even more (as the contour plots show).
I interpreted "Pythagonean distance" as "Euclidean distance". Then the answer is the same as "what is the difference between the length of a chord of a circle and the subtended perimeter?" Let radius be R, subtended angle is A (radians).
perimeter = L = A*R
chord = C = 2*sin(A/2)*R
diff = D = L - C
= (A-2*sin(A/2))*R
= A^3/24 * R (for A small)
= L^3/(24*R^2) (eliminating A)
relative error = D/L
= (L/R)^2/24
For the earth, substitute R = 6400 km. By the way, call it "great circle distance" (what it is) not "haversine distance" (how it is calculated). (This is similar to the distinction between Pythagorean distance and Euclidean distance.)
For a full and rigorous answer look to whuber's answer above. I'm going to answer in a more visual and basic way.
The reason why planar/Pythagorean calculations are inappropriate is because the calculations rely on the fact that moving one step in any direction is a constant change in magnitude regardless of where you are on the graph.
Longitude doesn't conform to this requirement. Lines of longitude converge at the poles.
That is why when we flatten the Earth to reflect the rules of a planar graph we get distortion.
If you look at that map it appears as if Greenland is roughly the size of Africa and Antarctica is about the size of Eurasia. Of course that isn't true. Greenland and Antarctica are both extremely distorted because they're close to the poles where longitude converges.
As you can see Greenland is approximately the size of Mexico.
And Antarctica is about the size of southern Africa (not South Africa).
As you can see the errors you'll get applying Pythagorean formulas depend more on where the points are than the distance between the points. With the important caveat that longer distances will magnify any errors. This is why planar solutions, while tempting, are a poor choice. Distortions will bite you and it isn't as simple as an offset. The errors are a result of warping the earth to fit inappropriate rules.