Limits (essay): Difference between revisions

From Objective Mathematics
Jump to navigation Jump to search
No edit summary
 
(6 intermediate revisions by the same user not shown)
Line 1: Line 1:
'''essay prompt''': Is the idea of 'limit' indispensable? Neutral? A hindrance?
'''Essay prompt''': Is the idea of 'limit' indispensable? Neutral? A hindrance?


I contend that some concept of 'limit' is indispensable in mathematics, physics, and beyond. I will make this case inductively, beginning with an exposition of the facts of reality that gave rise to the need for such a concept, and concluding with a proper definition of 'limit'.
I contend that ''some'' concept of limit is indispensable in mathematics, but that one must be very cautious in using standard mathematics' concept of limit.  


[TODO maybe also talk about invalid versions of the concept?]
The focus of this essay is mainly negative. I criticize two common attitudes towards limits: the "empiricist" attitude, which believes in effect that we don't need limits at all, and the "rationalist" attitude (the attitude of modern mathematics), which believes that we can use limits without any thought as to how they connect to reality. Like epistemological empiricism and rationalism, the problem with these approaches is that the former overloads the crow through its use of percepts without concepts, and the latter floats up into the clouds through its use of concepts without percepts. What is needed is an ''objective'' attitude towards limits. However, it is a very difficult task to lay out positive universal principles defining exactly what that objective attitude is, and so in this essay I will only attempt to demonstrate it through examples. 


== Fact of reality 1: ==
== What is a limit? ==
The Earth is a sphere, but its radius is so large, and the circumstances of everyday life take place on a scale so small, that in most contexts, it is valid to think of the Earth as a plane.
 
We may say, informally, that a plane is the limit of a sphere as its radius "tends towards infinity". [TODO one has to be careful about which limit one is taking, TODO harry doesn't like the word infinity, but I don't know how else to say it]
 
We may also say that in the context set by many everyday situations, the difference between a plane, and a sphere with 4000mi radius, is nill.
 
One might ask: why would I ever treat the Earth as a plane when it is in fact a sphere? What's the point of passing to the limit? The point is that it helps the crow. Here are some hypothetical examples:
 
* You might find it useful to know that the angles of a triangle add up to 180˚. However, this fact is only valid for triangles on a ''plane''; "triangles" on a sphere can have angles that add up to more than 180˚. If you are thinking of your back yard as a sphere, then you have to go through an extra step, where you first think: the size of this triangle is very small relative to the sphere, and therefore its angles add up to something very close to 180˚. This is true, but it is extra cognitive work, which can be avoided by thinking of your back yard as a plane.
* You might find it useful to know that the two parallel lines you drew in the ground will not intersect. However, this fact is only valid for parallel lines on a ''plane''; "parallel lines" on a sphere will eventually intersect. If you are thinking of the ground as a sphere, then you will have to go through an extra step, where you first think: the point at which the curvature of the earth will cause these line to intersect lies many miles out into the Pacific Ocean, and so it's completely irrelevant for my goal of making a pretty design (or whatever the goal is). Again, that is true, but it is extra cognitive work, which can be avoided by thinking of the ground as a plane. 
 
== Fact of reality 2: ==
[TODO write neatly] The sum from j = 0 to N of 2^{-j} is tabulated below for various values of N
 
  N   sum
 
<nowiki>----</nowiki>|-----
 
  0   1
 
  1   1.5
 
  2   1.75
 
  3   1.875
 
  4   1.9375
 
  5   1.96875
 
  6   1.984375
 
    ...
 
  30  1.999999999068677425384521484375
 
It appears that the larger N gets, the closer this sum gets to 2.
 
One might ask: How do we really know that if we continue for long enough, we'll get increasingly closer to 2? All we've seen is that we get very close to 2 after we sum the first 30 numbers. But maybe something unexpected will happen if we sum up to 3000, 300, or even 31.
 
Such a question can be answered as follows. First, note that
 
  (1 + x + x^2 + ... + x^N)(1 - x) = 1 + x + x^2 + ... + x^N
 
                                       -(x + x^2 + ... + x^N) - x^{N+1}
 
                                   = 1 - x^{N+1}.
 
Therefore, rearranging, we find
 
  1 + x + x^2 + ... + x^N = (1 - x^{N+1})/(1 - x)
 
or for x = 1/2,
 
  1 + 1/2 + 1/2^2 + ... + 1/2^N = 2 - 2^{-N}.
 
Thus we have precisely quantified the difference between (the sum up to N) and 2; it is 2^{-N}. The larger our value of N gets, the smaller this difference will become.
 
In fact, for any nill value epsilon, there exists some N such that the difference between (the sum up to N) and 2 is nill.
 
--
 
One might ask: Who CARES about passing to the limit? What do we gain from talking about 2 instead of 1.984375 ? The answer is that the former is simpler than the latter.
 
I will introduce the principle of [[Occam's razor]],[TODO]
 
=== Real example? ===
Why might you actually have to do this sum? Well let's say you're a contractor for a new airport, and they picked a design for the marble tiles that looks like a spiral: [TODO show picture]. Each tile has an area of 1 square meter. How much marble do you need per tile?
 
One way of thinking about it is that each tile is made up of a sequence of subtitles. The first subtile will be made of 1/2 of a square meter of marble, the second 1/4th of a square meter marble, the third 1/8th of a square meter of marble, and so on. You could add those quantities up for each subtile, and find that you need <math display="block">1/2 + 1/4 + 1/8 + 1/16 + 1/32 + 1/64 + 1/128 + 1/256 = 0.99609375</math>square meters of marble, then round that up to 1 square meter of marble. Another, easier way of thinking about it, is to note that the pattern basically fills up the entire square meter (which is perceptually obvious in this case), so therefore you need 1 square meter of marble per tile.
 
== Fact of reality 3: [TODO] ==
This one should be something having to do with a practical application of a derivative or an integral.
 
one idea is find the average height of a sine wave. This is contrived (when would I ever do it IRL?), and it's something you could figure out without calculus or limits.
 
== starting over ==
I contend that ''some'' concept of limit is indispensable in mathematics, but that one must be very cautious in using standard mathematics' concept of limit.
 
Outline:
 
# Explain what a limit is (on my view)
# Explain why limits are useful, e.g. by giving examples of limits
# Explain standard mathematics' definition of limit, and the problems that I think it has
 
=== What is a limit? ===
In this section, I will explain what a limit is. A limit can only be taken with respect to a ''sequence'', so first, I must explain what a sequence is.
In this section, I will explain what a limit is. A limit can only be taken with respect to a ''sequence'', so first, I must explain what a sequence is.


A sequence is a function<ref>A function is a quantitative relation between two groups of existents (called the "domain" and "codomain" of the function), which relates any existent in the domain to a ''unique'' existent in the codomain.</ref> from the natural numbers to a group <math>C</math> of existents, where there is some notion of ''closeness'' or ''distance'' between the units of <math>C</math>.<ref>Modern mathematics would say that <math>C</math> is a ''topological space'', but I do not wish to commit myself to that concept, because it raises potential philosophical problems. </ref> A sequence is like a set of instructions: you give me any natural number <math>n</math>, and I'll show you how to produce or identify some thing <math>a_n</math>, where <math>a_n</math> is a <math>C</math>. If the sequence consists of decimal numbers, then another way of thinking of the sequence is instructions for continuing the "...". Some examples of sequences:  
A sequence is a function<ref>A function is a quantitative relation between two groups of existents (called the "domain" and "codomain" of the function), which relates any existent in the domain to a ''unique'' existent in the codomain.</ref> from the natural numbers to a group <math>C</math> of existents, where there is some notion of ''closeness'' or ''distance'' between the units of <math>C</math>.<ref>Modern mathematics would say that <math>C</math> is a ''topological space'', but I do not wish to commit myself to that concept, because it raises potential philosophical problems. </ref> A sequence is like a set of instructions: you give me any natural number <math>n</math>, and I'll produce or identify some thing <math>a_n</math>, where <math>a_n</math> is a <math>C</math>. If the sequence consists of fractions with an inverse power of 10 in the denominator, then another way of thinking of the sequence is as instructions for continuing the "..." in a decimal expansion; the expression "3.14159..." can be thought of as shorthand for a sequence. Some examples of sequences:  


# "<math>a_1, a_2, a_3, \cdots </math>", where <math>a_n := \sum_{i = 1}^n 2^{-i}</math>.  
# "<math>a_1, a_2, a_3, \cdots </math>", where <math>a_n := \sum_{i = 1}^n 2^{-i}</math>.  
# "<math>b_1, b_2, b_3, \cdots </math>", where <math>b_n</math> is a circle of radius <math>n</math>, centered at <math>(x,y) = (n, 0)</math>.  
# "<math>b_1, b_2, b_3, \cdots </math>", where <math>b_n</math> is a circle of radius <math>n</math>, centered at <math>(x,y) = (n, 0)</math>. See the figure.  
# "<math>c_1, c_2, c_3, \cdots </math>", where <math>c_n</math> is a secant line to the graph of <math>f(t) = 10-10 t^2 </math> near <math>t = 1</math>. More formally, <math>c_n</math> is the unique line passing through the points <math>\left(1 + 1/n, f(1 + 1/n) \right)</math> and <math> \left(1 - 1/n, f(1 - 1/n )\right)</math>.  
# "<math>c_1, c_2, c_3, \cdots </math>", where <math>c_n</math> is a secant line to the graph of <math>f(t) = 10-10 t^2 </math> near <math>t = 1</math>. More formally, <math>c_n</math> is the unique line passing through the points <math>\left(1 + 1/n, f(1 + 1/n) \right)</math> and <math> \left(1 - 1/n, f(1 - 1/n )\right)</math>.  
# "<math>d_1, d_2, d_3, \cdots </math>", where <math>d_n := n</math>.  
# "<math>d_1, d_2, d_3, \cdots </math>", where <math>d_n := \log n</math>.  
# "<math>e_1,e_2,e_3, \cdots</math>", where <math>e_n := \sum_{k = 0}^n \frac{(2k+1)!}{2^{3k+1} (k!)^2}</math>.  
# "<math>e_1,e_2,e_3, \cdots</math>", where <math>e_n := \sum_{k = 0}^n \frac{(2k+1)!}{2^{3k+1} (k!)^2}</math>.  
 
[[File:Limit of circles.png|thumb|Circles of increasingly large radius becoming closer and closer to the vertical line on the left. ]]
Sometimes, but not always, sequences have the property that the elements of the sequence get closer and closer to a fixed point <math>L </math> as the sequence progresses. In such cases, the sequence is said to converge to <math>L </math>, and <math>L </math> is said to be the limit of the sequence. Examples:   
Sometimes, but not always, sequences have the property that the elements of the sequence get closer and closer to a fixed point <math>L </math> as the sequence progresses. In such cases, the sequence is said to converge to <math>L </math>, and <math>L </math> is said to be the limit of the sequence. Examples:   


Line 110: Line 24:
# The sequence "<math>e_1,e_2,e_3, \cdots</math>" converges to <math>\sqrt{2}</math>.<ref>https://en.wikipedia.org/wiki/Square_root_of_2#Representations[TODO]</ref> That is, as <math>n</math> gets larger and larger, <math>e_n^2</math> gets closer and closer to 2.  
# The sequence "<math>e_1,e_2,e_3, \cdots</math>" converges to <math>\sqrt{2}</math>.<ref>https://en.wikipedia.org/wiki/Square_root_of_2#Representations[TODO]</ref> That is, as <math>n</math> gets larger and larger, <math>e_n^2</math> gets closer and closer to 2.  


I will conclude with a concept-definition<ref>By "concept-definition," I mean a definition of a concept, to which to the principles of definition laid out in ITOE apply. I mean this in contrast to e.g. a legal definition, a definition in a computer program, or---most importantly for our purposes---a formal mathematical definition that is reducible back to the basic notions of set theory. </ref> of a limit: The limit of a sequence is the place it goes to once we are well beyond the level of precision'' ''that is ''relevant in our given context''.
=== Why are limits useful? ===
In this section, I will give some fictional but realistic scenarios in which limits could be used to great effect. [TODO]
 
== Converging to 1 ==
[[File:Pictorial proof of geometric sum.png|thumb|The design for the tile at the airport. White area represents places where there is granite, black area represents places where there is putty. ]]
Suppose that the following design is proposed for some of the granite floor tiles at a new airport terminal: 
 
That is, each tile consists of many sub-tiles, forming the pattern above. Each big tile is 1 square meter, so the sub-tiles have sizes 0.5 square meters, 0.25 square meters, etc. How much granite is needed to make each big tile? 
 
The answer to that question is perceptually obvious, given the picture of the design: about 1 square meter of granite is needed for each tile. But if what if we had to justify this to a blind person, or a person who hasn't seen the picture? 
 
The empiricist way of proceeding would be to say: The big tile contains sub-tiles of sizes 1/2, 1/4, 1/8, 1/16, 1/32, 1/64, 1/128, 1/256, 1/512, and 1/1024 square meters, and if you perform a long arithmetic computation, you will find that <math display="block">\frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{32} + \frac{1}{64} + \frac{1}{128} + \frac{1}{256} + \frac{1}{512} + \frac{1}{1024} = 0.9990234375,</math>which is close enough to 1 in the context at hand. Who cares about limits? 
 
The rationalist way of proceeding would be to say: The big tile contains sub-tiles of sizes 1/2, 1/4, 1/8, 1/16, 1/32, 1/64, 1/128, 1/256, 1/512, and 1/1024 square meters. The sequence of partial sums of these numbers is approximately the sequence <math>\{ a_n\} </math> (example 1 above). The limit of that sequence is 1. Therefore, <math display="block">\frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{32} + \frac{1}{64} + \frac{1}{128} + \frac{1}{256} + \frac{1}{512} + \frac{1}{1024} \approx 1 . </math>The advantage of the empiricist approach in this case is clear: it produces an answer which is correct, and we know why. [TODO]
 
There are, however, some advantages of the rationalist approach. One advantage is that it doesn't involve any explicit computation, so it would save time and effort (especially in the days where people didn't have calculators). Another advantage of the rationalist approach is that it is more conceptual, and seems to give us a better idea about what's really going on: the sub-tiles form a pattern which, if continued in the natural way, would make their total area arbitrarily close to 1. In other words, the rationalist approach gives us an ''explanation'' for why the area of the sub-tiles is so close to 1.  


Although the reader should keep in mind that the concept of limit extends far beyond the realm of numbers, for simplicity I will henceforth only consider sequences of numbers.  
The problem with the rationalist approach is that, unlike the empiricist approach, its connection to reality is not obvious. What do we actually mean by saying that "[t]he sequence of partial sums of these numbers is '''approximately''' the sequence <math>a_n</math> "? And in our conclusion that 1/2 + ... + 1/1024 ≈ 1, what do we mean by "≈", i.e. exactly ''how'' close can we conclude that 1/2 + ... + 1/1024 is to 1?


=== Why are limits useful? ===
To see a concrete way in which the rationalist approach can go awry, let's use the same logic and consider the sequence <math>a_1', a_2', a_3', \cdots </math>, where <math display="block">a_n' := \begin{cases}
In this section, I will give some fictional but realistic scenarios in which limits could be used to great effect. [TODO]
\sum_{i=0}^n 2^{-i}, &\text{if }n \leq 1000 \\
42, &\text{otherwise} 
\end{cases}.</math>One might accidentally subsume 1/2 + ... + 1/256 under the sequence  <math>\{a_n'\}</math> instead of the sequence <math>\{a_n\}</math>, because in real life we're only going up to <math>n = 8</math>, and these two sequences agree as long as <math>n \leq 1000</math>. The limit of the sequence <math>\{a_n'\}</math> is 42, so if one proceeded as before by taking the limit, he would erroneously conclude that  <math display="block">\frac{1}{2} + \frac{1}{4} + \frac{1}{8} + \frac{1}{16} + \frac{1}{32} + \frac{1}{64} + \frac{1}{128} + \frac{1}{256} + \frac{1}{512} + \frac{1}{1024}  \approx 42 . </math>A more objective approach can be found by digging into the proof that the limit of <math>\{a_n\}</math> is 1. By basic algebra, one may verify that the following identity holds


==== 1. Converging to 1 ====
<math display="block">x + x^2 + \cdots + x^n = \left( \frac{1}{1 - x} - 1\right) - \frac{x^{n+1}}{1 - x}</math>
example with the design in a square


==== 2. circles ====
for any number <math>x</math> such that <math>x\neq 1</math>. The term in parentheses should be thought of as a ''limit'', and the other term should be thought of as an ''error term'', or a deviation from the limit. Plugging in <math>x = 1/2</math>, the formula yields<math display="block">\frac{1}{2} + \frac{1}{2^2} + \cdots + \frac{1}{2^n} = 1 - \frac{1}{2^n},</math>so as <math>n</math> gets larger and larger, this quantity gets closer and closer to 1 (incidentally, that's the proof that the limit of <math>\{ a_n \}</math> is 1). In our case, <math>n = 8</math>, so the sum will be almost equal to the limit, but with an error of <math>1/2^8</math>, which is negligible for our purposes. Therefore 1 square meter of granite is needed. 
earth is like a plane locally


==== 3. tangent lines ====
A typewriter[?TODO] is dropped from a height of 10 meters, and we wish to know if it will break when it hits the ground. As an intermediate step, we must determine how fast it will be moving when it hits the ground.


The height of the typewriter[TODO], as a function of time, is <math>h(t) = 10-10 t^2 </math>, and it hits the ground after <math>t = 1</math> second has passed.  
Observe that in contrast to the empiricist approach, the objective approach didn't require a calculator, and it used concepts which clearly generalize far beyond the problem at hand; and in contrast to the rationalist approach, the objective approach produced a complete and correct answer, and didn't rely on any floating abstractions which could lead us down the wrong path.  


To find the speed of the typewriter as it hits the ground, we could find the slope of the line <math>c_n</math> for some large <math>n</math>. For example, the slope of <math>c_{12}</math> is
== Tangent lines ==
In Boulder, Colorado, there is a street called Baseline that runs precisely along the 40th parallel—that is, the coordinate line at 40˚ north latitude. From my parents' house, Baseline stretches out to the east for many miles, and throughout entire length it runs along the 40th parallel.


==== 5. square root of 2 ====
There is a fact familiar to every educated adult, that the Earth is a sphere, but that the radius of the sphere is so large that perceptually, to a person standing on the surface, it looks flat. This is an example of thinking of something as a limit; the plane is a limit of spheres of increasingly large radius. Similarly, any great circle on a sphere (such as the 40th parallel), in the limit that the radius of the sphere approaches "infinity," is a line (see the sequence <math>\{ b_n\} </math> above).
blahblahblah


==== criticism ====
Now, consider the following very practical question, with a very obvious answer: If you start at my parents' house and walk east down Baseline for a mile, how far will you need to walk to return to my parents' house?  
Well, why do we have to pass all the way to the limit? Can't we just pick a large n and say that that's good enough?  


The reason why is that limits ''simplify'' things. If you're designing a fence for your back yard, it is inc 
Empiricist approach: Well, okay, the 40th parallel is a circle on a sphere, and hmm let's see... the sphere has a very large radius... and so if you walk down this line... hmmm could there be a faster way to get back? I guess not because the radius is so large.


=== What are the problems with limits? ===
Rationalist approach: I passed to the limit, so really we are just considering a line in a plane. You're proposing to walk down a line in a plane, so obviously the shortest way back is to walk the same distance, in the opposite direction, down the same line. ''By the exact same logic, if you walked east down the 40th parallel for 20,000 miles, you'd have to walk 20,000 miles to get back.''
[TODO] in a sense, the problem comes in when you have to make a formal definition. I don't know if it's actually a problem with the formal definition, or if it's just that formal definitions encourage you to drop context.  


=== Rationalist approach ===
The empiricist approach does not produce an incorrect answer, but it's a lot of unnecessary cognitive work, all done in some futile attempt to think of things "as they really are," and not make any simplifications using concepts. The rationalist approach gets the correct answer to the question asked, and it does so in a cognitively easy way, but it can go wrong because it's dropping context and disconnected from reality. (The italicized part of the rationalist's answer is wrong because if you travelled down the 40th parallel for long enough, you would end up back where you started, because the 40th parallel is a circle.)


=== Empiricist approach ===
The objective approach is to note that Boulder is very small compared to the area of the entire Earth, so it's okay to think of Boulder as a plane. You won't leave Boulder if you only walk for a mile, so this approximation remains valid for the scenario in question. Therefore, the (un-italicized part of the) rationalist reasoning applies: you'll have to walk 1 mile to return to my parents' house.  
What I will dub the "empiricist approach" to limits is basically to say, we don't ever need to pass to the limit of a sequence, we can just work with some member of the sequence <math>a_n</math>, where <math>n</math> is large enough.  


For example, in the sequence <math>e_1, e_2, e_3, \cdots</math> above which converges to <math>\sqrt{2}</math>, an empiricist might say: for my purposes, <math>e_4 = 1.414</math> is good enough. That's what I mean by the square root of 2, and I don't need to think about all this extra nonsense (<math>e_5, e_6, e_7, </math> and so on) that tells me how to compute <math>\sqrt{2}</math> up to arbitrary accuracy.  
== square root of 2 ==
I don't have enough room to describe in detail a scenario in which the following might happen, but I claim that one runs into things like this all the time in mathematics and physics.  


For example, in the sequence <math>a_1, a_2, a_3, \cdots</math> above which converges to 1, an empiricist might say: for my purposes, <math>e_4 = 1.414</math> is good enough. That's what I mean by [TODO], and I don't need all this extra nonsense (<math>e_5, e_6, e_7, </math> and so on) that tells me how to compute 1 up to arbitrary accuracy.  
Suppose that I am doing some long computation, and involved somewhere in the computation is a square of area 2. At some point in the computation, I have to think about the side length of this square, which is √2. Then, even later in the computation, I have to square that side length, getting back to the square.  


In many cases, the empiricist is right. However, we will see that his approach is missing something important.
The math you should be writing down should look like


== Notes ==
== Notes ==


<references group="note" />
<references group="note" />
== References ==

Latest revision as of 05:20, 22 August 2024

Essay prompt: Is the idea of 'limit' indispensable? Neutral? A hindrance?

I contend that some concept of limit is indispensable in mathematics, but that one must be very cautious in using standard mathematics' concept of limit.

The focus of this essay is mainly negative. I criticize two common attitudes towards limits: the "empiricist" attitude, which believes in effect that we don't need limits at all, and the "rationalist" attitude (the attitude of modern mathematics), which believes that we can use limits without any thought as to how they connect to reality. Like epistemological empiricism and rationalism, the problem with these approaches is that the former overloads the crow through its use of percepts without concepts, and the latter floats up into the clouds through its use of concepts without percepts. What is needed is an objective attitude towards limits. However, it is a very difficult task to lay out positive universal principles defining exactly what that objective attitude is, and so in this essay I will only attempt to demonstrate it through examples.

What is a limit?

In this section, I will explain what a limit is. A limit can only be taken with respect to a sequence, so first, I must explain what a sequence is.

A sequence is a function[1] from the natural numbers to a group of existents, where there is some notion of closeness or distance between the units of .[2] A sequence is like a set of instructions: you give me any natural number , and I'll produce or identify some thing , where is a . If the sequence consists of fractions with an inverse power of 10 in the denominator, then another way of thinking of the sequence is as instructions for continuing the "..." in a decimal expansion; the expression "3.14159..." can be thought of as shorthand for a sequence. Some examples of sequences:

  1. "", where .
  2. "", where is a circle of radius , centered at . See the figure.
  3. "", where is a secant line to the graph of near . More formally, is the unique line passing through the points and .
  4. "", where .
  5. "", where .
Circles of increasingly large radius becoming closer and closer to the vertical line on the left.

Sometimes, but not always, sequences have the property that the elements of the sequence get closer and closer to a fixed point as the sequence progresses. In such cases, the sequence is said to converge to , and is said to be the limit of the sequence. Examples:

  1. The numbers "" get closer and closer to 1, as gets larger and larger.
  2. The circles "" gets closer and closer to being the y-axis (from the perspective of someone sitting at the origin), as gets larger and larger.
  3. The lines "" get closer and closer to being the tangent line to the graph of at , as gets larger and larger.
  4. The numbers "" get larger and larger as gets larger and larger. The sequence does not converge; it does not have a limit.
  5. The sequence "" converges to .[3] That is, as gets larger and larger, gets closer and closer to 2.

Why are limits useful?

In this section, I will give some fictional but realistic scenarios in which limits could be used to great effect. [TODO]

Converging to 1

The design for the tile at the airport. White area represents places where there is granite, black area represents places where there is putty.

Suppose that the following design is proposed for some of the granite floor tiles at a new airport terminal:

That is, each tile consists of many sub-tiles, forming the pattern above. Each big tile is 1 square meter, so the sub-tiles have sizes 0.5 square meters, 0.25 square meters, etc. How much granite is needed to make each big tile?

The answer to that question is perceptually obvious, given the picture of the design: about 1 square meter of granite is needed for each tile. But if what if we had to justify this to a blind person, or a person who hasn't seen the picture?

The empiricist way of proceeding would be to say: The big tile contains sub-tiles of sizes 1/2, 1/4, 1/8, 1/16, 1/32, 1/64, 1/128, 1/256, 1/512, and 1/1024 square meters, and if you perform a long arithmetic computation, you will find that

which is close enough to 1 in the context at hand. Who cares about limits?

The rationalist way of proceeding would be to say: The big tile contains sub-tiles of sizes 1/2, 1/4, 1/8, 1/16, 1/32, 1/64, 1/128, 1/256, 1/512, and 1/1024 square meters. The sequence of partial sums of these numbers is approximately the sequence (example 1 above). The limit of that sequence is 1. Therefore,

The advantage of the empiricist approach in this case is clear: it produces an answer which is correct, and we know why. [TODO]

There are, however, some advantages of the rationalist approach. One advantage is that it doesn't involve any explicit computation, so it would save time and effort (especially in the days where people didn't have calculators). Another advantage of the rationalist approach is that it is more conceptual, and seems to give us a better idea about what's really going on: the sub-tiles form a pattern which, if continued in the natural way, would make their total area arbitrarily close to 1. In other words, the rationalist approach gives us an explanation for why the area of the sub-tiles is so close to 1.

The problem with the rationalist approach is that, unlike the empiricist approach, its connection to reality is not obvious. What do we actually mean by saying that "[t]he sequence of partial sums of these numbers is approximately the sequence "? And in our conclusion that 1/2 + ... + 1/1024 ≈ 1, what do we mean by "≈", i.e. exactly how close can we conclude that 1/2 + ... + 1/1024 is to 1?

To see a concrete way in which the rationalist approach can go awry, let's use the same logic and consider the sequence , where

One might accidentally subsume 1/2 + ... + 1/256 under the sequence instead of the sequence , because in real life we're only going up to , and these two sequences agree as long as . The limit of the sequence is 42, so if one proceeded as before by taking the limit, he would erroneously conclude that
A more objective approach can be found by digging into the proof that the limit of is 1. By basic algebra, one may verify that the following identity holds

for any number such that . The term in parentheses should be thought of as a limit, and the other term should be thought of as an error term, or a deviation from the limit. Plugging in , the formula yields

so as gets larger and larger, this quantity gets closer and closer to 1 (incidentally, that's the proof that the limit of is 1). In our case, , so the sum will be almost equal to the limit, but with an error of , which is negligible for our purposes. Therefore 1 square meter of granite is needed.


Observe that in contrast to the empiricist approach, the objective approach didn't require a calculator, and it used concepts which clearly generalize far beyond the problem at hand; and in contrast to the rationalist approach, the objective approach produced a complete and correct answer, and didn't rely on any floating abstractions which could lead us down the wrong path.

Tangent lines

In Boulder, Colorado, there is a street called Baseline that runs precisely along the 40th parallel—that is, the coordinate line at 40˚ north latitude. From my parents' house, Baseline stretches out to the east for many miles, and throughout entire length it runs along the 40th parallel.

There is a fact familiar to every educated adult, that the Earth is a sphere, but that the radius of the sphere is so large that perceptually, to a person standing on the surface, it looks flat. This is an example of thinking of something as a limit; the plane is a limit of spheres of increasingly large radius. Similarly, any great circle on a sphere (such as the 40th parallel), in the limit that the radius of the sphere approaches "infinity," is a line (see the sequence above).

Now, consider the following very practical question, with a very obvious answer: If you start at my parents' house and walk east down Baseline for a mile, how far will you need to walk to return to my parents' house?

Empiricist approach: Well, okay, the 40th parallel is a circle on a sphere, and hmm let's see... the sphere has a very large radius... and so if you walk down this line... hmmm could there be a faster way to get back? I guess not because the radius is so large.

Rationalist approach: I passed to the limit, so really we are just considering a line in a plane. You're proposing to walk down a line in a plane, so obviously the shortest way back is to walk the same distance, in the opposite direction, down the same line. By the exact same logic, if you walked east down the 40th parallel for 20,000 miles, you'd have to walk 20,000 miles to get back.

The empiricist approach does not produce an incorrect answer, but it's a lot of unnecessary cognitive work, all done in some futile attempt to think of things "as they really are," and not make any simplifications using concepts. The rationalist approach gets the correct answer to the question asked, and it does so in a cognitively easy way, but it can go wrong because it's dropping context and disconnected from reality. (The italicized part of the rationalist's answer is wrong because if you travelled down the 40th parallel for long enough, you would end up back where you started, because the 40th parallel is a circle.)

The objective approach is to note that Boulder is very small compared to the area of the entire Earth, so it's okay to think of Boulder as a plane. You won't leave Boulder if you only walk for a mile, so this approximation remains valid for the scenario in question. Therefore, the (un-italicized part of the) rationalist reasoning applies: you'll have to walk 1 mile to return to my parents' house.

square root of 2

I don't have enough room to describe in detail a scenario in which the following might happen, but I claim that one runs into things like this all the time in mathematics and physics.

Suppose that I am doing some long computation, and involved somewhere in the computation is a square of area 2. At some point in the computation, I have to think about the side length of this square, which is √2. Then, even later in the computation, I have to square that side length, getting back to the square.

The math you should be writing down should look like

Notes


References

  1. A function is a quantitative relation between two groups of existents (called the "domain" and "codomain" of the function), which relates any existent in the domain to a unique existent in the codomain.
  2. Modern mathematics would say that is a topological space, but I do not wish to commit myself to that concept, because it raises potential philosophical problems.
  3. https://en.wikipedia.org/wiki/Square_root_of_2#Representations[TODO]