<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Narrow Road</title>
	<atom:link href="http://jedidiah.stuff.gen.nz/wp/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://jedidiah.stuff.gen.nz/wp</link>
	<description>Zen and the Art of Mathematics</description>
	<lastBuildDate>Sat, 15 Dec 2007 00:43:49 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Good News and Bad News</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=24</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=24#comments</comments>
		<pubDate>Sat, 15 Dec 2007 00:43:49 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Site news]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=24</guid>
		<description><![CDATA[The good news is that I am now gainfully employed in a full time job doing mathematics research. The bad news is that I have a lot less spare time to write new entries here. While the long term goal of eventually developing enough mathematics to motivate category theory and topos theory is firmly in [...]]]></description>
			<content:encoded><![CDATA[<p>The good news is that I am now gainfully employed in a full time job doing mathematics research. The bad news is that I have a lot less spare time to write new entries here. While the long term goal of eventually developing enough mathematics to motivate category theory and topos theory is firmly in place, the rate at which we get there will be drastically reduced. In the new year I am afraid that you can expect fewer entries, each of which will likely be shorter. Hopefully by reducing the quantity of work per entry and I can still keep them somewhat frequent (I am aiming for about 1 per month), and continue to advance toward the goal of motivating and explaining some of the truly interesting and inspiring ideas in modern mathematics.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=24</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A Brief Tangent</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=23</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=23#comments</comments>
		<pubDate>Wed, 24 Oct 2007 15:31:07 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Calculus]]></category>
		<category><![CDATA[Journey]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=23</guid>
		<description><![CDATA[I would like to get temporarily sidetracked. The straight road is often not always the best road, at least not when the journey matters at least as much as the destination. Basho often took sidetracks, and they often provided some of the highlights of his journeys. Wandering off track gives you the opportunity to learn [...]]]></description>
			<content:encoded><![CDATA[<p>I would like to get temporarily sidetracked. The straight road is often not always the best road, at least not when the journey matters at least as much as the destination. Basho often took sidetracks, and they often provided some of the highlights of his journeys. Wandering off track gives you the opportunity to learn a little more about the country that you are travelling through; that&#8217;s precisely the sort of sidetrack we&#8217;re about to take.</p>
<p>
We&#8217;ve spent some time <a href="http://jedidiah.stuff.gen.nz/wp/?p=9">contemplating</a> and <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">discussing</a> the <a href="http://jedidiah.stuff.gen.nz/wp/?p=17">intricacies of the infinite</a>. We started off with a very natural abstraction, and quickly got lead into a mire of technicality of complexity. With a little work we came through unscathed and ended up with a new appreciation for the unfolding beauty and complexity that we were lead to. All of that, however, was a matter of &#8220;head in the clouds&#8221; contemplation as to what it truly meant to be infinite, and what the continuum actually was. Now that we have that understanding it is time to wander off the path and explore the countryside that such understanding begins to open up for us.</p>
<p>
<span id="more-23"></span><br />
We are going to take a rather circuitous route through all of this, so please bear with me as we wander in odd directions. I want to start by picking up from <a href="http://jedidiah.stuff.gen.nz/wp/?p=9">Paradoxes of the Continuum, Part I</a> and Zeno&#8217;s paradoxes &#8212; in particular it is finally time to have a look at the previously undiscussed third paradox of Zeno, knows as <i>The Arrow</i>. The paradox generally runs as follows: picture an arrow speeding toward its target. Now imagine freezing time to just a single moment as the arrow travels; the arrow will be stuck, stationary and not moving. And yet this will be true at each and every moment of the arrows flight &#8212; the arrow is always stationary! Thus, would conclude Zeno, the concept of movement and change is false; merely an illusion.</p>
<p>
As with Zeno&#8217;s other paradoxes, also meant to demonstrate that movement is an illusion, most people aren&#8217;t convinced. It sounds all well and good, but I think we tend to all suspect there&#8217;s a trick somewhere in that reasoning. And indeed there is, but it is a subtle one, and teasing it out will require our <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">new understanding of the continuum</a>. To start, however, we&#8217;re going ask some questions about <i>speed</i>.</p>
<p>
We tend to have intuitions about speed (or more correctly velocity, but I&#8217;m going to avoid vectors for now), and it is these inexact intuitions that Zeno preys upon. We think of speed as the rate of movement of an object. An object that has zero speed is stationary &#8212; not moving. But how do we know the speed of an object? We observe is change in position over time, and then normalize that value based on the span of time we were observing the object for. This works well enough, giving an average speed over reasonable spans of time, but fools us when we have to deal with Zeno&#8217;s thought experiment: If we freeze time then surely the change in position will be zero, and doesn&#8217;t that mean that the speed will be zero? Well no, contrary to intuition the <i>instantaneous</i> speed isn&#8217;t zero; to see that, however, we need to think about time as a continuum.</p>
<p>
At the core this is the heart of the trick in Zeno&#8217;s third paradox: he effectively manages a straddle, viewing time as either discrete or continuous at different times to suit his needs. If we think carefully, and view time as truly continuous, the problem will evaporate.</p>
<p>
Given a span of time it is easy enough to calculate a change in position by taking a difference (that is, subtracting the first position from the second). What are we to make, however, of a change in position when there is no span of time involved? What we need, is a <i>continuous difference</i> between positions &#8212; the difference as positions change smoothly and continuously. So let&#8217;s think about a moment in time: it is a point on the continuum. We have, however, learned about continuums, and found that <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">a point on a continuum is really an infinite sequence of ever more accurate approximations &#8212; a Cauchy sequence</a>. That is, a point on the continuum is an infinite sequence of rational points that &#8220;home in&#8221; around the desired point; for many points on the continuum (indeed, <a href="http://jedidiah.stuff.gen.nz/wp/?p=17">for most of them!</a>) we can only specify them as an infinite sequence of approximations that get ever closer to, but never quite reach, the point. More importantly, however, we learned that there are many different Cauchy sequences that all refer to the same point (in much the same way as there are many fractions that refer to the same ratio): we simply pick a representative sequence. We could, of course, choose two different representative sequences for the same point. If we&#8217;re careful we can choose sequences that have no terms in common &#8212; perhaps one sequence is consistently increasing, each term larger than the next, but by a smaller and smaller amount, toward our desired &#8220;limit point&#8221;, while the other is consistently decreasing, each term slightly smaller than the next, down toward the same desired &#8220;limit point&#8221;. </p>
<p>
To make this a little easier to talk about we&#8217;ll provide some labels. Let</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
t<sub>1</sub>, t<sub>2</sub>, t<sub>3</sub>, t<sub>4</sub>, &#8230;<br />
</span>
</p></blockquote>
<p>be the moment in time expressed as the sequence tending from below, and let</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
T<sub>1</sub>, T<sub>2</sub>, T<sub>3</sub>, T<sub>4</sub>, &#8230;<br />
</span>
</p></blockquote>
<p>be the same moment, expressed as the sequence tending from above. Now, for each term in each sequence, we&#8217;ll have an associated position of the arrow at that time (the term is, in a sense, shorthand for a constant sequence). Thus for the first sequence we&#8217;ll have a sequence of positions:</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
p<sub>1</sub>, p<sub>2</sub>, p<sub>3</sub>, p<sub>4</sub>, &#8230;<br />
</span>
</p></blockquote>
<p>and for the second sequence we&#8217;ll have a different sequence of positions:</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
P<sub>1</sub>, P<sub>2</sub>, P<sub>3</sub>, P<sub>4</sub>, &#8230;<br />
</span>
</p></blockquote>
<p>
With all of that in mind we can set about calculating the speed of the arrow at the chosen moment in time. How do we do that? Well we can certainly find the difference between positions <span style="font-family:serif;font-style:italic">p<sub>1</sub></span> and <span style="font-family:serif;font-style:italic">P<sub>1</sub></span>, and since <span style="font-family:serif;font-style:italic">t<sub>1</sub></span> and <span style="font-family:serif;font-style:italic">T<sub>1</sub></span> are different we can normalize against the span of time between those positions to get <i>a</i> speed &#8212; call it <span style="font-family:serif;font-style:italic">s<sub>1</sub></span>. That is, we can calculate</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
s<sub>1</sub> = (P<sub>1</sub> &#8211; p<sub>1</sub>)/(T<sub>1</sub> &#8211; t<sub>1</sub>)<br />
</span>
</p></blockquote>
<p>with solid assurance that neither the numerator nor denominator are zero (we specifically chose our sequences <span style="font-family:serif;font-style:italic">t<sub>n</sub></span> and <span style="font-family:serif;font-style:italic">T<sub>n</sub></span> so this would be the case). Indeed, we can do the same calculation for each set of terms, and find</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic"><br />
s<sub>2</sub> = (P<sub>2</sub> &#8211; p<sub>2</sub>)/(T<sub>2</sub> &#8211; t<sub>2</sub>)<br/><br />
s<sub>3</sub> = (P<sub>3</sub> &#8211; p<sub>3</sub>)/(T<sub>3</sub> &#8211; t<sub>3</sub><br />
</span>
</p></blockquote>
<p>and so on, giving us an infinite sequence <span style="font-family:serif;font-style:italic">s<sub>1</sub>, s<sub>2</sub>, s<sub>3</sub>, &#8230;</span>, which shouldn&#8217;t be that daunting since, if we were being realistic, we were expecting speed to be a continuum as well, and thus we expect speeds to be, ultimately, Cauchy sequences too. So is the sequence Cauchy? In the case of our arrow, the answer is yes. Does the sequence get closer and closer to zero? In the case of our arrow, the answer is no. And so we have a non-zero speed that we calculated by finding the change in position between a moment, and itself (normalized, of course, against span of time over which we observed the change). In other words, at any given moment the arrow isn&#8217;t actually stationary &#8212; it has some non-zero speed.</p>
<p>
There is a technicality that it is worth being aware of: for all of this to make sense we want to know that the speed we get is independent of our choice of Cauchy sequence to describe the chosen moment of time. That is, we don&#8217;t want to find that, if we choose two new and different Cauchy sequences that express our chosen moment and proceed with the calculation as above, we end up with a different speed as the result. We can, of course, end up with a different sequence, as long as the sequence tends to the same limit (recall that the same point on the continuum can be expressed as many different Cauchy sequences). Clearing this particular hurdle will be easier later when we have better mathematical machinery and notation, so for now it will suffice to say that, presuming our arrow travels as any normal arrow, we won&#8217;t have an issue here.</p>
<p>
The real question, however, is: what just happened? With a wave of my hand and a dance of symbols we&#8217;ve apparently made the problem disappear, but I can certainly forgive you if you don&#8217;t feel any more enlightened as to where Zeno&#8217;s paradox breaks down. Let&#8217;s go over it all again, but at a higher level, to try and get a feel for what&#8217;s really going on here.</p>
<p>
Zeno wants us to conclude that the arrow is frozen in a moment in time, that it is stationary. We intuitively think this was because we presume, in that moment of time, the change in position of the arrow is zero, and from this we think of it as having zero speed. The catch is that, for speed to make sense, we require the moment of time in which the arrow is frozen to be non-zero &#8212; that is, a moment should capture some span of time. We can think of the moment that way because we are used to thinking of things in terms of discrete units, and we intuitively associate a moment with a discrete unit of time. We think of it as  the smallest possible span of time; the fundamental tick of the universe that takes us from one moment to the next. The problem is that time is a continuum, and continuums don&#8217;t work that way: there  is there any clear &#8220;next moment&#8221;; more specifically, as noted in <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">Paradoxes of the Continuum, part II</a>, the continuum contains incommensurable points &#8212; there simply is no fundamental unit for continuums, it cannot exist.</p>
<p>
Now, in <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">Paradoxes of the Continuum, Part II</a> we made sense of the continuum, and these baffling points, by working in terms of Cauchy sequences. The points of the continnum are, in some sense, evanescent; we cannot pint them down, but instead must specify them as an infinite sequence of progressively more accurate approximations. Or, to put it another way, a point on the continuum, a moment in time, is a completed infinite, and as we saw in <a href="http://jedidiah.stuff.gen.nz/wp/?p=17">A Transfinite Landscape</a> we must be careful of our intuitions when dealing with completed infinities.</p>
<p>
Now, just as a moment is expressible as an infinite increasingly accurate approximation, the speed of the arrow in that moment (which exists in the continuum of speeds &#8212; since we expect speeds to change smoothly just as time does) will also be expressible as an infinite increasingly accurate approximation. How do we find increasingly good approximations of the speed at a given moment? By building each approximation from approximations of the moment in time! This is, in essence, exactly what we have done above. The evanescent nature of points on the continuum &#8212; that they can only be apprehended as ever better approximations rather than clear discrete points &#8212; is the catch here. Zeno&#8217;s paradox relies on us failing to fully grasp this bizarre aspect of the continuum.</p>
<p>
To defeat the paradox we have shifted from thinking in terms of discrete differences in position, to continuous (normalized) differences in position &#8212; differences that don&#8217;t require a discrete &#8220;next moment&#8221;. A natural question begins to arise: if we can take continuous normalized differences instead of the usual discrete differences, can we evaluate continuous normalized sums instead of the usual discrete sums? And what do we mean by that anyway? It is time to follow our nose, and attempt to resolve these questions, for in doing so we will eventually come to a better understanding of continuous differences as well!</p>
<p>
Before trying to work out what we mean by a continuous normalized sum, let&#8217;s sort out what a discrete normalized sum looks like, and what would have to change for it to be a continuous sum. A discrete normalized sum is, in practice, an average. If we have a list of populations for each country in the world we can find the average population for a country. To do that we sum up all the populations, and divide by the number of countries.</p>
<p>
That covers discrete normalized sums &#8212; so what does it mean to be continuous? All we need to do is shift to values that are spread over a continuous rather than discrete domain. Suppose we have a metal bar with a temperature that varies (continuously!) along it&#8217;s length. We can reasonably ask for the average temperature of the metal bar as a whole. Now, however, we don&#8217;t have discrete values as we did with the populations of countries; the temperatures are spread over a continuum. Our usual methods of summation won&#8217;t work &#8212; we need a continuous normalized sum!</p>
<p>
As with continuous differences we should expect to find our answer takes the form of an infinite sequence of ever better approximations since we expect our answer to, itself, be a value on a continuum. How do we arrive at these approximations however?</p>
<p>
If we look back at how the continuous differences case worked, we see that we arrived at the desired continuous difference via a series of ever better approximations using discrete differences. Following that line of thought, we should look to create discrete sums that approximate our desired continuous sum. To create a discrete sum we can simply pick a finite number of points along the bar, sum up the temperature values at those points, and normalize by dividing by the number of points on the bar we picked &#8212; that gives us a discrete normalized sum that we can calculate. Now, obviously in picking merely a finite number points to sample the temperature at we are getting only an approximation; if we add more points to our existing sample, however, the approximation will improve. Thus if we choose to sample the temperature at more and more points for each successive approximation we will have a sequence of approximations that will get more and more accurate. It&#8217;s not hard to see that such a sequence will be a Cauchy sequence, and thus we have specified a point on the continuum of temperatures that will be the average temperature of the bar.</p>
<p>
As with continuous differences, we again have the technical issue: in this case ensuring that which finite set of points we choose for each approximation doesn&#8217;t effect the final result; that is, at each stage we have a choice of which (and how many!) new points to add to our sample, and we want to be sure that our choice of points doesn&#8217;t actually change the end result: the Cauchy sequences of approximations are allowed to differ, but their limit, the point of the continuum that they specify, must be the same. For now that&#8217;s a little to technical to get into, but suffice to say that, with sufficient care, we can surmount this issue.</p>
<p>
Another issue is that of why we are using <i>normalized</i> sums; why not just use ordinary sums instead? This comes down to the nature of continuous sums, and can be elucidated by our examples of populations of countries and the temperature of a metal bar. In the discrete example, summing populations of countries, there is a clear way of finding the ordinary sum, the total population of all countries. We can do that because we have a clear sense of units that we are summing over: a country is a unit, each country has a population value, and all of those population values are of equal weight with regard to one another. Indeed any discrete sum, by default, defines the units that are being summed over, since we can regard each discrete value as a discrete unit. In contrast, when we have a continuous domain to sum over, such as the metal bar, there are <i>no basic units</i> since such a thing simply does not exist for a continuum! Given that we have no units to sum over, and thus cannot ensure that the values we are summing are of comparable weight to one another, a standard sum no longer quite makes sense. Instead we simply need to pick an arbitrary unit and convert (i.e. <i>normalize</i>) our values into that unit system.</p>
<p>
In the case of the calculation we made for the metal bar our unit was the bar itself, and thus the continuous normalized sum was the average temperature for the length of bar (the average, normalized per our unit of choice). We could equally well have worked in meters and normalized according to that unit; what we would get is the sum of the temperatures along the bar averaged over one meter of bar. What exactly does that mean? Well if the bar was, for example, three meters long, then the resulting sum over the length of the bar normalized to meters would be three times that of the average for the bar &#8212; we&#8217;re summing over three meters of bar, but normalizing (averaging) that total over only one meter of bar. Of course if we wanted the average temperature of just the first meter of the bar, and summed only over that first meter things would again make sense as a standard average. The key, however, is having a standardized unit to normalize each individual contribution to the sum against. </p>
<p>
This same sort thinking applies to the discrete differences example: we choose an arbitrary unit of speed (be it meters per second, feet per hour, or what have you) and, for each discrete speed calculation we do, normalize to those units so each successive approximation is comparable to the last. Thus we see that our continuous differences and sums are necessarily <i>normalized</i> differences and sums because we simply don&#8217;t have a default set of units in the continuous case, and thus have to pick and arbitrary unit and consistently normalize to those units. </p>
<p>
So, we now have a method for finding continuous sums, as well as continuous differences. The natural question is: how do they relate to one another? If they are to behave at all similar to discrete sums and differences, we would expect them to be inverses of one another. Does this work? Let&#8217;s go back to our example of the arrow, and recall that we can calculate, for each point in time, the speed of the arrow at that moment &#8212; the continuous (normalized) difference in position. It should be possible, using continuous sums, to sum up all those speeds. And indeed we can, but if we look a little closer at exactly what is going on, we&#8217;ll find something very interesting is happening.</p>
<p>
Both the continuous differences and continuous sums need to be normalized; the continuous differences in position are normalized with respect to time, giving us speed, while a continuous sum of speeds is also normalized with respect to time. Intriguingly, because of the way these normalizations work, they will cancel out. Recall that, if we normalized our metal bar against meters, we got a average temperature scaled by the length of the bar that we summed over in meters. If we were to sum up the continuous differences, the speeds, then we would get a average speed scaled by the amount of time we were summing over &#8212; but what is an average speed multiplied by the amount of time that average was maintained? Why it is the total change is position over that time! Thus if we take a continuous sum of the continuous differences in position, we arrive back at the total change in position. Likewise if we were to sum up instantaneous acceleration values (continuous differences in speed) we expect to get the total change in speed, and so on. This relationship between continuous sums and continuous differences (as relatively clear as it is when phrased in terms of sums and differences) is <i>The Fundamental Theorem of Calculus</i>!</p>
<p>
Indeed, what we have been doing here is the heart of calculus. Ultimately this is what calculus is: the arithmetic of the continuous. And this also begins to explain why it is that calculus is so important: the continuous is all around us; time and space are continuous, and so are many things that inhabit it such as electromagnetic fields. And even things that aren&#8217;t strictly continuous, such as the flow of water, can be modelled or approximated as being continuous (calculating the movement of every molecule of water is intractable, but if we approximate water as continuous medium the problem becomes quite manageable). When so much is best handled as continuous, it is clear that an arithmetic of the continuous, rather than the basic discrete arithmetic we are used to, is vital: it opens up vast new areas of our universe for mathematical exploration that were inaccessible using only discrete arithmetic. And so it was that, with the development of an organised calculus, an arithmetic of the continuous, in the 17th century, physics and our understanding of the universe underwent a stunning revolution that is still on-going today. And at the heart of this remarkable revolution was a deeper understanding of of the age old abstractions of the infinite and the continuous.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=23</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>A Brief Apology</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=22</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=22#comments</comments>
		<pubDate>Thu, 18 Oct 2007 13:51:00 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Site news]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=22</guid>
		<description><![CDATA[This is a brief apology to let readers know that I am still working on the site. I have been otherwise occupied for several weeks, but the larger issue currently is that I am struggling to write the next entry &#8220;A Brief Tangent&#8221;. The reason for this is that I am battling to find the [...]]]></description>
			<content:encoded><![CDATA[<p>This is a brief apology to let readers know that I am still working on the site. I have been otherwise occupied for several weeks, but the larger issue currently is that I am struggling to write the next entry &#8220;A Brief Tangent&#8221;. The reason for this is that I am battling to find the right way to introduce and discuss the ideas I wish to cover, and this has proved harder than I initially expected. Many words have been written, and not a small number erased again, and the entry is progressing slowly. I hope to be able to finish it in the next few weeks and get it posted. Thank you for your patience.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=22</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Parallel Paths</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=21</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=21#comments</comments>
		<pubDate>Tue, 11 Sep 2007 23:55:59 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Journey]]></category>
		<category><![CDATA[Overview]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=21</guid>
		<description><![CDATA[Some time ago we set out on two diverging roads. One road sent us exploring the infinite, while the other started out looking at discrete patterns, and how they may be abstracted. As we moved along the roads, however, we found the first hints of cross fertilization: our results about the infinite allowed us to [...]]]></description>
			<content:encoded><![CDATA[<p>Some time ago we set out on two diverging roads. One road sent us exploring the infinite, while the other started out looking at discrete patterns, and how they may be abstracted. As we moved along the roads, however, we found the first hints of cross fertilization: our results about the infinite allowed us to propose pattern algebras for infinite patterns &#8212; the result being that the algebra was precisely that of numbers. From here on in the two separate paths begin to align, running quite separate but parallel courses. The cross fertilization of ideas will continue, and we will begin to see increasing similarities between these two very separate worlds of abstraction. The paths ahead will soon become interesting indeed.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=21</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Marriage</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=20</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=20#comments</comments>
		<pubDate>Sat, 25 Aug 2007 22:12:18 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=20</guid>
		<description><![CDATA[I am now the very happy husband of Tuula Kristiina Talvila. The ceremony was held at Rockcliffe Park Pavilion in Ottawa on the 24th of August. I could not hope to be married to more wonderful woman, and am looking forward to a lifetime together with her. Hopefully, understandably, new entries may be somewhat delayed.
]]></description>
			<content:encoded><![CDATA[<p>I am now the very happy husband of Tuula Kristiina Talvila. The ceremony was held at Rockcliffe Park Pavilion in Ottawa on the 24th of August. I could not hope to be married to more wonderful woman, and am looking forward to a lifetime together with her. Hopefully, understandably, new entries may be somewhat delayed.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=20</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Grouping Symmetries</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=19</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=19#comments</comments>
		<pubDate>Sun, 05 Aug 2007 01:05:44 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[General Algebra]]></category>
		<category><![CDATA[Group Theory]]></category>
		<category><![CDATA[Journey]]></category>
		<category><![CDATA[Overview]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=19</guid>
		<description><![CDATA[On his journey north, Basho stopped at Matsushima and was spellbound by its beauty. The town itself was small, but the bay is studded with some 260 tiny islands. The white stone of the islands has been eaten away by the sea, leaving a multitude of endlessly different shapes, pillars and arches, all crowned with [...]]]></description>
			<content:encoded><![CDATA[<p>On his journey north, Basho stopped at Matsushima and was spellbound by its beauty. The town itself was small, but the bay is studded with some 260 tiny islands. The white stone of the islands has been eaten away by the sea, leaving a multitude of endlessly different shapes, pillars and arches, all crowned with pine trees. Each island is different and unique, and each, with its sculpted white cliffs tufted with pine, is beautiful. It is, however, the whole bay, the combined diversity, that ultimately makes Matsushima one of the three great scenic locations in Japan. So far on our journey we have been admiring the beauty of the islands; the subtlety and intricacy of the different algebras that arise from different patterns, different symmetries. It is time to step back and begin to admire the whole &#8212; and in so doing gain a deeper and richer perspective on all the sights of our journey so far.</p>
<p>
<span id="more-19"></span><br />
In the previous entries <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</a> and <a href="http://jedidiah.stuff.gen.nz/wp/?p=15">Permutations and Applications</a> we have looked at how patterns and symmetry can be abstracted into algebras. Each different pattern provided its own unique algebra, with different algebraic rules. These are, in a sense, our islands; each different and unique and beautiful. What we need to explore now is the entire bay. To do this we need to take a step back, or more accurately up, and try to examine the whole. The aim is to abstract across all the different algebras that different patterns generate. We could, perhaps, liken this to the process of developing algebra from numbers by abstraction; here, however, we have different pattern-algebras standing in place of different numbers. In abstracting up from numbers to basic algebra we looked for those properties that held true regardless of the particular numbers under consideration &#8212; each different number has its own unique properties (indeed, there is a vast richness of material here that is studied in number theory) &#8212; but there are certain properties that are common to all. What we seek is a set of basic algebraic rules that are true for all pattern-algebras; each different pattern algebra may have its own idiosyncratic rules, but hopefully we can find a basic set of underlying rules. In practice we&#8217;re going to hope for even more than that. Not only do we want the algebraic rules we determine to be common to all pattern-algebras, we would very much like it if <i>any algebra</i> that satisfied the basic rules turned out to be the pattern-algebra for some pattern. That is, we want a two way correspondence: every pattern-algebra should satisfy these rules, and everything that satisfies these rules should be a pattern-algebra.</p>
<p>
So where to start? First and foremost we know that every pattern always has the null symmetry &#8212; the do nothing symmetry: doing nothing to a pattern will always preserve the pattern. Thus it follows that our rules should somehow express this fact. The question then is how to express the existence of a null symmetry in terms of algebraic rules. That, in turn, leads to the question of what algebraic rules make a given symmetry element a null symmetry. Now, the algebraic rules we can have for pattern-algebras are all expressed in terms of combining together different symmetries, so a rule for the null symmetry will be one that expresses how it interacts with other symmetries in combination. How does the null symmetry interact with other symmetries? Since it is the &#8220;do nothing&#8221; symmetry, combining it with other symmetries gives the same result as if we hadn&#8217;t done it at all. That is, we know that a symmetry <span style="font-family:serif;font-style:italic">e</span> is a null symmetry if, for any symmetry <span style="font-family:serif;font-style:italic">a</span> we have</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">ae</span> = <span style="font-family:serif;font-style:italic">ea</span> = <span style="font-family:serif;font-style:italic">a</span>
</p></blockquote>
<p>Thus our first requirement, or rule, for general pattern-algebras is that there must be some element <span style="font-family:serif;font-style:italic">e</span> of the algebra such that <span style="font-family:serif;font-style:italic">ae</span> = <span style="font-family:serif;font-style:italic">ea</span> = <span style="font-family:serif;font-style:italic">a</span> for any (i.e. every) element <span style="font-family:serif;font-style:italic">a</span> of the algebra.</p>
<p>
Next it would be nice to start to characterise the <i>symmetry</i> aspect of the pattern-algebras. Ultimately this can be reduced to just a couple of properties, though such a reductionist approach does diminish the transparency of the relationship to symmetry. The first, and easier to grasp, property boils down to the reversibility of symmetries. That is, if a certain action is a symmetry, then doing it&#8217;s opposite and taking everything back to how you started is also a symmetry. This is perhaps a little non-obvious since the &#8220;opposite&#8221; action doesn&#8217;t appear to necessarily be a pattern preserving symmetry in its own right; however, since the the result of our initial action results in a state that preserves the pattern, and the initial state to which we go back via the &#8220;opposite&#8221; action also necessarily preserves the pattern (by definition essentially) it follows that the opposite action necessarily takes pattern preserved states to pattern preserved states (that is, as long as we start in a pattern preserved state, we&#8217;ll always end up at another one) and is thus a symmetry. Looking at it from a broader perspective this is really a relatively deep statement about the nature of what we are calling symmetries: they are actions that move from one one state to another preserving some pattern that we have deemed it relevant to conserve; it is in the nature of such actions that they be reversible and that the inverse or opposite action is also a pattern preserving action. How do we write that in terms of algebraic rules? The opposite action is going to be one that combines with initial action to result in effectively a null action. Thus our second requirement is that for any symmetry <span style="font-family:serif;font-style:italic">a</span> there exists some symmetry <span style="font-family:serif;font-style:italic">b</span> such that</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">ab</span> = <span style="font-family:serif;font-style:italic">ba</span> = <span style="font-family:serif;font-style:italic">e</span>
</p></blockquote>
<p>where <span style="font-family:serif;font-style:italic">e</span> is the null symmetry whose existence we were guaranteed by the first requirement.</p>
<p>
The final property we require is the hardest to explain in clear practical terms. It will be easier to just state it, and then try and discuss exactly what it means, and why it might be relevant. The property we require is that for any three elements of the pattern-algebra, call them <span style="font-family:serif;font-style:italic">a</span>, <span style="font-family:serif;font-style:italic">b</span>, and <span style="font-family:serif;font-style:italic">c</span>, we have</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">a</span>(<span style="font-family:serif;font-style:italic">bc</span>) = (<span style="font-family:serif;font-style:italic">ab</span>)<span style="font-family:serif;font-style:italic">c</span>
</p></blockquote>
<p>This is the associative law which you should recall from <a href="http://jedidiah.stuff.gen.nz/wp/?p=7">A Fraction of Algebra</a>. What we are essentially saying by applying it here is that how we group together composition of symmetries is unimportant to the end result. That this is true of symmetries is relatively clear: given a sequence of symmetry actions the order of the actions matters, but how we group them does not; we can think of some pair, or group of actions in the sequence, as a single atomic action and the end result will be the same. For a more explicit example of this we can think of our example of the symmetries of a square from <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</a>. There we expressed things in terms of two basic actions: a rotation by 90 degrees, <span style="font-family:serif;font-style:italic">r</span>; and a flip about the vertical axis, <span style="font-family:serif;font-style:italic">f</span>. These combined to provide other symmetries, for example <span style="font-family:serif;font-style:italic">fr</span> was the symmetry action of flipping the square about its trailing diagonal. Now, given a sequence of such actions, it didn&#8217;t matter whether we thought it as a diagonal flip about the trailing diagonal axis followed by a rotation by 90 degrees, or a flip about the vertical axis followed by a rotation by 180 degrees, or simply as a flip about the horizontal axis; all amount to the same result, the same rearrangement of corners of the square. Stretching your mind to abstract this to general symmetries will let you see that they too will have the same property. Seeing that this property is the last piece we need to characterise symmetries is a little harder, and perhaps beyond the scope of this entry. Suffice to say this third requirement is the last one we need.</p>
<p>
There are, of course, a few unstated assumptions that we&#8217;ve been getting away with here. For the purposes of informal discussion that&#8217;s fine, but when we get into hammering out the specifics for mathematical purposes we can&#8217;t afford to let such ambiguity stand. So, let&#8217;s spell out these final details&#8230;</p>
<p>
In assuming that there are symmetries of  pattern we are assuming that there are a set of actions, that is, rearrangements, and that those actions are composable &#8212; that we can combine two actions together one after the other. In terms of our pattern algebra that amounts to assuming the existence of some algebraic objects, which we can denote by letters (or other characters if we prefer) and some sort of <i>binary operation</i> for those algebraic objects. A binary operation is essentially just a rule that allows us to combine together any two elements and arrive at a third. Addition is a binary operation on numbers; you add together two numbers to get a third. Likewise multiplication is a binary operation for numbers; you multiply two numbers together and get a new number. So what is the binary operation for pattern-algebras? There isn&#8217;t one in particular. Indeed, the binary operation is essentially defined by the algebraic rules unique to that pattern-algebra, so different pattern-algebras have different binary operations. There are a set of rules that help narrow down binary operations for pattern algebras in general, and those are, of course, precisely the rules we&#8217;ve been discussing previously. What matters, however, is that there exist <i>some</i> binary operation that can act on any pair of algebraic objects.</p>
<p>
Putting all of this together we can arrive at a formal description of what it takes to be a pattern-algebra. Using abstraction to pare it down the the minimum set of requirements we have: a pattern-algebra is a set of objects, with a binary operation on those objects, such that the following hold:</p>
<ol>
<li>There is an object <span style="font-family:serif;font-style:italic">e</span> such that, for any other object <span style="font-family:serif;font-style:italic">a</span> in the set, <span style="font-family:serif;font-style:italic">ea</span> = <span style="font-family:serif;font-style:italic">ae</span> = <span style="font-family:serif;font-style:italic">a</span>.</li>
<li>For each object <span style="font-family:serif;font-style:italic">a</span> in the set, there is some object <span style="font-family:serif;font-style:italic">b</span>, also in the set, such that <span style="font-family:serif;font-style:italic">ab</span> = <span style="font-family:serif;font-style:italic">ba</span> = <span style="font-family:serif;font-style:italic">e</span> (Where <span style="font-family:serif;font-style:italic">e</span> is the special object mentioned in requirement 1).</li>
<li>For any three objects <span style="font-family:serif;font-style:italic">a</span>, <span style="font-family:serif;font-style:italic">b</span>, and <span style="font-family:serif;font-style:italic">c</span> in the set, we have (<span style="font-family:serif;font-style:italic">ab</span>)<span style="font-family:serif;font-style:italic">c</span> = <span style="font-family:serif;font-style:italic">a</span>(<span style="font-family:serif;font-style:italic">bc</span>).</li>
</ol>
<p>and that&#8217;s it. This sort of very abstract and minimalist definition is exactly what you&#8217;ll find at the very beginning of most books on <i>Group Theory</i>, since this is the formal definition of what mathematicians call a <i>group</i>. Indeed a <i>group</i> is really just a pattern-algebra, though in the more rarefied areas of group theory the patterns they relate to can be so hideously complex as to be effectively unimaginable. Since using the same terms as mathematicians will help keep us on the straight and narrow when referring to any outside sources I&#8217;ll henceforth be using &#8220;group&#8221; to mean the sort of pattern-algebra we&#8217;ve been discussing throughout <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</i>, <a href="http://jedidiah.stuff.gen.nz/wp/?p=15">Permutations and Applications</a> and this entry. You can, of course, mentally translate &#8220;group&#8221; to mean &#8220;pattern-algebra&#8221; to help keep the mental connections to patterns and symmetry clearer.</p>
<p>
So what was all this abstraction for? What exactly have we gained by reducing things to this very abstruse definition in terms of sets and binary operations and algebraic rules? One may as well ask what the point of abstracting over different collections of objects to arrive at the abstruse notion of numbers and arithmetic is. We&#8217;ve dropped away all the fine grained particularity, and reduced things to a simple matter of clearly defined rules. Any set of objects with a binary relation that meets these three simple rules can be thought of as a group relating to some pattern. We don&#8217;t have to know about the pattern however, we can simply work within the specific rules of the group. More importantly, however, we now have an overview of the entire bay of islands rather than finding ourselves inspecting each unique island. These three simple rules provide the minimal basis for any group. Any group must have at least these three rules, and we can add any extra rules we like to these three base rules (as long as the resulting set of rules is self-consistent) and arrive at a new group. We aren&#8217;t bound by patterns any longer, we can explore the unique complexity of any island in the bay by simply building the group of our choosing; by imposing the extra rules, the extra structure, we wish. We need only choose a set of some size, and rules regarding how elements interact with one another.</p>
<p>
This, in turn, begins to draw us full circle. The perceptive reader may have noticed that each of the three rules defining a group is listed amongst the algebraic rules for addition of numbers back in <a href="http://jedidiah.stuff.gen.nz/wp/?p=7">A Fraction of Algebra</a>; we had the existence of an additive identity (the number zero) which covers requirement 1, the existence of additive inverses (negative numbers) which covers requirement 2, and the associativity of addition which covers requirement 3. Furthermore, you will hopefully recall that, as discussed in <a href="http://jedidiah.stuff.gen.nz/wp/?p=5">The Slow Road</a>, higher order operations such as subtraction, multiplication, and division, could be built from addition. Numbers form a group; they are a pattern-algebra! </p>
<p>
What pattern does the group of numbers under addition relate to? If we are thinking of the integers then you can think of an infinite line of marbles (infinite in both directions), with the various symmetries being horizontal shifts of varying sizes. The fact that such a shift is indeed a symmetry, that it results in exactly the same pattern of marbles, is something we discussed and resolved in our parallel road <a href="http://jedidiah.stuff.gen.nz/wp/?p=17">considering the nature of the infinite</a>. And so now, having scaled the heights, we find we can look back, out across the vast bay with an infinite number and diversity of unique and interesting islands, and see that what we had thought of as the vast plain of numbers with all its intricacies, is, in fact, just a single island amidst a sea of many many more. Certainly there is much to explore on that original island, but that is just the beginning; we now have the perspective to see how much wider the world is; to put our narrow beginnings in their proper place. And still we have only just begun! There is a vast bay of islands yet to explore, and once we have begun to comprehend some of their mysteries we can strike out for higher ground again from which we can look back and see even how narrow our current vista is. </p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=19</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Ph.D. Oral Exam</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=18</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=18#comments</comments>
		<pubDate>Sun, 15 Jul 2007 22:43:41 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Site news]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=18</guid>
		<description><![CDATA[On Friday the 13th of July I successfully defended my thesis, entitled &#8220;Pro-finite Lie rings and p-adic Lie algebras&#8220;. From here I have only the minor hurdles of getting the Faculty of Graduate Studies to sign off on formatting, and organising copying and binding, to successfully complete my degree requirements. I will therefore be graduating [...]]]></description>
			<content:encoded><![CDATA[<p>On Friday the 13th of July I successfully defended my thesis, entitled &#8220;<i>Pro-finite Lie rings and p-adic Lie algebras</i>&#8220;. From here I have only the minor hurdles of getting the Faculty of Graduate Studies to sign off on formatting, and organising copying and binding, to successfully complete my degree requirements. I will therefore be graduating in October, having moved to Ottawa at the end of this month, and gotten married to my fiancée Tuula on the 24th of August. For those who are curious, a copy of the my thesis can now be found in the PDFs section on the lower right; I can&#8217;t promise as to how understandable it will be for a general audience, but you may be interested to know what a mathematics Ph.D. thesis looks like.</p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=18</wfw:commentRss>
		<slash:comments>12</slash:comments>
		</item>
		<item>
		<title>A Transfinite Landscape</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=17</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=17#comments</comments>
		<pubDate>Mon, 02 Jul 2007 19:03:43 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Calculus]]></category>
		<category><![CDATA[General Mathematics]]></category>
		<category><![CDATA[Journey]]></category>
		<category><![CDATA[Logic]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=17</guid>
		<description><![CDATA[Problems that involve infinity have a tendency to read a little like Zen koans. Take, for example, this problem: Suppose we have three bins (labelled &#8220;bin A&#8221;, &#8220;bin B&#8221; and &#8220;bin C&#8221;) and an infinite number of tennis balls. We start by numbering the tennis balls 1,2,3,&#8230; and so on, and put them all in [...]]]></description>
			<content:encoded><![CDATA[<p>Problems that involve infinity have a tendency to read a little like Zen koans. Take, for example, this problem: Suppose we have three bins (labelled &#8220;bin A&#8221;, &#8220;bin B&#8221; and &#8220;bin C&#8221;) and an infinite number of tennis balls. We start by numbering the tennis balls 1,2,3,&#8230; and so on, and put them all in bin C. Then we take the two lowest numbered balls in bin C (that&#8217;s ball 1, and ball 2 to start) and put them in bin A, and then move the lowest numbered ball in bin A from bin A to bin B (that would be ball 1 in the first round). We repeat this process, moving two balls from bin C to bin A, and one ball from bin A to bin B, an infinite number of times. The question is, how many balls are in bin A and how many balls are in bin B when we&#8217;re done? Think carefully!</p>
<p>
<span id="more-17"></span><br />
The difficulty is in being sure of your answer*. We require a way to think consistently and coherently about such matters. So what is the answer? bin A has <i>no tennis balls</i> in it, while bin B has an infinite number! Does that sound wrong? It certainly seems confusing: we are consistently putting two balls into bin A and only taking one out at each step, so how can we end up with no balls in bin A? The key is to think in terms of a finished state, when the infinite process is somehow &#8220;complete&#8221;. Every ball is eventually moved to bin B, thus after an infinite number of steps all the balls must have been moved to bin B. The counterintuitive aspect is that we don&#8217;t expect moving one ball at a time to ever catch up with moving two balls at a time, yet oddly this happens.</p>
<p>
Another tale that highlights this point is that of the hotel with an infinite number of rooms**. The story usually begins with the hotel finding itself full one evening. A lone traveller then arrives, very weary, and asks the hotel manager if there is any chance at all that he can get a room. The hotel manager ponders this for a moment, and then has an idea. He asks each guest to move to the room numbered one higher than their current room. Since every number has a number one greater, and there are an infinite number of rooms, everyone is housed; and yet room number 1 is now empty, and the traveller has somewhere to stay. It doesn&#8217;t end there though. After the lone traveller, an infinite tour bus arrives, carrying an <i>infinite</i> number of passengers all looking for rooms. After having solved the first problem, however, the hotel manager isn&#8217;t phased. He asks each guest to move into the room with number twice that of their current room. Again, each number has a number twice as large, and there are an infinite number of rooms, so again everyone is housed; this time, however, everyone is housed in even numbered rooms, which leaves infinitely many odd numbered rooms in which the bus-load of tourists can be put up for the night.</p>
<p>
This brings us a little closer to the sticky point where our intuitions start to go astray. For any finite number <span style="font-family:serif;font-style:italic">n</span>  we expect there to be (roughly, it depends on whether <span style="font-family:serif;font-style:italic">n</span> is odd or even) half as many even numbers less than <span style="font-family:serif;font-style:italic">n</span> than there are natural numbers less than <span style="font-family:serif;font-style:italic">n</span>; when we have infinitely many numbers, however, there seems to be exactly as many even numbers as natural numbers. It&#8217;s this sort of unexpected equality with a set we initially intuitively think should be half as big that allows the tennis ball problem to fool us. In a sense, looking back from a completed infinity, 1 and 2 look pretty much the same. What it really comes down to, however, is the very simple question of what we mean by &#8220;how many&#8221;.  As we&#8217;ve often seen before, the devil is usually in the details, and even simple things that we think we know and understand bear some thinking about if we want to be sure we actually know what we mean.</p>
<p>
What happens when we count things? Because counting is a fairly innate skill for most adults it is helpful to consider what children, for whom counting is still somewhat new, do. Usually they count on their fingers (or other similar things), making a correlation between objects counted and fingers held up. At the more advanced adult level we do much the same thing, but we correlate with abstract objects (numbers, which by that time we&#8217;ve solidly beaten into instinctual memory). The point I&#8217;m trying to get at here is that counting is a matter of correlation; more importantly it is a very particular kind of correlation: in mathematics it is known as a one-to-one correlation. This means that each object corresponds with exactly one other object, and vice versa &#8212; in practice each object corresponds to exactly one number in our count, and each number in the count corresponds to exactly one object. If you can accept that, at the heart of it, it is that one-to-one correspondence that matters in counting, that it is the correspondence that ultimately determines what we mean by quantity, then we can pull out the mathematicians handy tool of abstraction and forget the other unimportant trivialities we might associate with counting, and use the idea of one-to-one correspondence to &#8220;count&#8221; infinite quantities. It might not be counting exactly as you would normally do it, but it will have the same core properties that matter about counting and quantity, and in the end as long as we can agree as to what important parts need to be preserved we can happily abstract all the rest away.</p>
<p>
So how do we count infinite sets with one-to-one correspondences?  Rather than actually counting infinitely many things, we provide an explicit process by which the count could (in theory) be done. Thus, we simply try to set up a  one-to-one correspondence just as before, the difference being that it will be given as a rule we can apply element by element as needed, rather than having every single element to element correspondence laid out ahead of time. If such a correspondence exists then the sets have the same infinite quantity. And that is exactly what we are doing, for example, with the infinite hotel story. First we are comparing the sizes of the sets {1,2,3,&#8230;} and {2,3,4,&#8230;} by noting that we have a correspondence<br />
<center></p>
<table width=80%>
<tr>
<td align="center">1</td>
<td align="center">2</td>
<td align="center">3</td>
<td align="center">4</td>
<td align="center">&#8943;</td>
<td align="center"><span style="font-family:serif;font-style:italic">n</span></td>
<td align="center">&#8943;</td>
</tr>
<tr>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center"></td>
<td align="center">&#8597;</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">2</td>
<td align="center">3</td>
<td align="center">4</td>
<td align="center">5</td>
<td align="center">&#8943;</td>
<td align="center"><span style="font-family:serif;font-style:italic">n</span>+1</td>
<td align="center">&#8943;</td>
</tr>
</table>
<p></center><br />
and that since both sets are infinite we will have exactly one element in the second set for every element in the first set, and vice versa; a one-to-one correspondence. The sets have the same quantity &#8212; thus we can shuffle everyone down one room and still house them all. When the tour bus shows up we end up comparing the sizes of the sets {1,2,3,&#8230;} and {2,4,6,&#8230;} by making the correspondence<br />
<center></p>
<table width=80%>
<tr>
<td align="center">1</td>
<td align="center">2</td>
<td align="center">3</td>
<td align="center">4</td>
<td align="center">&#8943;</td>
<td align="center"><span style="font-family:serif;font-style:italic">n</span></td>
<td align="center">&#8943;</td>
</tr>
<tr>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center"></td>
<td align="center">&#8597;</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">2</td>
<td align="center">4</td>
<td align="center">6</td>
<td align="center">8</td>
<td align="center">&#8943;</td>
<td align="center">2<span style="font-family:serif;font-style:italic">n</span></td>
<td align="center">&#8943;</td>
</tr>
</table>
<p></center><br />
where again the infinite sets ensure that each element corresponds to exactly one element in either direction; another one-to-one correspondence, demonstrating the sets have the same quantity &#8212; we can move everyone to even numbered rooms and still house them all.</p>
<p>
At this point most people are happy enough to accept that there are the same number of even numbers as there are natural numbers. Their argument runs roughly &#8220;well there is an infinite number of both, and infinity is as big a number as there can be, so of course they&#8217;re the same&#8221;. There is, possibly, a little squirming under the fact that the what is clearly only a part apparently has the same size as the whole, but that tends to get swept under the rug of &#8220;infinity is as big as you can go, so we don&#8217;t have a choice&#8221;. The real problems start, however, when we come to the realisation that using this same idea of quantity (determined in terms of one-to-one correspondences) we can find sets with sizes larger than the set of natural numbers: there may be infinitely many natural numbers, but that is not as large as you can go!</p>
<p>
The classic example of something infinite and larger in size than the natural numbers is the continuum (at least as classically conceived; the constructivist/intuitionist continuum is a little more tricky on this front) as discussed in <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">Paradoxes of the Continuum, Part II</a>. In that post we determined that points on the continuum were able to be identified with Cauchy sequences, which were akin to (though a little more technical than) infinite decimal expansions. We&#8217;ll stick with infinite decimal expansions here as most people have a better intuitive grasp of decimals than they do of Cauchy sequences or Dedekind cuts. To make things simple we&#8217;ll consider the continuum ranging between 0 and 1; that is, all the possible infinite decimals between 0 and 1, such as 0.123123123&#8230; We do have to be a little bit careful here since, as you should recall from <a href="http://jedidiah.stuff.gen.nz/wp/?p=14">Paradoxes of the Continuum, Part II</a>, in the same way that there are many fractions that represent the same ratio, there were many Cauchy sequences that represent the same point in the continuum, and in particular there are different decimal expansions that represent the same point, such as 0.49999999&#8230; and 0.50000000&#8230; ***. To be careful we have to make sure we always pick and deal with just one representative in all such cases; to do this we can simply only consider representations that have infinitely many non-zero places. Showing this is sufficient, and still covers all real numbers between 0 and 1 isn&#8217;t that hard, but amounts to some technical hoop jumping that is necessary for formal proofs, but not terribly elucidating for discussions such as this. Suffice to say that it all works out.</p>
<p>
The catch now is that we need to show not just that we are incapable of finding a one-to-one correspondence between these points on the continuum and the natural numbers, but that no such correspondence can exist. We do this by the somewhat backwards approach of assuming there is such a correspondence, and then showing that a logical contradiction would result. From that we can conclude that any such correspondence would be contradictory, and thus can&#8217;t actually exist (at least not in any system that doesn&#8217;t have contradictions). So, to begin, lets presume we have a correspondence like so****<br />
<center></p>
<table width=100%>
<tr>
<td align="center">1</td>
<td align="center">2</td>
<td align="center">3</td>
<td align="center">4</td>
<td align="center">&#8943;</td>
</tr>
<tr>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center">&#8597;</td>
<td align="center"></td>
</tr>
<tr>
<td align="center">0.<span style="font-family:serif;font-style:italic">a</span><sub>1</sub><span style="font-family:serif;font-style:italic">a</span><sub>2</sub><span style="font-family:serif;font-style:italic">a</span><sub>3</sub>&#8230;</td>
<td align="center">0.<span style="font-family:serif;font-style:italic">b</span><sub>1</sub><span style="font-family:serif;font-style:italic">b</span><sub>2</sub><span style="font-family:serif;font-style:italic">b</span><sub>3</sub>&#8230;</td>
<td align="center">0.<span style="font-family:serif;font-style:italic">c</span><sub>1</sub><span style="font-family:serif;font-style:italic">c</span><sub>2</sub><span style="font-family:serif;font-style:italic">c</span><sub>3</sub>&#8230;</td>
<td align="center">0.<span style="font-family:serif;font-style:italic">d</span><sub>1</sub><span style="font-family:serif;font-style:italic">d</span><sub>2</sub><span style="font-family:serif;font-style:italic">d</span><sub>3</sub>&#8230;</td>
<td align="center">&#8943;</td>
</tr>
</table>
<p></center><br />
where the <span style="font-family:serif;font-style:italic">a</span><sub>1</sub> etc. are just digits in the decimal expansion. The trick is to show that despite our best efforts to set up a one-to-one correspondence, the list of points in the continuum given by this correspondence (and since we haven&#8217;t specified what the correspondence actually is, <i>any</i> such one-to-one correspondence) is actually incomplete: we&#8217;ve missed some. We do this by constructing a decimal as follows: for the first decimal place, choose a digit different from <span style="font-family:serif;font-style:italic">a</span><sub>1</sub>, for the second decimal place choose a digit different from <span style="font-family:serif;font-style:italic">b</span><sub>2</sub>, for the third choose a digit different from <span style="font-family:serif;font-style:italic">c</span><sub>3</sub>, and so on. Now clearly this decimal is between 0 and 1, and hence ought to be in our list somewhere, but by its very manner of construction it isn&#8217;t! How can we be sure of that? Consider any decimal on the list, say the <span style="font-family:serif;font-style:italic">n</span>th one; now since in constructing our new decimal we specifically chose the digit in the <span style="font-family:serif;font-style:italic">n</span>th decimal place to be different from the <span style="font-family:serif;font-style:italic">n</span>th decimal place of the <span style="font-family:serif;font-style:italic">n</span>th decimal on the list, even if our decimal agrees at every other decimal place, we know it differs at at least one &#8212; and if it differs at one decimal place, then it is a different decimal. Since that argument applies to every single decimal on the list, we are guaranteed that our constructed decimal is different from every decimal we&#8217;ve listed! Thus despite our assumption that we had a one-to-one correspondence, it isn&#8217;t, since we&#8217;ve found a point in the continuum for which there is no corresponding natural number. Given such a contradiction, the only conclusion we can draw is that we cannot create a one-to-one correspondence between natural numbers and points in the continuum &#8212; no matter how we try, we&#8217;ll always end up with extra points in the continuum for which there is no corresponding natural number; that is, there are more points in the continuum than there are natural numbers.</p>
<p>
What all of this means is that we have to come to terms with the fact that some infinities are bigger than others. In fact, it gets even worse: some infinities are entirely incompatible with others. This particular catch hides in a slightly over-zealous abstraction of numbers. For the most part we do not differentiate between numbers describing order (2nd position, as opposed to 5th position, etc.) and numbers describing quantity (which is generally the notion of number with which we&#8217;ve been dealing). There is a perfectly good reason for this: when it comes to using and manipulating numbers using standard arithmetic, the numbers describing order (so called <i>ordinal numbers</i>) behave completely identically to those describing quantity (which we call <i>cardinal numbers</i>). As so often happens with abstraction (indeed, it essentially is the core idea of abstraction) if there are no practical differences (at least as far as the practical purposes we care about are concerned) between objects, we simply forget that there are any differences at all. And, indeed, for finite numbers this is a perfectly reasonable thing to do. The catch is that, once we start dealing with infinities, ordinals and cardinals start behaving rather differently &#8212; it is no longer safe to consider them the same, or even, for that matter, comparable to one another.</p>
<p>
I won&#8217;t go into the rather technical theory of transfinite ordinals here, and instead just give you a precis of where the difficulties lie. To start, lets&#8217; introduce some standard mathematical notation, and let &#8501;<sub>0</sub> denote the first infinite <i>cardinal</i> (that is, the quantity of natural numbers), and let &omega; denote the first infinite <i>ordinal</i> (that is, the first position reached after we&#8217;ve exhausted all finite positions). Now, as we&#8217;ve already seen, if we have infinite rooms, we can house an extra guest even if they&#8217;re all full; that is, &#8501;<sub>0</sub> + 1 = &#8501;<sub>0</sub>. On the other hand, if we tack on an extra position after &omega; and all the finite ones, i.e. we have 1st,2nd,3rd,&#8230;,&omega;,<span style="font-family:serif;font-style:italic">a</span>th, then the <span style="font-family:serif;font-style:italic">a</span>th position turns out to be appreciably different to all the positions before it. In other words &omega; + 1 &ne; &omega;. Now, whereas with finite numbers where are adding one produces the same result for both ordinals and cardinals, for infinite numbers it makes a huge difference (in one case we simply end up with what we started with, and in the other we end up with something entirely new). It shouldn&#8217;t be too hard to see that, from that simple difference, whether you have are dealing with a cardinal or ordinal transfinite number is going to matter for arithmetic operations; we can no longer ignore the difference; cardinals and ordinals have to be considered as quite separate and distinct kinds of objects!</p>
<p>
At this point you might be trying to reconcile the fact that &#8501;<sub>0</sub> + 1 = &#8501;<sub>0</sub> with the previously observed fact that there are bigger cardinal infinities. How can we get to a bigger infinity if adding to &#8501;<sub>0</sub> ends up going nowhere? To answer that I&#8217;m going to need to discuss power sets. Given a set <span style="font-family:serif;font-style:italic">A</span> (and for now we&#8217;ll keep things informal, the finer technicalities of what actually is and is not a set will come further along our road), the <i>power set</i> of <span style="font-family:serif;font-style:italic">A</span> is the set of all possible subsets of <span style="font-family:serif;font-style:italic">A</span>. An example will help clarify. If we have a set <span style="font-family:serif;font-style:italic">A</span> =  {a, b, c}, then the power set of <span style="font-family:serif;font-style:italic">A</span> is &#8473;(<span style="font-family:serif;font-style:italic">A</span>) = {{}, {a}, {b}, {c}, {a,b}, {a,c}, {b,c}, {a,b,c}}. Thus each element of the set &#8473;(<span style="font-family:serif;font-style:italic">A</span>) is itself a set, and in particular, a subset of <span style="font-family:serif;font-style:italic">A</span>; note that we consider both the empty set and <span style="font-family:serif;font-style:italic">A</span> itself to be subsets of <span style="font-family:serif;font-style:italic">A</span>. With a little combinatorics you can see that if a set has <span style="font-family:serif;font-style:italic">n</span>  elements (where <span style="font-family:serif;font-style:italic">n</span> is finite), then its power set will have 2<sup><span style="font-family:serif;font-style:italic">n</span></sup> elements (to make a subset we have to decide if each element is either in, or out, of the subset, thus we have 2 choices, multiplied together <span style="font-family:serif;font-style:italic">n</span> times, or 2<sup><span style="font-family:serif;font-style:italic">n</span></sup>). The trick is that, using an argument that closely parallels the previous argument showing that there is not a one-to-one correspondence between the natural numbers and points in a continuum,  we can show that even if a set has infinite cardinality it&#8217;s power set will have a larger cardinality. Thus, borrowing notation from the finite case, &#8501;<sub>0</sub> &lt; 2<sup>&#8501;<sub>0</sub></sup>. Using this trick, which applies to any infinite set, we can develop an entire hierarchy of different orders of infinity:</p>
<blockquote><p>
&#8501;<sub>1</sub> = 2<sup>&#8501;<sub>0</sub></sup> &lt; &#8501;<sub>2</sub> = 2<sup>&#8501;<sub>1</sub></sup> &lt; &#8501;<sub>3</sub> = 2<sup>&#8501;<sub>2</sub></sup> &lt; &#8943;
</p></blockquote>
<p>A similar, but different, hierarchy of infinite ordinals also exists (above and beyond the obvious option of simply adding one to an existing infinite ordinal to get a larger one), spiralling ever higher, this time using the concept of tetration***** rather than exponentiation. Contrary to initial expectation, infinities exist in infinite variety. How many infinities are there? We cannot say, on pain of paradox, since such a statement would only reflect back on itself in a vicious circle of contradiction.</p>
<p>
While we began with just a hazy view of the infinite, the mists have cleared to reveal a strange and remarkable valley; a transfinite landscape with a veritable zoo of infinities of different kinds and sizes; it is, indeed, a whole new landscape of numbers and possibilities to explore; a hidden valley of the infinite. Running through the middle of the valley is a large river, and if we wade in we will find very deep waters. It is all a matter of asking the right questions. The question we can start with seems innocently simple:</p>
<blockquote><p>
Is the cardinality of points on the continuum bigger, smaller, or the same as &#8501;<sub>1</sub> = 2<sup>&#8501;<sub>0</sub></sup>?
</p></blockquote>
<p>The answer is deceptively complex. It can be established, with a little work, that the number of points in the continuum is not bigger than &#8501;<sub>1</sub>, which leaves us with either smaller, or the same size as &#8501;<sub>1</sub>. From there things get complicated quickly, and mired in a certain degree of technicality, but essentially the result is that the answer &#8220;doesn&#8217;t matter&#8221;. What I mean by that is we may assume that the number of points in the continuum is &#8501;<sub>1</sub> and no problems or contradictions will arise, yet at the same time we can equally well assume that the number of points in the continuum is <i>strictly less than</i> &#8501;<sub>1</sub> and still no problems or contradictions arise. Indeed, whether there is any cardinal number between &#8501;<sub>0</sub> and &#8501;<sub>1</sub> falls into this category. There is, in a sense, no &#8220;truth&#8221; here, merely preference.</p>
<p>
This highlights deep facts about mathematics. When our journey began we considered numbers, and fractions, and algebra. Relatively speaking these are fairly simple abstractions, and, more importantly, they are abstractions that we tend to use each and every day (particularly in the case of numbers and fractions). Through a mix of immediate concrete associations due to the relatively low level of abstraction, and the sense reality imbued by constant use and exposure, we tend to think of numbers, fractions, algebra, and even mathematics itself, as something real, fixed, and concrete. That is, we think of mathematics as describing some platonic reality, that the objects it describes, while abstract, have some real existence. It is natural, then, to think that a number between &#8501;<sub>0</sub> and &#8501;<sub>1</sub> either exists, or doesn&#8217;t exist, in some real and concrete way &#8212; yet that is not how things have worked out. Imagine being told that whether the number five existed or not was quite optional, arithmetic would work just fine either way! We have, in essence, been told that existence is merely a preference, not a reality; &#8220;truth&#8221; is up for grabs, an option rather than a cold hard absolute.</p>
<p>
Going all the way back to the first entry, <a href="http://jedidiah.stuff.gen.nz/wp/?p=4">On Abstraction</a>, things start to get a little clearer however. As long as we view mathematics as a matter of making effective and powerful abstractions from the real world, rather than describing some platonic universe, having a choice of abstraction doesn&#8217;t seem so bad. We can choose how to interpret the continuum to suit our needs &#8212; indeed, we can even reject transfinite arithmetic and opt for the intuitionist conception of the continuum if we wish; we choose the abstraction that best suits our purposes for the moment. You could view it as little different than choosing to work at the genetic level as a molecular biologist instead of the considering subatomic particles as a physicist would: the level and manner of abstraction matters only with regard to the level and manner of detail you wish to obtain in the way of results.  The more layers of abstraction we apply, the greater the chances of running into quandaries and choices; by abstracting away more and more detail, and by piling abstractions upon abstractions, we push further and further into the realm of pure possibility. This has the potential to lead us to strange and confusing trails, but it also gives us the power to see beyond our own limited horizons. In broadening our minds to embrace worlds of possibility we conceive of realities that transcend our conceptions, and probe our own reality in ways far beyond the limits evolution has shackled our perceptions with.</p>
<p>
It has been a steep climb, but we have left the plains of ordinary finite numbers far behind us. We&#8217;ve crested the peak, and found a world that is strange and new. It gives us a chance to stretch our minds and our conceptions, and to begin to change how we look at the world. Equally importantly, it gives us a foundation for the further climb to come, providing a glimpse of the dance between logic and mathematics that will follow. We passed by the crossroads of unreality some time ago, yet there is still a very long way to go.</p>
<p>
<small><br />
* If you are sure of your answer then either you already know a decent amount of transfinite theory, or you&#8217;re most likely wrong &#8212; don&#8217;t be disappointed about being wrong though; the very reason we need clear logical guidelines for reasoning about the infinite is <i>precisely because</i> our intuitions are woefully inadequate and misleading.</p>
<p>
** The story is usually attributed to David Hilbert, but this is my own spin on it; errors and lack of clarity that may have crept in via the retelling are, of course, mine.</p>
<p>
*** People have a tendency to object to this, and other similar claims, such as that 0.99999&#8230; is equal to 1. The easiest way to see this simple but slightly unintuitive fact is to note that we should be able to take 0.99999&#8230;, move the decimal place right one place, subtract 9, and arrive back at the same value (this is akin to shifting the hotel guests down one room to make a spare room at the front, but in reverse). That is, if <span style="font-family:serif;font-style:italic">x</span>=0.99999&#8230; we can say that 10<span style="font-family:serif;font-style:italic">x</span> &#8211; 9 = <span style="font-family:serif;font-style:italic">x</span>. A little simple algebraic manipulation quickly yields <span style="font-family:serif;font-style:italic">x</span>=1. This sort of sleight of hand with infinite expansions will also play a role if and when we come to p-adic numbers, and discover that, in that case, negative numbers are really just very big positive numbers!</p>
<p>
**** The wary should be asking how we can even have a first element among points in the continuum (keeping in mind, of course, that there are cunning ways of ordering fractions such that there is a first element). This question quickly wades into very deep waters indeed &#8212; we can appeal to the Well Ordering Principle, which essentially just asserts that this can be done, or its equivalents such as the Axiom of Choice, or Zorn&#8217;s Lemma; all of these are somewhat contentious and tricky. If you&#8217;re interested it is well worth doing a little reading about them. We may come to discuss these issues ourselves later when we start to cross back and forth between mathematics and logic further down the road with Topos theory.</p>
<p>
***** Tetration is kind of like exponentiation on steroids; or, more accurately, it&#8217;s the next layer of abstraction up: whereas the number in exponentiation counts the multiplications to be performed, the number in tetration counts the exponentiations to be performed. Thus the 4th tetration of 3, written <sup>4</sup>3 is equal to 3<sup>(3<sup>(3<sup>3</sup>)</sup>)</sup>, which is 3<sup>19683</sup>, or really rather remarkably large.<br />
</small></p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=17</wfw:commentRss>
		<slash:comments>26</slash:comments>
		</item>
		<item>
		<title>Permutations and Applications</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=15</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=15#comments</comments>
		<pubDate>Sun, 27 May 2007 23:06:19 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[General Algebra]]></category>
		<category><![CDATA[Group Theory]]></category>
		<category><![CDATA[Journey]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=15</guid>
		<description><![CDATA[Numbers are remarkably tricky. We tend not to notice because we live in a world that is immersed in a sea of numbers. We see and deal with numbers all the time, to the point where most basic manipulations seem simple and obvious. It was not always this way of course. In times past anything [...]]]></description>
			<content:encoded><![CDATA[<p>Numbers are remarkably tricky. We tend not to notice because we live in a world that is immersed in a sea of numbers. We see and deal with numbers all the time, to the point where most basic manipulations seem simple and obvious. It was not always this way of course. In times past anything much beyond counting on fingers was the domain of the educated few. If I ask you what half of 60 is, you&#8217;ll tell me 30 straight away; if I ask you to stop and think about how you know that to be true you&#8217;ll have to think a little more, and start to realise that there is a significant amount of learning there; learning that you now take for granted. Almost everyone uses numbers regularly every day in our current society, be it through money, weights and measures, times of day, or in the course of their work. Through this constant exposure and use we&#8217;ve come to instinctively manipulate numbers without having to even think about it anymore (in much the same way that you no longer have to sound out words letter by letter to read). That means that when we meet a new abstraction, like the symmetries discussed in <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</a>, it seems comparatively complex and unnatural. In reality the algebra of symmetries is in many ways just as natural as the algebra of numbers, we just lack experience. Thus, the only way forward is to look at more examples, and see how they might apply to the world around us.</p>
<p>
<span id="more-15"></span><br />
In terms of examples I would like to take a step into the more abstract &#8212; rather than dealing with a physical example and determining an abstraction from it, we&#8217;ll start with a slightly abstract example and explore from there. The example I have in mind is that of permutations. By a permutation I simply mean a rearrangement of unspecified objects, mapping one position to another. We can view a permutation as a kind of wiring diagram, with the permutation:<br/><br />
<img src="images/permutation.png" width="420" height="298" alt="A permutation of three objects" title="A permutation of three objects"/><br/><br />
meaning that we shift whatever is in position 1 to position 3, whatever is in position 2 to position 1, and whatever is in position 3 to position 2. Hopefully you can see how such rearrangements are essentially what we were doing in <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</a>, but here we aren&#8217;t starting out with a specific pattern in mind, but considering all such rearrangements in general.</p>
<p>
As before we can combine two rearrangements together to get another. In this case we simply connect one wiring diagram to the next and follow the paths from the top right the way to the bottom. We can then simplify that down to a direct wiring diagram as before.<br/><br />
<img src="images/permutation-composition.png" width="420" height="298" alt="Combining two permutations to obtain a new permutation" title="Combining two permutations to obtain a new permutation"/><br/></p>
<p>
The first thing to note is that the number of objects we are permuting, or wiring together, matters. If we take, as our first and simplest case, permutations of two items, then we find there are only two permutations: the null permutation where we do nothing (the first item is connected to the first item, and the second item is connected to the second item), and a simple swap where we reverse the items (connect the first item to the second, and the second item to the first). Using the algebraic terms we <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">established previously</a> we end up a two element algebra: let <span style="font-family:serif;font-style:italic">s</span> be the permutation where we swap, and then we have the rule:</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">ss</span> = &sdot;
</p></blockquote>
<p>(swapping the two items, then swapping them again, is the same as doing nothing)<br />
which completely describes our algebra*.</p>
<p>
On the other hand, as soon as we consider permutations of three objects we find things get more complicated. There are a total of 6 (that&#8217;s 3&times;2&times;1) permutations of three objects. If we select two basic permutations appropriately we can generate them all as various combinations of the two. There are, in fact, several different pairs we could select (though the resulting algebra will turn out to be the same, no matter how we do it &#8212; the names might change, but the underlying rules will be the same), and I&#8217;ve opted for these two<br/><br />
<img src="images/named-permutations.png" width="420" height="298" alt="Two permutations, labelled 'a' and 'b'" title="Two permutations, labelled 'a' and 'b'"/><br/><br />
where <span style="font-family:serif;font-style:italic">a</span> swaps the first two elements and leaves the third alone, and <span style="font-family:serif;font-style:italic">b</span> swaps the second two elements, and leaves the first alone. Now as with the permutations of two elements, if we swap a pair, and then swap them again, we end up back where we started, so we can see that we have the following two rules:</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">aa</span> = &sdot;<br/><br />
<span style="font-family:serif;font-style:italic">bb</span> = &sdot;
</p></blockquote>
<p>Now, however, we have the possibility of combining together <span style="font-family:serif;font-style:italic">a</span> and <span style="font-family:serif;font-style:italic">b</span>. We already saw that <span style="font-family:serif;font-style:italic">ab</span> results in the first item going to the third place, the second item to the first position, and the third item to the second position:<br/><br />
<img src="images/permutation-composition.png" width="420" height="298" alt="Showing the result of combining 'a' and 'b'" title="Showing the result of combining 'a' and 'b'"/><br/><br />
but if we swap things around to find <span style="font-family:serif;font-style:italic">ba</span> we get<br/><br />
<img src="images/permutation-composition-ba.png" width="420" height="298" alt="Showing the result of combining 'b' and 'a'" title="Showing the result of combining 'b' and 'a'"/><br/><br />
which reverses the situation, with the third item moving the first place, while the first and second items get bumped to second and third place respectively. So at the very least we know that we don&#8217;t have commutativity &#8212; that is <span style="font-family:serif;font-style:italic">ab</span>&ne;<span style="font-family:serif;font-style:italic">ba</span>. Instead we find the rule that defines this algebra is as follows:<br/><br />
<img src="images/permutation-equality.png" width="420" height="298" alt="Showing how 'aba' is the same as 'bab'" title="Showing how 'aba' is the same as 'bab'"/><br/><br />
We can see that this gives us all the permutations by counting up the combinations of <span style="font-family:serif;font-style:italic">a</span> and <span style="font-family:serif;font-style:italic">b</span> that haven&#8217;t been ruled out as being reducible to something simpler. We have</p>
<ol>
<li>The null permutation: &sdot;</li>
<li><span style="font-family:serif;font-style:italic">a</span></li>
<li><span style="font-family:serif;font-style:italic">b</span></li>
<li><span style="font-family:serif;font-style:italic">ab</span></li>
<li><span style="font-family:serif;font-style:italic">ba</span></li>
<li><span style="font-family:serif;font-style:italic">aba</span></li>
</ol>
<p>and anything with four or more <span style="font-family:serif;font-style:italic">a</span>s and <span style="font-family:serif;font-style:italic">b</span>s will be reducible. Why is that? Since <span style="font-family:serif;font-style:italic">aa</span> = &sdot; and <span style="font-family:serif;font-style:italic">bb</span> = &sdot; any sequence will have to alternate <span style="font-family:serif;font-style:italic">a</span>s and <span style="font-family:serif;font-style:italic">b</span>s, otherwise we can just cancel down consecutive pairs. On the other hand, if we have a sequence of more than three alternating <span style="font-family:serif;font-style:italic">a</span>s and <span style="font-family:serif;font-style:italic">b</span>s then we&#8217;ll have a sequence <span style="font-family:serif;font-style:italic">aba</span> or <span style="font-family:serif;font-style:italic">bab</span> that we can convert using the fact that <span style="font-family:serif;font-style:italic">aba</span> = <span style="font-family:serif;font-style:italic">bab</span>, and end up with a pair of consecutive <span style="font-family:serif;font-style:italic">a</span>s or <span style="font-family:serif;font-style:italic">b</span>s that we can then cancel down. For example, if we tried to have a sequence of four <span style="font-family:serif;font-style:italic">a</span>s and <span style="font-family:serif;font-style:italic">b</span>s like <span style="font-family:serif;font-style:italic">abab</span>, then we can say</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">abab</span> = <span style="font-family:serif;font-style:italic">a(bab)</span> = <span style="font-family:serif;font-style:italic">a(aba)</span> = <span style="font-family:serif;font-style:italic">(aa)ba</span> = &sdot;<span style="font-family:serif;font-style:italic">ba</span> = <span style="font-family:serif;font-style:italic">ba</span>
</p></blockquote>
<p>With a little thought you can see that this sort of procedure can reduce any sequence of four or more <span style="font-family:serif;font-style:italic">a</span>s and <span style="font-family:serif;font-style:italic">b</span>s down to one of three or less. So for permutations of three objects we get an algebra that is described by three rules:</p>
<blockquote><p>
<span style="font-family:serif;font-style:italic">aa</span> = &sdot;<br/><br />
<span style="font-family:serif;font-style:italic">bb</span> = &sdot;<br/><br />
<span style="font-family:serif;font-style:italic">aba</span> = <span style="font-family:serif;font-style:italic">bab</span>
</p></blockquote>
<p>
If we were to consider permutations of four items we would have 24 permutations (that&#8217;s 4&times;3&times;2&times;1) to deal with, and things would be more complicated yet again. Permutations of five items provide a total of 120 (5&times;4&times;3&times;2&times;1) permutations, and an even more complicated algebra with yet more subtle and interesting properties. </p>
<p>
There are two things that you should take notice of here. The first is that even simple changes to a pattern &#8212; as simple as changing the number of items involved &#8212; can give rise to very different dynamics. The character of the algebra that arises from permutations of two objects is very different from that of the three object permutation algebra, and four objects is different again. To reiterate the point: different patterns can have surprisingly different and remarkable dynamics. The second thing that you should be noticing is that while we can work with permutations as wiring diagrams and connect them up to see what combinations will result, ultimately everything about the dynamics of the permutations is contained within the algebra we get from it; and the algebra can be described and manipulated using very simple rules. While the pattern provides the algebra, the algebra in turn tells us everything we need to know about the dynamics of the pattern. The advantage of the algebra is that we can reduce the whole problem of patterns to the simple task of manipulating algebraic expressions according to particular rules. By abstracting up to the algebra, we&#8217;ve made the problem much easier to think about and manipulate.</p>
<p>
Hopefully by this stage you&#8217;re developing a feel for how this abstraction process works. With numbers we start with a collection and abstract away all the details save a single property: the quantity. Here we have something a little more complex; we start with a pattern and abstract away as much of the detail as possible, while still retaining some information about the nature of the pattern. That information can be efficiently encoded into a sort of algebra, in the same way that we encode information about quantity into symbols (numbers). The exact nature and rules of the algebra we generate is the information about the pattern that we have kept. Now, numbers allow us to reason about quantity in general via arithmetic, which we can reduce to a game of manipulating symbols. Our abstraction of pattern allows us to reason about patterns via their associated algebra, which we can also reduce to a game of manipulating symbols. We have turned thinking about patterns into a kind of arithmetic; and doing so allows us to be systematic in studying and analysing such patterns.</p>
<p>
This, of course, raises the question of why we should be interested in studying and analysing patterns at all. The same question can be asked as to why we should be interested in studying and analysing quantity. The difference is that our culture is steeped in analysis of and use of quantity; we take its usefulness for granted. So let&#8217;s step back, and ask why using numbers is useful. As was pointed out in <a href="http://jedidiah.stuff.gen.nz/wp/?p=5">The Slow Road</a>, numbers and quantity are useful because they are everywhere &#8212; we can apply quantitative analysis to almost everything (and often do, sometimes even where it isn&#8217;t appropriate). It is worth pointing out that patterns and symmetry are every bit as prevalent in the world. All around us things can be described in terms of their patterns. Pick any collection of objects you care to set your eyes upon, and they will form some manner of pattern; perhaps they will only have a trivial symmetry, or perhaps they will have more complex symmetries. The point is that, just like numbers, symmetries are all around us. The study of pattern and symmetry in the manner we&#8217;ve been describing is very new however, and this means it hasn&#8217;t entered the mainstream consciousness, nor the language, in the same way that numbers have. We don&#8217;t describe the world around us in the language of mathematical symmetry, at least not in the same way that we describe the world around us in terms of numbers. Slowly that will change, but it will take a very long time indeed (centuries probably). That means that, in the meantime, the areas to which the language of mathematical symmetry will be applied, and the people who will apply it, will be restricted to those already using advanced mathematical methods. Right now that tends to mean fields  such as physics and chemistry.<br />
To give examples of applying our abstraction of pattern to physics and/or chemistry runs the risk of delving into the technical details of those subjects, as well as requiring math that is currently beyond the scope of our discussion. For that reason, you&#8217;ll have to forgive me if I gloss over things quite liberally in what follows.</p>
<p>
Everything has a pattern, and symmetries associated with that pattern, even if it is just the trivial symmetry. In the case of chemistry the obvious thing to start looking at is molecules. Unsurprisingly, the structure of a molecule has a pattern that depends, to a large extent, on its constituent elements. More interestingly, molecules often have interesting symmetries. We can, using a naive view, picture a molecule as a pattern of coloured balls, not dissimilar to our patterns of coloured marbles discussed in <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">Shifting Patterns</a>. Of course the patterns are now in 3 dimensions rather than 2, and connections between the balls/marbles are important, but the fundamental idea is there. Consider, for instance, the following picture of the ammonium molecule (NH<sub>4</sub>):<br/><br />
<img src="images/Ammonium-3D-balls.png" width="420" height="411" alt="An ammonium molecule" title="An ammonium molecule"/><br/><br />
We can, with little trouble, consider the various ways in which we can rearrange the 4 indistinguishable hydrogen atoms and yet keep the underlying structure that makes it an ammonium molecule. That is, using our abstraction of pattern, we can describe an algebra that captures the features that make the pattern of four hydrogen atoms and one nitrogen atom a molecule of ammonium. Furthermore, we can do the same for any other molecule we care to consider. The exact algebra that results will differ from molecule to molecule, with the individual idiosyncrasies of the different algebras describing the the individual idiosyncrasies of the different molecules. To understand the particular nature of the associated algebra is to understand a great deal about the particular nature of the molecule.</p>
<p>
It is, of course, possible to do this sort of analysis just by staring at the patterns and never resorting to the sort of abstraction we&#8217;ve been discussing. This approach runs the risk of being both haphazard, and superficial. By contrast, working in terms of pattern abstraction algebras affords us the ability to be both comprehensive and systematic in our analysis. Rather than trying to divine properties out of thin air via visual inspection, we take the resulting algebra and, by merely pushing symbols around on a piece of paper, pick apart every last nuance of its behaviour. Indeed, this sort of analysis (which extends to a level of detail in characterising the algebra that we won&#8217;t touch on for some time) is now fundamental to understanding much in chemistry, from <a href="http://en.wikipedia.org/wiki/Spectroscopy">spectroscopy</a> to <a href="http://en.wikipedia.org/wiki/Crystallography">crystallography</a>. Similar approaches to patterns and symmetry of particles lead to a variety of important results in quantum physics. </p>
<p>
Our world is filled with patterns that are worth analysing with a systematic approach: understanding the peculiarities of the algebras associated to those patterns can tell us a great deal about our world. New applications of this theory to new fields are still regularly occurring. There is a quiet revolution underway that is changing how we see and describe the world, and the abstraction of pattern is at its heart.</p>
<p>
<small><br />
* Through this post I will continue to use algebra in a generic sense to describe the symbolic calculus that we can associate to a pattern. This is a reminder that, in mathematics, <i>algebra</i> also has a more precise definition that does not apply to the objects under consideration here. I <i>do not mean</i> algebra in that more specific sense when using the word here.<br />
</small></p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=15</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Paradoxes of the Continuum, Part II</title>
		<link>http://jedidiah.stuff.gen.nz/wp/?p=14</link>
		<comments>http://jedidiah.stuff.gen.nz/wp/?p=14#comments</comments>
		<pubDate>Tue, 01 May 2007 19:11:16 +0000</pubDate>
		<dc:creator>Leland McInnes</dc:creator>
				<category><![CDATA[Calculus]]></category>
		<category><![CDATA[Journey]]></category>
		<category><![CDATA[Logic]]></category>
		<category><![CDATA[Topology]]></category>

		<guid isPermaLink="false">http://jedidiah.stuff.gen.nz/wp/?p=14</guid>
		<description><![CDATA[Mathematical arguments can be very persuasive. They lead inexorably toward their conclusion; barring any mistakes in the argument, to argue is to argue with the foundations of logic itself. That is why it is particularly disconcerting when a mathematical argument leads you down an unexpected path and leaves you face to face with a bewildering [...]]]></description>
			<content:encoded><![CDATA[<p>Mathematical arguments can be very persuasive. They lead inexorably toward their conclusion; barring any mistakes in the argument, to argue is to argue with the foundations of logic itself. That is why it is particularly disconcerting when a mathematical argument leads you down an unexpected path and leaves you face to face with a bewildering conclusion. Naturally you run back and retrace the path, looking, often in vain, for the wrong turn where things went off track. People often don&#8217;t deal well with challenges to their world-view. When a winding mountain path leads around a corner to present a view of a new and strange landscape, you realise that the world may be much larger, and much stranger, than you had ever imagined. When faced with such a realisation, some people flee in horror and pretend that such a place doesn&#8217;t exist; the true challenge is to accept it, and try to understand the vast new world. It is time for us to round a corner and glimpse new and strange landscapes; I invite you to follow me down, in the coming entries, and explore the strange hidden valley.</p>
<p>
<span id="more-14"></span><br />
We begin with a mere glimpse of what is to come along this road. Still, even this glimpse has been enough to frighten some. Indeed the (potentially apocryphal) tale of the first man to tread this road, a member of the Pythagorean Brotherhood, makes this very clear. The story goes that the insight came to him while travelling  by ship on the Aegean. Excited, he explained his cunning proof to the fellow members of the Brotherhood aboard the boat. They were so horrified by the implications that they immediately pitched him overboard, and he drowned. For the secretive Pythagorean Brotherhood, who believed that reality was simply numbers, mathematics was worth killing over.</p>
<p>
So what was this truth that the Brotherhood was willing to kill to keep secret? The fact that the square root of 2 is not expressible as a fraction. The proof of this is surprisingly simple, and runs roughly as follows. Let&#8217;s presume that &radic;2 can be expressed as a fraction, and so we have numbers <span style="font-family:serif;font-style:italic">n</span> and <span style="font-family:serif;font-style:italic">m</span> such that &radic;2 = <span style="font-family:serif;font-style:italic">n</span>/<span style="font-family:serif;font-style:italic">m</span>. As you may recall from <a href="http://jedidiah.stuff.gen.nz/wp/?p=7">A Fraction of Algebra</a>, a particular fraction is really just a chosen representative of an infinite number of ways of expressing the same idea &#8212; we can choose whichever representative we wish. For the purposes of the proof we will assume that we have chosen <span style="font-family:serif;font-style:italic">n</span>/<span style="font-family:serif;font-style:italic">m</span> to be as simple as possible (i.e. there is no common factor that divides both <span style="font-family:serif;font-style:italic">n</span> and <span style="font-family:serif;font-style:italic">m</span>); you may want to verify for yourself that such a thing is always possible (it&#8217;s not too hard). Now, using the allowable manipulations of algebra we have:</p>
<blockquote><p>
&radic;2 = <span style="font-family:serif;font-style:italic">n</span>/<span style="font-family:serif;font-style:italic">m</span><br/><br />
&rArr; 2 = <span style="font-family:serif;font-style:italic">n</span><sup>2</sup>/<span style="font-family:serif;font-style:italic">m</span><sup>2</sup><br/><br />
&rArr; 2<span style="font-family:serif;font-style:italic">m</span><sup>2</sup> = <span style="font-family:serif;font-style:italic">n</span><sup>2</sup><br/>
</p></blockquote>
<p>Now 2<span style="font-family:serif;font-style:italic">m</span><sup>2</sup> is an even number no matter what number <span style="font-family:serif;font-style:italic">m</span> is, so <span style="font-family:serif;font-style:italic">n</span><sup>2</sup> must be an even number as well. However, an odd number squared is always odd (again, this is worth verifying yourself if you&#8217;re uncertain, again it isn&#8217;t hard). That means the only way <span style="font-family:serif;font-style:italic">n</span><sup>2</sup> can be even is if <span style="font-family:serif;font-style:italic">n</span> itself is even. That means there must be some number <span style="font-family:serif;font-style:italic">x</span> such that 2<span style="font-family:serif;font-style:italic">x</span> = <span style="font-family:serif;font-style:italic">n</span>. But then</p>
<blockquote><p>
2<span style="font-family:serif;font-style:italic">m</span><sup>2</sup> = <span style="font-family:serif;font-style:italic">n</span><sup>2</sup><br/><br />
&rArr; 2<span style="font-family:serif;font-style:italic">m</span><sup>2</sup> = (2<span style="font-family:serif;font-style:italic">x</span>)<sup>2</sup><br/><br />
&rArr; 2<span style="font-family:serif;font-style:italic">m</span><sup>2</sup> = 4<span style="font-family:serif;font-style:italic">x</span><sup>2</sup><br/><br />
&rArr; <span style="font-family:serif;font-style:italic">m</span><sup>2</sup> = 2<span style="font-family:serif;font-style:italic">x</span><sup>2</sup><br/>
</p></blockquote>
<p>and so <span style="font-family:serif;font-style:italic">m</span><sup>2</sup>, and hence <span style="font-family:serif;font-style:italic">m</span>, must also be even. If both <span style="font-family:serif;font-style:italic">n</span> and <span style="font-family:serif;font-style:italic">m</span> are even then they have a common factor: 2; yet we specifically chose <span style="font-family:serif;font-style:italic">n</span> and <span style="font-family:serif;font-style:italic">m</span> so that wouldn&#8217;t be the case. Clearly, then, no such <span style="font-family:serif;font-style:italic">n</span> and <span style="font-family:serif;font-style:italic">m</span> exist, and we simply can&#8217;t express &radic;2 as a fraction!*</p>
<p>
This result (if not necessarily the proof) is well known these days; sufficiently so that many people take it for granted. It is therefore worth probing a little deeper to see what it actually means, and perhaps gain a better understanding of why it so incensed the Pythagorean Brotherhood. The first point to note is that &radic;2 does crop up in geometry: if you draw a square with sides of unit length (and we can always choose our units such that this is so) then, by Pythagoras&#8217; Theorem, the diagonal of the square has length &radic;2. That, by itself, is not necessarily troubling; but consider that we&#8217;ve just seen that &radic;2 is not expressible as a fraction. Recall that a fraction can be considered <a href="http://jedidiah.stuff.gen.nz/wp/?p=7">a re-interpretation of the basic unit</a>, and you see that what we&#8217;re really saying is that there simply doesn&#8217;t exist a unit of length such that the diagonal of the square can be measured with respect to it. If you were measuring a length in feet and found that it was between 2 and 3 feet then you could simply change your units and work in inches &#8212; the distance is hopefully an integer number of inches. If inches aren&#8217;t accurate enough we can just use a smaller unit again (eighths of an inch for example). What we are saying when we say that &radic;2 cannot be expressed as a fraction is that, no matter how small a unit we choose, we can still <i>never</i> accurately measure the diagonal of the square. Because we can simply keep dividing indefinitely to get smaller and smaller units, that means we need <i>infinitely</i> small units. And note the difference here:, unlike in <a href="http://jedidiah.stuff.gen.nz/wp/?p=9">Part I</a>, arbitrarily small is not good enough, we need to go past arbitrarily small to actually infinitely small. For the Pythagoreans infinity was unreachable &#8212; something that could never be completed or achieved &#8212; and thus an infinitely small unit could never be realised. Therefore, in their world-view, the diagonal of a square couldn&#8217;t exist since its length was an unreachable, unattainable, distance**. That, as you an imagine, caused quite a bit of cognitive dissonance! Hence their desire to pretend such a thing never happened.</p>
<p>
As you can see, it turns out (even though it may not have looked that way at first) that we are really butting our heads up against infinity again, just from a different direction this time. Things get worse however: if we had a line of length 2 then there surely exists a point somewhere along that line that is a distance of &radic;2 away from the origin. We have just seen, however, that such a distance is not one we can deal with in terms of fractions. If we were to put points at every possible fractional distance between 0 and 2 we would have a hole at &radic;2, and continuous lines don&#8217;t have holes in them. A new problem starts to raise its head.</p>
<p>
If we wish to have a continuum we have to fill in all the holes. The question is how we can do that &#8212; where exactly are the holes? And, for that matter, how many holes are there? The first of these questions turns out to be rather easier than the second (which we will address next time we venture down this fork of the road). The trick to finding holes is to note that, since fractions allow us the arbitrarily (if not infinitely) small, we can get arbitrarily close to any point in the continuum, holes included. That is, while we can&#8217;t actually express a hole in terms of fractions, we can sidle up as close beside it as we like using only fractions. And that means we must reach again for the useful tools of <a href="http://jedidiah.stuff.gen.nz/wp/?p=9"><i>distance</i> and <i>convergence</i></a> to determine that we are getting closer and closer to, and hence converging to, a hole.</p>
<p>
For our current purposes the definition of distance between numbers defined in <a href="http://jedidiah.stuff.gen.nz/wp/?p=9">Part I</a> will be sufficient. What we want to do is figure out a way to ensure that a sequence of fractions converges &#8212; that is, that it gets closer and closer to <i>something</i>, without necessarily knowing what the something is. The trick to this is to require that the distance between different terms in the sequence gets smaller and smaller. In this way we can slowly but surely squeeze tighter and tighter about a limit point, without necessarily knowing what it is that we are netting. More formally, if we have an infinite sequence <span style="font-family:serif;font-style:italic">S</span><sub>1</sub>, <span style="font-family:serif;font-style:italic">S</span><sub>2</sub>, <span style="font-family:serif;font-style:italic">S</span><sub>3</sub>, &#8230; then we require that for any &epsilon; > 0 there exists an integer <span style="font-family:serif;font-style:italic">N</span>&ge;1 such that, for all <span style="font-family:serif;font-style:italic">m</span>, <span style="font-family:serif;font-style:italic">n</span>&gt;<span style="font-family:serif;font-style:italic">N</span> ,</p>
<blockquote><p>
|<span style="font-family:serif;font-style:italic">S</span><sub><span style="font-family:serif;font-style:italic">m</span></sub> &minus; <span style="font-family:serif;font-style:italic">S</span><sub><span style="font-family:serif;font-style:italic">n</span></sub>|&lt;&epsilon;
</p></blockquote>
<p>(recalling that |<span style="font-family:serif;font-style:italic">x</span> &minus; <span style="font-family:serif;font-style:italic">y</span>| gives the distance between numbers <span style="font-family:serif;font-style:italic">x</span> and <span style="font-family:serif;font-style:italic">y</span>). Such a sequence is called a <i>Cauchy sequence</i>. Now, since any Cauchy sequence converges to something, we can identify (consider equivalent) the sequence and the point at its limit. Furthermore, since we know that using fractions we can get arbitrarily close to any point on the continuum, there must be some sequence of fractions that converges to that point, and so if we consider all the possible infinite Cauchy sequences of fractions, we can cover all the points on the continuum &#8212; we are assured that no holes or gaps can slip in this time. We&#8217;ve caught all the holes &#8212; without even having to find them!</p>
<p>
It is worth looking at an example: can we find a sequence of fractions converges to &radic;2?  Consider the decimal expansion of &radic;2 which starts out 1.41421&#8230; and continues on without any discernible pattern; clearly the sequence 1, 1.4, 1.41, 1.414, 1.4142, 1.41421, &#8230; (where the <span style="font-family:serif;font-style:italic">n</span>th term agrees with &radic;2 for the first <span style="font-family:serif;font-style:italic">n</span>&minus;1 decimal places) converges to &radic;2. More importantly each term can be rewritten as a fraction since each term has only finitely many non-zero decimal places; for example 1.4 = 14/10 and 1.4142 = 14142/10000 etc. Finally it is not hard to see that this sequence is a Cauchy sequence. We can do the same trick for any other decimal expansion, arriving at a Cauchy sequence that converges to the point in question. Of course there are many other Cauchy sequences of fractions that will converge to these values: we are dealing with something similar to our dilemma with fractions when we found that there were an infinite number of different pairs of natural numbers that all described the same fraction. In that case we  simply selected a particular representative pair that was convenient (and could change between different pairs that represented the same fraction if it was later convenient to think of the fraction that way). We can do the same here: noting that a point is described by an infinite number of Cauchy sequences, we can simply select a convenient representative sequence to describe the point. For our purposes the sequence constructed via the decimal expansion will do nicely &#8212; in some sense you can think of the Cauchy sequence as an infinite decimal expansion.</p>
<p>
Now that we at least have some idea of what these sequences might look like, it is time to take a step back and consider what is actually going on here. Back in <a href="http://jedidiah.stuff.gen.nz/wp/?p=5">The Slow Road</a> we constructed natural numbers as a property of collections of objects. Then, in <a href="http://jedidiah.stuff.gen.nz/wp/?p=7">A Fraction of Algebra</a>, we created fractions to allow us to re-interpret an object within a collection. This was another layer of abstraction &#8212; fractions were not really numbers in the same way that natural numbers were &#8212; fractions were a way of re-interpreting collections, and we could describe those re-interpretations by pairs of natural numbers. Perhaps rather providentially it turned out that the rules of algebra, the rules of arithmetic that were true no matter what natural numbers we chose, also happened to be true no matter what fractions we chose. It is this stroke of good fortune, combined with the fact that certain fractions can take the role of the natural numbers, that allows us to treat what are really quite different things in principle (fractions and natural numbers) as the same thing in practice: for practical purposes we usually simply consider natural numbers and fractions as &#8220;numbers&#8221; and don&#8217;t notice that, at heart, they are fundamentally different concepts. Now we are about to add a new layer of abstraction, built atop fractions, to allow us to describe points in a continuum. While all that was required to describe the re-interpretation of objects that constituted a fraction was a pair of numbers, points in a continuum can only*** be described by an infinite Cauchy sequence of fractions. Thus, in the same way that natural numbers and fractions are actually very different object, so fractions and points in a continuum are quite different. Again, however, we find that when we define arithmetic on sequences (which occurs in the obvious natural way) they all behave appropriately under our algebraic rules. When we consider that it is easy enough to find sequences that behave as fractions (any constant sequence for instance) it is clear that, again, for practical purposes, we can call these things numbers and assume we&#8217;re talking about the same thing regardless of whether we are actually dealing with natural numbers, fractions, or points in a continuum.</p>
<p>
It should be pointed out that sometimes these distinctions are actually important. A simple example is computer programming, which does bother to distinguish floating point numbers (ultimately fractions) from integers. You can usually convert or cast from one to the other via a function (and at times that function can be implicit), but the distinction is important. Later we will start getting into mathematics where the distinction becomes important.</p>
<p>
So, now that we have this construction, several layers of abstraction deep, that allows us to describe the continuum, does it resolve the problem the Pythagorean Brotherhood struggled with? Certainly within the continuum there is a point corresponding to &radic;2, but even with our construction it is the <i>limit</i> of an infinite sequence &#8212; we still require a completed infinity. Of course accepting the idea of a completed infinite would get us out of this conundrum; what we require is a coherent theory of the completed infinite &#8212; were we to have that, then we needn&#8217;t fear the idea as the Pythagorean Brotherhood did. The next time we venture along this particular road we will discuss just such a theory, and explore the remarkable transfinite landscape that it leads to. We would be remiss to conclude here, however, without noting that there is some dissent on this topic. While the theory of the continuum based on completed infinites we will cover is remarkably widely accepted and used, there are still those who do not wish to have to deal with the completed infinite. So what is the alternative?</p>
<p>
 The idea is to construct a continuum using infinitesimals: a number &epsilon; such that we have &epsilon;<sup>2</sup>=0, yet &epsilon;&ne;0. Using such a value we can create a continuum without holes as desired. If adding a seemingly arbitrary new element to the number system seems like cheating, remember that both fractions, and the infinite decimals via Cauchy sequences, are just as much artificial additions to the number sequence &#8212; they just happen to be ones we&#8217;re familiar with and now take for granted. The real dilemma is that, assuming the required properties of infinitesimals, we can deduce contradictions. As we noted at the start of this post, when a mathematical argument leads you somewhere you don&#8217;t wish to go you are left having to challenge the very foundations of logic itself. Surprisingly, that turns out to be the resolution: smooth infinitesimal analysis rejects <a href="http://en.wikipedia.org/wiki/Law_of_the_excluded_middle">the law of the excluded middle</a>. The logic used for this alternative conception of the continuum rejects the idea that, given a proposition,  either the proposition is true, or its negation is true. That means that saying that <span style="font-family:serif;font-style:italic">x</span>=<span style="font-family:serif;font-style:italic">y</span> is not true, does <i>not</i> mean that <span style="font-family:serif;font-style:italic">x</span>&ne;<span style="font-family:serif;font-style:italic">y</span>. This sounds like nonsense at first, because we generally take the law of excluded middle for granted, and it is ingrained in our thinking. We have to remember, however, that this is a theory dealing in <i>potential</i>, but not completed infinites, and it is that key word &#8220;potential&#8221; that helps clarify things. Consider two numbers <span style="font-family:serif;font-style:italic">x</span> and <span style="font-family:serif;font-style:italic">y</span> that have infinite decimal expansions; are they equal, or are they unequal? We can check the first decimal place, and they might agree; that does not mean they are equal, they might disagree further down; nor does it mean they are unequal, since they might indeed agree. We can check the first billion decimal places, and they might still agree; that does not mean they are equal, since it might be the billion and first decimal place at which they disagree; and yet we still can&#8217;t conclude they are unequal &#8212; they&#8217;ve agreed so far and could continue to do so. We even check the first 10<sup>20</sup> decimal places, and still we can&#8217;t conclude either way whether <span style="font-family:serif;font-style:italic">x</span> and <span style="font-family:serif;font-style:italic">y</span> are equal or unequal. Because we can never complete the infinity and check <i>all</i> the decimal places, unless we have more information (such as that both number are integers****), it is not possible to conclude either way &#8212; we have an in between state where the numbers are neither equal, nor unequal, and it is this in-between possibility that causes the law of excluded middle to fall apart. To say that <span style="font-family:serif;font-style:italic">x</span>=<span style="font-family:serif;font-style:italic">y</span> is not true simply means we have not yet concluded that <span style="font-family:serif;font-style:italic">x</span>=<span style="font-family:serif;font-style:italic">y</span>, but that does not require that we must have concluded <span style="font-family:serif;font-style:italic">x</span>&ne;<span style="font-family:serif;font-style:italic">y</span> since we may still be torn in between, unable to reach a conclusion. As strange as this sounds at first, it actually provides a surprisingly natural and intuitive model of the continuum &#8212; and a remarkably different one from the classical one we will be developing. Enough sidetracks, however; it is time to return to the path.</p>
<p>
We have rounded the bend, and can make out the rough expanse of the landscape below, but the land itself remains unexplored, and potentially quite alien. The next time we return to this road we&#8217;ll try and understand the implications of a continuum of completed infinites, including a variety of initially unsettling results. In the meantime, however, we will return the study of <a href="http://jedidiah.stuff.gen.nz/wp/?p=13">patterns and symmetry</a>, and try and build a robust theory from our simple examples.</p>
<p>
<small><br />
* This is, of course, a precis version of the proof. The devil is always in the details, and many details here have been glossed over as obvious, or left for the reader to verify. If you are interested in the nitty-gritty however, I recommend you try <a href="http://au.metamath.org/mpegif/sqr2irr.html">the Metamath proof of the irrationality of &radic;2</a>. Each and every step in the proof is referenced and linked to an earlier theorem previously proved. By following the links you can drill all the way down to fundamental axioms of logic and set theory. If you don&#8217;t care to follow the details yourself, you might note that, in this (extremely) explicit form, the proof  can be machine verified.</p>
<p>
** If you think you can get out of this by just starting with the diagonal as your unit of measure you will simply find that now the sides of the square are unmeasurable distances. The sides and diagonal of the square are <i>incommensurable</i> &#8212; we can&#8217;t measure both with the same units, no matter how fine a unit of measure we choose.</p>
<p>
*** Technically other methods of describing such points exist. Indeed a very common formal approach is <a href="http://en.wikipedia.org/wiki/Dedekind_cut">Dedekind cuts</a>. Ultimately, however, Dedekind cuts represent more detail than we need right now, and will serve more as a distraction than anything. The interested reader is, however, encouraged to investigate them, and puzzle out why I chose to go with Cauchy sequences here.</p>
<p>
**** Why does knowing that both numbers are integers help? because integers have fixed decimal expansions &#8211; the first decimal place is necessarily the same as all the rest: 0. As long as they agree up to the first decimal place, we are done. An exercise for the interested reader: what other knowledge about the numbers might allow us to conclude equality or inequality?<br />
</small></p>
]]></content:encoded>
			<wfw:commentRss>http://jedidiah.stuff.gen.nz/wp/?feed=rss2&amp;p=14</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
	</channel>
</rss>
