HOME

Essays on the nature and role of mathematical elegance

Show original manuscript
(2)

On reasoning and pondering.

Of all our thinking activities, one class stands out very clearly, viz. all manipulations that are formalized --or could readily be so-- by techniques such as arithmetic, formula manipulation, symbolic logic, etc., I shall denote these activities by the term "reasoning", a term of which I hope that it is sufficiently descriptive. A single name seems justified, because they have a few common characteristics.

First of all, whenever they have been applied, in principle there never needs to be an argument, whether they have been applied correctly or not, for each step (which is always from a finite repertoire) can be checked.

Secondly, as soon as it has been decided in sufficient detail, what has to be achieved by them, it is no longer a problem how to achieve it.

Thirdly --and this is not independent from the first two-- we know how to teach them: arithmetic at primary schools, formula manipulation at secondary schools, and symbolic logic at universities.

Fourthly, we are very good at doing modest amounts of reasoning. When large amounts of it are needed, however, we are powerless without mechanical aids. Multiplying two two-digit numbers is something we can all do; for the multiplication of two five-digit numbers most of us would prefer the assistance of pencil and paper; the multiplication of two hundred-digit numbers is a task that, even with the aid of pencil and paper, most of us would not care to undertake.

The various instances of reasoning are emotionally appreciated quite differently, so differently as a matter of fact, that quite a few people (I discovered) wondered whether arithmetic should be classified as a thinking activity. But let us be careful: less than 500 years ago a professor of mathematics at one of Europe's universities stated that for the more gifted and industrious student it was not impossible to master long divisions: his students had to do it with Roman numerals! The majority of us might feel that the introduction of the decimal number system has reduced arithmetic to a boring routine, but that is probably only, because most of us are so poor at arithmetic, and can do no more than laboriously applying the routine steps. Calculating prodigies, however, have many different ways of arriving at a result, and as a consequence, can get quite excited by their mental gyrations, excited because they can search for a shorter way.

As said: we are very good at modest amounts of reasoning. In many arguments, however, the amount of reasoning used often becomes the stumbling block, and I therefore relate the effectiveness of the way in which we have arranged our thoughts, to the extent in which we have been able to reduce the demands on our reasoning powers.

Let me give now just one example. In the late 18th century a German schoolmaster gave --with the intention of keeping his pupils busy for another hour-- the task of adding hundred terms of an arithmetic progression to a class of little boys who, of course, had never heard of arithmetic progressions. (To quote E.T.Bell: "The problem was of the following sort: 81297 + 81495 + ... + 100988, where the step from one number to the next is the same all along (here 198) and a given number of terms (here 100) are to be added.") The youngest pupil, however, wrote down the answer instantaneously and waited, gloriously, with his arms folded, for the next hour while his classmates toiled: at the end of the hour it turned out that little Johan Friedrich Carl Gauss had been the only one to hand in the correct answer. Young Gauss had seen instantaneously how to sum such a series analytically: the sum equals the number of terms, multiplied by the average of the first and last term. In two respects this is a classical example: firstly young Gauss produced the answer about a thousand times as fast as his classmates, secondly he was the only one to produce the correct answer. So much for the effective ordering of one's thoughts!

Prior to the existence of automatic computers the need to reduce, whenever possible, the demands on our limited reasoning powers was obvious. With today's possibility of mechanizing not only numerical computations, but also other formal manipulations that we have classified as "reasoning", this need may seem less urgent. Some seem even to believe that the need no longer exists, but I believe that the latter are mistaken. Firstly, laboriously carrying out great numbers of avoidable steps of reasoning remains a waste, whether mechanized or not. Secondly, the discovery of a way of avoiding a large amount of reasoning often transcends the specific instance: the formula for the summation of arithmetic progressions did not only save young Gauss on that specific occasion an hour of hard labour. And thirdly, unless we can control --nearly in the sense of: "prevent"-- the growth of the amount of reasoning needed, mechanization will not help very long: without special precautions, that growth tends to be exponential. (The fact that, already now, so-called Artificial Intelligence projects tend to be most demanding as far as computing power is concerned should be a warning.) My conclusion is that the need to reduce, whenever possible, the amount of reasoning needed, is undiminished. In view of the fact that we are now trying to accomplish so much more difficult things than in the pre-computer age, I am even willing to argue that the need has increased.

*                 *
*

In order to learn as much as I could about how the amount of reasoning needed is reduced most effectively, I decided to have a closer look at (known) mathematical solutions of acknowledged elegance. In the beautiful arguments I encountered, the amount of reasoning has, indeed, been reduced very effectively: they are very short.

We should bear in mind, however, that just the criterion of length is often too crude. Sometimes, arguments have been shortened by omission: a big step is made, the writer has just been too lazy to spell it all out, and the reader has to supply the missing reasoning for himself; that is clearly not the elegance we are looking for. Other arguments have been kept short by an appeal to knowledge that is often so extensive, that one cannot escape the feeling that an egg is being cracked with a sledgehammer; and that isn't the kind of elegance we are looking for either. An active mathematician commands a great manipulative agility, an experienced mathematician has an extensive knowledge, but I think that a wise one tries to use eventually as little as possible of either.

*                 *
*

I did some experiments with the following simple problem in classical mechanics. Given a flat earth, a homogeneous acceleration of gravity g, and neglecting air resistance, we are asked to determine at what distance a cannon ball hits the ground again, if it is shot away with an initial velocity v and angle phi with the horizontal direction. I have posed this problem to a number of grown-up professional mathematicians. A surprisingly large fraction could not really solve it --"It was too long ago that they had done problems like that."--, a few applied mathematicians solved it in a straightforward manner by writing down the differential equations of motion, solving them, adjusting the constants, adjusting the constants so as to satisfy the initial conditions, and finally setting y = 0 in the equation for the trajectory. (The story doesn't tell whether the eventual simplicity of the answer made them suspicious that they could have reached the answer in a simpler manner.) Those that could not solve it and those that solved it in the above manner, essentially reacted in the same way: they tried to solve the problem by means of reasoning techniques that were their daily routine; for the majority this routine pattern was insufficient, only the routine pattern of the applied mathematicians enabled those to solve the problem (but only after a fashion, as well shall see in a moment).

Two of them reacted quite differently: they gave the answer instantaneously because they knew it. And they knew it because they had solved the problem at the age of 14 --when both had posed it to themselves-- by a very simple argument, so simple that it is impossible to forget it once you have seen it. Their argument had to be simple because at that time 14-year old boys had not had analytical geometry --and so they did not know what a parabola was--, and they did not know what differential equations were, let alone that they knew how to solve them. They did not know much more than the definition of velocity --"the rate of change of position"-- and the definition of acceleration --"the rate of change of velocity"-- and a rather informal introduction to Newtonian mechanics. The boys' argument was as follows.

Because the horizontal speed is constant -- vx = v*cos(phi) -- the required distance is equal to t*vx, where t is the time of travel through the air. During the time of travel, the vertical speed has changed from vy = v*sin(phi) --for reasons of symmetry!-- to –vy, i.e the total decrement of the vertical speed equals 2*vy. Because the rate of change of vertical velocity is given to be g, we derive t = 2*vy/g, and the required answer is therefore

2*vx*vy
g

(an answer of which all except the factor of 2 can be derived by dimensional analysis, a derivation which essentially repeats the boys' argument.)

The little experiment is interesting in two ways. Firstly we can try to appreciate why the boy's argument is so much more attractive than the applied mathematicians' derivation of the parabolic trajectory. It is, because in the boys's argument the reasoning consists of two different parts: a --very simple-- argument about the horizontal motion and an --also very simple-- argument about the vertical motion, the two being combined by the very "narrow" interface of t, the time of travel through the air. This simplicity is lost as soon as one writes down the equation of the trajectory. Secondly I don't think it very noteworthy that the little boys each found the argument: they used about the only tools they had. But it is worth observing that the grown-up mathematicians immediately tried to solve the problem with the familiar reasoning apparatus of their daily routines! The ones that did not solve the problem should have realized the unsuitability of their daily used equipment --and should, to my taste, have found the boys' argument--; the applied mathematicians that solved the differential equations should, to my taste, have realized that they were cracking an egg with a sledgehammer --an activity I don't like, no matter how effective a sledgehammer may be for cracking eggs--.

From this experiment I concluded (perhaps a little bit rapidly!) that while many mathematicians acquire a great manipulative agility in certain areas, and become very familiar with certain patterns of reasoning, relatively few tend to challenge the "adequacy" of those methods when faced with a "new" problem --where "inadequacy" may present itself in two ways: either by the inability to solve the problem at all, or by the inability to solve the problem with an amount of reasoning near minimum--. Questions like "Is all that reasoning going to be sufficient?" and "Is all that reasoning really necessary?" seems to be ignored.

This is a very sad observation, because it is quite clear that a versatile and effective problem solver must ask himself these questions all the time. And when, evidently, many otherwise quite capable mathematicians do not ask themselves these questions all the time, I can only suspect that it is because they haven't been taught to do so. Two observations, indeed, point in that direction, the observation of our universities and the observation of our scientific journals.

For centuries we know two ways of transmitting knowledge, insights, and abilities to the next generation. The one is the tradition of the guilds, in which the young apprentice works for seven meagre years with a master, absorbing the skills implicitly, by "osmosis" so to speak. The other is the prevailing tradition at the universities, where via explicit --and recorded!-- formulation teachers try to bring knowledge and insights into the public domain. Till this very moment both teaching techniques are applied. Whereas, for instance, the physicists adhere very strongly to the tradition of the universities, the medical training to a large extent takes place as in the guilds. Mathematics is in a somewhat in-between position: mathematical results are taught quite openly, on the question how to do mathematics, however, most teachers are remarkably silent. (And we can hardly blame them for that. Most mathematicians quite understandably shun the "soft" subject of heuristics, they feel that they can apply it, but that they lack the ability to teach it. Add to that a cultural environment in which "thinking" is a taboo subject, and their silence on how to do mathematics becomes quite understandable.)

Secondly, the same taboo controls the style of most publications. The final results are published, but seldomly the reasons why the author looked for them, and seldomly the way in which the author arrived at them. Such "background noise" is regarded by many editors as irrelevant, and, as a result, by many authors as "no one's business". (To sketch the heuristics and to disclose some of the blind alleys one went into is really unusual: in a number of cases where I did so, the referees' report drew attention to my refreshing "courage". The mere fact that the referees use such terminology in their reports tells quite a tale: describing, or even indicating, how one has arrived at one's results is evidently appreciated as "exposing oneself".) It is one of my purposes to break this taboo: even if we cannot teach the skill of how to do mathematics as solidly as we can teach "mathematics itself", the very least we can do is not to hide the heuristics we used ourselves.

*                 *
*

Besides the explicit and formal thinking that we have called "reasoning", a second thinking activity has now become apparent: all the considerations that help the experienced problem solver in choosing the most effective argument. I think that this second kind of thinking differs so much from the first one, that it deserves a special name; let us call it "pondering".

At least in the early days of Artificial Intelligence, the validity of the distinction between what we have called "reasoning" and "pondering" respectively, has been challenged violently, but that need not worry us. The Artificial Intelligentsia argues --and perhaps they still do!-- that the difference between reasoning and pondering was not essential because, ultimately, both would be reduced to the same activity: symbol manipulation. But the objections of the Artificial Intelligentsia don't worry me --and should not worry you-- for the following reasons. Firstly, if they think it profitable to view all mental activity as symbol manipulation, that is their business, but no one is forced to share their belief. I, for my part, don't think it very profitable (unable to see, for instance, how it can help me to view an important mental activity like falling in love --either with a sweet girl or with a sweet argument-- as an act of symbol manipulation). Secondly I have learned to view the dichotomy between "essential" and "gradual"differences --like most philosophical terms-- with the gravest suspicion, for it is usually fruitless and misleading: to present something --even if it seems technically allowed-- as a "gradual" and, therefore, "inessential" difference becomes a distortion each time the "gradual" difference is large enough. (Which computing scientist has not learned that a difference of only several orders of magnitude usually makes already "all the difference"!)

To maintain the distinction between "reasoning" and "pondering" seems meaningful for the following reasons. Firstly, they serve different purposes: it is the purpose of reasoning to present a solution in a convincing way, it is the purpose of pondering to discover how the amount of reasoning needed can be reduced. (Note that it is usually not the purpose of pondering and reasoning together to solve the problem as quickly as possible. Suppose that by one day of pondering we see a way of solving a problem via an argument that takes another day to write down --and requires a similar amount of time to be read, checked and understood--. Compare this with the situation where four days of pondering shows us a way of writing down the solution in fifteen minutes --on the proverbial"backside of an envelope"--. In the latter case it took us twice as long to solve the problem, but our second solution is vastly superior to the first one!) The second reason why it seems meaningful to me to maintain the distinction between reasoning and pondering is that, their techniques being so different from each other, they have to be taught in quite different ways.

Eventually pondering should become one of the main topics of these essays. Before we can approach that subject, however, we should try to get a clearer idea of what makes an argument a really nice argument. For that purpose I propose to turn my attention to some quantitative aspects of reasonings.

*                 *
*

Illustration. At primary school we were taught how to solve what we called "encountering and overtaking problems". A typical specimen would be:

"At 12 o'clock the two hands of the clock coincide. What are the other moments of the day at which this is the case?"

Another beloved form in which such an implicit linear equation would be posed to us was the question where two walkers, starting at a given distance from each other with given speeds would meet. When we had made hundreds of such sums, someone would pose us the following problem:

"At a given moment Piet starts walking from A to B; at the same moment a fly departs (with greater speed) from his nose towards Jan, who leaves at the same moment B on a walk to A. As soon as the fly reached the nose of Jan, it returns and flies back to the nose of Piet, from where it returns to the nose of Jan, etc. until it is killed between the two colliding noses. If the distance between A and B is 20km, Piet walks with a speed of 4km/h, the fly flies with a speed of 13km/h, and Jan walks with a speed of 5km/h, how far has the fly flown before it is killed?"

Conditioned as we were, we first went into the foreseeable blind alley. When you have found a simple way to derive the answer (28.888...km), you will observe that you have arranged your reasoning in a pattern that is very similar to "the boys' argument" referred to above. (End of Illustration.)

Plataanstraat 5
5671 AL NUENEN
The Netherlands
prof.dr.Edsger W.Dijkstra
Burroughs Research Fellow


transcribed by Tristram Brelstaff
revised Thu, 18 Nov 2004