• Trigonometric Identity Crises

    ‘Sining’ In

    (Source: SlideServe)

    We are in the middle of a pit stop in the middle of our series on complex numbers. The pit stop is intended to introduce the reader to the basics of trigonometry. In the previous post, we saw why we have 360° in a circle and 90° for a right angle. We also defined the radian, which I demonstrated was not an arbitrary unit, unlike the degree. In the post preceding that, we saw how partnership between mathematicians from India, Persia, Greece, and Rome gave us the names of the trigonometric ratios.

    At the conclusion of this pit stop, when we return to complex numbers proper, we will see how integral the ideas we uncover in trigonometry are to the study of these numbers. For now, as we begin the next part of our pit stop, let us remind ourselves about the definitions of the trigonometric ratios.

    Nomenclature

    With respect to the figure above, the angle we are focusing on is ∠ABC, which we are calling θ. The side opposite θ is AC, which has a length equal to b. The side adjacent to θ is BC, which has a length equal to a. Finally, the hypotenuse of the triangle is AB, which has a length equal to c.

    Per our definitions of the trigonometric ratios

    The three ratios relate as follows

    We also define three other ratios as follows

    Here, csc stands for cosecant, which is often shortened to cosec. Similarly, sec is short for secant and cot is short for cotangent. With reference to the post made two week back, I leave it to the reader to discover why the secant has its name.

    Pythagoras Again

    Now, from Pythagoras’ theorem applied to the triangle ABC, we know that

    If we divide the entire equation by c2, we will get

    Pay close attention to the notation above. What the last equal says is that

    Consider the equation

    This is the first identity students learn in trigonometry. It is called a Pythagorean identity because it is derived from the Pythagorean theorem. But what do we mean by an identity?

    An identity is a mathematical statement that is true for all values of the variables. In the above equation, the angle θ is the variable since we did not decide on its value. The equation

    is true for all values of θ and is, therefore, called an identity. An identity, while in this case written in the form of an equation, differs from common equations in that one cannot ‘solve’ an identity to obtain a value of the unknown since it is true for all values.

    Now, starting with this identity, let us divide it by cos2θ. We would then get

    This last equation is also a Pythagorean identity since it too is derived from the Pythagorean theorem. The reader can begin with the first identity and divide it by sin2θ to obtain the third Pythagorean identity

    Proving Identities

    Note that the two additional Pythagorean identities did not require any additional trigonometry apart from the definition of the secant, cosecant, and cotangent. This is a big feature in trigonometry, where new identities can be proved based on existing ones. Let us consider some examples.

    Example 1

    Suppose we were asked to prove that

    We could proceed as follows

    Here LHS and RHS denote ‘left hand side’ and ‘right hand side’ and indicate the direction in which we are moving to prove the identity.

    Example 2

    Let’s consider another example. Suppose we were trying to prove that

    We could proceed as follows

    While this proof is longer than the earlier one, note that in both we use simple algebra, such as was discussed earlier in Primer to Complex Numbers, the post with which I began the series on complex numbers.

    Example 3

    As a final example, suppose we were asked to prove

    We could proceed as follows

    Interrelatedness of the Trigonometric Ratios

    Note that, along the way, we made use of the Pythagorean identities quite often. This is because the trigonometric ratios simply conceal the Pythagorean theorem in their definitions. In fact, because of this feature, namely that all the ratios are related to each other through the Pythagorean theorem, we can actually show the following

    That is, each of the ratios can be expressed completely in terms of any one of the other ratios.

    ‘Sining’ Off

    However, if you have been paying attention, you will realize that we have a problem. We know that no triangle can have an angle equal to 0° or 180°. We also know that a right angled triangle cannot have another right angle. We also know that a right angled triangle cannot have an obtuse angle or a reflex angle. In other words, defining the trigonometric ratios in terms of lengths of sides of a right angled triangle means that most of the angles cannot have trigonometric ratios defined for them. This is a remarkable drawback. However, mathematicians have gotten around this restriction in quite an ingenious manner. We will turn to that in the next post. Till then here are a few identities you can try proving.

  • Round and Round

    A Triggered Review

    We have reached a remarkable junction in our journey through complex numbers. We are in the middle of a pit stop in which we are looking at some trigonometric ideas. And on the occasion of Pi day 2025, I think this post makes a lot of sense. Two weeks back, in My Trigger, No Metric, Beginnings, I introduced the idea of right angled triangles and Pythagorean triples. As a reminder, Pythagorean triples are sets of three integers that satisfy the Pythagorean theorem

    In that post, I looked at different ways of generating these triples, an exercise with which I occupied myself shortly after I was introduced to these triples. Last week, in A Mathematical Partnership, we looked at how the common trigonometric ratios are defined. We also understood how the ratios – tangent, cosine, and sine – obtained their names.

    It is time now to understand how we humans have decided to measure angles. After all, that is precisely what the trigonometric ratios deal with – the ratios of the lengths of sides of a triangle corresponding to the measure of the angle being considered.

    A Degree of Definition

    Most of us would have started our journey into angle measurement with the knowledge that a right angle is equal to 90 degrees, where the ‘degree’ is the unit of measuring angles. However, when I have asked students how to define a ‘degree’, many, if not most, of them are flummoxed. They know that 90 of these units give us a right angle. And some students may be able to then say that a degree is one-ninetieth part of a right angle. And while this is perfectly correct, the definitions are circular since the degree is defined in terms of the right angle and the right angle in terms of the degree. I could just as well define the angle below as a right angle and obtain a different measure of a degree than what we conventionally use.

    You may object and tell me, “That angle is not a right angle.” But I could very well respond, “How do you know?” Indeed, since the definition of each term – right angle and degree – depended on the other, there is no way of deciding what each measure actually is.

    Of course, there is a better way out. Unfortunately, none of my mathematics teachers told me this. And given the flummoxed expressions on the faces of most students when asked to define the degree or the right angle without reference to the other, it seems that this unfortunate practice continues to date.

    So let us begin with two intersecting straight lines as shown below

    Now, apart from their location and orientation, there is nothing that differentiates the line AB from the line CD. Hence, the angle form by each of these lines at the point of intersection, O, must be identical. In other words, since ∠AOC + ∠BOC forms the straight line AB and, since ∠BOC + ∠BOD forms the straight line CD, the two measures should be equal. So we can write

    This is nothing other than saying that vertical angles (or vertically opposite angles) are equal. Now note that ∠BOC is adjacent to both ∠AOC and ∠BOD. These pairs of adjacent angles (∠AOC and ∠BOC or ∠BOC and ∠BOD) have the same angle sum, as we have stated above. However, there is only one configuration in which the adjacent angles will have equal measure. This is shown below

    We can now define the four angles formed, all of which will have equal measure, to be right angles. From here, we can define the degree to be one-ninetieth part of the right angle and have no circularity in our definitions. But why do we define a right angle to be 90°? Why not a neat, nice round number like 100°? Of course, there is a unit of angle measure, the gradian, that is defined as one-hundredth part of a right angle. But I bet not too many people know about it. Why, when 100 is a part of the decimal system, being a power of 10, has this unit not caught on? And why do we function with the right angle being defined as 90°? The answer to this is two-pronged.

    Celestial Constraints

    We live on a strange planet, the only one we know of that can sustain the kind of life that we are accustomed to. Our planet revolves around an unimpressive, medium sized star in a orbit that takes it about 365.25 of its own rotations about its axis. That’s just a convoluted way of saying that one earth year is equal to approximately 365.25 earth days. The earth also has one natural satellite – the moon – which revolves around the earth in an orbit, which when viewed from earth, takes between 29 and 30 earth days to traverse.

    The seasons are, for the most part, governed by the solar cycle of the earth. That is, the earth’s position relative to the sun has a greater impact on the seasons than does the moon’s position relative to the earth. However, it is difficult to determine exactly where in the solar cycle we are, since measuring exact times for sunrise and sunset is impractical almost everywhere. This means that determining the exact dates for the equinoxes, where the length of daytime and nighttime are equal, or the solstices, where the lengths of daytime and nighttime are extreme, is practically impossible.

    Due to this, most human cultures have found it easier to mark the progress through the year ironically with the lunar cycles. After all, it is quite easy to look into the night sky and realize that the moon is not visible! Hence, most cultures have measures this progress through the year by starting months when the new moon cycle began.

    However, as mentioned earlier, the moon takes 29 or 30 days to complete one cycle around the earth, as viewed from earth. This means that we encounter a new moon once every 29 or 30 days. Hence, most cultures have measured the progress through the year in terms of months that are 29 or 30 days long, following the lunar cycles. However, this results in a shortfall of about 11 days when compared with the solar cycle, resulting in a drift. To compensate for this, most cultures have the practice of adding an extra month every 2 or 3 years to get the calendar synchronized with the solar cycle.

    So what we have is a solar cycle of approximately 365.25 days and a lunar cycle, twelve of which amount to between 353 and 355 days. The number 360, which is four times 90, falls neatly in this interval between the period of twelve lunar months and one year. The number 400, which is what we would get from the gradian, lies outside this interval and hence would not have been suitable for measuring anything on planet earth.

    A Deliberation on Divisors

    Of course, the second consideration, beyond what is forced upon us by the weirdness of the interaction between the sun, the earth, and the moon, is that we want a number that has a large number of divisors. This allows for dividing the year into many smaller fractions without having fractional days to account for. It is here that we see another advantage of 360 over 400. Here at their divisors

    Apart from 1, which is a divisor of all numbers, and 400 and 360 respectively, since every number is automatically a divisor of itself, 400 has 11 divisors, while 360 has twice as many. In other words, 360 allows us to divide the year in 22 different ways without fractional days, while 400 only gives us 11 options.

    Now, if we extricate ourselves from the realms of planetary rotations and revolutions and turn to geometry, we can see that dividing one complete rotation about a point into 360 parts gives us more options than dividing by 400 parts. On account of this and the link with the lunar and solar cycles, the degree was defined to be one-ninetieth part of the right angle.

    However, we can see that it is an arbitrary division. Divisibility, which I discussed in Behind the Smoke and Mirrors, plays no role in determining the units of other quantities. Why should it play such a prominent role when it comes to measuring angles? In fact, let us look at how some fundamental units are defined.

    Unimaginable Units

    The second is defined as 9,192,631,770 cycles of radiation associated with the transition between the two hyperfine levels of the ground state of the cesium-133 atom. If that number is strange, consider that the metre is defined as the distance traveled by light in a vacuum in 1/299,792,458 seconds. The kilogram is defined in an even more roundabout way by first defining Planck’s constant to be 6.62607015 × 10−34 joule second and then using the definitions of the second and the metre and the fact that one joule is equal to one kilogram times metre squared per second squared to arrive at the measure of the kilogram. What this lets us know is that, when it comes to units of measurement, divisibility is the least of our concerns.

    What this means is that choosing a unit for angle measure based on divisibility is arbitrary. And since we understand that the twelve lunar cycles and one solar cycle are not sufficient grounds for measuring angles, any number we decide upon, whether 360 or 400, is going to be arbitrary at best. So, how do we decide on a better, more appropriate unit? We have seen earlier, in our series on e and π, that we are often forced to accept constants that are inconvenient. When I say inconvenient, I mean that we would never have thought to consider these numbers as significant. In fact, were these numbers not forced upon us, we would never in our wildest dreams have settled on anything even remotely close to their values.

    So, how do we go about defining a unit for measuring angles? The previously mentioned units (second, metre, and kilogram) were all defined in terms of physical processes or quantities. We do not have anything like this for the angle since an angle is an idealized geometric entity. So, we are forced to consider other ways to define a new unit for angle measure.

    Reining in Rotation

    We start with the observation that an angle simply represents a rotation. So, for example, in the figure below, the angle AOB can be viewed as a rotation of OA in the counterclockwise direction with O being the center of rotation until A coincides with B.

    So the measure of the angle can be quantified as the amount by which we must rotate from the starting point until the angle is formed. We also recognize that a logical point would be one complete rotation about O, which would bring OA back to coincide with itself. Once we have done this, the point A would have traced the circumference of a circle with center at O and radius equal to the length OA as shown below

    Now, we know that, if the radius of this circle is r, then the circumference is 2πr. We can also understand that, if point A traverses a distance that is twice the distance covered from A to B along the circumference, the angle by which OA would have rotated would be twice the original angle AOB. In other words, the distance covered is proportional to the angle through which OA rotates. We can also see that the distance is proportional to the radius. That is, if the radius is doubled, the distance covered would be double the original distance.

    We now look at the formula

    Since r represents the effect that the radius has on the circumference, we ask ourselves what the 2π represents. Remember that that distance covered during a rotation is proportional to both the radius and the angle of rotation. Since, r represents the proportionality to the radius, it must be the case that 2π represents the proportionality to the angle of rotation. However, the circumference itself represents a full rotation. Could it be then that 2π is the angle corresponding to the full rotation? What would that mean?

    Now, while OA is rotating about O, point A traces arcs of the circle. Let the arc length corresponding to an angle θ be s. Since arc length is proportional to the angle of rotation, we can write

    Now, if we keep the angle of rotation fixed at θ, then we can understand that, if the circle was twice as large, that is, if the circle had double the radius, then the arc length would be double what it was. Hence, the constant k must be none other than the radius r. This gives us

    Realizing the Radian

    Comparing this with the equation C = r we can see that, since C corresponds to the arc length for a full rotation, 2π does indeed correspond to the angle of rotation. Using the equation s = θr, we can see that, if we set θ = 1, we will get s = r. This gives us the definition of the new unit of angle measure, called the radian. The radian is the angle subtended by an arc of a circle that has length equal to the radius of the circle at the center of the circle. In other words, starting from OA, when we have rotated through an angle equal to 1 radian, the arc length involves would be equal to the radius of the circle. This is depicted below

    (Source: Wikipedia)

    With this definition of the radian, we can conclude the following

    1. One full rotation or the angle about a point is equal to 2π radians.
    2. A half rotation or the angle for a straight line is equal to π radians
    3. A quarter rotation or a right angle is equal to π/2 radians

    From this we can also get the following conversion equation

    The superscripted ‘c’ is the symbol for the radian, indicating ‘circular measure’ since the radian is the unit of angle measure defined solely based on the measurement of the circle. Of course, since the radian is the standard unit for angle measure, by convention it is often not indicated. Hence, in the equation

    it is understood that the left hand side specifies π/3 radians.

    Coming Full Circle

    What we have seen in this post is that the degree measure is an arbitrary unit. It was adopted on account of two considerations – the orbital period of the earth (i.e. approximately 365.25 days) and the convenience introduced by the fact that 360 is a highly composite number with many distinct divisors. However, we saw that units for other quantities did not pay heed to any such requirements. This allowed us to think of the unit for rotations in terms of the features of the circle, leading us finally to the radian.

    However, while almost all the other units have a certain degree (pun intended) of arbitrariness to them, the radian is completely defined in terms of the internal structure of the object it is designed to describe. For example, with the different measurement systems that humans have devised, we can imagine a world in which the standard unit of length is a cubit instead of a metre and the standard unit of mass is the pound instead of the kilogram. The second, based as it is on the period of rotation of the earth and the convenience of dividing successively by 24, 60, and 60, might have a slightly less amount of arbitrariness to it, at least for creatures who inhabit this planet. However, no matter where one may find oneself in the universe, all circles on Euclidean planes will yield the same measure for angles regardless of the name any extraterrestrial species may have given the quantity.

    Having defined the radian, we will continue with our pit stop in the next post in which we will consider some trigonometric identities. Until then, hope you have seven turns for the better!

  • A Mathematical Partnership

    Looking Back, Looking Forward

    In the previous post, we began our first pit stop during our journey into the wonderful realm of complex numbers. We looked at Pythagorean triples and some ways of generating sequences of triples. I admitted at the end of the post that this was not strictly a study of trigonometry and that we normally begin by defining the ratios since, cosine, and tangent. I promised you that I would tell you how we got these weird names. That is the agenda for this post. We will consider the ratios in the reverse order because the story of sine will take us on two globetrotting journeys that I am sure many, if not most, of you have never undertaken before this.

    Triangle Nomenclature

    We begin our journey by considering some nomenclature. In what follows, I will assume that the triangle is ABC and that the right angle is at vertex C. Further, the length of the side opposite a given vertex will be denoted by the lower case letter of that vertex. Hence, the length of side BC, which is opposite vertex A, will be denoted as a. The figure below should clarify the nomenclature we will use in the discussions from now on.

    Now the trigonometric ratios for angle B are defined as follows

    How did we get these names?

    Off on a Tangent

    For the definition of the tangent, we begin with a unit circle. This is simply a circle with radius equal to 1 unit. Let us consider a unit circle with center at O. A point, P, is chosen on the circumference of the circle and a tangent is drawn at that point. From what we know about circles, this tangent is perpendicular to the radius OP. Now consider another point, Q, on the circumference of the circle, such that the angle made by the radius OQ with the radius OP is θ. Now we extend the radius OQ and the tangent at P so that they intersect at point R. All of this is shown in the figure below.

    Now it is clear that ΔOPR is a right angled triangle with the right angle at vertex P. Now, with respect to the angle θ, RP is the opposite side while OP is the adjacent side. Hence, per the definition of the tangent above it should be clear that

    However, OP = 1, since we are considering a unit circle, hence we obtain

    However, RP is the length of the tangent corresponding to the points P and Q, the radii of which makes the angle θ. Hence, the name of the ratio. The tangent of an angle is the length of the tangent formed as above in a unit circle.

    Cosplay Anyone?

    Now, that’s all well and good. What about the cosine and sine? We go back to our original right angle triangle. However, now we focus on the vertex A. With respect to angle A, the trigonometric ratios will be as follows:

    Note that

    However, since C is a right angle, it follows that

    In other words, A and B are complementary angles. When we say that

    We are saying that the sine of an angle is the cosine of the complementary angle and vice versa. In other words, cosine is simply a shorthand for complementary sine, that is, sine of the complementary angle. But how in the world did the ‘sine’ get its name?

    A Journey from Sanskrit to Latin

    The ratio of the opposite side to the hypotenuse was used in many parts of the world. However, the name ‘sine’ strangely has its roots in the Indian subcontinent.

    (Source: Wikipedia)

    With reference to the figure above, the red arc and the green chord can be seen as a bow and string. Hence, Indian mathematicians named the string, that is, the chord, jyā, which is the Sanskrit word for ‘bowstring’. The half-chord was called ardha-jyā, which means ‘half bowstring’. Over time, the mathematicians realized that the ardha-jyā had more common use in the study of the triangles than did the jyā. Hence, they dropped the prefix and began calling the half-chord jyā.

    From here we have two stories, equally fascinating, and equally plausible (or implausible).

    One pathway indicates that a synonym for jyā in Sanskrit is jīvá. This term was adopted by Arabic scholars, who simply attempted to transliterate it into Arabic script as jība. However, since Arabic is not written with short vowels, this was rendered simply as jb. However, jība is not an Arabic word. In time, jb came to be pronounced as  jayb, Arabic for “bosom”. When Latin scholars found these Arabic works, they were reluctant to use the word for “bosom” in a mathematical work. Instead, they chose the word ‘sinus’, the Latin word for the hanging fold of a toga over the breast, from which we get the final form ‘sine’.

    The second pathway also observes the synonym jīvá but recognizes that jīvá also means ‘life’ in Sanskrit. From here the proposal is that the transformation happened earlier, when the word jīvá, now understood to mean ‘life’, was translated as ‘zind’, the Old Persian word for life. From there the name went westward to the Latin world where, unfortunately, the final ‘d’ of the name was misread as a breathing mark. This is understandable since, in Old Persian the letter for ‘d’ is 𐭣 and the breathing mark is 𐭥, making for a very small difference. Hence, the name got transliterated as ‘sine’, with the initial ‘z’ getting replaced with a ‘s’.

    Both the above pathways are based on hypotheses concerning when the Sanskrit texts were translated into either Arabic or Old Persian. We do not possess any extant manuscripts that indicate which pathway was taken. It is also possible that both pathways were taken a few centuries apart from each other, leading to a strange convergence when the various corruptions and mistranslations reached the Latin world. However, both pathways indicate that the origin of our name ‘sine’ is from the Sanskrit word jīvá.

    Knowledge Transcends Borders

    People from the Indian subcontinent may express pride at the fact that the main trigonometric ratio (sine) obtains its name from a Sanskrit word and from the imagination of some native mathematician who saw a segment of a circle as a bow and string. While there is no denying this, the length corresponding to the ardha-jīvá was used in many places, from Egypt to Persia. In other words, the usefulness of the length of the half-chord in relation to the angle formed by the half-chord was recognized by mathematicians the world over. For me, the journey, regardless of the pathway we decide upon, indicates that the world has always been one in which knowledge moved across all artificial national borders. Beginning with a Sanskrit word jīvá we have ended with a Latinized word ‘sine’. And I can only hope that this permeability of knowledge continues despite all the obstacles that are being introduced.

    Having seen how the trigonometric ratios received their names, we will continue our pit stop in the next post to understand how we decided to measure angles. Why do we have multiple units – the degree and the radian? Which one do I prefer? It will be pi day and I think the next post will be an appropriate one for the occasion. Stay tuned.

  • My Trigger, No Metric, Beginnings

    Brief Flashback

    A plot of the primitive Pythagorean triples with the odd leg on the horizontal axis and the even leg on the vertical axis. (Source: Wikipedia)

    My mathematical world was thrown wide open when, in the ninth grade, my classmates and I were introduced to the study of trigonometry. Following that, when I studied at IIT – Bombay, I wrote a research paper titled The Synthesis of Coupler Curves and designed a mobile industrial robot, both projects requiring the use of trigonometry in the extreme. Hence, this post and the next couple are ones that I have actually been looking forward to.

    Of course, this post forms a part of the ongoing series on complex numbers. However, this post forms the first in a pit stop that I had forewarned the readers about in the first post of the series. Yes, I said ‘first in a pit stop’. You see, unlike in an F-1 race, this pit stop will take some time!

    The Theorem of Pythagoras

    The journey into trigonometry begins with the ubiquitous theorem of Pythagoras or the Pythagorean theorem. This is not the place to debate where the theorem was first discovered or used or even if Pythagoras himself was an actual historical person or just a mythic figure concocted by the school that bore his name. Pythagoras’ theorem (as we call it in India) or the Pythagorean theorem (as people in North American tend to call it) states that, in a right angled triangle, the square of the length of the hypotenuse is equal to the sum of the squares of the lengths of the other two sides. Students are often instructed to memorize

    However, without knowing what the symbols a, b, and c denote, the above equation is meaningless. In particular, without recognizing that c represents the length of the hypotenuse, the equation is not even valid! However, in my years of teaching students in high school, I have realized that many students are not even taught what the hypotenuse is!

    An Unfortunate Realization

    Most students are introduced to right angled triangles with figures like the ones below

    In all three triangles, the side AB is the hypotenuse. However, the unfortunate fact is that in the above three figures AB is oblique (that is, neither vertical nor horizontal). And so many students have come to me after middle school with the understanding that the hypotenuse is the oblique side. So consider the figures below.

    All I have done is rotate the earlier triangles. The hypotenuse should not change. In all three cases it is still side AB. However, now AB is either horizontal or vertical. And some students have understood that the hypotenuse is oblique. Hence, they conclude that the hypotenuse is AC or BC. The hypotenuse, however, is the side opposite to the right angle. This is something that teachers in the middle school need to ensure students understand. It is not the orientation of a side that makes it the hypotenuse but its relation to the right angle.

    Pythagorean Triples

    Anyway, returning to the equation for Pythagoras’ theorem, we have

    I remember when I was introduced to this theorem back in my middle school days. My teacher introduced us to Pythagorean triples like (3, 4, 5), (5, 12, 13), etc. I remember spending a lot of time trying to find patterns to these triples. “Was there a way of finding new triples?” was the question I asked myself.

    My search for patterns revealed that

    That is, the square of the smallest number was the sum of the two larger numbers in the triple. Did this always hold true?

    I discovered that

    with (7, 24, 25) and (9, 40, 41) forming Pythagorean triples. Hence, it seemed that the pattern I had discovered did form Pythagorean triples. Could this be formalized in any way?

    Formalizing the Sequence

    Scatter plot of the legs (a and b) of Pythagorean triples (Source: Wikipedia)

    Later introduction to sequences and series allowed me to recognize that, in all the triples I was dealing with, the smallest number was an odd number. Moreover, the two larger numbers happened to differ by 1, the smaller being even. Knowing that 2n + 1 is always an odd number if n is a natural number, we can do the following

    So, the square of the smallest number 2n + 1 can be split into

    where the two numbers differ by 1 and the smaller is even. But do these three numbers satisfy Pythagoras’ theorem? Let’s check.

    It is clear that

    meaning that I had found a way to generate Pythagorean triples. Using this we can generate the triples (11, 60, 61), (13, 84, 85), (15, 112, 113), etc.

    Another Sequence of Triples

    However, it was clear from the triple (8, 15, 17) that I had not found a way for generating all the Pythagorean triples. Indeed, this triple does not satisfy the original premise for the triples because

    Was there a pattern that governed these three numbers? And could that pattern be extended, as before with the first pattern, to generate a new sequence of triples? Trying different patterns, I realized that the square of 8 was equal to twice the sum of 15 and 17. That is,

    Was this something important? Using this pattern, I discovered that (12, 35, 37) was also a Pythagorean triple in which

    Could this be generalized? Like before, where 2n + 1 is an odd number, 2n will be an even number, which happened to be the case with the smallest numbers in each of the new triples. Further, the two larger numbers differ by 2 and are one less and one more than the square of half the smallest number. So let’s test this. If the smallest number is 2n, then the square of half of 2n would be n2 and the two larger numbers would be n2 – 1 and n2 + 1. When we test these three numbers we get

    We can see that we have found a way of generating a new sequence of Pythagorean triples. Using this method, we can generate (16, 63, 65), (20, 99, 101), etc.

    Yet Another Sequence

    However, the presence of the triple (20, 21, 29) indicates that this second sequence of triples also does not exhaust all the possible Pythagorean triples. Despite this, I tried my hand at many such patterns, just to entertain myself. The first sequence consisted of triples having the form

    The second sequence has the form

    The third sequence is formed with the triples having the form

    from which we can obtain (20, 21, 29), (28, 45, 53), and (36, 77, 85), etc.

    Next Steps in the Journey

    This was all recreational mathematics in the service of some higher understanding of how these triples, which have a geometric significance could relate to each other. There are, of course, other sequences of triples than the three we have uncovered. For example, the triples (33, 56, 65), (39, 80, 89) and (65, 72, 97) do not fit into the three sequences we have uncovered. I leave the reader to think of other possible ways of characterizing Pythagorean triples. And perhaps also think about the question, “While it is true that each sequence for generating Pythagorean triples generates infinitely many triples, is the number of such sequences also infinite? And can it be proved either way?”

    Our journey into complex numbers required a pit stop at which we will learn some trigonometry. However, the journey is way too exciting, as we have hopefully seen in this post. The simple idea of trigonometry based on the right angled triangle, presented an opportunity to look into Pythagorean triples and some patterns with which an infinite number of triples can be generated. This, of course, is not strictly the study of trigonometry, but rather just my idiosyncratic diversion within this pit stop. Most of us know that trigonometry begins with defining ratios like sine, cosine, and tangent. Such strange names, especially the first two. Ever wonder how we got them? Keep your eyes glued for the next post.

  • Modulating an Invariant Metric

    A Brief Recap

    Our journey into the wonders of complex numbers continues. In this series I hope to introduce the reader to the reality of complex numbers. In the previous post, we reached the stage where I introduced the square root of -1 as the unit of the imaginary numbers, parallel to the number 1, which is the unit of the real numbers. We looked at how addition, subtraction, and multiplication works with complex numbers. And I promised to look at division in this post.

    Division of Irrational Numbers

    Now, those who are familiar with mathematics taught till about grade 9 or 10, will know that the set of real numbers includes rational and irrational numbers. Further, a rational number cannot be irrational and vice versa. For example, numbers like

    are rational because they can be expressed as the ratio of two integers. However, numbers like

    are irrational because they cannot be expressed as the ratio of two integers. Now, when we add or subtract one rational number and one irrational number, the result will be an irrational number. Hence, numbers like

    are irrational, with one part being rational and the other part being irrational. Now, if we were asked to divide one such irrational number by another, we would proceed with the method of rationalizing the denominator. So, if we were asked to evaluate

    we would begin by multiplying the numerator and the denominator as follows to get

    Note that the denominator has now become rational, which would allow us easier ways of manipulating the expression. The factor 5 + √7 is known as the irrational conjugate of the denominator 5 – √7.

    Dividing Complex Numbers

    So, the irrational conjugate, when multiplied by the irrational factor results in a rational factor. Could it be, then, that a similar ‘complex conjugate’ could transform a complex number in the denominator into a real number? Before we try it on an actual division, let us see if the product of a complex number and its complex conjugate actually gives us a real number.

    If we consider 3 + 4i, its complex conjugate (taking a hint from the irrational conjugate) would be 3 – 4i. When we multiply the two we would get

    So, our intuition that multiplying a complex number and its complex conjugate would result in a real number was right. And this is because i2 = -1. Let us use this to divide one complex number by another. Let us divide 4 – 5i by 3 + 4i.

    Note that the terms in red contain i2 and hence have a sign reversal, as indicated by the terms in green. Of course, the final result has the form a + bi, where a and b are real numbers. Hence, there is a real part, a, and an imaginary part, bi.

    A New Perspective on Multiplication and Division

    Now, it is all well and good that we managed to divide one complex number by another. However, whatever does the division mean? We know what division with real numbers means. For example 10 ÷ 5 can be viewed as the number of times 5 can fit into 10, giving the answer 2. We can also view it as a reduction of 10 by a factor of 5, giving 2. This allows us to see division and multiplication as, in essence, the same operation. Hence, division by 5 can be considered multiplication by 1/5. So both division and multiplication can be viewed as scaling a given number. Of course, as we have seen, negation, which can be viewed as a multiplication by -1, is nothing other than an anti-clockwise rotation about zero of 180°. And since division and multiplication are inverse operations, we can consider division by -1, which also is equivalent to negation, as a clockwise rotation about zero of 180°. Of course, a 180° rotation clockwise or anti-clockwise results in the same final position. However, we will see that maintaining a difference in direction of rotation makes sense when we consider complex numbers.

    Now, while we have given some meaning, in terms of scaling and rotation, to multiplication and division of real numbers, how can this relate to complex numbers? From the previous post, we saw that we can visualize complex numbers on a plane with the horizontal axis depicting the real part of a complex number, while the vertical axis depicts its imaginary part.

    The ‘Size’ of Real Numbers

    However, viewing multiplication and division as scaling requires us to have some idea about the ‘size’ of the number. I mean it is only meaningful to say that division of 10 by 5 is a scaling of 10 by a factor of 1/5, resulting in 2, if we have some idea that the number 10 has 5 times the ‘size’ of 2.

    Of course, using the number line, we understand that the ‘size’ of a real number, also known as its absolute value, is the ‘distance’ of the point representing that number from the zero on the number line. Hence, the ‘size’ of both 4 and -4, shown below, is 4 units because the points representing 4 and -4 are 4 units away from the zero.

    A Complex Cul-de-sac

    Now we saw that we can represent complex numbers on a two dimensional plane as below

    What ‘measure’ can we use to determine the ‘size’ of these numbers? We could simply add the distances moved in the real and imaginary directions. So the ‘size’ of 3 + 2i would be 3 + 2 = 5. Similarly, the ‘size’ of -4 + 4i would be 4 + 4 = 8. However, this measure is dependent on the orientation of the axes. For example, consider the point 4, depicted below

    If we now rotate the axis by 30° clockwise, we will get the following

    The distance from the imaginary axis will now be the length of the purple line, which happens to be 3.464 (rounded to 3 decimal places). The distance from the real axis will be the length of the green line, which is 0.5. Hence, as indicated, once we have rotated the axes, the new number becomes 3.464 + 2i, as indicated in the diagram. However, this means that the ‘size’ of the number is now 3.464 + 2 = 5.464. This means that a simple rotation of the axes has changed the ‘size’ of the number. We know that this is absurd. Changing the orientation of an object or our orientation with respect to an object should have no effect on the ‘size’ of the object. Hence, the measure involving simply adding the distances along the real and imaginary axes is a mathematically absurd measure.

    Vectors Again to the Rescue

    We could take a cue from the study of vectors, remembering that it was vectors that first gave us the intuition of adding another axis to depict the imaginary numbers. Now, the ‘size’ of a vector is known as its magnitude and is calculated using Pythagoras’ theorem. So if a vector depicts a move of 4 units to the left and 3 units down, we would first depict it as below

    Please do not confuse i, the imaginary unit that is equal to √-1, and î, the unit vector along the x direction. While mathematicians have felt free to adopt letters from other scripts, such as α, β, θ, etc. from the Greek script, we still predominantly use the English alphabet. And there are only 26 symbols to choose from. So some overlap is to be expected.

    Anyway, the magnitude of the above vector (i.e. -4î -3ĵ) is determined using Pythagoras’ theorem as follows

    The Modulus of a Complex Number

    We now use the same method to define the ‘size’ of a complex number, which is called its modulus. Hence, the modulus of the complex number z = a + bi is defined as

    Does this hold up for the earlier experiment with rotating the axes? Recall that the original complex number was 4 + 0i. It is trivial to see that its modulus is 4. After rotating, the number became 3.464 + 2i. We calculate its modulus below

    The final number, 3.999911999, might seem strange. However, this is the result of the rounding we had done earlier. Remember that the length of the purple line was 3.464 rounded to 3 decimal places. But how did I know if was 3.464? The measuring instruments I have at my disposal are an assortment of ruler, none of which has a least count smaller than 0.5 mm. So, if I used the rulers, I couldn’t have gotten anything better than 3.45 cm. What gives? Am I pulling the wool over your eyes? Absolutely not! You know I would not do that! So, there must be a way in which I have obtained the rounded value 3.464. Yes, indeed, there is. The actual value is 23, which the calculator gives as 3.464101615 and Wolfram Alpha gives as 3.46410161513775458705489268301174473388561050762076125611161395890386603381 to 64 decimal places. You can click on ‘More digits’ to get more digits, if that’s something you like to spend your time doing!

    Of course, if the actual length of the purple line is 23, then the complex number is 23 + 2i, which would yield

    Hence, with the exact value of the length, we get the same modulus for the complex number even after rotating the axes. Since, this is what we would expect, this definition of the modulus is what is accepted. But how did we obtain that the length was 23? In the first post of this series I had forewarned you that we would be making some pit stops along the way. Well, it’s time for one such pit stop, which I will deal with in the next post. Till then, modulate or remodulate yourself!

  • It’s Not Rocket Surgery!

    Revisiting the New Kind of Number

    We are in the middle of a series on complex numbers. I started this series with Primer to Complex Numbers, in which I shared some basic algebra tools that we would need for our exploration of complex numbers, ending in a brief discussion about the importance of the discriminant. In the second post, A New Kind of Number?, I continued with the exploration by looking at what the discriminant tells us about the kind of numbers we are considering. That discussion included the idea that negation could be considered an anti-clockwise rotation about zero of 180° rather than a reflection in zero. Quite obviously, rotation would mean ‘stepping out’ of the constraints of the number line into a new realm. What would that entail?

    Operations with the ‘Normal Numbers’

    We begin by considering this new kind of number as something that cannot be combined in normal ways with the numbers we normally encounter. For example, we know that the basic mathematical operations can be performed on all our normal numbers. So, for example, we know that 11 and 12 can be added to give 23; that 13 can be subtracted from 10 to give -3; that 3 can be multiplied by 16 to give 48; and that 50 can be divided by 6 to give a quotient of 8 and a remainder of 2.

    We do not have to restrict ourselves to integers, though. We know that the following are all valid

    We do not even have to restrict ourselves to rational numbers. In fact, the expressions

    can be evaluated to any desired degree of accuracy simply by calculating each irrational number and performing the necessary operations after that. This works even for the transcendental numbers, like e and π. For example, while it may be difficult to evaluate by hand, a computer could give us the value of the expressions below to any desired level of accuracy.

    In all the above cases, the operations, while in some cases extremely tedious, we possible to carry out. Furthermore, we had a clear idea about what these operations meant since all the numbers could be located on the number line.

    However, if we were now proposing that we escape the confines of the number line, what exactly are we attempting to do?

    Assistance from Mechanics and Vectors

    When students in school learn mechanics, either in the context of physics or mathematics, their study of motion is first confined to motion in a single dimension. A particle is considered to move along one axis, normally not even named since there is only one dimension of motion. However, since most motion in real life is three dimensional, the single dimension proves to be too unrealistic a constraint. Students are then introduced to motion in two dimensions, the most common case considered being that of a projectile, like a ball, being thrown at some angle of incline.

    Students are taught a key concept when studying motion in two dimensions – the vertical and horizontal dimensions are orthogonal to each other. The basic meaning of this is that the vertical and horizontal dimensions are at right angles to each other. However, there is a deeper insight to be gained: No horizontal motion will contribute to vertical motion and vice versa. In other words, what happens horizontally does not affect and is not affected by what happens vertically. This allows the students to study the motion in each direction separately before combining the results to describe the motion as a whole.

    This forms the basis of the study of vectors too. We conceive of mutually perpendicular axes so that we can isolate what happens along one axis from what happens along other axes. In both cases, what we are saying is that what happens in one dimension or axis is cannot be combined with what happens in another dimension or axis, but must be kept distinct.

    We express this technically by saying that vertical and horizontal motion are independent of each other. Similarly, the x and y directions are independent of each other. And this is all because the directions are at right angles to each other.

    The Imaginary Unit

    We now use this same insight to think of this new kind of number, which when squared can give a negative result. We recognize that all the numbers we normally encounter do not change when multiplied by 1. In other words

    We, therefore, consider the number 1 as the ‘unit’ of the normal numbers. Further, when faced with the square root of a negative number, we recognize that we can do something like the following.

    In other words, we recognize that any negative number can be written as the product of its absolute value and -1. Now, we proceed to define the square root of -1 as the ‘unit’ of the new kind of number. By convention, we will denote it as i. In other words,

    Linking the Normal and New Numbers

    Now, we allow ourselves to perform operations with the new numbers as usual, but ensuring that the normal numbers and the new numbers are never combined to as to lose their distinctiveness. So, for example, we can do the following.

    But what would happen if we tried to multiply these numbers? We would get the following.

    The above rearrangement is legitimate because multiplication is commutative. However, note that we have the square of i at the end. From the definition of i as the square root of -1, we should have

    However, note that -1 is a normal number. In other words, by squaring the new number, we get a normal number. But not just any normal number! Rather, we have obtained the negation of the ‘unit’ of the normal numbers. Recall that we had defined negation as  an anti-clockwise rotation about zero of 180°. This would indicate that i, the unit of the new numbers, is obtained by  an anti-clockwise rotation about zero of 90°, as a result of which squaring would involve two such rotations for a total of 180° as required by the result that

    So what we have obtained in that 1, the unit of the normal numbers, and i, the unit of the new numbers, are oriented at right angles to each other! In other words, 1 and i are independent of each other. Since the normal numbers have their own number line, we define another number line for the new numbers that is at right angles to the original number line. Also, since

    we realize that zero can be written as a normal number and as a new number. Hence, the two number lines will intersect at zero. We can depict this as below.

    Operations in the Complex Plane

    Now, I have to stop using ‘normal numbers’ and ‘new numbers’ simply because these are not the accepted conventions for referring to them. However, with respect the the above diagram, the normal numbers, known as ‘real numbers’, are represented along the horizontal or real axis, while the new numbers, known as ‘imaginary numbers’, are represented along the vertical or imaginary axis. The entire plane, consisting of the two axes, is known as the ‘complex plane’, another unfortunate term.

    Numbers that are a combination of real and imaginary numbers can be located on the plane as shown below

    We can describe the above numbers as follows:

    • 3+2i : Starting from the zero on both axes, move 3 units to the right and 2 units up.
    • -4+4i : Starting from the zero on both axes, move 4 units to the left and 4 units up.
    • -2-3i : Starting from the zero on both axes, move 2 units to the left and 3 units down.

    Since the real and imaginary axes are perpendicular to each other, they are independent of each other. Hence, we can add and subtract as follows

    This is because moving 3 units to the right and 2 units up followed by 2 units to the left and 3 units down, is, in effect, to move 1 unit to the right and 1 unit down. Also, moving 3 units to the right and 2 units up followed by the reverse of (i.e. negation of) two units to the left and 3 units down, is, in effect, to move 5 units to the right and 5 units up.

    When it comes to multiplication, however, we encounter a new phenomenon. It is meaningless to talk about multiplying directions. Hence, it is absurd to say that two upward directions multiply to give a leftward direction. However, when it comes to numbers, we saw that negation can be considered as  an anti-clockwise rotation about zero of 180°. Further, we saw that i can be considered an anti-clockwise rotation of 90°, allowing for the square of i to be equivalent to negation. This means that it is legitimate to multiply complex numbers and we can do this using FOIL (see Primer to Complex Numbers) as below.

    However, since the square of i is -1, we can simplify the above as

    The Fork Ahead

    So far we have seen that it is possible to conceive of imaginary numbers and real numbers as two quite different species of numbers that are independent of each other. Due to this, they can be added and subtracted in the conventional way with the only restriction being that we keep the real and imaginary parts separate. However, when we conceive of the real and imaginary numbers as independent, we face a corollary that the square of i is equivalent to negation. This gives us the justification for multiplying complex numbers using conventional algebraic rules.

    This also explains the title of this post. No, I did not make a mistake. Yes, I have combined two ideas – rocket science and brain surgery – that normally do not go together, much like the real and imaginary numbers. Not everything is smooth, as we will see. However, with the single insight that squaring the imaginary unit is equivalent to negation, we can actually make the two otherwise disparate elements work together.

    Of course, there is much more to complex numbers. We still have to give further meaning to multiplication. We also have to see how division would work, not to mention exponentiation. And that is just the start. There is so much that the imaginary numbers open up for us and I hope you are as excited as I am for the journey ahead. In the next post, we will consider how division works with complex numbers. Till then, I won’t say, “Stay real” but “Stay true.”

  • A New Kind of Number?

    The Discriminant Revisited

    Last week I began a series on complex numbers with the post Primer to Complex Numbers. At the end of that post, I considered the discriminant in the formula for solving a quadratic equation. We saw that it is easy to comprehend what a zero or positive discriminant would mean simply because the square of zero is zero itself and the square of a positive or a negative number will always give a positive result. However, what do we do with quadratic equations in which the discriminant is negative?

    Visualizing Quadratics vis à vis the Discriminant

    Here, it would help for us to visualize the functions corresponding to the quadratic equations. In other words, given the equation

    we can consider this to be the equation f(x) = 0, where

    This is nothing less than finding the points where the graph of y = f(x) cuts the x axis. So, let us consider three different quadratic functions

    If we plot the graphs of these functions we will get

    We can factorize f(x) and g(x) giving us

    Here we can see that the factors x – 1, x – 5, and x – 2 correspond to the solutions of the corresponding equations f(x) = 0 and g(x) = 0. These also are the x-coordinates of the points where the graphs of y = f(x) and y = g(x) cut the x-axis. However, we can see that we cannot factorize h(x). Nor does the graph of y = h(x) cut the x-axis. If we calculated the discriminants of the three quadratics we would obtain

    So, when the discriminant is negative, we can see that the graph of the corresponding quadratic function does not cut the x-axis. This is precisely what we mean when we say that the quadratic equation does not have real roots or real solutions.

    However, if the quadratic equation does not have real roots, what kind of roots does it have? In other words, what kind of number can we propose which when squared gives us a negative number?

    Reimagining Negation

    Let us take step back and ask ourselves how we can visualize negating a number. What I was taught, and I think what most students today are taught, is that negating a number involves taking its reflection in the zero point. Hence, the negation of 4 is -4 and the negation of -3 is 3. We can visualize both of these as follows.

    However, we can also visualize this as an anticlockwise rotation of 180° as below.

    I will later drop the degree unit for measuring angles and replace it with the radian. I will also explain why I am considering an anti-clockwise rotation when we make this change. But for now, it should be clear that an anti-clockwise rotation about zero of 180° on the number line produces the same effect as negating a number.

    We can now consider the positive sign to represent no rotation and the negative sign to represent a rotation of 180°. Then the product of two positive numbers will involve no rotation, that is, a positive sign. Similarly, the product of a positive and a negative number will involve no rotation and a rotation of 180°, that is, a negative sign. And the product of two negative numbers will involve two rotations of 180°. This will be a rotation of 360°, which is a complete revolution, or the equivalent of no rotation, yielding, once again, a positive sign.

    So what if we considered the square root of a negative number to involve a rotation of 90°? In that case, the product of two such numbers would be two rotations of 90°, which equals an effective rotation of 180°, that is, a negative sign.

    The Next Step

    What we can see is that, if we are willing to ‘step out’ of the constraints of the number line by introducing the idea of rotations, then we can conceive of a way in which a ‘number’ which when squared gives a negative number. Of course, since we have ‘stepped out’ of the number line to understand these ‘numbers’, it is clear that these ‘numbers’ do not belong on the number line. And since the number line is intended to represent all ‘real numbers’ we are forced to conclude that these new ‘numbers’ are not ‘real numbers’. It is an unfortunate fact of mathematical history that these numbers were given the name ‘imaginary’.

    In fact, it was the philosopher and mathematician, René Descartes, who rubbished the idea that there could be numbers whose squares were negative and who then jokingly called them ‘imagined’ numbers, that is, no more than a figment of our imagination. The name stuck. However, while it is great to have some historical humor in the nomenclature, there are many mathematics teachers today who actually think these numbers are entirely made up. As we will see in future posts, this is not the case. So, in the next post, we will look at understanding some arithmetic and algebra related to these numbers. Till then, let your imagination soar!

  • Primer to Complex Numbers

    A Promise Remembered

    At the end of the previous post, I had announced that I will be starting a series on complex numbers. The name ‘complex numbers’ is unfortunate. Mathematics is already surrounded by an aura of mystery. There was no need to name something that would heighten that mystery and increase the trepidation of people – mostly teenagers – who are introduced to these numbers. In this series of posts, I wish to introduce you to the wonders and power and beauty of complex numbers. Along the way we will make some necessary pit stops that might seem irrelevant. But believe me, all of it will be beautifully relevant.

    However, what I am going to cover in this post might be too trivial for some readers. If this is you, you might be wondering why I am beginning our journey here and with such ‘simple’ ideas. Well, it’s always best to start with a firm foundation rather than presume that everyone is on the same page. Hence, bear with me.

    Our journey into the realm of complex numbers begins with a property of numbers that I highlighted two weeks back: The product of two positive or two negative numbers is always positive. “So what?” you may ask. “What’s the big deal?” you may wonder. “How is this relevant?” you may contend.

    The Quadratic Equation

    Let us begin our journey with the simple quadratic equation

    Students in grade 9 and higher may know that this equation can be solved by using the quadratic formula shown below.

    But where does this formula come from? What kind of mathematical wizardry gives rise to it? I wish to devote much of the rest of this post simply to understanding how we get the quadratic formula. To this end, we start at a relatively innocuous result, the expansion of the product of two binomial expressions.

    Product of Binomials

    Since, I may have lost some of you, let me clarify some terms. In algebra, we encounter terms like ‘monomial’, ‘binomial’, and ‘polynomial’. These terms mean ‘one named’, ‘two named’, and ‘many named’ respectively. Expressions like a, m, and x, or even ab, ab2, and a3b5 are monomials because they have a single kind of term. Similarly, expressions like a + b, p + q, and x + y are binomials because they have two kinds of terms.

    Now we can consider the product of pq and r + s. We can visualize it as follows

    Algebraically, we can say that each term in the first binomial needs to be multiplied by each term in the second binomial. We can do this as follows.

    In the above, F represents the product of the first term in each binomial, O the product of the outer terms, I the product of the inner terms, and L the product of the last terms. We get the convenient acronym FOIL from this. When we use FOIL to the expansion of a perfect square, we get the following.

    Perfect Square Expansion

    When we use FOIL to the expansion of a perfect square, we get the following.

    Students in grade 6 or 7 should be familiar with the above expansion. It is crucial to note that the perfect square expansion involves three terms. Hence only an expansion that includes all three terms can be considered a ‘completed square’. Unfortunately, as a high school teacher, I come across students ‘expanding’ the square as

    all too often, causing considerable grief and anxiety to me. We could, of course, visualize the expansion geometrically as follows:

    The above figure shows that the expansion includes the two rectangles, each with area pq, which are excluded when a student ‘expands’ in the aforementioned way that causes me grief and anxiety.

    Deriving the Quadratic Formula

    Anyway, now we know the perfect square expansion, let us use that to solve the quadratic equation

    Let us first divide the whole equation by a and then move the constant term to the right. This will give us

    Now we consider the two terms on the left to be the first two terms of the perfect square expansion. This gives us

    Now, in order to complete the square on the left, we need to add the third term. We need to do this on both sides so the equation has the same solutions. This gives us

    Now, the left is a perfect square. Hence, we can write the equation as

    Now, we can take the square root on both sides to get

    Now, moving the second term on the left to the right we get

    The Promise of the Discriminant

    Now, we observe that the quadratic formula just derived has a term under a square root symbol. This term is called the discriminant because it allows us to discriminate between the kind of solutions the equation will have. In particular, if Δ > 0 then we will have two distinct real solutions and if Δ = 0 then we will have one real solution. But what happens when Δ < 0? Till now we have only encountered non-zero numbers that are either positive or negative. And when we multiply two numbers with the same sign we always obtain a positive number. What sense can we make of a negative number under the radical sign? We will look at this in the next post. Stay tuned.

  • The Presenting Problem

    Recently I came across a video on YouTube that asked the question, “Which is bigger: 99! or 5099?” There are so many such questions on YouTube and I wonder why. After giving it some thought, I realize the truth in the saying, “Give a woman a fish and you feed her for a day. Show a woman to fish and you feed her for a lifetime.” Ok, fine, the original had ‘man’ instead of ‘woman’! But you get the point. When we teach people how to solve a particular problem rather than show them the principles that would enable them to solve problems in general, we are giving them a fish that will last them a day.

    So I’m going to treat problems like this under a single umbrella. You see, the question could have been, “Which is bigger: 101! or 51101?” or “Which is bigger: 199! or 100199?” and the approach would have been exactly the same. Since I’m saying this, try to determine a pattern in the numbers involved in each of the questions before you proceed.

    The Pattern

    What we should be able to see in the pairs {99! 5099}, {101! 51101}, and {199! 100199} is that the number whose factorial forms the first part of the pair shows up as the exponent in the second part of the pair. And these are odd numbers. Also, the base in the second part is half of one more than the exponent (50 = (99 + 1)/2, 51 = (101 + 1)/2, and 100 = (199 + 1)/2). So we can generalize the question to, “Which is bigger (2n-1)! or n(2n-1).

    Breakdown

    We can recognize that (2n-1)! and n(2n-1) are each the products of 2n-1 numbers. Hence,

    Now, the middle number in each product is n. Moreover, the sum of the first and last terms in each product is 2n. And the sum of the second and second last terms is also 2n. In fact, every pair made of numbers equidistant from the ends (or the center) will add up to 2n.

    Let us consider the first pair from the ends. From (2n – 1)! we have 2n – 1 and 1 and from n2n – 1 we have n and n. Now, while the sum is the same for both pairs, the products are not the same.

    In fact

    Now, from the previous post (and elsewhere, of course!) we know that the product of two positive numbers or two negative numbers is a positive number. Hence,

    as long as n is not 1. We can expand the square to get

    Generalization

    Now, if we chose a general pair (say, the kth pair), we would have 2n – (2k – 1) and (2k – 1) from (2n – 1)! and n and n from n2n – 1. Now if we take the products we get

    Once again, let us consider the product of two numbers as below

    Expanding the square we get

    In other words, no matter which pair we pick, the product of the pair from n2n – 1, in which all terms are n, will be greater than the product of the pair from (2n – 1)! It follows, therefore that

    Hence, we can conclude that 99! < 5099, that 101! < 51101, and that 199! < 100199.

    Stepping Back

    What we did in looking at the above was identify patterns between as many numbers involved in the question as possible. Here, specifically four things were crucial. First, both sides consisted of products of n numbers. Second, even though I did not explicitly mention it, the factorial contains all natural numbers from 1 to a particular higher number (n). However, on the other side we had n copies of the same number, the base. Third, the sum of numbers equidistant from the ends was a constant. And fourth, the base happened to be half this sum.

    Recognizing these patterns allowed us to frame the issue in terms of the product of pairs of numbers whose sum is constant. From there it was an issue of recognizing that we can use something we knew about products, namely that the product of two numbers having the same sign is always positive, to produce the inequality that would allow us to proceed to a solution.

    Now, it must be noted that not all solutions will involve identical steps. However, for most problems pattern recognition is crucial. Further, as I have shown here and here, mathematics is a coherent body of knowledge. This means that things we learn in one area can be reframed to be applicable in another area. Hence, in this post, something that a student might learn in a chapter on inequalities turns out to be applicable to algebra in general and number theory in particular.

    The willingness to ‘borrow’ knowledge from another area of mathematics to make it applicable in a new area is something that we realize is important as mathematics becomes more complex. And speaking of that which is complex, I will be starting a new series on complex numbers next week. But next week’s post will only be a beginning. Till then, try proving that, for all n > 3, nn + 1 > (n + 1)n. If you need something to point the way, the series on e might be helpful.

  • Inane Phrases

    If you read my motivations page and the previous post, you will realize that my experience of teaching grade 6 students had an immense impact on me. While the previous post focused on BEDMAS (or BODMAS or PEMDAS, etc.), in the motivations page I also reveal that the students came to me with what I said were “some inane phrases to remember while performing operations.” Unfortunately, I have found these inane phrases all to prevalent even among students in high school.

    You may be wondering what these inane phrases are. Well, for instance, you may hear a student say, “Minus and minus equals plus” or “Minus and plus equals minus.” Another student may say, “Positive and positive gives positive” or “Positive and negative gives negative.” Whatever in the world is this nonsense?

    Most of you might, however, know what they are trying to say. However, without any proper context within which the phrases are to be interpreted, I could conclude, as many students do, that – 2 – 3 = +5. After all, “minus and minus equals plus,” right? Or I could insist that – 7 + 10 = – 3 since “minus and plus equals minus.”

    Need for Precision and Clarity

    What the phrases actually intend to communicate are “the product of two positive numbers is a positive number,” “the product of two negative numbers is a negative number,” and “the product of a positive number and a negative number is a negative number.” Of course, my phrases have 10, 10, and 14 words, while the original inaccurate phrases have 5 words each. The inaccurate phrases are certainly more economical in terms of the number of words they contain and the time it takes to say them. And don’t get me wrong. I’m all for saving time and effort.

    However, I am not for saving time and effort at the cost of precision and accuracy and clarity. If the price exacted by the shorter phrases is the confusion among students about whether these are applicable only for multiplication or also for addition and subtraction, then I think the price is too high to pay.

    Basic BEDMAS

    But you may ask, “If these phrases are applicable only for multiplication, what rules do we follow when it comes to addition and subtraction?” This is a valid question because here too I have found students floundering. It’s not just that they have got the order of operations wrong, but also that they do not understand what is happening, especially when we are asked to add and subtract multiple numbers, some of which are positive and others of which are negative.

    Let us consider one problem. Suppose we are asked to evaluate

    Equipped with our understanding that the convention is to perform division and multiplication first, we obtain

    Now what do we do? We begin by performing the calculations from left to right. What we first do is orient ourselves so we imagine we are on the number line facing the right (i.e. the positive direction). Now, we locate ourselves where the first number is. So we land on the position for -7.

    Now all we have to do is move forward or backward the appropriate number of steps according to whether we encounter an addition or subtraction respectively. So, we first move forward 10 spaces, landing on +3, then back 3 spaces, landing on 0, and then forward 48 spaces, landing on +48.

    BEDMAS with the Brackets

    But what if we have something more complicated, say something that involves brackets? Consider the following

    Here we have two separate parentheses to deal with. Since both are unrelated, in that there is a subtraction operator between the two, we can deal with both simultaneously. So we can first obtain

    We can now complete our dealings with the parentheses to get

    Now the –8 represents the negation of -8, which would give +8. Hence, we have

    Now, the initial multiplication involves one positive number and one negative number, meaning that the product will be negative. The division involves two positive numbers and hence will yield a positive number. So we have

    This gives the final answer as -2.

    In the above example, we had obtained –8 and I stated that this was the negation of -8. We could have also considered

    Here, instead of the negation of a negative number, we are considering the product of -1, which is inherent in the standalone negative sign, and -8, which gives us +8 since it is the product of two negative numbers.

    BEDMAS ‘Brain Teasers’

    Now, almost everyday I get some BEDMAS ‘brain teaser’ in my social media feeds. I don’t know why. Maybe my interest in mathematics related posts makes the idiotic algorithm ‘think’ that I would like such ‘brain teasers’. The reason I’m placing ‘brain teaser’ in inverted commas is because I do not think these problems qualify as brain teasers. They only check whether you know the rules of BEDMAS.

    ‘Brain Teaser’ 1

    Anyway, one such ‘brain teaser’ is

    We know we need to deal with the brackets first. To do this, we need to calculate 8 – 6 + 3, which equals 5. Hence, the problem reduces to

    Since we only have multiplication and division with no grouping, we can proceed from left to right. This gives us

    It is important to note that in the expression 5(8 – 6 + 3), we are not instructed to group the 5 and the value of the parentheses. Hence, the 5, which comes after the division sign, becomes the divisor, while the 5, which is the value of the parentheses, is a normal multiplicand.

    ‘Brain Teaser’ 2

    How would we calculate the value of

    Here, we observe that there is a parentheses that we need to deal with first, yielding

    Now, we have an exponent, which we need to address next, giving us

    Now we have to perform the multiplication to get

    This leaves just addition and subtraction, which we can perform from left to right to get

    which yields 14 as the final answer.

    ‘Brain Teaser’ 3

    But what if we have something like

    We recognize that the first term has a nested bracket. Hence, we need to first deal with the innermost bracket. This bracket contains an exponent, which we need to perform first. The second term has an exponent, which we will perform first in that term. The third and fourth terms have division, which we will perform at this stage. This gives us

    Now, we have to perform the division in the innermost bracket of the first term. At the same time, we can evaluate the brackets of the second, third, and fourth terms. This gives

    Now we can evaluate this innermost bracket of the first term. At the same time, we can evaluate the exponent on the second term. We now have

    Evaluating the first bracket, we get

    Now evaluating the exponent gives us

    Observe that we have two division operators one after the other. When faced with this situation, we need to perform the operations one by one from left to right. So we first get

    because the first division was 3÷1, which equals 3. Now we can perform the remaining division to get

    Now we have only addition and subtraction, which we can perform from left to right to give us 9 as the final answer.

    It’s a Wrap

    What we can see is that following the BEDMAS rules carefully will yield an unambiguous answer. As mentioned in the previous post, the is nothing sacrosanct about BEDMAS. It is not inherent to any of the operations. However, this is the convention that has been adopted by mathematicians so as to remove confusion and to ensure that any particular arithmetic calculation yields one and only one answer. In these two posts I hope I have communicated the rules of BEDMAS well enough for anyone at or above grade 4 to understand. I hope the examples I have given serve as good illustrations of the process. But more than that, I hope I have communicated the fact that BEDMAS is a convention that could only have been agreed upon if there was a spirit of collaboration and camaraderie among mathematicians.