• Contacting Parabolic Polarity

    Brief Recapitulation

    In the previous post, titled Mathematical Parables, I introduced us to the parabola. Just to jog your memory, a conic section is a a geometric figure such that, for any point on the conic section, the ratio of its distance from a fixed point, called the focus, to its distance from a fixed line, called the directrix, is a constant, known as its eccentricity. A parabola is a conic section which has an eccentricity equal to 1.

    In the previous post, we derived the equation y2 = 4ax as that of the parabola in standard form, obtained the parametric coordinates (at2,2at) for a general point on the parabola, derived the equation yy1 = 2a(x + x1) of the tangent at a point (x1y1) on the parabola, and the condition c = a/m under which the line y = mx + c is a tangent to the parabola. In this post, we will deal with the chord of contact and the polar of a point with respect to the parabola.

    Chord of Contact

    Let P(x1, y1) be a point outside the parabola y2 = 4ax. From point P tangents are drawn to touch the parabola at Q(x2, y2) and R(x3, y3). This is shown in the figure below.

    Since PQ is the tangent at Q, its equation must be yy2 = 2a(x + x2). Since P lies on this tangent, its coordinates must satisfy this equation. Hence, y1y2 = 2a(x1 +x2)

    Since PR is the tangent at R, its equation must be yy3 = 2a(x + x3). Since P lies on this tangent, its coordinates must satisfy this equation. Hence, y1y3 = 2a(x1 + x3)

    It follows then that Q(x2, y2) and R(x3, y3 ) satisfy the equation yy1 = 2a(x + x1) and since two points determine a straight line, the equation of the chord of contact QR must be yy1 = 2a(x + x1).

    Pole and Polar

    Having derived the equation of the chord of contact of the point (x1, y1) with respect to the parabola y2 = 4ax let us derive an equation for the polar of the same point with respect to the parabola. Let P(x1, y1) be a fixed point placed anywhere relative to the parabola y2 = 4ax. A line through P is free to pivot about P. In general, this line will cut the parabola at two points. Suppose these points are Q(x2, y2) and R(x3, y3). Tangents are drawn to the parabola at the points Q and R. Let these tangents intersect at S(h, k).This is shown in the figure below.

    It is clear that at the line pivots about P, the points of intersection, Q and R, of the line with the parabola will vary. This means that the tangents at Q and R will vary. Hence, the point S where the two tangents intersect will move. The locus (fanciful name for ‘path’) of S is known as the polar of P with respect to the parabola and P is known as the pole of the polar with respect to the parabola. It can be seen from the diagram that QR is the chord of contact of S wrt the parabola. Hence, its equation should be yk = 2a(x + h). However, P lies on QR. Hence, y1k = 2a(x1 + h). Generalizing, we obtain the locus of S to be yy1 = 2a(x + x1).

    Tangent, Chord of Contact, and Polar

    Once again, as in the case with the circle, we can reach the observation that the same equation represents the tangent at the point (x1, y1), the chord of contact of the point (x1, y1), and the polar of the point (x1, y1) with respect to the parabola y2 = 4ax. In this case, the equation is yy1 = 2a(x + x1). Once again, it is the context within which the equation is used that tells us what the equation represents.

    If you wish to see how the chord of contact, pole and polar relate to a parabola, you can click here for a Geogebra App that I created. With the checkbox on the top left unchecked, you can see how the chord of contact varies. Just click on the point P and drag it around! If you check the checkbox on the top left, you can explore how the pole and polar vary. Again, just click on the point P and drag it around to position the pole. Then move the slider for α to change the orientation of the line through P. This will change the positions of Q and R and, therefore, the bright pink tangents at those two points. The point of intersection of these two tangents will move, tracing the green line, which is the polar.

    Once again, from the applet you will notice that the chord of contact technically only represents the part of the line that is inside the parabola which the polar represents the part that is outside the parabola. And when P is on the parabola, the chord of contact degenerates to the point P, meaning that the tangent itself is the polar. This is why all three features of the parabola, as in the case of all the conic sections, share the same equation.

    A Bone Picked

    In the post Contacting Circular Polarity I had expressed my dismay that concepts such as chord of contact and pole and polar are not taught in most (any?) contemporary curriculums. However, just the past week, I came across a post on LinkedIn in which the author was lamenting the return to archaic topics like analytical geometry in India’s mathematical education. I wanted to refer to that post here. However, the Amazon delivery guy came when I was reading the post on my phone and when I returned to LinkedIn I was unable to locate the post. The author claimed that a number of mathematics teachers or professors had voiced their disapproval of this return to such archaic concepts. The author argued that we should instead be focusing on AI and Data Science. Given that I have embarked on this series on analytical geometry, I had to address the issue raised by the post. If any reader can locate the post, please share it with me.

    As mentioned in earlier posts, analytical geometry helps us link geometry, which can be quite abstract, to algebra, which is less so. This allows us to obtain a geometric interpretation of algebraic equations and an algebraic interpretation of geometric phenomena, both of which help link disparate sides of mathematical knowledge. While algebra is very left brain focused, geometry is more right brain focused. This gives the student an ability to visualize arcane algebraic equations from a geometric perspective and to rationalize geometric insights from an algebraic perspective. Hence, those who seem to think that analytical geometry is archaic have failed to appreciate what this branch of mathematics brings to the table.

    However, in very few other fields do we find a rejection of the archaic. Physics students learn about Newton’s corpuscular theory of light and the ether theory of light even though both have been conclusively disproven. Chemistry students still study Dalton’s and Thomson’s atomic models even though they are now demonstrably wrong. I have mentioned four theories, each proven to be incorrect, that science students still learn to this day. And people complain about analytical geometry not because it is wrong (it is not) but because it is archaic? I wonder how many English teachers would say that English students should not study Shakespeare or Blake, Dickenson or Brontë because they are archaic and we have contemporary poets to study!

    Since I am unable to locate the post, it may be that I have misrepresented the views of the author. In that case, I will be willing to apologize. Not for the stance I take concerning analytical geometry, but for misrepresenting him/her. My stance on analytical geometry remains unwavering. It is an essential part of mathematical education and probably the only relatively easy part of mathematics that is able to bridge the gap between the two hemispheres of the brain. I hope that, through this series on conic sections, I am able to communicate just this.

  • Mathematical Parables

    Starting the Parabolic Path

    Those who know me well, know that I have two main areas of interest. Since you are reading this blog, you probably know or have surmised that mathematics is one of those areas. If you do not yet know, the other area is the exploration of my Christian faith and to fulfill that interest I blog at Purposes Crossed. If the title of this post seems strange, it is because I have intentionally combined the two areas. Obviously, this is a mathematics focused blog. Hence, the first word. For the second word, we need to understand that a parable is a pedagogical tool used widely by many teachers in the past, including Jesus.

    But why do I add this word about parables? Well, a parable is a story in which there is one main point, one focal point, one could say. And as we turn to the second of the conic sections, the parabola, we realize that this geometric figure too has one focal point. Of course, we need to ask ourselves what a focal point would mean for a geometric figure. We will get to that later. However, before we begin our study of the parabola, it pays to realize that the Greek word for parable is parabolé (, which happens to be the word used by Euclid for the parabola. Hence, it is either a strange coincidence or an ongoing Greek joke that has lasted for two and a half millennia! Could it be that Euclid wanted us to see that the parabola is a conic section with a story? What could its story be? Or could it be that he wanted us to think of parables as strangely constructed word parabolas, whatever that could mean? Let us try to determine that.

    Profligate Oddballs

    We began the series on conic sections with a study of circles. There was much more we could have covered. However, I dealt with a few ideas that are not normally taught in high school curriculums around the world. As we turn from the circle to the other conic sections, namely the parabola, the ellipse, and the hyperbola, we need to introduce a new idea, the eccentricity of the conic section. We will see that the circle is simply a degenerate form of the ellipse. The previous two sentences should give you an idea of how mathematicians couch their jargon with tongue in cheek terms. I mean, we are talking about eccentric and degenerate geometric figures as though these inanimate squiggles on the page had personalities and moralities, as though they were oddballs and profligates! We will address the first of these terms, the eccentricity, shortly. The idea of the degeneracy of a conic section will have to wait till we discuss the ellipse.

    The Eccentricity

    All non-degenerate conic sections are defined by a point and a line. The point is said to be the focus and the line is called the directrix. For any general point on the conic section, the ratio of its distance from the focus to its distance from the directrix is a constant, which is called its eccentricity. The eccentricity of the parabola is 1. A conic section with eccentricity between 0 and 1 is an ellipse, with the circle having and eccentricity of 0. Finally, a conic section with eccentricity greater than 1 is a hyperbola. Let us proceed to derive the equation of a parabola in standard form.

    The Equation of the Parabola

    Consider a point, S, given to be the focus of a parabola and a line, L = 0, given to be the directrix of the parabola. Drop a perpendicular from S onto L and let the foot of the perpendicular be X. Let us define the distance SX to be 2a. Let A be the midpoint of SX. Then AS = AX = a, which by the definition of the parabola implies that A is on the parabola. Since A is the midpoint of the perpendicular from the focus to the directrix, A is said to be the vertex of the parabola. Let A be the origin of the coordinate system and let AS produced be the positive x axis. This is depicted below.

    Then the coordinates of A are (0,0) and of S are (a,0). The coordinates of X will be (-a,0). Hence, the equation of L would be L = x + a = 0. Let P(x, y) be any point on the parabola. Let M be the foot of the perpendicular from P onto L. Then the coordinates of M are (-a, y). Then by definition of the parabola SP = PM. This gives us

    Hence the equation of the parabola with axis along the x-axis, vertex at the origin, directrix formed by x + a = 0 and focus at (a, 0) is y2 = 4ax.

    Now the latus rectum is the chord through the focus and perpendicular to the axis. Hence, the equation of the latus rectum is x = a. Substituting x = a in the equation of the parabola gives y = ±2a. Hence, the endpoints of the latus rectum are (a, 2a) and (a, -2a), giving that the length of the latus rectum is 4a.

    The Riddle of the Parameter

    When we dealt with the circle, we said that the coordinates of any point on the circle of radius r and centered at the origin would be (r cos θ, r sin θ), where θ is the angle formed by the radius joining the point to the origin and the positive x-axis. In this case, θ is said to be a ‘parameter’ and the aforementioned coordinates are said to be the parametric coordinates.

    The parabola y2 = 4ax also can be described in terms of a parameter as follows. Suppose x = at2 and y = 2at. Since (2at)2 = 4a(at2 ), the point (at2,2at) lies on the parabola y2 = 4ax for all values of t, where t is a parameter. We will discover what the significance of t is shortly.

    Equation of the Tangent

    Before we get to that, let us obtain the equation of the tangent to the parabola at an arbitrary point (x1, y1) . Differentiating the equation y2 = 4ax we get

    Hence, the slope of the tangent at (x1, y1) is 2a/y1. This means that the equation of the tangent will be

    But (x1, y1) lies on the parabola. Hence,

    So, the equation of the tangent becomes

    Substituting the parametric coordinates x1 = at2 and y1 = 2at, we get

    Condition for Tangency

    We can also obtain the equation of the tangent in terms of the slope of the line. Let the line y = mx + c be tangent to the parabola y2 = 4ax. Solving the two equations we get

    Since the line is a tangent, the above quadratic equation should have a repeated root. Hence,

    The last equation is the condition under which the line y = mx + c will be tangent to the parabola y2 = 4ax. In other words, replacing c with a/m we obtain that the line

    is always a tangent to the parabola.

    Answering the Riddle

    However, if we compare this with the parametric equation of the tangent

    we can see that the two lines will be identical if t = 1/m. This tells us that the parameter t at a point on the parabola is nothing but the reciprocal of the slope of the tangent at that point. In other words, what seemed like a parameter introduced just to satisfy the equation of the parabola, actually turns out to have a deeper significance for the parabola itself. In other words, the parameter itself describes an essential property of the parabola. Or to put it in words with which I began this post, the parameter tells a story about the parabola. It is a parabolic way of describing the parabola.

    In the next post we will look at some more properties of the parabola, including the chord of contact and the polar. If you have forgotten what those words mean, please refer to Contacting Circular Polarity. Till then, stay focused! And remain profligate oddballs!

  • Circular Family Values

    We are in the middle of a series on conic sections. We started with Finding Refreshment in the Desert, in which I introduced us to what a conic section is and demonstrated that the general second degree equation in two variables represents a conic section. After that, in Circular Tangentiality we obtained the general equation of a circle and derived the equation of the tangent to a circle at a point. Last week, in Contacting Circular Polarity, I introduced us to the concepts of the chord of contact and polar of a point with respect to a circle. And toward the end of the post I teased the reader with a mention of a ‘family of circles’. Whatever could that be?

    Recall that the equation of a circle with center at (p, q) and radius r is given by

    When this is expanded and rearranged, we get

    Recall also that the general second degree equation in two variables is

    We can see that the equation of the circle is similar to the general second degree equation in two variables with

    Hence, by convention we say that the general equation of a circle is

    with center at (-g, –f) and radius given by

    Now, suppose we are given the circles

    The radical axis of these two circles is obtained by equating C1 and C2. Hence,

    This is clearly the equation of a straight line and we can easily show that it is perpendicular to the line joining the centers of C1 and C2. Now consider the equation

    When λ = -1, this represents the radical axis. However, when λ ≠ -1, we can write the equation as

    Dividing the equation by 1 + λ we get

    This clearly is the equation of a circle. Hence, by varying the value of λ we can obtain different circles. Suppose we attempted to find the radical axis of this circle and C1. We would get

    If we focused on the coefficient of x we would obtain

    Hence, the equation of the radical axis would be

    If we divide this equation by λ/(1 + λ) we will get

    which is the radical axis of C1 and C2. In other words, any two members of this family of circles shares the same radical axis as the two circles used to generate the family.

    While the study of family of circles began, as is the case for many concepts related to conic sections, simply as an exercise of playing with the mathematics, this obscure concept has found applications in the study of paths of light and designs of tires and gears. This has been the case with many mathematical concepts that were developed without any practical applications in mind. However, as the study proceeds, quite often unforeseen applications become possible.

    There is much more that can be discussed related to the circle, even related to families of circles. However, that will take us really deep down this rabbit hole. Hence, we will leave the circle for now. In the next post, we will turn our focus to the parabola and see what properties this conic section has.

  • Contacting Circular Polarity

    Brief Recapitulation

    We are in the middle of a series of posts on conic sections. In the previous post, we derived the equation of a circle centered at the origin and having a radius of r. We also derived the equation of the tangent at a point on the circle. To jog your memory, given the circle x2 + y2 = r2, the tangent at the point (x1y1) is

    If we use polar coordinates, then the point on the circle would be (r cos θr sin θ) and the equation of the tangent would be

    Chord of Contact

    Now, suppose we have a point P ≡ (x1y1) that lies outside the circle. From this point, we can draw two tangents to the circle x2 + y2 = r2. This is show in the figure below

    Here, PQ and PR are tangents to the circle at Q and R respectively. The chord QR is called the chord of contact of the point P with respect to the circle. Let, the coordinates of Q and R be (x2y2) and (x3y3) respectively. Then it follows that the tangents at Q and R must respectively be

    However, since the tangents were drawn from P, the coordinates of P must satisfy both these equations. Hence, we must have

    Now consider the equation

    Since P ≡ (x1y1) does not lie on the circle, this equation no longer represents the tangent to the circle at P. However, it does represent the equation of some straight line. Moreover, since

    it follows that Q ≡ (x2y2) and R ≡ (x3y3) lie on the straight line. However, two points define a straight line and the only line through both Q and R is the chord QR. Hence, the equation

    must represent the chord of contact QR of P ≡ (x1y1) with respect to the circle.

    Pole and Polar

    Now suppose we have a point P ≡ (x1y1) anywhere relative to the circle. Consider a line through P that is free to pivot about P. In so doing, suppose it cuts the circle at points Q ≡ (x2y2) and R ≡ (x3y3). Now, we draw tangents to the circle at Q and R, which meet at the point S. This is depicted in the diagram below.

    Now, as the green line pivots about P, the points of intersection Q and R of the line with the circle will change. This means that the tangents at Q and R will change, meaning that the point of intersection S of the two tangents will also change. The path traced by S as the green line pivots about P is said to be the polar of P with respect to the circle and P is said to be the corresponding pole.

    Suppose the point S ≡ (h, k). Then The equation of QR, which is the chord of contact of S with respect to the circle will be

    However, QR necessarily passes through P. Hence, the coordinates of P must satisfy the equation of the chord of contact. This gives us

    Now (h, k) are simply the coordinates of any general point on the path traced by S. Hence, the equation of the path traced by S will be

    This is the equation of the polar of P with respect to the circle.

    Tangent, Chord of Contact, and Polar

    If you have been paying attention, you will have observed that the equations of the tangent at a point and the chord of contact of a point and of the polar of a point wrt a given circle is the same All have the form

    What is the relation between the three? We must keep in mind that the tangent is for a point on the circle. In this case, the chord of contact degenerates to the point of tangency and we get the equation of the tangent. Also, in the case of a point on the circle, the polar and the tangent coincide because the variable line needed to generate the polar pivots about a point on the circle. Hence, one of the tangents needed for the point of intersection remains fixed and happens to the the tangent at the fixed point, leading to the result that the tangent itself is the path traced by the variable point S.

    The chord of contact is a chord of the circle. Hence, the chord of contact is strictly speaking only the portion of the line

    that lies inside the circle. The polar is located by the point of intersection of real tangents. This point of intersection must necessarily lie outside the circle. Hence, the polar is strictly speaking only the portion of the line that lies outside the circle.

    Hence, we can see that a single equation describes three different lines. However, it is the context within which we use the equation that tells us which line is being described.

    If you wish to see how the chord of contact, pole and polar relate to a circle, you can click here for a Geogebra App that I created. With the checkbox on the top left unchecked, you can see how the chord of contact varies. Just click on the point P and drag it around! If you check the checkbox on the top left, you can explore how the pole and polar vary. Again, just click on the point P and drag it around to position the pole. Then move the slider for α to change the orientation of the line through P. This will change the positions of Q and R and, therefore, the purple tangents at those two points. The point of intersection of these two tangents will move, tracing the green line, which is the polar.

    If you check and uncheck the checkbox, you will notice that the green line remains fixed since it is the line that represents both the chord of contact and the polar of P with respect to the circle. However, as mentioned earlier, the chord of contact is strictly the portion of the line inside the circle while the polar is the portion outside the circle.

    An Unfortunate Situation

    The chord of contact and polar are not concepts that are normally taught in schools. Both of them require a certain amount of ‘movement’, for want of a better term. More so in the context of the polar, which has not just a rotating line, but also moving points of intersection of that line with the circle, moving tangents at those points and moving point of intersection of those tangents. Due to this, I can understand why both these concepts were no included in the syllabus when I was in high school. It would have required a phenomenal effort on the part of the teacher to draw reasonably accurate diagrams with incremental changes. As someone who is unable to draw to save his life, I can understand why these ideas would have placed a huge burden on the teachers.

    However, now we have readily available tools, like Geogebra, that allow us to create interactive applets to explain pretty much any concept. Hence, there is no reason why concepts like these that would develop the spatial and abstract reasoning of the students cannot be included in the syllabus.

    However, most syllabuses today seem to be bent on cutting down on the amount of pure geometry and analytical geometry that is included. This, in my view, is unfortunate. The problem is that I don’t think most people developing these syllabuses have a clue about some of these concepts. Moreover, there is a tendency to think that pure geometry, and hence analytical geometry, has very little application.

    Now, from earlier posts in this blog, you should be quite aware that I am not too keen on including concepts in the syllabus based primarily on whether there are applications for the concept. Nevertheless, while the chord of contact and polar may seem to be arcane ideas restricted to analytical geometry, this is far from the truth. They are used to solve optimization problems for paths traced by linkages, analyze the center of percussion for rigid bodies, and framing inversive transformations in complex analysis, among other things. It is not possible for me to delve into those areas in this blog. However, in the next post, we will deal with another rarely taught concept related to circles – the idea of a family of circles, whatever that may be.

  • Circular Tangentiality

    Defining a Circle

    Three weeks back, I started a series on conic sections. I did not post the last two weeks due to being ill. We continue today with a study of the first conic section – the circle. Just as a reminder about conic sections, take a look at the figure below.

    Conic sections (Source: Wikipedia)

    As is clear, a circle is the conic section obtained when a cone is cut by a plane that is at right angles to its axis. From the symmetry of the situation, it is clear that every point on the circle will be a fixed distance from the axis. Hence, when defining a circle in two-dimensions, we normally state, “A circle is the locus of a point that moves such that its distance from a fixed point is constant.” Here, the ‘locus of a point’ is the path traced by a point as it moves while satisfying one or more conditions. In this case, the condition is that the distance from a fixed point is constant. In particular, the fixed point is called the center of the circle while the constant distance is the radius.

    The Equation of a Circle

    Suppose we have a circle with radius r and center at the point (p, q). If a point (x, y) lies on the circle, then by definition the distance of this point from the center must be r. This gives us

    When we expand and rearrange, we will obtain

    We can see that this is similar to the general second degree equation in two variables

    with

    Recall that, from the previous post, a general second degree equation in two variables describes a conic section. Since the equation we have obtained for the circle satisfies the conditions for such a general equation, our intuition that the circle is a conic section is validated.

    We can also see that, if the center of the circle is at the origin, its equation will reduce to

    Tangent at a Point

    In our exploration of complex numbers, we determined that we can use polar coordinates to define the usual Cartesian coordinates. When we do this we get

    Now, let us try to obtain the equation of the tangent to a circle at a point on the circle. Using Cartesian coordinates, we can say that a general point is (x1, y1) where

    Now, it is known that the radius at a point on a circle is perpendicular to the tangent at that point. But how do we prove this? The traditional proof is by contradiction. Supposed we have a circle with center at O. Now let’s draw a tangent at the point A on the circle. Suppose radius OA is not perpendicular to the tangent. This means that there must be another point B on the tangent such that OB is perpendicular to the tangent. This means that triangle OAB is right angled at B. Hence, OA is the hypotenuse, meaning that OA > OB. However, since B is a point on the tangent other than the point of tangency (i.e. A), B must be further away from O than A is from O. This means that OB > OA. Since this leads to a contradiction, our assumption that the radius is not perpendicular to the tangent must be incorrect.

    What would the equation of the tangent be at the point (x1, y1)? We will derive this in three different ways to show that the body of mathematical knowledge is internally consistent.

    Derivation 1: Using Coordinate Geometry

    Let us begin with the circle

    The center is at the origin (0,0). Consider the point (x1, y1) on the circle. The radius will have a gradient (slope) equal to y1/x1. This means that the gradient (slope) of the tangent is –x1/y1. Hence, the equation of the tangent must be

    However, since the point (x1, y1) is on the circle, it follows that

    Hence, the equation of the tangent becomes

    Derivation 2: Using Quadratic Equations

    Now let us suppose a line through (x1, y1) with gradient (slope) of m is tangent to the circle. Then if we try to solve the equation of the line and the circle by eliminating y, we should obtain a quadratic equation in x. Now, the line through (x1, y1) with gradient (slope) of m is

    Substituting this expression for y in the equation of the circle we get

    Since the line is a tangent, this quadratic equation must have only one solution, meaning that its discriminant must be zero. Hence, we get

    Since we have obtained the same value of the gradient (slope) as earlier, the rest of the solution proceeds as before and we will obtain the same equation for the tangent.

    Derivation 3: Using Calculus

    Now suppose we proceed by differentiating. In that case, we obtain

    Now if we substitute the coordinates of the point of tangency, we will obtain

    Again the gradient (slope) is the same as obtained earlier. Hence, we will obtain the same equation for the tangent.

    Equation of the Tangent

    In all three cases, we obtained that the tangent at the point (x1, y1) to the circle x2 + y2 = r2 is

    If we substitute the polar coordinates we will get

    Here it is important to remember that θ is the angle made by the radius at the point (x1, y1) ≡ (r cos θ, r sin θ) and the positive x axis. This is depicted in the figure below

    What this equation tells us is also the distance of the tangent from the origin, which happens to be r as expected. In other words, when expressed in polar form, the equation of any tangent to a given circle will lie at a fixed distance from the origin.

    The Road Less Travelled

    There is much more to student about the circle. However, I do not want this to simply be a regurgitation of things the readers were taught in school. Hence, in the next post, we will look at two not to common ideas related to curves in general and conic sections in particular. We will introduce them in the context of the circle, which is the conic section most of us are already familiar with. This will allow us to expand on these ideas when we deal with the other conic sections.

  • Finding Refreshment in the Desert

    A Forty Year Fascination

    I was in the eleventh grade when I was introduced to a part of analytical geometry (or coordinate geometry, if you wish) that would fascinate me over the years. Don’t get me wrong. When I first learned about this aspect of analytical geometry, which is now almost forty years ago, I did not give it much thought because the teacher I had made the subject so dry that it was a educational desert! All he did was rattle off formulas and derivations without pausing even a bit to explain the significance of what he was doing. And if someone like me, who really enjoyed mathematics, was bored out of his wits in those classes, I can only imagine how akin to some fantastical ‘undead’ many of the other students were driven to become.

    Seashell. Woodcut, 1919 or 1920 by M. C. Escher (Source: M. C. Escher Collection)

    The problem with mathematical education in high school, as I have explained elsewhere, is that the syllabus is bloated. It includes far too much that is superfluous or unnecessary for the development of mathematical skill and intuition. As a result, some parts of the syllabus, like the one I’m writing about, which combine different areas of the syllabus in a lyrical dance of mathematical insight, are often given short shrift, probably because the teacher himself/herself has failed to learn the dance steps.

    I am, of course, referring to the study of conic sections. I am grateful to many of my own students who asked me many questions and served as catalysts to fuel the love for this part of mathematics in me. This love, however, was first kindled when I was in the second year at the Indian Institute of Technology, Bombay. That was in 1988-1989 and I was doing a project for a professor on Synthesis of Coupler Curves. Anyway, my study of coupler curves led to an appreciation of the conic sections, which later helped me when I myself began to teach.

    Some of you may be wondering what a ‘conic section’ is. Fair point. It’s not a mathematical term you often come across. In fact, unfortunately, there are many syllabuses around the world in which students study the conic sections without being told that they are studying things that are called conic sections. So, what is a conic section? Simply put, a conic section is a two-dimensional curve obtained by intersecting (hence ‘section’) the surface of a cone (hence ‘conic’) with a plane. That is, if you cut the surface of a cone with a plane, the resultant boundary of the surface of the cut cone is a conic section. This is shown in the figure below.

    Conic sections (Source: Wikipedia)

    As the figure indicates, the commonly referenced conic sections are the circle, the ellipse, the parabola, and the hyperbola. I would add pair of lines to this, formed by sectioning the cone through its apex. In this series of posts I wish to study these five conic sections, introducing us to their properties and idiosyncrasies. In this post I simply want to deal with one issue – how do we represent a conic section analytically?

    Enter the Equation

    The teacher who first introduced me to the conic sections did not tell me and my classmates this. He simply told us that the general second degree equation in two variables is the general equation of a conic section. So he wrote

    on the board, gesticulated at it and told us that this was a general second degree equation in two variables and that it represented all conic sections. Of course, if you give it some thought, you will realize that, since there are two variables (x and y), this was an equation in two variables. Further, since the highest power of either variable (x2 and y2) or variables combined (xy) was 2, this was a second degree equation. Finally, since it contains all possible terms containing x and y with degree not greater than 2, this was the most general form of a second degree equation in two variables.

    But this did not explain why this represented all the conic sections. When I asked my teacher, he just repeated that it did! I was quite frustrated at having my curiosity snubbed in this manner. I laid the matter to rest and picked it up only many years later when a student asked me the same question. Given that I had been snubbed for asking the question, I could not do the same to m student. So I decided to understand why the above equation represents conic sections.

    Unfortunately, none of the resources available explained the why behind the fact that the equation represented conic sections. So I decided that I would undertake this exploration myself. And I realized that the answer is actually quite simple. It is so simple that the resources must either simply be taking this understanding for granted or have themselves not bothered to understand why the equation represents all possible conic sections in the two-dimensional plane. So let us provide some clarity.

    Understanding Intersection

    We begin with the equation of a straight line. We can write it as

    Here, m is the slope or gradient of the line and c is the y-intercept. What this equation tells us is that, if we are given the value of x, in order to find the corresponding value of y we need to multiply x by m and then add c. This will give the point (x, y) that satisfies the equation. Since the above equation is linear, that is the highest degree of the variables is 1, it represents a straight line. Hence, ‘linear’. Now, what if we have two lines, say

    These are two linear equations. If we find the values of x and y that satisfy both equations, we are said to ‘solve’ the system of simultaneous equations. However, looking at this from the perspective of geometry, this mean that the solution x and y represents the point (x, y) that lies on both lines. This is a remarkable insight. What this tells us is that, if we have two curves that can be represented by

    then solving the equation

    will give us the values of x that satisfy both equations simultaneously. From a geometric perspective, the solutions represent the x coordinates of the points that are common to both curves. Of course, having obtained the values of x, we can use either equation to obtain the corresponding values of y. Suppose we obtain solutions

    We can substitute these values of x into either equation to obtain the values of y as

    This means that the points

    are common to the graphs of both functions. They are the points at which the graphs of y = f(x) and y = g(x) intersect. In the language of conic sections, the graph of y = g(x) sections the graph of y = f(x) at the points listed above. Let’s keep this in mind for later when we start to section the cone with a plane.

    Equation of a Plane

    When we move to the plane and the cone, we need to describe them in a three-dimensional space. Let us first obtain the equation of a plane in general form.

    Suppose we have just one number line. We can represent one variable, say x, on this number line. If we had the equation ax = b, this would represent a point. In other words, when we have a one-dimensional ‘space’, that is the single number line, a linear equation represents a zero-dimensional object – a point.

    When we consider our normal two-dimensional space with x and y axes, we understand that an equation of the form ax + by = c represents a straight line. In other words, when we have a two-dimensional ‘space’, that is a plane formed by two number lines, a linear equations represents a one-dimensional object – a line.

    We can extend this and conclude that, given a three-dimensional ‘space’, that is one in which we can represent three variables, x, y, and z, then a linear equation ax + by + cz = d will represent a two-dimensional object, that is, a plane.

    Equation of a Cone

    Now that we have obtained the equation of a plane, let us do the same for a cone. What exactly is a cone? Most definitions online are absolutely useless because they define a cone in terms of what we know about a cone, namely that its surface is conical! Well, no resource actually has such an evidently circular definition. However, given the actual words they use, they might as well have had such a circular definition.

    Technically a cone is obtained by fixing one point on a straight line and rotating it about that point at a constant angle with respect to a fixed straight line passing through the fixed point. The fixed point is then the apex of the cone and the fixed straight line is its axis.

    Let us consider a cone with apex at the origin and axis being the z-axis. Consider the generating line to make an angle θ with the z-axis. Then a point on this line with z coordinate of z, will have a distance from the z axis equal to z tan θ. Now when the line is rotated about the z axis, the point will trace a circular path with center on the z axis and radius equal to z tan θ. In other words, if the moving point has x and y coordinates equal to x and y respectively, then

    Sectioning the Cone

    Suppose we replace tan θ with t, then the equation above becomes

    Suppose we are sectioning the cone with the plane given by

    We can rearrange this as

    Substituting this expression for z in the equation of the cone we get

    We can expand the square to get

    Multiplying the whole equation by c2 and grouping terms we get

    If we make the substitutions

    the solution equation becomes

    Search for Oases in the Desert

    Apart from the upper case letters, it is clear that this equation is identical to the earlier general second degree equation in two variables. In other words, when we solved the equation of the cone with that of the plane, what we obtained was the general second degree equation in two variables. Since the process of solving two equations gives the points that satisfy both equations, this means that solving the equation of the cone and the plane will give the points that lie on the cone and on the plane. However, the points that lie on the cone and the plane are precisely the points on the surface of the cone that are sectioned off by the plane. In other words, the solution of the two equations must represent the sectioning of the cone, that is, the conic section.

    Of course, there are six parameters (A, B, C, F, G, and H) that will affect the kind of conic section as well as the size, position and orientation of the resultant conic. In this series we will attempt to study each of the conic sections in the hope that this study will prove to be an oasis in the desert that analytical geometry can often be. We will continue the search for these oases in the next post, when we will look at the conic section that every student learns about in school – the circle.

  • Infinite Regress

    Arithmetic Blasphemy

    Infinite regress ambigram (Source: Punya Mishra)

    In one of the first posts in this blog, My Unbounded Mathematical Trauma, I had shared about my trauma when students says something like, “One divided by zero equals infinity.” Unfortunately, this is something that I have heard repeated by mathematics teachers too. And I shudder when I hear such blasphemy! I say ‘blasphemy’ because division by zero is the ‘unforgiveable sin’ in mathematics and a statement like “One divided by zero equals infinity” gives the impression that it is possible to divide one by zero and obtain a number.

    However, if it is possible to divide one by zero, then the result must be a number since operations performed on numbers must yield numbers. If the result is infinity, where do we locate it on the number line? After all, the number line is a mathematical artifact that is supposed to be a visual representation on which we can locate every real number. If we are able to locate infinity on the number line, then either that must be where the number line ends, meaning that the number line itself is not unbounded, or there are numbers to the right of infinity, meaning that infinity is not something that is greater than any number.

    What we realize is that infinity is not a number, but an idea, a construct, if you will, to denote unboundedness rather than quantity, which is what every number must denote.

    Strange Infinite Series

    When we take this to the area of infinite series, we have seen that there are series that converge and those that diverge. While I haven’t discussed tests for convergence either formally or comprehensively, I think posts like Infinitely Expressed and Serially Expressed have given us a reasonable idea that infinite series need to have certain properties for them to be convergent.

    Ascending and descending by M. C. Escher (Source: Escher in het Paleis)

    With this in mind, let us consider the series

    S = 1 – 1 + 1 – 1 + 1 – 1 + 1 – 1 +…

    This is an infinite series. What is its sum? Does it converge? If so, to what value does it converge?

    Now, we can add grouping symbols that should not alter the sum. Hence, we can have

    S = (1 – 1) + (1 – 1) + (1 – 1) + (1 – 1) + … = 0 + 0 + 0 + 0 + … = 0

    Alternately, we could have

    S = 1 – (1 – 1) – (1 – 1) – (1 – 1) – (1 – 1) – … = 1 – 0 – 0 – 0 – … = 1

    So grouping the terms changes the sum! Of course, we could use the formula for the sum of an infinite geometric series with the common ratio being -1. This would give us

    S = 1 ÷ (1 – (-1)) = 1/2

    We could have obtained the same result as follows

    S = 1 – (1 – 1 + 1 – 1 + 1 -…) = 1 – S ⇒ 2S = 1 ⇒ S = 1/2

    Clearly adding up the terms of this series is not straightforward. There is no clear indication that it diverges. But there is no clear value to which it converges. However, if we consider the partial sums of the series we get the following:

    S1 = 1, S2 = 0, S3 = 1, S4 = 0, S5 = 1, S6 = 0

    Since the partial sums alternate between 1 and 0, we say that the series is alternating.

    However, we can get even curiouser results with such infinite series. Consider the following

    S = 1 – 2 + 3 – 4 + 5 – 6 + …

    Here, the partial sums are as follows

    S1 = 1, S2 = -1, S3 = 2, S4 = -2, S5 = 3, S6 = -3

    If we group the terms we can get

    S = (1 – 2) + (3 – 4) + (5 – 6) + … = -1 – 1 – 1 -…

    It is clear that this way the series diverges toward negative infinity. However, we can also group the terms as follows:

    S = 1 – (2 – 3) – (4 – 5) – (6 – 7) – … = 1 + 1 + 1 + 1 + …

    In this case, the series diverges to positive infinity! Of course, the partial sums did indicate this since they alternated in signs while increasing the magnitude.

    Since all the methods give us different answers, what we realize is that, with divergent or alternating series, grouping of terms does not help a bit.

    Of course, infinity leads us to some additional strange results. For example, we know that the set of natural numbers is

    N = {1, 2, 3, 4, 5, 6, …}

    We also know that the set of even numbers is

    E = {2, 4, 6, 8, …}

    However, for every natural number, it is possible to multiply it by 2 and obtain an even number. In this sense the set mapping elements in N to elements in E can be listed as

    M = {(1,2), (2,4), (3,6), …}

    It is clear that no element in N will be excluded and that every element in E will be included. In other words the number of elements in N is the same as the number of elements in E even though it is clear that E excludes the odd numbers!

    The Infinity of Rational Numbers

    In fact, even the rational numbers can be mapped to the natural numbers. Let us arrange the rational numbers in a grid as shown below.

    Here, the numerator corresponds to the column while the denominator corresponds to the row. Hence, the element in row 5 and column 3 will be 3/5. Of course, we can have rational numbers that are equal but appear in different cells. This is because we have equivalent fractions. These are indicated above with the same color. Hence, all numbers in red are equivalent to , while all in light green are equivalent to 2, and so on. Now, since each row and each column has an infinite number of elements, we cannot simply go down a row or a column and hope to reach all the rational numbers. For example, if we simply choose to go down the first row, which simply includes all natural numbers, we will never get to row 2! However, as can use a process of diagonalization as shown below

    Here we go down the diagonals in the order indicated by the numbers on the left. With this order, the rational numbers are reached in the following order

    If we eliminate all the equivalent fractions, we will get

    Since the numbers are being reached diagonally, every cell will be reached after a finite number of moves. It is crucial to recognize that the numbers are not arranged in any numerical order. If we include negative numbers, we could do something like

    Hence, we can map every rational number, without repetition, to the natural numbers, meaning that the number of rational numbers and the number of natural numbers is the same!

    This is obviously counter-intuitive. After all, suppose we consider two consecutive integers, say 1 and 2 for convenience. We can generate rational numbers between 1 and 2 as follows

    This means that between any two consecutive natural numbers there are infinitely many rational numbers. Despite this, as we have shown, it is possible to map every rational number to a natural number. Let the weirdness of this sink in. Though there are infinitely many rational numbers between any two consecutive natural numbers, there is a way of mapping every rational number to a natural number.

    Brief Introduction to Cardinality

    The property that between any two rational numbers it is possible to generate at least one more rational number between them constitutes the set of rational numbers as a ‘dense’ set. However, it is clear that the natural numbers are not ‘dense’ since there is no natural number between consecutive natural numbers. Hence, it is strange that a one to one mapping from the rational numbers to the natural numbers exists. In set theory, all sets that share this one to one mapping with the natural numbers are said to have the same cardinality of ℵ0, aleph-nought. Here, ‘cardinality’ gives us a measure of the ‘size’ of the set and ℵ0 is the ‘smallest’ cardinality that a set with infinite members can have. Other sets that have cardinality of ℵ0 are integers, square numbers, cube numbers, constructible numbers, algebraic numbers, etc. If we take the union of all these sets, even this union set would have cardinality of ℵ0 since we can rotate between different sets, ignoring any duplications along the way.

    However, cardinality of the real numbers is different because they include other numbers, like the transcendental numbers, making it impossible to map the real numbers to the natural numbers. The cardinality of the real numbers is hypothesized to be ℵ1, the next larger element in the set of alephs. As of now, this is still a hypothesis.

    The Perils of Infinity

    What we can see is that ‘infinity’ is much weirder than we might ever have imagined. Not only have we seen that it is possible to map one ‘infinity’ (the natural numbers) to an ‘infinity of infinities’ (the rational numbers) but also that there are ‘infinities’ that are ‘infinitely larger’ than the intuitive ‘infinity’ obtained simply by counting upward without end. When students and (shudder) teachers treat ‘infinity’ in a trivial manner as though it were simply a very large number that can be located on the number line, for example by repeating or condoning statements like, “One divided by zero equals infinity,” they reveal that they have failed to understand the fact that the term ‘infinity’ is not used to denote a number but a conglomeration of idea that is itself ‘infinitely’ rich.

  • The Seedbed of Mathematical Fecundity

    Playing around with numbers is something that many, if not most, people who are fascinated with mathematics engage in. Most of the playing around does not lead to anything ‘productive’ in the sense that the results could find some practical application in real life. However, this does not stop people from just playing around with numbers. In a few cases, the playing around leads to some insights, as we saw in Arithmetic Doodling. In some cases, the insights seem so profound or intriguing that the person who reaches these insights decides to formalize it in a form of a conjecture.

    (Source: xkcd)

    In mathematics a conjecture is a statement that is offered without a proof. Since there is no proof, a conjecture is not taken as true. Conjectures are normally the result of uncountable hours of play with numbers. It is, in fact, impossible to start with a conjecture since one must have sufficient evidence to formulate a hypothesis, which would then require many more trials to test the hypothesis before a ‘dabbler’ or ‘doodler’ feels confident enough to propose a conjecture.

    Once a conjecture is proposed it invites others to join in the play time by pooling their resources to proving or disproving the conjecture. Hence, conjectures have often been seedbeds for the development of other insights into mathematics and the properties of numbers. Indeed, if the conjecture is tantalizing enough, it can provide grounds for fruitful exploration of mathematics. Here, I wish to discuss two conjectures that have fascinated me for years.

    The Goldbach Conjecture

    On 7 June 1742, Prussian mathematician Christian Goldbach wrote a letter to Leonard Euler. In this letter he proposed the following: “Every integer that can be written as the sum of two primes can also be written as the sum of as many primes (including unity) as one wishes, until all terms are units.” At that time, 1 was considered to be a prime number. However, in Cooking the Books I discussed why it is no longer considered to be prime. Goldbach’s initial proposition is quite puzzling. I means, every integer can be written as the sum of a series of ‘1’s! Of course, Goldbach himself recognized this and, later in the same letter, wrote, “It seems that every number that is greater than 2 is the sum of three primes.” Removing the idea that 1 is a prime number, we get the following, contemporary version of the conjecture: “Every even counting number greater than 2 is equal to the sum of two prime numbers.”

    Sums of primes yielding even numbers. (Source: Wikipedia)

    We can check these for some even numbers. For example, 4 = 2 + 2, 6 = 3 + 3, 8 = 3 +5, 10 = 3 + 7 = 5 + 5, 12 = 5 + 7, 14 = 3 + 11 = 7 + 7, and so on. As we consider larger even numbers, we are also reaching regions of the number line where the prime numbers are more sparse. Hence, it would seem that the conjecture could break down for sufficiently large even numbers. However, as of 2013, the conjecture has been shown to hold for prime numbers as large as 4×1017.

    (Source: xkcd, for explanation see here.)

    Why would this be important? What kind of use could the conjecture be put to if it could be proven? Today, most methods of encryption rely on large primes. For example, if I gave you the number 19939 and you knew that this was the product of two primes, then a short search would yield 127× 157 = 19939. If I gave you 2991119, you would take a bit longer to realize that 1549 × 1931 = 2991119. If we instead used two large even numbers, the key could be enhanced to involve appropriate factorization of the product and then selecting the larger prime of the Goldbach decomposition of the larger even factor and the smaller prime of the Goldbach decomposition of the smaller even factor (or the other way around), thereby increasing the complexity of the decryption.

    The Collatz Conjecture

    The second conjecture I wish to deal with was proposed by German mathematician Lothar Collatz in 1937. The conjecture involves a starting number and operations performed on the number to get the next number in the sequence. If the number is even, it is divided by 2. If the number is odd, it is multiplied by 3 and 1 added to the result. The question is: Does the sequence reach 1 no matter what the starting number is?

    Depiction of Collatz sequences. (Source: Cantor’s Paradise)

    For example, starting with 15, we get 15, 46, 23, 70, 35, 106, 53, 160, 80, 40, 20, 10, 5, 16, 8, 4, 2, 1. Similarly, starting with 29, we get 29, 88, 44, 22, 11, 34, 17, 52, 26, 13, 40, 20, 5, 16, 8, 4, 2, 1. And starting with108 we get 108, 54, 27, 82, 41, 124, 62, 31, 94, 47, 142, 71, 214, 107, 322, 161, 484, 242, 121, 364, 182, 91, 274, 137, 412, 206, 103, 310, 155, 466, 233, 700, 350, 175, 526, 263, 790, 395, 1186, 593, 1780, 890, 445, 1336, 668, 334, 167, 502, 251, 754, 377, 1132, 566, 283, 850, 425, 1276, 638, 319, 958, 479, 1438, 719, 2158, 1079, 3238, 1619, 4858, 2429, 7288, 3644, 1822, 911, 2734, 1367, 4102, 2051, 6154, 3077, 9232, 4616, 2308, 1154, 577, 1732, 866, 433, 1300, 650, 325, 976, 488, 244, 122, 61, 184, 92, 46, 23, 70, 35, 106, 53, 160, 80, 40, 20, 10, 5, 16, 8, 4, 2, 1

    The Collatz conjecture has been shown to hold to 2.36×1021. However, as of today, no proof has been forthcoming. Indeed, no one has even suggested what its applications might be should the conjecture be proven at some stage. Despite this, the Collatz conjecture highlights the complex behavior of numbers that can be obtained using the simplest rules.

    At its roots, the Collatz conjecture asks the question, “Can repeating the same sets of operations in an iterative manner always lead to the same result?” The fact that it is impossible to predict how many iterative steps it will take for a given starting number makes the Collatz conjecture particularly intractable.

    An Invitation to Play

    The two conjectures reveal how playing with numbers can yield deep insights. The Goldbach conjecture concerns the properties of numbers themselves. While it may have started in Goldbach’s mind as a simple though experiment, perhaps even a game, it has developed into something quite serious with possible applications in the protection of information. The Collatz conjecture concerns the properties of the basic operations on the numbers. It is still in the playful stage, with no real applications even being suggested. We can tweak the numbers to see if there are Collatz-like combinations of numbers and operations that share similar properties.

    Both conjectures germinated in the ‘what ifs’ asked by the mathematicians to whom they owe their existence. Both are extremely simple to state. Both are quite straightforward to test for any given starting number. However, their simplicity belies the deep complexity at the heart of the conjectures.

    Since conjectures are not theorems, they are not normally given the high place on a pedestal that theorems are given. However, once a theorem is proven, there normally isn’t much of a rush to find a different proof. There are, of course, some notable exceptions to this. However, conjectures, by their very nature, invite mathematical exploration. They present something that is still not settled and invite the dabbler or doodler to attempt to settle the matter one way or the other. I have briefly discussed just two such intriguing conjectures. I just hope I am alive when one of them is either proven or disproven.

  • A Mechanism of Domination

    Starting the Critique

    (Source: Euroschool)

    I don’t want you to get the wrong idea either while or after reading this post. At the risk of sounding like I’m blowing my own trumpet, let me just state that I have a really good test taking temperament. I have an excellent memory and am able to think on my feet. Moreover, I am rarely flustered by high pressure situations. However, I have reached a stage in life where I think that tests and exams are detrimental to student learning and health.

    During my career spanning a few decades now, I have encountered students who were exceptionally brilliant. However, closed spaces and proximity to others gave them the heebie jeebies, which adversely affected how they fared during testing conditions. Someone may argue that most exam boards now have accommodations for students with special needs, be these learning needs or mental health needs. This is true. However, in many parts of the world, special needs are either not diagnosed effectively or are too expensive for the school to accommodate. If a school only has funds for one designated exam room, then expecting it to provide a separate room for each student with different needs is not just ridiculous, it is unjust.

    Red Herrings

    Some people may say that exams and tests help students develop time management, stress management, and critical thinking skills. I find these claims to be quite disingenuous. First, if the testing environment itself is generating the stress, then it’s not an environment that is developing stress management skills. That’s like saying that a bully helps you develop the skills to stand up for yourself! Indeed, this argument fails to address the issue of stress itself. Why are we okay with environments that create stress? After all, to say that a student must develop stress management skills in the context of exams is to say that the kind of stress developed in an exam is like what we would experience outside the exam. But this is evidently not true. I have experienced many stressful exams, but none of them were in any way helpful in managing the kind of stress that visits us uninvited in life. Indeed, in the real world, many stressful situations can be diffused precisely by walking away, something that is just not available in an exam.

    Second, I do not know of any situation in the real world, apart from catching a flight or a train or making it for an interview or making an exquisite dessert, where management of time for a period of about 2 hours makes or breaks your future. Time management can be learned in many other ways. In fact, asking someone to make a batch of brownies would be a better way of teaching time management since you have to ensure that the butter is hot enough to stay melted but not so hot as to scramble the eggs when you break them into the batter! The idea of creating an artificial environment in which time management is critical is in my view only a way of exercising power over the student who has no say in the matter.

    Third, exams may help people develop the ability to think on their toes. However, flying by the seat of one’s pants is not critical thinking! That is winging it. Critical thinking must include the ability to reflect on the solutions one has reached and to critique them dispassionately with the intention of discarding unrealistic solutions and improving inefficient ones, which is not possible in the context of an exam. Asking someone to think clearly while they have a gun to their head is not developing critical thinking skills, but allowing survival responses to rule the day.

    Elevating Mediocrity

    However, the problems with tests go much deeper. As mentioned, the environment created is artificial. Each candidate is expected to think on their own within a limited time with no opportunity of seeking the wisdom and guidance of those who have gone before them. This creates a mentality in which collaboration, which is critical in the real world, is thrown into the rubbish heap and declared to be off bounds. Further, an exam, in which no breaks are permitted, is not how anything in life works. In a day at work, most people will have time for a coffee break or lunch with colleagues and friends. These breaks rejuvenate the mind and can even stimulate it. They allow us to step away from the problem at hand, which is often essential to getting the creative juices flowing again. This is proscribed in the exam environment. However, expecting someone – even a strong introvert like me – to sit for 2+ hours on a series of disconnected tasks, is a recipe for mediocrity, not excellence, because the creative process is not allowed to develop and flourish. It conveys the idea that problems can be fixed by a single person in a matter of minutes. It is no wonder that patience and perseverance are in short supply these days. In my view, any real world problem that is worth solving requires investment of time and collaboration between humans that the exam system belies.

    Choosing Conformity

    “Viewing the Pass List” Qiu Ying (仇英) (attributed) (Source: Wikipedia)

    Of course, we can understand that this would be the case if we just inquired about the origins of the exam system. The earliest known system of exams was the Imperial Examination, a civil examination administered in Imperial China for the purpose of selecting candidates who would serve in the state bureaucracy. A system of formal exams was introduced in England in the early nineteenth century, again with the intention of selecting civil servants. When the British took over in India, they introduced similar exams, all with the purpose of selecting civil servants.

    Now, let us ask ourselves a very pertinent question: Do governments select civil servants for creativity or conformity? Quite obviously the latter. We do not want civil servants to be engaged in creative bookkeeping or flights of fancy while interpreting the law of the land! No! In such situations, we desire and demand strict conformity. And there is no denying that exams conducted by governments are good at selecting candidates who will conform.

    However, human flourishing will not happen if conformity is given pride of place. Indeed, we almost certainly ensure that the creative impulses in most people are quenched when we tell them that the straitjacket of exams is the way to move ahead in life. It is no wonder that so many students emerge from high school and college completely jaded. After all, if we have killed in them what made them human, namely the impulse to create and speculate and imagine, then we should not be surprised if they emerge with a cynical view of the world.

    Lipstick on a Pig

    However, let us be brutally honest. The exam system was developed by governments to select people from the citizenry who would best execute and enforce what the government wanted in the nation. They were never designed to promote creative thinking or out of the box imagination. They were intended to be ways in which the nation controlled the masses, not empowered them. This is why memorization of arcane facts, accuracy with basic arithmetic, and repetition or identification of rules and regulations feature prominently in most exams.

    Exams may have changed their face today. Many do not require mere memorization and regurgitation of facts. Many include attempts at promoting creativity. Many include attempts at introducing real world application. However, creativity cannot be dictated by the clock and application often requires a good night’s sleep before the lightning bolt of an idea strikes. Of course, exams still deny the fact that we are human and can develop as humans only in collaboration with others. Expecting someone to think of a bunch of new ideas thrown at them at random in a span of a couple of hours is inhumane. However, if control is what we desire, then exams are the best way to ensure it. And so, all the sweeping changes in exams over the past decades are in reality nothing more than putting lipstick on a pig. The changes are superficial precisely because what the exams are intended to do cannot be aligned with the needs of most humans.

    Control Mechanism

    You may ask me what the way ahead might be. If I am so strongly opposed to exams, how do I think we should assess students? The question presupposes that assessing students is necessary. But what are we assessing them for? And what is the purpose of assessment? Why are high school students, for example, expected to write a series of exams toward the end of their program? Why is the student who just wants to become a field anthropologist expected to display competence in business or mathematics? Why is the student who wants to become an artist expected to demonstrate acumen in biology or geography? Why is the student who wants to become a hairdresser expected to show knowledge of physics or history? Why is the student who wants to undertake biochemistry research expected to indicate skill with economics or a second language? We have straitjacketed programs that do not actually serve most students. And then we complain that they lack motivation and throw up our hands in wonder.

    But someone may say that kids really don’t know what they want to do in life. Hence, we must give them exposure to a wide variety of options. This is simply a way of saying that we do not actually wish to invest the time and effort to mentor each child so that he/she can discover what most motivates him/her. We only want to give them a superficial overview of a whole gamut of human knowledge, but nothing to such a depth as would actually capture their imaginations and fire up their spirits. Expecting every student to learn the same thing is simply a way of throwing in the towel and reneging on our responsibility, as the adults in their lives, of providing meaningful guidance.

    However, if we devote time with each student to help them discover what drives them, we will realize that exams are precisely not the way to do it. Indeed, nothing that provides an external reward, in this case grades or college admissions or a job, can ever fire up the spirits. Our spirits, you see, are not need driven. The basic animal needs – food, shelter, clothing – do not inspire the spirit. Hence, anything that promises to put food on the table or a roof above our heads or clothes on a backs can never become something that captures our imagination. This is why most people consider their jobs just a drudgery to be endured, something that provides options for the weekend. However, what the dichotomy between the week and the weekend develops is the sense that the latter exists just so that we can ‘get away’ from what the former involves. This causes confusion because, with this paradigm, we are always unsure which our real life is, the five or six days of toil or the day or two of getting away from it.

    Straitjacket curriculums culminating in exams are designed to produce people who will conform to this dichotomy and confusion. They are designed precisely to ensure that most people will spend most of their time disliking what they are doing but doing it do that they have some respite from the drudgery for one or two days of each week. In other words, straitjacket curriculums and exams are precisely parts of a system of domination to ensure that very few of us will ever be able to truly flourish by enjoying what we do day in and day out.

    Breaking Free

    What I advocate is a system of mentorship between a student and a mentor in which the vast majority of adults play the part of mentor to some child or the other. The mentor is not a ‘know it all’ but someone who knows of people who have expertise in a wide variety of fields. If a student wants to learn about colonialism in India, the mentor can direct her to an expert. If the student wants to learn about the physiology of a horse, the mentor points her to an expert. In this way we get the modern equivalent of “it takes a village to raise a child.”

    What this means is that industry takes the onus of training the next generation of young learners. They do not merely profit from the investment of parents and teachers and schools over the years, but must actually contribute to the learning of the students who choose to be pointed in their direction. Hence, if a student expresses the desire to learn about designing cars, the mentor points her to an automobile company. And the company is expected to take the student on and show her how cars are designed. If another students wants to learn to cook Korean food then the mentor directs him to a chef who then teaches him the art of making Korean food. I expect there will be push back from many people here. Why should companies play such a role. Well, in the past, a blacksmith who hoped to expand his trade would have taken on an apprentice and shown him the ropes of the blacksmithy. The blacksmith would have invested the time and effort to pass on his skills to the apprentice. Those who hope to benefit from the skills of someone should be expected to contribute to the development of those skills in that person.

    Someone may say that this is impractical. Why? Are we not motivated enough to provide meaningful guidance to the next generation of humans, the ones who will carry the torch after our candles have been snuffed out? Some may say that it makes things very complicated. Of course it does! Each child is different! How can we say that a straitjacket curriculum will ever be beneficial to most of them? However, our responsibility to the next generation cannot be sacrificed on the altar of expediency. Expediency, however, is the way a system of dominance functions. Everyone has to be treated the same way. This ensures that control by the powers that be is possible. If everyone is made to pass through the same doors, then supervision of the majority by a minority is possible.

    Of course, this is done, deceptively in my opinion, under the banner of equality. However, equality is not equity. Equality denies the idiosyncrasies and circumstances of each person while equity recognizes them. A unilateral decision by the powers that be that a child who wants to draw must learn his numbers or that another who wants to play with numbers must make her drawings is not equity. It is actually not even equality. It is oppression. And for too long we the masses have allowed governments, prestigious universities, large companies, and publishing houses to place us in situations of oppression whose quintessential element during our formative years is the examination.

  • Reading Recommendations

    For today’s post, I wish to briefly two books that delve into mathematical themes. So here goes.

    Book Review: A Certain Ambiguity by Gaurav Suri and Hartosh Singh Bal

    Gaurav Suri and Hartosh Singh Bal. A Certain Ambiguity: A Mathematical Novel. (New Delhi: Penguin, 2007). Available on Amazon.

    Quite out of the blue, a colleague of mine gifted me a book toward the end of the 2018-2019 academic year. It was an unexpected gift, but quite timely since I did not have any holiday reading planned. I guess my colleague chose the book because I am a Mathematics teacher. I wonder, though, how many people would be attracted to a book that purports to be ‘a Mathematical novel’!

    I was, however, quite eager to get started with the book. The protagonist of the novel is one Ravi Kapoor, who is a student at Stanford. His grandfather, Vijay Sahni, had been a Mathematician and had instilled in Ravi a love for Mathematics. When Ravi reaches Stanford, he enrolls in a Mathematics course called Thinking About Infinity taught by Professor Nico Aliprantis. During discussions related to the course, Ravi mentions his grandfather, whose name Nico recognizes. When Nico pulls out a paper written by Vijay, Ravi reads a note that indicates that Vijay had been in prison in New Jersey in the early twentieth century. That sets Ravi on a course to discovering why his grandfather had been imprisoned. His digging reveals that Vijay had been charged under an obscure blasphemy law in New Jersey.

    The novel is like a braid, with three strands running through it. One strand, of course, is that related to the discussion in the course taught by Nico. The second consists of fictionalized memoirs of various Mathematicians ranging from Euclid to Gauss. The third consists of conversations between the grandfather, Vijay, and Judge John Taylor, appointed to decide whether Vijay should go on trial or be set free. Since Vijay was charged with blasphemy, these conversations also touch on religious themes.  The three strands are woven intricately and play off each other extremely well.

    As the title suggests, the book narrates how Vijay and Ravi searched for certainty within Mathematics. I’ll just leave it at that without revealing the outcome of their search. That is for you to discover for I wholeheartedly recommend the book to anyone who either loves Mathematics or enjoys a well crafted tale.

    Book Review: In Pursuit of the Unknown by Ian Stewart

    Ian Stewart. In Pursuit of the Unknown: 17 Equations that Changed the World. (New York: Basic Books, 2012). Available on Amazon.

    I stumbled across this book when a friend mentioned it to me, albeit in quite a vague manner. Once I read the subtitle, of course, I was hooked. As someone who has loved Mathematics for as long as he can remember, I could not resist the temptation to read a book about how Mathematical equations have played a key role in giving us the world we live in today.

    As the subtitle suggests, Stewart describes the meaning and influence of seventeen Mathematical equations from all sorts of domains – from geometry to physics and from signal processing to economics. Some, such as the Pythagorean Theorem and E=mc2, are what one may call ‘usual suspects’ given that they have entered common discourse even if most people have no clue about what the equations mean or of how to use them. Others are more elusive, such as the Navier-Stokes equation and the Black-Scholes equation, the knowledge of which is restricted only to a handful of people in the know. Still others, such as the second law of thermodynamics and the definitions of the derivative and the imaginary number i, might cause high school students around the world to shudder.

    Stewart does a commendable job both of describing the meaning of each equation and of outlining its uses. While some knowledge of Mathematics is definitely helpful, Stewart has not targeted the book toward those with much knowledge of Mathematics. This makes the book far more accessible that it otherwise would have been.

    Stewart also does not shy away from the downsides of using Mathematics. He shows that, when Mathematics is used as a tool within other disciplines, the ethical considerations must be dealt with within those disciplines for Mathematics itself is unconcerned about such issues. Thus the power of Mathematics is itself a major drawback for, in allowing itself to be used by other disciplines, it subjects itself to the whims of those disciplines.

    The book, as a whole, is a remarkably good read and I could not put it down once I began. For those who benefit daily from the usefulness of Mathematics and who still wonder about why it is taught in schools, I would recommend this book as an eye-opener. Of course, I would recommend this book to anyone who has an inkling of curiosity coursing through their veins!