• Arithmetic Doodling
    Photo by Scott Webb on Pexels.com

    Some years ago, I came across a puzzle that asked if I could use exactly six ‘4’s and the four common operations (+, -, ×, and ÷) to give the result 24. Of course, the trivial solution is 4 + 4 + 4 + 4 + 4 + 4 = 24. But after a little bit of thought I came up with (4 + 4 + 4) ÷ 4 + 4 + 4 =24. Some more thought yielded (4 + 4 + 4) × (4 + 4) ÷ 4 = 24. And a little more thought yielded (4 + 4 ÷ 4 + 4 ÷ 4) × 4 = 24. And yet more thought yielded 4 × 4 + 4 + 4 + 4 – 4 = 24. And even 44 – 4 × 4 – 4 = 24.

    Of course, if we allow exponentiation and radicals, we can have even more such solutions. For example, (√4 + √4 + √4) × (4 + 4) ÷ √4 = 24 or 4 × 4 ÷ 4 × (4 + 4 ÷ √4) = 24 or (√4 + √4 + √4) × (4 + 4) ÷ 4 = 24.

    So, I wondered if I could get any whole number this way. So here goes

    • 4 – 4 + 4 – 4 + 4 – 4 = 0
    • (4 + 4 + 4) ÷ (4 + 4 + 4) = 1
    • (4 + 4 + 4 + 4) ÷ (4 + 4) = 2
    • 4 ÷ 4 + 4 ÷ 4 + 4 ÷ 4 = 3
    • (4 + 4) ÷ 4 + (4 + 4) ÷ 4 = 4
    • (4 + 4 × 4) × 4 ÷ 4 ÷ 4 = 5
    • 4 × 4 ÷ 4 + (4 + 4) ÷ 4 = 6
    • 4 + 4 – (4 + 4) ÷ (4 + 4) = 7
    • 4 + 4 + 4 – 4 + 4 – 4 = 8
    • (4 + 4 ÷ 4) × (4 + 4 ÷ 4) = 9
    • 4 + 4 + 4 ÷ 4 + 4 ÷ 4 = 10

    We can continue in this way. I encourage you to find other patterns for the above results and to proceed beyond 10.

    Photo by Katerina Holmes on Pexels.com

    However, I wondered what the largest number might be. If we are limited to the four operations then the largest number would be 444 × 444 = 197136. However, if we allow exponentiation, then the largest number is 4^(4^(4^(4^(4^4)))) = 4^(4^(4^(4^256))) = 4^(4^(4^13407807929942597099574024998205846127479365820592393377723561443721764030073546976801874298166903427690031858186486050853753882811946569946433649006084096)). That’s 155 digits in that last number and we still have 3 more exponentiation functions to deal with! In fact, the number is so large that no publicly available computation site that I know of was able to calculate this number.

    Playing with numbers in this way has absolutely no productive purpose. It is purely a recreational activity. However, it does familiarize a person with patterns that develop when we use the mathematical operations. With respect to the above 444 × 444 versus 4^(4^(4^(4^(4^4)))), we may ask, why 4^(4^(4^(4^(4^4)))) is so large, but 4 × 4 × 4 × 4 × 4 × 4 = 4096 is not as large as 444 × 444. Indeed, we can even check to determine that 444^444 = 274181867963021452236233098115415899720776205798649755793667915961236790274589588340150992768838522033181214516284285332773163064469482041901003908126050422955612888919193934414520264211619150516791746766169089527422112735455691316517219258451182646508224779320212392542987483229069236254216598092621103632551390087850968989089099464497128935323467029182130683400149701073650091196852136571107489847564227504319046561398518922741929687815408469723169188873887882230322166952158753390120554531578647460220302381245259634634469767947795755147705092032914527384068630183109187993236769944866212774292942582416757317197927837501391362584057134071021551533313131406893521078023657100146103388103428321685173807760191113376709657657395886097014206038723956927777772422903690643103118361188528532331570074306316944892941075080102793904295150057658633501250162034388403842338437258226367135375352848005596082763545270786578951069417785335762883852191304642416990955392728403464665816357313934692154694224918478621155151771588457206018359592714202916841031166217552839560214893121104836629359007910041332262083007051081312403504450964912888379989399053705177940430208243717095550877696. This is an extremely large number with 1176 digits. However, it is clearly dwarfed by 4^(4^(4^(4^(4^4)))), which the computational engine could not even dare to calculate! It could only tell me that the result would contain over 8 × 10153 digits, which it likely calculated by repeatedly resorting to logarithms. Just to put that into perspective, current estimates of the universe say that there are about 1082 atoms in the universe.

    This gives us some insight into the power of exponentiation (pun intended!). Of course, financiers have know this, when they concocted the idea of compound interest. Indeed, if we compare simple interest of 10% p.a. with compound interest of 10% p.a. compounded annually, we obtain the following for an initial investment of Rs. 1000.

    As we can see, the difference is barely noticeable in the first few years. After 5 years, the difference is about 7.3%. However, after 10 years, it is almost 30%, while after 30 years it is a whopping 336%!

    It is this aspect of the exponential functions that makes 4^(4^(4^(4^(4^4)))) so much larger than 444^444. For large enough values of the exponent, as long as the base is greater than 1, the function will simply balloon out of control. This can be seen in the graphs below, where the graph in red is y = 1.01x, the one in blue y = 1.05x, and the one in green y = 1.1x. While the y values of the green graph will always be much larger than the values of the red graph for the same value of x, the values of all three functions explode for large values of x.

    We began this journey by playing with a set of six ‘4’s and seeing how the four operations can be used to calculate various numbers. I cannot tell you how many times just doodling like this has given me mathematical insights. So let me leave you with some doodling tasks.

    • Can you do the same kind of doodling with six ‘3’s? What about six ‘7’s?
    • How about doodling with the numbers 1, 2, 3, 4, 5, and 6? What’s the largest number you can form with these six numbers?
    • Suppose you limit yourself only to subtraction and division. What’s the smallest number you can calculate using 1, 2, 3, 4, 5, and 6?

  • Cooking the Books

    Abjuring Serialization

    Last week we finished a series on complex numbers. During this series, we took two pit stops to learn about trigonometry and calculus. Quite obviously, I have not exhausted any of these three topics. However, planning these long series is exhausting! So, for a few posts, I will refrain from serialization. Instead, I will focus on some one-off posts. I will also tone down the level of the posts to make them a little more accessible since the series on complex numbers did prove to be quite heavy.

    Now, in the minds of many, mathematics is taken to be the quintessentially objective field. After all, it is governed by logic and uses rigorous methods of proof. While this is true for the most part, there are times when some arbitrariness is permitted into the ‘rules’ of mathematics because the arbitrariness serves a greater goal. I’d like to call these instances of ‘cooking the books’ and here I’d like to discuss two such instances.

    In Its Prime

    I don’t know when I was introduced to the prime numbers. However, I do remember wondering why they were given this name. I mean, ‘prime’ sounds like these numbers are somewhat better or more important than numbers that are not ‘prime’. Despite this initial distraction, I found myself fascinated with these numbers. A prime number is defined as a natural number which is divisible only by 1 and itself. Hence, 2 is a prime number. So are 3, 5, 7, and so on. However, 4, 6, 8, and 9 are not because they have additional divisors other than 1 and themselves.

    Plot of the prime number counting function (Source: ResearchGate)

    Most students learn this kind of definition early on. However, I have discovered of late an alarming trend. Many students reach high school not knowing whether 1 is a prime number or not nor any reason for their answer.

    Now, if you scour the internet, you will find many sites that explicitly state that prime numbers have to be greater than 1. A few state that a prime number has to have two distinct divisors, thereby excluding 1 from the set of primes. In other words, the consensus is that 1 is not a prime.

    But why not? What would be the disastrous consequences of allowing 1 to be a prime? One of the main theorems of arithmetic, betrayed by its name is the Fundamental Theorem of Arithmetic (FTA), which states that every natural number greater than 1 is either prime or can be expressed uniquely as the product of prime numbers, apart from the order of the primes. Note the critical word ‘uniquely’ and the phrase about the order. So, for example, 12 = 2 × 2 × 3 = 2 × 3 × 2 = 3 × 2 × 2. Note that, since multiplication is commutative, that is the order of the products does not affect the product, the three products represent the same set of two ‘2’s and one ‘3’.

    However, note that the FTA itself excludes 1 by starting with natural numbers greater than 1. What gives? Actually, this exclusion gives us a hint of why 1 is excluded from the set of primes.

    Suppose 1 is included in the set of primes. Now, 1 is the multiplicative identity element. (Math jargon alert!) What that means is that, when you multiply any number by 1, the product is the number you started with. So we have 3 × 1 = 3, 7 × 1 = 7, and so on. But what this obviously means is that 1 × 1 = 1. And here we see the beginnings of the problem.

    If 1 = 1 × 1, then 1 = 1 × 1 × 1 and 1 = 1 × 1 × 1 × 1, and so on. Indeed, one of the few things most students remember is that 1n = 1 regardless of what n is. This obviously means that no number can be uniquely expressed as the product of primes. After all, we have 12 = 2 × 2 × 3 = 1 × 2 × 2 × 3 = 1 × 1 × 2 × 2 × 3 and so on. Indeed, there would not just be no uniqueness to the product, there would actually be an infinitely many products that could be written for any given number.

    While having an infinitely many products for a given number might not seem too troubling, it’s the reverse process that raises problems. If any number can be written as the product of any number of numbers, then how do we determine the number of factors that a number has? For example, if 12 = 2 × 2 × 3, we can count and determine that there are 6 factors, inclusive of 1 and 12. However, if 12 = 1 × 2 × 2 × 3, the number of factors remains 6 but only an explicit enumeration of the factors can give us this result. What do I mean?

    Consider 12 = 2 × 2 × 3. Any factor of 12 must necessarily be the product of powers of 2 from 20 to 22 and powers of 3 from 30 to 31. For example, 1=20 × 30, 2 = 21 × 30, 3 = 20 × 31, 4 = 22 × 30, 6 = 21 × 31, and 12 = 22 × 31. Indeed, if a number N can be written as

    where

    are prime numbers and

    then any factor of N can have as its own factors

    That is, there are ak + 1 possibilities for the prime number pk.

    This is why, for 12 = 2 × 2 × 3 = 22 × 31, the number of factors is (2 + 1) × (1 + 1) = 6.

    However, if we allow 1 to be a prime number, how many factors does an arbitrary number have? We wouldn’t be able to count because one person may decide that 1 only appears as 10, while another may say it appears as 123. The first person would determine that 12 has 6 factors, which is correct, while the second would determine that it has 144 factors, which is obviously incorrect.

    Precisely because 1 it the multiplicative identity element, it cannot serve the purpose of determining factors because dividing any number of times by 1 does not change the number. Because of this, including 1 as a prime would not just be problematic, but would actually undermine many things we hope to achieve using the theorems of arithmetic. Hence, 1 is somewhat arbitrarily defined not to be a prime number.

    A Matter of Choice

    Pascal’s triangle (Source: SciencePhotoLibrary)

    The second example of ‘cooking the books’ arises in the field of combinatorics, dealing with permutations and combinations. One of the most common short forms used in this field is that of the factorial, denoted with the ‘!’ symbol. Here, n! is defined as

    That is, the factorial of n is defined as the product of all the natural numbers from 1 to n. Hence, 3! = 1 × 2 × 3 = 6, 4! = 1 × 2 × 3 × 4 = 24, 5! = 1 × 2 × 3 × 4 × 5 = 120, etc. Now, given n distinct objects, the number of ways of choosing r of them is given by nCr, which is defined as

    Hence, we can calculate as follows

    an so on. Let us make the above a little more specific, observing the, when we make a choice, the order in which we choose items is irrelevant. Hence, from the set {A, B, C, D, E}, selecting A and then B is the same as selecting B and then A. So, the 10 different selections we can make from the five distinct items A, B, C, D, and E are AB, AC, AD, AE, BC, BD, BE, CD, CE, and DE.

    In much the same way, given the seven items A, B, C, D, E, F, and G, the 4 item selections will be ABCD, ABCE, ABCF, ABCG, ABDE, ABDF, ABDG, ABEF, ABEG, ABFG, ACDE, ACDF, ACDG, ACEF, ACEG, ACFG, ADEF, ADEG, ADFG, AEFG, BCDE, BCDF, BCDG, BCEF, BCEG, BCFG, BDEF, BDEG, BDFG, BEFG, CDEF, CDEG, CDFG, CEFG, and DEFG. Hence, as calculated above, there are 35 ways of making 4 item selections from 7 distinct items.

    Now, we come to the problem. How many ways are there of selecting 5 out of 5 or 7 out of 7 items? Obviously, the answer is 1. Given the items A, B, C, D, and E, there is only 1 selection, i.e. ABCDE, that contains all 5 items. Similarly, given the items A, B, C, D, E, F, and G, there is only 1 selection, i.e. ABCDEFG, that includes all 7 items. However, what does the formula above give us? Plugging in 5 and 7 into the formula we get

    In general, we obtain nCn as

    We get the same when we want to select none of the n items because

    However, the number of ways of selecting no items out of n items is obviously 1, namely the empty set, denoting that none of the items were selected. Hence, we can conclude that

    However, the equation

    can only be true if 0! = 1. However, this does not align with the definition of the factorial. However, to ensure that nCn and nC0 are meaningful, which they are, and can be meaningfully calculated, we quite arbitrarily define 0! to be equal to 1.

    Fits and Starts

    What we have seen are two instances in which something that defies a definition is force-fit with a new definition because there are many gains to be had by doing so. Mathematics, while objective for the most part, permits some arbitrariness when the benefits outweigh the losses. Actually, we have seen this earlier in The Quantifiers Strike Back, where we saw that, in order to ensure closure of some operation, we were required to expand the number system. This is something that mathematicians have done for ages. However, every new arbitrary inclusion is often met with opposition. And rightly so. We do not want just about any arbitrary idea to be added on to our edifice of mathematical knowledge. However, through debate and discussion, often including some quite harsh ad hominem attacks, mathematicians slowly get convinced that some ideas are better than excluded. And so, in fits and starts, mathematics, like any other branch of human knowledge, advances.

  • Forms of Complex Numbers

    Photo by Tomas Anunziata on Pexels.com

    In the previous post, The Tripartite Fugue, we finally obtained the exponential form for complex numbers. As mentioned in the previous post, we will conclude our series on complex numbers today. Here I wish to explore a couple of aspects of complex numbers that become evident from the exponential form. I also wish to relate the rotational aspect of complex numbers to the idea of negation that we saw in A New Kind of Number?.

    Let us begin our final exploration by reminding ourselves of the three equivalent forms for complex numbers we have obtained. So far we have

    In the polar and exponential forms we recognize that, due to the periodicity of the trigonometric functions and fact that θ represents a rotation, there are infinitely many values of θ that, when used in the polar or exponential forms, could give us the same complex number.

    Addition of Complex Numbers

    Now, let us consider two complex numbers

    In polar form these would be

    If we attempt to add the two complex numbers we would obtain

    Special Case 1: r1 = r2 = r

    Now, there is very little we can do to simplify the above. However, if we have the case where both complex numbers have the same modulus r, we can get

    Here, we can recognize that the term in blue is a pure rotation, which must mean that the term in term gives us the modulus of the resultant complex number. Now, if we step back and think about what this represents, we can see that the two complex numbers z1 and z2 form adjacent sides of a rhombus, one inclined at θ1 with the real axis and the other at θ2. Quite obviously, the addition of the two gives us the complex number represented by the diagonal, which must bisect the angle between the two sides, meaning that it must be inclined at an angle of (θ1 + θ2)/2 with the real axis. Further, it is easy to show that this diagonal must have a length equal to the red term, where the angle (θ1θ2)/2 is that between one side of the rhombus and the diagonal.

    Addition of complex numbers to give a rhombus (Source: Wikipedia)

    Multiplication of Complex Numbers

    Now, suppose we were attempting to multiply the original complex numbers. In exponential form they were

    In polar form they are

    Using the polar form, we can obtain

    Once again, we can see that the term in blue represents a pure rotation, meaning that the term in red must be the modulus. Hence, when we multiply two complex numbers, their moduli get multiplied while their arguments get added.

    This feature is much more easily recognized when we consider the exponential form since

    Hence, if we consider z1 to be our original complex number, then multiplying it by z2 represents a further rotation by θ2 and an enlargement by a factor of r2.

    Special Case 2: r1 = r2 = 1

    Now, if we consider the case where the modulus of both complex numbers is 1, then we have a situation of pure rotation since any power of 1 is 1. So, there is no enlargement anymore, only pure rotation. Now,

    Hence,

    As we can see, since the moduli of both complex numbers is 1, the product represents only a rotation.

    Special Case 3: r1 = r2 = 1, θ2 = π

    Suppose now that the second argument (i.e. θ2) is equal to π. Then,

    Per our understanding of rotation, this represents rotating z1 by π radians in the counterclockwise direction. However, consider

    In other words, rotating a complex number by π radians in the counterclockwise direction, which is what e represents, is the same as negating a complex number. We saw the same thing when we considered real numbers in A New Kind of Number?. That is, we said that negating a real number can be thought of as a rotation by π radians. Of course, in that post, we hadn’t yet introduced the radian measure. Hence, we had said that negation is the same as a rotation by 180°.

    We can solidify this understanding of negation by considering e using the polar form. Hence, we have

    We saw this identity in The Eye of the Beholder, one of the first posts in this blog. In a different form, namely

    this is known as Euler’s identity.

    Special Case 4: r1 = r2 = 1, θ1 = 0, θ2 = π/2

    But if a rotation by π radians is equivalent to negation, what would a rotation by half of π radians be equivalent to? Here, I am intentionally choosing z1 to be a purely real number so that we can see the effect the rotation has. Now,

    What this tells us is that the imaginary unit i represents a rotation in the counterclockwise direction by π/2 radians. We also know that i2 = -1, which would correspond to two rotations of π/2 radians in the counterclockwise direction, which is equivalent to one rotation of π radians in the counterclockwise direction as we have just seen. Moreover, i3 = –i, which would correspond to three rotations of π/2 radians in the counterclockwise direction (i3), which is equivalent to one rotations of π/2 radians in the clockwise direction (-i).

    Source: Wikimedia

    Significance of Rotation by π/2 radians

    Of course, right away we should be able to see some sort of symmetric beauty in this. Each rotation through π/2 radians in the counterclockwise direction represents moving from one term to the next in the cyclic pattern 1, i, -1, –i, 1, i, -1, –i, … It also communicates that a rotation through π/2 radians in the counterclockwise direction represents a shifting of the weights (i.e. relative magnitudes) of the real and complex parts of a complex number. What do I mean? Consider

    This is equivalent to

    Now, when we multiply the two we get

    Geogebra Activity

    So we can see that, apart from any changes in ‘sign’, positive to negative or vice versa, the sine and cosine interchange positions. In other words, when we rotate by π/2 radians, the real and imaginary parts of a complex number exchange places. Much in the same way as multiplication by -1 changes the sign of a real number by negating it, rotation by π/2 radians involves moving what is real to become imaginary and what is imaginary to become real.

    A Pedagogical Note

    If you recall (and you probably don’t), in Primer to Complex Numbers, I had stated that it is unfortunate that we have used the terminology ‘complex numbers’. In A New Kind of Number? I stated that using the term ‘imaginary numbers’ is also unfortunate. This latter term is particularly unfortunate. Indeed, I have come across colleagues, that is mathematics teachers, who tell their students that the ‘imaginary numbers’ do not exist. Presumably, they mean that these numbers are like ‘imaginary friends’! However, these numbers are not figments of our imagination.

    However, consider the Schrödinger equation shown below

    It is clear that the left side of the equation has the imaginary unit i. However, this equation represents a description of the evolution of the wave function of a non-relativistic quantum-mechanical system. In other words, the equation describes a very real physical phenomenon. Could this be possible if one side of the equation represents something that does not exist? In a similar way, Fourier analysis, which is used in signal processing, depends on the use of complex numbers. Finally, the Euler-Lagrange equations, a system of second-order differential equations, are the basis of Lagrangian mechanics, a way of describing motion using energy rather than forces, which are used in Newtonian mechanics.

    I cite the above applications in physics precisely to show that complex numbers are essential to our ability to describe real world phenomena, which would be quite unthinkable if the ‘imaginary numbers’ did not exist. It is unfortunate that René Descartes’ prejudices against such numbers continues to misinform many teachers and, consequently, their students concerning these numbers.

    Hence, when I teach this topic to my students, I prefer to all these numbers ‘non-real’ numbers with the prior definition, which I would normally give them, that “a real number is a number whose square is non-negative,” thereby leaving open the possibility that there could be numbers whose squares are negative.

    Turning the Corner

    And with that we wrap up our exploration of complex numbers. We have been at it since late January with a one week break for Good Friday and a three week hiatus for my grading assignment. During this series, we took a pit stop for trigonometry and another for calculus. Today, we saw how all of this relates back to a seminal idea with which I began this blog. It has been a whirlwind for me and I am sure you feel the same. So, in the next post, we will move in a different direction. Till then, keep it complex.

  • The Tripartite Fugue

    Milestones on the Journey

    Photo by Laura Penwell on Pexels.com

    After a three week break, most of which was spent grading IB papers, I am back. We continue with our series on complex numbers. By the time we last dealt with this, in Deriving Derivatives – Part 2, we had obtained the derivatives for a few functions as follows:

    In the post that preceded it, that is Deriving Derivatives – Part 1, we also were able to obtain the following results:

    Of course, in Anticipating the Exponential Whirligig, we revisited a crucial result we had obtained during our series on e, namely that

    During the series on e, the post Infinitely Expressed introduced us to the idea of infinite series. We explored the idea of infinite series in the post Serially Expressed, during which we derived the above result for the derivative of ex. Let us now proceed with using what we know so far about infinite series to obtain similar expressions for sin x and cos x.

    Infinite Series for sin x and cos x

    Photo by Teona Swift on Pexels.com

    We begin by assuming that

    We also assume for now that this series converges. Now, we can differentiate the above equation repeatedly to obtain

    Note that, in the last two equations above, the superscripted (4) and (5), with the numerals 4 and 5 inside parentheses, indicates the 4th and 5th derivatives of the function respectively, in this case, sin x.

    Now, we can substitute x = 0 in all the equations for sinx and its derivatives to obtain the following:

    Now, we know that sin 0 = 0 and cos 0 = 1. This means that the above set of equations reduces to

    Now, we know that

    This means that the above set of equations can be written as

    We can see that the constants with even subscripts are all zero. The constants with odd subscripts are the reciprocals of the factorial of the subscript with alternating positive and negative signs. Substituting this in the infinite series expression for sin x we get

    In a similar way we can obtain

    Assembling the Fugue

    So, as of now, we have the following three infinite series

    What we can observe from the above is that the series for ex has terms with every power of x and all the coefficients are positive. The series for sin x has terms with odd powers of x and the coefficients alternate between positive and negative. Likewise, the series for cos x has terms with even powers of x and the coefficients alternate between positive and negative.

    Photo by Tayssir Kadamany on Pexels.com

    Now, recall that

    Now, in the series expression for ex let us replace x with . This will give us

    This is the exponential form of complex numbers that we have been pursuing for many weeks now. It links the exponential function to the sine and cosine functions. We can easily see that the modulus of this complex number is 1 while its argument is θ. In case you have forgotten what the modulus and argument of a complex number are, we had introduced the former in Modulating an Invariant Metric and the latter in Pole Vaunting.

    Hence, a complex number z = x + iy can be written in polar and exponential form as follows:

    where

    Here, r is the modulus of z and θ is the argument. Of course, care must be taken when determining the argument of z. When y/x is positive, it could be because both are positive, in which case the angle is in the first quadrant and atan(y/x) will give the correct value. On the other hand, both could be negative, in which case the angle is in the third quadrant, meaning that π radians or 180° need to be added to or subtracted from atan(y/x). In much the same way, when y/x is negative, it could be because x is positive and y negative, in which case the angle is in the fourth quadrant and atan(y/x) will give the correct value. However, it could be because x is negative and y positive, in which case the angle is in the second quadrant, meaning that π radians or 180° need to be added to or subtracted from atan(y/x).

    The Journey Ahead

    Now, the exponential form of complex numbers is a powerful form. However, me simply claiming this is so cannot suffice. So, in the next post, which will be the final one in this series, I will consider the exponential form in greater detail. We will also consider how it showcases the idea of rotation, linking it to the idea of negation that we first saw in A New Kind of Number.

  • It has come to that time of the year when I grade the IB papers. This requires a significant amount of my time and also focus. So I will not be posting for the next few weeks. The next post will only be on 20 June 2025. In the meantime, you can check my series on e or π. Or you could check the ongoing series on complex numbers, with its two pit stops on trigonometry and calculus. We will resume the series on complex numbers on 20 June 2025.

  • Serially Expressed

    Revisiting Convergence

    We are in the middle of the second pit stop on our journey of exploring complex numbers. We have reached a point where I can introduce us to some infinite series, as mentioned in the previous post. Of course, we have encountered infinite series when we studied e as well as when we studied π. An infinite series is simply the summation of all the terms of an infinite sequence.

    The graph of ex and the first 9 approximations from the infinite series

    In case the preceding sentence was confusing, let us recap a bit. A sequence is a succession of numbers generated according to a well defined rule. A sequence can have a finite number of terms or it could be an infinite sequence. A series is formed by adding the terms of a sequence. An infinite series can converge, diverge, or oscillate. A classic case of such a series is the geometric series. We considered the geometric series in Naturally Bounded?, the second post in the series on e. There, I only considered a single case where the common ratio r was 1/2. That series converged. In general, a geometric series will converge if the common ratio lies between -1 and +1. In all other cases, it will diverge. And if r is negative, the series will also oscillate, converging while oscillating if r is between -1 and 0 and diverging while oscillating if r is less than or equal to -1.

    Conditions for Convergence

    What this means is that, when we are faced with an infinite series, there will be conditions under which the series will converge. When these conditions are not met, the series will diverge, either oscillating or not oscillating.

    So suppose we have an infinite series

    We can define the partial sum of n +1 terms as

    Now, if

    we can say that the series converges and that it converges to the finite sum of S. In general, a necessary but not sufficient condition for a series to converge is that each term approaches zero. That is,

    However, this test is not sufficient to guarantee that the series converges. For that, we have to apply many other tests. However, this is not what I wish to focus on. For now, let us simply assume that the series we will introduce are convergent. Will will then be able to obtain some rather intriguing results.

    Infinite Series for ex

    So, suppose that

    is a convergent series. Now, from the previous post we know that the derivative of ex is ex. From the post that preceded it, we know that the derivative of xn is nxn – 1. So, if we differentiate the above equation, we will get

    We can differentiate this again to get

    We can continue in this manner ad infinitum. However, let us see what the value of all of these are when x = 0. When x = 0, ex = e0 = 1. Also, all terms with xk will become 0, leaving us with

    Rearranging each equation, we get

    Recognizing that n! = n(n – 1)(n – 2)···3·2·1, and defining 0! = 1, we can write the above as

    From this we can write the infinite series for ex as

    In the post Infinitely Expressed we looked at the speed with which a series converged. How does this series compare? The table below gives us the relevant information

    What we can see is that, as n increases, the error decreases by about one order of magnitude for each increased value of n. In the aforementioned post, we had seen that this series does not converge very quickly and we introduced other series that converge much more rapidly. However, here we are not concerned with the speed with which the series converges. Rather, the simplicity of the terms themselves provide a strange allure. From the image at the start of this post we can see how each additional term makes the resulting series more closely approximate the graph of y = ex.

    Infinite Series for sinx

    Suppose now we attempt to obtain a similar infinite series for sinx. So let us consider that

    When we repeatedly differentiate the equation we will obtain the following

    Now, if we put x = 0, we will have sinx = 0 and cosx = 1. Then we will obtain

    Rearranging these equations we get

    From this we can obtain

    Infinite Series for cosx

    In a similar way, for the cosx function, we can obtain

    Setting the Stage

    So far we have obtained infinite series expressions for ex, sinx, and cosx. This gets us to the point where we can use these series to obtain the exponential form of the complex numbers. We will deal with that in the next post. Hence, this pit stop too has come to an end. We have, quite obviously, only dealt with those concepts in calculus that directly play a role in determining the exponential form of the complex numbers. Once I have concluded with the series of posts on complex numbers, we will revisit calculus. But till then I hope that this short pit stop has been helpful in your reaching the realization that calculus is not something to fear.

  • Deriving Derivatives – Part 2

    Signing In

    In the previous post, Deriving Derivatives – Part 1, which is part of the second pit stop in our exploration of complex numbers, we had derived the derivatives of xn and sin x. I had informed you that we will derive the derivatives of the exponential and logarithmic functions. In case some of you are unclear what these two words mean, let me begin with a brief introduction to exponents and logarithms.

    Laws of Exponents

    Photo by Pixabay on Pexels.com

    When we write 3x, what we mean is x is added to itself 3 times. Hence, 3x = x + x + x. Hence, we are taught that multiplication is repeated addition. Of course, we can conceive of repeating multiplication itself. For example, there is nothing to stop us from evaluating 4 × 4 × 4 = 64. Of course, if there are a small number of identical numbers being multiplied by each other, we may not feel the need for a more compact notation. However, if we had to multiply 42 identical numbers by each other, it would be quite a tedious business to write this down. Hence, we express 4 × 4 × 4 as 43, where the number in the superscript tells us how many of the numbers in the regular script are multiplied by each other. In this notation, the 4 is said to be the ‘base’ while the 3 is called the ‘exponent’.

    Using the definition of the exponent, we can obtain the following laws of exponents.

    The Exponential Function

    Exponential functions (Source: Ontario Tech University)

    When we refer to an ‘exponential’ function, what we mean is a function in which the variable is in the exponent. Hence, if the variable under consideration is x, then 3x, 5x, πx, etc. are called exponential functions with bases being 3, 5, and π respectively. When dealing with real valued functions, that is functions that relate real numbers to other real numbers, we only allow the base to be positive. This is because the square root of a negative number is not a real number but an imaginary number.

    Now, we had earlier gone through a series on e. The function ex is considered to be the natural exponential function because it represents a 100% growth rate compounded over infinitely many infinitesimally small compounding intervals. For an explanation of this, see What’s Natural About e?, the post that launched the series on e. We will shortly look at some properties of ex related to limits and differentiation.

    Laws of Logarithms

    However, if we can relate one real number to another through exponents, it should be possible to go the other way. This gets us to what is known as a logarithm. Logarithms are defined as follows: If an = x, then n = logax. We read the previous statement as follows: If a to the power of n is equal to x, then n is equal to the logarithm of x to the base a. In other words, what we have been calling the ‘exponent’ is, when we go the other way, called the ‘logarithm’.

    Once again, using the definition of the logarithm, we can obtain the laws of logarithms.

    The Logarithmic Function

    Similar to the exponential function, we can define a logarithm function logax, which is the power to which a must be raised to produce x. And similar to the natural exponential function, we have a natural logarithm function where the base is e. Since logarithms were first used in the decimal system, the notation logx is often taken to imply base 10, while the notation lnx is often used to denote the logarithm to base e. We will also take a look at some of the properties of lnx shortly.

    Logarithmic functions (Source: LibreTexts Mathematics)

    The Derivative of ex

    Since we are dealing with calculus in this pit stop, let us attempt to obtain the derivative of ex. By the definition of the derivative

    Now, that final limit is something we haven’t seen before. Let us obtain it in a rigorous way. From our study of e, we know that 2 < e < 3. This means that e – 1 > 1 > 0. That is e – 1 is a positive quantity.

    Now, consider the expansion of (1 + p)2. Since (1 + p)2 = 1 + 2p + p2, it follows that, for positive values of p, (1 + p)2 > 1 + 2p. In a similar way we can show that (1 + p)3 > 1 + 3p. Continuing in this way, we can generalize that (1 + p)h > 1 + hp. Using this we can obtain

    Replacing h with –h, we get

    Note that, if h > 1, the final inequality would have to change sign. However, since h → 0, 1 – h will be positive and the above will be true. Combining the two inequalities we get

    From this we can obtain the following

    Now we can use the sandwich theorem as follows

    And since the left and right limits evaluate to 1, we obtain

    Hence, we can conclude that

    The Derivative of lnx

    Having obtained the derivative of ex, let us turn our attention to lnx. Rather than use the definition of the derivative, which we can certainly do as we did above, I will use a little bit of intuition. Suppose a car is traveling at 10 meters per second. I can also write this as 0.1 seconds per meter. What this means is that the rate at which one variable (say y) changes with respect to another (say x) is the reciprocal of the rate at which the second (i.e. x) changes with respect to the first (i.e. y). We can write this as

    Now suppose y = lnx. From this we can get

    Now, we have expressed x as a function of y. Differentiating this with respect to y we get

    Signing Off

    In this post, we derived the derivatives of ex and lnx. In the previous post we had obtained the derivatives of xn and sinx. In much the same way as we obtained the derivative of sinx, we can show that the derivative of cosx is -sinx. In the next post, we will look at infinite series and consider some conditions under which these series converge. That will place us in a position to use what we have uncovered so far in this pit stop to step back into the main series on complex numbers where we can return to the exponential form of complex numbers and some associated results.

  • Deriving Derivatives – Part 1

    Rewinding a Bit

    (Source: Cyberchalky)

    We began our exploration of calculus last week with The Sky is the Limit. This is the second pit stop in our exploration of complex numbers. The first pit stop dealt with trigonometry. In the previous post we had looked at the idea of the limit, which I said was a central concept in the study of calculus. In this post I wish to obtain a few standard results that we will be able to use in our exploration of complex numbers.

    In the previous post, we saw that the limiting value of the gradient of the secant through a fixed point on the graph of a function as another point approaches the fixed point is the gradient of the tangent at the fixed point. In the context of calculus, this is known as the derivative of the function at the point.

    Definition of the Derivative

    Suppose we have a function y = f(x). Then the derivative is denoted by

    Per the limit definition of the derivative, we can say that

    Note that the quantity inside the limit is simply the gradient of the secant at the point (x,f(x)). As h approaches 0 the gradient of the secant approaches the gradient of the tangent (i.e. the derivative) and the limiting value of the gradient of the secant is the gradient of the tangent.

    Algebra of Differentiation

    In what follows, u and v represent function of x.

    The Sum Rule

    The derivative of the sum of two functions is equal to the sum of the derivatives of the two functions. Symbolically, we state (u + v)’ = u‘ + v‘. This can be obtained as follows

    Difference Rule

    The derivative of the difference of two functions is equal to the difference of the derivatives of the two functions. Symbolically, we state (uv)’ = u‘ – v‘. This can be obtained as follows

    The Product Rule

    The derivative of the product of two functions is equal to the sum of the product of the first function and the derivative of the second and the second function and the derivative of the first. Symbolically, we state (uv)’ = uv‘ + vu‘. This can be obtained as follows

    The Quotient Rule

    The derivative of a quotient is equal to the difference between the products of the denominator and the derivative of the numerator and the numerator and the derivative of the denominator divided by the square of the denominator. Symbolically, we state (u ÷ v)’ = (vu‘ – uv’) ÷ v2. This can be obtained as follows

    Derivative of xn

    Having obtained the results for the algebra of differentiation, let us derive our first derivative. Let us consider the function y = xn. Per the definition of the derivative we have

    While I have only derived the result for positive integral values of n, the result can easily be obtained for all rational values of n. But what does this mean? Let’s take some concrete cases.

    Suppose f(x) = x3. Here we can see that n = 3. Hence the derivative will be given by f'(x) = 3x2. This means that, at the point where x = 2, the gradient of the tangent is f'(2) = 3(2)2 = 12. Similarly, for the function f(x) = x2/3, n = 2/3. Hence, the derivative will be given by f'(x) = (2/3)x-1/3.

    Derivative of sin x

    Now, we have seen that the polar form of complex numbers involves the trigonometric ratios of the argument of the complex numbers. This means that it is quite likely that the derivatives of the sine and cosine functions will be needed in our further exploration of complex numbers. So let us obtain the derivative of the sine function. Suppose y = sin x.

    Limit of sin x ÷ x as x 0

    Let us first obtain the limit of sin x ÷ x as x approaches 0.

    The Sandwich Theorem

    Here we need to invoke a theorem known as the Sandwich theorem. Here it is

    Let us return to our derivation of the limit and use the Sandwich theorem.

    We now have enough under our belts to obtain the derivative of sin x. Let’s proceed.

    In a similar way we can show that (cos x)’ = -sin x.

    Next Steps

    In the derivations above, I have not shown all the ‘steps’ but have skipped some that are, in my view, decipherable by students who are willing to put in some additional effort. If any of the steps above is unclear, please do reach out to me. At the same time, I realize that, while the above results are old hat for students who have studied calculus, in many cases, these students have not been shown how the results are obtained. Shockingly, I have had students come to me in the twelfth grade having been introduced to differentiation but having no idea how the results are obtained. Due to this, I think this post is already quite heavy. So I will draw it to a close. In the next post, we will consider the derivatives of the exponential and logarithmic functions. Till then, be responsible and don’t drink and derive!

  • The Sky is the Limit

    The Rationale

    We are in the middle of a series on complex numbers. In the previous post, Anticipating the Exponential Whirligig, we had come to a critical point, where I claimed that there was another way of representing complex numbers that is even more powerful than the polar form, which was itself more powerful than the Cartesian form. However, this form is derived using calculus. I could have simply thrown the form in your faces, hoping that you would take me at my word. And while you may be willing (I hope) to take me at my word, this would not have given you any insight into why the form exists nor why it works the way it does.

    In fact, at the end of the previous post, we had actually reached a somewhat counterintuitive idea, namely that, since the modulus (r) gives the ‘size’ of the complex number, the argument (θ), which represents the rotation, must be what is expressed in an exponential form. Since this is counterintuitive, it would not do for me to simply state a result and proceed. I need to show how the result is obtained. Hence, we are forced to take another pit stop, this one dealing with calculus.

    What is Calculus?

    Of course, if we ask, “What is calculus?”, different sources give us different definitions. For example, Britannica states, “calculus, branch of mathematics concerned with the calculation of instantaneous rates of change (differential calculus) and the summation of infinitely many small factors to determine some whole (integral calculus)” (Italics mine.) Wikipedia gives us, “Calculus is the mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations.” (Italics mine.) The MIT course Calculus for Beginners states, “Calculus is the study of how things change. It provides a framework for modeling systems in which there is change, and a way to deduce the predictions of such models.” (Italics mine.) The MIT definition could be said to be only the first sentence. However, this definition does not allow us to recognize that there are two facets to calculus, something that I believe is essential to know right from the start.

    While these definitions are not wrong per se, I think they somewhat miss the point (Wikipedia), beat around the bush (Britannica), or give extraneous information that is confusing (MIT). For example, unless I know what it means to says that “geometry is the study of shape” or that “algebra the study of generalizations of arithmetic operations,” I will not have a cluse about how calculus is the study of continuous change! Similarly, if I have no idea why we would need to calculate instantaneous rates of change (or indeed what that means) or sum infinitely small factors (or indeed what that means), I have no idea what calculus deals with. Finally, why the provision of “a framework for modeling systems in which there is change” is included in the definition of what calculus is when it is something that calculus does just boggles me.

    I would give the following definition: “Calculus is the study of the rate and the development of change.” I think my definition is superior because it is sleek, using just 9 words as opposed to 18 (Britannica), 25 (Wikipedia), or MIT (27). Here I have counted only the predicate clauses given in italics since that is what forms the definition. Of course, the study of rates of change is known as differential calculus, while the study of the development of change is called integral calculus. In this pit stop, I will be dealing with differential calculus because that is what is immediately relevant to our understanding of complex numbers. I will deal with integral calculus as a later date. So let’s get started.

    Initial Intuition

    Suppose we have a straight horizontal road. A car on the road moves from traveling at 10 m/s to 30 m/s in 5 seconds. The previous statement only gives us the starting and ending states of the car on its 5 second ‘journey’. We know nothing concerning what it was doing (i.e. its speed/velocity) or where it was (i.e. its position/displacement) in the intervening period between times t = 0 seconds and t = 5 seconds. For example, the figure below shows four of the infinitely many possible variations of v (velocity) and t (time) that satisfy the conditions given earlier.

    (Source: Wolfram Alpha)

    While all four journey profiles have the same initial conditions (t = 0 s, v = 10 m/s) and final conditions (t = 5 s, v = 30 m/s) all of them differ at every other point from every other profile. Quite obviously, it will not suffice to simply think of initial and final conditions, not if we want to be able to give a thorough description of the profiles. For example, we could draw the tangents at t = 2 s to get

    Source: Wolfram Alpha)

    Of course, the ‘tangent’ to the first graph, which is linear, is the line itself. Hence, only three other distinct tangents are visible. What we can easily observe is that the gradient or slope of each line is distinct from the others. Since we have plotted speed/velocity on the vertical axis and time on the horizontal axis, the gradient would indicate the rate at which the velocity is changing with respect to time when t = 2 s. Of course, the question is, “How do we obtain these gradients?” This is where calculus comes in.

    Introducing the Limit

    To understand calculus, whether differential or integral, we need to grasp the concept of a ‘limit’. Of course, here we are dealing with the limit in the context of differential calculus. To get a feel for what this is before we begin a more formal treatment, click on the link in the caption on the figure below. This will take you to a Geogebra app. It contains the plot of the graph of a function in red. There is a tangent to this graph at the point A in blue. Another point, B, lies on the graph and the secant connecting A and B is drawn in green. Click and drag on the point B. This will allow you to move B along the graph. As you do this, the orientation of the secant (green line) will change. It will, in other words, rotate about the fixed point A.

    Click here to visit the Geogebra app

    Now, since the point A is common to both lines, the only difference between them is their orientations or gradients. You will observe that, as you move B closer to A, the secant will rotate about A with its orientation becoming closer to the orientation of the tangent. In fact, you can compare the gradients in the app itself. In the figure above, the gradient of the tangent is 15.33541, while that of the secant is 8.27131. You will note that there is a quantity h below the two gradients. In the figure above h = 1. This quantity represents the horizontal distance between points A and B. In other words, it is the difference in the x coordinates of the two points.

    The Gradient of the Secant

    Now, in the figure above, x is listed as the x coordinate of A. This means that the x coordinate of B will be x + h. Now, since the graph is that of the curve y = f(x), the corresponding y coordinates will be f(x) and f(x + h).

    Now the gradient of the line joining the points (x1,y1) and (x2,y2) is given by

    Hence, the gradient of the secant through A and B will be given by

    Of course, this only tells us what the gradient of the secant is. We are not yet at the point of determining the gradient of the tangent, which, if you recall, from the earlier exploration of the 5 second journey, will give the rate at which the velocity is changing with respect to time at any given instant.

    At this stage, we have no idea what the function is. Hence, things are quite abstract. In order to allow us to attach some numerical quantities to the above ‘formula’, let us consider the function f(x) = x2. I have intentionally chosen an easy function so we can get to the meat of the matter quickly. Let us also consider the x coordinate of A to be 2. Hence, the gradient of the secant will be given by

    We can either expand the parentheses in the numerator or factorize the numerator to simplify as shown below

    In the step above, the line in red is the approach with expansion, while the one in green is the approach with factorization. As the color scheme indicates, I prefer the factorization approach even though both give the same results.

    Revisiting the Limit

    What we have obtained so far is that, for the graph of y = x2, the gradient of the secant joining points A and B with x coordinates 2 and 2 + h respectively is 4 + h. Now, we saw that, when B approaches A along the curve, the secant rotates about point A with its orientation becoming closer to the orientation of the tangent. So let us see how the gradient of the secant varies as h varies. The table below shows the variation on both sides of x = 2.

    Note that there is no secant defined for x + h = 2, because that corresponds to h = 0, which would make the expression for the gradient indeterminate since the result would be 0 ÷ 0.

    Introducing Indeterminacy

    Here, a note on indeterminacy is in order since I have had teachers and colleagues who have used the word ‘undefined’ in cases when they should have used ‘indeterminate’. The word ‘undefined’ is normally reserved for cases involving division by zero since it is meaningless to divine by zero. However, this is only when we are attempting to divine a finite, non-zero quantity by zero. When we attempt to divide zero by zero, we face a quandary.

    You see, division of a non-zero quantity by zero is, as we have just seen, undefined. However, division of zero by a non-zero quantity results in zero. So in the expression 0 ÷ 0, which of the zeroes takes precedence? Since the whole expression for a function that yielded the problematic 0 ÷ 0 is important for the definition of the function it is impossible to determine which zero should take precedence. Hence, the expression is indeterminate.

    Similar indeterminate forms yield expressions like 0 × ∞, ∞ – ∞, ∞ ÷ ∞, 00, ∞0, and 1. In each of these, there are two parts and it is impossible to determine which part should be given precedence. Hence, these forms are indeterminate.

    The Meaning of the Limit

    Returning to the table, we can see that, as x + h gets closer and closer to 2, the gradient, m, gets closer and closer to 4. So we say that the limit of the gradient as h approaches 0 is 4. It is crucial to observe that this statement does not mean. It does not mean that the gradient of the tangent can be determine by putting h = 0 in the original function. Neither does it mean that the gradient of the secant becomes the gradient of the tangent when B coincides with A.

    What the statement about the limit declares is something about the behavior of some parameter, in this case the gradient of the secant, around a point where it is technically indeterminate. It says that, by making h arbitrarily close to 0, we can obtain a secant whose gradient is arbitrarily close to the gradient of the tangent. And since the gradient of the secant approaches 4, it must mean that the gradient of the tangent is 4.

    A Glimpse Ahead

    The limiting process is the cornerstone of both branches of calculus. We have actually already seen it in this blog in a more informal manner. In A Piece of the Pi, which was the first post in the series on π, we had seen that early attempts at approximating the value of π involved regular polygons with increasing number of sides. This was nothing but a limiting process since the area of the circle was always squeezed between the area of the inscribed polygon and that of the circumscribed polygon. As the number of sides increased, the areas of the two polygons approached each other and in the limiting case would have become the area of the circle.

    Today, we have seen a slightly more formal look at the limiting process. Of course, we only considered one special function, f(x) = x2, which was relatively easy to analyze. However, we need to be able to use the limiting process for functions of all sorts – algebraic, trigonometric, inverse trigonometric, exponential, and logarithmic. And we know that we are aiming for something in the exponential side of things for the exponential form of complex numbers. In the next post, therefore, we will continue our journey into calculus with a look at some algebraic and trigonometric results. Till then, the sky’s the limit.

  • Anticipating the Exponential Whirligig

    Resumption

    Photo by Frank Cone on Pexels.com

    Last week I had taken a break for Good Friday. For those who are interested, I used a random event for my sermon. If you’re interested, you can find it here. Anyway, with the break behind us, we resume the series on complex numbers. In the previous post of the series, Pole Vaunting, we had introduced the idea of the argument of a complex number. Before that, we had a six part pit stop, in which we learned some trigonometry, especially as it relates to complex numbers. Before the start of the pit stop, in Modulating an Invariant Metric, we had introduced the idea of the modulus of a complex number.

    Revisiting the Polar Form

    With the modulus and argument of a complex number, we specify the complex number in terms of its polar coordinates. With these polar coordinates, we are able to locate any complex number on the complex plane. The modulus tells us how far the point representing the complex number is from the origin and the argument tells us the angle made with the positive real axis by the line joining the origin to the point representing the complex number. In particular, we have seen that the complex number z = (r, θ) in polar coordinates is the same as the complex number z = (a, b) in Cartesian coordinates, where

    That Old Friend Again

    If you have been following the blog, you may vaguely recall that I had done a short series on the number e. In Naturally Bounded?, I had shown that

    Photo by Levi Damasceno on Pexels.com

    Then, in Return of an Old Friend, we saw that this could be extended to

    However, we didn’t really prove the above result. And that is because we had not formally dealt with calculus. In fact, till today, we have not done that in the blog. However, I realize that, in order to do a sufficiently robust job in our journey ahead with complex numbers, I will need to at the very least dip my toes into the ocean of calculus. Hence, I will start another pit stop next week with a brief journey through calculus. But for now, I wish to explore some ideas we uncovered.

    Rotational Rumination

    In the previous post we saw that multiplying two complex numbers resulted in the multiplication of their moduli and the adding of their arguments. That is, if z1 = (r1, θ1) and z2 = (r2, θ2) then z1z2 = (r1r2, θ1+θ2). Now, let’s ask ourselves under what conditions does multiplication of a ‘two-part’ number result in the multiplication of one part and an addition of the second part?

    Photo by Francesco Ungaro on Pexels.com

    The term ‘two-part’ number, of course, is poorly defined. Someone could think of a rational number as having two parts – the integral part and the fractional part. For example,

    However, when we multiply two rational numbers, both parts get multiplied. This can be seen by both approaches shown below.

    What this tells us is that the parts of complex numbers in polar form (i.e. the modulus and argument) do not function like the integral and fractional parts of rational numbers. What other ‘two-part’ numbers can we think of?

    We can consider numbers written in scientific notation. For example, 2.3 × 103 has a mantissa or significand of 2.3 and an exponent or power of 3. Now, we can consider

    What we can see is that the mantissas were multiplied while the exponents were added. This is similar to our saying that, when two complex numbers are multiplied, their moduli are multiplied while their arguments are added. Hence, it sees that there should be a way of expressing complex numbers in a ‘two part’ form where one part is a regular real number, similar to the mantissa, while the other part is written in exponential form, similar to the exponent.

    However, we observe that, when we write z1z2 = (r1r2, θ1+θ2), the modulus of the product complex number z1z2 is given by the first part r1r2. However, we have seen that a complex number, represented in polar form, has a ‘size’ (i.e. its modulus) and an ‘orientation’ (i.e. its argument). Since in the product z1z2, r1r2 already expresses the modulus of the product in a manner similar to the mantissa, it must follow that θ1+θ2, which represents the argument of the product, corresponds to the exponent of the ‘two part’ number.

    What this means is that there must be a way of representing complex numbers with two parts, similar to what we use in scientific notation, with the part corresponding to the mantissa representing the modulus and the part corresponding to the exponent representing the argument of the complex number. Since the modulus is a simple number, it must follow that it corresponds to the r of the polar form (r, θ). Hence, there must we a way of expressing the argument θ as the second, exponential part of the ‘two part’ number.

    This is somewhat counterintuitive since, in conventional scientific notation, the exponential part conveys the ‘size’ of the number in terms of consecutive powers of 10 between which the number lies. That is, when we write 2.3 × 103, we know this number lies between103 and 104. However, 2.3 × 1032 is a number that lies between1032 and 1033. So, the exponential part gives the ‘size’ of the overall number, while the mantissa only locates the number between the two consecutive powers of 10.

    However, now we are saying that there is an exponential form for complex numbers in which the modulus (i.e. size) is represented by the part similar to the mantissa while the argument (i.e. orientation) is represented by the part similar to the exponent. In other words, we are looking for some kind of exponential form involving θ that represents pure rotation.

    Spiraling Away

    Of course, since I went on a short detour earlier with e, you might be thinking that e has some role to play in this exponential form that represents pure rotation. You would be right! Of course, if you have studied complex numbers before, this will not be news to you. However, I’ll bet the reasoning toward this realization is somewhat novel. Anyway, the exponential form of complex numbers is an extremely powerful form. I could, of course, simply tell you what it is without any foundation for understanding how it is obtained. But that would be to short-change you – something I’d rather not do. Hence, as mentioned earlier, I will begin a second pit stop in the next post with some introductory explorations of calculus.