In the previous post, Deriving Derivatives – Part 1, which is part of the second pit stop in our exploration of complex numbers, we had derived the derivatives of xn and sin x. I had informed you that we will derive the derivatives of the exponential and logarithmic functions. In case some of you are unclear what these two words mean, let me begin with a brief introduction to exponents and logarithms.
When we write 3x, what we mean is x is added to itself 3 times. Hence, 3x = x + x + x. Hence, we are taught that multiplication is repeated addition. Of course, we can conceive of repeating multiplication itself. For example, there is nothing to stop us from evaluating 4 × 4 × 4 = 64. Of course, if there are a small number of identical numbers being multiplied by each other, we may not feel the need for a more compact notation. However, if we had to multiply 42 identical numbers by each other, it would be quite a tedious business to write this down. Hence, we express 4 × 4 × 4 as 43, where the number in the superscript tells us how many of the numbers in the regular script are multiplied by each other. In this notation, the 4 is said to be the ‘base’ while the 3 is called the ‘exponent’.
Using the definition of the exponent, we can obtain the following laws of exponents.
When we refer to an ‘exponential’ function, what we mean is a function in which the variable is in the exponent. Hence, if the variable under consideration is x, then 3x, 5x, πx, etc. are called exponential functions with bases being 3, 5, and π respectively. When dealing with real valued functions, that is functions that relate real numbers to other real numbers, we only allow the base to be positive. This is because the square root of a negative number is not a real number but an imaginary number.
Now, we had earlier gone through a series on e. The function ex is considered to be the natural exponential function because it represents a 100% growth rate compounded over infinitely many infinitesimally small compounding intervals. For an explanation of this, see What’s Natural About e?, the post that launched the series on e. We will shortly look at some properties of ex related to limits and differentiation.
Laws of Logarithms
However, if we can relate one real number to another through exponents, it should be possible to go the other way. This gets us to what is known as a logarithm. Logarithms are defined as follows: If an = x, then n = logax. We read the previous statement as follows: If a to the power of n is equal to x, then n is equal to the logarithm of x to the base a. In other words, what we have been calling the ‘exponent’ is, when we go the other way, called the ‘logarithm’.
Once again, using the definition of the logarithm, we can obtain the laws of logarithms.
The Logarithmic Function
Similar to the exponential function, we can define a logarithm function logax, which is the power to which a must be raised to produce x. And similar to the natural exponential function, we have a natural logarithm function where the base is e. Since logarithms were first used in the decimal system, the notation logx is often taken to imply base 10, while the notation lnx is often used to denote the logarithm to base e. We will also take a look at some of the properties of lnx shortly.
Since we are dealing with calculus in this pit stop, let us attempt to obtain the derivative of ex. By the definition of the derivative
Now, that final limit is something we haven’t seen before. Let us obtain it in a rigorous way. From our study of e, we know that 2 < e < 3. This means that e – 1 > 1 > 0. That is e – 1 is a positive quantity.
Now, consider the expansion of (1 + p)2. Since (1 + p)2 = 1 + 2p + p2, it follows that, for positive values of p, (1 + p)2 > 1 + 2p. In a similar way we can show that (1 + p)3 > 1 + 3p. Continuing in this way, we can generalize that (1 + p)h > 1 + hp. Using this we can obtain
Replacing h with –h, we get
Note that, if h > 1, the final inequality would have to change sign. However, since h → 0, 1 – h will be positive and the above will be true. Combining the two inequalities we get
From this we can obtain the following
Now we can use the sandwich theorem as follows
And since the left and right limits evaluate to 1, we obtain
Hence, we can conclude that
The Derivative of lnx
Having obtained the derivative of ex, let us turn our attention to lnx. Rather than use the definition of the derivative, which we can certainly do as we did above, I will use a little bit of intuition. Suppose a car is traveling at 10 meters per second. I can also write this as 0.1 seconds per meter. What this means is that the rate at which one variable (say y) changes with respect to another (say x) is the reciprocal of the rate at which the second (i.e. x) changes with respect to the first (i.e. y). We can write this as
Now suppose y = lnx. From this we can get
Now, we have expressed x as a function of y. Differentiating this with respect to y we get
Signing Off
In this post, we derived the derivatives of ex and lnx. In the previous post we had obtained the derivatives of xn and sinx. In much the same way as we obtained the derivative of sinx, we can show that the derivative of cosx is -sinx. In the next post, we will look at infinite series and consider some conditions under which these series converge. That will place us in a position to use what we have uncovered so far in this pit stop to step back into the main series on complex numbers where we can return to the exponential form of complex numbers and some associated results.
We began our exploration of calculus last week with The Sky is the Limit. This is the second pit stop in our exploration of complex numbers. The first pit stop dealt with trigonometry. In the previous post we had looked at the idea of the limit, which I said was a central concept in the study of calculus. In this post I wish to obtain a few standard results that we will be able to use in our exploration of complex numbers.
In the previous post, we saw that the limiting value of the gradient of the secant through a fixed point on the graph of a function as another point approaches the fixed point is the gradient of the tangent at the fixed point. In the context of calculus, this is known as the derivative of the function at the point.
Definition of the Derivative
Suppose we have a function y = f(x). Then the derivative is denoted by
Per the limit definition of the derivative, we can say that
Note that the quantity inside the limit is simply the gradient of the secant at the point (x,f(x)). As h approaches 0 the gradient of the secant approaches the gradient of the tangent (i.e. the derivative) and the limiting value of the gradient of the secant is the gradient of the tangent.
Algebra of Differentiation
In what follows, u and v represent function of x.
The Sum Rule
The derivative of the sum of two functions is equal to the sum of the derivatives of the two functions. Symbolically, we state (u + v)’ = u‘ + v‘. This can be obtained as follows
Difference Rule
The derivative of the difference of two functions is equal to the difference of the derivatives of the two functions. Symbolically, we state (u – v)’ = u‘ – v‘. This can be obtained as follows
The Product Rule
The derivative of the product of two functions is equal to the sum of the product of the first function and the derivative of the second and the second function and the derivative of the first. Symbolically, we state (uv)’ = uv‘ + vu‘. This can be obtained as follows
The Quotient Rule
The derivative of a quotient is equal to the difference between the products of the denominator and the derivative of the numerator and the numerator and the derivative of the denominator divided by the square of the denominator. Symbolically, we state (u ÷ v)’ = (vu‘ – uv’) ÷ v2. This can be obtained as follows
Derivative of xn
Having obtained the results for the algebra of differentiation, let us derive our first derivative. Let us consider the function y = xn. Per the definition of the derivative we have
While I have only derived the result for positive integral values of n, the result can easily be obtained for all rational values of n. But what does this mean? Let’s take some concrete cases.
Suppose f(x) = x3. Here we can see that n = 3. Hence the derivative will be given by f'(x) = 3x2. This means that, at the point where x = 2, the gradient of the tangent is f'(2) = 3(2)2 = 12. Similarly, for the function f(x) = x2/3, n = 2/3. Hence, the derivative will be given by f'(x) = (2/3)x-1/3.
Derivative of sin x
Now, we have seen that the polar form of complex numbers involves the trigonometric ratios of the argument of the complex numbers. This means that it is quite likely that the derivatives of the sine and cosine functions will be needed in our further exploration of complex numbers. So let us obtain the derivative of the sine function. Suppose y = sin x.
Limit of sin x ÷ x as x→ 0
Let us first obtain the limit of sin x ÷ x as x approaches 0.
The Sandwich Theorem
Here we need to invoke a theorem known as the Sandwich theorem. Here it is
Let us return to our derivation of the limit and use the Sandwich theorem.
We now have enough under our belts to obtain the derivative of sin x. Let’s proceed.
In a similar way we can show that (cos x)’ = -sin x.
Next Steps
In the derivations above, I have not shown all the ‘steps’ but have skipped some that are, in my view, decipherable by students who are willing to put in some additional effort. If any of the steps above is unclear, please do reach out to me. At the same time, I realize that, while the above results are old hat for students who have studied calculus, in many cases, these students have not been shown how the results are obtained. Shockingly, I have had students come to me in the twelfth grade having been introduced to differentiation but having no idea how the results are obtained. Due to this, I think this post is already quite heavy. So I will draw it to a close. In the next post, we will consider the derivatives of the exponential and logarithmic functions. Till then, be responsible and don’t drink and derive!
We are in the middle of a series on complex numbers. In the previous post, Anticipating the Exponential Whirligig, we had come to a critical point, where I claimed that there was another way of representing complex numbers that is even more powerful than the polar form, which was itself more powerful than the Cartesian form. However, this form is derived using calculus. I could have simply thrown the form in your faces, hoping that you would take me at my word. And while you may be willing (I hope) to take me at my word, this would not have given you any insight into why the form exists nor why it works the way it does.
In fact, at the end of the previous post, we had actually reached a somewhat counterintuitive idea, namely that, since the modulus (r) gives the ‘size’ of the complex number, the argument (θ), which represents the rotation, must be what is expressed in an exponential form. Since this is counterintuitive, it would not do for me to simply state a result and proceed. I need to show how the result is obtained. Hence, we are forced to take another pit stop, this one dealing with calculus.
What is Calculus?
Of course, if we ask, “What is calculus?”, different sources give us different definitions. For example, Britannica states, “calculus, branch of mathematics concerned with the calculation of instantaneous rates of change (differential calculus) and the summation of infinitely many small factors to determine some whole (integral calculus)” (Italics mine.) Wikipedia gives us, “Calculus is the mathematical study of continuous change, in the same way that geometry is the study of shape, and algebra is the study of generalizations of arithmetic operations.” (Italics mine.) The MIT course Calculus for Beginnersstates, “Calculus is the study of how things change. It provides a framework for modeling systems in which there is change, and a way to deduce the predictions of such models.” (Italics mine.) The MIT definition could be said to be only the first sentence. However, this definition does not allow us to recognize that there are two facets to calculus, something that I believe is essential to know right from the start.
While these definitions are not wrong per se, I think they somewhat miss the point (Wikipedia), beat around the bush (Britannica), or give extraneous information that is confusing (MIT). For example, unless I know what it means to says that “geometry is the study of shape” or that “algebra the study of generalizations of arithmetic operations,” I will not have a cluse about how calculus is the study of continuous change! Similarly, if I have no idea why we would need to calculate instantaneous rates of change (or indeed what that means) or sum infinitely small factors (or indeed what that means), I have no idea what calculus deals with. Finally, why the provision of “a framework for modeling systems in which there is change” is included in the definition of what calculus is when it is something that calculus does just boggles me.
I would give the following definition: “Calculus is the study of the rate and the development of change.” I think my definition is superior because it is sleek, using just 9 words as opposed to 18 (Britannica), 25 (Wikipedia), or MIT (27). Here I have counted only the predicate clauses given in italics since that is what forms the definition. Of course, the study of rates of change is known as differential calculus, while the study of the development of change is called integral calculus. In this pit stop, I will be dealing with differential calculus because that is what is immediately relevant to our understanding of complex numbers. I will deal with integral calculus as a later date. So let’s get started.
Initial Intuition
Suppose we have a straight horizontal road. A car on the road moves from traveling at 10 m/s to 30 m/s in 5 seconds. The previous statement only gives us the starting and ending states of the car on its 5 second ‘journey’. We know nothing concerning what it was doing (i.e. its speed/velocity) or where it was (i.e. its position/displacement) in the intervening period between times t = 0 seconds and t = 5 seconds. For example, the figure below shows four of the infinitely many possible variations of v (velocity) and t (time) that satisfy the conditions given earlier.
While all four journey profiles have the same initial conditions (t = 0 s, v = 10 m/s) and final conditions (t = 5 s, v = 30 m/s) all of them differ at every other point from every other profile. Quite obviously, it will not suffice to simply think of initial and final conditions, not if we want to be able to give a thorough description of the profiles. For example, we could draw the tangents at t = 2 s to get
Of course, the ‘tangent’ to the first graph, which is linear, is the line itself. Hence, only three other distinct tangents are visible. What we can easily observe is that the gradient or slope of each line is distinct from the others. Since we have plotted speed/velocity on the vertical axis and time on the horizontal axis, the gradient would indicate the rate at which the velocity is changing with respect to time when t = 2 s. Of course, the question is, “How do we obtain these gradients?” This is where calculus comes in.
Introducing the Limit
To understand calculus, whether differential or integral, we need to grasp the concept of a ‘limit’. Of course, here we are dealing with the limit in the context of differential calculus. To get a feel for what this is before we begin a more formal treatment, click on the link in the caption on the figure below. This will take you to a Geogebra app. It contains the plot of the graph of a function in red. There is a tangent to this graph at the point A in blue. Another point, B, lies on the graph and the secant connecting A and B is drawn in green. Click and drag on the point B. This will allow you to move B along the graph. As you do this, the orientation of the secant (green line) will change. It will, in other words, rotate about the fixed point A.
Now, since the point A is common to both lines, the only difference between them is their orientations or gradients. You will observe that, as you move B closer to A, the secant will rotate about A with its orientation becoming closer to the orientation of the tangent. In fact, you can compare the gradients in the app itself. In the figure above, the gradient of the tangent is 15.33541, while that of the secant is 8.27131. You will note that there is a quantity h below the two gradients. In the figure above h = 1. This quantity represents the horizontal distance between points A and B. In other words, it is the difference in the x coordinates of the two points.
The Gradient of the Secant
Now, in the figure above, x is listed as the x coordinate of A. This means that the x coordinate of B will be x + h. Now, since the graph is that of the curve y = f(x), the corresponding y coordinates will be f(x) and f(x + h).
Now the gradient of the line joining the points (x1,y1) and (x2,y2) is given by
Hence, the gradient of the secant through A and B will be given by
Of course, this only tells us what the gradient of the secant is. We are not yet at the point of determining the gradient of the tangent, which, if you recall, from the earlier exploration of the 5 second journey, will give the rate at which the velocity is changing with respect to time at any given instant.
At this stage, we have no idea what the function is. Hence, things are quite abstract. In order to allow us to attach some numerical quantities to the above ‘formula’, let us consider the function f(x) = x2. I have intentionally chosen an easy function so we can get to the meat of the matter quickly. Let us also consider the x coordinate of A to be 2. Hence, the gradient of the secant will be given by
We can either expand the parentheses in the numerator or factorize the numerator to simplify as shown below
In the step above, the line in red is the approach with expansion, while the one in green is the approach with factorization. As the color scheme indicates, I prefer the factorization approach even though both give the same results.
Revisiting the Limit
What we have obtained so far is that, for the graph of y = x2, the gradient of the secant joining points A and B with x coordinates 2 and 2 + h respectively is 4 + h. Now, we saw that, when B approaches A along the curve, the secant rotates about point A with its orientation becoming closer to the orientation of the tangent. So let us see how the gradient of the secant varies as h varies. The table below shows the variation on both sides of x = 2.
Note that there is no secant defined for x + h = 2, because that corresponds to h = 0, which would make the expression for the gradient indeterminate since the result would be 0 ÷ 0.
Introducing Indeterminacy
Here, a note on indeterminacy is in order since I have had teachers and colleagues who have used the word ‘undefined’ in cases when they should have used ‘indeterminate’. The word ‘undefined’ is normally reserved for cases involving division by zero since it is meaningless to divine by zero. However, this is only when we are attempting to divine a finite, non-zero quantity by zero. When we attempt to divide zero by zero, we face a quandary.
You see, division of a non-zero quantity by zero is, as we have just seen, undefined. However, division of zero by a non-zero quantity results in zero. So in the expression 0 ÷ 0, which of the zeroes takes precedence? Since the whole expression for a function that yielded the problematic 0 ÷ 0 is important for the definition of the function it is impossible to determine which zero should take precedence. Hence, the expression is indeterminate.
Similar indeterminate forms yield expressions like 0 × ∞, ∞ – ∞, ∞ ÷ ∞, 00, ∞0, and 1∞. In each of these, there are two parts and it is impossible to determine which part should be given precedence. Hence, these forms are indeterminate.
The Meaning of the Limit
Returning to the table, we can see that, as x + h gets closer and closer to 2, the gradient, m, gets closer and closer to 4. So we say that the limit of the gradient as h approaches 0 is 4. It is crucial to observe that this statement does not mean. It does not mean that the gradient of the tangent can be determine by putting h = 0 in the original function. Neither does it mean that the gradient of the secant becomes the gradient of the tangent when B coincides with A.
What the statement about the limit declares is something about the behavior of some parameter, in this case the gradient of the secant, around a point where it is technically indeterminate. It says that, by making h arbitrarily close to 0, we can obtain a secant whose gradient is arbitrarily close to the gradient of the tangent. And since the gradient of the secant approaches 4, it must mean that the gradient of the tangent is 4.
A Glimpse Ahead
The limiting process is the cornerstone of both branches of calculus. We have actually already seen it in this blog in a more informal manner. In A Piece of the Pi, which was the first post in the series on π, we had seen that early attempts at approximating the value of π involved regular polygons with increasing number of sides. This was nothing but a limiting process since the area of the circle was always squeezed between the area of the inscribed polygon and that of the circumscribed polygon. As the number of sides increased, the areas of the two polygons approached each other and in the limiting case would have become the area of the circle.
Today, we have seen a slightly more formal look at the limiting process. Of course, we only considered one special function, f(x) = x2, which was relatively easy to analyze. However, we need to be able to use the limiting process for functions of all sorts – algebraic, trigonometric, inverse trigonometric, exponential, and logarithmic. And we know that we are aiming for something in the exponential side of things for the exponential form of complex numbers. In the next post, therefore, we will continue our journey into calculus with a look at some algebraic and trigonometric results. Till then, the sky’s the limit.
Last week I had taken a break for Good Friday. For those who are interested, I used a random event for my sermon. If you’re interested, you can find it here. Anyway, with the break behind us, we resume the series on complex numbers. In the previous post of the series, Pole Vaunting, we had introduced the idea of the argument of a complex number. Before that, we had a six part pit stop, in which we learned some trigonometry, especially as it relates to complex numbers. Before the start of the pit stop, in Modulating an Invariant Metric, we had introduced the idea of the modulus of a complex number.
Revisiting the Polar Form
With the modulus and argument of a complex number, we specify the complex number in terms of its polar coordinates. With these polar coordinates, we are able to locate any complex number on the complex plane. The modulus tells us how far the point representing the complex number is from the origin and the argument tells us the angle made with the positive real axis by the line joining the origin to the point representing the complex number. In particular, we have seen that the complex number z = (r, θ) in polar coordinates is the same as the complex number z = (a, b) in Cartesian coordinates, where
That Old Friend Again
If you have been following the blog, you may vaguely recall that I had done a short series on the number e. InNaturally Bounded?, I had shown that
However, we didn’t really prove the above result. And that is because we had not formally dealt with calculus. In fact, till today, we have not done that in the blog. However, I realize that, in order to do a sufficiently robust job in our journey ahead with complex numbers, I will need to at the very least dip my toes into the ocean of calculus. Hence, I will start another pit stop next week with a brief journey through calculus. But for now, I wish to explore some ideas we uncovered.
Rotational Rumination
In the previous post we saw that multiplying two complex numbers resulted in the multiplication of their moduli and the adding of their arguments. That is, if z1 = (r1, θ1) and z2 = (r2, θ2) then z1z2 = (r1r2, θ1+θ2). Now, let’s ask ourselves under what conditions does multiplication of a ‘two-part’ number result in the multiplication of one part and an addition of the second part?
The term ‘two-part’ number, of course, is poorly defined. Someone could think of a rational number as having two parts – the integral part and the fractional part. For example,
However, when we multiply two rational numbers, both parts get multiplied. This can be seen by both approaches shown below.
What this tells us is that the parts of complex numbers in polar form (i.e. the modulus and argument) do not function like the integral and fractional parts of rational numbers. What other ‘two-part’ numbers can we think of?
We can consider numbers written in scientific notation. For example, 2.3 × 103 has a mantissa or significand of 2.3 and an exponent or power of 3. Now, we can consider
What we can see is that the mantissas were multiplied while the exponents were added. This is similar to our saying that, when two complex numbers are multiplied, their moduli are multiplied while their arguments are added. Hence, it sees that there should be a way of expressing complex numbers in a ‘two part’ form where one part is a regular real number, similar to the mantissa, while the other part is written in exponential form, similar to the exponent.
However, we observe that, when we write z1z2 = (r1r2, θ1+θ2), the modulus of the product complex number z1z2 is given by the first part r1r2. However, we have seen that a complex number, represented in polar form, has a ‘size’ (i.e. its modulus) and an ‘orientation’ (i.e. its argument). Since in the product z1z2, r1r2 already expresses the modulus of the product in a manner similar to the mantissa, it must follow that θ1+θ2, which represents the argument of the product, corresponds to the exponent of the ‘two part’ number.
What this means is that there must be a way of representing complex numbers with two parts, similar to what we use in scientific notation, with the part corresponding to the mantissa representing the modulus and the part corresponding to the exponent representing the argument of the complex number. Since the modulus is a simple number, it must follow that it corresponds to the r of the polar form (r, θ). Hence, there must we a way of expressing the argument θ as the second, exponential part of the ‘two part’ number.
This is somewhat counterintuitive since, in conventional scientific notation, the exponential part conveys the ‘size’ of the number in terms of consecutive powers of 10 between which the number lies. That is, when we write 2.3 × 103, we know this number lies between103 and 104. However, 2.3 × 1032 is a number that lies between1032 and 1033. So, the exponential part gives the ‘size’ of the overall number, while the mantissa only locates the number between the two consecutive powers of 10.
However, now we are saying that there is an exponential form for complex numbers in which the modulus (i.e. size) is represented by the part similar to the mantissa while the argument (i.e. orientation) is represented by the part similar to the exponent. In other words, we are looking for some kind of exponential form involving θ that represents pure rotation.
Spiraling Away
Of course, since I went on a short detour earlier with e, you might be thinking that e has some role to play in this exponential form that represents pure rotation. You would be right! Of course, if you have studied complex numbers before, this will not be news to you. However, I’ll bet the reasoning toward this realization is somewhat novel. Anyway, the exponential form of complex numbers is an extremely powerful form. I could, of course, simply tell you what it is without any foundation for understanding how it is obtained. But that would be to short-change you – something I’d rather not do. Hence, as mentioned earlier, I will begin a second pit stop in the next post with some introductory explorations of calculus.
As we continue our series on complex numbers, we return to the main series of posts, having concluded what I called a ‘pit stop’ last week. The pit stop introduced us to trigonometry. It should be observed that I have only introduced trigonometric ideas that contribute to our understanding of complex numbers. There is much more that we could have studied. Perhaps at a later date, I will revisit trigonometry and take us beyond to some fascinating results, such as conditional identities and the properties of triangles. Anyway, let us trudge back to our study of complex numbers.
Revisiting the Modulus
When we began our pit stop on trigonometry, we had just introduced the idea of the modulus of a complex number. Hence, we saw that, given the complex number z = a + bi, the modulus is defined as
We saw that this way of defining the modulus is not affected by the rotation of the axes. In other words, this metric is invariant to the rotation of the axes. Of course, if we rotate the axes, the orientation of the complex number with respect to the axes will change. In other words, even though the modulus is invariant, the orientation is not. This is only to be expected since the orientation is determined in relation to the axes and the axes are rotating.
Numbers on the Complex Plane
However, given a complex number z = a + bi, it is clear that this number represents a units in the real direction and b units in the imaginary direction. Hence, the point representing z on the complex plane is fixed. We can, therefore, join this point to the origin with a line that is unique to the point and, hence, this line makes a unique angle with respect to the positive real axis. This is depicted in below.
In the above diagram, we can see that the real and imaginary parts of z can be written as
Introducing the Argument
Hence, the above complex number has an orientation equal to θ with the positive real axis. This angle θ is called the ‘argument’ of the complex number. Now, since the trigonometric ratios are periodic in nature, we can also conclude that
In other words, due to the periodic nature of the trigonometric functions, which is something obtained from repeated rotations about the center, there are infinitely many values of the angle that will land us on the same point on the complex plane. However, there is a unique angle with the smallest magnitude that gives the same orientation. This angle is called ‘the principal value’ of the argument and will take values between –π and +π. Let us see how this works.
Suppose I have a complex number that I have obtained as follows: It has a modulus of 8 and an argument of 22π/3. Since the argument is great than π, I repeatedly subtract 2π to get
Due to this we can conclude that
Since the angle now lies between –π and +π, it is the angle with the smallest magnitude that gives us the same orientation. Hence, while we can say that the same complex number can have arguments of 22π/3, 16π/3, 10π/3, 4π/3 and -2π/3, the principal argument of the complex number is -2π/3. Where would this complex number lie on the complex plane? All we have to do is calculate the real and imaginary parts and plot the point. This is shown below
Till now we have specified complex numbers in terms of their real and imaginary parts as a + bi. This is known as the Cartesian form, since it depends on the system similar to the normal Cartesian coordinates in which we have two axes intersecting at right angles. Of course, since the modulus and argument of a complex number specifies a unique point on the plane, we can represent the complex number in terms of these two numbers. Hence, the above complex number can be written as (8, 22π/2), where the first number specifies the modulus and the second the argument. This is known as the polar form of the complex number. By convention, the modulus is denoted by r, since the complex number having a modulus of r lies on the circumference of a circle of radius r centered at the origin.
Adding and Subtracting with the Polar Form
Now, let us see what happens when we add and subtract complex numbers using polar coordinates. Consider two complex numbers z1 = (2, π/3) and z2 = (3,π/4). Here, the complex numbers have been specified in polar form. We can obtain the following
Now, the terms in red are clunky. And there are no trigonometric identities that would help us reduce these to forms that are neater. So, why bother with the polar form if all it does is give us clunky terms? I’m glad you asked.
Multiplying and Dividing with the Polar Form
The power of the polar form lies in what it does when we multiply and divide complex numbers. Let us consider the same two complex numbers and see what happens when we multiply them.
To get from the third to the fourth lines we used the sum formulas that we obtained in the previous post for the terms in red. Of course, from earlier in the present post we can recognize that the terms in green are simply the complex number z = (6, 7π/12). However, note the following
That is, when multiplying complex numbers, their moduli get multiplied and their arguments get added. That is really convenient and reduces the effort of multiplication considerably. Knowing the two moduli were 2 and 3, we could right away conclude that the modulus of the product is 6. And knowing that the two arguments were π/3 and π/4, we could easily determine that the argument of the product is 7π/12.
The same power is evident when we try to divide two complex numbers. We should be able to infer that, when dividing z1 = (2, π/3) by z2 = (3,π/4), the result should be a complex number with modulus equal to 2/3 and argument equal to π/3 – π/4 = π/12. Mimicking college mathematics textbooks, I leave the proof to the reader!
Looking Ahead
What we have seen is that the polar form of complex numbers significantly simplifies multiplication and division. Addition and subtraction, however, are not really improved at all and may, in many cases, be actually more tedious. However, the polar form has more advantages as we will see in the next post. That will be two weeks from today as I will be taking a break next week for Holy Week and Good Friday.
In our study of complex numbers, we are reaching the end of a pit stop dealing with trigonometry. I ended the previous post by mentioning that the trigonometric functions are periodic in nature. I claimed that this is an important feature in the study of complex numbers. In this post, we will look at what periodicity means and what it reveals concerning the trigonometric functions.
If you read the preceding paragraph carefully, you will probably have realized that my vocabulary has changed. I am no longer speaking of ‘trigonometric ratios’ but of ‘trigonometric functions’. One obvious reason is that, once we have transcended the definitions based on sides of a right angled triangle, there are no sides, the lengths of which can be placed in a ratio. In fact, as we saw in the previous post, we are now defining sine and cosine of an angle in terms of the coordinates of the corresponding point on the unit circle. Moreover, when we say that the values of sines and cosines repeat, we must have a way of quantifying how this repetition occurs. In other words, we want to be able to relate the angle made by the line joining the point under consideration to the origin and the positive x-axis. That is, we want to obtain expressions for the coordinates in terms of the sines and cosines of angles. This means we are treating the sines and cosines of the angles as functions of the angles themselves. Hence the change in vocabulary was needed.
In case you are wondering about the title of this post, it is a feeble riff off the majestic Concluding Unscientific Postscript to Philosophical Fragments by the Danish Christian philosopher Søren Kierkegaard, a work that is quite unwieldy.
Anyway, the focus of this post is on the periodicity of the trigonometric functions and some corollaries that can be obtained from this periodicity. This is precisely where I wanted to reach before returning to the main series of posts on complex numbers. So let us proceed.
Identifying Periodicity
Suppose, we consider a point P on the unit circle corresponding to the angle θ. Then, from what we have learned concerning the unit circle, the coordinates of P will be (cos θ, sin θ). This is shown below.
Now, if we rotate the radius OP counterclockwise by 2π radians, then the new position of the radius will be the same as the original position. In other words, now, the same point P corresponds to a radius making an angle of θ + 2π radians. Since the new position of the radius is the same as the old position, the new coordinates of P, corresponding to the angle θ + 2π radians, will be the same as the old coordinates, corresponding to the angle θ radians. Hence, we can conclude that
Of course, if we rotate OP by another 2π radians, we will be back where we started. This holds true for any integral multiple of 2π radians in the clockwise or counterclockwise direction. Hence, we can conclude that
What this tells us is that the values of the cosine and sine functions will repeat every 2π radians in the clockwise or counterclockwise directions. Now, it is only by convention that an angle is often denoted by the letter θ. Normally, when we plot graphs, we use x for the independent variable and y for the dependent variable. Hence, if we plot the graph of y = sin x, we will get the following
Similarly, if we plot the graph of y = cos x, we will get
In like manner, if we plot the graph of y = tan x, we will get
The regular repetition of the pattern for all three graphs is what makes these functions periodic.
Corollaries of Periodicity
Now, if we are considering rotations of the radius, how do these values relate to each other? That is if we double the angle, how does this affect the values of sine, cosine, and tangent? Before we can answer that question, we need to look at the more general case of adding two angles. So let us deal with that.
Sum and Difference of Angles
The question we are asking is, “How do the sine, cosine, and tangent of A+B relate to the sines, cosines, and tangents of A and B? Here, it is possible for us to derive the following formulas
If you are interested in knowing how these are derived, below are the proofs.
If the above proofs are difficult to follow, please do ask me for an explanation. Of course, having obtained the values of sine and cosine for the sum of angles, we can obtain the values for the difference of angles as below
Double and Half Angle Formulas
Using the expressions for the sum of angles and putting A = B, we can obtain the following expressions for the sine, cosine, and tangent of the double angle 2A. This is shown below
Going in the reverse direction and using the expressions for cos 2A, we can obtain
Triple Angle Formulas
Using some of the above results, we can obtain an expression for the cosine of a triple angle as follows
In a similar way, we can obtain an expression for the sine of a triple angle as shown below
Finally, we can obtain the expression for the tangent of a triple angle as follows
Given the fact that the trigonometric functions are periodic and that there are expressions for the trigonometric functions of sums, differences, multiples and sub-multiples of angles in terms of the trigonometric functions of their component parts, there are an indefinite number of results that can be obtained. However, what we have so far is sufficient for our journey dealing with the study of complex numbers.
Getting Back on Track
However, in case you are wondering why we took this pit stop, recall that we have a geometric interpretation of a complex number on a two dimensional plane in which the horizontal axis is the real axis and the vertical axis the imaginary axis. We had dealt with this in It’s Not Rocket Surgery! and Modulating an Invariant Metric. Recall that, it was immediately after the latter that we began our pit stop.
Recall that we can depict the numbers 3 + 2i, -4 + 4i, and -2 – 3i as shown below.
We had also introduced ourselves to the ‘size’ of a complex number, which is called the modulus. This is nothing but the length of the line joining the origin to the point representing the number. This line, however, has a well defined orientation with respect to the positive real axis. Hence, we can specify the complex number in terms of a distance from the origin (i.e. its modulus) and an orientation with respect to the positive real axis. This orientation has a technical term, of course. However, we will deal with it and some other aspects related to operations on complex numbers in the next post. I know this may bug you, especially if you are the impatient sort. But from you I will accept no argument!
René Descartes’ concept of the Cartesian plane provides valuable insights for trigonometry. (Source: World History Encyclopedia)
As we continue with our pit stop in the series on complex numbers, let us remind ourselves where we have reached. In Trigonometric Identity Crises, we looked at some trigonometric identities. This followed on the heels of Round and Round, in which we saw why a right angle is defined to have 90°. We also defined the radian. Before that, in A Mathematical Partnership, we took a historical tour, starting from India and ending in Rome, during which we understood how the trigonometric ratios are defined and why they have their names. The pit stop began with the post that preceded it, My Trigger, No Metric, Beginnings, in which I explained my long standing fascination with trigonometry, beginning with my explorations with Pythagorean triples.
However, what we have seen so far is that the ratios are defined in terms of the lengths of sides of a right angled triangle. Now, students of Grade 6 (or maybe 7) and above will know that the sum of the angles of a triangle is 180°. This means that, in a right angled triangle, since one angle already is 90°, the other two need to add up to 90°. In other words, as defined, the ratios only work for angles that are acute, that is, strictly between 0° and 90°, not including both of them.
Now, since we can have angles that measure 0°, 90° and everything above 90°, including reflex angles, this means that the definitions of the ratios do not include most of the angles we may encounter. This is a serious drawback.
Descartes to the Rescue
Mathematicians do not like artificial restrictions. Hence, when faced with such an obstacle, they become creative and look for ways to circumvent the obstacle. In many such cases, the innovativeness involved is just mind boggling. Of course, since we are living some centuries after these innovations, we lack the ability to view them as something fresh that contributed to the advancement of mathematical knowledge. Indeed, hardly any teacher spends any time showing the students what the ‘before’ and ‘after’ of these innovations was, thereby impoverishing the students by depriving them of this knowledge.
In the case of extending the definition of the trigonometric ratios, the innovation involved is truly remarkable and underscores something I attempt to communicate to my students often, namely that the different branches of mathematics form one unified body of knowledge in which everything dovetails perfectly with everything else.
So let us see what they did. We begin with the Cartesian plane, named after René Descartes, the French mathematician. It consists of two number lines at right angles (we know what that means!) to each other and intersecting at their respective zero points. Now, let’s consider our right angled triangle. However, we place the vertex of interest at the origin of a two dimensional Cartesian plane, with one of the arms that forms the vertex aligned with the positive x-axis. This is shown below.
Beginning a Redefinition
When we defined the ratios and understood why they have their names, we had considered a unit circle. In keeping with that, the hypotenuse of the triangle (side OA) is going to be 1 unit. Now, we can see that
Now, as indicated in the diagram, the point A has coordinated (x, y). However, the x coordinate of a point is the directed distance of the point from the y-axis. Similarly, the y coordinate of a point is the directed distance of the point from the x-axis. Since the term ‘directed distance’ may be unfamiliar to some, allow me to explain. If we start at the y axis and move toward the right to reach the point, then the distance is considered positive and the x coordinate will be positive. If, however, we more toward the left to reach the point, then the distance is considered negative and the x coordinate will be negative. In a similar way, starting from the x axis, an upward movement is considered positive, while a downward movement is considered negative. Hence, we can conclude that, to the right of the y-axis the x coordinates will be positive while they will be negative to the left of the y-axis. Also, above the x-axis the y coordinates will be positive while they will be negative below the x-axis.
However, with reference to the above diagram, it is clear that OB is the directed distance of A from the y-axis, while BA is the directed distance of A from the x-axis. In other words, the lengths OB and AB, both being positive in the above diagram, give the x and y coordinates of the point A. Hence, we can conclude that
In other words, with respect to the right angled triangle OAB, the cosine and sine of the angle at O are equal to the x and y coordinates of the point A.
Now, let us revisit our Cartesian plane. Without any points on it, it would look like
Here, the Roman numerals (I, II, III, and IV) indicate the quadrant number. The signs below indicate the signs that the x and y coordinates have in that quadrant. Hence, in the second quadrant (i.e. II), the x coordinate is negative while the y coordinate is positive. This follows the directed distance convention we saw earlier – right/up is positive, left/down is negative.
Completing the Redefinition
We can see here that no quadrant has the same combination of signs for both the x and y coordinates. This allows us to extend the definition of the trigonometric ratios as follows. Consider a point (say P) on the circumference of the unit circle. Draw the radius joining P to the center O. Let the angle made by OP with the positive x-axis be θ. Of course, here we have the need for a further convention. Since any point on the circumference can be reached through a clockwise and an anticlockwise rotation, we define the anticlockwise rotation, that sweeps the quadrants in proper order, to be a positive angle measure. Hence, a clockwise rotation would be a negative angle measure.
Anyway, let the angle made by OP with the positive x-axis be θ. Then we define the cosine of θ to be the x coordinate of P and the sine of θ to be the y coordinate of P. We can depict this as follows.
Ratios for Allies
We can see that, since the angle made by the axes with each other is 90° (or π/2 radians), angles that differ by or add to multiples of 90° (or π/2 radians) will have trigonometric ratios that are related to each other. Such pairs of angles, which differ by or add to multiples of 90° (or π/2 radians) are called allied angles. Using these extended definitions, we can obtain the following results
The above table may be a little confusing. For starters, I have used radian measures for the angles. Recall that 180° = π radians. So, let’s see how the table is to be read. Suppose I want to determine sin α, where α = π – θ. The row for sin α is the second row of the table, while π ± θ is the third column. The element in that cell is ∓ sin θ. Since I am interested in π – θ and the minus sign is second in the symbol ±, I take + sin θ, which is second in the term ∓ sin θ. Hence, I can conclude that
How do we obtain these results? If you are interested, you can find visual proofs using a general, not unit, circle here. With our extended definitions of the trigonometric ratios, we can obtain their values for any angle. If you wish to see this for yourself, you can check here, here, and here.
Rotations Without End
Now, of course, we are treating angles as though they were also on somewhat of a number line, just one that wraps around itself. We have adopted the convention that counterclockwise rotations are positive and clockwise rotations negative. Also, we realize that there is nothing that hinders a radius from continuing to rotate beyond the full cycle. So we could have angles that exceed 2π radians (360°) and angles that go below -2π radians (-360°). An angle of 7π radians, for example, would involve three full counterclockwise rotations, each of which accounts for 2π radians, followed by a further rotation by π radians. Similarly, an angle of -810° would involve two clockwise rotations, each of which accounts for -360°, followed by a further rotation of 90°. Of course, with each rotation, the point P will reach identical positions as in the previous rotation. This means that, after every rotation, the values of the trigonometric ratios repeat themselves. This is a feature known as periodicity, which is crucial in the study of complex numbers. Since it is a very important part of the study of trigonometry, we will turn to that in the next post. Till then, keep twirling!
We are in the middle of a pit stop in the middle of our series on complex numbers. The pit stop is intended to introduce the reader to the basics of trigonometry. In the previous post, we saw why we have 360° in a circle and 90° for a right angle. We also defined the radian, which I demonstrated was not an arbitrary unit, unlike the degree. In the post preceding that, we saw how partnership between mathematicians from India, Persia, Greece, and Rome gave us the names of the trigonometric ratios.
At the conclusion of this pit stop, when we return to complex numbers proper, we will see how integral the ideas we uncover in trigonometry are to the study of these numbers. For now, as we begin the next part of our pit stop, let us remind ourselves about the definitions of the trigonometric ratios.
Nomenclature
With respect to the figure above, the angle we are focusing on is ∠ABC, which we are calling θ. The side opposite θ is AC, which has a length equal to b. The side adjacent to θ is BC, which has a length equal to a. Finally, the hypotenuse of the triangle is AB, which has a length equal to c.
Per our definitions of the trigonometric ratios
The three ratios relate as follows
We also define three other ratios as follows
Here, csc stands for cosecant, which is often shortened to cosec. Similarly, sec is short for secant and cot is short for cotangent. With reference to the post made two week back, I leave it to the reader to discover why the secant has its name.
Pythagoras Again
Now, from Pythagoras’ theorem applied to the triangle ABC, we know that
If we divide the entire equation by c2, we will get
Pay close attention to the notation above. What the last equal says is that
Consider the equation
This is the first identity students learn in trigonometry. It is called a Pythagorean identity because it is derived from the Pythagorean theorem. But what do we mean by an identity?
An identity is a mathematical statement that is true for all values of the variables. In the above equation, the angle θ is the variable since we did not decide on its value. The equation
is true for all values of θ and is, therefore, called an identity. An identity, while in this case written in the form of an equation, differs from common equations in that one cannot ‘solve’ an identity to obtain a value of the unknown since it is true for all values.
Now, starting with this identity, let us divide it by cos2θ. We would then get
This last equation is also a Pythagorean identity since it too is derived from the Pythagorean theorem. The reader can begin with the first identity and divide it by sin2θ to obtain the third Pythagorean identity
Proving Identities
Note that the two additional Pythagorean identities did not require any additional trigonometry apart from the definition of the secant, cosecant, and cotangent. This is a big feature in trigonometry, where new identities can be proved based on existing ones. Let us consider some examples.
Example 1
Suppose we were asked to prove that
We could proceed as follows
Here LHS and RHS denote ‘left hand side’ and ‘right hand side’ and indicate the direction in which we are moving to prove the identity.
Example 2
Let’s consider another example. Suppose we were trying to prove that
We could proceed as follows
While this proof is longer than the earlier one, note that in both we use simple algebra, such as was discussed earlier in Primer to Complex Numbers, the post with which I began the series on complex numbers.
Example 3
As a final example, suppose we were asked to prove
We could proceed as follows
Interrelatedness of the Trigonometric Ratios
Note that, along the way, we made use of the Pythagorean identities quite often. This is because the trigonometric ratios simply conceal the Pythagorean theorem in their definitions. In fact, because of this feature, namely that all the ratios are related to each other through the Pythagorean theorem, we can actually show the following
That is, each of the ratios can be expressed completely in terms of any one of the other ratios.
‘Sining’ Off
However, if you have been paying attention, you will realize that we have a problem. We know that no triangle can have an angle equal to 0° or 180°. We also know that a right angled triangle cannot have another right angle. We also know that a right angled triangle cannot have an obtuse angle or a reflex angle. In other words, defining the trigonometric ratios in terms of lengths of sides of a right angled triangle means that most of the angles cannot have trigonometric ratios defined for them. This is a remarkable drawback. However, mathematicians have gotten around this restriction in quite an ingenious manner. We will turn to that in the next post. Till then here are a few identities you can try proving.
We have reached a remarkable junction in our journey through complex numbers. We are in the middle of a pit stop in which we are looking at some trigonometric ideas. And on the occasion of Pi day 2025, I think this post makes a lot of sense. Two weeks back, in My Trigger, No Metric, Beginnings, I introduced the idea of right angled triangles and Pythagorean triples. As a reminder, Pythagorean triples are sets of three integers that satisfy the Pythagorean theorem
In that post, I looked at different ways of generating these triples, an exercise with which I occupied myself shortly after I was introduced to these triples. Last week, in A Mathematical Partnership, we looked at how the common trigonometric ratios are defined. We also understood how the ratios – tangent, cosine, and sine – obtained their names.
It is time now to understand how we humans have decided to measure angles. After all, that is precisely what the trigonometric ratios deal with – the ratios of the lengths of sides of a triangle corresponding to the measure of the angle being considered.
A Degree of Definition
Most of us would have started our journey into angle measurement with the knowledge that a right angle is equal to 90 degrees, where the ‘degree’ is the unit of measuring angles. However, when I have asked students how to define a ‘degree’, many, if not most, of them are flummoxed. They know that 90 of these units give us a right angle. And some students may be able to then say that a degree is one-ninetieth part of a right angle. And while this is perfectly correct, the definitions are circular since the degree is defined in terms of the right angle and the right angle in terms of the degree. I could just as well define the angle below as a right angle and obtain a different measure of a degree than what we conventionally use.
You may object and tell me, “That angle is not a right angle.” But I could very well respond, “How do you know?” Indeed, since the definition of each term – right angle and degree – depended on the other, there is no way of deciding what each measure actually is.
Of course, there is a better way out. Unfortunately, none of my mathematics teachers told me this. And given the flummoxed expressions on the faces of most students when asked to define the degree or the right angle without reference to the other, it seems that this unfortunate practice continues to date.
So let us begin with two intersecting straight lines as shown below
Now, apart from their location and orientation, there is nothing that differentiates the line AB from the line CD. Hence, the angle form by each of these lines at the point of intersection, O, must be identical. In other words, since ∠AOC + ∠BOC forms the straight line AB and, since ∠BOC + ∠BOD forms the straight line CD, the two measures should be equal. So we can write
This is nothing other than saying that vertical angles (or vertically opposite angles) are equal. Now note that ∠BOC is adjacent to both ∠AOC and ∠BOD. These pairs of adjacent angles (∠AOC and ∠BOC or ∠BOC and ∠BOD) have the same angle sum, as we have stated above. However, there is only one configuration in which the adjacent angles will have equal measure. This is shown below
We can now define the four angles formed, all of which will have equal measure, to be right angles. From here, we can define the degree to be one-ninetieth part of the right angle and have no circularity in our definitions. But why do we define a right angle to be 90°? Why not a neat, nice round number like 100°? Of course, there is a unit of angle measure, the gradian, that is defined as one-hundredth part of a right angle. But I bet not too many people know about it. Why, when 100 is a part of the decimal system, being a power of 10, has this unit not caught on? And why do we function with the right angle being defined as 90°? The answer to this is two-pronged.
Celestial Constraints
We live on a strange planet, the only one we know of that can sustain the kind of life that we are accustomed to. Our planet revolves around an unimpressive, medium sized star in a orbit that takes it about 365.25 of its own rotations about its axis. That’s just a convoluted way of saying that one earth year is equal to approximately 365.25 earth days. The earth also has one natural satellite – the moon – which revolves around the earth in an orbit, which when viewed from earth, takes between 29 and 30 earth days to traverse.
The seasons are, for the most part, governed by the solar cycle of the earth. That is, the earth’s position relative to the sun has a greater impact on the seasons than does the moon’s position relative to the earth. However, it is difficult to determine exactly where in the solar cycle we are, since measuring exact times for sunrise and sunset is impractical almost everywhere. This means that determining the exact dates for the equinoxes, where the length of daytime and nighttime are equal, or the solstices, where the lengths of daytime and nighttime are extreme, is practically impossible.
Due to this, most human cultures have found it easier to mark the progress through the year ironically with the lunar cycles. After all, it is quite easy to look into the night sky and realize that the moon is not visible! Hence, most cultures have measures this progress through the year by starting months when the new moon cycle began.
However, as mentioned earlier, the moon takes 29 or 30 days to complete one cycle around the earth, as viewed from earth. This means that we encounter a new moon once every 29 or 30 days. Hence, most cultures have measured the progress through the year in terms of months that are 29 or 30 days long, following the lunar cycles. However, this results in a shortfall of about 11 days when compared with the solar cycle, resulting in a drift. To compensate for this, most cultures have the practice of adding an extra month every 2 or 3 years to get the calendar synchronized with the solar cycle.
So what we have is a solar cycle of approximately 365.25 days and a lunar cycle, twelve of which amount to between 353 and 355 days. The number 360, which is four times 90, falls neatly in this interval between the period of twelve lunar months and one year. The number 400, which is what we would get from the gradian, lies outside this interval and hence would not have been suitable for measuring anything on planet earth.
A Deliberation on Divisors
Of course, the second consideration, beyond what is forced upon us by the weirdness of the interaction between the sun, the earth, and the moon, is that we want a number that has a large number of divisors. This allows for dividing the year into many smaller fractions without having fractional days to account for. It is here that we see another advantage of 360 over 400. Here at their divisors
Apart from 1, which is a divisor of all numbers, and 400 and 360 respectively, since every number is automatically a divisor of itself, 400 has 11 divisors, while 360 has twice as many. In other words, 360 allows us to divide the year in 22 different ways without fractional days, while 400 only gives us 11 options.
Now, if we extricate ourselves from the realms of planetary rotations and revolutions and turn to geometry, we can see that dividing one complete rotation about a point into 360 parts gives us more options than dividing by 400 parts. On account of this and the link with the lunar and solar cycles, the degree was defined to be one-ninetieth part of the right angle.
However, we can see that it is an arbitrary division. Divisibility, which I discussed in Behind the Smoke and Mirrors, plays no role in determining the units of other quantities. Why should it play such a prominent role when it comes to measuring angles? In fact, let us look at how some fundamental units are defined.
Unimaginable Units
The second is defined as 9,192,631,770 cycles of radiation associated with the transition between the two hyperfine levels of the ground state of the cesium-133 atom. If that number is strange, consider that the metre is defined as the distance traveled by light in a vacuum in 1/299,792,458 seconds. The kilogram is defined in an even more roundabout way by first defining Planck’s constant to be 6.62607015 × 10−34 joule second and then using the definitions of the second and the metre and the fact that one joule is equal to one kilogram times metre squared per second squared to arrive at the measure of the kilogram. What this lets us know is that, when it comes to units of measurement, divisibility is the least of our concerns.
What this means is that choosing a unit for angle measure based on divisibility is arbitrary. And since we understand that the twelve lunar cycles and one solar cycle are not sufficient grounds for measuring angles, any number we decide upon, whether 360 or 400, is going to be arbitrary at best. So, how do we decide on a better, more appropriate unit? We have seen earlier, in our series on e and π, that we are often forced to accept constants that are inconvenient. When I say inconvenient, I mean that we would never have thought to consider these numbers as significant. In fact, were these numbers not forced upon us, we would never in our wildest dreams have settled on anything even remotely close to their values.
So, how do we go about defining a unit for measuring angles? The previously mentioned units (second, metre, and kilogram) were all defined in terms of physical processes or quantities. We do not have anything like this for the angle since an angle is an idealized geometric entity. So, we are forced to consider other ways to define a new unit for angle measure.
Reining in Rotation
We start with the observation that an angle simply represents a rotation. So, for example, in the figure below, the angle AOB can be viewed as a rotation of OA in the counterclockwise direction with O being the center of rotation until A coincides with B.
So the measure of the angle can be quantified as the amount by which we must rotate from the starting point until the angle is formed. We also recognize that a logical point would be one complete rotation about O, which would bring OA back to coincide with itself. Once we have done this, the point A would have traced the circumference of a circle with center at O and radius equal to the length OA as shown below
Now, we know that, if the radius of this circle is r, then the circumference is 2πr. We can also understand that, if point A traverses a distance that is twice the distance covered from A to B along the circumference, the angle by which OA would have rotated would be twice the original angle AOB. In other words, the distance covered is proportional to the angle through which OA rotates. We can also see that the distance is proportional to the radius. That is, if the radius is doubled, the distance covered would be double the original distance.
We now look at the formula
Since r represents the effect that the radius has on the circumference, we ask ourselves what the 2π represents. Remember that that distance covered during a rotation is proportional to both the radius and the angle of rotation. Since, r represents the proportionality to the radius, it must be the case that 2π represents the proportionality to the angle of rotation. However, the circumference itself represents a full rotation. Could it be then that 2π is the angle corresponding to the full rotation? What would that mean?
Now, while OA is rotating about O, point A traces arcs of the circle. Let the arc length corresponding to an angle θ be s. Since arc length is proportional to the angle of rotation, we can write
Now, if we keep the angle of rotation fixed at θ, then we can understand that, if the circle was twice as large, that is, if the circle had double the radius, then the arc length would be double what it was. Hence, the constant k must be none other than the radius r. This gives us
Realizing the Radian
Comparing this with the equation C = 2πr we can see that, since C corresponds to the arc length for a full rotation, 2π does indeed correspond to the angle of rotation. Using the equation s = θr, we can see that, if we set θ = 1, we will get s = r. This gives us the definition of the new unit of angle measure, called the radian. The radian is the angle subtended by an arc of a circle that has length equal to the radius of the circle at the center of the circle. In other words, starting from OA, when we have rotated through an angle equal to 1 radian, the arc length involves would be equal to the radius of the circle. This is depicted below
With this definition of the radian, we can conclude the following
One full rotation or the angle about a point is equal to 2π radians.
A half rotation or the angle for a straight line is equal to π radians
A quarter rotation or a right angle is equal to π/2 radians
From this we can also get the following conversion equation
The superscripted ‘c’ is the symbol for the radian, indicating ‘circular measure’ since the radian is the unit of angle measure defined solely based on the measurement of the circle. Of course, since the radian is the standard unit for angle measure, by convention it is often not indicated. Hence, in the equation
it is understood that the left hand side specifies π/3 radians.
Coming Full Circle
What we have seen in this post is that the degree measure is an arbitrary unit. It was adopted on account of two considerations – the orbital period of the earth (i.e. approximately 365.25 days) and the convenience introduced by the fact that 360 is a highly composite number with many distinct divisors. However, we saw that units for other quantities did not pay heed to any such requirements. This allowed us to think of the unit for rotations in terms of the features of the circle, leading us finally to the radian.
However, while almost all the other units have a certain degree (pun intended) of arbitrariness to them, the radian is completely defined in terms of the internal structure of the object it is designed to describe. For example, with the different measurement systems that humans have devised, we can imagine a world in which the standard unit of length is a cubit instead of a metre and the standard unit of mass is the pound instead of the kilogram. The second, based as it is on the period of rotation of the earth and the convenience of dividing successively by 24, 60, and 60, might have a slightly less amount of arbitrariness to it, at least for creatures who inhabit this planet. However, no matter where one may find oneself in the universe, all circles on Euclidean planes will yield the same measure for angles regardless of the name any extraterrestrial species may have given the quantity.
Having defined the radian, we will continue with our pit stop in the next post in which we will consider some trigonometric identities. Until then, hope you have seven turns for the better!
In the previous post, we began our first pit stop during our journey into the wonderful realm of complex numbers. We looked at Pythagorean triples and some ways of generating sequences of triples. I admitted at the end of the post that this was not strictly a study of trigonometry and that we normally begin by defining the ratios since, cosine, and tangent. I promised you that I would tell you how we got these weird names. That is the agenda for this post. We will consider the ratios in the reverse order because the story of sine will take us on two globetrotting journeys that I am sure many, if not most, of you have never undertaken before this.
Triangle Nomenclature
We begin our journey by considering some nomenclature. In what follows, I will assume that the triangle is ABC and that the right angle is at vertex C. Further, the length of the side opposite a given vertex will be denoted by the lower case letter of that vertex. Hence, the length of side BC, which is opposite vertex A, will be denoted as a. The figure below should clarify the nomenclature we will use in the discussions from now on.
Now the trigonometric ratios for angle B are defined as follows
How did we get these names?
Off on a Tangent
For the definition of the tangent, we begin with a unit circle. This is simply a circle with radius equal to 1 unit. Let us consider a unit circle with center at O. A point, P, is chosen on the circumference of the circle and a tangent is drawn at that point. From what we know about circles, this tangent is perpendicular to the radius OP. Now consider another point, Q, on the circumference of the circle, such that the angle made by the radius OQ with the radius OP is θ. Now we extend the radius OQ and the tangent at P so that they intersect at point R. All of this is shown in the figure below.
Now it is clear that ΔOPR is a right angled triangle with the right angle at vertex P. Now, with respect to the angle θ, RP is the opposite side while OP is the adjacent side. Hence, per the definition of the tangent above it should be clear that
However, OP = 1, since we are considering a unit circle, hence we obtain
However, RP is the length of the tangent corresponding to the points P and Q, the radii of which makes the angle θ. Hence, the name of the ratio. The tangent of an angle is the length of the tangent formed as above in a unit circle.
Cosplay Anyone?
Now, that’s all well and good. What about the cosine and sine? We go back to our original right angle triangle. However, now we focus on the vertex A. With respect to angle A, the trigonometric ratios will be as follows:
Note that
However, since C is a right angle, it follows that
In other words, A and B are complementary angles. When we say that
We are saying that the sine of an angle is the cosine of the complementary angle and vice versa. In other words, cosine is simply a shorthand for complementary sine, that is, sine of the complementary angle. But how in the world did the ‘sine’ get its name?
A Journey from Sanskrit to Latin
The ratio of the opposite side to the hypotenuse was used in many parts of the world. However, the name ‘sine’ strangely has its roots in the Indian subcontinent.
With reference to the figure above, the red arc and the green chord can be seen as a bow and string. Hence, Indian mathematicians named the string, that is, the chord, jyā, which is the Sanskrit word for ‘bowstring’. The half-chord was called ardha-jyā, which means ‘half bowstring’. Over time, the mathematicians realized that the ardha-jyā had more common use in the study of the triangles than did the jyā. Hence, they dropped the prefix and began calling the half-chord jyā.
From here we have two stories, equally fascinating, and equally plausible (or implausible).
One pathway indicates that a synonym for jyā in Sanskrit is jīvá. This term was adopted by Arabic scholars, who simply attempted to transliterate it into Arabic script as jība. However, since Arabic is not written with short vowels, this was rendered simply as jb. However, jība is not an Arabic word. In time, jb came to be pronounced as jayb, Arabic for “bosom”. When Latin scholars found these Arabic works, they were reluctant to use the word for “bosom” in a mathematical work. Instead, they chose the word ‘sinus’, the Latin word for the hanging fold of a toga over the breast, from which we get the final form ‘sine’.
The second pathway also observes the synonym jīvá but recognizes that jīvá also means ‘life’ in Sanskrit. From here the proposal is that the transformation happened earlier, when the wordjīvá, now understood to mean ‘life’, was translated as ‘zind’, the Old Persian word for life. From there the name went westward to the Latin world where, unfortunately, the final ‘d’ of the name was misread as a breathing mark. This is understandable since, in Old Persian the letter for ‘d’ is 𐭣 and the breathing mark is 𐭥, making for a very small difference. Hence, the name got transliterated as ‘sine’, with the initial ‘z’ getting replaced with a ‘s’.
Both the above pathways are based on hypotheses concerning when the Sanskrit texts were translated into either Arabic or Old Persian. We do not possess any extant manuscripts that indicate which pathway was taken. It is also possible that both pathways were taken a few centuries apart from each other, leading to a strange convergence when the various corruptions and mistranslations reached the Latin world. However, both pathways indicate that the origin of our name ‘sine’ is from the Sanskrit word jīvá.
Knowledge Transcends Borders
People from the Indian subcontinent may express pride at the fact that the main trigonometric ratio (sine) obtains its name from a Sanskrit word and from the imagination of some native mathematician who saw a segment of a circle as a bow and string. While there is no denying this, the length corresponding to the ardha-jīvá was used in many places, from Egypt to Persia. In other words, the usefulness of the length of the half-chord in relation to the angle formed by the half-chord was recognized by mathematicians the world over. For me, the journey, regardless of the pathway we decide upon, indicates that the world has always been one in which knowledge moved across all artificial national borders. Beginning with a Sanskrit word jīvá we have ended with a Latinized word ‘sine’. And I can only hope that this permeability of knowledge continues despite all the obstacles that are being introduced.
Having seen how the trigonometric ratios received their names, we will continue our pit stop in the next post to understand how we decided to measure angles. Why do we have multiple units – the degree and the radian? Which one do I prefer? It will be pi day and I think the next post will be an appropriate one for the occasion. Stay tuned.