Acutely Obtuse

A Piece of the Pi

March 20, 2026

As mentioned earlier, I am taking a break from publishing new material for a few weeks. During these weeks I am republishing the series on e and the one on π. The post below was originally published on 9 August 2024.

Definitionally Limited

(Source: Live Science)

Almost everyone who has studied beyond Grade 4 has probably heard of the mathematical constant π. Students are told that π represents the ratio of the circumference of a circle to its diameter. That is all well and good. But how do we calculate it? I mean, we could potentially use a piece of string to measure both the circumference and the diameter. Even assuming no error in determining one exact rotation around the circle (for the circumference) and diametrically opposite points (for the diameter), the experiment is limited by the measuring instruments.

Suppose, however, we have access to measuring instruments (necessarily hypothetical) that can measure to the accuracy of the Planck length (i.e. 1.616255)×10^-35 m), we will still only get measurements with about 40 significant figures. Since the accuracy of a mathematical calculation cannot exceed the accuracy of the least accurate number in the calculation, we will get a result that has 40 significant figures for π. Actually, if we look at any measuring instrument, we will see that there are hardly any that give more than 4 significant figures. This means that even obtaining 3.1416 as a 5 digit approximation for π is a pipedream if we take the measurement.

However, we are told quite early on that π is an irrational number. This means that, when we attempt to write π in a decimal format, the expression continues endlessly without any pattern of repetition. What this means is that the definition of π cannot be used to determine its value! In fact, we should have known this precisely from the definition. If C÷D is irrational, at least one of the numbers, C and D, must be irrational. Otherwise, C÷D will give us a rational number. So we can see that the definition of π that is based on measurement is an impractical way of obtaining the value of an irrational number like π.

Now, if you refer back to our posts on e, especially the opening post, you will realize that this is exactly the opposite issue we faced there. The definition of e was able to give us results to as great an accuracy as we desire. While we had to do quite a bit of heavy mathematics before we obtained a usable infinite series, once we had obtained the desired infinite series, we could use it to obtain the value of e to any arbitrary level of accuracy.

Not so with π since its definition in terms of measured lengths renders it limited precisely because no measuring instrument can be indefinitely accurate.

Why the Symbol

Before we proceed, however, it is interesting to make a brief detour concerning the history of how the Greek letter π came to be used to denote this ratio. The earliest known use of the π in a similar context was to designate the semiperimeter of the circle. With the Greek letters δ and ρ representing the diameter and radius of the circle, the many mathematicians used the ratios

to denote what we currently would denote as π and 2π respectively.

In his 1727 paper An Essay Explaining the Properties of Air, Euler used π to denote the ratio of the circumference to the radius. Just 9 years later, in 1736 in his book Mechanica, Euler used π to denote the ratio of the circumference to the diameter. From then on he consistently used the latter designation and, because he corresponded prodigiously with other mathematicians, his notation eventually was adopted though, even till 1761 there was quite a bit of fluidity.

Despite the confusing history of the symbol, something utterly lacking in the case of e, π has been the more popular number and more efforts have been made to calculate its digits than the digits of e. Obviously, this means that mathematicians have some other result, necessarily stemming from the definition of π, that allow the generation of different infinite series that can be used to calculate the value of π. The current record stands at 105 trillion digits, which required 75 days of dedicated computer time by a computer company. But if the definition ofπ is limited, how in the world do mathematicians calculate its digits?

The Polygon Method

The original impetus came from the study of polygons, specifically regular polygons. In the figure below we see the same circle repeatedly inscribed and circumscribed by regular polygons. On the left we have pentagons. In the middle are hexagons. And on the right are octagons.

Here it is possible to define the circle as having a radius of 1 unit. Then the distance of the vertices of the inscribed polygon to the center of the circle will also be 1 unit. And the perpendicular distance of the center of the circle from the sides of the circumscribed polygon will also be 1 unit. Using this, the perimeter of both polygons can be determined. Also we can easily see that the circumference of the circle has a value between the perimeters of the two polygons. This allows us to place lower and upper bounds on the value of π. With polygons having more sides our accuracy becomes better and we can get tighter bounds.

Using this method with 96-sided regular polygons, Archimedes, in the 3^rd century BCE, obtained that

This gives us

which most students in middle school will realize are accurate only to 3 significant figures! 96 sides and 3 significant figures! I don’t know about you, but I think this is a really poor payoff. I applaud Archimedes for sticking with his 96-sided polygons.

Not to be outdone by the redoubtable Archimedes, Liu Hui from the Wie Kingdom used a 3,072-sided polygon and determined that the value of π was 3.1416. That’s a factor of 32 more sides yielding just 2 more digits!

Not having any other methods at the time, around 480 CE and using Liu Hui’s algorithm with a 12,288-sided polygon, Zu Chongzhi showed that π could be approximated by 355÷113, which gives 3.1415929…, accurate to 6 digits. In 499 CE, Aryabhatta used the value 3.1416, though, unfortunately, he does not tell us the method he used or if he was using someone else’s value.

Mathematicians continues using this polygon method for the next few centuries. In 1424 CE Jamshīd al-Kāshī determined 9 sexagesimal digits of π, the equivalent of about 16 decimal digits, using a polygon with 3×2²⁸ sides! Finally, Christoph Grienberger obtained 38 digits in 1630 CE using a polygon with 10⁴⁰ sides!

I do not wish to detract from the achievements of any of these great mathematicians. The patience and perseverance they displayed is exemplary. Morover, they achieved what they did in an age before calculators and computers. They only had their hands and minds to work with. Despite their achievements – or rather precisely because of them – it is evident that the polygon method is wonderfully accurate, but also woefully inefficient.

The Path Forward

Apart from the slowness with which the polygon method converges to the value of π, it is evident that, when you have a finite number of terms, here represented by the finite number of sides of the polygon, regardless of how large the number of sides may be, the final result will only be achieved as a value between two bounds specified by the inscribed and circumscribed polygons. Due to this, mathematicians have devised what can only be described honestly as ‘workarounds’, methods that yield the value of π but not because of some direct application of the definition.

These methods fall into two large branches. First, we have the methods that rely on some kind of iterative formula. Second, we have methods that yield an infinite series. We must ask ourselves how an iterative formula or infinite series can be generated for a number that is defined in terms of the ratio of two lengths. In the next two posts we will see just how this is done.
Rationally Irrational

March 13, 2026

As mentioned earlier, I am taking a break from publishing new material for a few weeks. During these weeks I am republishing the series on e and the one on π. The post below was originally published on 14 June 2024.

Recapitulation

Visualizing the irrationality of e (Source: Mathematica Stack Exchange)

We have reached our final post in this series on the number e. In the first post of the series, we introduced e and looked at the reasons for which it is the base of the natural logarithm and the exponential function. In the second post of the series, we used various techniques to determine that e lies between 2 and 3, a fact that will be important in today’s post. In the previous post, we considered three examples of infinite series for e to show that more advanced mathematics does not guarantee quicker convergence to the value of e. In today’s post, we will demonstrate that e is an irrational number.

Reductio ad Absurdum using Bounds

Since e lies between 2 and 3, e cannot be an integer. We will now use reductio ad absurdum, a method we introduced in another post. So we will begin by assuming that e is a rational number. Hence, suppose

Since e is not an integer, it follows that q cannot be 1. Hence, q>1. Now, we also know that

Combining the two we get

If we multiply this equation by q! we will get

Distributing the q! into the parentheses on the right side we get

Now the left side, being the product of natural numbers, is a natural number. On the right side, the terms in green are all integers since the numerator q! is necessarily a multiple of the denominators. This means that, for the equation to hold, the terms on the right must add up to an integer. Let us designate this sum as R. Then

Now, since q > 1, it follows that

Taking the reciprocals of each term we get

This means that

Hence, we can conclude that

However, we have seen, when we obtained the lower and upper bounds for e, that the infinite series

Hence, without the leading 1, the sum must be 1. This allows us to conclude that R < 1. However, since all terms in R involve the products and quotients of positive integers, it must follow that R is positive. However, there is no integer between 0 and 1, which means that R cannot be an integer. This means that the terms in red in the equation

do not give an integral sum, which is something that was required for the equation to hold. Since we reached this requirement when we assumed that e is rational, we have reached something absurd, namely that this is possible only if we can find an integer between 0 and 1. This concludes the proof by reductio ad absurdum.

Reductio ad Absurdum using Partial Fractions

I would like to contribute another proof using reductio ad absurdum that I have not seen anywhere else. This uses the concepts surrounding partial fractions. While partial fractions is normally used in the context of algebra, I’d like to explore its implications in the context of arithmetic.

Now, it is clear that

What sets apart the expression in green from the expressions in red is that the one in green only has prime numbers or powers of prime numbers in the denominator. This is not true in the case of the expressions in red, since 6 and 12 do not satisfy the criterion.

So suppose we restrict the splitting of a given fraction into its arithmetic partial fractions where the denominators consist only of prime numbers or the powers of prime numbers. Suppose then that we have a number N such that

where

are distinct prime numbers and

Now any divisor of N can have a particular prime p_k appear from 0 to a_k times. For example, the divisors of 12 are 1, 2, 3, 4, 6, and 12. The prime factorization of 12 is

Hence, the prime factorization of any divisor of 12 can have 2⁰, as in 1 and 3, or 2¹, as in 2 and 6, or 2², as in 4 and 12. Or it could have 3⁰ as in 1, 2, and 4, or 3¹, as in 3, 6, and 12. In other words, any prime p_k, which appears as a power a_k in the prime factorization of N, can appear as a power from 0 to a_k in divisors of N. Hence, there are a_k ways in which p_k can appear in divisors of N, excluding 1. Hence, the number of divisors of N that are prime numbers or powers of prime numbers is

This means that, if we are able to write

then splitting it into its arithmetic partial fractions would involve at most P terms. However, P is a finite natural number.

Yet, from the expression

it is clear that e is expressed as the sum of an infinite series. But in the denominators we have all the natural numbers without exception. And we know that there are infinitely many primes in the set of natural numbers. This means that the infinite series for e includes terms that involve infinitely many prime numbers. For example, the term that contains p_k! in the denominator, when split into its arithmetic partial fractions, will contain a finite number of terms that involve all prime numbers and powers of prime numbers that are less than p_k.

However, since there is no limit to p_k, there is no way to combine all the terms into a finite series, such as is required by the arithmetic partial fraction expansion of

Now, we know that

The inherent pattern involved in the denominators allows us a concise way of combining the infinite terms to give the resulting sum as 2. In general, if we have an infinite set of fractions where the denominators form a discernible pattern, we might be able to combine the terms to give us a determined sum. If the primes themselves appear after some discernible pattern, then too we would possibly be able to combine the infinite terms to yield a determined sum. Since the primes do not appear in any discernible pattern, such a combination is impossible, even in theory.

In other words, we have reached a contradiction, where infinitely many fractions with distinct denominators involving infinitely many unpatterned primes and their powers are combined to produce the sum of a finite number of fractions, therefore yielding a rational number. So once again, by reductio ad absurdum the result is proved and we conclude that e is irrational.

Wrapping Up

We have devoted four posts to the study of the number e. The impetus for this study came from the opening post of this blog in which I introduced Euler’s Identity, stating that it was an example of beauty in mathematics. During the course of these four posts, we have looked at the definition of e. We also saw how e is related to the ideas of compound interest and, therefore, to the ideas of growth and decay that occur in natural systems. We were able, then, to see why e is the ‘natural’ base for logarithms and exponential functions.

Then we obtained a lower and an upper bound for the value of e. Along the way we introduced some key ideas, notable among which were the limiting process, infinite geometric sequences, and the binomial expansion. This allowed us to obtain an infinite series for e.

Following this we explored three infinite series for e to determine the speed at which they converge to the value of e. We saw that more advanced mathematics does not guarantee speedier results. This allowed us to conclude that we need to be wary when claims are made that something is based on more rigorous mathematics.

Finally, we proved that e is irrational. We did this in two ways, both involving reductio ad absurdum, a mathematical strategy for proofs that we had introduced earlier. I introduced a proof that I have not seen elsewhere, though this may just be an indication of my ignorance rather than my ingenuity.

This does not exhaust the study of e. By no means! But this does conclude our exploration at this stage. It has been fruitful and insightful for me. I hope you can say the same.
Infinitely Expressed

March 6, 2026

As mentioned earlier, I am taking a break from publishing new material for a few weeks. During these weeks I am republishing the series on e and the one on π. The post below was originally published on 7 June 2024.

Recapitulation

I’m back with another post after taking a break for a week. The last week of May was particularly busy as I was teaching a class called Introduction to the New Testament. During the week I also gave a lecture on Causes and Symptoms of Religious Discord, which you can find here if you’re interested. Anyway, back to Acutely Obtuse, we are in the middle of a four part series on the number e. In the last but one post, I started this series. We then saw why e is the base of the natural logarithm and the base of the exponential function. We also saw the relation between e and compound interest. In the previous post, we determined that e lies between 2 and 3. In the next post we will show that e is irrational. Given that e is irrational, it follows that it cannot be expressed as a ratio of two integers. This also means that it cannot be expressed as the sum of a finite number of rational numbers because the sum of rational numbers is necessarily rational.

Introduction to Infinite Series

Of course, this leaves open the possibility that e could be represented as the sum of an infinite series, that is a series that does not have a finite number of terms. In the previous post we looked at the limiting process and concluded that

Using sigma notation, we can write

This means that e can indeed be expressed as the sum of an infinite series. However, the above series is not the only one that has been shown to converge to the value of e. In this post, I wish to consider the above series and two others that have been derived for determining the value of e. However, while we just about managed to derive the first series without the use of any calculus, most of the series that have been derived require the use of calculus. Since I do not wish to introduce any advanced calculus at this stage in the blog, I will be considering one series that is derived from the first one itself and accepting, but not deriving, a second that has been derived using some advanced calculus.

But first we must ask ourselves why additional series are needed. If we already have a workable series, why should we bother to obtain more series? Mathematicians, after all, tend to be quite frugal, rarely doing more than is required. So why would they bother with more series representations for e when they already have one that one can use to obtain the value of e to any level of accuracy that might be needed?

Rationale for Other Infinite Series

To answer this question, we need to look at the values obtained by using the original series. Here are the first twenty values we obtain from the original series.

The red digits indicate where the current approximation for e begins to differ from the previous approximation. As can be seen, this series gives on average one additional digit of e for each additional term. Of course, for most purposes 10 or 12 digits of e is more than sufficient. In the table above, we have determined the value of e accurately to 18 digits. Why then would we need to determine more digits?

Computations of the digits of irrational numbers like π and e are used to measure the computational power of supercomputers. As we include more terms of an infinite series, two things must be done. First, the existing approximation of e must be kept in memory while the latest term is calculated. Second, the existing approximation and the latest term need to be added to obtain the new approximation. However, as we go down the terms in the series, each term becomes decreasingly small, requiring greater storage of memory. Since pushing things to and pulling things from memory are time intensive processes, at least from the perspective of a supercomputer, each new approximation tests the limits of the computational system to a greater extent.

However, once we have obtained the first (say) 18 digits of e, as we have above, it is pointless to use the same process to obtain the same results. Mathematicians hate repeating things because they know that they aren’t going to get new results. Hence, using the same series to obtain 1 billion digits from a set of computers to see which one is the fastest would be fine once. After that no new information is being generated.

However, mathematicians think, “If we are going to use infinite series to test the power of computational systems, why not generate series that give more digits per term?” In other words, if we can generate a new series that gives 2 additional digits per term, then with the same amount of time and same number of terms, we can generate 2 billion digits instead of 1 billion. And if someone can produce a series that gives 10 digits per term, then for the same price we can generate 10 billion digits. For people involved in the computational side of things, it does not matter which series is used. Any series can be used to arbitrate between competing computational systems. However, for mathematicians, the goal is to obtain new information. Hence, any series that promises more digits per term is to be valued for the additional information it can provide.

So mathematicians resort to all sorts of manipulations to produce new series that could provide more digits per term. The two that I have selected do precisely that.

Combining Terms

The approach for the second series recognizes that we can pair up terms and not affect the sum. Now, in general, consecutive terms can be written as

This term simplifies to

However, since we have grouped the terms, n cannot now take all the values earlier specified or we will overcount. We can overcome this by replacing n with 2k to get

Hence, the original sum can be expressed as

Substituting values for k we get

We can see that each term is quite simple and that there is an easily recognizable pattern, which makes this easy to program. If we use this infinite series, we will get the following table:

Of course, as we should have expected, since we combined two terms of the original series to give a single term of the new series, the new series approaches the value of e twice as quickly as the original series. This means that we get, on average, 2 additional digits of e for each term of the new series. While this is laudable, as mentioned earlier, once this new series has been used, it will fail to produce any new information after a single use. This means that mathematicians will look for newer series to put into play. We could, as the approach to the second series suggests, combine more terms. For example, if we combine three consecutive terms we get

Replacing n+2 with 3k we get

This series obviously will converge to the value of e three times as fast as the original series. However, we can see that this process might become quite unwieldy. Finding the LCM of multiple terms is not a difficult matter. However, expressing them in a compact form that is conducive to programming is not necessarily a given. Already, while combining only 3 terms, we actually have quite a bit of deft algebraic manipulations to undertake. And remember, once a series is used, it pretty much becomes obsolete! Hence, we must discover new ways with which to express e.

Using Advanced Calculus

The third series I wish to consider is derived using the lower and upper incomplete gamma functions and the Taylor series. While the mathematics involved is certainly above the level of the blog at this stage, the end result is

If we substitute the values of k, we will get

Once again, each term is quite simple and we can easily recognize the pattern. So this too would be a good series to use for programming. But what kind of approximations of e does this series yield? The table below is instructive.

What we can see is that the third series converges slightly quicker than the original series but considerably slower than the second series we considered.

Rejecting the Snake Oil

Over the years, mathematicians have derived many infinite series for e using all sorts of techniques. They have also used continued fractions, infinite products, and recursive functions. All these techniques are needed precisely because e is an irrational number, which we will discuss in the next post. However, as we have seen, there is little real world advantage to be gained from deriving more expressions for e. The requirement for new series comes from the realm of computation, where the computing power of a computational engine can be measured by having it perform processor intensive calculations such as are needed by these expressions for e. Mathematicians use this requirement to get the computational engines to churn out more digits of famous irrational numbers like e, π, and φ.

A definition of e in terms of the rectangular hyperbola xy = 1 (Source: Wikipedia)

However, what we have seen in this post is something that we are rarely told about. In fact, I think what we have discovered is intentionally kept from us because it would reveal something we dare not admit. Now, hopefully, the mathematics involved in obtaining the original series, which was discussed in the previous post, was not too onerous. Granted that it was heavier than most other mathematical concepts I have dealt with so far in the blog, I believe it was not too difficult for most readers. The second series we considered was derived by combining the terms of the original series as pairs. Hence, this too did not require much advanced mathematics. However, the third series did require quite a bit of advanced mathematics and I avoided explaining it in this post. Despite this, the resultant series does not converge as quickly as the second series, which was obtained using much simpler mathematics.

What this tells us is that the level of mathematics involved in deriving an infinite series for e, or for that matter any irrational number, is no indicator of how quickly the series will converge to the desired value. This runs contrary to the intuition many people have about mathematics, namely that we need increasingly more complicated mathematics for more complicated problems. This is not the case and there are many instances when relatively simple mathematics can be put to use to solve extremely complicated problems. In my opinion, this is precisely what undergirds mathematical theorems like Gödel’s Incompleteness Theorems and some of the Millennium Prize Problems such as – P vs NP and the Riemann Hypothesis. It is also what lies behind the deceptively simple, but heretofore unsolved, Collatz Conjecture.

In these days of increased dependence on Machine Learning, it pays to step back and understand that mathematics itself does not give us any guarantees or advice about what methods would work best for a given problem. Until we develop machines that are actually able to replicate human thought, any idea that more advanced computing capabilities would yield lasting solutions is, in my view, a pipedream. Unfortunately, too many of us are mathematically ill equipped to recognize when we are being sold snake oil in mathematical garb!
Naturally Bounded?

February 27, 2026

As mentioned earlier, I am taking a break from publishing new material for a few weeks. During these weeks I am republishing the series on e and the one on π. The post below was originally published on 24 May 2024.

Recap

In the previous post, I started a series of three posts focused on the number denoted by e. We then saw why it is the base of the natural logarithm and the base of the exponential function. We also saw the relation between e and compound interest. In this post we will try to determine the lower and upper bounds for e.

In order to do this, we need to do seven things. First, we will then ‘discretize’ the function

for integers n. Second, we will need to introduce ourselves to the limiting process. Here, we will only restrict ourselves to the cases where x becomes infinitely large. Third, we will introduce ourselves to the binomial expansion and obtain the lower bound for e. Fourth, we will show that the function

is an increasing function. This means that the value of f(n) increases as the value of n increases. Fifth, we will combine the limiting process and the binomial expansion to obtain a very common infinite series for f(x). Sixth, we will introduce ourselves to the idea of an infinite geometric series. Seventh, we will use what we know from the infinite geometric series and the infinite series for f(x) to obtain the upper bound for e.

Discretization

In order to proceed with what I have called ‘discretization’, consider the function

It is possible to prove that this function is defined for all positive values of x. We may not be able to calculate the value by hand. And we may not have a clue what

might even mean, let alone how to calculate it. Nevertheless, if f(x) is defined for all positive values of x it must be defined for all positive integer values of x. In this f(x) is transformed to f(n), where

This is something that we can comprehend because exponentiation to a positive integer, in this case n, only signifies repeated multiplication. So now, we have to only deal with f(n).

But now we have to show that f(n) is an increasing function. In more formal mathematical terminology, we have to show that f(n) is monotonically increasing. It is easy to see from the tables in the previous post that this is true. Below is one of the tables from that post.

Since the values of x chosen were all integers, the same table would apply for n, f(n), and Δf(n). We can see that, as n increases, f(n) also increases. But this is not how we prove things in mathematics! So is there another way to prove that f(n) is increasing? Yes there is!

The Limiting Process

But before we get to that, let us introduce some aspects of the limiting process. Consider the function

We can easily see that, as n gets larger and larger, the value of g(n) gets closer and closer to 0. This is seen in the table below.

What we can say is that, as n gets infinitely large, g(n) gets infinitesimally closer to 0. Or to be rigorous, we say that the limit of g(n) as n tends to infinity is 0. Please note what this language is claiming and what it isn’t. First, it is not claiming that infinity is a number. God forbid! I have dedicated a whole post to ridding ourselves of that mathematical heresy. Second, it is not claiming that the value of g(n) ever becomes zero. It is impossible for the reciprocal of any number to equal zero. If this were not true, we could argue as follows.

But, as we know, division by zero is meaningless or ‘absurd’. Since this assumption leads to an absurdity, by the reasoning of reductio ad absurdum, we can conclude that the original assumption, namely that the reciprocal of some number can be equal to zero, is incorrect.

The way we denote the limiting process is by writing

The left side of the equation tells us what the variable of the limit is, in this case n, and how it is being made to vary, in this case getting infinitely large. It also tells us what function of the variable we are dealing with, in this case 1/n. The right side of the equation tells us the number that the value of this function approaches, in this case 0, as the variable (n) varies as specified.

Due to the limiting process, we can draw the following conclusions.

The table below can confirm our conclusions.

So what we have managed to prove, albeit not with great rigor, is that the ratio of an integer to its predecessor or to its successor approaches 1 as the integer gets larger and larger.

The Binomial Expansion and the Lower Bound for e

Now it is time for us to introduce another important result that we will be using. This is known as the binomial expansion. As seen in an earlier post, the number of ways of selecting r items out of n items is

Now consider the expression

This is called a binomial expression since there are 2 terms in the expression. Now consider the expression

Quite obviously

Here, on the right side of the equation, there are n sets of parentheses, reflecting the power n on the left side of the equation. Now we have to ask ourselves how we expand the right side. When we expand, we have to select one of the terms, either a or b, from each of the binomial terms. We could select a from all the binomial terms, leading to aⁿ. Or we could select b from all the binomial terms, leading to bⁿ.

In general, if we select b from r of the binomial terms, this would mean we have selected a from the remaining n-r binomial terms, leading to a^n-rb^r. In how many ways can we form the a^n-rb^r term? This will be the same number of ways as selecting r out of n items, since, when we select r of the binomial terms to give us b, we will be auto-selecting the remaining n-r binomial terms to give us a. Hence, the binomial expansion gives us

Hence, we can conclude that

With special note are the first two terms. The first term is

Similarly, the second term is

Hence, the expansion can be expressed as

Now, all the terms that are in red are the product and quotients of natural numbers. Hence, each term is necessarily positive. This allows us to conclude that

Hence, we have obtained the lower bound for e. All we need now is to obtain the upper bound. This is somewhat more difficult. We first need to demonstrate that f(n) is an increasing function.

f(n) is an Increasing Function

Let us consider the two consecutive terms f(n) and f(n+1). We have

Taking LCM inside the parentheses, we get

Dividing the second equation by the first we get

This can be further modified to give

Choosing a new index m = n+1, the above gets transformed to

This can further be written as

Now consider consecutive terms in the expansion of the term in red above.

Dividing the second equation by the first we get

It is clear that the two terms in red will cancel. At the same time, the green terms will leave r+1 in the denominator, while the blue terms will leave m-r-1 in the numerator. Hence, the above simplifies to

Redistributing the terms and taking absolute value so we do not have to deal with a negative quantity, this is equivalent to

Since r takes values from 0 to m-1 and since m must be necessarily greater than or equal to 2 (remember m=n+1), each of the three terms above is strictly less than 1. Hence, in the expansion

each term has a smaller magnitude than the preceding term. If we write out the expansion we get

Now we have shown that the 3^rd term (in red) has a greater magnitude than the 4^th term (in green). Hence, the sum of the 3^rd and 4^th terms must be positive. Similarly for each subsequence pair of terms in the expansion. If m-1 is even, there will be an odd number of terms in the expansion, with the final term being positive. If m-1 is odd, there will be an even number of terms with the final pair of terms also yielding a positive sum. Hence, we can conclude that

However, recall that

This means that

This means that f(m) is increasing, which also means that f(n) is increasing.

Whew! That took some doing, huh?

Combining the Limit Process and the Binomial Expansion

What we can now say is that f(n) will keep getting larger as n gets larger. Recall that we had shown

This means that

if both limits exist. Now since f(n) keeps increasing, all we need to show is that there is some upper bound beyond which the expansion cannot go. That would also place an upper bound on the value of f(n).

Introducing an Infinite Geometric Series

Consider the infinite series

Here each term is multiplied by ½ to get the next term. If we multiply the whole equation by 2 we get

However, note that the terms in red are the same series with which we started. This gives us

Obtaining the Upper Bound for e

Visualization of the convergence of f(n) (Source: Algebrica)

Now once again consider the expansion for f(n), which is

Now the (r+1)^th term is given by

Note that this happens to be the r^th red term above and it can be modified as follows

Rearranging the terms we get

Some more rearrangement gives

Here, r varies between 0 and n. Hence, all the terms in red are necessarily positive and less than 1. However, in the limiting case, as n approaches infinity, the limiting value of each of the red terms is 1. This gives us

Writing the infinite series this will give us

For the fourth term onward, r is greater than or equal to 3. However, it is easy to see that for r > 2

As examples

Hence, we can conclude that

However, we have already shown that the terms in red add up to 2.

Hence, using the earlier lower bound result, we conclude

Conclusion

We have shown that the value of e lies between 2 and 3. Along the way we used some mathematics that we had explored earlier, like reductio ad absurdum and the method of determining the number of ways of selecting r out of n items. But we also introduced new mathematics, like the limiting process, the binomial expansion, and the infinite geometric series. The crucial step was to show that f(n) is increasing. If this were not true then showing that it has a lower and upper bound would not indicate that there is a specific value to which f(n) approaches, since it could then just oscillate between slightly above 2 and slightly below 3. The process to demonstrate this was considerably involved and also included the idea of showing that successive pairs of a sequence added to give a positive sum. This was not necessarily the most intuitive step in the process, even though, without it, we would not have been able to prove the result.

In the previous post, we introduced e and touched on what its significance is and why it is the base of the ‘natural’ logarithmic and exponential functions. In this post we have placed bounds on the value of e. Since this post end up being quite long and, I must admit, heavy, I am going to do a part of what I planned for this post in the next, where we will look at some common infinite series that have been derived for e and consider how rapidly each of these series converges to the actual value of e. In the post after that, I will deal with how we know that e is irrational. I will, however, be taking a break next week. Hence, the next post on infinite series for e will be published on Friday, 7 June 2024.
What’s Natural About e?

February 20, 2026
A visualization of e (Source: Laughing Squid)

While I am enjoying the series on counting principles, I need to take a break. Hence, I will be republishing the series on e and the one on π, both which I had first published in 2024. The post below was originally published on 17 May 2024.

In the opening post of this blog, I had introduced Euler’s Identity, which states

The identity combines five numbers – 0, 1, e, i, and π – and three mathematical operators – addition, multiplication, and exponentiations – and the equality. In other words, this identity captures many diverse parts of mathematics and links them, thereby demonstrating that what we call ‘mathematics’ is a unified field in which one area neatly dovetails into the next. For a few more links I suggest you read the earlier post.

In this post, however, I wish to focus on the number e. I will be devoting three posts to it, including this one. If this now seems excessive, I hope that, after you have read the three posts, you will have had a change of heart and mind. Indeed, my hope is that you would wish for a fourth. And a fifth! I could, of course, include it all in one post. However, I have realized that the last few posts have been considerably longer than I had planned for this blog. Granted that each post did deal with a unified theme, the fact still remains that they were quite long. Hence, in the interest of not squelching all the curiosity of the reader, I feel it is best, where possible, to publish shorter posts.

In this post I wish to deal with the definition of e and its relation to a concept of mathematics that most students learn in the 9^th or 10^th grades. I also wish to address the significance of e that arises from the definition.

The second post on e will deal with some common bounds we can place on its value. The first post will have given us some indication of these bounds. However, in the second post I will take a more formal approach to this. This will involve looking at a few infinite series that mathematicians have derived as ways of calculating the value of e. The third post of e will deal with the issue of e being an irrational number. Along the way, in both future posts, we will learn a few more mathematical tricks to keep in our quiver should we ever need them.

When I was introduced to e, my professor at the Guru Nanak Khalsa College in Bombay (now Mumbai) just told us that it was the base of the natural logarithm (log_ex) and the base of the exponential function (e^x). I asked him a few questions like:
1. Why was it this number and not some other number that was the base of both the logarithm and exponential functions?
2. What was the significance of the number e?
Unfortunately, all my professor could tell me was that the approximate value of e was 2.718. When I pressed him for more information, he summarily asked me to leave his class. Perhaps my mother will now understand why I hated attending classes there. I mean, if even the mathematics class was going to be transformed into one mind-numbing exercise of rote learning, the other subjects didn’t have a prayer!

I have dealt with one mathematical issue that causes me trauma elsewhere. The trauma that this professor caused me remains to this day and surfaces when I hear students confidently tell me that the value of e is 2.718281828. (Yes, they can use their calculators now to get more digits than my professor had memorized!) When I hear something like that I have a strong urge to tug at my hair, which, fortunately, is somewhat difficult for me!

Anyway, let me proceed with the definition of e and then hopefully address the two questions above.

Consider the function

Students who have learned about compound interest will recognize the similarity the above expression has to the formula for compound interest given by

where P is the principal invested, R is the interest rate as a percentage per compounding cycle, N is the number of compounding cycles, and A is the amount upon maturation of the investment. If we divided both sides of the equation by P and express the interest rate as a number rather than a percentage, the formula gets transformed to

where G is the ‘growth’, that is the ratio of the maturation amount to the principal invested, r is the interest rate per compounding cycle, and n is the number of compounding cycles.

Before proceeding, let’s consider an example so we understand how the formula works. Suppose we invest ₹1,000 at 10% interest per annum compounded annually for 3 years. Then, P = 1000, r = 0.1 (corresponding to R = 10%), and n = 3. Hence,

This gives A = ₹1331 or G = 1.331.

With the same numbers, but assuming that interest is compounded every 6 months, the value of R and r will be halved and the value of N and n will get doubled. This is because 10% per annum is the same as 5% semiannually. And in 3 years, there are actually 6 periods of 6 months each. Hence, R = 5%, r = 0.05, N = n = 6. This gives

Hence, A = ₹1340.10 and G = 1.340096. [Note: I have rounded A to 2 decimal places as is the convention for currency.]

Suppose now, that the interest rate is 100%. Then R = 100% and r = 1. Now, if we invest some amount for, say, 3 years, we will get:

But suppose, we invest only for 1 year. Then we will have

Suppose, now we keep reducing the duration of the compounding cycles. If we have 2 compounding cycles in a year, each lasting 6 months, we will have

If we change this to compounding every 4 months, we will have 3 compounding cycles, giving us

We can, of course, continue increasing the number of compounding cycles.

For the sake of the discussion, I will rename G as f(x) and n as x, yielding the following table:

The third column gives the change in the value of f(x) from the previous row. What we can observe is that the values of f(x) keep increasing from one row to the next. Also, the value of Δf(x) keeps decreasing from one row to the next. In fact, if we plotted the graph of the function, shown below, this is what we would expect.

Graph of y = f(x)

Since the graph becomes almost horizontal, it seems that the rate at which the function increases its value keeps decreasing. This is indeed the case as can be seen from the table below.

What we can see here is that x is increasing by orders of magnitude, while the corresponding values of Δf(x) keep getting smaller and smaller, while remaining positive.

Now there are 31,536,000 seconds in a year. If we put this as the value of x we will get f(x) = 2.71828177847, which represents an increase of 0.00001354128 from the value when x = 100,000.

At this stage, let us take a short detour. Suppose we have a sample of bacteria in a petri dish with enough nutrients for the bacteria to grow and undergo mitosis unhindered. Assuming no mutations occur, there will be no way of distinguishing any particular bacterium from another. All the bacteria in the sample are, in other words, identical. All are consuming nutrients and all will reach the next stage of mitosis simultaneously. Hence, 100% of the sample has the potential to undergo mitosis. But how frequently does mitosis occur?

Some bacteria need about 24 hours of feeding on the nutrients before they undergo mitosis. So here we have a doubling every day. But suppose we once again restricted ourselves to 1 day but somehow sped up the process of mitosis. What would happen? The tables above tell us exactly what would happen. If we have 100,000 cycles of mitosis in our day with only 1/100,000 of the sample undergoing mitosis each time, we will end up having 2.71826823719… times the number of bacteria with which we started.

We can see that, as the number of cycles increases indefinitely, with the fraction of bacteria undergoing mitosis each time correspondingly decreasing, the growth will be given by the limiting value of the function f(x) as x gets infinitely large.

Coming back to the issue of compound interest, every unit of currency we invest is identical to every other. Since we proposed a 100% interest rate, every currency unit is subject to growth at all times. However, if we reduce the compounding period indefinitely and correspondingly decrease the fraction of the currency units that actually multiply, at the end of the year we will have a growth equal to the same limiting value of f(x).

Now currency is an artificial human construct. However, bacteria belong to the natural world. Many other things grow in the natural world. Similarly, there are things that decay, like radioactive nuclei. All these natural phenomena are, like our sample of bacteria or the invested money, continuously growing. Continuous growth, subject to sufficiently large environments and resources to expand into, and continuous decay, subject to sufficiently large numbers of species to undergo decay, are ubiquitous natural phenomena.

The limiting value I have referred to is the number denoted by e. And we can see that we have answered both the questions I had posed. The significance of e is that it represents the limiting value of growth (i.e. a multiplicand) or decay (i.e. a divisor) when the growth or decay is continuous. And it is the base of the natural logarithm, which shows up when we know the final population and need to solve for time, and the base of the exponential function, which shows up when we need to solve for the population after a period of growth, because it represents the behavior of all natural systems.

Now, was that too hard for my professor to tell me? I do not think so. But, unfortunately, I have to face the dismal possibility that he had no clue about any of this, having resigned himself to learning by rote rather than learning by inquisitiveness.
Factional Factors

February 13, 2026

The Fundamental Theorem of Arithmetic

(Source: Geeks for Geeks)

We started a series on counting principles earlier this year. We first considered the most basic principles, namely the multiplication and addition principles. Then we looked at how permutations differ from combinations, considering some basic patterns of both including the elements of Pascal’s triangle. Next, we turned our attention to permutations of special kinds, including circular permutations and permutations of non-distinct objects. Then last week we considered combinatorial arguments and binomial identities. We are now poised to consider some ideas related to factors of numbers. I had thought I would be able to include the technique of partitions also in this post. However, I will tackle that in the next post.

In an earlier post we had looked at the ideas related to factors of numbers. But that was many months ago. So let us start here as though we are addressing these issues for the first time. We begin with the observation that every natural number, except 1, is either a prime number or a composite number. As a reminder, a prime number is one whose factors are only 1 and itself. A number that has a factor other than 1 and itself is said to be a composite number. While it would seem that 1 satisfies the condition for being a prime number, 1 is not considered to be prime. Why is this the case?

The fundamental theorem of arithmetic states that every natural number greater than 1 is either a prime number or can be expressed as a unique product of prime numbers, disregarding the order of the primes. We must admit that this theorem, like many in mathematics, is a kind of after-thought. It was introduced specifically to exclude 1 from the set of primes. And this is because including 1 in the set of primes creates more problems than it solves. This is because 1 is the multiplicative identity element. That is, the product of any number and 1 is the number itself. This means that, if we allow 1 to be defined as a prime number, we could write

Hence, any idea of a unique prime factorization would go into the gutter! However, nothing is gained by defining 1 to be a prime number. Hence, we exclude 1 from the set of primes. What this means is that every natural number greater than 1 can be written with a unique prime factorization, disregarding the order of the primes. For example,

Since multiplication is commutative, the order in which we multiply the prime numbers does not matter.

Number of Factors of N

Let us consider, for example, the number 450. We can determine the factors by using a method like that indicated in the diagram at the start of this post. However, for large numbers this may be an onerous task. However, we can resort to prime factorization and write 450 as

We can see that there are 5 prime factors with two of them being repeated twice. Hence, from what we have learned earlier, there are

ways of permuting these prime factors. However, all of them give the product of 450. Hence, if we disregard the order in which we multiply the prime factors, the prime factorization of 450 is unique and can be written as

Let us consider the factors of 450. They are 1, 2, 3, 5, 6, 9, 10, 15, 18, 25, 30, 45, 50, 75, 90, 150, 225, and 450. Hence, 450 has 18 distinct factors. Some of them are odd, like 1, 3, 5, 9, 15, 25, 45, 75, and 225. The others are even, like 2, 6, 10, 18, 30, 50, 90, 150, and 450. We can see that there are 9 factors that are odd and 9 that are even. That is curious because 9 + 9 = 18. Perhaps there is something here.

Let us consider the other prime factors. There are factors that are not divisible by 3, such as 1, 2, 5, 10, 25, and 50. There are factors that are divisible by 3 but not divisible by 9, such as 3, 6, 15, 30, 75, and 150. And there are factors that are divisible by 9, such as 9, 18, 45, 90, 225, and 450. We can see that there are 6 of each such factors and 6 + 6 + 6 = 18. In a similar way, the factors not divisible by 5 are 1, 2, 3, 6, 9, and 18. The factors divisible by 5 but not by 25 are 5, 10, 15, 30, 45, and 90. And the factors divisible by 25 are 25, 50, 75, 150, 225, and 450. Once again, there are 6 of each and 6 + 6 + 6 = 18. Surely there is some significance to this. Perhaps we can get a pattern by considering a number with fewer distinct factors.

Let us consider the number 18. It has 6 distinct factors, namely 1, 2, 3, 6, 9, and 18. Of these, 3 are odd, namely 1, 3, and 9, and 3 are even, namely 2, 6 and 18. Also 2 are not divisible by 3, namely 1 and 2, 2 are divisible by 3, but not by 9, namely 3 and 6, and 2 are divisible by 9, namely 9 and 18. We will now invoke the fact that 1 is the multiplicative identity element and that, for any natural number n, n⁰ = 1. Since the prime factorization of 18 is 2¹3², any factor of 6 can have as its factors 2⁰, giving an odd number, or 2¹, giving an even number. In a similar way, any factor of 18 can have as its factor 3⁰, giving a number not divisible by 3, or 3¹, giving a number divisible by 3, but not by 9, or 3², giving a number divisible by 9. In other words, there. So, since we had 2¹ in the prime factorization of 18, there are 1 + 1 = 2 ways of including 2 in the factor, either exclude it, giving an odd number, or include it, giving an even number. And since we had 3² in the prime factorization of 19, there are 2 + 1 = 3 ways of including 3 in the factor, exclude it completely, giving a number not divisible by 3, include only one 3, giving a number divisible by 3 but not by 9, or include both 3s, giving a number divisible by 9.

Let us see if we can generalize this result. Suppose we have the number N. Suppose its prime factorizations is

Then, the number of distinct factors of N will be

Let us put this to the test. For 18 = 2¹3², p₁ = 2, p₂ = 3, a₁ = 1, and a₂ = 2. According to the result, the number of factors should be (1 + 1)(2 + 1) = 6, which is correct. For 450 = 2¹3²5², p₁ = 2, p₂ = 3, p₃ = 5, a₁ = 1, a₂ = 2, and a₃ = 2. According to the result, the number of factors should be (1 + 1)(2 + 1)(2 + 1) = 18, which is correct.

Sum of Factors of N

Now, let us try to find the sum of the factors of a natural number. Consider the number 18. The sum of its factors will be 1 + 2 + 3 + 6 + 9 + 18 = 39. For the number 450, the sum of the factors is 1 + 2 + 3 + 5 + 6 + 9 + 10 + 15 + 18 + 25 + 30 + 45 + 50 + 75 + 90 + 150 + 225 + 450 = 1209. If we attempt to factorize 39 we get 39 = 3 × 13. Similarly 3 × 13 × 31. The repetition of the 3 and 13 seems interesting.

Let us consider another number, say 6, which has a prime factorization of 2¹3¹. The factors are 1, 2, 3 and 6, which add up to 12, which in turn can be written as 3 × 4. Similarly, the factors of 15, which has a prime factorization of 3¹5¹, are 1, 3, 5, and 15, which add up to 24, which is 4 × 6. Also, the factors of 30, which has a prime factorization of 2¹3¹5¹, are 1, 2, 3, 5, 6, 10, 15, and 30, which add up to 72, which is 3 × 4 × 6. We can see that, when 2¹ appears in the prime factorization, the sum of the factors has 3 as a factor. Similarly, when 3¹ appears in the prime factorization, the sum of the factors has 4 as a factor. And when 5¹ appears in the prime factorization, the sum of the factors has 6 as a factor.

Now let us consider some other examples. These are summarized in the table below.

The results are also color coded. 2¹ links with 3, 2² links with 7, 3¹ links with 4, 3² links with 13, 5¹ links with 6, 5² links with 31, and 7¹ links with 8. Of course, if you are familiar with series of powers of natural numbers, you will recognize that

Hence, it seem as though, for a number like, say 42, the sum of the factors can be written as

We can see how this makes sense. The final expression involving a product of three terms can be expanded as follows:

We can, therefore, reach a generalization as follows for any natural number, N. Suppose the prime factorization of N is

Then the sum of the factors of N will be

Consider the number 12,600 = 2³3²5²7¹. According to the result we have just obtained, the sum of all the factors should be (1 + 2 + 3 + 4 + 5)(1 + 3 + 9)(1 + 5 + 25)(1 + 7) =48,360. This is confirmed here. Of course, we could use the previous result to determine that 12,600 has (3 + 1)(2 + 1)(2 + 1)(1 + 1) = 72 factors.

Using the two results we have obtained we can determine how many factors a number has and the sum of those factors without determining what those factors are. So, for example, if we are given the number 174,636,000 = 2⁵3⁴5³7²11¹, we can determine that it has (5 + 1)(4 + 1)(3 + 1)(2 + 1)(1 + 1) = 720 factors, the sum of which is (1 + 2 + 4 + 8 + 16 + 32)(1 + 3 + 9 + 27 + 81)(1 + 5 + 25 + 125)(1 + 7 + 49)(1 + 11) = 813,404,592. We can observe that the terms in each set of parentheses form a geometric sequence with first term 1, which can easily be summed using the formula

In other words, even with extremely large numbers, we can obtain the number of factors and the sum of factors with a few lines of code in a computer.

Looking Ahead

What we have seen in this post is a way of determining the number of factors of a number and the sum of its factors without having to find all the factors. We do need to determine the prime factorization of the number. However, this is a straightforward process. Now, we used the fundamental theorem of arithmetic to help us determine both the number of factors and the sum of the factors. While this may not be a theorem that is explicitly named when we are in school, it is something that we use regularly after learning division and factorization. Hence, we used some relatively simple mathematics that we learned in elementary school to unravel the question of the number and sum of factors. While some of the formalism and symbols used might be beyond what a student in elementary can understand, the arithmetic behind it is something that elementary students can understand. This is especially so if we demonstrate how, while expanding the expression for the sum of factors, we need to choose one term from each set of parentheses without repetition.

In the next post, we will tackle the technique of partitioning. In order to set up the post, I will leave you with a question. Given that x, y, and z are natural numbers, how many solutions exist to the equation x + y + z = 100?
The World of Sagacious Selections

February 6, 2026

Recapitulation

(Source: i-Scoop)

In this post we continue with the series on Counting Principles. Whereas we devoted the previous post to permutations of special kinds, we will turn our attention today to the matter of combinations. In Arranging and Choosing we saw that the binomial coefficients are related to the elements of Pascal’s triangle. We saw that the process of expanding an expression like (a + b)ⁿ involves multiplying a + b by itself n times. This involves choosing either a or b from each factor. We were, therefore, able to show that the (r + 1)^th element of the n^th row of Pascal’s triangle is the sum of the r^th element of the (n – 1)^th row and the (r + 1)^th element of the (n – 1)^th row. In symbolic notation,

While we proved this result with algebra, I claimed that the algebraic method, involving pure brute force of algebraic manipulation, did not yield any particular insight. However, we proved the above result using

and arguing from how a particular element on the left side is obtained by adding two terms on the right side. This sort of argumentation does not use algebra. However, the argument in Arranging and Choosing is not technically what is called a ‘combinatorial argument’. So, what is a combinatorial argument? Let us learn with a few examples.

Combinatorial Arguments

Example 1

We begin with the same result we proved earlier. Suppose we have n distinct items and we wish to choose r out of them. This can obviously be done in ⁿC_r ways. Now, let us focus on one particular item (say A) out of the n items. The group of r items that are selected either contains A or does not contain A. If the selection contains A, then we have to choose r – 1 more items from the remaining n – 1 items, which can be done in ^{n – 1}C_{r – 1} ways. If the selection does not contain A, then we still have to choose r items from the remaining n – 1 items, which can be done in ^{n – 1}C_r ways. Since the selections that contain A and do not contain A are mutually exclusive (meaning they have no common members) and exhaustive (meaning that there are no other possibilities), it follows that

As an example, suppose we have to choose 6 items out of 20. This can be done in ²⁰C₆ = 38,760 ways. Now, if one of the items is A, then the selection either contains A or does not contain A. If it contains A, then we have to select 5 more items from the remaining 19 items, which can be done in ¹⁹C₅ = 11,628 ways. If it does not contain A, then we have to select 6 items from the remaining 19 items, which can be done in ¹⁹C₆ = 27,132 ways. It is easy to check that 11,628 + 27,132 = 38,760. This is an example of the addition principle that we dealt with in Down for the Count.

Example 2

Let us consider another result, namely

We proceed as follows. Suppose we have n items and we have to select r of them for some purpose. This obviously can be done in ⁿC_r ways. However, by selecting the r items for the purpose, we automatically create a selection of the remaining n – r items not for that purpose – that is for some other purpose. Selecting n – r items for some other purpose can obviously be done in ⁿC_{n – r} ways. Since the selection of r items automatically creates a selection of the remaining n – r items, the number of ways must be equal. Hence, ⁿC_r = ⁿC_{n – r} .

Example 3

Now let us try to prove

We proceed as follows. Suppose we have n items from which we have to select r items. This can be done in ⁿC_r ways. Now suppose we select k items from the r. This can be done in ^rC_k ways. The two selections can obviously be done in ⁿC_r ^rC_k ways. But we can make these selections in a different way. Let us first select the k items from the original n items. This can be done in ⁿC_k ways. Now we have to select r – k items from the remaining n – k items, which can be done in ^{n – k}C_{r – k} ways. The two selections can obviously be done in ⁿC_k ^{n – k}C_{r – k} ways. Since in both cases we end up with r – k items for one purpose and k items for a second purpose, the number of ways must be the same. Hence, ⁿC_r ^rC_k = ⁿC_k ^{n – k}C_{r – k}.

An example of this last result is as follows. Suppose we have a class of 30 students. We need to select a committee of 7 members within which we select a sub-committee of 3 members. Then we can first select the entire committee, which can be done in ³⁰C₇ = 2,035,800 ways. Then we select the sub-committee, which can be done in ⁷C₃ = 35 ways. This gives a total of 2,035,800 × 35 = 71,253,000 ways. Alternately, we select the sub-committee first, which can be done in ³⁰C₃ = 4,060 ways. Now we have to choose the remaining 4 members of the committee from the remaining 27 students, which can be done in ²⁷C₄ = 17,550 ways. This gives a total of 4,060 × 17,550 = 71,253,000 ways. This is an example of the multiplication principle that we dealt with in Down for the Count.

Binomial Identities

Having considered some combinatorial arguments, we turn our attention to the binomial coefficients themselves. Since there is a pattern leading from one row to the next, there must be some relationships that hold among these coefficients. For simplicity in proposing and proving these relationships, let us take a = 1 and b = x in the expansion of (a + b)ⁿ, which becomes (1 + x)ⁿ.

Example 1

Using the normal binomial expansion we obtain

If we substitute x = 1 in the above we get

We can verify this by considering a few rows of Pascal’s triangle. For example

We can see that the identity holds true for the first 5 values of n.

Example 2

Now, if we substitute x = -1 in the expansion of (1 + x)ⁿ, we will get

We can verify this for the first few rows of Pascal’s triangle. For example

Again, we can see that the identity holds true for the first 5 values of n.

Example 3

We can even use differentiation to obtain various results. For example, consider the expansion

If we differentiate both sides with respect to x we will get

Now, if we substitute x = 1 in the above equation, we will get

Again, we can verify this for the first few rows of Pascal’s triangle. For example

Once again, we have seen that the derived identity holds for the first 5 values of n.

Example 4

We can carry the idea of differentiation further as follows. We know that

If we multiply both sides by x we will get

Now, if we differentiate with respect to x we will get

Once again, we can verify this for the first few rows of Pascal’s triangle. For example

So, we have verified the result for the first 3 rows of Pascal’s triangle.

Parting Shot

What we have seen in this post is that we can use logical arguments to obtain relationships between binomial coefficients. We can also use simple algebra and calculus to obtain other relationships. As seen in the last example, we can multiple (1 + x)ⁿ by other terms to obtain further identities. The results are quite obviously endless. For example, I leave the reader to attempt to prove that

Perhaps a hint would help. Consider the expansion of

If you wish to check the proof, click here.

Our study of counting principles is not over, though. There is much more left to go. In the next post, we will consider how to obtain the number of divisors and sum of the divisors of any natural number without actually listing the divisors or counting them. And we will consider the technique of partitions. Till then, the ball is in your court.
A Penchant for Permutations

January 30, 2026

Refresher

Two weeks back, we started a new series on Counting Principles. In Down for the Count we looked the the basic multiplication and addition principles of counting. In Arranging and Choosing we introduced the ideas of permutations and combinations. In this post, we will consider circular permutations and permutations with restrictions.

Circular Permutations

When we introduced permutations, even though we did not mention it explicitly, the implicit understanding was that we were arranging the objects in a row. In fact, this is not necessarily the case. However, if we are able to call one position the ‘first’ position and another the ‘second’, etc., then what we have, in effect, is a row of positions since we can place the positions in a row according to their ‘number’.

The knights of the round table. (Source: World History)

Now, if we arrange the positions in a circle, what we lose is the ability to name a position as genuinely ‘first’. As the legendary round table of Arthur asserted, there is no ‘first’ in such a seating arrangement. Since there is no head, there is no primacy given to any particular position. We are only left with the relative positions and can say, “Person A is seated to the right of person B” or “Person B is seated to the left of person A.”

Now suppose there are 5 distinct objects that need to be permuted and placed in a circle. Two of the permutations are ABCDE and BCDEA. These are shown below

Since no position is a ‘starting’ positions or ‘first’ position, the two linear permutations above represent the same circular permutation, with A→B→C→D→E→A, being the progression if we went clockwise around the circle. We can easily see that the permutations CDEAB, DEABC, and EABCD represent the same circular permutation. Hence, there are 5 linear permutations that represent the same circular permutation. This simply represents having 5 options for the top position.

If we generalize this to n objects, we can see that we will have n linear permutations that represent the same circular permutation. Now, n objects can be permuted in n! ways, as we saw in the previous post. This means that n objects can be permuted in a circle in

We can extend this further to permuting r out of n objects. We know that r out of n objects can be permuted in ⁿP_r ways. However, for each set of r objects, there will be r linear permutations that represent the same circular permutation. This means that r out of n objects can be permuted in

Permutations of Non-Distinct Objects

Now suppose we do not have distinct objects. Rather, there are some objects that are identical. Let us consider a small set to get started. Now, instead of 5 distinct objects, let us consider 5 objects, 2 of which are identical. Hence, we could consider the set to be A, A, B, C, D. Note that the two ‘A’s are identical. Hence, it is impossible to distinguish between them. However, for the sake of this illustration, let us use subscripts to indicate patterns. So the two ‘A’s are A₁ and A₂.

Now, it is clear that the patterns A₁A₂BCD and A₂A₁BCD will be indistinguishable since both will appear as AABCD. In other words, there are 2 patterns that are indistinguishable from each other. In a similar way, BA₁CA₂D and BA₂CA₁D will appear as BACAD. What this means is that, no matter where we place the two ‘A’s, there will be two permutations that will be indistinguishable from each other.

What happens if there are 3 identical objects (e.g. AAABC)? Well, we can see that the patterns A₁A₂A₃BC, A₁A₃A₂BC, A₂A₁A₃BC, A₂A₃A₁BC, A₃A₁A₂BC, and A₃A₂A₁BC will all appear as AAABC. Hence, there are 6 permutations that will be indistinguishable.

We can see that, if there are p objects that are identical, then there will be p! permutations that will be indistinguishable, all of which represent permutations in which the identical objects occupy the same positions in the pattern. Hence, if there are n objects, p of which are identical, they can be permuted in

We can extend this to multiple kinds of identical objects. Suppose we have n objects with p₁ of one kind, p₂ of a second kind, etc. up to p_k of a k^th kind. Then the number of permutations will be

Let’s apply this. Let’s say I have 50 tiles, 4 of which are blue, 5 of which are red, and 10 of which are green, all others being distinct. Then then number of permutations will be

That’s 2.91… septendecillion permutations! I bet you learned a new word here!

Permutations with Restrictions

We can also consider a different kind of permutation in which we have some restrictions. For example, suppose we have 5 distinct objects (e.g. A, B, C, D, and E), which have to be arranged such that two of them, say A and B, have to be adjacent to each other. To find the number of permutations, we consider the group formed by A and B to be a distinct group, say X. Then, we have 4 distinct objects, C, D, E and X, which need to be permuted. This can obviously be done in 4! = 24 ways. However, the objects A and B can be permuted among themselves in 2! = 2 ways. For example, the permutation CXDE can be CABDE or CBADE. Hence, the total number of permutations is 4! × 2! = 24 × 2 = 48.

We can generalize this. Suppose we have n distinct objects, p of which need to be placed in a group and arranged among themselves. Hence, there are n – p distinct objects which have no restrictions. Then the number of distinct objects will be n – p + 1, including the group with p objects, which can be permuted in (n – p +1)! ways. However, the p objects can be permuted among themselves in p! ways. Hence, the total number of permutations will be (n – p + 1)! × p!.

Let’s consider an example. Suppose there is a group of 20 people, 5 of whom must be seated together. Then then number of permutations will be

Of course, we can extend this further. Suppose there are n distinct objects that need to be arranged such that a group of p₁ objects need to be together, another group of p₂ objects need to be together, and so on till a group of p_k objects need to be together. Note that each group will count as 1 object. Hence, the effective number of objects, including distinct objects and groups, which need to be permuted will be

Hence, the number of permutations, including the permutations within the groups, will be

Let’s try this out. Suppose there are 50 seats in a bus and 50 passengers. Among the passengers are a family of 5, a couple, and a group of 11 friends. If the groups have to sit together, the number of permutations will be

That’s 98.99… quindecillion permutations!

Looking Ahead

In this post we have looked art some different kinds of permutations, where the objects are arranged in a circle or where some of the objects are identical or where some distinct objects need to be kept together. Of course, there are cases where we may need to ensure some objects are not placed together, as in the case of rivals or bitter enemies. This is a much more complex situation than simply keeping a group together. So we will look into that in a post much later. However, in the next post, we will turn our attention to combinations and look at what are known as combinatorial arguments, which will help us to derive identities using the binomial coefficients solely through logical argumentation rather than through algebraic manipulation. Auf wiedersehen!
Arranging and Choosing

January 23, 2026

Defining Terms

(Source: Willing Ways)

In the previous post, Down for the Count, we started a series on counting principles. We looked at the multiplication and addition principles of counting and I promised that, in this post, we would look at permutations and combinations, as well as the relationship between combinations and the binomial coefficients. So, what do these strange words mean?

A permutation is an arrangement. Hence, if I have three objects, A, B, and C, I could arrange them as ABC, ACB, BAC, BCA, CAB, and CBA. Each of these, while containing each of the objects, represents a different ordering of the object and, hence, a different permutation. Now, suppose I have to choose two out of the three objects. I could choose A followed by B or B followed by A. But in both cases I have chosen A and B. What is end up with, in other words, does not depend, in this case, on the order in which I chose the objects. Both of them represent the same combination of A and B. Hence, if the order is important, we call it a permutation. And if order is unimportant, we call it a combination.

The term ‘binomial coefficient’ is a little more difficult to explain. Consider the expression a + b. Since there are two terms, a and b, this expression is called a binomial. Now, consider the expression (a + b)ⁿ. We know that this means the expression a + b is multiplied by itself n times. For example, (a + b)² = a² + 2ab + b². The coefficients of the terms are 1, 2, and 1. Since these are obtained by expanding the power of the binomial, they are called ‘binomial coefficients’. Similarly, the binomial coefficients of (a + b)³ are 1, 3, 3, and 1, while those of (a + b)⁴ are 1, 4, 6, 4, and 1.

Enumerating Permutations

Suppose I have two distinct objects, A and B. I need to place them in some order. I can choose either A or B to occupy the first position. Hence, for the first position, I have 2 options to choose from. Once I do this, I am left with the second object and am forced to place it in the second position. This means that I have 2 × 1 = 2 ways of arranging A and B, namely AB or BA.

Now, if I have three objects A, B, and C, I have 3 options for the first position. Once I have placed that object, I am left with 2 objects, which can be arranged, as we have seen in the previous paragraph, in 2 ways. Since I have to arrange all three objects, I use the multiplication principle and conclude that there are 3 × (2 × 1) = 6 ways of arranging A, B, and C. These have already been listed earlier.

Now, if I have four objects A, B, C, and D, I have 4 options for the first position. Once I have placed that object, I am left with 3 objects, which can be arranged, as we have just seen, in 6 ways. Against, using the multiplication principle, we can conclude that there are 4 × (3 × 2 × 1) = 24 ways of arranging A, B, C, and D. These are ABCD, ABDC, ACBD, ACDB, ADBC, ADCB, BACD, BADC, BCAD, BCDA, BDAC, BDCA, CABD, CADB, CBAD, CBDA, CDAB, CDBA, DABC, DACB, DBAC, DBCA, DCAB, and DCBA.

We can see that, if we proceed this way, the number of ways of arranging 5 objects would be 5 × 4 × 3 × 2 × 1 = 120 ways. We can generalize this and conclude that, given n distinct objects, the number of ways of permuting all of them would be

Now, suppose we do not wish to arrange all the objects, but only r of the n available objects. We can obtain the following table.

The first row in red contains the position numbers. The second row indicates the number of options available for that position. Note that the third row is simply the sum of the items in the preceding two rows, which all happen to be n + 1. It follows then that the number of ways of permuting r out of n items is

This is a cumbersome formula. However, we can make it simpler as follows

We can use this formula quite easily now. For example, if we wish to arrange 9 out of 15 distinct objects, then we number of permutations is

The answer of 1,816,214,000 should also give a reason why, in many cases, we simply prefer to write it in short form as ¹⁵P₉. The large numbers are due to the fact that the factorial (n!) grows at a remarkably fast pace. The table below gives a comparison of the growth of various common functions.

As can be seen, after a relatively slow start, the factorial function just balloons up. It will eventually grow faster than any exponential function because, for any function y = a^x, for values of n > a, the multipliers for n! will be greater than the multipliers for a^x, which is a. As the table above shows, 25! > 10²⁵. If we had 25 distinct objects and wish to arrange all of them, we would have 15 septillion, 511 sextillion, 210 quintillion, 43 quadrillion, 330, trillion, 985 billion, 984 million arrangements. Just to give you an idea of how large this number is, if you started at the big bang, forming one arrangement every 1 second for the 14 billion years of the life of the universe so far, you would need more than 35 million such universes before you were able to finish every arrangement!

Enumerating Combinations

Anyway, there are times when the order of objects does not matter. For example, if we are selecting a 3-member committee from person A, B, C, D, and E, it does not matter if we chose them in the order DBE or BED, etc. Only the final members selected would matter. Now, recall that permuting 3 out of 5 objects can be done in ⁵P₃ ways. However, this represents all the permutations of the three objects selected. Hence, for the set {B, E, D} we have 3! ways of permuting them (i.e. BED, BDE, DBE, DEB, EBD, and EDB), all of which constitute the same selection of the set {B, E, D}. In other words, if we are only concerned about the objects selected and not the order in which they are selected, then ⁵P₃ represents an ‘over counting’ by a factor of 3!. This means that the number of ways of selecting 3 objects from 5 objects would be

We can extend this to a more general case. Given n distinct objects, we can arrange r of them in ⁿP_r ways. However, the r objects can be permuted among themselves in r! ways, all of which represent the same selection. Hence, ⁿP_r represents an ‘over counting’ by a factor of r!. This gives the number of ways of selecting r out of n distinct objects to be

Now, we can see that, given n objects to choose from, we can choose anywhere from 0 of the objects to all of the objects. However, if we put r = 0 or r = n in the above formula, we get

However, common sense tells us that there is only 1 way to choose none or all the n objects. Because of this 0! is defined to be equal to 1.

Now, if we tabulated the values of ⁿC_r, we would get a table as follows.

The Binomial Coefficients and Pascal’s Triangle

We can also prepare the first 8 rows of Pascal’s triangle and obtain

We can see that the values of ⁿC_r match the elements of the r^th row of Pascal’s triangle. Why is this?

Consider the expression (a + b)ⁿ. As mentioned earlier, when we expand this, we are multiplying a + b by itself n times. So we have

where there are n sets of parentheses. Now, to complete the multiplication, we need to multiply each element from each set of parentheses by each element in every other set of parentheses. When we get to a particular binomial, we have to decide whether to choose a or to choose b. We cannot choose both, nor can we choose neither. In the end, we will multiply n elements, one from each set of parentheses. Hence, the power of every such product will be n, distributed between the power of a and the power of b. Hence, if the power of a is r, the power of b will be n – r, yielding terms that have a^rb^{n – r}. However, this means that we have to choose r sets of parentheses to contribute the a, thereby making the other n – r sets of parentheses to contribute the b. Hence, we have to ask, “In how many ways can we choose r sets of parentheses out of n?” Since this is obviously ⁿC_r, we can see why these combination values show up as the binomial coefficients. But why do they show up in Pascal’s triangle?

Psacal’s triangle begins with the top row containing two ‘1’s. Each subsequent row is built by starting and ending with a ‘1’ and obtaining the middle values by adding the two elements above it in the previous row. Hence, the second row starts and ends with ‘1’, with the middle ‘2’ being the sum of ‘1’ and ‘1’ from the first row, giving 1 2 1. The third row starts and ends with ‘1’, with the middle values of ‘3’ being the sum of 1 and 2 and the sum of 2 and 1 respectively, giving 1 3 3 1.

We can see how this happens. We can write

Now, to get the term with a^rb^{n – r}, we need to either select a from the first parentheses and the a^r ^{– 1}b^{n – r} term from the second parentheses or select b from the first parentheses and the a^rb^{n – r} ^{– 1} term from the second parentheses. However, both a^r ^{– 1}b^{n – r} and a^rb^{n – r} ^{– 1} are terms from the (n – 1)^th row of Pascal’s triangle, that is the preceding row.

Moreover, the term with a^r ^{– 1}b^{n – r} is the r^th element of the (n – 1)^th row, while the term with a^rb^{n – r} ^{– 1} is the (r + 1)^th element of the (n – 1)^th row. And the term with a^rb^{n – r} is the (r + 1)^th element of the n^th row.

But this means that the (r + 1)^th element of the n^th row is the sum of the r^th element of the (n – 1)^th row and the (r + 1)^th element of the (n – 1)^th row, which is exactly how each subsequent row of Pascal’s triangle is built. We can put this in terms of combinations and write

Since the above equation relating binomial coefficients indicates the way the elements of Pascal’s triangle are obtained, it is clear why the two are identical.

Making it Count

So far we have learned how to determine the number of permutations of r out of n distinct objects as well as the number of combinations of r out of n distinct objects. We have also seen how the binomial coefficients relate to the elements of Pascal’s triangle. We could have proved the final relation using algebra as below.

However, the algebra does not give us any special insight, which is what we need. We did get some insight when we showed why Pascal’s triangle has elements that are the same as the binomial coefficients and when we showed why the binomial coefficients show how Pascal’s triangle is built up. But we have only just started our journey of learning how to count.

In this post, we have considered permutations and combinations of distinct objects in a row. What if some of the objects are not distinct? What if the objects are not in a row, but in a circle? What if we have some restrictions, like two objects cannot be placed side by side or that two objects must always be placed side by side? How do we manage to use the basic counting principles to determine how to proceed counting in these cases? We will turn to that in the next post. Till then, permute your life to make every moment count.
Down for the Count

January 16, 2026
What Are Counting Principles?

(Source: Inner Drive)

Counting is something that we learn from a very young age, either in the informal environment at home or in the formal environment of school. It forms the basis of all the mathematics we learn through our lives. All the basic mathematical operations (addition, subtraction, multiplication, division, and exponentiation) can be explained in terms of counting. Hence, it is often the case that, when students reach high school, after probably 10 years of formal mathematics, they are surprised when they see a chapter in their book with the title “Counting Principles”. They may think, “What principles might this involve? How much more is there to counting? Didn’t we put the issue of counting behind us when we learned how to perform the operations?”

What the student does not realize is that what she has learned so far are ‘recipes’ for performing the mathematical operations. That is, she has been given some idea of what the operations involve. However, the main focus would have been on how to perform the operations not on why certain operations are needed to be performed. For example, if you were given 3 ⊗ 7, where ⊗ indicates some mathematical operation, what would the result be? 3 + 7 gives us 10, 3 – 7 would yield -4, 3 × 7 is 21, 3 ÷ 7 would be 0.428571…, and 3⁷ is 2187. While the student would easily be able to determine each of these five answers, which operation should she use in a given context? This is where the idea of the counting principles comes in. These principles tell us which operations are to be used in a given context.

The Multiplication and Addition Principles

As an example, suppose a restaurant offers a 3-course lunch with 3 options of starters, 5 of main courses, and 2 of desserts. How many different selections of lunch can a customer make? Or if you have a choice of a room from 3 hotels, one of which has 20 rooms, the second 30 rooms, and the third 50 rooms, how many choices of rooms do you have? How do the 3, 5, and 2 in the first situation relate to each other. How do the 20, 30, and 50 in the second situation relate to each other? There needs to be some principles on the basis of which the student would decide how to find the answer to each of the situations. What are these principles?

Today, we are starting a new series on counting principles. We will start with some basic ideas and work ourselves to more complex ideas. At the end we will answer the following question: “For a theatre production of a play there are n characters, each of which requires an understudy. If any of the actors and understudies can play all the parts and if the main actor must have more years of experience than the understudy, in how many ways can main actors and understudy actors be assigned?”

As promised, let us begin with the two earlier examples, which are relatively easy. Suppose we label the starters as A, B, and C, the main courses as P, Q, R, S, and T, and the desserts as Y, and Z. The possible choices are listed below:

In the above, each row represents a different choice of starter. The colors differentiate the main courses. Finally, the dessert choice is differentiated between normal and italicized fonts. We can see that there are 40 possible selections because 4 × 5 × 2 = 40. But the reason we multiply is that the customer has to choose a starter and a main course and a dessert.

In the case of the hotels, we cannot multiply because the person can only occupy one room. Hence, he must choose to go to either hotel A, where he has a choice of 20 rooms, or hotel B, where he has a choice of 30 rooms, or hotel C, where he has a choice of 50 rooms, yielding a total of 20 + 30 + 50 = 100 choices.

The situation with the lunch choices is one in which we use what is known as the multiplication principle, while the case with the hotel rooms uses the addition principle.

The multiplication principle states that, if there are m ways of doing one thing and n ways of doing a second thing, then the number of ways of doing both things is m × n. The addition principles states that, if there are m ways of doing one thing and n ways of doing a second thing, then the number of ways of doing either thing is m + n.

Since the lunch menu required a choice of starter and main course and dessert, the multiplication principle was applicable. And since the hotel room required a choice of only one of the three hotels, the addition principle was applicable.

Your Turn

Why don’t you try the questions below?
1. A football squad consists of 3 goalkeepers and 5 strikers. In how many ways can the coach choose 1 goalkeeper and 1 striker?
2. A library allows members to borrow two books at a time. The library has 15 books by Jeffrey Archer and 23 books by Paulo Coelho. In how many ways can a member borrow 1 book by each author?
3. There are 5 apples and 7 oranges in a fruit basket. In how many ways can a person choose either an apple or an orange?
4. A student has 3 colleges to choose from. The first college has 3 programs, the second has 5 and the third has 6. Assuming that the programs are full-time programs, in how many ways can she choose a program?
5. A library allows members to borrow only one book at a time. The library has 15 books by Jeffrey Archer and 23 books by Paulo Coelho. In how many ways can a member borrow a book by either author?
6. Three cities A,B, and C are connected by 4 roads between A and B, 5 between B and C, and 6 between C and A. In how many ways can a round trip be made?
How did you fare with the above questions? The answers are 15, 345, 12, 14, 38, and 720. Many students get the first 5 but stumble on the last one. The answer I have given is correct. It is 720. If you wish to check the solution, click here.

Checking Out

As we continue with the series, we will see that there are many ways to put these two basic counting principles together. We will learn about permutations and combinations in the next post as well as the relation of the latter to Pascal’s triangle. We will also learn about how permutations and combinations relate to each other. We will also see why the combinations show up as the coefficients of the binomial expansion. Till then, let everything count.