Consider the integers ,
, , and . For each of these integers, do the
following.
Determine whether the integer is a multiple of .
With the hundreds digit equal to , the tens digit equal to , and the units digit equal to , compute .
What do you notice?
Suppose is a
three-digit integer ( is the
hundreds digit, is the tens
digit, and is the units digit).
Show that if is a multiple of
, then is a multiple of .
Show that if is a
multiple of , then the three-digit
integer is a multiple of
.
Suppose is a
six-digit integer that has each of its digits different from . Show that is a multiple of if and only if is a multiple of .
Think of ways to generalize the fact in part (d).
Hint
No hint given.
What are the remainders when and are divided by ?
No hint given.
Try to find a condition on six-digit integers that detects
divisibility by . There is a
well-known general test for divisibility by , but it will be useful to find another
one that is specific to six-digit integers and similar to the condition
for three-digit integers implied by (b) and (c).
Solution
In the table below, the first column contains the four integers
given in the problem, the middle column says whether that integer is a
multiple of , and the third column
contains the value of .
Multiple of 7?
yes ()
no
no
yes ()
The two integers that are multiples of have the property that is also a multiple of , and the two integers that are not
multiples of have the property
that is not a multiple of
.
The integer is equal to
. Observe that and , so we have that which can then be
rearranged to get .
We are assuming that is a
multiple of , so the expression
is equal to the difference
of two multiples of , and so it
must be a multiple of .
Using the calculation from part (b), for any three-digit
positive integer . If is a multiple of , then is the sum of two multiples of and so must be a multiple of .
The combined result of parts (b) and (c) is that a three-digit
integer is a multiple of if and only if the integer
is a multiple of . To answer the question in this part,
we will first establish a similar fact about six-digit integers.
To do this, we first divide each of , , , , and by to find the quotient and remainder.
Since , we can
substitute the equations above and rearrange to get that and since is a multiple
of , a similar argument to those
used in parts (b) and (c) imply that the integer is a multiple of if and only if the quantity is a multiple of .
We will now prove that is
a multiple of if and only if
is a multiple of . First, assume that is a multiple of . By the fact established above, the
integer is a
multiple of . Though it may seem
like a strange observation, this implies that must be a multiple of
. Expanding and grouping some
terms, we have and so is a
multiple of . This implies that
is a multiple of
, and by the fact established
above, the six-digit integer
is a multiple of .
We now suppose that is a
multiple of . We have already
shown above that if the digits are "cycled" to the left, then the
integer obtained is also a multiple of . Applying this several times, we have
that is a multiple of , as are , , , and finally .
As an example, you can verify for yourself that is a multiple of , and so are each of , , , , and . Similarly, the integer is not a multiple of , and neither are any of , , , , and .
Note: The assumption that none of the digits of
are zero implies that each
of the integers obtained by cycling the digits is a six-digit integer.
If we were to allow digits equal to , the claim remains true, but in
practice we may need to include "leading zeros". For example, cycling
the digits of the integer in
this way would lead to
(which is really a four-digit integer), then , , , and so on.
While we will not include proofs here, we will list a few ways
that the result in (d) can be generalized.
If the number of digits in an integer is any positive multiple of
, then the result of part (d)
still holds. That is, if has
digits for some , then is a multiple of if and only if the integer obtained by
cycling its digits is a multiple of .
For any prime number ,
other than and , if an integer has digits, then it is a multiple of
if and only if the integer
obtained by cycling its digits is a multiple of . For example, a -digit integer is a multiple of if and only if the integer obtained by
cycling its digits is a multiple of . The reason this fails for and is related to the fact that and are the prime factors of , which is the base of our
number system.
Suppose is an integer. The
Euler Totient Function of ,
denoted by , is defined
to be the number of integers with
such that . For example, since and are the only integers between and inclusive that have a of with . since there are exactly
integers from to inclusive with the property that . They are .
If is an integer that is
neither a multiple of nor a
multiple of , then any -digit integer is a multiple of
if and only if the integer
obtained by rotating its digits is a multiple of . For example, the -digit integer is a multiple of , and so are , , and so on.
One of the key observations for proving something like this is a
result due to Euler that says that when and are nonzero integers with , the remainder when dividing
by is .
Problem 1: October
2022
Problem
In any triangle, there is a unique circle called its
incircle that can be drawn in such a way that it is tangent to
all three sides of the triangle. For a given triangle, the radius of its
incircle is known as its inradius and is denoted by .
For each side of the triangle (which is tangent to the incircle),
another tangent to the incircle can be drawn in such a way that it is
parallel to that side. The three sides as well as these three new
tangents give a total of six tangents to the incircle. They uniquely
determine a hexagon that we will call the Seraj hexagon of the
triangle.
Finally, for a given triangle, we will denote by its semiperimeter, which is
defined to be half of its perimeter.
The diagram below is of a triangle showing its incircle and Seraj
hexagon.
Sketch the ––
triangle with its incircle and Seraj hexagon. Compute its inradius,
semiperimeter, and the area of its Seraj hexagon.
Find a general expression for the area of a triangle in terms
only of its inradius and semiperimeter.
Find a general expression for the area of the Seraj hexagon of a
triangle in terms of its three side lengths, its semiperimeter, and its
inradius.
What is the largest possible value that can be obtained by
dividing the area of a triangle’s Seraj hexagon by the total area of the
triangle?
Hint
There are several ways to compute the area of the Seraj hexagon.
One is to subtract the areas of three smaller triangles from that of the
full triangle.
From the centre of the incircle, draw a radius to each point of
tangency the circle has with the triangle.
If is similar
to , then .
It is a useful general fact that if we denote this common ratio by , then any two corresponding altitudes
of these two triangles also have a ratio of . Can you compute the ratio of the areas
of and in terms of ?
One possible expression is
Try to prove that is true for all real numbers , , and and determine a condition on , , and that implies .
Solution
Note: In this solution, we will denote by the area of . We will also use the
following facts about circles.
Fact 1: Suppose a circle has centre and is a point on its circumference. The
tangent to the circle at point is
perpendicular to the radius from
to .
Fact 2: For any point outside of a circle, there are exactly
two tangents to the circle that pass through . If the points of tangency are and , then .
Below is a picture of the triangle. Its vertices are labelled by
, , and with the right angle at , , , and . The centre of the incircle is
labelled by , and , , and are tangent to the incircle at , , and , respectively.
Since the perimeter is ,
the semiperimeter is .
To compute , we first use
Fact 1 to get that . We are assuming that as well, and since
the sum of the angles of a quadrilateral is always , Therefore, is a rectangle. Since and are radii of the incircle, . Since is a rectangle, .
Using that , we get and . By Fact 2, and , and since , This gives , which can be solved for to get .
In the diagram below, the Seraj hexagon has been added. The side of
the Seraj hexagon that is parallel to (and tangent to the incircle)
intersects at and at . The side of the Seraj hexagon that is
parallel to intersects at and at . The side of the Seraj hexagon that is
parallel to intersects at and at . The point of tangency of with the circle is labelled by .
To compute the area of the Seraj hexagon, we will compute the areas
of , , and and subtract their combined
area from the area of .
To compute the area of , we first set and
. Since is parallel to , we have that and . Since the two
share a right angle at , is similar to . Therefore, ,
or .
From earlier, we have that , which means and . By Fact 2, . By the Pythagorean
theorem applied to
and using that , we
have If , then is at , but and are distinct parallel lines, so they
have no points in common. Therefore, or , so .
We can now compute
We now compute the area of . By Fact 1, , and since is parallel to , as well. By reasoning similar to earlier, we
conclude that is a square of
side-length , which means . We also have that , so this means .
Since is parallel to , is similar to by reasoning similar to that which was used to show that
was similar to . This means or . Substituting
, , and into this equation, we get . Therefore,
Using very similar reasoning, one can show that . Therefore,
the area of the Seraj hexagon is
In the diagram below, a triangle has its incircle drawn. Radii
are drawn from the centre of the incircle, , to the points of tangency of , , and , which are labelled , , and , respectively. As well, is connected by line segments to each
vertex of the triangle.
Set , , and . With this notation, the
semiperiemter of is
. Since , , and are radii to points of tangency, they
are perpendicular to , , and , respectively. Therefore, they are
altitudes of , , and , respectively. These three
triangles compose the entirety of with no overlap, so the area of is equal to the sum of
their areas. Therefore,
As an example demonstrating this formula, consider from part (a). Using the
usual area formula, . From part (a), its
semiperimeter is
and inradius is , so , which is the correct
area.
In the diagram below, has its incircle and Seraj hexagon drawn. The side of the
Seraj hexagon that is parallel to intersects at and at . The side parallel to intersects at and at . The side parallel to intersects at and at . The altitude of from is also drawn and its points of
intersection with and are labelled by and , respectively1.
By construction, is parallel
to . By reasoning similar to that
which was used in earlier parts, this means is similar to and is similar to . These two pairs of similar
triangles imply that and . We will
call this common ratio . The area
of can be computed in
terms of and the area of as follows
We next examine the quantity .
Since and are different parallel tangents to the
incircle and is perpendicular to
both lines, the length of must
be equal to the diameter of the incircle (you may want to think about
why this is true). The diameter of the incircle is , so , which means .
Therefore, we have .
From part (b), ,
but from the usual formula for the area of a triangle we also have .
This implies or . Substituting into the
formula for above, we have that
If we set , , and , then we get . Earlier, we showed that
,
so By similar
reasoning, we also have The area of the Seraj hexagon is equal to the
area of minus the
combined area of these three triangles, so using the formulas just above
as well as , we
have
As suggested in the hint, we will show that for all real
numbers , , and . To see this, note that the inequality
is equivalent to the inequality which,
after expanding and rearranging the left side, is the same as
After further manipulation, this is equivalent to Therefore,
the given inequality is true exactly when . The sum of
three squares is always non-negative, so this inequality is always true.
Therefore, the given inequality is true for all real numbers , , and , as claimed. Moreover, the only way
that the quantity can be equal to
is when , and hence, the only way that is when .
Dividing the expression for the area of the Seraj hexagon from part
(c) by , we get that the ratio of
the area of the Seraj hexagon to that of the triangle is
Using the inequality established above, we can multiply by to get that for any real
numbers , , and , .
Applying this with , , and , we get that Therefore, the ratio is at
most in every
triangle. By the remark at the end of the proof of the inequality, the
ratio equals exactly
when .
Since is always nonzero, this is
equivalent to . Therefore, the
ratio is maximized when the triangle is equilateral. It is not difficult
to explicitly show that the area of the Seraj hexagon in an equilateral
triangle is equal to two thirds of the area of the triangle.
Footnotes
If or is
obtuse, then intersects some
extensions and and not the line segments themselves.
We leave it to the reader to verify that the argument that follows works
even if does not pass through
and .↩︎
Problem 2: November
2022
Problem
Let .
For integers , each equal to or , the expression is called a base expansion and represents the
real number
The integers through are called the digits of
the expansion. For example, the base expansion represents the real
number
which can be simplified to get and so .
What are the real numbers represented by and ?
Find a base expansion
of the real number .
Show that and
use this to deduce that for all
integers .
Show that every positive integer has a base expansion and find a base expansion for each positive integer
from through . One approach is to prove and use the
following two facts.
If a real number has a
base expansion, then it has a
base expansion that does not
have two consecutive digits equal to .
If a real number has a
base expansion, then it has a
base expansion that has its
units digit, , equal to .
Hint
No hint given.
What is ?
No hint given.
One way to prove the first fact is to show that if there are two
consecutive digits equal to , then
there is a base expansion with
fewer non-zero digits. To prove the second fact, use the same idea as in
the proof of the first fact, but in reverse.
Solution
Expanding and simplifying, we have
and so we see that
There are several ways to approach this problem and we will
demonstrate two of them. First, from the example given in the problem
statement, we have that
and using the hint, we have that
and so
Another approach is to use what might be called a "greedy algorithm".
We first note that and then consider powers of and find the first one that does not
exceed it. As it turns out, and . It can be
checked that , so if we
let , then
The approximate value of is . Now we check that and , so the largest
power of that does not exceed
is , and it can be checked that Since
and , the
largest power of that does not
exceed is
, and so we now consider
the quantity
Considering taken to negative
integer exponents, we find that , so we suspect
that is exactly equal
to . Indeed,
Therefore, which can be rearranged to
get , which means . Notice that
these two base expansions are
different and both correct. A way to see that the expansions are equal
is explained throughout the rest of the solution.
Expanding and
simplifying, we have Multiplying this equation by gives . It will be
useful in part (d) to note that this means that and more generally,
if ever occurs in a base expansion, it can be replaced by
, or vice versa, without
changing the actual value of the number being expressed.
As suggested in the problem, we will use the facts in the bullet
points. Specifically, we will use Fact 1 and Fact 2 below:
Fact 1: If a real number has a base expansion, then it has a base expansion that does not have two
consecutive digits equal to .
Fact 2: If a real number has a base expansion, then it has a base expansion that has its units digit
equal to .
Proof of Fact 1. Let be a base
expansion of . We will call the
digit bad if and at least one of and is equal to . We want to show that has a base expansion that has no bad
digits.
Observe that if has an
expansion with every digit equal to , then must itself be the real number . This is because every power of is positive.
Suppose the given expansion of
has at least one bad digit. Then we let be the largest integer such that is bad. By definition, and either or . However, it is not possible
for since this would
imply that is bad, but
is the largest integer such that
is bad.
Thus, we must have ,
, and , and so the given expansion is
By the remark at the end of
part (c), we can replace the by
to get that
Notice that we have found a base expansion of with fewer non-zero digits, under the
assumption that the expansion had at least one bad digit. In other
words, if a base expansion of
has at least one bad digit, then
there is a base expansion of
that has fewer non-zero digits.
Thus, as long as there is a bad digit, we can decrease the number of
non-zero digits.
By the above remark, unless ,
the number of nonzero digits cannot decrease indefinitely since there
were finitely many digits to begin with. Thus, we must eventually lose
the ability to decrease the number of nonzero digits, and by the
previous paragraph, this means we must eventually reach a base expansion that has no bad
digits. ◻
Below is an example of the procedure explained in the proof of
Fact 1. It is used to find a base expansion of that has no
bad digits. There is a step where a leading is replaced by . The digits are aligned in the
equations to indicate where this happened. In every line except the
last, the the digits that are changed in the subsequent line are
highlighted.
Proof of Fact 2. This can be shown by an argument rather
similar to the proof of Fact 1. This time, we will introduce bad digits
in order to remove a potential
from the units digit. Suppose has
a base expansion. By Fact 1,
we can assume that the expansion is a expansion without any bad digits. If , then there is nothing to do, so we
assume that . We will now
artificially add two "trailing" ’s
as digits. That is, we also have that where . Let be the largest integer with and . Since we have added two
trailing zeros, we know this must happen somewhere, so it is possible to
choose this way.
By the choice of , we must have
that , and since the
expansion has no bad digits, . By the choice of , it then follows that , then since there are no bad
digits, we get , and so
on. Since , this alternating
pattern must eventually reach . In other words, the expansion
takes the form By part (c), we can change and to ’s and compensate by changing to . The new expansion is Repeating
this process, we now change
and to ’s and change to a . This process terminates in the
expansion which
indeed has . ◻
The procedure in the proof of Fact 2 is used below to find a base
expansion of the number
represented as that has . As was done earlier, in each line
but the last, the three digits highlighted are the ones that
change in the subsequent line.
Now we will apply the facts to derive base expansions for the first ten
positive integers.
To start, we have that since . This means the integer has a base expansion. Following the procedure
outlined in the proof of Fact 2, . It is easy to add to a base expansion that has because we can simply change . Thus, we have that , and using the procedure
from the proof of Fact 1 again, we get .
Continuing in this way, if we have a base expansion of the integer , then we can use the technique in the
proof of Fact 1 to find a base
expansion that does not have any bad digits. Then, if necessary, we can
use the technique in the proof of Fact 2 to find a base expansion of that has . From this, we can find a base
expansion of by taking the base expansion of and switching from to , which has the effect of adding . This process can be repeated to find a
base expansion of , then , and so on. Starting with , this is demonstrated up to below.
A bag contains exactly 15 marbles of which are red, are blue, and are green. The marbles are chosen at
random and removed one at a time from the bag until all of the marbles
are removed. One colour of marble is the first to have remaining in the bag. What is the
probability that this colour is red?
Note: It might be useful to familiarize yourself
with the notation of binomial coefficients before attempting
this problem.
Suppose there are red
marbles and blue marbles. As in
the original problem, the marbles are chosen at random and removed from
the bag one at a time until all marbles are removed. One colour of
marble is the first to have
marbles remaining in the bag. What is the probability that this colour
is red?
Suppose there are red
marbles, blue marbles, and green marbles. The marbles are chosen
at random and removed one at a time until all marbles are removed. What
is the probability that red is the colour of marble that is first to be
completely removed from the bag?
Suppose there are red
marbles, blue marbles, and green marbles with . Let be the probability that the red
marbles are the first to be completely removed from the bag and define
and similarly. Determine which of , , and is the smallest and which is the
largest. Does the result agree with your intuition?
Show that the values of , , and depend only on the proportions of
, , and to the total number of marbles. For
example, if one bag has red,
blue, and green marbles and another has red, blue, and green marbles, then the probability
that the red are removed first is the same for both bags.
Hint
If the red marbles are all removed from the bag first, then what
is the colour of the last marble to be removed from the bag?
As in part (a), it is useful to think about the colour of the
final marble to be removed. In part (a), the final colour alone
determines which colour has completely removed first. In this part, it
is a bit more complicated.
Try to find a general expression for each of , , and , simplifying as much as possible.
Once you have done this, try to determine the sign of .
One way to approach this is to compute the probabilities directly
in terms of the proportions. Thinking about which colour is removed last
will likely be helpful, as in earlier parts. Another approach is to use
a general formula for the probabilities in terms of , , and , and divide the numerator and
denominator by .
Solution
Suppose there are red
marbles and blue marbles and set
. Upon removing the marbles from the bag in random order,
we can record the order in which they emerged from the bag by a sequence
of ’s and ’s. For example, if the first two
marbles were red and the third was blue, then the sequence would begin
with . If the fourth was red,
then the sequence would continue , and so on.
An important observation in this part and later parts of the solution
is that every possible sequence of characters, of which are and of which are , is equally likely to occur through
this process. To make this observation, we first imagine labelling the
red marbles by , , and so on through to , and the blue marbles by , , and so on through to . With this labelling, we have made it
so that the marbles are distinct.
There are exactly orders in
which the marbles can be removed, and each of them is equally likely. If
we now imagine listing these
possible orderings of and removing
the subscripts, each possible sequence of ’s and ’s will occur in the list exactly times. Therefore, if we remove
the (unindexed) marbles and write down the sequence of ’s and ’s, each possible sequence will occur
with probability exactly .
Hence, every sequence occurs with the same probability. Notice that
is equal to the
number of sequences of ’s and ’s since any such sequence is completely
determined by selecting of the
possible positions and placing
in them.
Therefore, the probability that the red marbles are removed first is
equal to the number of sequences of ’s and ’s with the property that there is at
least one to the right of the
rightmost , divided by .
Since there are only two colours in this part of the problem, the
statement "there is at least one
to the right of the rightmost " is
equivalent to "the final letter in the sequence is ". Put in a different way, the red
marbles are completely removed first if and only if a blue marble is
removed last. Therefore, the probability that we seek is simply the
number of sequences of ’s and
’s that end in , divided by . The number of such
sequences is equal to since, to construct all
such sequences, we can place one
at the end, and then place the ’s in any of the other positions. Therefore, the probability
that all the red marbles are removed first is Looking ahead to later
parts, notice that this probability is equal to the proportion of the
marbles that are blue.
Consider the following two events:
Event X: The final marble that is removed is blue and the final
green marble is removed after the final red marble is removed.
Event Y: The final marble that is removed is green and the final
blue marble is removed after the final red marble is removed.
To say that the red marbles were removed first is the same as saying
either Event X occurred or Event Y occurred. Since Event X has blue as
the final marble and Event Y has green as the final marble, Event X and
Event Y cannot occur simultaneously, so this means that the probability
the red marbles are removed first is the sum of the probabilities of
Event X and Event Y. Therefore, we can compute the desired probability
by computing the probabilities of Events X and Y and adding the results
together.
For both of these probabilities, we need to count the total number of
ways that the marbles can be removed. As in part (a), we will think of
an order that the marbles can be removed in as a sequence of ’s, ’s, and ’s. There are ways to place the ’s. After doing this, there will be
empty positions, so there are
ways to place the
’s. Once the ’s and ’s are placed, there is no choice of
where to place the ’s, so the
total number of sequences is
Note: This expression is an example of something called a
multinomial coefficient, which is a generalization of a
binomial coefficient. We will not use the notation of multinomial
coefficients in this solution, but you might like to read about them if
you have not before.
We will compute the number of sequences that end in and have at least one to the right of the rightmost . This quantity divided by is the
probability that Event X occurs.
There are
ways to place the ’s so that the
final letter in the sequence is .
This is because we need to place one in the final position, then place the
remaining ’s in the remaining positions.
For any such placement of the ’s, in order to place the ’s and ’s so that at least one occurs to the right of the rightmost
, we need to place a in the rightmost empty position. Thus,
for every such way to place the ’s, we now have ways to place the
’s. This is because we must place
one in the rightmost position
that is not occupied by a , and
then the remaining ’s can be placed in any of the remaining
positions. Once the ’s and ’s are placed, there is no choice of
where to place the ’s, so the
probability that Event X occurs is By a very similar
calculation, it can be shown that the probability that Event Y occurs is
and so
the probability that all the red marbles are removed from the bag first
is
In part (b), we showed that .
Following nearly identical reasoning, it can be shown that
Using the equations above, we will show that is negative. This result will
show that is greater than
. Remember that we are assuming
in this part.
Since , , and are all positive, the quantity is positive, which means
is negative if and only
if
is negative. Using a common denominator of , we get
Noting that the denominator is also positive, we have
that is negative if and
only if the numerator of the fraction above is negative. Manipulating
the numerator, we have At first, it might seem that there is no hope
of understanding the expression above. However, if we imagine the
situation where , then by
symmetry we really should have that (in the problem, we are
assuming that , but the
equation above is not aware that we plan to impose this restriction).
This means the expression
should evaluate to when , so it should have a factor of . Indeed, if we group the terms that
"look" alike, it is easier to notice the factor of :
Since is given, we now have
that is negative if and
only if the quantity is positive,
but each of , , and is positive, so it is indeed
positive!
Therefore, we have that , or . Similar calculations can be
used to show that .
Therefore, the marble colour with the smallest number of representatives
is most likely to be removed first, and the marble colour with the
largest number of representatives is least likely to be removed
first.
Let ,
, and . That is, is the proportion of the marbles
that are red, and similarly for and .
From part (b), the probability that all the red marbles are removed
first, , is
Dividing the numerator and denominator of each expression by , the probability above is equal
to
So we have written the probability that all the red marbles are
removed first in terms of the proportions of each of the three
colours.
We will now try to explain why the formula above makes sense. Once
again, we will think of a way of drawing the marbles randomly as a
sequence of ’s, ’s, and ’s. In any given position, the
proportion of sequences with a in
that position is . In
particular, is the probability
that the final letter in the sequence is .
The quantity is equal to the
proportion of green marbles among those marbles that are either red or
green. Similar to the reasoning above, this means that if we look at
sequences of ’s, ’s, and ’s the quantity is the
probability that is the rightmost
letter among ’s and ’s.
The overall rightmost letter being equal to is independent of the probability that
the rightmost letter among ’s and
’s is , so the quantity
is equal to the probability that the rightmost letter in the sequence is
a and among ’s and ’s, the rightmost letter is . Similarly, the quantity is equal to
the probability that the rightmost letter is and among ’s and ’s, the rightmost letter is . The ’s all occur before the final and before the final if and only if one of these two events
occurs, so this explains why the probability that all the red marbles
are removed first is equal to
This expression is in terms of the proportions of red, blue, and green
marbles. Similar expressions can be computed for and .
Problem 4: January
2023
Problem
For each positive integer ,
define a function for each integer . That is, is the sum of the first perfect th powers. It is well known
that .
Fix a positive integer .
Let be the set of ordered triples
of integers between and , inclusive, that also satisfy and . Show that there are exactly elements in the set .
With as in part (a), show
that there are elements
in and use this to show that
For each , show that there
are constants such that
for all .
Note: Actually computing the constants gets more and
more difficult as gets larger.
While you might want to compute them for some small , in this problem we only intend that
you argue that the constants always exist, not that you deduce exactly
what they are.
Use part (c) to show that and .
It follows from the fact in part (c) that is a polynomial of degree . With , this means there are constants and such that Use the fact that
and for all to determine through , and hence, derive a formula for
.
Show that is a factor
of for every positive
integer and that is a factor of for every even positive integer
.
Hint
What are the possible values of ?
How many distinct integers can occur in a triple in ?
Try to generalize the idea in part (c). The constants through do not depend on .
For positive integers and
with , the usual convention is that
. This convention
makes sense for (at least) two reasons. First, there are zero ways to
choose objects from distinct objects if , so " choose " should be equal to . Second, the formula for given by
will have a factor of in the
numerator if .
Directly compute an expression for . It should be a
polynomial with coefficients depending on through . By equating coefficients with the
polynomial , solve for through . After these coefficients are known,
can be computed from .
A polynomial with infinitely many roots must be the constant zero
polynomial. Using this fact, show that for all real numbers,
not just positive integers. This means you need to "extend" to accept inputs that are not
positive integers. Once this is done, determine the values of and . To show that is a factor of for even , consider the values of when is a positive integer.
Solution
Since and are positive integers and is larger than both of them, the
smallest value that can take is
. The possible values of are therefore , , and so on up to .
The only triple in with the
property that is , so there is one triple with
. The triples in with are , , , and , so there are four triples with
In general, if , then and can both be any integer from through inclusive. Thus, there are triples in with . Since ranges from through , there are exactly
triples in .
We will now count the triples in by considering them according to the
number of distinct integers that occur in the triple. For example, there
are two distinct integers in and three distinct integers in
.
If is in , then there are at most three distinct
integers in the triple (since there are only three integers in total).
As well, since , the integers
cannot all be equal. Therefore, there are either two distinct integers
or three distinct integers in .
Suppose , , and are three distinct integers between
and inclusive. Since they are distinct,
one of them is the largest. Assuming is the largest, the triples and are both in , and these are the only triples in
that contain the integers , , and . Therefore, for every way to choose
three distinct integers between
and inclusive, there are two
triples in . This observation
means there are
triples in with three distinct
integers in them.
Now suppose that and are two distinct integers with . Then the only triple in that contains exactly the integers
and is . Thus, for every two distinct
integers between and inclusive, there is exactly one
triple in that contains exactly
those two integers. Therefore, there are triples in with two distinct integers in them.
Therefore, the number of elements in is .
From part (a),
Fix a positive integer .
Using the idea from parts (a) and (b), we let be the set of ordered lists of positive integers with the
property that each integer is
between and inclusive and is larger than every other
integer in the list. A list of the form is called
a -tuple.
As in the case with , it is
not possible for a -tuple in
to have . This is because the other
integers are positive and
must be the largest integer in the list (it cannot be "tied" for the
largest). The possible values of are through .
For a fixed value of ,
say , if is in
, then through can each be any positive integer less
than , of which there are . Therefore, the number of -tuples in with is . The possible values of are through , so we have that the number of
elements in is
Following the reasoning of part (b), we now want to count the
elements in a different way.
Since is the largest
integer in the -tuple, there
are at least two distinct integers in every -tuple in . As well, there are at most distinct integers in every -tuple. Suppose we have chosen distinct positive integers between
and inclusive with . We would like to count
the -tuples in that contain exactly those integers. For example, suppose and . We need to count the number of -tuples that use exactly three given
integers with the largest integer occurring only in the rightmost
position. Suppose the largest integer is and the other two are and . The -tuples in are
, , , , , , , , , , , , ,
and so there are -tuples in with exactly those three integers.
This means that there are -tuples in that have exactly three distinct
integers in them. The important thing to notice here is that the
constant does not depend on
. We could do a similar count to
see how many -tuples there are in
with exactly five distinct
integers. The combinatorics might be a bit tedious, but in the end, we
would find that the number of -tuples in with exactly five distinct integers
is some multiple of
where the multiplier does not depend on .
Putting this together, there must be some constants so that
where each coefficient is equal
to the number of -tuples in
that contain exactly a fixed
set of distinct integers.
Put differently, suppose is a
set of distinct positive
integers. Then is the number of
-tuples that satisfy the
following three conditions: every integer in the -tuple comes from , every integer in occurs at least once in the -tuple, and the largest integer in
occurs exactly once and is in the
rightmost position.
Before moving on, we make one final observation. With the convention
that when , the equation above makes sense
even when is smaller than the
bottom number in the binomial coefficient. For example, if and , then the term is
counting the number of -tuples
consisting of four distinct integers between and inclusive. There is no way to
choose four distinct integers from the list , , , so there should be zero such -tuples, and the fact that records this
observation.
From part (c), we have that for some
constants , , and . Using that , we have that and so . With , we have , so and so . Finally, using , we have , so from which it follows that
. Recall that is equal to the number of four-tuples
in consisting of a fixed set of
four distinct integers. Indeed, the largest integer must go in the
rightmost position, and there are
ways to arrange the other three integers, so makes sense from a combinatorial
perspective.
We now have shown that
and after some simplification, we get that
We can approach similar
to how was approached above.
We start with
and then use that , , , and to solve for
, , , and . After doing this, we find that , , , and . Therefore, after some
simplification, we get
Using that , we can find a
general expression for . and
after collecting like terms, we get We can now
equate coefficients to get the system of equations From the first equation, we get . Substituting this into
the second equation gives so . Continuing this way, we
get that , , , and . Therefore
To solve for , we can use that
to get the equation
which means . Finally, we can
rearrange into a nicer form
Fix a positive integer and
consider the function . We
observed earlier that is a
polynomial in . While is designed to output when is a positive integer, there is nothing
to stop us from "symbolically" evaluating when is not an integer. For example, even
though
does not have the same meaning as when is a positive integer (we cannot "add
together the first
positive integers"), we can still substitute into the formula for to get .
Now consider the function where is allowed to be any real number. The
function is the
sum/difference of three polynomials, so it is itself a polynomial. Since
for all
positive integers , is a root of for all positive integers . By the fact in the hint, a polynomial
with infinitely many roots must be the constant zero function, so for all real
numbers .
With , we get that , but since , we have that , so . Similarly, by considering , we get that , and since , we have , so as well. We have shown that
and are roots of the polynomial , which shows that and are factors of . This means is a factor of .
We now suppose that is even,
which means for all real
numbers . From above, we have that
. Considering at , we have that , and since is even and , we have or .
Next, we use
with to get , and since
, this means or .
Continuing this way, we have , so , and so . We have now shown
that for the
positive integers , , , , and . This pattern continues. It can be
proven using mathematical induction that for all positive
integers .
This means that for every non-negative
integer. Therefore, the polynomial has infinitely many
roots, and so must be equal to
for every real number .
Taking , we get
which simplifies to .
Therefore, is a root
of when is even. Thus, has a factor of , which is equivalent to
having a factor of .
Problem 5: February
2023
Problem
The sequence has
the property that every integer in the sequence is a divisor of the sum
of the integers adjacent to it. That is, is a divisor of ,
is a divisor of , is a divisor of , and so on.
For , the sequence of positive
integers is called a splendid sequence of length if it satisfies conditions S1, S2,
and S3 found below.
S1. is a divisor of
and is a divisor of .
S2. is a divisor of
for each from through inclusive.
S3. There is no prime number that is a divisor of every integer
in the sequence.
For example, is
a splendid sequence because it satisfies S1, S2, and S3. The sequence
satisfies S1 and S2, but
it is not a splendid sequence because it fails S3 since is a divisor of every integer in the
sequence.
For , is a splendid sequence of
length if it satisfies S1
and S3. Mostly for notational convenience, we also define a splendid
sequence of length to be the
"sequence" . That is, the only
splendid sequence of length
consists of a single integer equal to .
Show that there is only one splendid sequence of length .
Show that there is at least one splendid sequence of every
possible length .
Suppose is a splendid
sequence of length . Show
that for every integer with there is a positive
integer so that
is a splendid sequence of length .
Suppose is a splendid
sequence of length . Show that
.
For each , show that
there are only finitely many splendid sequences of length .
Find a closed form for the number of splendid sequences of length
. Your answer should be an
expression in terms of .
Note: In part (f), we are asking you to find a
closed form for the number of splendid sequences, the existence of which
immediately implies that there are only finitely many. Hence, one way to
answer part (e) is to answer part (f). With that said, we decided to
include part (e) because it can be done without part (f) and (at least
as far as we can tell) it is quite a bit easier than part (f). Part (f)
is very challenging, so the hint will have more detail than usual.
Hint
Remember the condition that no prime number can divide every
integer in a splendid sequence.
Think of an integer that is a divisor of every integer.
Try the positive integer .
Prove that if the divisibility conditions (S1 and S2 in the
problem statement) are satisfied by , then and must both divide every integer in the
sequence.
Try to write down a few splendid sequences of length at least
. With the hint for (c) in mind,
what do you notice about the largest integer in a splendid sequence?
Show that, for each , there is a
largest possible value that an integer in any splendid sequence of
length can take. For instance, it
can be shown that in a splendid sequence of length , every integer must be less than
.
There is a famous sequence called the Catalan numbers
where the th Catalan
number equal to . The Catalan
numbers arise in many interesting ways in mathematics. One such way is
that the th Catalan
number is equal to the number of sequences of length with the following properties.
Each is a positive
integer.
.
For each satisfying , . That is, is no more than one more than
(it is allowed to be less than
or equal to ).
In the solution, such sequences will be called tame
sequences. One way to answer this question is to show that the
number of tame sequences of length is equal to the number of splendid
sequences of length and use the
closed form for the number of tame sequences. To do this, we suggest
trying to devise a way to use a tame sequence of length as "instructions" to construct a
splendid sequence of length . The
idea from part (c) will probably be important.
Note: In the solution, we will provide a proof that
the number of tame sequences of length is the Catalan number. You might
want to try to prove this yourself, but we recommend taking it for
granted when trying to solve part (f).
Solution
Definition 1: For integers and , we say that divides if there is an integer with the property that . In this case, we write .
The phrases " is a divisor
of " and " is a multiple of " both have the exact same meaning
as " divides ". Notice that, by this definition,
every integer is a divisor of ,
but the only divisor of is itself. We give a few facts that will
be used in this solution. Their proofs are not included.
Fact 1: If
and are positive integers such
that , then .
Fact 2: If ,
, and are integers such that and , then and .
Fact 3: If ,
, and are integers such that and , then .
Suppose is a splendid
sequence. By definition, this means and . Since the
integers in a splendid sequence must be positive, Fact 1 implies that
and , which implies . If is a prime number such that , then by Fact 3. However, no prime
number can divide both and because is splendid. Therefore, no prime
number divides . Similarly, no
prime number divides , and so
.
Therefore, the only splendid sequence of length is .
There are several "generic" sequences that one might find. The
simplest is probably the sequence . That is, the sequence
with for all is always a splendid sequence. This is
because no prime number divides ,
and divides every integer.
Another splendid sequence is . For each integer
in this sequence, other than the
’s on the end and , the integers next to it are and , so their sum is , which is a multiple of
. The integers next to are and , which have a sum of .
In the remaining parts of the solution as well as in the Appendix, we
will often denote a sequence by a bold letter. For example, we might
refer to the sequence by .
Assume that is a splendid
sequence. We will show that
is a splendid sequence when .
If some prime number divides
every integer in , then it
also divides every integer in . Since is a splendid sequence, there is
no such prime number, so no prime number divides all of the integers in
.
If , then and
. Since is a splendid sequence, . We also have that , so or by Fact 2. Since every integer
divides itself, . To
see that , we note
that because is splendid and because every integer divides
itself. By Fact 2, so . The integers through all have exactly the same neighbours
in as they do in , which is a splendid sequence, so
satisfies all other
divisibility conditions required for it to be a splendid sequence.
If , then is splendid by a similar argument
to the one in the previous paragraph.
If , then . When
and when , divides the sum of its neighbours
because it has the exact same neighbours as it did in . In , the neighbours of the integer
are and . Since is a splendid sequence, . We also have
that , and so by Fact 2,
which
means . By
similar reasoning, . The integer is equal to the sum of its neighbours
in by definition, so it also
divides the sum of its neighbours. Therefore, every integer in divides the sum of its neighbours.
We already argued that no prime number divides every integer in , so is a splendid sequence.
Suppose is a splendid sequence. If , then the solution to part (a)
implies that .
Suppose . By definition,
we have that and
that . By
Fact 3, . Since
, we can apply Fact 2 to
get that
which implies that .
Since is splendid, we
also have that . We have just shown that divides , so by Fact 3, . We also have
that , so Fact 2
implies or . Continuing in this way,
we can show that for
all with . By the condition that no
prime number can divide every integer in , we conclude that . Essentially the same argument
shows that .
Most of the work is to prove these two claims.
Claim 1: If is a splendid
sequence of length that
contains at least one integer that is greater than , then there is some with such that and
is a splendid sequence. That is, there is an integer in the sequence
that is equal to the sum of its neighbours, and if it is removed, the
remaining shorter sequence is a splendid sequence.
Claim 2: If is a splendid
sequence of length , then
for every .
Proof of Claim 1. To prove Claim 1, suppose is a
splendid sequence with at least one integer greater than and let be such that is the largest integer in the
sequence, choosing the rightmost occurrence if there is a "tie". More
precisely, is the largest integer
with the property that
for all .
By part (d), , and
since there is at least one integer in that is greater than , neither nor can be the largest integer in , which means . The choice of ensures that and . If , then since is the largest integer in the
sequence, this would imply that
is not the rightmost occurrence of the largest integer in the
sequence. Therefore, we actually have that .
The inequalities and
imply that and since is
splendid, . As well, all integers in a splendid sequence
are positive, which means that is a positive multiple of
that is less than . The only such multiple is itself, and so .
We have shown that one of the integers in is equal to the sum of its
neighbours. To finish proving the claim, we need to show that is a splendid
sequence. In , only and have different neighbours than
they did in , so the
divisibility conditions we need to verify are that and
We know that and we have just
shown that .
Substituting, we get that . By Fact 2,
or . A
nearly identical argument shows that . Therefore,
satisfies the divisibility
conditions.
If a prime number divides
every integer in , then and , so by Fact 2 since . This would mean
divides every integer in , which is not the case since is splendid. ◻
We will now prove Claim 2 by mathematical induction. The essence of
the proof is that, by Claim 1, the largest integer in a splendid
sequence must be the sum of two integers in a shorter splendid sequence.
Therefore, the maximum size of an integer in a splendid sequence of
length is at most twice the
maximum size of an integer in a splendid sequence of length . This means that there is always a
fixed upper bound on the size of integers in a splendid sequence of a
fixed length.
Proof of Claim 2. To get an idea of how the induction will
work, we first prove this for ,
, and . For , we showed in the solution to part
(a) that the only splendid sequence of length is . The largest element in this
sequence is , so the claim
holds for .
Now suppose is a
splendid sequence of length . If
, then every integer
in the sequence is less than . Otherwise, since by part (d), Claim 1 implies
that is the
largest integer in the sequence, so all integers in the sequence are at
most .
Continuing to , suppose is a splendid sequence.
Again, if the sequence consists entirely of ’s, then for all . Otherwise, either is a splendid sequence of
length and , or is a splendid sequence of
length and . Either way, three of the
four integers are in a splendid sequence of length , and the fourth is the sum of two
integers in a splendid sequence of length . We just showed that an integer in a
splendid sequence of length is at
most , so an integer in a
splendid sequence of length is at
most .
Therefore, no integer in the sequence can exceed .
Now for the inductive step. Suppose, for some , that every integer in every
splendid sequence of length is at
most . This is our
inductive hypothesis. Consider a splendid sequence of length
. If for all , then for all . Otherwise, Claim 1 implies that there
is some so that and is a splendid sequence of length . By the inductive hypothesis, this
means each of , , , , , , , , and is at most , and since , each of these
integers is at most .
For , we have and since and , we conclude that
as well. We have shown
that every integer in a splendid sequence of length is at most . This completes the induction
and the proof. ◻
By Claim 2, every integer in a splendid sequence of length is at most . There are only finitely many
sequences of length consisting of
positive integers less than or equal to , regardless of whether they are
splendid. Therefore, for fixed ,
there are only finitely many splendid sequences of length .
The proof of part (f) will use some language about sets.
Specifically, we will use the language of injective, surjective, and
bijective functions. If you have seen this before, you should be ready
to read the solution to part (f). Otherwise, we recommend reading
Appendix 1 first.
As pointed out in the hint, the number of splendid sequences of
length is equal to the st Catalan
number. The th
Catalan number is equal to , so the
number of splendid sequences of length is . The
sequence of Catalan numbers shows up in many contexts. A useful example
for this solution is given in Definition 2.
Definition 2: For each positive integer , we call a sequence of positive integers
a tame sequence if
and for every
integer with . In other words, and every other integer in the
sequence is at most more than the
previous integer.
The name tame sequence was made up for the purpose of this
solution, but it is known that the number of tame sequences of length
is equal to the th Catalan number. There are
proofs of this in various places in the literature. For completeness, we
have included a proof in Appendix 2. It is stated as Claim 5.
Let denote the set of tame
sequences of length and denote the set of splendid sequences
of length . We will show, for
, that there is a bijection
with domain and codomain
. By the discussion in
Appendix 1, this will show that the number of splendid sequences of
length is the st Catalan number.
Recall from part (c) that if is a splendid
sequence of length , then
is a splendid sequence of length .
From this point on, it will be notationally useful to prepend a zero
at the beginning of splendid sequences. For example, the sequence will now be the unique splendid
sequence of length . The sequence
is a splendid
sequence of length . This means
that a splendid sequence of length now has integers, the first of which is . For instance, , , , , , and is how we would index the sequence
going forward. Notice
that the observation from part (c) mentioned above also works if we
insert the sum between the first and second integers, and . You should convince yourself of this
before moving on.
For each , we define a
function, , with domain and codomain . The way works is to use a tame sequence as a
list of instructions to build a splendid sequence. Consider a tame
sequence of length . Starting with , the unique splendid sequence of
length , we read from left to right and each
integer in the tame sequence tells us where to insert a sum to get a
longer splendid sequence. Specifically, in the th step, the integer tells us that we should insert a sum
between and to get a longer splendid
sequence.
For example, suppose and the
tame sequence is . Start with , which has and . The first integer in is , so in the first step we insert the sum
between and . This means we go from to . To start the second
step, we reindex to , , and . The next integer in is , so we insert the sum between and to get . The next integer
in is , so we insert the sum between the third
and fourth integers in the current splendid sequence to get . Continuing, we
get ,
followed by , and
finally .
Thus, .
By part (c), if is a tame
sequence of length , then is a splendid sequence of
length . Also note that at the
start of the th step,
the splendid sequence has
integers in it. Because of the way tame sequences are defined, the th integer in a tame sequence
is at most , so at each step, the
splendid sequence is always long enough for the instruction to makes
sense.
For each , we will show
that the function is a
bijection. The proof will be by induction, but we first need a
definition and then a useful fact.
Definition 3: For a splendid sequence of length
(remember that ), we say that is a peak if .
By Claim 1 (see the solution to part (e)), every splendid sequence
with an integer greater than has
at least one peak. Moreover, if that peak is "removed", the resulting
shorter sequence is splendid. As well, with our new notation, if there
is no integer greater than , then
the sequence is of the form and the first
(leftmost) is its only peak. If
it is removed, the resulting shorter sequence is also splendid. Indeed,
the reason for introducing the at
the beginning was to avoid having to treat the sequence of all ’s separately in this part of the
argument.
Now for the useful fact:
Claim 3: Suppose is a splendid sequence of length
and that is a
tame sequence of length such
that . If we let
, then is the leftmost peak of .
A proof of Claim 3 is given at the end. We will now prove by
induction that is a bijection
for all .
It was observed in part (a) that the only splendid sequence of length
is . As well, the only tame
sequence of length is . It is easily checked that
. The sets and each have only one element. There is
only one function between two sets with one element, and it is always a
bijection (convincing yourself of this is a good exercise in
understanding definitions). Therefore, is a bijection.
For the inductive hypothesis, we assume for some that is a bijection.
We will show that is a
bijection, which means we need to show that it is injective and
surjective. To show that it is surjective, we assume that is in and let be its leftmost peak. By Claim 1 from
the solution to part (e), the sequence is in
. Since is a bijection by the inductive
hypothesis, it is surjective, so there is some tame sequence in with . For convenience,
let .
Recall that the leftmost peak of is . Note that since can never be a peak. Therefore,
and so .
Suppose is the leftmost peak
of . If , then is a peak of because has the same neighbours in and when . However, was chosen to be the leftmost peak of
, so we cannot have . This means , but by Claim 3, , so we get . Combining with , we have that , which means
the sequence is a tame sequence. We will call this
tame sequence .
To recap, the tame sequence is obtained by appending to , the splendid sequence is obtained by inserting the sum
between the st and
th integer in , and . It follows that . We have found
such that . Since was an arbitrary element of , this concludes the proof that
is surjective.
We will now show that is
injective. To do this, we suppose and are in with . We
will show that .
Let be
such that .
Suppose is the leftmost peak of
. By Claim 3, both and . This shows that . As well, if we set and and
, then
. By
the inductive hypothesis, is
bijective, and hence, it is injective, so . This shows that and have the same first integers, and since as well, we have that , which concludes the proof
that is injective.
We have now shown that
is bijective, which proves that and have the same number of elements when
. Therefore, the number of
splendid sequences of length is
as
claimed earlier.
Proof of Claim 3. The proof is by induction on . It was noted earlier that the only
tame sequence of length is
(), the only splendid sequence of
length is (, , ), and that . The leftmost peak of
is and . This shows that Claim 3 is true
when .
For the inductive hypothesis, we suppose, for some , that if is a
splendid sequence of length and
is a
tame sequence of length such
that , then the
leftmost peak of the is
where .
Suppose is a splendid sequence of
length and that is a tame
sequence of length with . Because of how
is applied, is a peak of . As well, is a splendid
sequence of length such that
where . For
convenience, set and . Suppose the leftmost peak of
is for some . For now, assume . It is not difficult to show
that it is impossible for a splendid sequence to have two consecutive
peaks. This means we must have since must be at
least two less than . If this
happens, is also a peak of
because has exactly the same neighbours
in and . By the inductive hypothesis,
is the leftmost peak of
, so and we get . Since is a tame sequence, or . This implies that so , which is impossible.
Therefore, we cannot have ,
which means is indeed the
leftmost peak of . This
completes the induction and the proof. ◻
Appendix 1
Suppose and are finite sets, where by "set" we mean
an unordered collection of objects. Suppose there is a "rule" that, for
every element in the set ,
produces an element in the set .
For instance, if the sets were and (both sets of three ordered pairs), the "rule"
might be "square the second entry, then reverse the order". With this
rule, becomes , becomes , and becomes , so every element of is transformed into an element of . Such a rule is called a function. The
set is called its "domain" and
is called its "codomain". If the
function is named , we would use
to denote the function applied
to an element in the domain. You
have probably seen functions before where the domain and codomain are
all or part of the set of real numbers, but the notion of a function
applies in a much broader context. Below are three important properties
that functions may (or may not) have.
Injectivity: A function with domain and codomain is called injective if for
every two elements of the domain, and , if , then . In other words, a
function is injective if its application to two different elements of
the domain always gives two different results. Note that when trying to
prove that a function is injective, we typically assume that and deduce that . You might want to think about
this logic.
Surjectivity: A function with domain and codomain is called surjective if for
every there exists an so that . In other words, a function is
surjective if every element of the codomain is the result of applying
to some element in the domain.
[We might also say that the range equals the codomain to
describe surjectivity.]
Bijectivity: A function is with domain and codomain is called bijective or is
a bijection if it is both injective and surjective.
There is a lot to be said about injective, surjective, and bijective
functions, but for us, the useful observation will be that if and are finite sets and there is a
bijective function with domain
and codomain , then and have the same number of elements.
Indeed, if has elements and has elements, then being injective implies
that and being surjective
implies that . Thus, being
bijective implies that , so .
Observe that the example given at the beginning of this appendix is
neither injective nor surjective, so it is not bijective. However, and do have the same number of elements. It
is important to keep in mind that we are only claiming that if there is
a bijection from to , then they have the same number of
elements. There are six bijective functions from to , the example we gave just happens to
not be one of them.
Appendix 2
We will now show that the number of tame sequences of length is equal to the th Catalan number, . The proof
relies on the material from Appendix 1. As well, the results here are
well known and proofs of them can be found in the literature.
Definition 4: For each positive integer , a sequence of integers is called a jagged
sequence of length if properties
and hold:
. Exactly of the integers are equal to and exactly of the integers are equal to .
. For each integer with , the sum of the first integers in the sequence is
non-negative.
Claim 4: There are jagged
sequences of length .
Proof of Claim 4. Fix a positive integer . Let be the set of sequences of integers that satisfy and fail . Also, let be the set of sequences of integers, of which equal and of which equal . We will show that and have the same number of elements.
Suppose is in . Since fails , there must be some with and the property that the sum of the first entries is negative. Let be the smallest such position in the
sequence. If , then . This means of the integers in the list are equal to , and of them are equal to . Therefore, the sequence has integers equal to and integers equal to , which means it is in .
If , then , , and so on up to , but . Since but , we must
have that is negative, but
, so . As well, each of the are integers, so the two sums above
are integers, which means and (there is
no other way to add to a non
negative integer and get a negative integer). The fact that implies that
exactly half of the integers in the list are equal to , and so the number of ’s in is one more than the
number of ’s. Since the number of
’s and ’s is equal in , this means the number of ’s in is one more than
the number of ’s. All of this
implies that
has two more ’s than ’s. Two numbers that differ by and have a sum of must be and , so the sequence above is in .
The above work defines a function, that we will call , with domain and codomain . Specifically, if in and the smallest integer such that , .
That is, is the sequence
obtained by negating every integer from to the end of the sequence. We
will show that is a
bijection.
To see that is injective,
suppose and are in with . We suppose that
is the smallest such that and is the smallest such that . We might as well
assume that . Our assumption
says that the sequences
and
are equal. Since , this
means when and when . Observe that ,
and since
by assumption, we get that . This means as well and so . Since when and , we have that for all . In other words, , so is injective.
Now suppose is a sequence is in . Because , exactly of the integers in are equal to and of them are equal to . Consider the list of sums The first "sum", , is either or . The final sum is . As we move from one sum
to the next in the list above, we add for some , which means the sums either increase
or decrease by as we move down
the list. Therefore, there is at least one sum that equals (it could be the first). Suppose is the smallest such that . Then the
sequence is in and . To see that , we have that and , ad so it must
be that .
Therefore, .
This means has the same
number of ’s and ’s, which means there are of each. As well, the sequence fails
because the first integers have a negative sum. This
shows that , and that
is essentially by
the definition of .
Therefore, is surjective, which
completes the proof that it is a bijection.
The number of sequences in is
. This is because
we can choose where to put the ’s
in ways, and then
there is no choice of where to place the ’s. By what we have shown, we now know
that there are
sequences in as well.
We can now compute the number of jagged sequences. The number of
sequences of integers that
satisfy is by reasoning similar to
that in the previous paragraph. To get the number of jagged sequences of
length , we need to subtract from
the number of
sequences that satisfy but fail
, which is exactly the number of
sequences in . Therefore, the
number of jagged sequences is
◻
Claim 5: The number of tame sequences of length
is .
Proof of Claim 5. We will find a bijection from the set of
jagged sequences of length to
the number of tame sequences of length .
Suppose is jagged and that are the
indices where the ’s
occur. That is, and all
other integers in are equal
to . We will now define a
sequence so that is the sum of the integers in from up to and including the th integer equal to . In symbols, .
Of the first integers in
, exactly of them are equal to and the other of them are equal to . Therefore, their sum (which is by definition), is . Thus, for a jagged
sequence , we define to be the sequence where .
We will show that is a
bijection, but we first need to confirm that is always tame, which means we
need to show that , for all , and that for all . We know that by , so this means and . Suppose . In , there are only ’s up to and including , which means that among the first
integers in , there are at least as many ’s as ’s. This means the sum of the first
integers cannot be positive.
Therefore, we must have that , and since both are integers,
, which rearranges to
, which gives . To see that , note that , so . Then and so or . This completes the
proof that is tame.
To see that is injective,
notice that can be
rearranged to get . In
other words, if ,
then is uniquely determined
from . This means that from
, the positions of the ’s in are uniquely determined, so the
entirety of is uniquely
determined by . This means
there is only one with the
property that , so
is injective.
To see that is surjective,
suppose
is a tame sequence. For each from
through , define , then define so that
if for some , and otherwise.
That follows by
rearranging to get . However, to conclude that
is surjective, we need to verify
that is indeed a jagged
sequence.
By one of the conditions of tameness, , so so we have that . Using the assumption that
which can be
rearranged to , we
get that which means and it follows that
. Finally, since
is positive, . We have shown that
. This
shows that all of the are
distinct, so satisfies .
Rearranging , we get
, and by the reasoning
from earlier, this means the sum of the first integers in (always ending with the th ) is equal to , which is positive because is tame. Now consider the sum
for some an
arbitrary with . If for some , then the sum is positive by the
reasoning just given. Otherwise, there is some for which . Since every
integer in strictly between
and equals by construction, we must have that
because the
latter is obtained from the former by adding some (possibly zero) ’s. We know that and that
is positive, so this means .
We have shown that
satisfies as well, so is a jagged sequence with . Therefore, is surjective, which completes the
proof that it is bijective. Therefore, the number of tame sequences of
length is . ◻
Additional Information
Splendid sequences are part of a much more general curious
combinatorial context. Here are a few definitions:
A graph is a collection of vertices (denoted in this
document by circles) and edges (lines connecting some of these
circles).
Two vertices in a graph connected by an edge are called
adjacent.
A splendid numbering of a graph is an assignment of a
positive integer to each vertex so that
Each number divides the sum of the integers in the adjacent
vertices, and
the greatest common divisor of all integers is 1.
Below are some examples of splendid numberings of some graphs:
You may notice that a splendid sequence of length is simply a splendid numbering of the
graph that is a path of length (the rightmost example above is a
path of length ).
So, given a graph, we can ask the same question as in the problem of
the month: How many splendid numberings are there? Just like splendid
sequences, the number of splendid numberings for any fixed graph is
finite! (see part (e) of the problem).
The exact number of splendid numberings is known for paths of length
. This was essentially the content
of part (f) of the problem.
Two other families of graph for which splendid numbers have been
studied are cycles and
-pointed stars. Rather
than defining these generally, the image below has a cycle on the left and a pointed star on the right. We
leave it to the reader to guess the general definition.
A 6 cycle (left) and a 6-pointed star (right)
The number of splendid numberings of an cycle is known to be . This was computed in
a paper from 2018 (see reference (1)), which is quite recent!
For an -pointed star, the
number of splendid numberings is closely related to the number of ways
to decompose the number 1 as a sum of the form where each is a positive integer, which is a
very hard problem. In fact, it is currently unknown how many splendid
numberings there are for the -pointed star.
This is almost all of what is known at the moment, and counting the
number of splendid numberings (called arithmetical structures in the
research community) is an active area of research. In 2020, for example,
the number of splendid numberings for a particular family of graphs
called "bidents" was computed (see reference (2)). It created a new
entry in The On-Line Encyclopedia of Integer Sequences (OEIS)! The new
entry is entry number A335676. If you’ve never played around with the
OEIS, it’s definitely worth it. Just make sure you have a few
procrastination hours to burn.
As a final note, we should mention that there are many resources out
there about Catalan numbers. For example, Richard P. Stanley’s 2015 book
"Catalan Numbers" contains a very long list of contexts where the
Catalan numbers arise.
References
Benjamin Braun et al. "Counting arithmetical structures on paths
and cycles". In: Discrete Math. 341.10 (2018), pp.
2949-2963.
Kassie Archer et al. "Arithmetical structures on bidents". In:
Discrete Math. 343.7 (2020), pp. 111850, 23.
Problem 6: March
2023
Problem
For a non-negative integer ,
define to be the first digit
after the decimal point in the decimal expansion of . For example, and so . Note that and that when is a perfect square. You will likely
want a calculator that can compute square roots for this question.
Compute for every
integer strictly between and as well as every integer strictly between and .
Compute for every
integer strictly between and as well as every integer strictly between and .
Show that if is a positive
multiple of , then each digit from
through appears in the list
the same number of times.
For each digit from through , determine how many times occurs in the list
Here are a couple of other things that you might like to think
about. No solution will be provided for either of these questions, but
as always, we would love to hear about any observations you make!
How are the digits through
distributed among the infinite
list
For example, in the long run, are the ten digits distributed roughly
"uniformly"? One way to make sense of this question is to think about
the frequency of each digit among the list for very
large .
Are there similar patterns to those in the earlier parts of this
problem if we consider the first two digits after the decimal place?
What if we consider three or more digits?
Hint
There is no hint given for this part, but it might be useful in
later parts to see if you notice any patterns in the distribution of the
possible outputs of the function .
See part (a).
If ,
then is equivalent to .
Similar to the result in (c), if is one more than a multiple of , then in the list every
possible value from through appears exactly times, with the exception
of and which appear times each. Try to find
and prove other similar results.
Solution
Note: In this solution, we will take for granted that if is a positive integer that is
not a perfect square, then is an irrational number.
Since
and , we have
that and .
In the table below, the first column contains the integers from through , the second column contains truncated to four digits past
the decimal point, and the third column contains .
Among the values of for
from through , each integer from through appears exactly once except for and , which appear exactly twice each.
Notice that and are the values of and , respectively.
Since , , , and , we have that , , , and .
In the table below, the first column contains the integers from through , the second column contains truncated to four digits past
the decimal point, and the third column contains .
Similar to part (a), observe that among the values of for between and inclusive, every integer from through appears either once or twice, and the
integers that appear twice are exactly , , , and .
Suppose is a positive
integer in the list . This is
equivalent to , which is equivalent to since , , and are non-negative.
Fix an integer satisfying
and assume that is an integer satisfying both and . That means is the first digit past the decimal
point in the decimal expansion of . That is between and means the integer part of is . Therefore, is at least more than and at most more than . Since is between two consecutive
perfect squares, it is not a perfect square, which means is irrational. Putting this
together, we get that where the strict inequalities
are justified by the irrationality of . We are also assuming that is a positive multiple of , which means there is some positive
integer such that .
The quantities in the chain of inequalities above are all positive,
which means Expanding
and substituting , we get
We will now count the number of integers that satisfy the
inequalities above.
Since the integer satisfies
, we must have that
.
Therefore, is between the
consecutive integers and
, possibly equal to the
smaller of the two but not equal to the larger of the two. Since is an integer that is greater than
, we
conclude that
Similarly, implies
that . This means is
between the consecutive integers and , possibly equal to the
larger of the two but not equal to the smaller of the two. Since is an integer that is smaller than
,
we can conclude that . Combining this inequality with the inequality
from earlier, we now have that The number of integers that satisfy this inequality is
We have now shown the following: For each integer satisfying , there are at most integers in the list with the
property that . There are
integers in the list above and
only ten different values that the function can output. Since , there is no possibility other
than that takes every possible
value exactly
times.
To get an idea of how to proceed, we first discuss the results of
parts (a), (b), and (c).
In part (a), we saw that for values of between and , the function takes on every possible value the same
number of times (zero) with the exception of and , which occur one extra time each. For
values of between and , the function takes on every possible value the same
number of times (one) with the exception of and , which occur one extra time each.
Similarly, in part (b) we observed that takes on every value the same number of
times, with the exceptions of ,
, , and when ranges between and as well as between and .
The result of part (c) was that if is a multiple of , then takes on every possible value the
same number of times (with no exceptions) as ranges over the integers strictly
between and .
The patterns in part (a) and (b) continue, and there are similar
patterns for other ranges between perfect squares. A bit more precisely,
if is a positive integer, then as
ranges over the integers between
and , takes on every possible value the
same number of times, possibly with some exceptions that occur one extra
time each. These exceptional values depend only on the remainder when
is divided by .
Even more precisely, the following five claims are true. The claims
are numbered to correspond to remainders after division by .
Claim 0: Suppose for
some integer . As ranges over the integers strictly
between and , takes every value from through exactly times.
Claim 1: Suppose for
some integer . As ranges over the integers strictly
between and , takes the values and a total of times each and takes every other
value times.
Claim 2: Suppose for
some integer . As ranges over the integers strictly
between and , takes the values , , , and a total of times each and takes every other
value times.
Claim 3: Suppose for
some integer . As ranges over the integers strictly
between and , takes the values , , , , , and a total of times each and takes every other
value times.
Claim 4: Suppose for
some integer . As ranges over the integers strictly
between and , takes the values , , , , , , , and a total of times each and takes the values and a total of times each.
In part (c), we showed that Claim 0 is true. Before proving Claims 1,
2, 3, and 4, we will answer the actual question which was to determine
how many times each digit occurs in the list Because
is a perfect square, , so we can instead count
the number of times each digit occurs in the slightly different list
There are perfect squares
among the integers from through
, so there are occurrences of in the list that come from perfect
squares.
For every other integer in the
list , there
are unique integers and with and
with the property that .
Suppose is a non-zero digit.
For fixed and , as we let take the values strictly between and , Claim tells us whether the digit occurs either times or times in that range. We are
considering values of satisfying
for
every pair with and . For every pair of the form
, we get occurrences of (see Claim 0). For every pair of the
form , we always get occurrences or we always get occurrences of , depending on what Claim 1 says about
the digit . Continuing with this
reasoning, the number of occurrences of in the list can be
expressed in the form where is the
number of values of for which
Claims 0 through 4 say there are
occurrences of , and is the number of values of for which Claims 0 through 4 say there
are occurrences of .
For example, for , there are
occurrences when , , and , and there are occurrences when and . Therefore, with , we have and , so the number of times that occurs in the list is .
Using Claims 0 through 4, the table below summarizes the count for
the remaining non-zero digits. In the first column, the digit appears, in the second column the value
of appears, in the third column,
the value of occurs, and in the
fourth column, appears,
which is the number of times
appears in the list.
Finally, for , we already
have occurrences of in the list coming from the perfect
squares. Otherwise, we can use the exact same technique as above to
count the number of s in the list
coming from integers that are not perfect squares. For , and , so there are occurrences of the
digit in the list .
We will now prove Claims 1 through 4.
Suppose is a non-negative
integer, is , , , , or , and that is a digit between and inclusive. We wish to count the number
of integers such that and .
Specifically, we want to show that this number is either or depending on and in the way delineated in Claims 0
through 4.
The condition is equivalent to , and this combined with (note that is not a perfect square and that and are consecutive integers) is
equivalent to
Since everything is non-negative, everything can be squared to get the
equivalent chain of inequalities
After some expansion, we have We
are interested in how many integers satisfy the chain of inequalities
above. Since is an
integer, the number of integers
satisfying the inequalities above is the same as the number of integers
that satisfy and after
combining fractions on the left and right, we get
Suppose
and
are both strictly between and
. Then the integers satisfying are precisely the integers through . More generally, we will show that
whether there are or integers depends on whether or not
the quantities and are between
the same pair of consecutive integers.
To give an idea of how the argument will proceed, consider the
situation when and . Then ,
but .
Thus, becomes The integers through satisfy these inequalities,
for a total of integers.
Observe that This means the difference
between and
is
less than , which leads to the
following two possibilities.
Possibility 1: There is an integer so that In this case, the integers satisfying are , , , and so on through for a total of integers.
Possibility 2: There is some integer with the property that
In this case, the integers satisfying are through , for a total of integers.
We have now shown that there will be integers satisfying exactly when there is an integer
strictly between and .
Multiplying through by , this is
equivalent to there being a multiple of strictly between and
The table below contains the values of for every and . The columns after the
first are indexed by a value of
from through from left to right and the rows after
the first are indexed by values of from through from top to bottom. The cell in the row
corresponding to and the column
corresponding to contains . Cells are highlighted if there
is a multiple of between the
value in the cell and the value in the cell immediately to its
right.
/
0
1
2
3
4
5
6
7
8
9
10
0
0
1
4
9
16
25
36
49
64
81
100
1
0
21
44
69
96
125
156
189
224
261
300
2
0
41
84
129
176
225
276
329
384
441
500
3
0
61
124
189
256
325
396
469
544
621
700
4
0
81
164
249
336
425
516
609
704
801
900
The highlighted cells correspond to pairs for which is less than a multiple of and is greater than that
multiple of . As noted above,
these correspond to the pairs
for which integers satisfy
. For , we see no values of , which gives another proof of Claim 0.
For , the values of for which there are integers are and , which proves Claim 1. Continuing,
the data in the table above verifies Claims 0 through 4.
Problem 7: April
2023
Problem
Let denote the set of all
-tuples of s and s. For example, is the set of all ordered pairs of
s and s or . To
improve readability, we will omit the commas and parentheses from the
elements of . For example, the
elements of will be denoted by
, , , , , , , and .
Variables referring to elements of will be bold lowercase letters. For
example, we might refer to elements in as , or , and so on. To refer to the
coordinates of elements in , we
will use square brackets. For example, if , an element in , then , , , , and .
For two elements and
in , the Hamming distance,
denoted between
and is equal to the number of
coordinates where they differ. For example, if and , then because and , but for , , and . It is important to convince yourself
that for any
and .
The notion of a graph was defined in the extra information
about the February 2023 problem, but there are also plenty of places
online that have definitions. We will keep things simple here and define
a graph to be a collection of vertices, some of which are
connected to each other by edges. When we draw a graph, a
vertex will be represented by a circle and an edge will be represented
by a line segment from one vertex to another. Two examples of graphs are
depicted below. The one on the left has four vertices and the one on the
right has eight. Note that two edges intersecting does not necessarily
imply the presence of a vertex.
For each , we define a graph
with vertices called the
natural graph of . In
the natural graph of , it is
possible to label every vertex by exactly one element of such that there is an edge between
two vertices exactly when their Hamming distance is . The two examples above are the natural
graphs of and . They are shown again below with
their vertices labelled.
A walk in a graph from vertex to vertex is a sequence of vertices starting at
and ending at with the property that there is an edge
connecting every pair of consecutive vertices in the sequence. The
length of a walk is the number of edges it uses. For example,
let , , , and be the vertices labelled by , , , and in the natural graph of , respectively. Then and are walks of length from to . The distance between and in a graph is equal to the shortest
possible length of a walk from to
. In the example above, and have a distance of because there are walks of length , but there are no shorter walks from
to .
Let and be elements of . Show that is equal to the distance
between and in the natural graph of .
For each with , determine the number of
two-element subsets of that satisfy .
Suppose we were to relabel the vertices of the natural graph of
by permuting the labels. That
is, we keep the graph the same but use the elements of to label a vertex of the graph in
some other way. For example, in , we might leave and where they are and swap the positions
of and , as shown.
When this is done, the distance in the new graph between elements of
and is not necessarily equal to any more. The table
below has the two-element subsets of in the first column, in the second column,
and their distance in the relabelled graph in the third column.
new distance
Among the four subsets with , there are two that
have a distance in the relabelled graph of and two that have a distance in the
relabelled graph of .
Now for the question: For each , find a way to permute the elements of
so that the following happens:
Among all two-element subsets of with , there are the same
number with each possible distance in the relabelled graph.
Hint
Suppose
for some . Try to construct a path
of length in the natural graph
from the vertex labelled to
the vertex labelled .
For fixed ,
how many have the
property that ?
Find a function that works for and use this to build one for . It might be useful to think of the
natural graph of as a square
and the natural graph of as a
cube. As well, a cube can be thought of as two squares on top of each
other with vertical edges connecting corresponding vertices in the top
and bottom faces.
Solution
Suppose and are elements in and that for some . If , then , so their distance in the
graph is also .
Otherwise, and differ at exactly coordinates where . Using the
notation introduced in the problem statement, we mean that if is in the list and otherwise. Notice
that the function has
the property that and , so switches and . We will now define a sequence of
elements in . Informally, is obtained from by leaving all coordinates alone
except , which gets
changed from to or to as appropriate. Continuing, is obtained from by leaving all coordinates alone
except , which gets
switched, and this continues for , , and so on. More formally, for
each with we define as follows.
if
is not in the list .
for each in the list .
for
each in the list .
By construction, the list
is a list in which every pair of elements differ at exactly one
coordinate. Moreover, the list is that which is generated by changing
the coordinates of that
differ from those of one at
a time, from leftmost to rightmost. This means , and the above is a
walk from to . There are vertices in this walk, so there are
edges.
We have constructed a walk from to in the natural graph of that has length , which means the distance from to in the natural graph is at most
. To see that it is at least , we suppose
is a walk in the natural graph of of length for some . Since there are vertices in this walk in addition to
and two vertices have an
edge between them exactly when they differ at exactly one coordinate,
the total number of coordinates at which and (the ends of the walk) can differ
is at most . Since we know that
they differ at exactly
coordinates, we must have that . This means that any walk from to in the natural graph of has at least edges.
We have shown that the distance in the natural graph between and is at least and at most , which means it is exactly .
For convenience, in the solution to this part and the solution to
part (c), we will refer to a two element subset as a pair.
Since for any
elements , we
will say that is
the Hamming distance of the pair or the pair has Hamming distance
, and possibly
other similar things depending on the grammar in that particular
sentence. Similarly, we might say that has distance in a graph to mean that the
distance between the vertices labelled by and is in that graph.
For a fixed element , an element satisfies exactly when it
differs from at exactly
coordinates. There are coordinates in total, so there are
possible choices of
coordinates that could be
different from . Therefore,
for a fixed element ,
there are exactly
elements with the
property that .
There are elements in and each belongs to exactly pairs with a Hamming
distance of . This gives a total
of pairs.
However, this total
counts every pair twice, once for each of its two elements. Thus, the
number of pairs in with a
Hamming distance of is .
Denote by the set of
pairs from satisfying . From part (b), there
are
pairs in . In the relabelled
natural graph of , we want the
distances of the pairs in to be
equally distributed among all possible distances in the graph. There are
possible distances between
distinct vertices in the graph, so the fact that is a multiple of is a good sign.
We want to permute the elements of in such a way that for each from through , exactly pairs in
have a distance of in the relabelled graph.
There are many ways to do this. The approach given here is inductive,
starting by examining . Consider
the example from the problem statement. In that example, and were switched and and stayed the same.
Although it is not very interesting in , there is an observation we can make
that will generalize. The vertices that are connected by horizontal
edges in the diagram of the natural graph of remain connected by an edge after
permuting. The vertices in the top change order but their distance apart
does not change. Meanwhile, the vertices that are connected by a
vertical edge are moved to occupy opposite corners of the square so
their distance goes from to . While this is only an increase of
, it will be useful going forward
to think of the vertices connected by vertical edges as having gone from
as close together as possible (connected by an edge) to as far apart as
possible.
Now consider the natural graph of , which is pictured below.
There are edges in the graph.
After permuting the labels of the graph, we want pairs of adjacent
vertices to be sent to adjacent vertices, pairs of adjacent vertices to be sent
to vertices with a distance of ,
and pairs of adjacent vertices to
be sent to vertices with a distance of .
Looking at the diagram, we can think of the natural graph of as being composed of two copies of
the natural graph of laid
horizontally on top of each other. The labelling also has some coherence
with the labelling of . First,
label the bottom and top square as if they were copies of the natural
graph of , making sure to label
vertically-adjacent vertices in the same way. Next, append a to the right of every label in the
bottom layer and append a to the
right of every label in the top layer.
To permute the labels in the way we want, we will first perform the
same permutation on each layer as we did in . In each layer, this moves two pairs
of labels from adjacent vertices so that they are at a distance of . There are also two pairs of adjacent
vertices in each layer that remain adjacent after permuting. The four
pairs of vertically-adjacent vertices will remain vertically adjacent
because we will have performed the exact same permutation in each layer.
At this point, the labels on four pairs of adjacent vertices have been
sent to vertices that are apart
and the other pairs of labels
remain on adjacent vertices. The diagram below shows what we have done
so far:
The second and final step is to swap the corners in the top layer.
This will have the effect of moving the labels on vertically-adjacent
vertices to be as far apart as possible. In a "cube", this means they
will end up at opposite ends of a "space diagonal". Swapping the corners
in a layer preserves the distance between all pairs of vertices in that
layer. This means the net effect of the second step is to move four
pairs of adjacent vertices so that they are at a distance of . The overall effect is shown below:
In the table below, the first column has the pairs from and the second column has the
distance of the corresponding pair in the relabelled graph.
new distance
As grows, the natural graph of
gets harder and harder to draw
in a useful way, so we need some notation to help translate the
geometric idea into symbols. First, we will clarify what we actually
want.
Suppose is a function with
domain and
codomain. This means
is a function that takes elements
of as input and also outputs
elements of . When we talk about
a permutation of , we really
mean a function with domain and
codomain both equal to that is
a bijection. For a brief discussion about what a bijection is,
you can consult Appendix 1 from the solution to the February 2023
problem. In the context of this solution, a function from to is a permutation if every possible
output is attained by exactly one input. For example, the function with domain and codomain given by is a bijection from to . Every possible output is attained
(the four elements of appear on
the right side of the displayed equations above) and no output is
attained more than once. If you think about it, every way to order the
elements of (a permutation)
corresponds to exactly one such function: choose an order of the
elements, then write them in that order in the second column above. It
will not be important for this solution, but it might help you to
understand this connection if you convince yourself that there are
exactly bijections from to itself.
A rearrangement of the labels in the natural graph of can be thought of as a bijection from
to itself. If is such a function, then is equal to the original label
of the vertex to which the label is sent by the permutation. For
example, the permutation for
corresponding to the relabelling from earlier (shown below)
is given by For example, the fourth of
the displayed equations above is because in the second diagram
appears where originally appeared.
This is an important observation because we can now recognize
distance in the relabelled graph as a Hamming distance. The distance
between and in the relabelled graph is equal
to the distance between
and in the original
graph. By part (a), this means the distance between and in the relabelled graph is equal
to .
We can now formally articulate what we seek. For each , we would like a function with domain and codomain both equal
to with the following
properties.
is a permutation of
(a bijection).
Among the pairs
from , takes on each
value from through exactly times.
It may be a good idea to digest what has been said so far, possibly
going back to see how this applies to and .
We can now define for each
, but the definition will be
recursive, so we need a bit more notation. For an element in where , we will write to mean the element of that is obtained by appending a
to the right end of . For example, . Similarly, is the element of obtained by appending to the right end of . Also, we will denote by the element of obtained by changing every coordinate
of from to or to , as appropriate. For example, if , then .
Starting with (which we have
not addressed up to this point) we will let . That is, is the identity
function. The elements of are
and , and and . We now continue recursively.
For , we define from as follows.
If is of
the form for some , then .
If is of
the form for some , then .
Notice that the above instructions indeed explain how to evaluate
at every element of because each element of can be constructed in exactly one
way by appending either a or a
to the right of an element from
. As an example, we will
determine exactly what does to
each element in . For , we have to use the rule in the first
bullet point because the rightmost digit is . , so we have that . Since the second digit of
is also and , we get that . For , we have to use the rule in the second
bullet point. This means . Finally, . This is exactly the function from to itself discussed earlier. We leave
it as an exercise to verify that is exactly the permutation of discussed earlier.
We can now prove by induction that does what we want for every . Before doing that, we will discuss how
this function corresponds to the geometric idea from earlier. The
elements in can be obtained
by taking each element of and
appending a to the right and a
to the right, in a way getting
two elements in from every
element in . By this reasoning,
can be thought of as two
copies of : elements ending in
and elements ending in . If you look at the natural graph of
above, these two copies are
exactly the "bottom" and the "top" squares. The way is defined is to operate
differently on the two copies of , since how is computed depends on
the rightmost digit of . In
other words, it depends on which copy of belongs to. If has a rightmost digit of , then essentially does what did. This corresponds to the bottom
face of the cube being permuted exactly as the square was. If the
rightmost digit of is , then is in the other copy,
corresponding to the top face of the cube in the case of . In this situation, we apply to the element of obtained by removing the last digit,
just as we would if the rightmost digit were . However, we then switch every digit,
and this corresponds to "swapping the diagonals" in . Finally, a is appended to the right of the result,
which corresponds to making sure that elements in the top of the cube
stay in the top of the cube during the permutation.
For , it is an exercise in
understanding definitions to see that satisfies the given conditions. We
have already established this for , and can be verified by confirming that
is exactly the permutation from
earlier that we verified worked.
We now assume, for some ,
that is a permutation of with the property that among pairs
from , takes on every
value from through exactly times. We will show that is a permutation of with the property that among
pairs from , takes
on every value from through exactly times.
To see that is a
permutation, let be
arbitrary. Since is a
permutation, there is a unique element such that and a unique element
such that . Using the
definition of , we have
and . Since every element in
is of the form or for some , we have shown that every
element of is in the range
of . Now suppose
are the elements of (in
some order). We have shown that every element of appears in the list
at least once. There are
elements in the list above and elements in , so every element of must appear in the list above
exactly once. In other words, is a permutation of .
To prove the other fact about , we will use the fact that for any and . It is left as an exercise to
verify this.
Suppose is a positive integer
such that . By
induction, there are exactly pairs in with . If is one of these pairs, we have where the second equality is because the
elements in question agree in the last coordinate, so their Hamming
distance is equal to the Hamming distance between the elements obtained
by removing the rightmost elements. We similarly have that Therefore, there are pairs from satisfying for
each .
Next, for any , we
have where the second equality is because we
know that the two elements being handed to have different rightmost coordinates,
so their Hamming distance is one more than the Hamming distance between
the elements obtained by removing the rightmost coordinates. The third
equality is because and differ in all coordinates by definition.
There are elements of , and each pair is in , so we get pairs from such that .
For each from through , we have found pairs from with the property that .
This completes the proof.
Problem 8: May 2023
Problem
This month’s problem is based on Problem 6 part (b) of the 2023 Euclid Contest. Here is a modified and rephrased version of that problem:
A square is drawn in the plane with vertices at , and . Two blue lines are drawn with
slope 3, one passing through
and the other through . Two red lines are drawn
with slope , one
passing through and the other
through . What is
the area of the square bounded by the red and blue lines?
The answer to this question is . We suggest convincing
yourself of this before attempting the rest of the problem. The fact
that this small square has an area exactly one tenth of the larger
square suggests that there is a way to answer this question by showing
that exactly squares the size of
the small one should fit into the large square. Let’s explore!
Here is some terminology that we will use in this problem.
Definition: A lattice point is a point
for which and are both integers.
Definition: A square whose vertices are at lattice points is called a
unit lattice square. We will denote by the unit lattice square with vertices
, , , and .
Definition: denotes the line segment
connecting to the point . Note that this line has slope
.
Definition: An – lattice line is a line with slope
that passes through at least one
lattice point.
The next two definitions are more complicated. There are examples
given after they are stated.
Definition: The tricky unit square, , is a modified version of
(see above) with the property
that if a line reaches an edge of , it "jumps" to the opposite
side and continues with the same slope. For example, if a line reaches
the top edge of , it
continues with the same slope from the bottom edge directly below where
it reached the top edge. If a line reaches a vertex of , then it has simultaneously
reached two edges. In this situation, it continues with the same slope
from the opposite vertex of .
Definition: Let be a rational number written
in lowest terms. The – loop on is the line passing through
with slope .
Below are diagrams, from left to right, of the – loop, the – loop, and the – loop on . In each diagram, equal
letters mark places where the line jumps from one side of the square to
the opposite side.
Each of the loops above eventually come back to their starting point
and repeat. This happens because is rational (think about
this!). Notice that in the image of the – loop (the middle image),
the loop starts at instead of
. This is because a line of
negative slope starting at
(on the bottom edge) immediately jumps to the top edge and continues
from . It is worth thinking
about how all four vertices of really represent the same
point.
Below is an image of
with the – loop in blue
and the – loop in red.
These two loops divide
into smaller squares. The squares numbered , , , , , and are split in two pieces each across
edges of . Notice that
both the loops pass thorough the point since the four vertices of the
square are the same point. This means, if we count the four vertices as
one intersection point, the two loops intersect exactly times (count them!). Think about how
this compares to the Euclid problem mentioned earlier.
A square is divided by six lines: three parallel blue lines with slope and three parallel red lines with slope .
The leftmost blue line goes from the bottom-left vertex of the square to a point one-third of the way along the top side of the square. The next two blue lines start at points one-third and two-thirds along the bottom side of the square.
The topmost red line goes from the top-left vertex of the square to a point one-third of the way down the right side of the square. The next red lines start at points one-third and two-thirds down the right side of the square.
These intersecting lines create sixteen regions that combine to form ten identical smaller squares within the square. Viewing these regions as forming a tilted and non-uniform four by four grid within in the square, the regions in the top row, from left to right, are numbered 1, 2, 3, 4, those in the second row are numbered 4, 7, 8, 5, those in the third row are numbered 5, 9, 10, 6, and those in the bottom row are numbered 6, 1, 2, 3. The regions at the centre, 7, 8, 9, 10, are intact tilted squares. The other regions come in pairs consisting of a smaller piece and a larger piece of a square, both with the same number.
For each pair of loops below, draw with that pair of loops, count
the number of times the loops intersect, and count the number of squares
into which the loops divide .
The – loop and
the – loop.
The – loop and
the – loop.
The – loop and
the – loop.
The – loop and
the – loop.
In the remaining problems, and
are positive integers that have
no positive divisors in common other than .
How many line segments make up the – loop?
How many – lattice lines intersect
?
Explain why the number of points of intersection of the – loop and the – loop on is equal to the number of
small squares into which the loops divide . Remember that the four
vertices of represent
the same point.
How many – lattice lines intersect
?
Compute the area of the small squares in created by the – loop and the – loop. Our solution will
take for granted that these small squares all have the same area, but
you might like to think about how to prove this.
Hint
When counting, remember that some squares are split into pieces.
As well, the four vertices of count together as one
intersection point.
Relate the number of segments to the number of unit lattice
squares through which
passes.
Such a line has an equation of the form . What can you say about
and based on the fact that the line
intersects ? do not forget about
the condition that the line must pass through at least one lattice
point!
Each square has a leftmost vertex.
Similar to the hint for part (c), such a line must have equation
. Try to deduce
restrictions on the value of from
the fact that this line intersects .
Since the squares have the same size (though some might be broken
into pieces) and has
area , the area of the squares can
be computed by computing the number of them. Our solution will put
together the ideas from (b) through (e) to count the squares.
Specifically, the count in part (e) can be related to the number of
squares. Remember to be careful with the four corners of , which count as one
point.
Solution
The following fact will come in handy later in the solution.
Fact 1: Let
and be integers with such that and have no positive divisors in common
other than . If and are distinct lattice points such that
the line segment joining them has slope , then the distance between
and is at least .
Proof of Fact 1. Suppose is the point and is the point where , , , and are integers so that the line segment
joining and has slope . Then the line segment
joining and has slope . By the
assumption on and , the fraction is written in lowest terms,
so we must have and
implying and . The distance between and is ,
which completes the proof. ◻
Below is a picture of the – loop (in blue) and the
– loop (in red) on
.
There are 17 intersection points. Remember that the two loops
intersect at , which is the
same point as the other three vertices of .
There are 17 small squares. Of these , 9 are whole squares in the middle of
, 4 are split into two
pieces over the horizontal edges of , and 4 are split over the
vertical edges of .
Here is a picture of a – loop (in blue) and a – loop (in red) on .
There are 13 intersection points and 13 small squares.
Here is a picture of a – loop (in blue) and a – loop (in red) on .
There are 25 intersection points and 25 small squares.
Here is a picture of a – loop (in blue) and a – loop (in red) on .
There are 2 intersection points and 2 small squares. Here the squares
are harder to see, since both of them pass through the edges of , passing over to the other
side.
It is not a coincidence that the number of small squares is equal to
the number of intersection points in all four cases.
Consider the line of slope through the origin. By
Fact 1, is the lattice point
in the first quadrant that is closest to the origin. Therefore, there
are no lattice points on
other than its endpoints. To illustrate the idea in this solution, the
diagram below shows along
with every unit lattice square through which it passes.
The – loop
continues to wrap around
until it reaches a corner again. Because of how behaves, we can think of the
– loop as the result
of overlaying all of the unit lattice squares in the diagram above on
top of each other. Indeed, if the blue line reaches the top of the
square, it jumps to the bottom with the same -coordinate and continues with the same
slope. This means that after it jumps, what the line does in is the same as what it does as
it continues into the adjacent unit lattice square.
This is true in general. So the number of line segments in the – loop is equal to the number
of unit lattice squares that intersects. By "intersects" we
mean that the line passes through some part of the interior of
the square and not just a vertex.
Therefore, to count the line segments in the – loop, we can count the unit
lattice squares through which passes. Line segment begins in and passes into a new unit lattice
square exactly when it crosses either a vertical line with equation
for some integer or a horizontal line with equation
for some integer . Since passes through no lattice points
other than its endpoints, it will never cross such a vertical and a
horizontal line at the same time. Since starts at and ends at , it must cross such vertical lines and such horizontal lines. Adding one to
account for the unit square from which it originates, this means there
are line
segments in the – loop.
For the – loop, this
gives line segments. By
counting, you can verify that there are indeed unit lattice squares in the diagram
above. If you carefully draw the – loop, you will see that
there are segments.
Since and are positive, is positive. Suppose is a – lattice line that
intersects . Then we must have
, otherwise would be entirely below the graph of
(remember, its slope is
positive). Additionally, we must have that , otherwise would be completely above the graph of
.
The inequality is
equivalent to ,
or . Combining
with , we get , which we will
write as .
So far, we have not used the fact that contains at least one lattice point.
Suppose passes through some
lattice point . That is, and are integers and . Then which can be rearranged
to get . This
implies that there is some integer for which . Substituting into , we get
which is equivalent to . There are exactly integers that satisfy these inequalities, so
there are – lattice lines that
intersect through .
Before moving on, we make the observation that two of the – lattice lines that
intersect only pass through the
vertices and . This means there are actually
– lattice lines that pass
through the interior of .
Since and , the loops are neither
horizontal nor vertical. Therefore each of the small squares has a
unique left-most vertex, which is the vertex with the smallest -coordinate. Each of the small squares
has exactly one left-most vertex, and every point of intersection is the
left-most vertex of exactly one square. This means that the number of
intersection points is equal to the number of squares.
Note that this lends some credibility to counting the four vertices
as one intersection point. See, for example, the and loops from the problem
statement. In that example, the square labelled by is split into two pieces, and its
leftmost vertex is represented by all four vertices of . It might be useful to think about this
for a moment before moving on.
Since and are positive, is negative. Suppose is a – lattice line that
intersects . Since the slope
of is negative, we must have
. Otherwise, would be entirely above the graph
of .
For similar reasoning, we also require that , which means . This can be
rearranged to . Combining with gives . By
essentially the same argument that was used in the solution to part (c),
must take the form for some integer . Therefore, we have and there are exactly integers with this property. These lines are all
different and all intersect , so the answer is .
Before moving on, we note that the lines of slope that pass through and are – lattice lines that
intersect . Therefore, there
are points of
intersection of – lattice lines with , excluding the endpoints. We will
refer to this set of
points several times in the solution to part (f), so we will denote it
by for convenience.
Let be the set of
intersection points of the – loop with the – loop, other than the point
represented by the four corners of . We will show that every point
in corresponds to exactly one
point in , thereby showing that
and have the same number of elements. Since
there are points in , if we include the point in represented by the corners,
this will show that the loops have exactly intersection points in . By part (d), this will show
that the loops divide
into small squares.
The rest of the solution is devoted to showing that and have the same number of elements. We
will use the following fact several times. Its proof is not
included.
Fact 2: If an – lattice line is translated units to the left and units to the right for some integers
and , then the line obtained is also an
– lattice line.
We first observe that every line segment in the – loop is part of some – lattice line. To see why
this is true, consider the solution to part (c). It was discussed there
that the line segments of the – loop can be obtained by
overlaying on the unit lattice
squares through which
passes. For each such unit lattice square, there are integers and so that the square can be translated
units to the left and units down in order to land on . Therefore, if we translate units to the left and units down, it will land exactly on the
segment of the – loop
in question. is part of a
– lattice line, so by
Fact 2 above, we have shown that every line segment in the – loop is part of some – lattice line.
By part (b), there are
segments in the – loop.
By the remark at the end of the solution to part (c), there are – lattice lines that pass
through the interior of . Each of
the segments in the – loop is in the interior of
the square. We conclude that the segments of the – loop are exactly the parts
of – lattice lines that
pass through the interior of .
By very similar reasoning, we can also conclude that the – loop is made precisely of
the segments of – lattice lines that pass
through the interior of .
Consider a point in and suppose it is inside the unit
lattice square with vertices ,
, , and . Let be the – lattice line that
intersects at . The point is in . If we translate and to the left by units and down by units, the resulting lines will
intersect at . By Fact 2,
translating in this way
results in a – lattice
line and translating
results in a – lattice
line. By the previous argument, these will be segments of the and loops, respectively.
Therefore, is a point of
intersection of the two loops. This gives a natural way to translate
points of to points in . In symbols, we send to .
Suppose that and are two different points in . If these points are in different unit
lattice squares, then when they are translated to , they will end up on different line
segments of the – loop,
so they cannot possibly be translated to the same point in . If they are in the same unit lattice
square, then they will be translated to the left and down by exactly the
same amount, so they will end up in different places in . Either way, we can conclude that
different points in are sent to
different points in by the rule
described in the previous paragraph.
Suppose is a point in
. This point is on some line
segment of the – loop,
and we have argued earlier that these segments come from translating
unit lattice squares through which passes (and taking the parts of
along for the ride).
Suppose the segment on which
lies comes from the unit lattice square with vertices , , , and and call this square . The point is in and lies on .
We showed earlier that every line segment on the – loop is part of some – lattice line. Let be the – lattice line on which
lies. If we translate to the right by units and up by units, the result will be a new – lattice line by Fact 2.
Moreover, this line will pass through , which also lies on . Therefore, is a point in that is inside the square . If we translate to the left by and down by , it will be sent to by construction.
We have shown that each of the points in corresponds to exactly one point in
. As explained earlier, if we
consider the four corners of as one intersection point,
then we get a total of
intersection points of the two loops.
As mentioned in the problem, we are taking for granted that the
squares all have the same size. This means that each of them has an area
of exactly .
Referring back to the Euclid problem from the beginning, we were dealing
with the situation where and
, so the area of the squares is
.