Preprint
Article

Partition Entropy as a Measure of Regularity of Music Scales

Altmetrics

Downloads

75

Views

20

Comments

0

A peer-reviewed article of this preprint also exists.

This version is not peer-reviewed

Submitted:

17 April 2024

Posted:

18 April 2024

You are already at the latest version

Alerts
Abstract
The entropy of the partition generated by an n-tone music scale is proposed to quantify its regularity. The normalized entropy relative to a regular partition, and its complementary, here referred to as the bias, allow to analyze various conditions of similarity between an arbitrary scale and a regular scale. Interesting particular cases are scales having limited bias because their tones are distributed along interval fractions of a regular partition. The most typical case in music concerns partitions associated with well-formed scales generated by a single tone h. These scales are maximal even sets that combine two elementary intervals. Then, the normalized entropy depends on the number of tones as well as the relative size of both elementary intervals. When well-formed scales are refined, several nested families stand out with increasing regularity. It is proven that a scale of minimal bias, i.e. with less bias than those with fewer tones, is always a best rational approximation of log2h.
Keywords: 
Subject: Computer Science and Mathematics  -   Applied Mathematics

1. Introduction

An n-tone music scale determines a partition of the octave in n intervals. Regarding their regularity, scales can be of equal divisions of the octave, i.e., with n tones of equal temperament (n-TET scale), where discretization precision increases as the number of tones grows, or of different divisions, where precision increases in two ways, on the one hand, as the size of the intervals decreases and, on the other hand, as its regularity increases. In particular, well-formed scales of one generator, hereinafter referred to as cyclic scales, if they are non-degenerate (of equal temperament) are formed from two elementary intervals [1,2], one longer than the interval of an n-TET scale and the other of shorter width. These scales also fulfill the condition of maximum evenness [3,4], that is, they present the most even distribution, which does not depend on the relative size of the intervals. In many cases, as increasing the number of tones of a cyclic scale, the regularity of the intervals decreases, still satisfying the condition of maximal even set. Therefore, for cyclic scales it is not obvious how to quantify precision and regularity of the partition as increasing the number of tones. The present work has the purpose of analyzing it, as well as studying the bias from regular temperament in more general cases.
The regularity of the intervals of an n-tone scale E n can be quantified in several ways. A fairly common way is from their standard deviation. In the octave, the intervals A i , i I = { 1 , , n } , between consecutive tones, considered as a discrete random variable of equal probability such that i I A i = 1 , have expected value t = 1 n , which is the size of the elementary interval of the n-TET scale. A measure of dispersion of these values about the mean is the standard deviation σ , defined from its square, the variance, as σ 2 = 1 n i I ( A i t ) 2 . In this way it is possible to compare different n-tone scales in terms of the average dispersion of their intervals relatively to the n-TET scale. However, this procedure does not present interesting properties to deal, for instance, with successive refinements of cyclic scales.
Another way to measure the regularity of E n may be based on the quadratic sum of its intervals. We consider it in a more general way. Let us assume that ( M , μ ) is a Lebesgue-measurable space such that μ ( M ) = 1 , and α = { A i } i I is a finite measurable partition of M , that is1, μ ( i I A i ) = 1 and μ ( A i A j ) = 0 if i j . Then, the sum of squares for the partition α relative to the measure μ is S ( α ) = i I μ ( A i ) 2 . The function S ( α ) is a convex function of μ ( A i ) , which constrained to i I μ ( A i ) = 1 has an absolute minimum for μ ( A i ) = 1 n , i . In this case, S ( α ) = 1 n is a decreasing function of n. When the partition is refined, that is, if β = { B j } j J is another partition of the octave, then its sum α β = { A i B j } i I , j J is a refinement of them, in order to make successive refinements it would be desirable to be able to express S ( α β ) = S ( α ) + F ( α , β ) , with a function F that is a linear combination of squares of μ ( A i B j ) , i.e., also a sum of squares of the refinement. But this is not so, because in F it will necessarily appear cross products.
Fortunately, the notions of regularity and fineness of a partition have in several fields a well-known way of being quantified, which is the partition entropy. The concept of entropy was introduced by Clasius in thermodynamics (in 1850), Boltzmann applied it to statistical mechanics (1877), Planck related it to probability theory (1906), Shannon applied it to information theory (1948), Jaynes used it as a measure of uncertainty (1957), and Kolmogorov extended it to deterministic systems (1958).
Certainly, entropy has been used in music almost from the beginning of information theory [5,6,7] by considering the musical language as a source that produces a sequence of symbols representing musical tones and by associating them with certain probabilities according to their frequency of appearance. In particular, a wide range of works have used entropy to identify music styles, e.g., [8,9,10,11] although results may vary depending on the preanalytical assumptions made in order to treat the data (scale degrees, pitch class, octave equivalence, weighting by duration, key-signature dependency, modal bias, etc.). With the same purpose, cross-entropy, e.g., [12] has been used to quantify stylistic similarity between two sequences [13]. An up-to-date review on entropy and other physical parameters applied to music is provided by Gündüz [14].
Nevertheless, it seems that entropy as a measure of regularity of music scales has been neglected. Therefore, in the current paper it is proposed to use the normalized entropy as a measure of their regularity. Its application is illustrated in two cases. On one hand to study the similarity between an arbitrary scale and an n-TET scale. On the other hand, to study the regularity of cyclic scales generated by a tone h and the relationship between their bias and the rational approximation of log 2 h they provide.

2. Partition Entropy

Following the notation and terminology of Arnold and Avez [15], the entropy of the partition2  α is defined from a concave function z ( t ) as follows,
H ( α ) = i I z ( μ ( A i ) ) ; z ( t ) = t log 2 t , 0 < t 1 0 , t = 0
It is fulfilled3,
H ( α ) = n 1 n i z ( μ ( A i ) ) n z 1 n i μ ( A i ) = n z ( 1 n ) = log 2 n
Therefore, for a partition of n elements, the value log 2 n is the maximum value that the entropy can reach, and it is reached when the elements of the partition are of equal measure.
Thus, i z ( μ ( A i ) ) = i μ ( A i ) log 2 1 μ ( A i ) is the weighted average of the values log 2 1 μ ( A i ) (what in probabilities would be the information associated with the value of the random variable A i , i.e., the expected value of the information content or self-information of the variable A i ). The smaller μ ( A i ) , the greater is the above logarithm, so this average gives an idea of how much refined the partition is.
Shannon (1948) showed that the entropy defined in this way is the only function that, except for a multiplicative constant, satisfies the following postulates:
1.
H ( α ) is a continuous function of μ ( A i ) .
2.
If μ ( A 1 ) = = μ ( A n ) = 1 n , then H ( α ) is an increasing function of n.
3.
Let’s consider partitions α = { A i } i I , β = { B j } j J and a refinement of both α β = { A i B j } i I , j J . The conditional entropy of α relative to β is defined as
H ( α / β ) = j μ ( B j ) i z ( μ ( A i / B j ) ) j μ ( B j ) i z ( μ ( A i B j ) μ ( B j ) )
where μ ( A i / B j ) is the conditional measure of A i relative to B j . Then, the following specific property of entropy (sub-additivity)4 holds,
H ( α β ) = H ( α ) + H ( β / α ) H ( α ) + H ( β )
4.
If φ is an automorfism of ( M , μ ) and α = { A i } i I , φ α = { φ A i } i I are both partitions, then φ ( α β ) = φ α φ β and H ( α / β ) = H ( φ α / φ β ) .
Therefore, the partition entropy seems an appropriate parameter to measure how much a scale of n arbitrary tones differs from an n-TET scale. It is also the parameter that provides us with an estimate of how refined a scale is, because the entropy of a scale of n tones will always be greater than that of a subscale of n 1 tones, and less than or equal to that of an n-TET scale. This will be useful when dealing with cyclic scales. In addition, as the intervals of a partition are subdivided, the entropy is additive with regard to the intervals being refined, which saves calculations.

3. Cyclic Scales

Computation of entropy for cyclic scales requires a brief review of their properties, which hereinafter are summarized by following Cubarsi [16], Cubarsi [17].
For any frequency ratio ν Ω ( 0 , ) , the values 2 k ν , k Z , define one equivalence class. A ratio of 2 between values corresponds to the frequency range of one octave. The set Ω is a commutative group for multiplication. The set of all the octaves of the fundamental frequency ratio ( ν 0 = 1 ) is a monogenous subgroup of Ω of infinite cardinal, Ω 2 = { 2 k , k Z } . The frequency classes are the elements of the quotient group Ω 0 = Ω / Ω 2 , also commutative for multiplication. For each equivalence class, we choose a representative in [ 1 , 2 ) , the reference octave, which we identify with Ω 0 . A finite set of these representatives will be referred to as scale tones.
An n-tone cyclic scale E n h is a scale of one generator, a real positive value h, which satisfy the symmetry condition, consisting of displaying several degrees of rotational symmetry that are equivalent to the closure condition [18,19,20], although such an equivalence does not hold for scales with more than two generators [21,22]. The partition of the octave induced by the scale notes has exactly two sizes of scale steps, and each number of generic intervals occurs in two different sizes, which is known as Myhill’s property [1,2]. For h = 3 we meet the particular case of the 12-tone Pythagorean scale, generated by fifths, as well as those listed in Table 1, and for h other than a rational power of 2 (which would lead to degenerate cases of equal temperament scales) we get generalized Pythagorean scales.
The scale tones are ν k = h k 2 k ; k = 0 , , n 1 ; with k = k log 2 h (floor function). When the scale tones are ordered clockwise direction from lowest to highest pitch in [ 1 , 2 ) (say, in cyclic order) we find two extreme tones, the minimum tone ν m and the maximum tone ν M , which determine the two elementary factors U = ν m = h m 2 m (up the fundamental) and D = 2 ν M = 2 M + 1 h M (down the fundamental) associated with the generic widths of the step interval, so that U M D m = 2 . The indices satisfy n = m + M , all of them coprime.
The tone ν n = h n 2 n , which does not belong to the scale E n h , provides the closure condition (either ν n 1 + or ν n 2 ) determining the n-order comma κ n = min ( ν n , 2 ν n ) (in the frequency space), i.e., the error in closing the scale near the fundamental, with no other scale notes between them. The comma itself does not provide information about whether ν n closes above or below the fundamental. By using the index N = m + M + 1 = n log 2 h + 1 2 , two parameters provide this information. On the one hand, the scale closure, γ n = h n 2 N , which is a value close to 1 satisfying U D = γ n . On the other hand, the scale digit  δ = N n , taking values 0 or 1. Then, δ = 0 γ n > 1 ( ν n 1 + , γ n = κ n ) or δ = 1 γ n < 1 ( ν n 2 , γ n = κ n 1 ). The value | log 2 γ n | = log 2 κ n multiplied by 1200 measures the distance from ν n to 1 in cents (Preprints 104176 i002).
Alternatively, each tone ν k in the frequency space is associated with a note or pitch class log 2 ν k in the octave S 0 = R / Z , so that the above quantities have the corresponding one in S 0 . Thus, the elementary intervals u = log 2 U , d = log 2 D generate the partition of the octave in n intervals, M u + m d = 1 , satisfying u d = ϕ n , with interval closure ϕ n = log 2 γ n and interval comma | ϕ n | = log 2 κ n . Sometimes is more useful to work in the multiplicative space of tones and sometimes in the additive space of notes.
The fractions N n are associated with convergent or semi-convergent continued fraction expansions of log 2 h [23,24]. Among cyclic scales, two categories may be pointed out. On the one hand, optimal scales, associated with the best closure γ n 1 , corresponding to the best rational approximations5  N n of log 2 h , i.e., the convergents of its canonical continued fraction expansions from both sides. On the other hand, what we shall name accurate scales, which are associated with the best estimations of the generator tone 2 N n h 1 , corresponding to the good rational approximations  N n of log 2 h . Apart from accurate scales, which include optimal scales, there are cyclic scales not associated with good or best rational approximations, still corresponding to semiconvergents. For all these scales, the values U , D and γ n have bounds according to Appendix A. In particular, | log 2 γ n | quantifies the error of the rational approximation of log 2 h . These bounds determine the interval between a note of the cyclic scale E n h and the one with the same ordinal in the n-TET scale, which is analyzed in Appendix B.
Finally, the family of cyclic scales follow a chain, E n h E n + h , so that starting from the indices ( m , M ) of E n h , the same values for the next scale E n + h are (i) m + = m + M , M + = M δ = 0 , and (ii) m + = m , M + = m + M δ = 1 (see Table 1).

4. Entropy of a Cyclic Scale

For a cyclic scale E n h , with the measure defined for tone intervals as μ ( [ x 0 , x 1 ] ) = log 2 x 1 x 0 for 0 < x 0 x 1 , consider the partition of the octave α ( u M , d m ) composed of M intervals of width u and m intervals of width d, regardless of the order in which the intervals follow each other. The partition entropy is
H ( u M , d m ) = M z ( u ) + m z ( d )
We will explicitly write it in terms of the indices of the minimum and maximum tones m and M. In order to do so, we define the values
| x = { log 2 h x } , x | = 1 { log 2 h x } ; x R
where { log 2 x } is the mantissa of log 2 x , that is, log 2 h x log 2 h x , so that the elemental intervals u , d can be expressed in terms of the indices of the minimum and maximum tones as
u = | m , d = M |
In this way, Eq. 4 becomes
H ( | m M , M | m ) = M z ( | m ) + m z ( M | )
Let’s see how to express the entropy of the cyclic scale E n + h , the one following E n h in the chain of cyclic scales. The scale refinement process is as follows. The partition ( u M , d m ) breaks, so that the major interval splits into two, one of the same size as the minor one plus a remainder. This residual is only smaller than the size of the smaller interval when the scale is optimal. This process is iterated.
As explained in §Section 3, we must distinguish two cases, depending on whether γ n is greater or less than 1, or equivalently, if δ equals 0 or 1: (i) δ = 0 , γ n > 1 , U > D , u > d . In this case, N = n , m + = n , M + = M .
The refinement is done in the M intervals of size u and the m intervals of size d are maintained,
u = u d = ϕ n u | m + = | m M | d = d d M + | = M |
We express6 the entropy according to Eq. 3, being the new partition α β = ( u ( M + , d ( m + ) ,
H ( u ( M + , d ( m + ) = H ( u M , d m ) + M u H ( d u , 1 d u ) = = H ( u M , d m ) + M [ z ( d ) z ( u ) + z ( κ n ) ]
In terms of the indices of the extreme tones, by noting H n = H ( E n h ) , it can be written as
H n + = H n + Δ M ; Δ M = M [ z ( M | ) z ( | m ) + z ( κ n ) ]
(ii) δ = 1 , γ n < 1 , U < D , u < d . In this case, N = n + 1 , m + = m , M + = n .
The refinement is done in the m intervals of size d and the M intervals of size u are maintained,
d = d u = ϕ n d M + | = M | | m u = u u | m + = | m
The entropy of the new partition α β = ( u ( M + , d ( m + ) is
H ( u ( M + , d ( m + ) = H ( u M , d m ) + m d H ( u d , 1 u d ) = = H ( u M , d m ) + m [ z ( u ) z ( d ) + z ( κ n ) ] )
In terms of the indices of the extreme tones it is equivalent to
H n + = H n + Δ m ; Δ m = m [ z ( | m ) z ( M | ) + z ( κ n ) ]

5. Partition Modulation

According to Eq. A2, we write the elementary intervals of a cyclic scale as
u = 1 + m ϕ n n , d = 1 M ϕ n n
We have referred to one degenerate case of cyclic scale, the limiting case when the two intervals are one, u d . In this case u = d = 1 n and E n h = E n , the n-TET scale. This is equivalent to ϕ n 0 in the expressions of Eq. 9. However, we should consider two more degenerate cases. The larger is one interval, the smaller is the other, always filling a full octave, M u + m d = 1 . Therefore, if d 0 , then the cyclic scale becomes an equal temperament scale of M tones and, if u 0 , the scale becomes an equal temperament scale of m tones. In other words,
(i)
According to Appendix A, for ϕ n > 0 , by Eq. A5, ϕ n < 1 M . If ϕ n 1 M then d 0 . In this case, the intervals of the cyclic scale satisfy 0 < d < ϕ n < u . Since u d = ϕ n , then, when u ϕ n the scale in no more non-degenerate and becomes an M-TET scale, with entropy
H n = H ( u M , d m ) d 0 H ( u M ) = H M = log 2 ( M )
Obviously, an optimal cyclic scale is far from this situation, because it satisfies 0 < ϕ n < d < u .
(ii)
Also, according to Appendix A, for ϕ n < 0 , owing to Eq. A8, ϕ n > 1 m . If ϕ n 1 m then, u 0 . In this case, the intervals of the cyclic scale satisfy 0 < u < | ϕ n | < d . Then d | ϕ n | , so that the scale becomes a degenerate m-TET scale, with entropy
H n = H ( u M , d m ) u 0 H ( d m ) = H m = log 2 ( m )
An optimal cyclic scale case is also far from this situation, since it satisfies 0 < | ϕ n | < u < d .
Lemma 1.
The scales formed from the two elementary intervals u = 1 + m ξ n and d = 1 M ξ n with values 1 m < ξ < 1 M generate an infinite and continuous family of n-tone cyclic scales C n h ( ξ ) , which are neighbors of E n h , such that C n h ( ϕ n ) = E n h and C n h ( 0 ) = E n . In addition, if 1 n + m < ξ < 1 n + M , the scales C n h ( ξ ) are optimal.
These results are immediate consequence of the bounds obtained in Appendix A.

5.1. Modulating Temperament Scales

The most usual case of cyclic scale E n h is the one associated with a generator corresponding to a harmonic h Z + of the fundamental tone. Then, the tonal class of the generator is g = h log 2 h ( 1 , 2 ) , which can be written as g = 2 μ + ϕ n n , where μ = N 1 n is the ordinal of the scale tone that better approximates g [17], known as chromatic length of the pitch class of the generator. The value μ = n log 2 g + 1 2 also determines the indices m and M of the extreme tones, corresponding to the ordinals 1 and n 1 of the scale notes. Since, as μ m n m = 1 , m is the positive integer, 0 < m < n , so that μ m = 1 mod n .
For fixed n (and also m and M), the family of cyclic scales C n h ( ξ ) has generators that, in general, are not the tonal class of any harmonic, that is, they are real scales. In this case we write them as g = 2 μ + ξ n , so that, for ξ ( 1 m , 1 M ) , we get a partition modulation, which is a continuum of irregular temperaments [27,28] close to E n h and E n , also corresponding to cyclic scales. Hence, the n-tone cyclic scales C n h ( ξ ) is a family of modulating temperament scales around the generator 2 μ n , namely T n μ = { C n h ( ξ ) ; ξ ( 1 m , 1 M ) } .
For example, for n = 12 , and g = 5 4 , the value ξ = 12 4 log 2 5 12 4 log 2 5 + 1 2 = 0.03422 gives rise to the quarter-comma meantone temperament, which fits well the class of the fifth harmonic, in exchange for decreasing in 5 the accuracy of the third one. Since the value of | ξ | is lower than 1 n + m = 1 19 = 0.05263 , it results in an optimal scale.

5.2. Entropy in Terms of the Closure

Let us consider an n-tone modulating the temperament scale C n h ( ξ ) T n μ . According to equations 4 and 9, we write the entropy of C n h ( ξ ) as
H n = H ( m , M , ξ ) = M z ( 1 + m ξ n ) + m z ( 1 M ξ n )
Using the relationship z ( a b ) = 1 b z ( a ) + a z ( 1 b ) , it is immediate to see that
H ( m , M , ξ ) = log 2 n + M n z ( 1 + m ξ ) + m n z ( 1 M ξ )
that is,
H ( m , M , ξ ) = log 2 n M n ( 1 + m ξ ) log 2 ( 1 + m ξ ) m n ( 1 M ξ ) log 2 ( 1 M ξ )
By deriving Eq. 10 respect of ξ , we get,
H ξ = M m n [ log 2 ( 1 + m ξ ) log 2 ( 1 M ξ ) ]
If 0 < ξ < 1 M , the two terms of the previous equation are positive and the derivative is negative, therefore, in this interval the entropy is a decreasing function. If 1 m < ξ < 0 , both terms in the above equation are negative and the derivative is positive, so in this interval the entropy is an increasing function. At ξ = 0 there is a local maximum, since log 2 ( 1 + m ξ ) log 2 ( 1 M ξ ) = 0 if and only if ξ = 0 . Thus, H in terms of ξ is a concave function, since 2 H ξ 2 = M m n ln 2 ( m 1 + m ξ + M 1 M ξ ) < 0 . Therefore,
(a)
If 0 < ξ < 1 M , then H ( m , M , 0 ) = log 2 n > H ( m , M , ξ ) > log 2 M = H ( m , M , 1 M ) .
(b)
If 1 m < ξ < 0 , then H ( m , M , 1 m ) = log 2 m < H ( m , M , ξ ) < log 2 n = H ( m , M , 0 ) .
In addition, it is straightforward to see the following properties,
(c)
If m < M , H ( m , M , ξ ) < H ( m , M , ξ ) for ξ > 0 and H ( m , M , ξ ) > H ( m , M , ξ ) for ξ < 0 .
(d)
If m > M , H ( m , M , ξ ) > H ( m , M , ξ ) for ξ > 0 and H ( m , M , ξ ) < H ( m , M , ξ ) for ξ < 0 .
(e)
H ( m , M , ξ ) > H ( M , m , ξ ) for 1 m < ξ < 1 M
Properties (c) and (d) refer to the slight asymmetry of the entropy far from ξ = 0 , which, as we shall see in the last section, is negligible around ξ = 0 . Property (e) means that a cyclic scale generated by h and its inverse scale, generated counter-clockwise with swapped indices of the extreme tones, although they have different notes that differ in one comma, have the same entropy.
Therefore, the entropy of any n-tone modulating temperament scale in the family T n μ only depends on the closure.

6. Normalized Entropy

Since the entropy increases as the partition is refined, to be able to compare qualities of different scales, whether they are cyclic or not, we will consider the normalized entropy, i.e., relatively to the maximum value it can reach, that of the equally tempered scale. For an n-tone scale E n it is defined as7
η ( E n ) = 1 log 2 ( n ) H ( E n ) ; 0 η 1
so that η tends to 1 as E n approaches to an n-TET scale. For a cyclic scale E n h , we will write η n = η ( E n h ) .
Let us see in which case the normalized entropy increases when a scale is being refined. Let us assume two scales, not necessarily cyclic, E n E n + with n < n + (i.e., E n + is a refinement of E n ).
Lemma 2.
The normalized entropy increases if and only if the relative increment of entropy in refining the partition is greater than the relative increment of entropy of the corresponding equal temperament scales.
Proof. 
By the sub-additivity property, H ( E n + ) = H ( E n ) + Δ . It will be η ( E n ) < η ( E n + ) if and only if H ( E n ) log 2 n < H ( E n + ) log 2 n + , so H ( E n + ) log 2 n < H ( E n + ) + Δ log 2 n + . Hence
log 2 n + log 2 n log 2 n < Δ H ( E n )
The deviation of an n-tone scale E n , not necessarily cyclic, relatively to the regular n-TET scale will be measured from the complementary of the normalized entropy, which we will call bias,
θ ( E n ) = 1 η ( E n ) ; 0 θ 1

7. Bias of a Cyclic Scale

For a non-degenerate n-tone cyclic scale C n h ( ξ ) T n μ the bias θ ( C n h ( ξ ) ) = θ n ( ξ ) depends on the divergence of its elementary intervals u and d with regard to the elementary interval of the n-TET scale. Thus, bearing in mind Eq. 10,
θ n ( ξ ) = 1 n ln n [ M ( 1 + m ξ ) ln ( 1 + m ξ ) + m ( 1 M ξ ) ln ( 1 M ξ ) ]
The graph of the function θ n is the mirror image up-to-down of H n , scaled by the factor 1 log 2 n , hence it is convex (Figure 1); the value θ n ( 0 ) = 0 is its minimum in the interval ξ ( 1 m , 1 M ) , and at the extremes (corresponding to degenerate scales) it takes the values θ n ( 1 m ) = 1 ln m ln n and θ n ( 1 M ) = 1 ln M ln n . Therefore, in the interval ( 1 m , 0 ) , θ n ( ξ ) is a decreasing function and in ( 0 , 1 M ) it is an increasing function of ξ .
Remark that this behavior of θ n in terms of ξ holds when n is fixed. In other words, θ n = θ ( ξ ; m , M ) . Then, although for another n -tone cyclic scale, n n , the bias θ n has a similar behavior with respect to ξ , we cannot assure that if ξ < ξ then θ ( ξ ; m , M ) < θ ( ξ ; m , M ) , since this also depends on the values of m and M , such that m + M = n .
Figure 2 shows the trends of the bias θ n and the interval comma | ϕ n | for Pythagorean scales (generated by h = 3 ) in terms of n. In general, optimal scales have low bias, but not always a scale is refined the bias decreases. For example, the first optimal scales are those of n = 5 ; 12 ; 41 ; 53 ; 306 ; 665 and the bias decreases for n = 5 ; 12 ; 53 ; 665 but the scale of n = 41 has greater bias than that of n = 5 ; 12 ; and the one of n = 306 has greater bias than those of n = 12 ; 53 . Also notice that there are intervals where | ϕ n | decreases but θ n increases.
We will say that a cyclic scale E n h is of minimal bias (MB) if for any cyclic scale E n h with n < n , then θ n < θ n . Obviously, this is tantamount to say that η n > η n , hence, the scale has greater normalized entropy than the previous ones. Not all optimal cyclic scales are MB.
Before a deeper analysis, in an approximate way we may estimate how close an MB scale and an equal temperament scale of the same number of tones are. We may use the criterion for which a cyclic scale (not necessarily optimal) unambiguously approximates an n-TET scale if every note of the former is at a distance equal or less than half an elementary interval from the latter. According to Eq. A12 with λ = 2 , this leads to the condition ( κ n ) 600 n 1 (say condition I|2n).
This condition can be satisfied by optimal and non-optimal cyclic scales. For example, for n = 3 condition I|2n is hold although the scale is not optimal. On the contrary, n = 41 ; 306 ; 111 202 are optimal scales but the condition I|2n is not met.
Let’s see what happens with the MB condition. In this case, up to n = 2 · 10 5 , all scales that satisfy MB also satisfy I|2n, but there are scales that are I|2n and not MB scales, for example for n = 3 ; 306 ; 15 601 ; 31 867 ; 79 335 . Hence, the condition that a note of the cyclic scale is closer than half an interval of a note of a tempered scale is weaker than the MB condition. On the contrary, if we further restrict the above condition, let’s say, that the respective notes are at most at a third interval, i.e., ( κ n ) 400 n 1 (say condition I|3n), then, up to n = 2 · 10 5 all I|3n scales are MB scales. In Table 1 these and other properties are displayed.

8. Scales with Limited Bias

8.1. Scale Distributed within Regular Intervals

We study several families of scales for which it is possible to estimate a priori the lower limit of the entropy.
We say that two n-tone scales E n and E n sharing the fundamental tone alternate if their notes in cyclic order, s j = log 2 σ j E n , s j = log 2 σ j E n , 0 < j < n , fulfill one of the following conditions:
( a ) s j 1 < s j s j ; 0 < j < n ( E n alternate by the right of E n ) ( b ) s j 1 < s j s j ; 0 < j < n ( E n alternate by the left of E n )
If E n is the equal temperament scale E n , then we write σ j = ϑ j and say that E n is regular interval distributed (RID). In this case, the alternance can also be defined from the following relations involving intervals between tones8,
( a ) 0 I ( σ j , ϑ j ) < 1 n ( b ) 0 I ( ϑ j , σ j ) < 1 n ; 0 < j < n
Note that these conditions are generally more restrictive than the condition
d ( σ j , ϑ j ) < 1 n 1 n < I ( σ j , ϑ j ) < 1 n
Obviously, Eq. 16 implies the condition of Eq. 17, although, as seen in Appendix B, for cyclic scales they are equivalent.
For example, suppose case (a). If 0 I ( σ j , ϑ j ) < 1 n , then the interval between two consecutive notes of E n satisfies I ( σ j , σ j + 1 ) = I ( σ j , ϑ j + 1 ) + I ( ϑ j + 1 , σ j + 1 ) = 1 n I ( σ j , ϑ j ) + I ( ϑ j + 1 , σ j + 1 ) < 2 n , so that d ( σ j , σ j + 1 ) < 2 n . Instead, if d ( σ j , ϑ j ) < 1 n , then d ( σ j , σ j + 1 ) d ( σ j , ϑ j ) + d ( ϑ j , ϑ j + 1 ) + d ( σ j + 1 , ϑ j + 1 ) < 3 n . In a similar way would be reasoned case (b).
Lemma 3.
The interval between two consecutive notes of an n-tone RID-scale is lower than 2 n .

8.2. Scale r-Similar to n-TET

A criterion for measuring the proximity between scales is similarity [30]. Two n-tone scales T and T are similar at level r > 0 (r-similar) if for each tone τ i T , τ j S , such that d ( τ i , τ j ) r and d ( τ i , τ k ) > r , k j .
For example, a cyclic scale E n h and the n-TET scale have level of similarity 1 2 n if and only if any pair of notes with ordinal j, i.e., ϑ j E n h , ϑ j E n , satisfy d ( ϑ j , ϑ j ) 1 2 n . This condition is equivalent to the condition of Eq. A12 with λ = 2 , ( κ n ) 600 n 1 .
However, for the current purpose of evaluating and comparing entropy partitions, we will slightly modify such a concept by assuming that the fundamental is shared by both scales. Then, we say a scale E n is similar to the n-TET scale at level r, with 0 < r 1 2 n , if their tones satisfy
d ( σ j , ϑ j ) r ; 0 < j < n

8.3. Scale One-Side r-Similar to n-TET

We introduce a concept that will mix the one of similarity with the one of distribution within regular intervals, which will be appropriate to study the bounds of the entropy for cyclic scales.
Let ϑ j be the tones of the n-TET scale. We say that a scale E n is one-side r-similar at level r (r-OSS) to the n-TET scale, with 0 < r 1 n , if
( a ) 0 I ( σ j , ϑ j ) < r ( b ) 0 I ( ϑ j , σ j ) < r ; 0 < j < n
It is OSS by the right (a) or by the left (b), respectively. Therefore, when r = 1 n it matches the definition of a RID-scale.

9. Entropy of a Scale 1 λ n -OSS to n -TET

Consider an n-tone scale E n , not necessarily cyclic, which is 1 λ n -OSS to n-TET for λ 1 . We already know that the maximum entropy is log 2 n . Let’s see the minimum entropy it can reach. We assume case (a) of Eq. 19 and consider the n notes ( n 2 ) of E n in cyclic order in [ 0 , 1 ) { 1 } , since the entropy does not change if we add a null set. We also extend the range of possible variation of the intervals between notes at the extremes to include the limiting, degenerate cases. The points determining the division of the octave are
s 0 = 0 ; s i = i n + t i , 0 t i 1 λ n , 1 i n 1 ; s n = 1
By writing T = 1 n , the respective intervals are, by assuming t 0 = t n = 0 ,
T i = s i s i 1 = T + t i t i 1 , 1 i n
Then, scale E n generates the partition α = { T i } , i { 1 , , n } , with entropy
H ( α ) = i = 1 n z ( T i ) = i = 1 n ( T + t i t i 1 ) log 2 ( T + t i t i 1 )
The entropy H ( α ) is a concave and differentiable function f ( t 1 , , t n 1 ) , f : V R , with V = [ 0 , 1 λ n ] n 1 , a hypercube, which is a convex and compact space. Then, f has a local and global maximum at ( t 1 , , t n 1 ) = ( 0 , , 0 ) , corresponding to an n-TET scale with T i = T and H = log 2 n , and the global minimum is reached at one or some of the vertices of V. These vertices are determined by the possible values t j { 0 , 1 λ n } , 1 j n 1 , and the resulting intervals T i can only take the following values: interval T 1 , values 1 n and λ + 1 λ n ; interval T n , values 1 n and λ 1 λ n ; and the intermediate intervals, values λ 1 λ n , 1 n and λ + 1 λ n .
The octave is covered by a number of different intervals satisfying
A + B + C = n
of which A in number have width λ 1 λ n , B width 1 n , and C width λ + 1 λ n , so that9 A λ 1 λ n + B λ λ n + C λ + 1 λ n = 1 . Therefore,
A ( λ 1 ) + B λ + C ( λ + 1 ) = λ n
The entropy at one of these vertices can be written in terms of the respective number of intervals as
f = A z ( λ 1 λ n ) + B z ( λ λ n ) + C z ( λ + 1 λ n ) = = log 2 n + 1 n g ( A , C ) ; g ( A , C ) = A z ( λ 1 λ ) + C z ( λ + 1 λ )
where z is the concave function defined in Eq. 1, also extended for values t > 1 , where z ( t ) < 0 . Among possible configurations, we look for the minimum value that f can take, which will correspond to the minimum value of g. Notice that this quantity is added to the entropy of an n-TET scale, being it consistent with the fact that the B intervals of width 1 n are not involved in the expression. However, the intervals in number C, of greater width than 1 n , contribute to decreasing the entropy, while those in number A, of width lower than 1 n , contribute to increasing it, since the corresponding function z evaluated in values less than 1 is positive.
In the current case, by combining equations 20 and 21, we get A = C and B + 2 C = n . Therefore,
g ( C ) = C Z ( λ , 1 ) ; Z ( λ , a ) = z ( λ a λ ) + z ( λ + a λ ) , λ a
If n is even, the maximum value for C, by assuming B = 0 , is C = n 2 . Hence, the number of intervals of null-size is also A = n 2 . Then10, g ( n 2 ) = n 2 Z ( λ , 1 ) , corresponding to a degenerate scale of n 2 non-null intervals. Therefore, if n = 2 ˙ the entropy satisfies log 2 n + 1 2 Z ( λ , 1 ) H ( α ) .
If n is odd, the maximum value for C and A is n 1 2 , with B = 1 . Then, g = n 1 2 Z ( λ , 1 ) , corresponding to a degenerate scale of n + 1 2 non-null intervals. Therefore, if n 2 ˙ the entropy satisfies log 2 n + 1 2 Z ( λ , 1 ) ( 1 1 n ) H ( α ) .
Case (b) is similar. It would be reasoned by considering that f ( t 1 , , t n 1 ) = f ( t n 1 , , t 1 ) , i.e., it is equivalent to case (a), but following the intervals from right to left starting at 1. Therefore,
Theorem 1.
The entropy and bias of a scale 1 λ n -OSS to n-TET satisfy
n = 2 ˙ , log 2 n + 1 2 Z ( λ , 1 ) H ( α ) log 2 n ; 0 θ ( α ) 1 2 log 2 n Z ( λ , 1 ) n 2 ˙ , log 2 n + 1 2 Z ( λ , 1 ) ( 1 1 n ) H ( α ) log 2 n ; 0 θ ( α ) 1 2 log 2 n Z ( λ , 1 ) ( 1 1 n )
We explicitly write two cases. For λ = 1 , since Z ( 1 , 1 ) = 2 , the entropy of a RID-scale satisfies, if n = 2 ˙ , log 2 n 2 H ( α ) log 2 n and, if n 2 ˙ , log 2 n 2 + 1 n H ( α ) log 2 n .
For λ = 2 , since Z ( 2 , 1 ) = 0.189 , the entropy of a scale 1 2 n -OSS to n-TET satisfies, if n = 2 ˙ , log 2 n 0.189 H ( α ) log 2 n and, if n 2 ˙ , log 2 n 0.189 ( 1 1 n ) H ( α ) log 2 n .
Levels for λ = 1 , 2 , 4 , 8 , 16 are displayed in Figure 2.

10. Entropy of a Scale 1 λ n -Similar to n -TET

Let us calculate the minimum entropy that can reach an n-tone scale E n , not necessarily cyclic, having a similarity level 1 λ n with the n-TET scale for λ 2 . According to the previous notation and considerations, the points that determine the division of the octave are
s 0 = 0 ; s i = i n + t i , 1 λ n t i 1 λ n , 1 i n 1 ; s n = 1
The respective intervals are, T i = s i s i 1 = 1 n + t i t i 1 , 1 i n , by assuming t 0 = t n = 0 . As before, we write the entropy of the partition α = { T i } , i { 1 , , n } , as H ( α ) = i = 1 n z ( T i ) = f ( t 1 , , t n 1 ) , where f : V R , with V = [ 1 λ n , 1 λ n ] n 1 , a convex and compact space. The local and global maximum of f takes place at ( t 1 , , t n 1 ) = ( 0 , , 0 ) , corresponding to the scale of n-TET, with T i = T and H = log 2 n . The global minimum is reached at one or some of the vertices of V. These vertices are determined by the possible values t j { 1 λ n , 1 λ n } , 1 j n 1 , and the resulting intervals T i can only take the following values: the extreme intervals T 1 and T n values λ 1 λ n and λ + 1 λ n , while the intermediate intervals can have values λ 2 λ n , 1 n , and λ + 2 λ n . The octave is covered by a number of different intervals satisfying
A + B + C + D + E = n
of which A in number have width λ 2 λ n , B width λ 1 λ n , C width 1 n , D width λ + 1 λ n and E width λ + 2 λ n , so that A λ 2 λ n + B λ 1 λ n + C λ λ n + D λ + 1 λ n + E λ + 2 λ n = 1 . Therefore,
A ( λ 2 ) + B ( λ 1 ) + C λ + D ( λ + 1 ) + E ( λ + 2 ) = λ n
The entropy at one of these vertices can be written in terms of the respective number of intervals as
f = A z ( λ 2 λ n ) + B z ( λ 1 λ n ) + C z ( λ λ n ) + D z ( λ + 1 λ n ) + E z ( λ + 2 λ n ) = = log 2 n + 1 n g ( A , B , D , E ) , g ( A , B , D , E ) = A z ( λ 2 λ ) + B z ( λ 1 λ ) + D z ( λ + 1 λ ) + E z ( λ + 2 λ )
Once again, the intervals of width 1 n are not involved in this expression. The intervals in number D and E, of greater width than 1 n , contribute to decreasing the entropy, while those who are there in number A and B, of width lower than 1 n , contribute to increasing it.
The minimum value of g is obtained by the highest possible values of E and D (in that order) and minimum of A and B.
In the current case, by combining equations 25 and 26, we get A = E + 1 2 D 1 2 B and
B + 2 C + 3 D + 4 E = 2 n
If n is even, at most there can be E = n 2 intermediate intervals of width λ + 2 λ n . By substitution of this value into Eq. 28, we get B + 2 C + 3 D = 0 , so that B = C = D = 0 and A = n 2 . Hence, the minimum g is g ( n 2 , 0 , 0 , n 2 ) = n 2 Z ( λ , 2 ) , corresponding to a degenerate scale of n 2 non-null intervals. Therefore, if n = 2 ˙ the entropy satisfies log 2 n + 1 2 Z ( λ , 2 ) H ( α ) .
If n is odd, we should examine two alternatives. On the one hand, if we assume there are E = n 1 2 intermediate intervals of width λ + 2 λ n , by substitution into Eq. 28 we get B + 2 C + 3 D 2 = 0 . Possible interval values are ( B , C , D ) = ( 0 , 1 , 0 ) ; ( 2 , 0 , 0 ) . In this case, the minimum value of g is provided by the first triad, i.e., B = D = 0 , C = 1 , A = n 1 2 , which yields g ( n 1 2 , 0 , 0 , n 1 2 ) = n 1 2 Z ( λ , 2 ) , corresponding to a degenerate scale of n + 1 2 non-null intervals. In this case,
log 2 n + 1 2 Z ( λ , 2 ) ( 1 1 n ) H ( α )
On the other hand, as second alternative, if there are E = n 3 2 intermediate intervals of width λ + 2 λ n , it is hold B + 2 C + 3 D 6 = 0 . Possible interval values are ( B , C , D ) = ( 0 , 0 , 2 ) ; ( 1 , 1 , 1 ) ; ( 3 , 0 , 1 ) ; ( 0 , 3 , 0 ) ; ( 4 , 1 , 0 ) . The minimum of g is provided by the first triad, so that B = C = 0 , D = 2 , A = n 1 2 , and g ( n 1 2 , 0 , 2 , n 3 2 ) = n 1 2 z ( λ 2 λ ) + 2 z ( λ + 1 λ ) + n 3 2 z ( λ + 2 λ ) corresponding to a degenerate scale of n + 1 2 non-null intervals. Hence,
log 2 n + 1 2 Z ( λ , 2 ) 1 1 n z ( λ 2 λ ) 4 z ( λ + 1 λ ) + 3 z ( λ + 2 λ ) Z ( λ , 2 ) H ( α )
The factor multiplying 1 n is a positive value between 3 ( 2 log 2 3 ) = 1.245 and 3 2 for λ 2 . Since the value Z ( λ , 2 ) is negative, the above factor increases it. Hence, the entropy of Eq. 29 is lower than the entropy of Eq. 30. Therefore,
Theorem 2.
The entropy and bias of a scale 1 λ n -similar to n-TET satisfy
n = 2 ˙ , log 2 n + 1 2 Z ( λ , 2 ) H ( α ) log 2 n ; 0 θ ( α ) 1 2 log 2 n Z ( λ , 2 ) n 2 ˙ , log 2 n + 1 2 Z ( λ , 2 ) ( 1 1 n ) H ( α ) log 2 n ; 0 θ ( α ) 1 2 log 2 n Z ( λ , 2 ) ( 1 1 n )
We explicitly evaluate the case for λ = 2 . Since Z ( 2 , 2 ) = 2 , if n = 2 ˙ , then log 2 n 2 H ( α ) log 2 n and, if n 2 ˙ , then log 2 n 2 + 1 n H ( α ) log 2 n , which are the same bounds as for a RID-scale.

11. Cyclic RID-Scales

We may reformulate Theorem A1 according to the above definitions.
Theorem 3.
Optimal cyclic scales are RID-scales.
Nevertheless, there are many non-optimal cyclic scales that are also RID-scales. Cyclic RID-scales correspond to partial convergents of continued fractions where the comma is not as low as for optimal scales, i.e., their closure is not a best approximation, although they belong to a family of relatively good partial convergents. However, although in most cases a convergent that gives a good approximation (i.e., an accurate scale) generates a RID-scale, there are exceptions, such as for n = 200 11. Therefore, there exist accurate scales that are not RID, and RID-scales that are not accurate.

11.1. Comma and Elementary Intervals of a RID-Scale

It is immediate to identify a RID-scale from its interval comma | ϕ n | = | log 2 γ n | = | n log 2 h N | . As seen in Appendix B, the distance between the tones ϑ j E n and ϑ j = ν k E n h for 0 j < n is k n | ϕ n | . So, k n | ϕ n | < 1 n for all 0 k < n if and only if n 1 n | ϕ n | < 1 n , that is, the interval comma is limited as | ϕ n | < 1 n 1 .
Lemma 4.
A cyclic scale is RID if and only if | ϕ n | < 1 n 1 .
This condition, which is satisfied by all optimal scales, assures us that the value of | ϕ n | is small enough to be far from the previously seen degenerate cases of less than n notes, since for every RID-scale it is fulfilled | ϕ n | < 1 n 1 min ( 1 m , 1 M ) .
From another point of view, while considering RID-scales we are excluding “bad approximations” of log 2 h that satisfy, if ϕ n > 0 , 1 n 1 n log 2 h N < 1 M and, if ϕ n < 0 , 1 m < N n log 2 h 1 n 1 . This fact also has implications on the bounds of the two elementary intervals of cyclic RID-scales, We distinguish the following cases.
 (i)
If 0 < ϕ n < 1 n 1 , according to Eq. 9, u < 1 n ( 1 + m n 1 ) < 1 n ( 1 + n 1 n 1 ) = 2 n . Hence12,
1 n < u < min ( 1 M , 2 n )
(ii)
If 1 n 1 < ϕ n < 0 , according to Eq. 9, d < 1 n ( 1 + M n 1 ) < 1 n ( 1 + n 1 n 1 ) = 2 n . Hence13,
1 n < d < min ( 1 m , 2 n )
In both cases, the size of the greatest elementary interval of a cyclic RID-scale is lower than 2 n , as stated in Lemma 3.

11.2. Bias of RID-Scales

For large values of | ϕ n | , that is, | ϕ n | > 1 m or | ϕ n | > 1 M , Eq. 14 can have a quite arbitrary and non-symmetrical behavior, but for values | ϕ n | < 1 n 1 , and in particular for optimal scales with | ϕ n | < 1 n + M or | ϕ n | < 1 n + m , depending on the value δ , the bias θ n behaves as proportional to ϕ n 2 .
Being | m ϕ n | < 1 and | M ϕ n | < 1 , we can approximate the logarithms in the following expressions as
( 1 + m ϕ n ) ln ( 1 + m ϕ n ) = m ϕ n + 1 2 m 2 ϕ n 2 1 6 m 3 ϕ n 3 + O 4 ( m ϕ n )
( 1 M ϕ n ) ln ( 1 M ϕ n ) = M ϕ n + 1 2 M 2 ϕ n 2 + 1 6 M 3 ϕ n 3 + O 4 ( M ϕ n )
and by substitution into Eq. 14 we get
θ n = 1 ln n M m 2 + m M 2 2 n ϕ n 2 M m 3 m M 3 6 n ϕ n 3 + M m 4 + m M 4 12 n O 4 ( ϕ n ) =
= m M 2 ln n ϕ n 2 m M 3 ϕ n 3 + m 2 m M + M 2 6 O 4 ( ϕ n )
Notice that m | ϕ n | 3 = 1 m 2 | m ϕ n | 3 = O 3 ( | ϕ n | ) < O 3 ( | m ϕ n | ) with | m ϕ n | < 1 and M | ϕ n | 3 = 1 M 2 | M ϕ n | 3 = O 3 ( | ϕ n | ) < O 3 ( | M ϕ n | ) with | M ϕ n | < 1 . The same happens with the higher order terms. Therefore, for enough small | ϕ n | we can use the approximation derived from the following result,
Lemma 5.
The bias of a cyclic RID-scale satisfies
θ n = m M 2 ln ( m + M ) ϕ n 2 + O 3 ( | ϕ n | )

11.3. Cyclic Scales of Minimal Bias

As explained in §Section 4, while refining a cyclic scale, in each iteration one of the indices of the extreme tones remains fixed, while the other increases by a value equal to the one that remains fixed. Let’s see that in each refinement, the following function appearing in Eq. 34 always increases,
ψ ( m , M ) = m M ln ( m + M )
Lemma 6.
For n = m + M 2 , ψ ( m + , M ) > ψ ( m , M ) with m + = m + M , and ψ ( m , M + ) > ψ ( m , M ) with M + = m + M .
Proof. 
Indeed, it suffices to check that the function ψ ( m , M ) = m M ln ( m + M ) satisfies ψ m > 0 , ψ M > 0 .
m m M ln ( m + M ) = M ln ( m + M ) m M m + M ln 2 ( m + M ) = M 2 ln ( m + M ) + m M ( ln ( m + M ) 1 ) ln 2 ( m + M ) > 0 m + M 2
M m M ln ( m + M ) = m ln ( m + M ) m M m + M ln 2 ( m + M ) = m 2 ln ( m + M ) + m M ( ln ( m + M ) 1 ) ln 2 ( m + M ) > 0 m + M 2
Note that, for m + M > 2 , the above inequalities hold and, if m + M = 2 , then M = m = 1 , so that the above results are also valid. □
Corollary 1.
Let us write ψ n = ψ ( m , M ) . If n > n then ψ n > ψ n .
With this notation, the bias of a cyclic RID-scale can be estimated from
θ n = 1 2 ψ n ϕ n 2
Theorem 4.
Every cyclic RID-scale of minimal bias is optimal.
Proof. 
We prove it by denying the consequent. Assume two cyclic RID-scales E n h i E n h such that n > n with interval commas satisfying | ϕ n | > | ϕ n | , i.e., E n h is not optimal. Then, applying the previous corollary to Eq. 36, we have
ψ n ϕ n 2 > ψ n ϕ n 2 θ n > θ n
Therefore, if a cyclic RID-scale is not optimal, it cannot be MB. □
The first cyclic MB-scales for h = 3 , i.e., Pythagorean scales, are for n = 5 ; 12 ; 53 ; 665 as shown in Fig. Figure 2, as well as in Table 1.

12. Conclusions

In the current paper it is proposed to measure the regularity of the intervals of a music scale from its partition entropy. Among other properties, the fact of being a continuous increasing function of n for an n-TET scale, which is always the maximum value that the entropy of any n-tone scale can reach, together with the sub-additivity property, which guarantees that as refining the partition, the entropy always increases, make this parameter very suitable for our purpose. In order to compare scales with different number of tones, the entropy relative to the corresponding regular scale is used, which is the normalized entropy, so that their complementary to 1 quantifies the bias relative to the n-TET scale.
The main application of these concepts is to cyclic scales, whose properties are reviewed and further investigated in the Appendices. Two situations have been analyzed. First, cyclic scales with a fixed number of tones, which is a family of modulating temperament scales around one generator. In this case, the bias only depends on the closure, i.e., the relative size of both elementary intervals. Second, as cyclic scales are refined, the bias also depends on how many intervals of each size there are.
In order to study such a dependency, it has been necessary to restrict the scales in two ways. On the one hand, to those having a lower limit of the entropy determined by the condition that their notes are distributed along each of the intervals of a regular scale (RID-scales). Such a study has been made in a general way, by determining the maximum bias of several scales, not necessarily cyclic, having different levels of similarity with a n-TET scale, either from one side or from both sides. Figure 2 displays the similarity levels for cyclic scales. On the other hand, to scales with the comma not exceeding that of a RID-scale, since in this case the dependency bias-closure is well defined.
We prove that any cyclic scale of minimal bias (MB-scale), i.e., whose bias is lower than that of the cyclic scales of fewer tones, is necessarily optimal, i.e., corresponds to a best rational aproximation of log 2 h . Therefore, in relation to the closure, scales can be ordered in nested families, from worst to best, as cyclic, accurate, and optimal scales, whilst in relation to their regularity, they can be ordered in nested families as cyclic, RID, optimal, and MB-scales.

Funding

This research received no external funding.

Conflicts of Interest

The author declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
BRA Best rational approximation (optimal scale)
GRA Good rational approximation (accurate scale)
I|Xn maximum distance 1 x n to n-TET
MB Minimal bias
n-TET n-tones of equal temperament
OSS One-side similar
RID Regular interval distributed

Appendix A. Bounds for the Elementary Intervals and Closure of Cyclic Scales

The notes of a cyclic scale E n h divide the circle of the octave into n intervals of width log 2 U or log 2 D , beginning and ending in the pitch class of the fundamental, so that
M log 2 U + m log 2 D = 1
According to the definitions of U, D, and γ n , these intervals can be referred to the elementary interval of the equal temperament scale as
log 2 U = 1 n + m n log 2 γ n ; log 2 D = 1 n M n log 2 γ n
We may determine their bounds by distinguishing between the cases γ n greater or less than 1.
(i)
If γ n > 1 , then U > D . Depending on the scale, we have:
(a)
The closure of an optimal cyclic scale satisfies 1 < γ n < 2 ν M < ν m , hence 1 < γ n < D < U . By taking logarithms and taking into account Eq. A2 we get
0 < log 2 γ n < log 2 D = 1 n M n log 2 γ n 0 < log 2 γ n < 1 n + M
The elementary intervals satisfy
1 n + M < log 2 D < 1 n ; 1 n < log 2 U < 2 n + M
(b)
The closure of a non-optimal cyclic scale satisfies 1 < 2 ν M < γ n < ν m , hence 1 < D < γ n < U . Then, by taking logarithms,
log 2 D = 1 n M n log 2 γ n < log 2 γ n 1 n + M < log 2 γ n log 2 γ n < log 2 U = 1 n + m n log 2 γ n log 2 γ n < 1 M
Therefore,
0 < log 2 D < 1 n + M ; 2 n + M < log 2 U < 1 M
(c)
For an accurate cyclic scale, to the above condition 0 < log 2 D < log 2 γ n < log 2 U we must add that of a good rational approximation, that can be written as
0 < 1 n log 2 γ n < 1 M log 2 D < 1 m log 2 U
Then, 1 n log 2 γ n < 1 M log 2 γ n M n , hence log 2 γ n < 1 2 M , and, since Eq. A4 is still valid, we get the following bounds,
1 n + M < log 2 γ n < 1 2 M ; 1 2 n < log 2 D < 1 n + M ; 2 n + M < log 2 U < 1 2 n + 1 2 M
Thus, in case (i), either for optimal or non-optimal scales, it is satisfied,
γ n > 1 0 < log 2 γ n < 1 M ; 0 < log 2 D < 1 n ; 1 n < log 2 U < 1 M
(ii)
If γ n < 1 , then D > U . Depending on the scale, we have:
(a)
The closure of an optimal cyclic scale satisfies ν M 2 < 1 ν m < γ n < 1 , hence 1 D < 1 U < γ n < 1 . Then, by taking logarithms,
log 2 U = 1 n m n log 2 γ n < log 2 γ n < 0 1 n + m < log 2 γ n < 0
The elementary intervals satisfy
1 n + m < log 2 U < 1 n ; 1 n < log 2 D < 2 n + m
(b)
The closure of a non-optimal cyclic scale satisfies ν M 2 < γ n < 1 ν m < 1 , hence 1 D < γ n < 1 U < 1 . Then, by taking logarithms,
log 2 D = 1 n + M n log 2 γ n < log 2 γ n 1 m < log 2 γ n < 0 log 2 γ n < log 2 U = 1 n m n log 2 γ n log 2 γ n < 1 n + m
Then, the elementary intervals satisfy
0 < log 2 U < 1 n + m ; 2 n + m < log 2 D < 1 m
(c)
For an accurate cyclic scale, to the above condition log 2 D < log 2 γ n < log 2 U < 0 we must add that of a good rational approximation, that can be written as
1 M log 2 D < 1 m log 2 U < 1 n log 2 γ n < 0
Then, 1 n log 2 γ n > 1 + m log 2 γ n m n , hence 1 2 m < log 2 γ n , and, since Eq. A7 is still valid, we get the following bounds,
1 2 m < log 2 γ n < 1 n + m ; 2 n + m < log 2 D < 1 2 n + 1 2 m ; 1 2 n < log 2 U < 1 n + m
Thus, in case (ii), either for optimal or non-optimal scales, it is satisfied,
γ n < 1 1 m < log 2 γ n < 0 ; 0 < log 2 U < 1 n ; 1 n < log 2 D < 1 m
Table A1 summarizes the above results over the octave S 0 . Notice that, for optimal scales, equations A3 and A6 give the following bounds, similarly to the theory of continued fractions,
( i ) log 2 h N n < 1 n ( n + M ) ; ( ii ) log 2 h N n < 1 n ( n + m )
Table A1. Bounds for ϕ n = log 2 γ n , u = log 2 U and d = log 2 D , depending on the type of cyclic scale.
Table A1. Bounds for ϕ n = log 2 γ n , u = log 2 U and d = log 2 D , depending on the type of cyclic scale.
Preprints 104176 i003

Appendix B. Deviation of a Cyclic Scale from n-TET

We calculate the interval between the notes with ordinal j of the respective scales E n h and E n , corresponding to the tones ϑ j E n h and ϑ j = 2 j n E n . According to equations A2 and A1, the deviations of each note of E n h with regard to E n compensate each other and, in the end, they close exactly the octave. If the tone ϑ j corresponds to the iteration ν k , then j = N k n k [17], Eq. 28]. Therefore, k = N k j n . Bearing in mind that log 2 γ n = n log 2 h N , the scale note log 2 ϑ j = log 2 ν k is
log 2 ϑ j = k log 2 h k = k n log 2 γ n + j n
Lemma A1.
The interval that separates the two notes is
I ( ϑ j , ϑ j ) = log 2 ϑ j log 2 ϑ j = k n log 2 γ n ; j = 0 , , n 1
Therefore, all of these intervals have the same sign, which is positive if γ n > 1 and negative if γ n < 1 . By taking absolute value, the interval between these close notes becomes a distance,
d ( ϑ j , ϑ j ) = k n | log 2 γ n | = k n κ n ; j = 0 , , n 1
As expected, these distances increase with the iterations, so that the maximum is reached by last iterate, k = n 1 , i.e., d max = max j d ( ϑ j , ϑ j ) = n 1 n κ n . Then, a condition such as d max 1 λ n , for λ > 0 , becomes κ n 1 λ ( n 1 ) , which, in cents is
( κ n ) 1200 λ ( n 1 )
For an optimal cyclic scale, according to equations A3 and A6, 1 n + m < log 2 γ n < 1 n + M , so that | log 2 γ n | < 1 n and k | log 2 γ n | < 1 in Eq. A11. Therefore, we conclude d ( ϑ j , ϑ j ) < 1 n .
Theorem A1.
The notes ϑ i , i 0 , of anoptimalcyclic scale E n h and the ones of the corresponding n-TET scale E n alternate, i.e., for 0 < i < n either(i) ϑ i < ϑ i < ϑ i + 1 or(ii) ϑ i 1 < ϑ i < ϑ i .
Such a situation is also possible for non-optimal scales, although there are exceptions where the tones do not alternate. For instance, in a non-optimal scale with γ n < 1 , Eq. A7 implies m | log 2 γ n | < 1 . But, in a cyclic scale where M > m , the condition k | log 2 γ n | < 1 , k , stated in case (ii) is not fulfilled. An example of such a situation is the 29-tone scale, with m = 12 and M = 17 , where | log 2 γ n | 1 = 27.7 . The 28-th tone ν 28 = ϑ 12 (454.74 cents) is below ϑ 12 (496.55 cents) more than a factor 2 1 29 (41.38 cents), and between ϑ 12 and ϑ 13 = ν 11 there are two consecutive tones ϑ 11 and ϑ 12 of the 29-TET scale.

References

  1. Clough, J.; Myerson, G. Variety and multiplicity in diatonic systems. Journal of Music Theory 1985, 29, 249–270. [Google Scholar] [CrossRef]
  2. Clough, J.; Myerson, G. Musical scales and the generalized circle of fifths. Am. Math. Monthly 1986, 93, 695–701. [Google Scholar] [CrossRef]
  3. Clough, J.; Douthett, J. Maximally even sets. Journal of Music Theory 1991, 35, 93–173. [Google Scholar] [CrossRef]
  4. Douthett, J.; Krantz, R. Maximally even sets and configurations: common threads in mathematics, physics, and music. Journal of Combinatorial Optimization 2007, 14, 385–410. [Google Scholar] [CrossRef]
  5. Pinkerton, R. Information theory and melody. Scientific American 1956, 194, 77–87. [Google Scholar] [CrossRef]
  6. Youngblood, J. Style as information. Journal of Music Theory 1958, 2, 24–35. [Google Scholar] [CrossRef]
  7. Lewin, D. Some applications of communication theory to the study of twelve-tone music. Journal of Music Theory 1968, 12, 50–84. [Google Scholar] [CrossRef]
  8. Cohen, J. Information Theory and Music. Behavioral Science 1962, 7, 137–163. [Google Scholar] [CrossRef]
  9. Knopoff, L.; Hutchinson, W. Information theory for musical continua. Journal of Music Theory 1981, 25, 17–44. [Google Scholar] [CrossRef]
  10. Knopoff, L.; Hutchinson, W. Entropy as a measure of style: The influence of sample length. Journal of Music Theory 1983, 27, 75–79. [Google Scholar] [CrossRef]
  11. Snyder, J. Entropy as a Measure of Musical Style: The Influence of a Priori Assumptions. Music Theory Spectrum 1990, 12, 121–160. [Google Scholar] [CrossRef]
  12. Temperley, D. Music and Probability; The MIT Press, 2006. [CrossRef]
  13. Sakellariou, J.; Tria, F.; Loreto, V.; Pachet, F. Maximum entropy models capture melodic styles. Scientific Reports 2017, 7, 9172. [Google Scholar] [CrossRef] [PubMed]
  14. Gündüz, G. Entropy, energy, and instability in music. Physica A: Statistical Mechanics and its Applications 2023, 609, 128365. [Google Scholar] [CrossRef]
  15. Arnold, V.; Avez, A. Ergodic Problems of Classical Mechanics; Mathematical physics monograph series, Benjamin, 1968.
  16. Cubarsi, R. An alternative approach to generalized Pythagorean scales. Generation and properties derived in the frequency domain. Journal of Mathematics and Music 2020, 14, 266–291. [Google Scholar] [CrossRef]
  17. Cubarsi, R. On the divisions of the octave in generalized Pythagorean scales and their bidimensional representation. Journal of Mathematics and Music 2024, 18, 1–19. [Google Scholar] [CrossRef]
  18. Carey, N.; Clampitt, D. Aspects of Well-Formed Scales. Music Theory Spectrum 1989, 11, 187–206. [Google Scholar] [CrossRef]
  19. Carey, N.; Clampitt, D. Two Theorems Concerning Rational Approximations. Journal of Mathematics and Music 2012, 6, 61–66. [Google Scholar] [CrossRef]
  20. Carey, N.; Clampitt, D. Addendum to ”Two Theorems Concerning Rational Approximations”. Journal of Mathematics and Music 2017, 11, 61–63. [Google Scholar] [CrossRef]
  21. Žabka, M. Generalized Tonnetz and well-formed GTS: A scale theory inspired by the Neo-Riemannians. In Mathematics and Computation in Music; Springer: Berlin, 2009; pp. 286–298. [Google Scholar]
  22. Žabka, M. Well-Formedness in Two Dimensions: A Generalization of Carey and Clampitt’s Theorem. Journal of Mathematics and Music 2010, 4, 1–30. [Google Scholar] [CrossRef]
  23. Hellegouarch, Y. Gammes naturelles. Gazette des Matematiciens, 1999; 82, 13–26. [Google Scholar]
  24. Kassel, A.; Kassel, C. On Hellegouarch’s definition of musical scales. Journal of Mathematics and Music 2010, 4, 31–43. [Google Scholar] [CrossRef]
  25. Loya, P. Amazing and Aesthetic Aspects of Analysis; Springer: New York, 2017. [Google Scholar]
  26. Khinchin, A. Continued Fractions; University of Chicago: Chicago, 1964. [Google Scholar]
  27. Milne, A.; Sethares, W.; Plamondon, J. Tuning continua and keyboard layouts. Journal of Mathematics and Music 2008, 2, 1–19. [Google Scholar] [CrossRef]
  28. Sethares, W.; Milne, A.; Tiedje, S.; Prechtl, A.; Plamondon, J. Spectral Tools for Dynamic Tonality and Audio Morphing. Computer Music Journal 2009, 33, 71–84. [Google Scholar] [CrossRef]
  29. Wilcox, A. Indices of Qualitative Variation; Oak Ridge National Lab.: Tennessee, 1967; No.ORNL-TM-1919. [Google Scholar]
  30. Liern, V. On the construction, comparison, and exchangeability of tuning systems. Journal of Mathematics and Music 2015, 9, 197–213. [Google Scholar] [CrossRef]
1
Equalities that involve measures are understood to be true except in a null set, that is, strictly we should write μ ( M i I A i ) = 0 , but we will not do it to simplify the notation.
2
It is also called metric entropy, and it consists in changing the metric of the theory of probabilities, i.e., the probability, by a generic metric, where the concepts relative to the partitions remain.
3
If the function of a random variable f ( x ) is concave, the expected value satisfies E ( f ( x ) ) f ( E ( x ) ) (Jensen’s inequality).
4
In general, for three measurable partitions α , β , γ , H ( α β / γ ) = H ( α / γ ) + H ( β / α γ ) H ( α / γ ) + H ( β / γ ) .
5
The terms “good” and “best” rational approximation [25] are equivalent to best approximation “of the first kind” and “of the second kind”, respectively [26]. For the current case, the conditions mean:
One-sided best approximation of log 2 h + satisfies, if γ n < 1 , ( k + 1 ) k log 2 h > N n log 2 h > 0 ; 0 < k n .
One-sided best approximation of log 2 h satisfies, if γ n > 1 , 0 < n log 2 h N < k log 2 h k ; 0 < k n .
Best rational approximation satisfies 0 < | n log 2 h N | < | k log 2 h ( k + 1 ) | and 0 < | n log 2 h N | < | k log 2 h k | ; 0 < k n .
One-sided good approximation of log 2 h + satisfies, if γ n < 1 , k + 1 k log 2 h > N n log 2 h > 0 ; 0 < k n .
One-sided good approximation of log 2 h satisfies, if γ n > 1 , 0 < log 2 h N n < log 2 h k k ; 0 < k n .
Good rational approximation satisfies 0 < | log 2 h N n | < | log 2 h k + 1 k | and 0 < | log 2 h N n | < | log 2 h k k | ; 0 < k n .
6
Since b [ z ( a b ) + z ( 1 a b ) ] = z ( a ) z ( b ) + z ( b a ) and u d = κ n , it holds u H ( d u , 1 d u ) = z ( d ) z ( u ) + z ( κ n ) .
7
This ratio has also been called efficiency and relative entropy [29].
8
The interval between two tones ν 1 < ν 2 in ( 0 , ) is I ( ν 2 , ν 1 ) = log 2 ν 2 log 2 ν 1 .
9
Some of these intervals may have width zero, giving rise to a degenerate scale with less than n non-null intervals.
10
The function Z ( λ , a ) = z ( λ a λ ) + z ( λ + a λ ) with a > 0 is defined for λ a , where satisfies Z ( λ , a ) < 0 , always increasing from Z ( a , a ) = 2 until Z ( λ , a ) 0 when λ .
11
Between the notes j = 83 of the respective scales E 200 3 and E 200 there is a distance of nearly 1.5 elementary intervals of the equal temperament scale and between two consecutive notes of the cyclic scale there may be a distance equivalent to 2.1 regular intervals, what means that within some regular intervals there are two notes of the cyclic scale.
12
We must distinguish two cases: (a) M > m , therefore 1 n < 1 M < 2 n and 2 n < 1 m ; since M u + m d = 1 , we have 1 n < u < 1 M and 0 < d < 1 n , which does not add any new limitation. (b) M < m , therefore 1 n < u < 2 n and 1 n ( 1 M m ) < d < 1 n . Therefore, the extremes still correspond to n-tone scales by avoiding degenerate cases.
13
We must distinguish two cases: (a) M > m , therefore 1 n < 1 M < 2 n and 2 n < 1 m ; hence, 1 n < d < 2 n and 1 n ( 1 m M ) < u < 1 n . Once again, the extremes correspond to n-tone scales, not degenerating toward scales of fewer tones. (b) M < m , therefore 2 n < 1 M and 1 n < 1 m < 2 n ; hence, 1 n < d < 1 m and 0 < u < 1 n , which does not add any new limitation.
Figure 1. Graphs of normalized entropy and bias.
Figure 1. Graphs of normalized entropy and bias.
Preprints 104176 g001
Figure 2. Behavior of | ϕ n | = log 2 κ n (red) and θ n = 1 η n (blue) in terms of the number of tones n (bilogarithmic scales). In gray, maximum bias for several similarity levels to n-TET scales.
Figure 2. Behavior of | ϕ n | = log 2 κ n (red) and θ n = 1 η n (blue) in terms of the number of tones n (bilogarithmic scales). In gray, maximum bias for several similarity levels to n-TET scales.
Preprints 104176 g002
Table 1. Properties of cyclic scales: GRA=Good rational approximation (accurate scale); RID=Regular interval distributed; BRA=Best rational approximation (optimal scale); MB=Minimal bias; I|Xn=maximum distance 1 x n to a note of the n-TET scale.
Table 1. Properties of cyclic scales: GRA=Good rational approximation (accurate scale); RID=Regular interval distributed; BRA=Best rational approximation (optimal scale); MB=Minimal bias; I|Xn=maximum distance 1 x n to a note of the n-TET scale.
Preprints 104176 i001
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated