1. Introduction
Galen of Pergamum - a giant in the history of medicine - is currently described as medical doctor and philosopher, born in Pergamum in the year 129 AD (Anno Domini) and died in 216. He attended schools of Greek philosophers and medical schools in Pergamum, Smirne and Alexandria. In the year 157 he became the personal doctor of the emperor Marcus Aurelius. Galen reorganized medicine on a unitary basis in which anatomy and the demonstrative methods of Aristotelian and Euclidean origins were central.
Apparently, he was a very prolific writer. Of the 400 books (in the following we refer to them as “texts”) attributed to him, only 130 are today available, written in Greek, Arabic, Syrian and Latin. However, some texts are not considered authentic but are attributed to a Pseudo Galen, written surely after his death, and authored by unknown persons who exploited his fame [
1].
What we know about his life comes only from his writings since he is practically unknown to contemporary scholars. His reference to Aristotelian philosophy and monotheism made him accepted by Judaism, Fathers of the Christian Church and by Islam. His works were, therefore, adopted as textbooks in Medieval Universities and his authority was practically indisputable till the Renaissance.
His fame arrives very late, only about 150 years after his death, at the medical school in Alexandria, mainly because to the large number of his alleged texts collected or summarized by Oribasius, the personal medical doctor of the emperor Julian the apostate, IV century. Till then, practically nothing was known about him [
2] (p. 31), so that a “Galenic Question” has been recently raised [
2]. In other words, who really wrote the huge number of texts attributed to Galen? How many different writers authored them?
The aim of the present paper is to propose a “solution” to the “Galenic Question” through a multidisciplinary approach based: (a) on historical research and (b) on a mathematical analysis of the Greek texts.
The historical approach has considered historical independent sources and possible anachronisms. This study has allowed us to select a subset of Galen’s texts that very likely were written by an author, named also Galen, but living before Galen of Pergamum, between the end of the I century BC (Before Christ) and the second half of the I century AD.
The mathematical approach is based on a mathematical theory concerning surface deep–language parameters [
3] - stochastic variables not consciously controlled by authors - used already to assess: (a) similarities among the New Testament books by addressing the “Synoptic question” with a new approach [
4]; (b) the likely impact of King James’ New Testament translation on Charles Dickens’ novels [
5]; (c) the unexpected strong mathematical relationship between the Book of Revelation and the Letter to Hebrews [
6], which should be further pursued by scholars.
After this introductory section, in
Section 2 we briefly summarize the so–called "Galenic Question" which, through the study of independent sources, questions how many different authors, and from what historical period, hide under the name Galen. We present an overall critical analysis of some suspicious texts attributed to Galen, with the aim of identifying works that might have been written by different authors, as suggested in [
2] and in some Renaissance texts [
7] (p. 2); [
8] (p. 260).
In section 3 we examine many texts, attributed to Galen of Pergamum, which, because of possible historical anachronisms or suspicious information, very likely were written by an author who lived before him.
In
Section 4 we apply the basic tools of the mathematical theory, mentioned above, and probability calculations to 57 Greek texts, allegedly written by a single “Galen”, and show that, very likely, at least three authors hide under the authorship of Galen of Pergamum.
In
Section 5 we merge the texts of these alleged three “authors” and consider them as single texts to which apply the mathematical theory.
In
Section 6 we conclude with the statement that – in our opinion – at least three authors wrote the texts studied. The first two very likely were real persons: (a) a certain Galen living between the end of the I century BC (Before Christ) and the second half of the I century AD; (b) the historical Galen of Pergamum (II–III centuries AD) and (c) several unknown authors hiding under the name Galen, but surely after Galen of Pergamum’s death.
2. The Galenic Question
Athenaeus of Naucratis in his work written at the beginning of the III century AD (Deipnosophistae, I, 2) speaks of a Galen, philosopher, physician, author of scientific texts, and commentator of ancient texts, who surpassed all his predecessors. This great praise seems to fit more with an author of the past rather than a contemporary.
Alexander of Aphrodisias, a scholar who lived in Athens around 200 AD (Anno Domini), in his comments on Aristotle cites Galen as “famous philosopher” together with Plato and Aristotle (Alexander of Aphrodisias, Comm. Arist. Topic., 8, 5). This statement sounds curious for a writer who should be his contemporary, known above all as a medical researcher and physician, whereas here he is associated with the great philosophers of the past.
Gargilius Martial in Medicina ex oleribus et pomis, III century AD, mentions only physician of the I century AD and among them he mentions Galen several times in connection with Dioscorides (Gargilius Martial, Medicina ex oleribus et pomis, VI) who lived under the emperor Nero.
Eusebius (IV century) reports that in a work written very likely at the beginning of the III century against the heresy of Theodotus, the heretics almost adored Galen (Eusebius, Historia Ecclesiastica, V, 28, 13–14), seen as philosopher and logician together with Euclid, Aristotle and Theophrastus. Thus, Galen is considered among the great writers of antiquity, and he does not appear to be contemporary of with the heretics who "worshipped" him.
Simeon Metaphraste, in his histories of saints and martyrs, mentions the
passio of Carpus, Paphilus, Agathodorus, Agathonyx [
9], physicians martyred under the emperor Decio (249–251). During their interrogatory the names of Hippocrates and Galen are mentioned. Now, mentioning Galen together with Hippocrates indicates that he already was so famous to be associated with Hippocrates and, therefore, this should place Galen in a very ancient epoch. Moreover, these martyrs are also mentioned by Eusebius of Caesarea (
Historia Ecclesiastica, IV, 15, 48) who, however, set their martyrdom under the emperor Lucius Verus (161–169). Modern scholars share this latter dating because the Greek language is that of the epoch of Marcus Aurelius [
10].
These are few examples of contradictory historical information regarding Galen of Pergamum that raise the question about the existence of more than one author under Galen’s name, very likely persons who lived in different epochs. An exhaustive review of these points can be found in [
2].
Several other studies on Galen, even if they do not contradict the current dating are nevertheless quite critical and capture the complexity of the contents of his writings, compared to the intellectual life of his (presumed) epoch [
11,
12,
13,
14,
15,
16,
17,
18,
19,
20,
21]. There are many texts attributed to him which, for their style and arguments, scholars consider surely written by other authors – namely a Pseudo Galen (PG) – after Galen of Pergamum’s death [
1]. On the contrary, other writings seem to belong to an epoch before Galen of Pergamum. In conclusion, several authors could hide under the historical figure of Galen of Pergamum, living either before or after Galen of Pergamum’s epoch.
Indeed, in old age, Galen of Pergamum wrote a book in which he lists which texts were his own. This fact means that already at that time there was a widespread confusion between his texts and others attributed to a homonym Galen. The need of ordering works attributed to him may have led Galen to declare numerous texts written by others, whose memory had been lost, as his own. The most important Galen scholars agree that extracting autobiographical truth from his writings is impossible, just because he constructed a partially imaginary autobiography [
22] (p. 12).
To search for a possible solution to this question, in our study we have examined 57 works written in Greek, with sufficiently large number of words (at least 1500) to allow reliable statistical results.
Table 1 lists the Latin titles of these texts with the presumed epoch of writing, according to scholars. Obviously, all the texts are dated according to the current chronology (129–216 AD), except the Pseudo Galen’s texts.
As already stated, some of the texts of
Table 1 should refer to different authors. There is a substantial consensus among the scholars on the fact that the works numbered from 31 to 42 (PG) were really written after the death of Galen of Pergamum [
1], by the so–called Pseudo-Galen.
Galen appears as a compiler and commentator of previous works, often reported verbatim, but this is hardly noticeable because he does not mention sources [
23]. The alleged autobiographical passages could also have been inserted later. In fact, Galen's works remained ignored until around 360 AD, and only then his alleged literary production exploded into a myriad of manuscript copies, not to mention compilations, summaries, interpolations, translations into multiple languages, falsely attributing to him writings of others. These texts have generally been collected by scholars under the name of Pseudo–Galen [
1].
Other works seem to be written by another author who lived before Galen of Pergamum [
2] because of confusions and contradictions which have constantly drawn a series of criticisms to Galen, summarized in a recent book with the significant title “
Contre Galien. Critiques d’une autorité médicale de l’Antiquité à l’âge moderne” [
24].
With this regard, it should be highlighted that the biography of an author named Galen but living at the beginning of the I century AD, is mentioned in several sources, although generally either ignored or neglected by scholars. In fact, besides the autobiography of Galen of Pergamum there is a second biography in Arabic texts in which Galen is a philosopher and physician living between the end of the I century BC and the second half of the I century AD, until the times of Nero and Vespasian [
25,
26,
27]. Furthermore, Arab doctors know Galen's works unknown in the West, and many details of his life. Hunayn Ibn Ishāq (IX century), translator of many Galen's medical texts, wrote that Galen was a contemporary of Christ, and that he died in the year 88 AD at 87 years, according to the authority of
Yahyā the Grammarian, i.e., John the Grammarian. For Sulayman ibn Hassān (i.e. Ibn Juljul, X century, from Cordoba), Galen lived at the time of the emperor Nero and died in Sicily [
25,
26,
27].
In other words, the claim that Galen lived at the epoch of Jesus Christ should be considered seriously because the first Arab translators of Galen had access to lost Byzantine biographies, and their information on the contemporaneity of Galen and Christ confirms the doubts about Galen and the existence of a "Galenic Question" [
2].
In conclusion, the number of authors hiding under Galen’s name is at least three, therefore in the following we refer to three “Galen”.
We first conjecture a Galen philosopher and physician who lived
before Galen of Pergamum, between the late I century BC (Before Christ) and the epoch of Nero (54–68 AD) and Vespasianus (69–79 AD): we refer to him as Galen–1. Secondly, we refer to the philosopher and physician living from 129 to 216 AD, i.e. the historical Galen of Pergamum, as Galen–2. Thirdly, we refer to the author of texts written
after Galen of Pergamum’s death, authored by exploiting his fame [
1], as the Galen–3 (Pseudo Galen).
The mathematical analysis of the deep–language structure of all the writings reported in
Table 1, discussed
Section 4, will aim either to confirm or to deny our conjecture of the existence of at least three authors. Before this mathematical analysis, in the next section, we examine the works that might have been written by Galen–1.
4. Deep−Language Parameters and Vector Representation of Texts
Let us consider a text and its subdivision in disjoint blocks long enough to give reliable average values [
3]. For each text block, let
be the number of sentences contained in it,
the number of words,
the number of characters contained in the
words and
the number of punctuation marks (interpunctions) contained in the
sentences.
In the present study we have divided each text of
Table 1 in disjoint blocks of approximately 300 words, so that the statistics of linguistic variables need not be weighted by the length (in words) of block texts, as done in [
3,
4,
5,
6] were the blocks considered are chapters of novels. In all cases, all other alphanumeric characters, notes and titles have been deleted, leaving only words and interpunctions, so that to obtain as much as possible the plain text written by the author.
For each text block we have computed the following variables [
1]:
We refer to them as the deep–language variables [
3]. In other words,
gives the number of characters per word;
gives the words per sentence;
gives the words per interpunctions
(this parameter is also referred to as the “words interval” [
3]);
gives the interpunctions per sentence (this variable gives the number of
contained in a sentence). Very likely, these four linguistic variables are rarely consciously controlled by any author, therefore their statistics can give useful indications on similarity of texts with little or no bias. Notice also that they also reveal readers’ (and writers’, as well) short–term memory characteristics [
48] and readability index of the text [
49].
Table 4 reports the mean values of these four deep–language variables for each text of
Table 1. Specifically, let
be the number of samples (i.e.,
disjoint blocks), the mean value
, for example, is given by:
The variables defined in Equations (1)–(4) allow to study variances of texts of similar length, scatterplots of variables and the linguistic channels [
3,
4,
5,
6,
48,
49].
Notice that
=
. In other words, the mean value
is not given by the total number of words
divided by the total number of sentences
, unless all text blocks are of equal number of words and sentences, which never ooccurs. The same discussion applies to all other variables. For example, for text 7 (
De usu partium),
,
, therefore
while
(
Table 4)
The values reported in
Table 4 can be used to represent texts in Cartesian coordinates [
3]. This geometrical representation supports, as we show next, our alleged attribution of the texts of
Table 1 to Galen–1, Galen–2 and Galen–3.
In this Cartesian plane two texts share a common mathematical structure if their relative Pythagorean distance is small, i.e. if vectors show close ending. In other words, a small distance means that texts are mathematically simialr, a feature that authors very likely do not consciously control.
The geometrical representation is based on defining the following six vectors of the indicated components of deep‒language variables
),
),
),
),
),
) and their resulting vector sum:
From vector analysis, the two orthogonal components of
are given by
, , which can be represented as single points in the first Cartesian quadrant. Notice that the choice of which variables represents the
and
components is not irrelevant because, once the choice is made, the numerical results will depend on it, but not the relative comparisons and general conclusions. Moreover, to avoid different ranges in the
and
axes we use the following normalized variables:
In Eqs. (7)(8) the maximum and minimum values are those obtainable from
Table 4. The scatterplot of the resulting normalized coordinates is shown in
Figure 1: green for the texts attributed to Galen–1; red for the texts attributed to Galen–2; blue for the texts attributed to Galen–3 (Pseudo Galen).
We notice the following facts:
- a)
The texts allegedly attributed to Galen–1 fall into the region delimited by the dashed green line.
- b)
The texts attributed to Galen–2 fall into the region delimited by the dashed blue line.
- c)
The texts allegedly attributed to Galen–3 (Pseudo Galen) fall in the large region delimited by the red dashed line which includes all texts.
The blue and the green regions have a negligible intersection, therefore indicating, very likely, that the texts that fall in these regions were written by different authors. Next, we calculate some probabilities to further pursue this topic.
Let us first calculate the a–priori probability that a text falls in the green region of
Figure 1. The probability that, by chance, a text attributed to Galen–1 falls in the green region is given by the ratio between the area delimited by the green dashed line and the total area, i.e. the area delimited by the red dashed line. This probability is
. Now, the probability that, by chance, the 13 texts attributed to Galen–1 all fall in the green area can be calculated with the binomial distribution.
The binomial distribution is a discrete probability distribution that describes the number of successes in a fixed number of independent trials, where each trial has only two possible outcomes: success or failure. In the following, the binary outcome is represented by a text either inside or outside the green region. To evaluate joint probabilities there is the so-called probability mass function [
50], given by Eq. (9). The formula describes the probability of observing exactly
n successes in
N trials, given by the binomial coefficient, multiplied by the probability of success raised to the power of the number of successes, multiplied by the probability of failure raised to the power of the number of failures.
Therefore, the joint probability that
points, out of
, are in the green region, is therefore given by [
50]:
Setting and , the mean value and the standard deviation of the binomial distribution are given, respectively, by and .
Now, we can explicitly calculate the probability that
texts fall in the green region, out of
First (
t–test) we calculate the t–value:
The probability of having
values greater than 8.6 with
degrees of freedom is
[
50]. Therefore, we can exclude that the clustering of the points attributed to Galen–1 is due to chance.
We also observe that 6 red points (11,30,43,50,52,55), out of , also fall in the green area. Now, the probability that fall in the green area, can be calculated in the same way. Since now , the probability that with degrees of freedom is , therefore these 6 texts can fallen into the green area by chance, because their distribution covers the entire red area.
In the next section, based on these results, we merge the three sets of texts into three singles texts and study them in the vector plane.
5. Deep−Language Parameters of Galen–1, Galen–2 and Galen–3
In this section, we consider the three sets of texts of
Section 4 as three single texts. In other words, we consider three different alleged authors who wrote three long texts. This new analysis definitely shows that the authors were at least three.
Table 5 reports mean value and standard deviation of the mean (in parentheses) of
,
,
and
, for the three authors. At glance, these values already show significant differences in the three authors, which are clearly evident in the vector plane shown in
Figure 2, with normalized coordinates so that Galen–3 is set at the origin of coordinates, point (0,0), and Galen–1 is at (1,1).
The ending points of the vectors shown in
Figure 2 are computed from the values of
Table 5 according to Eq. (6); the variance of the
and
coordinates is calculated by summing the variances of the variables of each coordinate in Eq.(6), whose square root is reported in
Table 5. The result of this calculation is reported in
Table 6.
From these latter values, we have calculated the normalized coordinates of the ending points drawn in
Figure 2 and the 3–standard deviation (sigma) circles. For example, the 1–sigma radius of Galen–1 is given by
. Therefore the 3–sigma circle of Galen–1 has center in (
) with radius
. Then these values are then normalized so that Galen–3 is at (0.0) and Galen–1 at (1,1).
We can see that the probability of mistaken one author for another is practically zero, because the 3–sigma circles are each other very distant.
In conclusion,
Figure 2 says that there is no overlapping among the three sets of texts when considered as single long texts. In other words, Galen–1, Galen–2 and Galen–3 are three different “authors”, the first two very likely real persons, the third an unknown numbers of authors.