Submitted:
12 January 2026
Posted:
13 January 2026
You are already at the latest version
Abstract
Keywords:
1. Introduction
2. Data
3. Results
3.1. Frequency and Rank Diversity
3.2. Similarity
3.3. Unique and Common Content Words
3.4. Article Classifier
3.5. Physicists Mentions
4. Conclusions
Acknowledgements
References
- Wang, D.; Barabási, A.L. The Science of Science; Cambridge University Press: Cambridge, 2021. [Google Scholar] [CrossRef]
- Mingers, J.; Leydesdorff, L. A review of theory and practice in scientometrics. European Journal of Operational Research 2015, 246, 1–19. [Google Scholar] [CrossRef]
- Ward, R.; Tupikina, L.; Santolini, M. Mapping the physics literature with a network of 27,770 papers: A study of publication and citation patterns. PLOS One 2022, 17, e0270131. [Google Scholar] [CrossRef]
- de Solla Price, D.J. Networks of Scientific Papers. Science 1965, 149, 510–515. [Google Scholar] [CrossRef] [PubMed]
- Small, H. Co-citation in the scientific literature: A new measure of the relationship between two documents. Journal of the American Society for Information Science 1973, 24, 265–269. [Google Scholar] [CrossRef]
- Fortunato, S.; Bergstrom, C.T.; Börner, K.; Evans, J.A.; Helbing, D.; Milojević, S.; Petersen, A.M.; Radicchi, F.; Sinatra, R.; Uzzi, B.; et al. Science of science. Science 2018, 359, eaao0185. [Google Scholar] [CrossRef]
- Wuchty, S.; Jones, B.F.; Uzzi, B. The Increasing Dominance of Teams in Production of Knowledge. Science 2007, 316, 1036–1039. [Google Scholar] [CrossRef]
- Redner, S. Citation Statistics from 110 Years of Physical Review. Physics Today 2005, 58, 49–54. [Google Scholar] [CrossRef]
- Uzzi, B.; Mukherjee, S.; Stringer, M.; Jones, B. Atypical Combinations and Scientific Impact. Science 2013, 342, 468–472. [Google Scholar] [CrossRef]
- Church, K.; Liberman, M. The Future of Computational Linguistics: On Beyond Alchemy. Frontiers in Artificial Intelligence 2021, 4, 625341. [Google Scholar] [CrossRef] [PubMed]
- Brown, P.F.; Cocke, J.; Pietra, S.A.D.; Pietra, V.J.D.; Jelinek, F.; Lafferty, J.D.; Mercer, R.L.; Roossin, P.S. A statistical approach to machine translation. Computational Linguistics 1990, 16, 79–85. [Google Scholar]
- Berger, A.L.; Pietra, V.J.D.; Pietra, S.A.D. A maximum entropy approach to natural language processing. Computational Linguistics 1996, 22, 39–71. [Google Scholar]
- Sutskever, I.; Vinyals, O.; Le, Q.V. Sequence to sequence learning with neural networks. In Proceedings of the Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2, Cambridge, MA, USA, 2014; NIPS’14, pp. 3104–3112. [Google Scholar] [CrossRef]
- Mikolov, T.; Chen, K.; Corrado, G.; Dean, J. Efficient Estimation of Word Representations in Vector Space. 2013, 1301.3781. [Google Scholar] [CrossRef]
- Bahdanau, D.; Cho, K.; Bengio, Y. Neural Machine Translation by Jointly Learning to Align and Translate. arXiv 2016, arXiv:1409.0473. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. In Proceedings of the Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 2017; NIPS’17, pp. 6000–6010. [Google Scholar] [CrossRef]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv 2019, arXiv:1810.04805. [Google Scholar] [CrossRef]
- Brown, T.B.; Mann, B.; Ryder, N.; Subbiah, M.; Kaplan, J.; Dhariwal, P.; Neelakantan, A.; Shyam, P.; Sastry, G.; Askell, A.; et al. Language models are few-shot learners. In Proceedings of the Proceedings of the 34th International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 2020; NIPS ’20, pp. 1877–1906. [Google Scholar] [CrossRef]
- Sinatra, R.; Deville, P.; Szell, M.; Wang, D.; Barabási, A.L. A century of physics. Nature Physics 2015, 11, 791–796. [Google Scholar] [CrossRef]
- Zipf, G.K. Selected Studies of the Principle of Relative Frequency in Language; Harvard University Press, 1932. [Google Scholar] [CrossRef]
- Cancho, R.F.i.; Solé, R.V. The small world of human language. Proceedings of the Royal Society of London. Series B: Biological Sciences 2001, 268, 2261–2265. [Google Scholar] [CrossRef]
- Montemurro, M.A. Beyond the Zipf–Mandelbrot law in quantitative linguistics. Physica A: Statistical Mechanics and its Applications 2001, 300, 567–578. [Google Scholar] [CrossRef]
- Newman, M.E. Power laws, Pareto distributions and Zipf’s law. Contemporary Physics 2005, 46, 323–351. [Google Scholar] [CrossRef]
- Batty, M. Rank clocks. Nature 2006, 444, 592–596. [Google Scholar] [CrossRef] [PubMed]
- Baek, S.K.; Bernhardsson, S.; Minnhagen, P. Zipf’s law unzipped. New Journal of Physics 2011, 13, 043004. [Google Scholar] [CrossRef]
- Deng, W.; Li, W.; Cai, X.; Bulou, A.; Wang, Q.A. Universal scaling in sports ranking. New Journal of Physics 2012, 14, 093038. [Google Scholar] [CrossRef]
- Blumm, N.; Ghoshal, G.; Forró, Z.; Schich, M.; Bianconi, G.; Bouchaud, J.P.; Barabási, A.L. Dynamics of ranking processes in complex systems. Physical Review Letters 2012, 109, 128701. [Google Scholar] [CrossRef]
- Font-Clos, F.; Boleda, G.; Corral, A. A scaling law beyond Zipf’s law and its relation to Heaps’ law. New Journal of Physics 2013, 15, 093033. [Google Scholar] [CrossRef]
- Bettencourt, L.M.A. The Origins of Scaling in Cities. Science 2013, 340, 1438–1441. [Google Scholar] [CrossRef]
- West, G. Scale: The Universal Laws of Life, Growth, and Death in Organisms, Cities, and Companies; Penguin Books: USA, 2018. [Google Scholar]
- Callon, M.; Courtial, J.P.; Turner, W.A.; Bauin, S. From translations to problematic networks: An introduction to co-word analysis. Social Science Information 1983, 22, 191–235. [Google Scholar] [CrossRef]
- Donthu, N.; Kumar, S.; Mukherjee, D.; Pandey, N.; Lim, W.M. How to conduct a bibliometric analysis: An overview and guidelines. Journal of Business Research 2021, 133, 285–296. [Google Scholar] [CrossRef]
- Lin, T.C.; Tang, K.Y.; Lin, S.S.; Changlai, M.L.; Hsu, Y.S. A Co-word Analysis of Selected Science Education Literature: Identifying Research Trends of Scaffolding in Two Decades (2000–2019). Frontiers in Psychology 2022, 13, 844425. [Google Scholar] [CrossRef]
- Blei, D.M.; Ng, A.Y.; Jordan, M.I. Latent dirichlet allocation. Journal of Machine Learning Research 2003, 3, 993–1022. [Google Scholar] [CrossRef]
- Arseniev-Koehler, A. Theoretical foundations and limits of word embeddings: What types of meaning can they capture? Sociological Methods & Research 2024, 53, 1753–1793. [Google Scholar] [CrossRef]
- Michel, J.B.; Shen, Y.K.; Aiden, A.P.; Veres, A.; Gray, M.K.; Team, T.G.B.; Pickett, J.P.; Hoiberg, D.; Clancy, D.; Norvig, P.; et al. Quantitative Analysis of Culture Using Millions of Digitized Books. Science 2011, 331, 176–182. [Google Scholar] [CrossRef]
- Perc, M. Evolution of the most common English words and phrases over the centuries. Journal of The Royal Society Interface 2012, 9, 3323–3328. [Google Scholar] [CrossRef]
- Petersen, A.M.; Tenenbaum, J.; Havlin, S.; Stanley, H.E. Statistical laws governing fluctuations in word use from word birth to word death. Scientific Reports 2012, 2, 313. [Google Scholar] [CrossRef]
- Acerbi, A.; Lampos, V.; Garnett, P.; Bentley, R.A. The Expression of Emotions in 20th Century Books. PLOS One 2013, 8, e59030. [Google Scholar] [CrossRef] [PubMed]
- Bochkarev, V.; Solovyev, V.; Wichmann, S. Universals versus historical contingencies in lexical evolution. Journal of The Royal Society Interface 2014, 11, 20140841. [Google Scholar] [CrossRef]
- Morales, J.A.; Colman, E.; Sánchez, S.; Sánchez-Puig, F.; Pineda, C.; Iñiguez, G.; Cocho, G.; Flores, J.; Gershenson, C. Rank Dynamics of Word Usage at Multiple Scales. Frontiers in Physics 2018, 6, 45. [Google Scholar] [CrossRef]
- Perc, M. Self-organization of progress across the century of physics. Scientific Reports 2013, 3, 1720. [Google Scholar] [CrossRef]
- Cocho, G.; Flores, J.; Gershenson, C.; Pineda, C.; Sánchez, S. Rank Diversity of Languages: Generic Behavior in Computational Linguistics. PLOS One 2015, 10, e0121898. [Google Scholar] [CrossRef]
- Morales, J.A.; Sánchez, S.; Flores, J.; Pineda, C.; Gershenson, C.; Cocho, G.; Zizumbo, J.; Rodríguez, R.F.; Iñiguez, G. Generic temporal features of performance rankings in sports and games. EPJ Data Science 2016, 5, 33. [Google Scholar] [CrossRef]
- Sánchez, S.; Cocho, G.; Flores, J.; Gershenson, C.; Iñiguez, G.; Pineda, C. Trajectory Stability in the Traveling Salesman Problem. Complexity 2018, 2018, 2826082. [Google Scholar] [CrossRef]
- Sánchez-Puig, F.; Lozano-Aranda, R.; Pérez-Méndez, D.; Colman, E.; Morales-Guzmán, A.J.; Rivera Torres, P.J.; Pineda, C.; Gershenson, C. Language Statistics at Different Spatial, Temporal, and Grammatical Scales. Entropy 2024, 26, 734. [Google Scholar] [CrossRef]
- Balucha, A. Stop-Words. 2011. Available online: https://code.google.com/archive/p/stop-words.
- França, U.; Sayama, H.; McSwiggen, C.; Daneshvar, R.; Bar-Yam, Y. Visualizing the “heartbeat” of a city with tweets. Complexity 2015, 21, 280–287. [Google Scholar] [CrossRef]
- Clauset, A.; Shalizi, C.R.; Newman, M.E. Power-law distributions in empirical data. SIAM Review 2009, 51, 661–703. [Google Scholar] [CrossRef]
- Morales, J.A.; Flores, J.; Gershenson, C.; Pineda, C. Statistical Properties of Rankings in Sports and Games. Advances in Complex Systems 2021, 24, 2150007. [Google Scholar] [CrossRef]
- Iñiguez, G.; Pineda, C.; Gershenson, C.; Barabási, A.L. Dynamics of ranking. Nature Communications 2022, 13, 1646. [Google Scholar] [CrossRef]
- Webber, W.; Moffat, A.; Zobel, J. A similarity measure for indefinite rankings. ACM Trans. Inf. Syst. 2010, 28, 1. [Google Scholar] [CrossRef]
- Heylighen, F.; Dewaele, J.M. Variation in the contextuality of language: An empirical measure. Foundations of science 2002, 7, 293–340. [Google Scholar] [CrossRef]
- MIT. Pantheon World. 2013. Available online: https://pantheon.world.
- Yu, A.Z.; Ronen, S.; Hu, K.; Lu, T.; Hidalgo, C.A. Pantheon 1.0, a manually verified dataset of globally famous biographies. Scientific Data 2016, 3, 150075. [Google Scholar] [CrossRef] [PubMed]
| 1 | Function words, commonly known as stop words, carry little lexical meaning and primarily serve grammatical functions (e.g., prepositions, pronouns, auxiliary verbs, conjunctions, and articles). In contrast, content words such as nouns, verbs, adjectives, and adverbs convey substantive meaning. |










| Journal | Abbreviation | Scope |
|---|---|---|
| Phys. Rev. A. | PRA | Atomic, molecular, |
| optical physics and | ||
| quantum information | ||
| Phys. Rev. B. | PRB | Condensed matter and |
| material physics | ||
| Phys. Rev. C. | PRC | Nuclear physics |
| Phys. Rev. D. | PRD | Particles, fields, |
| gravitation and cosmology | ||
| Phys. Rev. E. | PRE | Statistical, non-linear, |
| biological and soft | ||
| matter physics | ||
| Phys. Rev. X. | PRX | Broad subject |
| coverage encouraging | ||
| communication across | ||
| related fields | ||
| Phys. Rev. Lett. | PRL | Fundamental research |
| on all topics | ||
| related to all | ||
| fields of physics | ||
| Rev. Mod. Phys. | RMP | The full range |
| of applied, fundamental, | ||
| and interdisciplinary | ||
| physics research topics |
| Journal | Years | Articles | Content Words | Total Words |
|---|---|---|---|---|
| PRA | 1970 - 2017 | 105,383 | 1,267,089 | 24,103,820 |
| PRB | 1970 - 2017 | 241,135 | 2,785,162 | 14,774,108 |
| PRC | 1970 - 2017 | 57,609 | 1,456,679 | 26,638,407 |
| PRD | 1970 - 2017 | 110,288 | 837,719 | 37,122,795 |
| PRE | 1993 - 2017 | 86,743 | 2,025,377 | 29,616,401 |
| PRL | 1959 - 2017 | 123,217 | 1,841,870 | 18,548,599 |
| PRX | 2012 - 2017 | 1,153 | 1,848,094 | 29,704,075 |
| RMP | 1930 - 2017 | 4,593 | 266,052 | 1,256,168 |
| Rank | PRA | PRB | PRC | PRD | PRE |
|---|---|---|---|---|---|
| 1 | state | energy | energy | mass | model |
| 2 | states | temperature | mev | energy | system |
| 3 | energy | state | data | model | function |
| 4 | quantum | magnetic | state | field | case |
| 5 | field | field | states | case | results |
| 6 | function | states | model | order | energy |
| 7 | case | results | cross | function | phase |
| 8 | system | function | results | theory | number |
| 9 | results | model | nuclear | terms | different |
| 10 | number | phase | values | results | values |
| 11 | phase | surface | energies | data | equation |
| 12 | wave | structure | function | form | order |
| 13 | values | spin | calculations | gauge | value |
| 14 | laser | shown | experimental | gev | density |
| 15 | electron | case | nuclei | value | shown |
| 16 | potential | different | potential | values | distribution |
| 17 | shown | order | obtained | equation | field |
| 18 | different | system | reaction | large | state |
| 19 | atoms | values | scattering | same | same |
| 20 | density | density | neutron | term | dynamics |
| PRA | PRB | PRC | PRD | PRE | |||||
|---|---|---|---|---|---|---|---|---|---|
| state | 1-2 | temperature | 2-22 | mev | 2-130 | mass | 1-23 | model | 1-2 |
| states | 2-4 | magnetic | 4-121 | data | 3-10 | order | 6-11 | system | 2-7 |
| quantum | 4-44 | surface | 11-31 | cross | 7-26 | theory | 8-32 | function | 3-5 |
| wave | 12-42 | structure | 12-51 | nuclear | 9-409 | terms | 9-31 | results | 5-6 |
| laser | 14-295 | spin | 13-55 | energies | 11-71 | form | 12-33 | phase | 7-9 |
| electron | 15-22 | transition | 22-29 | calculations | 13-45 | gauge | 13-951 | number | 8-9 |
| atoms | 19-39 | band | 26-81 | experimental | 14-30 | gev | 14-102 | different | 9-15 |
| atomic | 29-151 | due | 27-38 | nuclei | 15-1525 | large | 18-26 | equation | 11-16 |
| frequency | 30-80 | observed | 28-39 | obtained | 17-21 | term | 20-69 | value | 13-14 |
| matrix | 35-53 | lattice | 30-75 | reaction | 18-576 | limit | 21-49 | density | 14-19 |
| pulse | 45-384 | dependence | 32-69 | scattering | 19-34 | decay | 22-27 | distribution | 16-41 |
| corresponding | 46-55 | sample | 34-311 | neutron | 20-638 | quark | 23-200 | dynamics | 20-105 |
| beam | 47-72 | effect | 42-64 | kev | 21-678 | scalar | 26-648 | small | 21-32 |
| atom | 49-160 | found | 44-58 | interaction | 25-27 | equations | 27-36 | particles | 24-80 |
| approximation | 51-99 | shows | 51-60 | momentum | 26-43 | parameters | 28-38 | systems | 25-62 |
| optical | 55-135 | show | 53-56 | calculated | 27-38 | scale | 29-120 | behavior | 26-48 |
| set | 56-78 | along | 54-117 | present | 28-37 | fields | 32-139 | point | 28-51 |
| method | 57-62 | spectra | 55-71 | angular | 35-152 | space | 36-92 | velocity | 29-157 |
| hamiltonian | 59-115 | properties | 56-95 | level | 36-101 | functions | 37-39 | parameter | 33-37 |
| ionization | 62-1423 | electrons | 58-119 | excitation | 37-89 | models | 40-96 | particle | 35-65 |
| Word | Common Rank | English Rank |
|---|---|---|
| energy | 6 | 457 |
| results | 10 | 361 |
| function | 12 | 519 |
| values | 19 | 579 |
| model | 24 | 317 |
| value | 24 | 221 |
| state | 31 | 62 |
| same | 32 | 45 |
| different | 34 | 84 |
| obtained | 34 | |
| shown | 35 | 430 |
| number | 36 | 95 |
| case | 38 | 83 |
| potential | 38 | 729 |
| density | 41 | |
| data | 43 | 184 |
| large | 48 | 142 |
| parameters | 52 | |
| order | 55 | 108 |
| terms | 64 | 369 |
| 1 | 1.2890 |
| 2 | 1.000 |
| 10 | 0.3288 |
| 20 | 0.0397 |
| 400 | -1.2096 |
| 500 | -1.3026 |
| Physicist | Pantheon | PRA | PRB | PRC | PRD | PRE | PRL | PRX | RMP |
|---|---|---|---|---|---|---|---|---|---|
| Curie | 3 | 72 | 11 | 84 | 53 | 51 | 27 | 29 | 38 |
| Einstein | 2 | 12 | 26 | 41 | 2 | 13 | 11 | 12 | 8 |
| Faraday | 6 | 35 | 31 | 38 | 60 | 26 | 33 | 31 | 41 |
| Hawking | 5 | 92 | 150 | 148 | 11 | 130 | 60 | 76 | 59 |
| Newton | 1 | 48 | 75 | 37 | 25 | 31 | 59 | 53 | 47 |
| Orsted | 9 | 182 | 186 | - | 180 | - | 187 | - | - |
| Planck | 7 | 25 | 35 | 50 | 7 | 5 | 15 | 23 | 25 |
| Röntgen | 4 | 153 | 184 | - | 179 | - | 183 | - | 181 |
| Rutherford | 10 | 70 | 77 | 29 | 57 | 94 | 56 | 123 | 77 |
| Volta | 8 | 140 | 134 | 160 | 158 | 146 | 160 | 148 | 173 |
| Rank | PRA | PRB | PRC | PRD | PRE | PRL | PRX | RMP |
|---|---|---|---|---|---|---|---|---|
| 1 | Coulomb | Fermi | Coulomb | Higgs | Boltzmann | Fermi | Fermi | Fermi |
| 2 | Bose | Coulomb | Fermi | Einstein | Chen | Coulomb | Dirac | Coulomb |
| 3 | Fermi | Raman | Pauli | Dirac | Landau | Lee | Chen | Lee |
| 4 | Rabi | Landau | Dirac | Lorentz | Langevin | Raman | Landau | Landau |
| 5 | Bloch | Lee | Lee | Yukawa | Plank | Landau | Coulomb | Chen |
| 6 | Raman | Chen | Born | Wilson | Lee | Chen | Bose | Dirac |
| 7 | Stark | Dirac | Wigner | Planck | Rayleigh | Dirac | Lee | Anderson |
| 8 | Born | Anderson | Chen | Feynman | Coulomb | Anderson | Raman | Einstein |
| 9 | Wigner | Heisenberg | Landau | Schwarzschild | Maxwell | Bose | Bloch | Higgs |
| 10 | Dirac | Bloch | Gamow | Fermi | Debye | Higgs | Pauli | Smith |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).