Citation: | Yiying Deng, Sicun Song, Junxuan Fan, Mao Luo, Le Yao, Shaochun Dong, Yukun Shi, Linna Zhang, Yue Wang, Haipeng Xu, Huiqing Xu, Yingying Zhao, Zhaohui Pan, Zhangshuai Hou, Xiaoming Li, Boheng Shen, Xinran Chen, Shuhan Zhang, Xuejin Wu, Lida Xing, Qingqing Liang, Enze Wang. Paleontology Knowledge Graph for Data-Driven Discovery. Journal of Earth Science, 2024, 35(3): 1024-1034. doi: 10.1007/s12583-023-1943-9 |
A knowledge graph (KG) is a knowledge base that integrates and represents data based on a graph-structured data model or topology. Geoscientists have made efforts to construct geoscience-related KGs to overcome semantic heterogeneity and facilitate knowledge representation, data integration, and text analysis. However, there is currently no comprehensive paleontology KG or data-driven discovery based on it. In this study, we constructed a two-layer model to represent the ordinal hierarchical structure of the paleontology KG following a top-down construction process. An ontology containing 19 365 concepts has been defined up to 2023. On this basis, we derived the synonymy list based on the paleontology KG and designed corresponding online functions in the OneStratigraphy database to showcase the use of the KG in paleontological research.
Allmon, W. D., Bottjer, D. J., 2001. Evolutionary Paleoecology: The Ecolo-gical Context of Macroevolutionary Change. Columbia University Press, New York. 357 |
Bottjer, D., 2016. Paleoecology: Past, Present and Future. John Wiley & Sons, Chichester. 222 |
Bromley, R. G., 1996. Trace Fossils: Biology, Taxonomy and Applications. Chapman & Hall, London. 361 |
Brower, A. V. Z., Schuh, R. T., 2021. Biological Systematics: Principles and Applications. Cornell University Press, New York. 328 |
Buatois, L. A., Mángano, M. G., 2011. Ichnology: Organism-Substrate Interaction in Space and Time. Cambridge University Press, New York. 358 |
Cavalier-Smith, T., 1998. A Revised Six-Kingdom System of Life. Biological Reviews of the Cambridge Philosophical Society, 73(3): 203–266. https://doi.org/10.1017/s0006323198005167 |
Chen, X. J., Jia, S. B., Xiang, Y., 2020. A Review: Knowledge Reasoning over Knowledge Graph. Expert Systems with Applications, 141: 112948. https://doi.org/10.1016/j.eswa.2019.112948 |
Copeland, H. F., 1938. The Kingdoms of Organisms. The Quarterly Review of Biology, 13(4): 383–420. https://doi.org/10.1086/394568 |
Copeland, H. F., 1956. The Classification of Lower Organisms. Pacific Books, Palo Alto. 302. https://doi.org/10.5962/bhl.title.4474 |
Dash, M. C., 2001. Fundamentals of Ecology. Tata McGraw-Hill Education, New York. 453 |
Dong, S. C., Shi, Y. K., Ran, Y. Z., et al., 2023. Biological Classification System Knowledge Graph and Semi-automatic Construction of Its Invertebrate Fossil Branches. Journal of Earth Science. https://doi.org/10.1007/s12583-023-1941-y |
Dong, S. C., Yin, H. W., Xu, G., 2010. Heterogeneous Data Searching Based on Geologic Time Ontology. Journal of Geo-Information Science, 12(2): 2194–2199 (in Chinese with English Abstract) |
Droser, M. L., Bottjer, D. J., Sheehan, P. M., 1997. Evaluating the Ecological Architecture of Major Events in the Phanerozoic History of Marine Invertebrate Life. Geology, 25(2): 167–170. https://doi.org/10.1130/0091-7613(1997)025<0167:eteaom>2.3.co;2 doi: 10.1130/0091-7613(1997)025<0167:eteaom>2.3.co;2 |
Ehrlinger, L., Wöß, W., 2016. Towards a Definition of Knowledge Graphs. SEMANTiCS, 48(1–4): 2 |
Fensel, D., Şimşek, U., Angele, K., et al., 2020. Introduction: What is a Knowledge Graph? In: Fensel, D., Şimşek, U., Angele, K., et al., eds., Knowledge Graphs. Springer, Cham, Switzerland. 1–10. https://doi.org/10.1007/978-3-030-37439-6_1 |
Foote, M., Miller, A. I., 2007. Principles of Paleontology (Third Edition). W. H. Freeman, New York. 354 |
Haeckel, E., 1866. Generelle Morphologie der Organismen. Reimer, Berlin. 462 (in German) |
Häntzschel, W., 1975. Trace Fossil and Problematica. In: Teichert, C., ed., Treatise on Invertebrate. Geological Society of America, University of Kansas Press, Lawrence. 1–263 |
Hautmann, M., 2020. What is Macroevolution? Palaeontology, 63(1): 1–11. https://doi.org/10.1111/pala.12465 |
Hu, X. M., Xu, Y. W., Ma, X. G., et al., 2023. Knowledge System, Ontology, and Knowledge Graph of the Deep-Time Digital Earth (DDE): Progress and Perspective. Journal of Earth Science, 34(5): 1323–1327. https://doi.org/10.1007/s12583-023-1930-1 |
Janev, V., Graux, D., Jabeen, H., et al., 2020. Knowledge Graphs and Big Data Processing. Springer Nature, Switzerland. 209 |
Knaust, D., 2017. Atlas of Trace Fossils in Well Core: Appearance, Taxonomy and Interpretation. Springer International Publishing, Dordrecht. 209 |
Laxton, J., Serrano, J. J., Tellez-Arenas, A., 2010. Geological Applications Using Geospatial Standards—An Example from OneGeology-Europe and GeoSciML. International Journal of Digital Earth, 3: 31–49. https://doi.org/10.1080/17538941003636909 |
Linnaeus, C., 1735. Systemae Naturae, Sive Regna tria Naturae, Systematics Proposita per Classes, Ordines, Genera & Species. Lugduni Batavorum. 12 |
Liu, Q., Li, Y., Duan, H., et al., 2016. Knowledge Graph Construction Techniques. Journal of Computer Research and Development, 53(3): 582–600. https://doi.org/10.7544/issn1000-1239.2016.20148228 (in Chinese with English Abstract) |
Ma, X. G., Carranza, E. J. M., Wu, C. L., et al., 2012. Ontology-Aided Annotation, Visualization, and Generalization of Geological Time-Scale Information from Online Geological Map Services. Computers & Geosciences, 40: 107–119. https://doi.org/10.1016/j.cageo.2011.07.018 |
Ma, X. G., Ma, C., Wang, C. B., 2020. A New Structure for Representing and Tracking Version Information in a Deep Time Knowledge Graph. Computers & Geosciences, 145: 104620. https://doi.org/10.1016/j.cageo.2020.104620 |
Martin, R. E., 1999. Taphonomy: A Process Approach. Cambridge University Press, Cambridge. 524 |
Noy, N. F., McGuinness, D. L., 2001. Ontology Development 101: A Guide to Creating your First Ontology. Stanford Knowledge Systems Laboratory, Technical Report. 1–25 |
Payne, J. L., Boyer, A. G., Brown, J. H., et al., 2009. Two-Phase Increase in the Maximum Size of Life over 3.5 Billion Years Reflects Biological Innovation and Environmental Opportunity. Proceedings of the National Academy of Sciences of the United States of America, 106(1): 24–27. https://doi.org/10.1073/pnas.0806314106 |
Peters, S. E., Husson, J. M., Wilcots, J., 2017. The Rise and Fall of Stromatolites in Shallow Marine Environments. Geology, 45(6): 487–490. https://doi.org/10.1130/g38931.1 |
Pignatti, J. S., 2009. Evolutionary Paleontology. In: De Vivo, B., Grasemann, B., Stiwe, K., eds., Geology-Volume Ⅱ. 342–362 |
Qi, H., 2020. The Construction of Ontology-Based Earth Science Knowledge Graph: [Dissertation]. Nanjing University, Nanjing. 1–66 (in Chinese with English Abstract) |
Qiu, Q. J., Wang, B., Ma, K., et al., 2023. A Practical Approach to Constructing a Geological Knowledge Graph: A Case Study of Mineral Exploration Data. Journal of Earth Science, 34(5): 1374–1389. https://doi.org/10.1007/s12583-023-1809-3 |
Ricklefs, R. E., Miller, G. L., 1999. Ecology (Fourth Edition). W. H. Freeman, New York. 896 |
Ride, W. D. L., Cogger, H. G., Dupuis, C., et al., 1999. International Code of Zoological Nomenclature (Fourth Edition). International Trust for Zoological Nomenclature, London. 306 |
Ruggiero, M. A., Gordon, D. P., Orrell, T. M., et al., 2015. A Higher Level Classification of all Living Organisms. PLoS One, 10(4): e0119248. https://doi.org/10.1371/journal.pone.0119248 |
Shen, Z., Gong, Y. M., Ban, F. M., et al., 2022. Taxonomic Reconsideration of Ammonidium Lister 1970 and Related Species and Its Biostratigraphical and Palaeogeographical Implication. Earth Science, 47(8): 2985–3004. https://doi.org/10.3799/dqkx.2022.092 (in Chinese with English Abstract) |
Smith, F. A., Payne, J. L., Heim, N. A., et al., 2016. Body Size Evolution across the Geozoic. Annual Review of Earth and Planetary Sciences, 44: 523–553. https://doi.org/10.1146/annurev-earth-060115-012147 |
Song, H. J., Tong, J. N., Chen, Z. Q., 2011. Evolutionary Dynamics of the Permian–Triassic Foraminifer Size: Evidence for Lilliput Effect in the End-Permian Mass Extinction and Its Aftermath. Palaeogeography, Palaeoclimatology, Palaeoecology, 308(1/2): 98–110. https://doi.org/10.1016/j.palaeo.2010.10.036 |
Tarhan, L. G., Droser, M. L., Planavsky, N. J., et al., 2015. Protracted Development of Bioturbation through the Early Palaeozoic Era. Nature Geoscience, 8: 865–869. https://doi.org/10.1038/ngeo2537 |
Tong, J. N., 2021. Paleontology (Second Edition). Higher Education Press, Beijing. 361 (in Chinese) |
Turland, N. J., Wiersema, J. H., Barrie, F. R., et al., 2018. International Code of Nomenclature for Algae, Fungi, and Plants (Shenzhen Code). In: Nineteenth International Botanical Congress. July 2017, Shenzhen. Koeltz Botanical Books. https://doi.org/10.12705/code.2018 |
Uschold, M., Gruninger, M., 1996. Ontologies: Principles, Methods and Applications. The Knowledge Engineering Review, 11(2): 93–136. https://doi.org/10.1017/s0269888900007797 |
Wang, X. J., Yao, L., Wang, X. D., 2021. Permian Naotic-Dissepimented Rugose Corals in China and Their Palaeoenvironmental Implications. Geological Journal, 56(12): 6151–6161. https://doi.org/10.1002/gj.4220 |
Whittaker, R. H., 1969. New Concepts of Kingdoms of Organisms. Science, 163(3863): 150–160. https://doi.org/10.1126/science.163.3863.150 |
Woese, C. R., Fox, G. E., 1977. Phylogenetic Structure of the Prokaryotic Domain: The Primary Kingdoms. Proceedings of the National Academy of Sciences of the United States of America, 74(11): 5088–5090. https://doi.org/10.1073/pnas.74.11.5088 |
Woese, C. R., Kandler, O., Wheelis, M. L., 1990. Towards a Natural System of Organisms: Proposal for the Domains Archaea, Bacteria, and Eucarya. Proceedings of the National Academy of Sciences of the United States of America, 87(12): 4576–4579. https://doi.org/10.1073/pnas.87.12.4576 |
Xi, J. L., Wu, J., Wu, M. B., 2023. Design and Construction of Lightweight Domain Ontology of Tectonic Geomorphology. Journal of Earth Science, 34(5): 1350–1357. https://doi.org/10.1007/s12583-022-1779-x |
Wu, G. G., He, M. Y., 2016. Standards for Resource Description of Invertebrate Fossil Specimens. Geological Publishing House, Beijing. 356 (in Chinese with English Abstract) |
Xu, Z. L., Sheng, Y. P., He, L. R., et al., 2016. Review on Knowledge Graph Techniques. Journal of University of Electronic Science and Technology of China, 4: 589–606 (in Chinese with English Abstract) |
Xu, H. Q., Zhao, Y. Y., Huang, H., et al., 2023. A Comprehensive Construction of the Domain Ontology for Stratigraphy. Geoscience Frontiers, 14(5): 101461. https://doi.org/10.1016/j.gsf.2022.101461 |
Xu, Y. W., Hu, X. M., Han, Z., 2023. Carbonate Ontology and Its Application for Integrating Microfacies Data. Journal of Earth Science, 34(5): 1328–1338. https://doi.org/10.1007/s12583-023-1808-4 |
Yao, L., Lin, W., Aretz, M., et al., 2023. Colonial Coral Resilience by Decreasing Size: Reaction to Increased Detrital Influx during Onset of the Late Palaeozoic Ice Age. Proceedings Biological Sciences, 290: 20230220. https://doi.org/10.1098/rspb.2023.0220 |
Zhang, L. N., Hou, Z. S., Shen, B. H., et al., 2023. Paleobiogeographic Knowledge Graph: An Ongoing Work with Fundamental Support for Future Research. Journal of Earth Science, 34(5): 1339–1349. https://doi.org/10.1007/s12583-023-1845-z |
Zhang, Y. L., Liu, G. B., Bian, L. Z., 1988. Paleontology. Geological Publishing House, Beijing. 660 (in Chinese) |