1. Introduction
Research on digital twins has been ongoing both domestically and internationally, with various definitions presented by international organizations and researchers. it defined a digital twin as ’a virtual representation of a physical object used to optimize performance using real-time data’ [
1]. Also the European Commission’s JRC Technical Report categorizes these definitions into standard, industry, and scientific literature [
2]. After comparing and reviewing these definitions, common and key terms such as "virtual representation," "physical object," "real-time data," and "optimizing performance" were identified [
3]. Consequently, digital twins can be categorized by 'semantics,' 'means,' and 'purpose.' Considering the information comprehensively, a digital twin can be defined as a "representation of physical objects in a virtual environment that utilizes real-time data to achieve optimal performance."
Digital twins are applied across a wide range of fields. Nativi et al. explored their use in manufacturing, energy, smart cities, farming, building, and healthcare [
2]. Rasheed et al. reviewed their application in health, manufacturing process technology, meteorology, education, urban areas, transportation, and energy sectors [
4]. Fuller et al. focused on their application in smart cities, manufacturing, and healthcare [
5], while Barricelli et al. highlighted their use in manufacturing, aviation, and healthcare [
6] as shown in
Table 1. Based on this discussion, the manufacturing sector is a prominent field where digital twins are utilized. Additionally, digital twins are frequently employed in urban areas, healthcare, and medical sectors. For instance, some studies suggest that Michael Graves first introduced the concept of the digital twin in 2002 in the context of product lifecycle management [
7]. From another perspective, transportation and energy may also be considered part of the urban sector.
Based on the definitions and application fields of digital twins, it can be concluded that digital twin data must be both utilized and reliable, as digital twins are applied across various domains. In other words, the success of digital twin projects depends significantly on the effective use of digital twin data.
A common obstacle in data utilization is a lack of understanding of the data and its applications, which hinders active data use in other fields or applications. Specifically, smart city and digital twin projects in South Korea aim to collect data for specific areas or fields to achieve particular objectives or services. However, the interoperability of data across various services remains weak. To address these limitations, it is essential to establish a method for actively sharing and utilizing digital twin data based on international standards. Additionally, a plan for developing digital twin data should be devised by utilizing publicly available data in conjunction with adherence to international standards.
In this study, digital twin basic data is defined as follows.
The basic data for digital twins should support interoperability between heterogeneous systems without constraints and be provided in a machine-readable format with a well-established foundation. Additionally, accessibility should be ensured to allow easy manipulation of digital twin data by all users. This can be described in detail as shown in
Table 2.
Therefore, this study aims to propose a method for constructing basic digital twin data in South Korea by adhering to international standards and utilizing publicly available data. Specifically, the study focuses on designing and proposing a digital twin data model for buildings, which are the most applicable among the basic digital twin data. Additionally, it aims to optimize the understanding, access, management, and updating of digital twin basic data to support effective application and implementation in smart city and digital twin projects in South Korea.
The structure of this study is as follows: The first section provides essential background information, introduces concepts and requirements related to digital twin basic data, and offers a brief overview of CityGML. The second section explains the methodology and data used in this study. The third section presents the main findings: the selection of public data (buildings) for constructing digital twin basic data, the mapping process using CityGML, and the creation of UML diagrams. The fourth section discusses these findings. Finally, the conclusion and suggestions for future research are provided.
2. Materials and Methods
This study establishes the fundamental guidelines for the design of a building digital twin data model as follows. The key approach is to emphasize the importance of 'data standardization' and to aim for a data-centric digital twin platform. The goal is to transition from the current data ecosystem—characterized by non-real-time, database-centric, and standardized data in specific areas—to one that enables the creation of new value and services through the convergence of real-time (sensor, etc.) and location data from various areas of the city. Developing smart city-related services in the country requires a primary understanding of data, with the challenge being to process the data structure (format) to suit its intended use. Currently, weak data interoperability between various domains (or services) limits the sharing, distribution, and utilization of application services and private data.
This study aims to create a digital twin data ecosystem that addresses these issues and meets the diverse demands of smart cities by enabling the activation of the smart city/digital twin data domain (such as buildings). The digital twin basic data serves as a conceptual tool for digitizing and constructing a database of indoor and outdoor, underground and aboveground, static and dynamic objects and phenomena in the real world.
To ensure consistent and systematic construction of digital twin basic data, this study establishes guidelines based on four key principles: continuous updating, multi-perspective support, scalability assurance, and standardization. First principle is continuous renewal of configuration. The identification and construction of objects required for the digital twin cannot be completed in a single operation. These objects need to be continuously updated and added based on changes in the urban environment and advancements in application technology. Therefore, it is necessary to create a feature catalog to facilitate the ongoing updating and addition of objects required for digital twin services, thus promoting continuous data management. Second principle is configuring multiple perspective support. In cities, the required attributes and representation methods for the same object can vary depending on the area of interest or activity. Consequently, it is essential to consider the diversity of urban environments and address the varying requirements of service users and stakeholders when managing digital twin basic data. Third principle is ensuring scalability in configuration (definition of scalability rule). The digital twin basic data is designed to be open-ended rather than complete, allowing for the continuous expansion of the data model through defined rules for its extension. In other words, the abstract urban data model cannot be finalized at a specific point in time, as it must continuously evolve in response to changes in the urban environment, administrative factors, and emerging technologies. Therefore, ensuring scalability is essential. Fourth principle is standardization-based configuration. In the field of smart cities, a digital twin should visualize diverse modeled data (or connected/converged data) through the creation of spaces and objects. Therefore, adherence to international standards, such as spatial data, is necessary during the pre-data construction process. However, if the content specified by international standards differs from the contextual environment of the digital twin, the standards will be adapted and applied accordingly.
In this study, a method consisting of four steps was used to establish the digital twin basic data. These steps included selecting a reference model, choosing public data, conducting the mapping process using the international standard CityGML, and creating UML diagrams as shown in
Figure 1.
3. Results
3.1. Selection the Reference Model
A digital twin data model should be able to accommodate information from multiple domains and be easily compatible with other three-dimensional (3D) data and application systems. Therefore, selecting a reference framework that meets these criteria is crucial. For this purpose, this study has selected the most widely used 3D data models across various fields. The models used for analysis include IFC, LandXML, 3DF-GML, KML, CityGML, and glTF, as specified in standard documents and research reports.
The items used for comparison and analysis between models include: Geometry: Contains 3D geometric information. Topology: Contains topological information. Texture: Supports the mapping of real-life captured images onto spatial objects. Level of Detail (LoD): Indicates the level of detail of 3D objects. Semantic Information: Reflects the conceptual data model. Support for Attributes and Georeferencing (geo ref.): Includes attributes and georeferencing of data [
8] as shown in
Table 3.
After comparing and analyzing 3D data models based on the criteria mentioned, it was found that CityGML 2.0 is the most suitable reference model for building digital twin basic data. CityGML 2.0 supports all of the evaluated criteria and allows for flexible model extensions. Additionally, CityGML comprehensively covers various urban objects, including bridges, tunnels, and roads, beyond just buildings [
9], which are the focus of this study. It facilitates efficient modeling of digital twin objects at both macro and micro scales. Furthermore, OGC's official standards, which address international spatial information standards, ensure a high level of reliability.
As previously stated, CityGML is a data exchange format for representing 3D cities and landscapes, independent of any specific application. Sometimes, it may be necessary to extend the CityGML model by adding additional information. There are two main methods to achieve this. The first method involves adding CityGML objects to the application framework. The second method involves incorporating specific information into the CityGML instance document. If the information to be added follows the format defined by the CityGML schema, the second method is more effective. This method is divided into two approaches: one utilizing generic attributes and objects, and another utilizing CityGML Application Domain Extensions (ADEs). All objects and attributes in CityGML are defined through standards, but the specific names and data types of generic attributes and objects are not explicitly defined. Consequently, there are limitations in ensuring semantic interoperability of the model, as comprehensive attributes and objects cannot be validated through XML. In contrast, ADEs offer the advantage of adding information to the application in a more systematic manner.
The Building module of CityGML 2.0 represents various characteristics of a building, including its class, function (residential, public, industrial), purpose, construction year, demolition year, roof type, measured height, and the number of above-ground and underground floors. The building model allows for detailed structuring through different LoD. In the Building module, the _AbstractBuilding class is central to this representation. The range of LoD for each geometric and semantic theme within the _AbstractBuilding class is described as shown in
Table 4.
3.2. Selection the Public Data
To design a building digital twin data model, it is crucial to integrate and extend South Korea-specific fundamental spatial information (2D) attributes into the CityGML 2.0 schema. A key aspect of this process involves applying the Application Domain Extension (ADE) method to enable seamless integration between 2D and 3D data. For this purpose, we will utilize the 2D building data model attributes defined in the "National Base Map" established by the National Geographic Information Institute of Korea. These attributes include 'Class, Usage, Year of construction, Year of demolition, Measured height, Roof type, Building definition ID, Building name, Building main number, Building sub number, Poi id, Parcel number, Road name, Road name code, UFID (Unique Feature Identifier), Zip code,' among others. Among these, 'Class, Usage, Year of construction, Year of demolition, Measured height, Roof type' directly correspond to existing attribute items specified in the CityGML 2.0 Building model, allowing their values to be used without modification. The other attributes, such as 'Building definition ID, Building name, Building main number, Building sub number, Poi id, Parcel number, Road name, Road name code, UFID (Unique Feature Identifier), Zip code’ will be mapped as separate classes within the Building model using the ADE method. Integrating these additional attributes through ADE allows the building digital twin data model to comprehensively represent the unique characteristics and features of Korean buildings, ensuring a seamless connection between 2D and 3D spatial data. This approach enhances the usability and accuracy of the building data model and promotes the standardization and interoperability of geospatial information in urban planning and development contexts [
10].
The core of the building digital twin data model lies in its height attributes. Therefore, we plan to utilize the 'Height Information Database' established by the National Geographic Information Institute of Korea for these attributes. The attributes defined in the 'Height Information Database' include 'Building furniture maximum height, Building ground height, Building height, Building maximum height’. Since these attributes do not directly match any existing attributes in the CityGML 2.0 Building model, we will apply the ADE method to map them as separate classes within the Building model. By integrating these height-related attributes through ADE, the building digital twin data model can accurately represent the vertical dimensions of buildings, which is essential for various urban planning and development applications. This approach ensures that the model is both comprehensive and interoperable, aligning with international standards while addressing specific regional requirements [
11].
After examining public data related to the building sector in South Korea for the purpose of designing building digital twin data model using the CityGML 2.0 Building module, it was determined that the 'GIS (Geographic Information System) Integrated Building Information' provided by the Ministry of Land, Infrastructure, and Transport in South Korea is the most suitable option. ‘GIS Integrated Building Information’ is a public dataset constructed in spatial data (shp) format, representing both the shape and attribute information of buildings [
12]. Furthermore, as designated by South Korea's national core data, it is considered highly suitable for this study, given its accessibility and usability. 'GIS Integrated Building Information' integrates building spatial data based on continuous cadastral polygon information. This dataset in-cludes the following information: 'Original drawing ID, GIS building integration identifier, Unique number, Legal district code, Legal district name, Lot number, Special land code, Special land category, Building use code, Building use name, Building structure code, Building structure name, Building area, Date of use approval, Floor area, Land area, Height, Building coverage ratio, Floor area ratio, Building ID, Violation building status, Reference system link key, Data reference date'. The design was refined by excluding redundant information and adding only newly expanded items for mapping between CityGML 2.0 building models and GIS Integrated Building Information: 'Usage (ADE), Class (ADE), Use approve date, Area, Land area, Building coverage ratio, Floor area ratio, Building ID, Illegal building, Data date’.
3.3. Mapping Process
Before mapping, it is essential to understand the Building model within CityGML 2.0. The '_AbstractBuilding' class plays a pivotal role among all classes, encapsulating characteristics such as building type, function (residential/public/industrial), usage, construction year, demolition year, roof type, measured height, and the number of above and below-ground floors. Therefore, the extension method proposed in this study using the ADE technique is designed in relation to the '_AbstractBuilding' class. The 'Building' class represents a single building composed of the same structural segments (roof, floors, entrances, etc.). The 'BuildingPart' class models structural parts of a building, representing building parts, which must be associated with either a single building or a building part object. The 'Build-ingInstallation' class represents external components that influence the building's ex-ternal characteristics, such as chimneys, staircases, antennas, balconies, stairs, and roofs. The '_BoundarySurface' class represents the surfaces constituting the building's interior and exterior, including roofs, grounds, external ceilings, external floors, windows, entrances, floors, internal and external walls, and ceilings. The '_Opening' class represents entrances on internal and external boundary surfaces, such as chimneys, doors, and windows. The 'Room' class represents interior rooms of buildings, consisting of entrances, floors, ceilings, and internal walls, and can be represented as LoD 4 Solid or MultiSurface. Lastly, the 'IntBuildingInstallation' class is used for permanent, non-movable internal objects attached to the building structure, such as internal staircases, railings, and pipes [
13].
The result of the mapping process of digital twn data model related to buildings is as shown in
Table 5.
3.4. Creat UML
According to ISO 19101-1:2014, spatial information modeling refers to a series of processes that recognize a part of the real world, limited within a discussion domain, as a spatial information application context and define it as a conceptual model [
14]. The conceptual model is expressed in the form of a conceptual schema using UML diagrams. The procedures and methods for defining the rules and UML profiles necessary to convert the conceptual model of spatial information into an implementation schema follow the standards of ISO 19103:2015 [
15] and ISO 19109:2015 [
16].
In this study, as demonstrated in the previous process, we reference the 2D data model attributes included in the NGII's 'National Base Map' and 'Height Information Database'. Additionally, we utilize South Korea's public GIS Integrated Building Information as an extension element of the building digital twin data model. As previously described, attributes not present in CityGML 2.0 are added as new subclasses to the CityGML 2.0 Building model (_AbstractBuilding) using the ADE method. For attributes with different English labels but identical properties between the extension elements and CityGML 2.0, we standardize them using international standard terms. The data model defined through mapping between the CityGML 2.0 Building model and the extension elements is illustrated using UML in
Figure 2. In
Figure 2, the classes highlighted in gray represent the newly extended elements based on the ADE method in the data model.
The detailed examination of the building digital twin data model is as follows. As previously described, attributes included in NGII's 'National Base Map' have been extended using the ADE method, resulting in the 'NDTBuilding_Building2Dproperty'. Additionally, to further extend building height information, the NGII's 'Height Information Database' was referenced to expand the CityGML 2.0 Building model, resulting in the 'NDTBuilding_BuildingHeightValue' as show in
Figure 3.
Next, to define the core attribute information of the building digital twin data model based on public data, we have further expanded the attributes related to 'GIS Integrated Building Information' (PublicDataGIS). The detailed contents of this expansion are illustrated in
Figure 4.
4. Discussion
The method of defining a digital twin data model allows real-world objects to be represented in various forms in the virtual world. For example, the same real-world object can appear in different forms and have different attributes in the virtual world depending on the digital twin data model used. These data models influence spatial analysis methods and systems that utilize spatial analysis.
Due to the unique characteristics and broad applicability of digital twin data models, various spatial information-related data models currently exist. In particular, 3D spatial information is expected to be highly versatile compared to 2D spatial information, serving as an important tool for developing noise maps, analyzing 3D urban landscapes, and planning smart cities that connect real and virtual spaces. Recently, there has been a demand for high-precision data and rich semantic information that allows devices to independently recognize spatial situations, particularly in building-related data within digital twin frameworks. Therefore, establishing principles and proactive design processes for digital twin data models is essential to expand their use and ensure interoperability in building-related applications. In this context, the building digital twin data model proposed in this study is designed by integrating unique aspects such as building attribute information from Korea’s fundamental spatial information while adhering to the international standard of OGC’s CityGML 2.0. This approach enhances the accuracy of building-related digital twin data and supports the use of digital twin services in both public and private sectors by enabling various spatial analyses.
Following the research procedures, the study presents the following implications:
Selection of the reference model: This study utilizes the reference model (CityGML) based on international standards like those from OGC, rather than a vendor-specific model. This allows the creation and use of open data applicable in any country, including South Korea, ensuring interoperability between heterogeneous systems and data.
Selection of public data: While the reference model CityGML, established as an international standard, provides guidelines for shape, additional and expanded attribute information tailored to the building conditions of each country (e.g., South Korea) is necessary. This study proposes utilizing publicly available data for attribute information, facilitating easy access, acquisition, processing, and use by both public and private sectors.
Mapping process with CityGML 2.0: Using the ADE (Application Domain Extension) technique officially guided by CityGML, this study demonstrates how building attribute information can be extended into the CityGML 2.0 Building model. This offers an effective mapping method for practical physical model construction or derivative services in various fields.
Creation of UML diagram: By mapping basic spatial information and public data related to buildings in South Korea to the CityGML 2.0 Building model, this study presents a UML diagram for utilizing the building digital twin data model. This simplifies the framework process for building data construction and enhances efficiency in data construction and distribution at the national level.
Alongside these implications, the study acknowledges several research limitations that need further consideration. First, with the advent of more advanced versions like CityGML 3.0, additional measures should be devised to flexibly incorporate these advancements. Second, the added and expanded attribute information in this study primarily pertains to South Korea’s basic spatial information and public data. To further enhance usability, expanding the range of ADE to include diverse attribute and shape information related to buildings is necessary. Third, since the version presented in this study corresponds to a conceptual/logical model, there must be a capability to transform it into a physical data model for practical application in building-related systems.
5. Conclusions
Spatial information data models determine how and what real-world spatial information is represented. Therefore, spatial information data models play a crucial role in the visualization and spatial analysis capabilities of systems, with the performance of these systems varying based on the data model used. In this context, while data models for 2D spatial information have been developed in South Korea, there is a notable absence of data models for 3D spatial information, which is anticipated to have higher utility in the future.
Given this background, the objective of this study is to design a building digital twin data model, utilizing UML, focusing on buildings, which have the highest applicability among digital twin data. This study has designed a building digital twin data model that adheres to international standards, ensures interoperability with South Korea's existing fundamental spatial information and public data, and can be utilized across various fields. Based on the results of this study, it is expected that digital twin data models can be designed and implemented for various domains such as roads and bridges, in addition to buildings. Furthermore, there is a need to develop and establish these models as official standards, such as national standards (Korean Industrial Standards, KS) or as standards by organizations like South Korea's TTA (Telecommunications Technology Association).
6. Acknowledgments
This work was supported by Institute for Information& communications Technology Promotion(IITP) grantfunded by the Korea government(MSIP) (No.2022-0-00622, Digital Twin Testbed Establishment).
References
- Park, J.; Choi, W.; Jeong, T.; Seo, J. Digital twins and land management in South Korea. Land Use Policy 2023, Vol.124, p.106442. [CrossRef]
- Nativi, S.; Delipetrev, B.; Craglia, M. Destination earth. European Commission JRC Technical Report 2020.
- Jeong, D.; Park, J.; Choi, W.; Cho, K.; Hong, S. Comparative analysis of digital twin trends in urban area. Journal of Cadastre & Land InformatiX 2023, Vol.53, pp.123-137. [CrossRef]
- Rasheed, A.; San, O.; Kvamsdal, T. Digital twin: values, challenges and enablers from a modeling perspective. IEEE Access, 2020; 8, 21980–22012. [Google Scholar] [CrossRef]
- Fuller, A.; Fan, Z.; Day, C.; Barlow, C. Digital twin: Ena-bling technologies, challenges and open research. IEEE Access, 2020; 8, 108952–108971. [Google Scholar] [CrossRef]
- Barricelli, B.R.; Casiraghi, E.; Fogli, D. A survey on digital twin: Definitions, characteristics, applications, and design implications. IEEE Access, 2019; 4, 167653–167671. [Google Scholar] [CrossRef]
- Grieves, M.; Vickers, J. Digital twin: mitigating unpredictable, undesirable emergent behavior in complex systems. Transdisciplinary Perspectives on Complex Systems, 2016; 17, 85–113. [Google Scholar] [CrossRef]
- Kim, B.; Jeong, D.; Hong, S. Constructing 3D Geo-spatial Model for Building Data. Journal of the Korean Society for Geospatial Information Science, 2018; 26, 57–67. [Google Scholar]
- Jeong, D.; Shin, D. A Study on Development of 3D Data Model for Underground Facilities Using CityGML ADE. Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography, 2021; 39, 245–252. [Google Scholar] [CrossRef]
- NGII. Strengthening the use of institutional standards and expanding national standardization. National Geographic Information Institute Research Report, Suwon, South Korea, 2020.
- NGII. Creation of 3D spatial information to establish the foundation for digital twins. National Geographic Information Institute Research Report, Suwon, South Korea, 2020.
- Public data portal in South Korea. Available online: https://www.data.go.kr/ (accessed on 29 December 2023).
- OGC. OGC 12-019; OGC city geography markup language (CityGML) encoding standard. Open Geospatial Consortium (OGC): Arlington, USA, 2012. [Google Scholar]
- ISO. ISO 19101-1:2014; Geographic Information — Reference model. International Organization for Standardization (ISO): Geneva, Switzerland, 2014. [Google Scholar]
- ISO. ISO 19103:2015; Geographic Information — Conceptual schema language. International Organization for Standardization (ISO): Geneva, Switzerland, 2015. [Google Scholar]
- ISO. ISO 19109:2015; Geographic Information — Rules for application schema. International Organization for Standardization (ISO): Geneva, Switzerland, 2015. [Google Scholar]
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).