This section discusses how experts are using metadata in their educational applications. We use the opportunity to introduce some basic knowledge about learning metadata, partly derived from the interviews in the form of questions – illustrated by quotes from the interviewed experts.
2.1. Practical Usage of Metadata - Why Experts Use Metadata and How
The digital learning process can be divided into four phases (Reichow et al. 2021) as shown in
Figure 1. We asked experts in which phase they used the metadata and for what purpose? All experts agreed that metadata is eventually required for all these phases, however, four experts specifically used metadata for phase 2, three experts mentioned using metadata in phases 1 and 4, and two experts used metadata in phase 3.
What functions do the experts seek to fulfill with the help of metadata?
Experts mentioned the following uses of metadata in the educational context:
Searching the educational content or offers
Recommendation of educational content or offers
Describing competencies or learning objectives
Manifesting features for AI based educational technologies like intelligent tutoring or personalized learning
One of the crucial functions of metadata is providing descriptive information that helps users find relevant resources more efficiently. It also helps in the reuse of learning resources. To find things more easily, some experts mentioned the importance of “context”.
The context can be added to metadata with the help of ontologies2 by providing a structured framework for representing and organizing knowledge about a specific domain. Ontologies define concepts, relationships, and properties within a domain, allowing for more precise and meaningful metadata representation.
Another important use of metadata is in recommending learning content to provide personalized and relevant suggestions to learners based on their interests, preferences, and learning goals (Reichow et al. 2022; Drachsler et al. 2015). Metadata associated with user profiles, course attributes, learner feedback and ratings, and contextual factors, can be used to recommend learning content at the time of need. For example, metadata related to learning pathways can be used to suggest a structured learning path to learners, ensuring their progress through courses in a meaningful and cohesive manner, e.g., from beginner to advanced levels. By utilizing metadata, the course recommendation system can generate personalized and relevant suggestions to learners, helping them discover courses that match their interests, level of expertise, and learning objectives.
In the mature phase of recommendation, one of the experts (E-USA2) envisioned the implicit recommendation of educational content at the point of need. This can be achieved based on the learning experience and with the help of a lot of metadata as suggested by the expert.
Another direction where we need metadata is to describe “competencies”, also called “competency frameworks”
. These are the “expectations” or “learning objectives” (the taxonomy of what one should learn). For example, if developing a curriculum or course, a student should show these “competencies” or “master these skills” after the course. Associating competency metadata with learning resources enables learners and organizations to identify the most relevant resources for developing and accessing specific competencies (competency-based search). E-USA3 mentioned the use of competencies from “Open Competency Framework Collaboration OCFC”
3, which is a domestically focused organization, but the standards defined can be used internationally. The expert also emphasised the use of metadata for manifesting the features to develop AI based educational systems like intelligent tutoring systems or personalized learning systems.
Which metadata fields do experts use for the description of learning materials or course offers on their platform?
Various metadata fields are collectively used to describe learning contents or course offerings like title, subject, and author information, etc. These fields provide a set of information to help learners, educators, and administrators organize and utilize learning materials effectively. However, finding an optimal set of metadata fields catering to the diverse usage of metadata in the educational setting is a challenging task.
E-USA1 described using the fields proposed by the IEEE-P2881 Learning metadata standard. A brief description of the standard is provided in
Appendix B. The standard is recently available with an open-source license. Further details and description of the metadata schema are available as an open-source standard by IEEE
4.
E-USA3 and E-UK mentioned using the LRMI standard for the description of learning material and course offers. E-UK also spoke about the CTDL standard for describing competencies and skills.
E-USA4 discussed the use of some metadata fields such as “study type”, “subject matter”, “course level”, “discipline”, “subject” (in a discipline), “topic”, and “learning outcome”. In addition, the expert also used tags to describe “language”, “accessibility level”, “authoring information”, “course number”, and “additional keywords” (if additional information needs to be added that can help like Bloom’s level for the resource).
E-USA2 mentioned to use fields like “topic”, “format”, “organization”, “source” etc. Additionally, the expert emphasized to include fields to capture the following information:
Type of the learning object (a PowerPoint presentation, an eLearning module, a manual, or a procedure for the headset display, etc.).
Compliance-related aspects of the object e.g., Google compliance.
Access rights and privacy, who has the right to see it?
Ownership, lifecycle, and maintenance: who is the owner of the things, what is the approval process, and who approves it? What is the retention policy? When will the resource expire? How often do we maintain it, and who is responsible for the maintenance?
Sequences, if there is a particular sequence. Where is the object positioned in the sequence? Should the user have seen another object before seeing this object?
Geography, important in some cases where different legal requirements may arise for different geographical locations.
Version/Change, helpful in the search for a finished product that is changed, there may be a need to reference back to the base product.
Tracking Information, metadata after publishing the objects for tracking.
Metadata recording formats and authoring tools
We also asked experts about how they get the correct information for the metadata fields, their preferred tools/languages to store that information, and whether are they using any metadata authoring tool. Mostly the metadata fields are filled manually by content providers and here lots of problems can occur. Therefore, experts were also seeking to fill the metadata fields automatically with the help of artificial intelligence or other techniques. Mostly the metadata are being stored using XML
5, RDF
6, JSON
7, and JSON-LD
8 machine readable formats. Some of the experts provided information about metadata authoring tools which are listed below.
Coping with problems in using metadata
As we can see using metadata provides a lot of benefits when implemented in a learning platform. However, there are a few challenges that were mentioned by the experts.
One major problem is how to get complete and accurate information for the metadata fields, Mostly, this is done manually where people fill in the metadata fields using existing information/domain knowledge of the learning resource. However, this manual effort is not favorable as mentioned by experts. First, it is required to equip people with a certain level of knowledge about the learning resources so that they can provide relevant and correct data to the metadata fields. In most cases, the data also needs to be consistent.
Second, the process is very time-consuming. Motivating people to fill all the relevant fields is a difficult task. E.g. “even though the expert thought that a metadata framework like SCORM is very robust and easy to adopt, individuals assigned to fill the fields do it with only minimal effort such that the results are not very useful.” Third, how to verify that the provided content is of high data quality? Has it been tagged accurately? To overcome these issues, experts stated that we need to train people to understand the importance and content of a taxonomy or a metadata field. Alternately, it is better employing the subject matter experts or teachers to tag their own resources to avoid inaccurate entries. Another approach could be to write metadata before the development of the actual learning resource including all the required information.
Automation techniques such as natural language processing (NLP) or AI algorithms can also help to streamline the process. Machine learning (ML) algorithms can be used to develop fully automated or semi-automated solutions for generating “data” for metadata fields from the learning resources. However, this could be challenging to ensure the quality of the tagging. AI may be good in extracting data but interpreting the meaning or context could be problematic. The problem can be solved using a human-machine collaborative form, where machines should suggest tags for the fields and humans just need to validate or interpret it if required. The AI-system has the role of an assistant service. Alternatively, aside from relying on AI or ML for metadata enrichment, an alternative approach is facilitating direct communication between two databases.
Another issue faced by the experts in their application scenarios is the use of non-standard metadata that causes the issue of interoperability of the system to another learning domain or partners, as people may use different metadata schemas for recording metadata. Ideally, common metadata fields should prevail, ensuring seamless transformation. However, executing such transformations can pose significant challenges. A solution lies in using standardized metadata schemas as well as standards to fill the metadata fields. Sometimes there is a problem in using fixed metadata schemas when switching from one domain to another. For example, one might not have a “level of organization” for middle school, but it may exist for higher education, so it needs to be added into the system at some point.
User Interface is another issue. What user interface on the screen is good to fill the metadata fields efficiently? There are various options, e. g. free text, fixed list items to the excel worksheets. If a lot of free text to be filled which is difficult, and people did not fill it out most of the time. A fixed list is also problematic, it is not dynamic and cannot be changed and if you change you need to retag everything.
2.2. Metadata Standards – for What Purposes do Experts Use Them
To date, several learning metadata standards exist that can facilitate educational institutions, content creators, and learners in various phases of digital learning or education process. For example, IEEE Standard for Learning Object Metadata (LOM) and Dublin Core metadata standards have been in practice from the last few decades for the description and presentation of learning resources (Barker and Campbell 2010). A brief description of metadata standards discussed by experts is given in
Appendix B.
E-USA1 discussed the use of Dublin Core and LOM standards as well as others in the development of new IEEE standard P2881-Learning Metadata Terms (LTM). LTM is extensively derived from Dublin Core, Learning Resource Metadata Initiative (LRMI), Credential Transparency Description Language (CTDL), and Schema.org with an objective to bridge different communities and standards, to serve as a common, shared vocabulary for describing various digital objects related to learning and teaching.
E-USA3 also talked about LRMI, Dublin Core, Schema.org as the same family of standards. The expert mentioned the use of SCORM for content management and the use of xAPI for the representation of learning activities. The major weakness of xAPI is that it offers a standard for communicating activities but does not provide a schema for describing these activities.
E-UK revealed the use of LRMI and CTDL standards. The LRMI allows us to describe the educationally significant characteristics and relationships of a resource. E.g. it allows us to describe the relationship of resources to competencies (teaches a particular competency or assesses a particular competency), the nature of resources such as whether it is an instructional video or a textbook, etc. It also encompasses details like the typical learning time, and target audience (which may differ from the ultimate beneficiaries). The LRMI can be used in recommender systems, allowing for more personalized and effective resource recommendations based on specific user preferences and needs.
The CTDL serves a multifaceted purpose. It enables the recording of skills required for a degree, details on how these skills are assessed, the relevance of the degree to specific occupations, the institution where the degree was pursued, and the duration of the study. Additionally, it is employed for describing the credentials themselves, the organizations offering the competencies, the assessment procedures, available programs and courses, pathways within a program, and aspects like micro-credentialing and badges. This allows for the development of pathways from one credential or badge to another, including stackable credentials that lead to larger qualifications, and considerations such as credit accumulation and credit transfer.
In addition, experts also talked about various learning standards and international standard development activities. Notable resources include:
-
- ○
The work includes SCORM, xAPI, Competency Data Standards, and more.
- ○
2.3. What Experts Think about Metadata Standards in General
In general metadata standards in education and learning ensure a wide range of benefits. They provide a consistent and structured way to describe and organize educational resources that make searching the relevant educational resources easier and quicker for educators and learners. Adopting standardized metadata ensures that different learning systems, platforms, and tools can communicate and exchange data seamlessly.
However, finding relevant standards that enable the integration of diverse resources and technologies, creating a cohesive learning experience is still problematic. There are several standards and selecting a specific standard may require a significant effort. Experts have different opinions about why there are so many metadata standards. According to experts, the reason for having so many standards could be:
Different application scenarios have different requirements for metadata, which can lead to the development of multiple standards that are tailored to specific needs.
Another reason for the proliferation of metadata standards is that different organizations working simultaneously come in similar time frame but slightly different priorities and goals without any coordination. For example, ISO and IEEE created a lot of standards but there was no ability to somehow work together and that is the reason they try to re-create things.
A different group of people with slightly different perspectives worked on developing standards. People think of objects differently, e.g., people from library science or people from the learning domain think another way, and this can also lead to different standards being developed that are optimized for different goals.
Varying regulations and governmental influences on educational metadata, which can differ across countries.
Despite having a lot of standards, very few of them are widely accepted by the community and most of them failed or they are outdated. This can happen for a variety of reasons, such as lack of awareness, lack of incentives to adopt, or lack of support from key stakeholders. One reason for failure may be that most individuals lack a clear understanding of metadata standards – what they are, their purpose, and their benefits. Also, a lot of work is done by volunteers and there are many opportunities for errors during the standard development process. Another reason could be that past experiences with metadata standards may not have been effective, leading to a perception of limited value. This could be due to inadequate delivery, perhaps excelling in one aspect like search but falling short in others.
While there are many standards that have failed, there are also many that have been successful and continue to be widely used today. For example, LRMI is widely employed internally by publishers. Overall, the development of learning metadata standards is a complex and ongoing process that is influenced by a variety of factors. The field is still in the early stages of understanding the relationships among various elements, individuals, and their functionalities, as well as delineating boundaries and distinctions.