Version 1
: Received: 25 February 2024 / Approved: 26 February 2024 / Online: 26 February 2024 (12:12:43 CET)
How to cite:
Gimenez-Abalos, V.; Oliva-Felipe, L.; Vázquez-Salceda, J.; Cortés, U.; Alvarez-Napagao, S. Why Interpreting Intent Is Key for Trustworthiness in the Age of Opaque Agents. Preprints2024, 2024021446. https://doi.org/10.20944/preprints202402.1446.v1
Gimenez-Abalos, V.; Oliva-Felipe, L.; Vázquez-Salceda, J.; Cortés, U.; Alvarez-Napagao, S. Why Interpreting Intent Is Key for Trustworthiness in the Age of Opaque Agents. Preprints 2024, 2024021446. https://doi.org/10.20944/preprints202402.1446.v1
Gimenez-Abalos, V.; Oliva-Felipe, L.; Vázquez-Salceda, J.; Cortés, U.; Alvarez-Napagao, S. Why Interpreting Intent Is Key for Trustworthiness in the Age of Opaque Agents. Preprints2024, 2024021446. https://doi.org/10.20944/preprints202402.1446.v1
APA Style
Gimenez-Abalos, V., Oliva-Felipe, L., Vázquez-Salceda, J., Cortés, U., & Alvarez-Napagao, S. (2024). Why Interpreting Intent Is Key for Trustworthiness in the Age of Opaque Agents. Preprints. https://doi.org/10.20944/preprints202402.1446.v1
Chicago/Turabian Style
Gimenez-Abalos, V., Ulises Cortés and Sergio Alvarez-Napagao. 2024 "Why Interpreting Intent Is Key for Trustworthiness in the Age of Opaque Agents" Preprints. https://doi.org/10.20944/preprints202402.1446.v1
Abstract
This paper addresses the critical issue of trust in Artificial Intelligence systems, especially when users might find it challenging to comprehend the internal decision-making processes of such systems. A relevant topic of research in this respect is Theory of Mind, which involves attempting to understand these systems as if they possessed beliefs, desires, and intentions. We focus on the latter: intentions, and we examine how producing explanations based on them can improve \emph{understandability} while allowing for a better interpretation of how they align with human values. We also review some existing methods for identifying intentions in AI systems and we conclude with a discussion on possible future directions in this line of research.
Keywords
explainability, intention inference, values alignment, multi-agent systems
Subject
Computer Science and Mathematics, Artificial Intelligence and Machine Learning
Copyright:
This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.