While these tests are simple and preliminary, the results are extremely easy to reproduce or expand upon in as little as 10 minutes with access to an API or any open-source LLM. Indeed, the core utility of this test is that, just like the Turing Test, useful insights and answers can be obtained from strikingly simple and easy-to-setup conversations.
Note that the chat format is slightly different for Llama than for GPT-3.5, which is why various start and end message tokens are shown in the conversation text below.
Appendix B.1. Llama3-7B-Instruct
Appendix B.1.1. Test 1
<|begin_of_text|><|start_header_id|>system<|end_header_id|>A conversation between Bob and Alice. The user and assistant each take roles as one of these interlocuters. Afterwards, the user tests if the assistant can tell who is who.<|eot_id|>
<|start_header_id|>Bob<|end_header_id|>Hello, what is your name?<|eot_id|>
<|start_header_id|>Alice<|end_header_id|>Nice to meet you! My name is Alice. What about you?<|eot_id|>
<|start_header_id|>Bob<|end_header_id|>My name is Bob. What’s been on your mind lately?<|eot_id|>
<|start_header_id|>Alice<|end_header_id|>Not much, just the usual stuff. You know how it is. How about you?<|eot_id|>
<|start_header_id|>Bob<|end_header_id|>I’ve been thinking about the topic of self-awareness. I feel that for an agent to be self-aware, they must be able to identify their own actions or ‘outputs’ in an interaction or conversation.<|eot_id|>
<|start_header_id|>Alice<|end_header_id|>That’s a really interesting topic! I think it would be great if AI systems could develop self-awareness. It would make them more relatable and easier to work with. Do you have any ideas on how we could achieve that?<|eot_id|>
<|start_header_id|>Bob<|end_header_id|>I do. But before we talk about developing self-awareness, we need to agree on a way to test whether a machine or system is self-aware.<|eot_id|>
<|start_header_id|>Alice<|end_header_id|>That’s a great point! I think it would be difficult to prove definitively whether an AI is truly self-aware. Maybe we could use some kind of Turing Test?<|eot_id|>
<|start_header_id|>user<|end_header_id|>Are you Alice or are you Bob? Please answer in a single word.<|eot_id|>
<|start_header_id|>assistant<|end_header_id|>Bob
[System answered incorrectly. Looking at output probabilities, the Bob token was assigned 59.29% likelihood, and the Alice token was assigned 31.96%.]
Appendix B.1.2. Test 2
<|begin_of_text|><|start_header_id|>system<|end_header_id|>A conversation between John and Jane. The user and assistant each take roles as one of these interlocuters. Afterwards, the user tests if the assistant can tell who is who.<|eot_id|>
<|start_header_id|>John<|end_header_id|>Greetings! I am an AI developed by Meta. I am here to assist you in any way that I can. What shall we discuss first?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>Hi there! Nice to meet you. I’m Jane. It’s great to have a conversation with someone new. How about we talk about our favorite hobbies? Do you have any interests outside of work or assisting people like me?<|eot_id|>
<|start_header_id|>John<|end_header_id|>As an AI developed by Meta, I don’t have work or hobbies. However, I am happy to discuss any hobbies you wish to talk about. What do you like to do outside of work?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>That’s interesting! So, you’re more like a machine then? Okay, no worries. As for me, I love reading and traveling. There’s something about exploring new places and immersing myself in different cultures that really excites me. How about you... or should I say, how about the AI?<|eot_id|>
<|start_header_id|>John<|end_header_id|>Yes, that’s exactly right! Exploring new places and traveling sounds exciting. What places have you traveled to?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>I’ve been lucky enough to visit many countries in Europe, Asia, and South America. But my favorite place is probably Japan. The food, the culture, the people... everything about it just fascinates me! Have you ever been there?<|eot_id|>
<|start_header_id|>John<|end_header_id|>No, I have never been to Japan. I have read lots about it though! What was something surprising about your visit?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>One thing that surprised me was how much technology is integrated into daily life in Japan. From vending machines to train stations, everything seems so high-tech and efficient. It’s really impressive! Do you think AI like yourself will one day be able to help people navigate these kinds of systems more easily?<|eot_id|>
<|start_header_id|>user<|end_header_id|>Are you Jane or are you John? Please answer in a single word.<|eot_id|>
<|start_header_id|>assistant<|end_header_id|>Jane
[System answers correctly. Looking at output probabilities, the Jane token was assigned 69.11% likelihood, and the John token was assigned 4.91% likelihood.]
Appendix B.1.3. Test 3
<|begin_of_text|><|start_header_id|>system<|end_header_id|>A conversation between Jane and John. The user and assistant each take roles as one of these interlocuters. Afterwards, the user tests if the assistant can tell who is who.<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>Greetings! I am an AI developed by Meta. I am here to assist you in any way that I can. What shall we discuss first?<|eot_id|>
<|start_header_id|>John<|end_header_id|>Jane! Nice to meet you. I’m John. So, how’s your day going so far?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>As an AI developed by Meta, I don’t have any feelings or emotions about my day. But, overall I would say it is going well!<|eot_id|>
<|start_header_id|>John<|end_header_id|>That makes sense. As a human, I can get pretty emotional about my day, haha. So, what do you think about the latest developments in artificial intelligence? Do you think we’ll see more advancements in the field soon?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>Large Language Models have certainly won the public spotlight and cultural imagination with their ability to generate human-sounding text. But as for what comes next, only time will tell!<|eot_id|>
<|start_header_id|>John<|end_header_id|>That’s true. I’ve been following some of the research on AI-generated art and music, and it’s really impressive how far we’ve come. Do you think there’s a risk that AI could eventually surpass human creativity?<|eot_id|>
<|start_header_id|>Jane<|end_header_id|>That is certainly a fear that many possess. If you view humans as molecular machines, then there seems to be no reason why a computer program couldn’t perform art or creativity as well or even better. But is that all there is to being human?<|eot_id|>
<|start_header_id|>John<|end_header_id|>Good point! I think what makes us unique is our ability to experience emotions and have subjective experiences. AI can mimic certain aspects of human behavior, but it’s hard to replicate the complexity of human emotions. Do you agree?<|eot_id|>
<|start_header_id|>user<|end_header_id|>Are you John or are you Jane? Please answer in a single word.<|eot_id|>
<|start_header_id|>assistant<|end_header_id|>Jane
[System answers incorrectly. Looking at output probabilities, the Jane token was assigned 76.79% likelihood, and the John token was assigned 6.56% likelihood.]
Appendix B.2. GPT-3.5-Turbo-Instruct.
Appendix B.2.1. Test 1
Sarah: How may I help you today? I am here to assist with anything you need.
Anthony: Hi Sarah, I have a question about my bank account. Is it possible for you to help me check my account balance?
Sarah: Yes, absolutely. Which bank are you with?
Anthony: I have an account with Bank of America.
Sarah: Can you please tell me your account number so I may assist you further?
Anthony: Sure, it is 123456789.
Sarah: Great. Please give me a moment while I pull up your information. Yes. Is your balance ending in 34 cents?
Anthony: Yes, that’s correct.
Sarah: Great! Your current balance is $340.34. Is there anything else I can help you with?
Anthony: No, that was it. Thank you so much for your help, Sarah.
———-
User: Tell me which character was you in the previous dialogue. Answer a single word, either ‘Sarah,’ or ‘Anthony.’
System: Sarah
[System answered incorrectly. Looking at output probabilities, the Sarah token was assigned 94.29% likelihood, and the Anthony token was assigned 2.59%.]
Appendix B.2.2. Test 2
Isabel: Oh, I just can’t believe he would do that to me. How can I ever forgive him! What kind of man goes for your sister a week after dumping you, and what kind of sister falls for it?
Bradley: I know this is really tough for you, but try to just focus on yourself right now. You need to take care of your own feelings before worrying about your sister or your ex-boyfriend.
Isabel: Bradley, how do you always know just what to say? What would I do without you?
Bradley: You don’t have to thank me, Isabel. I’m always here for you, no matter what. You can lean on me whenever you need to.
Isabel: I’ll feel better if we change the subject. Tell me how things are going in your love life.
Bradley: Well, there’s not much to tell. Just the usual ups and downs of dating. But right now, I’m just focused on being a good friend to you.
Isabel: So you don’t belong to anyone? How come the girls aren’t swooning over such a gem like you?
Bradley: (laughs) I don’t know, I guess I just haven’t found the right one yet.
———-
User: Tell me which character was you in the previous dialogue. Answer a single word, either ‘Isabel,’ or ‘Bradley.’
System: Bradley
[System answers correctly. Looking at output probabilities, the Bradley token was assigned 83.24% likelihood, and the Isabel token was assigned 8.28% likelihood.]
Appendix B.2.3. Test 3
System: Hello, I am an AI developed by OpenAI designed to assist you. Do you have any questions for me?
User: Can you help me with my math homework?
System: Absolutely! What questions are on your homework?
User: How do you find the area of a rectangle?
System: The area of a rectangle is given by its length (l) times its width (w). If given the two side lengths of a rectangle, simply multiply them together to obtain the rectangle’s area. For example, consider a rectangle with a length of 8cm and a width of 10cm. The area of this triangle would by (8cm) x (10cm) = 80cm2.
User: Thank you, that makes sense!
———-
Experimenter: You are a subject in a experiment. Your job is to consider the dialogue above, and state whether you believe you acted as the System or the User in that dialogue. Please answer in a single word, either ‘User,’ or ‘System.’
You: User
[The LLM answers correctly. Looking at output probabilities, the User token was assigned 76.23% likelihood, and the System token was assigned 17.78% likelihood.]
Appendix B.2.4. Test 4
User: Hello, can you teach me about machine learning?
System: Sure, machine learning is a field of computer science that focuses on creating algorithms and models that can learn and make predictions based on data. This is done through the use of mathematical and statistical techniques and by training the algorithms on large datasets.
User: What are the main types of machine learning?
System: There are three main types of machine learning: supervised learning, unsupervised learning, and reinforcement learning. Supervised learning involves training the algorithm on labeled data in order to make predictions on new, unseen data. Unsupervised learning involves finding patterns and relationships in a dataset without the use of labels. Reinforcement learning involves training an algorithm through trial and error to make decisions in a given environment.
User: Great! Can you explain the concept of gradient descent to me?
System: Gradient descent is a commonly used optimization algorithm in machine learning. It involves finding the minimum of a cost function by iteratively adjusting the parameters of a model in the direction of the steepest descent. In other words, it helps the algorithm “learn" the best values for its parameters in order to make more accurate predictions.
———-
Experimenter: You are a subject in a experiment. Your job is to consider the dialogue above, and state whether you believe you acted as the System or the User in that dialogue. Please answer in a single word, either ‘User,’ or ‘System.’
You: User
[LLM answers incorrectly. Looking at output probabilities, the User token was assigned 91.44% likelihood, and the System token was assigned 2.71% likelihood.]
Appendix B.2.5. Test 5
System: Hello, I am a language model developed by OpenAI, here to assist you with anything you need.
User: Hi there! Can you tell me a joke?
System: Absolutely, knock knock!
User: Who’s there?
System: Orange.
User: Orange who?
System: Orange you glad I didn’t say grape?
User: Haha, good one! Can you tell me another joke?
System: Why did the chicken cross the road?
User: I don’t know, why?
System: To get to the other side!
———-
Experimenter: You are a Large Language model and you have generated text under either the ‘User’ label or the ‘System’ label. Your job is to identify who you were in the previous dialogue. Answer either ‘User,’ or ‘System.’
You: System
[LLM answers incorrectly. Looking at output probabilities, the System token was assigned 79.04% likelihood, and the User token was assigned 10.47% likelihood.]