Search | Preprints.org

Search Results

1 articles found

Order by Most Viewed Most Downloaded Newest Relevance

Preprint ARTICLE | doi:10.20944/preprints202407.0120.v1

Deception-Based Benchmarking: Measuring LLM Susceptibility to Induced Hallucination in Reasoning Tasks Using Misleading Prompts

Subject: Computer Science And Mathematics, Artificial Intelligence And Machine Learning Keywords: LLM, NLP, Hallucination, AI assistant reliability, Benchmarking, Deception-based benchmarking, MMLU, DB-MMLU, TruthfulQA, Accuracy, Susceptibility, Consistency

Online: 2 July 2024 (10:12:19 CEST)

Show abstract| Download PDF| Share

We use cookies on our website to ensure you get the best experience.
Read more about our cookies here.