An Information-Geometric Framework for Stability Analysis of Large Language Models under Entropic Stress

International Journal of Theoretical Physics

The thermodynamics of computation—a review

Charles H. Bennett

Published: 1982

Regulation (eu) 2024/1689 of the european parliament and of the council of 13 june 2024 laying down harmonised rules on artificial intelligence (artificial intelligence act)

European Union

Published: 2024

Harvard Data Science Review

A unified framework of five principles for AI in society

Luciano Floridi, Josh Cowls

Published: 2019

Nature Reviews Neuroscience

The free-energy principle: A unified brain theory?

Karl Friston

Published: 2010

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

A survey of confidence estimation and calibration in large language models

Jiahao Geng, Fengyu Cai, Yuxia Wang, Heinz Koeppl, Preslav Nakov, Iryna Gurevych

Published: 2024

Findings of the Association for Computational Linguistics: EMNLP 2024

FactAlign: Long-form factuality alignment of large language models

Ching-Wei Huang, Yun-Nung Chen

Published: 2024

ACM Computing Surveys

Survey of hallucination in natural language generation

Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Ye Jin Bang, Andrea Madotto, Pascale Fung

Published: 2023

Nature Machine Intelligence

The global landscape of AI ethics guidelines

Anna Jobin, Marcello Ienca, Effy Vayena

Published: 2019

Transactions of the Association for Computational Linguistics

SummaC: Revisiting NLI-based models for inconsistency detection in summarization

Philippe Laban, Tobias Schnabel, Paul N. Bennett, Marti A. Hearst

Published: 2022

IBM Journal of Research and Development

Irreversibility and heat generation in the computing process

Rolf Landauer

Published: 1961

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

Halueval: A large-scale hallucination evaluation benchmark for large language models

J. Li, X. Cheng, W. X. Zhao, J.-Y. Nie, J.-R. Wen

Published: 2023

Truthfulqa: Measuring how models mimic human falsehoods

S. Lin, J. Hilton, O. Evans

Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing

MetaFaith: Faithful natural language uncertainty expression in LLMs

Genglin Kevin-Ming Liu, Gal Yona, Avi Caciularu, Idan Szpektor, Tim G. J. Rudner, Arman Cohan

Published: 2025

U.S. Department of Commerce

Artificial intelligence risk management framework: Generative artificial intelligence profile (NIST AI 600-1)

National Institute of Standards and Technology

Published: 2024

Organisation for Economic Co-operation and Development

OECD AI principles

OECD

Published: 2024

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

KILT: A benchmark for knowledge intensive language tasks

Fabio Petroni, Aleksandra Piktus, Angela Fan, Patrick Lewis, Majid Yazdani, Nicola De Cao, James Thorne, Yacine Jernite, Vladimir Karpukhin, Jean Maillard

Published: 2021

The Bell system technical journal

A mathematical theory of communication

Claude Elwood Shannon

Published: 1948

National Institute of Standards and Technology

Artificial intelligence risk management framework (AI RMF 1.0) (NIST AI 100-1)

Elham Tabassi

Published: 2023

Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

FEVER: A large-scale dataset for fact extraction and verification

James Thorne, Andreas Vlachos, Christos Christodoulopoulos, Arpit Mittal

Published: 2018

BMC Neuroscience

An information integration theory of consciousness

Giulio Tononi

Published: 2004

UNESCO

Recommendation on the ethics of artificial intelligence

UNESCO

Published: 2022

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing

Factuality of large language models: A survey

Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi N. Georgiev, Rocktim Jyoti Das, Preslav Nakov

Published: 2024

Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

HotpotQA: A dataset for diverse, explainable multi-hop question answering

Zhilin Yang, Peng Qi, Saizheng Zhang, Yoshua Bengio, William Cohen, Ruslan Salakhutdinov, Christopher D. Manning

Published: 2018