publications
2025
- AAMASDynamic Coalition Structure Detection in Natural Language-based InteractionsAbhishek Kulkarni*, Andy Liu*, Jean-Raphael Gaglione, Daniel Fried, and Ufuk TopcuIn The 24th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2025
In strategic multi-agent sequential interactions, detecting dynamic coalition structures is crucial for understanding how self-interested agents coordinate to influence outcomes. However, natural-language-based interactions introduce unique challenges to coalition detection due to ambiguity over intents and difficulty in modeling players’ subjective perspectives. We propose a new method that leverages recent advancements in large language models and game theory to predict dynamic multilateral coalition formation in Diplomacy, a strategic multi-agent game where agents negotiate coalitions using natural language. The method consists of two stages. The first stage extracts the set of agreements discussed by two agents in their private dialogue, by combining a parsing-based filtering function with a fine-tuned language model trained to predict player intents. In the second stage, we define a new metric using the concept of subjective rationalizability from hypergame theory to evaluate the expected value of an agreement for each player. We then compute this metric for each agreement identified in the first stage by assessing the strategic value of the agreement for both players and taking into account the subjective belief of one player that the second player would honor the agreement. We demonstrate that our method effectively detects potential coalition structures in online Diplomacy gameplay by assigning high values to agreements likely to be honored and low values to those likely to be violated. The proposed method provides foundational insights into coalition formation in multi-agent environments with language-based negotiation and offers key directions for future research on the analysis of complex natural language-based interactions between agents.
@inproceedings{kulkarni2025dynamic, title = {Dynamic Coalition Structure Detection in Natural Language-based Interactions}, author = {Kulkarni, Abhishek and Liu, Andy and Gaglione, Jean-Raphael and Fried, Daniel and Topcu, Ufuk}, booktitle = {The 24th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)}, year = {2025}, url = {https://arxiv.org/abs/2502.16339}, }
2024
- ACLEvaluating Large Language Model Biases in Persona-Steered GenerationAndy Liu, Mona Diab, and Daniel FriedIn Findings of the Association for Computational Linguistics ACL 2024, 2024
The task of persona-steered text generation requires large language models (LLMs) to generate text that reflects the distribution of views that an individual fitting a persona could have. People have multifaceted personas, but prior work on bias in LLM-generated opinions has only explored multiple-choice settings or one-dimensional personas. We define an incongruous persona as a persona with multiple traits where one trait makes its other traits less likely in human survey data, e.g. political liberals who support increased military spending. We find that LLMs are 9.7% less steerable towards incongruous personas than congruous ones, sometimes generating the stereotypical stance associated with its demographic rather than the target stance. Models that we evaluate that are fine-tuned with Reinforcement Learning from Human Feedback (RLHF) are more steerable, especially towards stances associated with political liberals and women, but present significantly less diverse views of personas. We also find variance in LLM steerability that cannot be predicted from multiple-choice opinion evaluation. Our results show the importance of evaluating models in open-ended text generation, as it can surface new LLM opinion biases. Moreover, such a setup can shed light on our ability to steer models toward a richer and more diverse range of viewpoints.
@inproceedings{liu2024evaluating, title = {Evaluating Large Language Model Biases in Persona-Steered Generation}, author = {Liu, Andy and Diab, Mona and Fried, Daniel}, booktitle = {Findings of the Association for Computational Linguistics ACL 2024}, pages = {9832--9850}, year = {2024}, url = {https://arxiv.org/abs/2405.20253}, }
- arXivBIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded DataWenkai Li, Jiarui Liu, Andy Liu, Xuhui Zhou, Mona Diab, and Maarten SaparXiv preprint arXiv:2410.16491, 2024
In this work, we tackle the challenge of embedding realistic human personality traits into LLMs. Previous approaches have primarily focused on prompt-based methods that describe the behavior associated with the desired personality traits, suffering from realism and validity issues. To address these limitations, we introduce BIG5-CHAT, a large-scale dataset containing 100,000 dialogues designed to ground models in how humans express their personality in text. Leveraging this dataset, we explore Supervised Fine-Tuning and Direct Preference Optimization as training-based methods to align LLMs more naturally with human personality patterns. Our methods outperform prompting on personality assessments such as BFI and IPIP-NEO, with trait correlations more closely matching human data. Furthermore, our experiments reveal that models trained to exhibit higher conscientiousness, higher agreeableness, lower extraversion, and lower neuroticism display better performance on reasoning tasks, aligning with psychological findings on how these traits impact human cognitive performance. To our knowledge, this work is the first comprehensive study to demonstrate how training-based methods can shape LLM personalities through learning from real human behaviors.
@article{li2024big5, title = {BIG5-CHAT: Shaping LLM Personalities Through Training on Human-Grounded Data}, author = {Li, Wenkai and Liu, Jiarui and Liu, Andy and Zhou, Xuhui and Diab, Mona and Sap, Maarten}, journal = {arXiv preprint arXiv:2410.16491}, year = {2024}, url = {https://arxiv.org/abs/2410.16491}, }
2023
- ICLRComputational Language Acquisition with Theory of MindAndy Liu, Hao Zhu, Emmy Liu, Yonatan Bisk, and Graham NeubigIn The Eleventh International Conference on Learning Representations, 2023
Unlike current state-of-the-art language models, young children actively acquire language through interactions with their surrounding environment and caretakers. One mechanism that has been argued to be critical to language learning is the ability to infer the mental states of other agents in social environments, coined Theory of Mind (ToM) by Premack & Woodruff (1978). Drawing inspiration from the modern operationalized versions of ToM implemented in Rabinowitz et al. (2018) and Zhu et al. (2021), we build language-learning agents equipped with ToM, and measure its effects on the learning process. We model ToM by giving the speaker agent an internal listener model that is trained alongside the speaker and used to rerank potential utterances. We experiment with varying task difficulty, hypothesizing that models will acquire more complex language to adapt to stronger environmental pressures. We find that training speakers with a highly weighted ToM listener component leads to performance gains in our image referential game setting. We also find some evidence that increasing task difficulty in the training process results in more fluent and precise utterances in evaluation. This suggests the potential utility of further incorporating ToM, as well as other insights from child language acquisition, into computational models of language acquisition.
@inproceedings{liucomputational, title = {Computational Language Acquisition with Theory of Mind}, author = {Liu, Andy and Zhu, Hao and Liu, Emmy and Bisk, Yonatan and Neubig, Graham}, booktitle = {The Eleventh International Conference on Learning Representations}, year = {2023}, url = {https://arxiv.org/abs/2303.01502}, }