Choice Vectors: Streamlining Personal AI Alignment Through Binary Selection

Watson, Eleanor; Nguyen, Minh; Pan, Sarah; Zhang, Shujun

Author(s)

Watson, Eleanor; Nguyen, Minh; Pan, Sarah; Zhang, Shujun

Downloadmti-09-00022.pdf (2.018Mb)

Publisher with Creative Commons License

Terms of use

Creative Commons Attribution https://creativecommons.org/licenses/by/4.0/

Metadata

Show full item record

Abstract

Value alignment for AI is not “one-size-fits-all”: even polite and friendly models can still fail to represent individual user contexts and preferences, and local cultural norms. This paper presents a modular workflow for personal fine-tuning, synthesizing four core components from our previous research: (1) robust vectorization of user values and preferences, (2) a binary choice user interface (UI) approach to capturing those preferences with minimal cognitive load, (3) contrastive activation methods for steering large language models (LLMs) via difference vectors, and (4) knowledge graph integration for more auditable and structured alignment. Our approach—descended from past research on “Towards an End-to-End Personal Fine-Tuning Framework”—demonstrates how these elements can be combined to create personalized, context-rich alignment solutions. We report on user studies for the forced-choice UI, describe an experimental pipeline for deriving “control vectors”, and propose a “moral graph” method for bridging symbolic and vector-based alignment. Our findings suggest that multi-pronged personalization can significantly reduce user annotation fatigue, improve alignment fidelity, and allow for more flexible, interpretable AI behaviors.

Date issued

2025-03-03

URI

https://hdl.handle.net/1721.1/158997

Department

Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science

Journal

Multimodal Technologies and Interactions

Publisher

Multidisciplinary Digital Publishing Institute

Citation

Watson, E.; Nguyen, M.; Pan, S.; Zhang, S. Choice Vectors: Streamlining Personal AI Alignment Through Binary Selection. Multimodal Technol. Interact. 2025, 9, 22.

Version: Final published version

Collections

MIT Open Access Articles