 |
Internalizing Self-Consistency in Language Models: Multi-Agent Consensus Alignment
Ankur Samanta,
Akshayaa Magesh,
Youliang Yu,
Runzhe Wu,
Ayush Jain,
Daniel Jiang,
Boris Vidolov,
Paul Sajda,
Yonathan Efroni,
Kaveh Hassani
Preprint
Language models learn to maintain consistent answers across diverse reasoning paths and ground arguments in peer reasoning by reinforcing their own debate consensus, driving reasoning self-improvement.
arXiv |
Code
|
Teaching Assistant (USC): Deep Learning and its Applications (CSCI566, CSCI599)
- Fall 2024: Prof. Yan Liu
- Spring 2024: Prof. Yue Zhao
- Spring 2023: Prof. Jesse Thomason
- Fall 2020: Prof. Joseph J Lim
- Spring 2019: Prof. Joseph J Lim
- Fall 2019: Prof. Joseph J Lim
|
- ICLR: 2023, 2024, 2025, 2026
- NeurIPS: 2023, 2024, 2025
- ICML: 2025
- RLC: 2025
- CoRL: 2021, 2022, 2023, 2024
- AAAI: 2026
|
|