| Posters |
| Contextual Integrity in LLMs via Reasoning and Reinforcement Learning |
Guangchen (Eric) Lan · Huseyin A. Inan · Sahar Abdelnabi · Janardhan Kulkarni · Lukas Wutschitz · Reza Shokri · Christopher Brinton · Robert Sim |
| Direct Alignment with Heterogeneous Preferences |
Ali Shirali · Arash Nasr-Esfahany · Abdullah Alomar · Parsa Mirtaheri · Rediet Abebe · Ariel Procaccia |
| Quantifying Uncertainty in Error Consistency: Towards Reliable Behavioral Comparison of Classifiers |
Thomas Klein · Sascha Meyen · Wieland Brendel · Felix A. Wichmann · Kristof Meding |
| Reparameterized LLM Training via Orthogonal Equivalence Transformation |
Zeju Qiu · Simon Buchholz · Tim Xiao · Maximilian Dax · Bernhard Schölkopf · Weiyang Liu |
| EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Network |
Michael Arbel · David Salinas · Frank Hutter
|
| Gompertz Linear Units: Leveraging Asymmetry for Enhanced Learning Dynamics |
Indrashis Das · Mahmoud Safari · Steven Adriaensen · Frank Hutter
|
| Learning in Compact Spaces with Approximately Normalized Transformer |
Jörg Franke · Urs Spiegelhalter · Marianna Nezhurina · Jenia Jitsev · Frank Hutter · Michael Hefenbrock |
| DeltaProduct: Improving State-Tracking in Linear RNNs via Householder Products |
Julien Siems · Timur Carstensen · Arber Zela · Frank Hutter · Massimiliano Pontil · Riccardo Grazzi |
| GPAS: Accelerating Convergence of LLM Pretraining via Gradient-Preserving Activation Scaling |
Tianhao Chen · Xin Xu · Zijing Liu · Pengxiang Li · Xinyuan Song · Ajay Jaiswal · Fan Zhang · Jishan Hu · Yang Wang · Hao Chen · Shizhe Diao · Shiwei Liu · Yu Li · Lu Yin · Can Yang |
| The Curse of Depth in Large Language Models |
Wenfang Sun · Xinyuan Song · Pengxiang Li · Lu Yin · Yefeng Zheng · Shiwei Liu
|
| AlphaDecay: Module-wise Weight Decay for Heavy-Tailed Balancing in LLMs |
Di He · Ajay Jaiswal · Songjun Tu · Li Shen · Ganzhao Yuan · Shiwei Liu · Lu Yin |
| Collective Reasoning in Performative Prediction |
Haiqing Zhu · Tijana Zrnic · Celestine Mendler-Dünner
|
| Performative Validity of Recourse Explanations |
Gunnar König · Hidde Fokkema · Timo Freiesleben · Celestine Mendler-Dünner · Ulrike Luxburg |
| Enhancing Optimizer Stability: Momentum Adaptation of The NGN Step-size |
Rustem Islamov · Niccolò Ajroldi · Antonio Orvieto · Aurelien Lucchi |
| Counterfactual reasoning: an analysis of in-context emergence |
Moritz Miller · Bernhard Schölkopf · Siyuan Guo |
| Language Models Are Inefficient Reasoners: An Analysis on Arithmetic Proof Search |
Andreas Opedal · Yanick Zengaffinen · Haruki Shirakami · Clemente Pasti · Mrinmaya Sachan · Abulhair Saparov · Ryan Cotterell · Bernhard Schölkopf
|
| SPARTAN: A Sparse Transformer World Model Attending to What Matters |
Anson Lei · Bernhard Schölkopf · Ingmar Posner |
| Cultural Alien Sampler: Open-ended Art Generation Balancing Originality and Coherence |
Alejandro Hernandez · Hiromu Yakura · Levin Brinkmann · Mar Canet Solal · Hassan Abu Alhaija · Ignacio Serna · Nasim Rahaman · Bernhard Schölkopf · Iyad Rahwan |