arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2603.15405 2026-03-17 cs.CL

Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models

Zehao Chen, Rong Pan

详情

英文摘要

Large Language Models (LLMs) have demonstrated impressive capabilities in simulating diverse human behaviors and personalities. However, existing methods for personality control, which include prompt engineering and standard Supervised Fine-Tuning (SFT), typically treat personality traits as discrete categories (e.g., "Extroverted" vs. "Introverted"), lacking the ability to precisely control the intensity of a trait on a continuous spectrum. In this paper, we introduce Fusian, a novel framework for fine-grained, continuous personality control in LLMs. Fusian operates in two stages: (1) Trajectory Collection, where we capture the dynamic evolution of personality adoption during SFT by saving a sequence of LoRA adapters, effectively mapping the continuous manifold of a trait; and (2) RL-based Dynamic Fusion, where we train a policy network using Reinforcement Learning to dynamically compute mixing weights for these frozen adapters. By sampling from a Dirichlet distribution parameterized by the policy network, Fusian fuses multiple adapters to align the model's output with a specific numerical target intensity. Experiments on the Qwen3-14B model demonstrate that Fusian achieves high precision in personality control, significantly outperforming baseline methods in aligning with user-specified trait intensities.

URL PDF HTML ☆

赞 0 踩 0

2603.15404 2026-03-17 cs.CV cs.AI

Detection of Autonomous Shuttles in Urban Traffic Images Using Adaptive Residual Context

Mohamed Aziz Younes, Nicolas Saunier, Guillaume-Alexandre Bilodeau

Comments 10 pages, 6 figures

2603.15403 2026-03-17 cs.CV

Pointing-Based Object Recognition

Lukáš Hajdúch, Viktor Kocur

Comments Submitted to InnovAIte conference

2603.15402 2026-03-17 cs.CL cs.AI

A Closer Look into LLMs for Table Understanding

Jia Wang, Chuanyu Qin, Mingyu Zheng, Qingyi Si, Peize Li, Zheng Lin

2603.15396 2026-03-17 cs.CV cs.AI

AI Evasion and Impersonation Attacks on Facial Re-Identification with Activation Map Explanations

Noe Claudel, Weisi Guo, Yang Xing

2603.15389 2026-03-17 cs.CL

When Does Sparsity Mitigate the Curse of Depth in LLMs

Dilxat Muhtar, Xinyuan Song, Sebastian Pokutta, Max Zimmer, Nico Pelleriti, Thomas Hofmann, Shiwei Liu

Comments 32 pages, 29 figures

2603.15388 2026-03-17 cs.LG cs.AI cs.RO stat.ML

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Yanning Dai, Yuhui Wang, Dylan R. Ashley, Jürgen Schmidhuber

Comments presented at the Fourteenth International Conference on Learning Representations; 11 pages in main text + 3 pages of references + 23 pages of appendices, 5 figures in main text + 11 figures in appendices, 16 tables in appendices; accompanying website available at https://yanningdai.github.io/stackelberg-ppo-co-design/ ; source code available at https://github.com/YanningDai/StackelbergPPO

2603.15386 2026-03-17 cs.CV cs.AI

RieMind: Geometry-Grounded Spatial Agent for Scene Understanding

Fernando Ropero, Erkin Turkoz, Daniel Matos, Junqing Du, Antonio Ruiz, Yanfeng Zhang, Lu Liu, Mingwei Sun, Yongliang Wang

2603.15381 2026-03-17 cs.AI

Why AI systems don't learn and what to do about it: Lessons on autonomous learning from cognitive science

Emmanuel Dupoux, Yann LeCun, Jitendra Malik

2603.15374 2026-03-17 cs.CV

Spectral Rectification for Parameter-Efficient Adaptation of Foundation Models in Colonoscopy Depth Estimation

Xiaoxian Zhang, Minghai Shi, Lei Li

Comments 15 pages

2603.15373 2026-03-17 cs.LG cs.AI

GradCFA: A Hybrid Gradient-Based Counterfactual and Feature Attribution Explanation Algorithm for Local Interpretation of Neural Networks

Jacob Sanderson, Hua Mao, Wai Lok Woo

2603.15371 2026-03-17 cs.AI cs.NI

Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning

Guangfu Hao, Yuming Dai, Xianzhe Qin, Shan Yu

2603.15370 2026-03-17 cs.CV

Trajectory-Diversity-Driven Robust Vision-and-Language Navigation

Jiangyang Li, Cong Wan, SongLin Dong, Chenhao Ding, Qiang Wang, Zhiheng Ma, Yihong Gong

Comments 17pages, 5 figures

2603.15368 2026-03-17 cs.CV

IRIS: Intersection-aware Ray-based Implicit Editable Scenes

Grzegorz Wilczyński, Mikołaj Zieliński, Krzysztof Byrski, Joanna Waczyńska, Dominik Belter, Przemysław Spurek

2603.15365 2026-03-17 cs.CV

A PPO-Based Bitrate Allocation Conditional Diffusion Model for Remote Sensing Image Compression

Yuming Han, Jooho Kim, Anish Shakya

2603.15364 2026-03-17 cs.AI cs.CL

CRASH: Cognitive Reasoning Agent for Safety Hazards in Autonomous Driving

Erick Silva, Rehana Yasmin, Ali Shoker

2603.15358 2026-03-17 cs.LG cs.AI physics.ao-ph

FuXiWeather2: Learning accurate atmospheric state estimation for operational global weather forecasting

Xiaoze Xu, Xiuyu Sun, Songling Zhu, Xiaohui Zhong, Yuanqing Huang, Zijian Zhu, Jun Liu, Hao Li

2603.15354 2026-03-17 cs.LG cs.AI

Conditional Rectified Flow-based End-to-End Rapid Seismic Inversion Method

Haofei Xu, Wei Cheng, Sizhe Li, Jie Xiong

2603.15351 2026-03-17 cs.AI cs.MA

PMAx: An Agentic Framework for AI-Driven Process Mining

Anton Antonov, Humam Kourani, Alessandro Berti, Gyunam Park, Wil M. P. van der Aalst

Comments Submitted to EMMSAD 2026 (tool demonstration track), under review

2603.15348 2026-03-17 cs.CV

Oscillating Dispersion for Maximal Light-throughput Spectral Imaging

Jiuyun Zhang, Zhan Shi, Linsen Chen, Xun Cao

2603.15341 2026-03-17 cs.AI cs.HC cs.MA

Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents

Ren Jian Lim, Rushi Dai

Comments 25 pages, 20 figures; accepted for publication in the Proceedings of ACADIA 2025

详情

英文摘要

In architectural interior design, miscommunication frequently arises as clients lack design knowledge, while designers struggle to explain complex spatial relationships, leading to delayed timelines and financial losses. Recent advancements in generative layout tools narrow the gap by automating 3D visualizations. However, prevailing methodologies exhibit limitations: rule-based systems implement hard-coded spatial constraints that restrict participatory engagement, while data-driven models rely on extensive training datasets. Recent large language models (LLMs) bridge this gap by enabling intuitive reasoning about spatial relationships through natural language. This research presents an LLM-based, multimodal, multi-agent framework that dynamically converts natural language descriptions and imagery into 3D designs. Specialized agents (Reference, Spatial, Interactive, Grader), operating via prompt guidelines, collaboratively address core challenges: the agent system enables real-time user interaction for iterative spatial refinement, while Retrieval-Augmented Generation (RAG) reduces data dependency without requiring task-specific model training. This framework accurately interprets spatial intent and generates optimized 3D indoor design, improving productivity, and encouraging nondesigner participation. Evaluations across diverse floor plans and user questionnaires demonstrate effectiveness. An independent LLM evaluator consistently rated participatory layouts higher in user intent alignment, aesthetic coherence, functionality, and circulation. Questionnaire results indicated 77% satisfaction and a clear preference over traditional design software. These findings suggest the framework enhances user-centric communication and fosters more inclusive, effective, and resilient design processes. Project page: https://rsigktyper.github.io/AICodesign/

URL PDF HTML ☆

赞 0 踩 0

2603.15340 2026-03-17 cs.CL stat.ML

DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models

Xueyu Zhou, Yangrong Hu, Jian Huang

Comments 16 pages, 5 figures

2603.15335 2026-03-17 cs.LG

Data Augmentation via Causal-Residual Bootstrapping

Mateusz Gajewski, Sophia Xiao, Bijan Mazaheri

2603.15330 2026-03-17 cs.CV

MeMix: Writing Less, Remembering More for Streaming 3D Reconstruction

Jiacheng Dong, Huan Li, Sicheng Zhou, Wenhao Hu, Weili Xu, Yan Wang

2603.15329 2026-03-17 cs.RO

User-Tailored Learning to Forecast Walking Modes for Exosuits

Gabriele Abbate, Enrica Tricomi, Nathalie Gierden, Alessandro Giusti, Lorenzo Masia, Antonio Paolillo

2603.15326 2026-03-17 cs.CL cs.AI

Tagarela - A Portuguese speech dataset from podcasts

Frederico Santos de Oliveira, Lucas Rafael Stefanel Gris, Alef Iury Siqueira Ferreira, Augusto Seben da Rosa, Alexandre Costa Ferro Filho, Edresson Casanova, Christopher Dane Shulby, Rafael Teixeira Sousa, Diogo Fernandes Costa Silva, Anderson da Silva Soares, Arlindo Rodrigues Galvão Filho

2603.15321 2026-03-17 cs.LG

CASHomon Sets: Efficient Rashomon Sets Across Multiple Model Classes and their Hyperparameters

Fiona Katharina Ewald, Martin Binder, Matthias Feurer, Bernd Bischl, Giuseppe Casalicchio

Comments Equal contributions by Fiona Katharina Ewald and Martin Binder

2603.15317 2026-03-17 cs.CL

PYTHEN: A Flexible Framework for Legal Reasoning in Python

Ha-Thanh Nguyen, Ken Satoh

Comments Accepted at JURISIN 2026

2603.15309 2026-03-17 cs.CL cs.AI

CCTU: A Benchmark for Tool Use under Complex Constraints

Junjie Ye, Guoqiang Zhang, Wenjie Fu, Tao Gui, Qi Zhang, Xuanjing Huang

2603.15307 2026-03-17 cs.LG physics.chem-ph

A Kolmogorov-Arnold Surrogate Model for Chemical Equilibria: Application to Solid Solutions

Leonardo Boledi, Dirk Bosbach, Jenna Poonoosamy

详情

英文摘要

The computational cost of geochemical solvers is a challenging matter. For reactive transport simulations, where chemical calculations are performed up to billions of times, it is crucial to reduce the total computational time. Existing publications have explored various machine-learning approaches to determine the most effective data-driven surrogate model. In particular, multilayer perceptrons are widely employed due to their ability to recognize nonlinear relationships. In this work, we focus on the recent Kolmogorov-Arnold networks, where learnable spline-based functions replace classical fixed activation functions. This architecture has achieved higher accuracy with fewer trainable parameters and has become increasingly popular for solving partial differential equations. First, we train a surrogate model based on an existing cement system benchmark. Then, we move to an application case for the geological disposal of nuclear waste, i.e., the determination of radionuclide-bearing solids solubilities. To the best of our knowledge, this work is the first to investigate co-precipitation with radionuclide incorporation using data-driven surrogate models, considering increasing levels of thermodynamic complexity from simple mechanical mixtures to non-ideal solid solutions of binary (Ba,Ra)SO$_4$ and ternary (Sr,Ba,Ra)SO$_4$ systems. On the cement benchmark, we demonstrate that the Kolmogorov-Arnold architecture outperforms multilayer perceptrons in both absolute and relative error metrics, reducing them by 62% and 59%, respectively. On the binary and ternary radium solid solution models, Kolmogorov-Arnold networks maintain median prediction errors near $1\times10^{-3}$. This is the first step toward employing surrogate models to speed up reactive transport simulations and optimize the safety assessment of deep geological waste repositories.

URL PDF HTML ☆

赞 0 踩 0