arXivDaily每日学术速递，同步arXiv全量数据，AI总结、翻译，覆盖人工智能、机器人、计算机、金融、统计学、数学、物理学、生物学、经济学、电气&系统等方向。

2604.22753 2026-04-27 cs.LG

Spend Less, Fit Better: Budget-Efficient Scaling Law Fitting via Active Experiment Selection

Sijie Li, Shanda Li, Haowei Lin, Weiwei Sun, Ameet Talwalkar, Yiming Yang

详情

英文摘要

Scaling laws are used to plan multi-million-dollar training runs, but fitting those laws can itself cost millions. In modern large-scale workflows, assembling a sufficiently informative set of pilot experiments is already a major budget-allocation problem rather than a routine preprocessing step. We formulate scaling-law fitting as budget-aware sequential experimental design: given a finite pool of runnable experiments with heterogeneous costs, choose which runs to execute so as to maximize extrapolation accuracy in a high-cost target region. We then propose an uncertainty-aware method for sequentially allocating experimental budget toward the runs most useful for target-region extrapolation. Across a diverse benchmark of scaling-law tasks, our method consistently outperforms classical design-based baselines, and often approaches the performance of fitting on the full experimental set while using only about 10% of the total training budget. Our code is available at https://github.com/PlanarG/active-sl.

URL PDF HTML ☆

赞 0 踩 0

2604.22749 2026-04-27 cs.CL

Representational Harms in LLM-Generated Narratives Against Global Majority Nationalities

Ilana Nguyen, Harini Suresh, Thema Monroe-White, Evan Shieh

Comments FAccT '26, June 25-28, 2026, Montreal, QC, Canada

详情

DOI: 10.1145/3805689.3806452

英文摘要

Large language models (LLMs) are increasingly used for text generation tasks from everyday use to high-stakes enterprise and government applications, including simulated interviews with asylum seekers. While many works highlight the new potential applications of LLMs, there are risks of LLMs encoding and perpetuating harmful biases about non-dominant communities across the globe. To better evaluate and mitigate such harms, more research examining how LLMs portray diverse individuals is needed. In this work, we study how national origin identities are portrayed by widely-adopted LLMs in response to open-ended narrative generation prompts. Our findings demonstrate the presence of persistent representational harms by national origin, including harmful stereotypes, erasure, and one-dimensional portrayals of Global Majority identities. Minoritized national identities are simultaneously underrepresented in power-neutral stories and overrepresented in subordinated character portrayals, which are over fifty times more likely to appear than dominant portrayals. The degree of harm is amplified when US nationality cues (e.g., ``American'') are present in input prompts. Notably, we find that the harms we identify cannot be explained away via sycophancy, as US-centric biases persist even when replacing US nationality cues with non-US national identities in the prompts. Based on our findings, we call for further exploration of cultural harms in LLMs through methodologies that center Global Majority perspectives and challenge the uncritical adoption of US-based LLMs for the classification, surveillance, and misrepresentation of the majority of our planet.

URL PDF HTML ☆

赞 0 踩 0

2604.22747 2026-04-27 cs.SE

Code for All: Educational Applications of the "Vibe Coding" Hackathon in Programming Education across All Skill Levels

Ashley J. Chen, Yijia Cao, Minghao Shao, Ramesh Karri, Muhammad Shafique

Comments 15 pages, 14 figures

2604.22746 2026-04-27 math.OC cs.LG

Relaxation-Informed Training of Neural Network Surrogate Models

Calvin Tsay

Comments 35 pages, 5 figures

2604.22744 2026-04-27 cs.SI cs.IT math.IT q-bio.QM

Multiplex Hypergraph Modeling of Higher Order Structures in Psychometric Networks

Francesca Possenti, Laura Girelli, Paolo Tieri, Manuela Petti

Comments 17 pages, 6 figures, 2 tables

2604.22742 2026-04-27 cs.CC

Boolean PCSPs through the lens of Fourier Analysis

Demian Banakh, Katzper Michno

2604.22740 2026-04-27 eess.SP cs.IT math.IT

Minimax Optimal Procedures for Joint Detection and Estimation

Dominik Reinhard, Michael Fauß, Abdelhak M. Zoubir

Comments 13 pages, 3 figures, 2 tables

2604.22739 2026-04-27 cs.CV

Inter-Stance: A Dyadic Multimodal Corpus for Conversational Stance Analysis

Xiang Zhang, Xiaotian Li, Taoyue Wang, Nan Bi, Xin Zhou, Cody Zhou, Zoie Wang, Andrew Yang, Yuming Su, Jeff Cohn, Qiang Ji, Lijun Yin

2604.22737 2026-04-27 eess.SY cs.SY math.CO math.OC

A Vehicle Routing Problem for Human-Centered Electric Mobility

Mostafa Emam, Björn Martens, Thomas Rottmann, Matthias Gerdts

Comments 7 pages, 5 figures, standard IEEE double-column format

2604.22736 2026-04-27 cs.LO cs.AI

An Undecidability Proof for the Plan Existence Problem

Antonis Achilleos

2604.22734 2026-04-27 gr-qc cs.NA math.NA

Radiation outer boundary conditions and near-to-far field signal transformations for the Bardeen-Press equation

Som Dev Bishoyi, Scott E. Field, Stephen R. Lau

Comments 26 pages, 8 figures, 4 tables

2604.22732 2026-04-27 math.NA cs.NA physics.comp-ph

Craig-Bampton-based Quadratic Manifold for Nonlinear Substructuring

Alexander Saccani, Paolo Tiso

2604.22730 2026-04-27 cs.LG cs.CL

Neural Recovery of Historical Lexical Structure in Bantu Languages from Modern Data

Hillary Mutisya, John Mugane

2604.22724 2026-04-27 cs.RO cs.SY eess.SY

GCImOpt: Learning efficient goal-conditioned policies by imitating optimal trajectories

Jon Goikoetxea, Jesús F. Palacián

Comments Accepted for publication at the 8th Annual Conference on Learning for Dynamics and Control (L4DC 2026). 16 pages (including appendix), 1 figure. For project website, see https://jongoiko.github.io/gcimopt/

2604.22723 2026-04-27 cs.LG cs.CL

Zero-Shot Morphological Discovery in Low-Resource Bantu Languages via Cross-Lingual Transfer and Unsupervised Clustering

Hillary Mutisya, John Mugane

2604.22721 2026-04-27 physics.ao-ph cs.NA math.NA physics.data-an

Spectral-Domain Local Statistics with Missing-Data Support for Cartesian and Polar Grids

Jairo M. Valdivia-Prado, William E. Chapman, Katja Friedrich

Comments Accompanies the open-source dct_toolkit package

2604.22715 2026-04-27 cs.RO

ATRS: Adaptive Trajectory Re-splitting via a Shared Neural Policy for Parallel Optimization

Jiajun Yu, Guodong Liu, Li Wang, Pengxiang Zhou, Wentao Liu, Yin He, Chao Xu, Fei Gao, Yanjun Cao

Comments 8 pages, submitted to IEEE Robotics and Automation Letters

详情

英文摘要

Parallel trajectory optimization via the Alternating Direction Method of Multipliers (ADMM) has emerged as a scalable approach to long-horizon motion planning. However, existing frameworks typically decompose the problem into parallel subproblems based on a predefined fixed structure. Such structural rigidity often causes optimization stagnation in highly constrained regions, where a few lagging subproblems delay global convergence. A natural remedy is to adaptively re-split these stagnating segments online. Yet, deciding when, where, and how to split exceeds the capability of rule-based heuristics. To this end, we propose ATRS, a novel framework that embeds a shared Deep Reinforcement Learning policy into the parallel ADMM loop. We formulate this adaptive adjustment as a Multi-Agent Shared-Policy Markov Decision Process, where all trajectory segments act as homogeneous agents and share a unified neural policy network. This parameter-sharing architecture endows the system with size invariance, enabling it to handle dynamically changing segment counts during re-splitting and generalize to arbitrary trajectory lengths. Furthermore, our formulation inherently supports zero-shot generalization to unseen environments, as our network relies solely on the internal states of the numerical solver rather than on the geometric features of the environment. To ensure solver stability, a Confidence-Based Election mechanism selects only the most stagnating segment for re-splitting at each step. Extensive simulations demonstrate that ATRS accelerates convergence, reducing the number of iterations by up to 26.0% and the computation time by up to 19.1%. Real-world experiments further confirm its applicability to both large-scale offline global planning and real-time onboard replanning within 35 ms per cycle, with no sim-to-real degradation.

URL PDF HTML ☆

赞 0 踩 0

2604.22714 2026-04-27 cs.CV

Long-tail Internet photo reconstruction

Yuan Li, Yuanbo Xiangli, Hadar Averbuch-Elor, Noah Snavely, Ruojin Cai

Comments Project page: https://megadepth-x.github.io/

2604.22710 2026-04-27 cs.NI

Evaluation of the effects of 3GPP-specific beamforming and channel estimation on the 3D EIRP profile of a 5G gNB

Armed Tusha, Joshua Roy Palathinkal, Monisha Ghosh

2604.22708 2026-04-27 cs.MA

Seeing the Whole Elephant: A Benchmark for Failure Attribution in LLM-based Multi-Agent Systems

Mengzhuo Chen, Junjie Wang, Fangwen Mu, Yawen Wang, Zhe Liu, Huanxiang Feng, Qing Wang

Comments Accepted by ACL 2026

2604.22700 2026-04-27 cs.CV

Generative Modeling of Neurodegenerative Brain Anatomy with 4D Longitudinal Diffusion Model

Nivetha Jayakumar, Swakshar Deb, Bahram Jafrasteh, Qingyu Zhao, Miaomiao Zhang

2604.22697 2026-04-27 cs.CY cs.HC

RFID-Based Non-Biometric Classroom Attendance System: Proxy Attendance Detection via Weight Sensor Integration

Furkan Ege, Muhsin Özdemir

Comments Full English version followed by the original Turkish version of the paper. Main text in English; Turkish translation appended after the English text

2604.22695 2026-04-27 eess.SP cs.LG

Time-Localized Parametric Decomposition of Respiratory Airflow for Sub-Breath Analysis

Victoria Ribeiro Rodrigues, Paul W. Davenport, Nicholas J. Napoli

Comments Submitted to IEEE Journal of Biomedical and Health Informatics (under review). 18 pages, 7 figures, 5 tables

2604.22693 2026-04-27 cs.CL cs.AI

CRAFT: Clustered Regression for Adaptive Filtering of Training data

Parthasarathi Panda, Asheswari Swain, Subhrakanta Panda

2604.22685 2026-04-27 astro-ph.IM astro-ph.EP cs.NI cs.PF

CosmicDancePro -- Measuring LEO satellite's orbital decay and network connectivity implications during solar storms

Suvam Basak, Amitangshu Pal, Debopam Bhattacherjee

2604.22679 2026-04-27 cs.CY cs.AI

How Supply Chain Dependencies Complicate Bias Measurement and Accountability Attribution in AI Hiring Applications

Gauri Sharma, Maryam Molamohammadi

详情

英文摘要

The increasing adoption of AI systems in hiring has raised concerns about algorithmic bias and accountability, prompting regulatory responses including the EU AI Act, NYC Local Law 144, and Colorado's AI Act. While existing research examines bias through technical or regulatory lenses, both perspectives overlook a fundamental challenge: modern AI hiring systems operate within complex supply chains where responsibility fragments across data vendors, model developers, platform providers, and deploying organizations. This paper investigates how these dependency chains complicate bias evaluation and accountability attribution. Drawing on literature review and regulatory analysis, we demonstrate that fragmented responsibilities create two critical problems. First, bias emerges from component interactions rather than isolated elements, yet proprietary configurations prevent integrated evaluation. A resume parser may function without bias independently but contribute to discrimination when integrated with specific ranking algorithms and filtering thresholds. Second, information asymmetries mean deploying organizations bear legal responsibility without technical visibility into vendor-supplied algorithms, while vendors control implementations without meaningful disclosure requirements. Each stakeholder may believe they are compliant; nevertheless, the integrated system may produce biased outcomes. Analysis of implementation ambiguities reveals these challenges in practice. We propose multi-layered interventions including system-level audits, vendor guidelines, continuous monitoring mechanisms, and documentation across dependency chains. Our findings reveal that effective governance requires coordinated action across technical, organizational, and regulatory domains to establish meaningful accountability in distributed development environments.

URL PDF HTML ☆

赞 0 踩 0

2604.22678 2026-04-27 cs.CL

BERAG: Bayesian Ensemble Retrieval-Augmented Generation for Knowledge-based Visual Question Answering

Jinghong Chen, Jingbiao Mei, Guangyu Yang, Bill Byrne

详情

英文摘要

A common approach to question answering with retrieval-augmented generation (RAG) is to concatenate documents into a single context and pass it to a language model to generate an answer. While simple, this strategy can obscure the contribution of individual documents, making attribution difficult and contributing to the ``lost-in-the-middle'' effect, where relevant information in long contexts is overlooked. Concatenation also scales poorly: computational cost grows quadratically with context length, a problem that becomes especially severe when the context includes visual data, as in visual question answering. Attempts to mitigate these issues by limiting context length can further restrict performance by preventing models from benefiting from the improved recall offered by deeper retrieval. We propose Bayesian Ensemble Retrieval-Augmented Generation (BERAG), along with Bayesian Ensemble Fine-Tuning (BEFT), as a RAG framework in which language models are conditioned on individual retrieved documents rather than a single combined context. BERAG treats document posterior probabilities as ensemble weights and updates them token by token using Bayes' rule during generation. This approach enables probabilistic re-ranking, parallel memory usage, and clear attribution of document contribution, making it well-suited for large document collections. We evaluate BERAG and BEFT primarily on knowledge-based visual question answering tasks, where models must reason over long, imperfect retrieval lists. The results show substantial improvements over standard RAG, including strong gains on Document Visual Question Answering and multimodal needle-in-a-haystack benchmarks. We also demonstrate that BERAG mitigates the ``lost-in-the-middle'' effect. The document posterior can be used to detect insufficient grounding and trigger deflection, while document pruning enables faster decoding than standard RAG.

URL PDF HTML ☆

赞 0 踩 0

2604.22675 2026-04-27 cs.SI

Measuring Epistemic Unfairness for Algorithmic Decision-Making

Camilla Quaresmini, Lisa Piccinin, Valentina Breschi

2604.22673 2026-04-27 cs.SE cs.SC

Inferring Equivalence Classes from Legacy Undocumented Embedded Binaries for ISO 26262-Compliant Testing

Marco De Luca, Domenico Francesco De Angelis, Domenico Amalfitano, Pasquale Cimmino, Anna Rita Fasolino

Comments Paper Accepted at EASE 26

2604.22672 2026-04-27 cs.LG

Iterative Model-Learning Scheme via Gaussian Processes for Nonlinear Model Predictive Control of (Semi-)Batch Processes

Tai Xuan Tan, Alexander Mitsos, Eike Cramer

Comments 12 pages, 7 figures