Skip to main content

Showing 1–50 of 228 results for author: Smith, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2604.06648  [pdf, ps, other

    astro-ph.GA cs.CV

    Euclid Quick Data Release (Q1). AgileLens: A scalable CNN-based pipeline for strong gravitational lens identification

    Authors: Euclid Collaboration, X. Xu, R. Chen, T. Li, A. R. Cooray, S. Schuldt, J. A. Acevedo Barroso, D. Stern, D. Scott, M. Meneghetti, G. Despali, J. Chopra, Y. Cao, M. Cheng, J. Buda, J. Zhang, J. Furumizo, R. Valencia, Z. Jiang, C. Tortora, N. E. P. Lines, T. E. Collett, S. Fotopoulou, A. Galan, A. Manjón-García , et al. (286 additional authors not shown)

    Abstract: We present an end-to-end, iterative pipeline for efficient identification of strong galaxy--galaxy lensing systems, applied to the Euclid Q1 imaging data. Starting from VIS catalogues, we reject point sources, apply a magnitude cut (I$_E$ $\leq$ 24) on deflectors, and run a pixel-level artefact/noise filter to build 96 $\times$ 96 pix cutouts; VIS+NISP colour composites are constructed with a VIS-… ▽ More

    Submitted 7 April, 2026; originally announced April 2026.

    Comments: 30 pages, 16 figures

  2. arXiv:2604.04826  [pdf, ps, other

    cs.RO

    Efficient Multi-Objective Planning with Weighted Maximization Using Large Neighbourhood Search

    Authors: Krishna Kalavadia, Shamak Dutta, Yash Vardhan Pant, Stephen L. Smith

    Abstract: Autonomous navigation often requires the simultaneous optimization of multiple objectives. The most common approach scalarizes these into a single cost function using a weighted sum, but this method is unable to find all possible trade-offs and can therefore miss critical solutions. An alternative, the weighted maximum of objectives, can find all Pareto-optimal solutions, including those in non-co… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

  3. arXiv:2604.04604  [pdf, ps, other

    cs.CY cs.AI cs.CR cs.MA

    AI Agents Under EU Law

    Authors: Luca Nannini, Adam Leon Smith, Michele Joshua Maggini, Enrico Panai, Sandra Feliciano, Aleksandr Tiulkanov, Elena Maran, James Gealy, Piercosma Bisconti

    Abstract: AI agents - i.e. AI systems that autonomously plan, invoke external tools, and execute multi-step action chains with reduced human involvement - are being deployed at scale across enterprise functions ranging from customer service and recruitment to clinical decision support and critical infrastructure management. The EU AI Act (Regulation 2024/1689) regulates these systems through a risk-based fr… ▽ More

    Submitted 6 April, 2026; originally announced April 2026.

    Comments: Working Paper - April 2026, subject to updates (EC M/613, M/606, Digital Omnibus proposals)

  4. arXiv:2604.03440  [pdf, ps, other

    cs.CE

    ARES OS 2.0: An Orchestration Software Suite for Autonomous Experimentation Systems and Self-Driving Labs

    Authors: Arthur W. N. Sloan, Robert W. Waelder, Morgen L. Smith, Nicholas Kleiner, Arnas Babeckis, Jason Wheeler, Daylond Hooper, Benji Maruyama

    Abstract: ARES OS 2.0 (hereinafter ARES OS) is an open-source software suite to enable laboratory automation and closed-loop autonomous experimentation. Its function is to orchestrate experimental actions and data handoff between lab equipment, analysis routines, and experimental planning modules through a service-oriented architecture. ARES OS is abstracted to apply to general experimental flows common in… ▽ More

    Submitted 3 April, 2026; originally announced April 2026.

  5. arXiv:2602.17888  [pdf, ps, other

    cs.LG cs.AI

    Machine Learning Based Prediction of Surgical Outcomes in Chronic Rhinosinusitis from Clinical Data

    Authors: Sayeed Shafayet Chowdhury, Karen D'Souza, V. Siva Kakumani, Snehasis Mukhopadhyay, Shiaofen Fang, Rodney J. Schlosser, Daniel M. Beswick, Jeremiah A. Alt, Jess C. Mace, Zachary M. Soler, Timothy L. Smith, Vijay R. Ramakrishnan

    Abstract: Artificial intelligence (AI) has increasingly transformed medical prognostics by enabling rapid and accurate analysis across imaging and pathology. However, the investigation of machine learning predictions applied to prospectively collected, standardized data from observational clinical intervention trials remains underexplored, despite its potential to reduce costs and improve patient outcomes.… ▽ More

    Submitted 19 February, 2026; originally announced February 2026.

  6. arXiv:2601.17227  [pdf, ps, other

    cs.RO

    Hierarchical Informative Path Planning via Graph Guidance and Trajectory Optimization

    Authors: Avraiem Iskandar, Shamak Dutta, Kevin Murrant, Yash Vardhan Pant, Stephen L. Smith

    Abstract: We study informative path planning (IPP) with travel budgets in cluttered environments, where an agent collects measurements of a latent field modeled as a Gaussian process (GP) to reduce uncertainty at target locations. Graph-based solvers provide global guarantees but assume pre-selected measurement locations, while continuous trajectory optimization supports path-based sensing but is computatio… ▽ More

    Submitted 23 January, 2026; originally announced January 2026.

  7. arXiv:2512.24969  [pdf, ps, other

    cond-mat.stat-mech cs.CL physics.bio-ph q-bio.NC

    Large language models and the entropy of English

    Authors: Colin Scheibner, Lindsay M. Smith, William Bialek

    Abstract: We use large language models (LLMs) to uncover long-ranged structure in English texts from a variety of sources. The conditional entropy or code length in many cases continues to decrease with context length at least to $N\sim 10^4$ characters, implying that there are direct dependencies or interactions across these distances. A corollary is that there are small but significant correlations betwee… ▽ More

    Submitted 31 December, 2025; originally announced December 2025.

    Comments: 8 pages, 6 figures

  8. arXiv:2512.11736  [pdf, ps, other

    cs.RO

    Bench-Push: Benchmarking Pushing-based Navigation and Manipulation Tasks for Mobile Robots

    Authors: Ninghan Zhong, Steven Caro, Megnath Ramesh, Rishi Bhatnagar, Avraiem Iskandar, Stephen L. Smith

    Abstract: Mobile robots are increasingly deployed in cluttered environments with movable objects, posing challenges for traditional methods that prohibit interaction. In such settings, the mobile robot must go beyond traditional obstacle avoidance, leveraging pushing or nudging strategies to accomplish its goals. While research in pushing-based robotics is growing, evaluations rely on ad hoc setups, limitin… ▽ More

    Submitted 12 December, 2025; originally announced December 2025.

    Comments: Under review for ICRA 2026

  9. arXiv:2512.10099  [pdf, ps, other

    cs.RO cs.LG

    Push Smarter, Not Harder: Hierarchical RL-Diffusion Policy for Efficient Nonprehensile Manipulation

    Authors: Steven Caro, Stephen L. Smith

    Abstract: Nonprehensile manipulation, such as pushing objects across cluttered environments, presents a challenging control problem due to complex contact dynamics and long-horizon planning requirements. In this work, we propose HeRD, a hierarchical reinforcement learning-diffusion policy that decomposes pushing tasks into two levels: high-level goal selection and low-level trajectory generation. We employ… ▽ More

    Submitted 10 December, 2025; originally announced December 2025.

    Comments: 8 pages, 8 figures

  10. arXiv:2512.10092  [pdf, ps, other

    cs.AI cs.LG

    Interpretable Embeddings with Sparse Autoencoders: A Data Analysis Toolkit

    Authors: Nick Jiang, Xiaoqing Sun, Lisa Dunlap, Lewis Smith, Neel Nanda

    Abstract: Analyzing large-scale text corpora is a core challenge in machine learning, crucial for tasks like identifying undesirable model behaviors or biases in training data. Current methods often rely on costly LLM-based techniques (e.g. annotating dataset differences) or dense embedding models (e.g. for clustering), which lack control over the properties of interest. We propose using sparse autoencoders… ▽ More

    Submitted 10 December, 2025; originally announced December 2025.

    Comments: Code: https://github.com/nickjiang2378/interp_embed

  11. arXiv:2511.22662  [pdf, ps, other

    cs.LG

    Difficulties with Evaluating a Deception Detector for AIs

    Authors: Lewis Smith, Bilal Chughtai, Neel Nanda

    Abstract: Building reliable deception detectors for AI systems -- methods that could predict when an AI system is being strategically deceptive without necessarily requiring behavioural evidence -- would be valuable in mitigating risks from advanced AI systems. But evaluating the reliability and efficacy of a proposed deception detector requires examples that we can confidently label as either deceptive or… ▽ More

    Submitted 16 December, 2025; v1 submitted 27 November, 2025; originally announced November 2025.

  12. arXiv:2511.14759  [pdf, ps, other

    cs.LG cs.RO

    $π^{*}_{0.6}$: a VLA That Learns From Experience

    Authors: Physical Intelligence, Ali Amin, Raichelle Aniceto, Ashwin Balakrishna, Kevin Black, Ken Conley, Grace Connors, James Darpinian, Karan Dhabalia, Jared DiCarlo, Danny Driess, Michael Equi, Adnan Esmail, Yunhao Fang, Chelsea Finn, Catherine Glossop, Thomas Godden, Ivan Goryachev, Lachy Groom, Hunter Hancock, Karol Hausman, Gashon Hussein, Brian Ichter, Szymon Jakubczak, Rowan Jen , et al. (31 additional authors not shown)

    Abstract: We study how vision-language-action (VLA) models can improve through real-world deployments via reinforcement learning (RL). We present a general-purpose method, RL with Experience and Corrections via Advantage-conditioned Policies (RECAP), that provides for RL training of VLAs via advantage conditioning. Our method incorporates heterogeneous data into the self-improvement process, including demon… ▽ More

    Submitted 18 November, 2025; v1 submitted 18 November, 2025; originally announced November 2025.

  13. arXiv:2511.07568  [pdf, ps, other

    cs.AI

    Procedural Knowledge Improves Agentic LLM Workflows

    Authors: Vincent Hsiao, Mark Roberts, Leslie Smith

    Abstract: Large language models (LLMs) often struggle when performing agentic tasks without substantial tool support, prom-pt engineering, or fine tuning. Despite research showing that domain-dependent, procedural knowledge can dramatically increase planning efficiency, little work evaluates its potential for improving LLM performance on agentic tasks that may require implicit planning. We formalize, implem… ▽ More

    Submitted 10 November, 2025; originally announced November 2025.

  14. arXiv:2511.04831  [pdf, ps, other

    cs.RO cs.AI

    Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

    Authors: NVIDIA, :, Mayank Mittal, Pascal Roth, James Tigue, Antoine Richard, Octi Zhang, Peter Du, Antonio Serrano-Muñoz, Xinjie Yao, René Zurbrügg, Nikita Rudin, Lukasz Wawrzyniak, Milad Rakhsha, Alain Denzler, Eric Heiden, Ales Borovicka, Ossama Ahmed, Iretiayo Akinola, Abrar Anwar, Mark T. Carlson, Ji Yuan Feng, Animesh Garg, Renato Gasoto, Lionel Gulich , et al. (82 additional authors not shown)

    Abstract: We present Isaac Lab, the natural successor to Isaac Gym, which extends the paradigm of GPU-native robotics simulation into the era of large-scale multi-modal learning. Isaac Lab combines high-fidelity GPU parallel physics, photorealistic rendering, and a modular, composable architecture for designing environments and training robot policies. Beyond physics and rendering, the framework integrates… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

    Comments: Code and documentation are available here: https://github.com/isaac-sim/IsaacLab

  15. arXiv:2510.19113  [pdf, ps, other

    cs.SI

    UniqueRank: Identifying Important and Difficult-to-Replace Nodes in Attributed Graphs

    Authors: Erica Cai, Benjamin A. Miller, Olga Simek, Christopher L. Smith

    Abstract: Node-ranking methods that focus on structural importance are widely used in a variety of applications, from ranking webpages in search engines to identifying key molecules in biomolecular networks. In real social, supply chain, and terrorist networks, one definition of importance considers the impact on information flow or network productivity when a given node is removed. In practice, however, a… ▽ More

    Submitted 21 October, 2025; originally announced October 2025.

    Comments: In submission to the IEEE, 16 pages, 14 figures

  16. arXiv:2510.15060  [pdf

    cs.CV

    A solution to generalized learning from small training sets found in infant repeated visual experiences of individual objects

    Authors: Frangil Ramirez, Elizabeth Clerkin, David J. Crandall, Linda B. Smith

    Abstract: One-year-old infants show immediate adult-like generalization of common object categories to novel instances. The field has limited understanding of how this early prowess is achieved. Here we provide evidence on infants' daily-life visual experiences for 8 early-learned object categories. Using a corpus of infant head camera images recorded at mealtimes (87 mealtimes captured by 14 infants), we q… ▽ More

    Submitted 27 December, 2025; v1 submitted 16 October, 2025; originally announced October 2025.

    Comments: 18 pages, 5 figures, 2 tables

  17. arXiv:2510.13677  [pdf, ps, other

    gr-qc astro-ph.IM cs.NE physics.comp-ph

    APRIL: Auxiliary Physically-Redundant Information in Loss - A physics-informed framework for parameter estimation with a gravitational-wave case study

    Authors: Matteo Scialpi, Francesco Di Clemente, Leigh Smith, Michał Bejger

    Abstract: Physics-Informed Neural Networks (PINNs) embed the partial differential equations (PDEs) governing the system under study directly into the training of Neural Networks, ensuring solutions that respect physical laws. While effective for single-system problems, standard PINNs scale poorly to datasets containing many realizations of the same underlying physics with varying parameters. To address this… ▽ More

    Submitted 15 October, 2025; originally announced October 2025.

  18. arXiv:2509.18499  [pdf, ps, other

    cs.LG

    Hybrid Data can Enhance the Utility of Synthetic Data for Training Anti-Money Laundering Models

    Authors: Rachel Chung, Pratyush Nidhi Sharma, Mikko Siponen, Rohit Vadodaria, Luke Smith

    Abstract: Money laundering is a critical global issue for financial institutions. Automated Anti-money laundering (AML) models, like Graph Neural Networks (GNN), can be trained to identify illicit transactions in real time. A major issue for developing such models is the lack of access to training data due to privacy and confidentiality concerns. Synthetically generated data that mimics the statistical prop… ▽ More

    Submitted 22 September, 2025; originally announced September 2025.

    Comments: Presented at the Association of Certified Fraud Examiners (ACFE) Research Institute Annual Meeting, Las Vegas, NV, (2024)

  19. arXiv:2509.07282  [pdf, ps, other

    cs.LG cs.AI cs.CL cs.CR

    ALICE: An Interpretable Neural Architecture for Generalization in Substitution Ciphers

    Authors: Jeff Shen, Lindsay M. Smith

    Abstract: We present cryptogram solving as an ideal testbed for studying neural network reasoning and generalization; models must decrypt text encoded with substitution ciphers, choosing from 26! possible mappings without explicit access to the cipher. We develop ALICE (an Architecture for Learning Interpretable Cryptogram dEcipherment), a simple encoder-only Transformer that sets a new state-of-the-art for… ▽ More

    Submitted 24 September, 2025; v1 submitted 8 September, 2025; originally announced September 2025.

    Comments: Preprint. Project page at https://jshen.net/alice. Added section on probing

  20. arXiv:2507.22133  [pdf, ps, other

    cs.CR cs.CL

    Prompt Optimization and Evaluation for LLM Automated Red Teaming

    Authors: Michael Freenor, Lauren Alvarez, Milton Leal, Lily Smith, Joel Garrett, Yelyzaveta Husieva, Madeline Woodruff, Ryan Miller, Erich Kummerfeld, Rafael Medeiros, Sander Schulhoff

    Abstract: Applications that use Large Language Models (LLMs) are becoming widespread, making the identification of system vulnerabilities increasingly important. Automated Red Teaming accelerates this effort by using an LLM to generate and execute attacks against target systems. Attack generators are evaluated using the Attack Success Rate (ASR) the sample mean calculated over the judgment of success for ea… ▽ More

    Submitted 29 July, 2025; originally announced July 2025.

    Comments: 9 pages, 5 Figures, and 1 Appendix item

  21. arXiv:2506.22604  [pdf, ps, other

    cs.AI cs.HC cs.RO

    Bootstrapping Human-Like Planning via LLMs

    Authors: David Porfirio, Vincent Hsiao, Morgan Fine-Morris, Leslie Smith, Laura M. Hiatt

    Abstract: Robot end users increasingly require accessible means of specifying tasks for robots to perform. Two common end-user programming paradigms include drag-and-drop interfaces and natural language programming. Although natural language interfaces harness an intuitive form of human communication, drag-and-drop interfaces enable users to meticulously and precisely dictate the key actions of the robot's… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: Accepted by the 2025 34th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN)

  22. arXiv:2506.15742  [pdf, ps, other

    cs.GR

    FLUX.1 Kontext: Flow Matching for In-Context Image Generation and Editing in Latent Space

    Authors: Black Forest Labs, Stephen Batifol, Andreas Blattmann, Frederic Boesel, Saksham Consul, Cyril Diagne, Tim Dockhorn, Jack English, Zion English, Patrick Esser, Sumith Kulal, Kyle Lacey, Yam Levi, Cheng Li, Dominik Lorenz, Jonas Müller, Dustin Podell, Robin Rombach, Harry Saini, Axel Sauer, Luke Smith

    Abstract: We present evaluation results for FLUX.1 Kontext, a generative flow matching model that unifies image generation and editing. The model generates novel output views by incorporating semantic context from text and image inputs. Using a simple sequence concatenation approach, FLUX.1 Kontext handles both local editing and generative in-context tasks within a single unified architecture. Compared to c… ▽ More

    Submitted 24 June, 2025; v1 submitted 17 June, 2025; originally announced June 2025.

  23. arXiv:2506.05574  [pdf, ps, other

    cs.LG cond-mat.dis-nn cond-mat.stat-mech q-bio.NC stat.ML

    When can in-context learning generalize out of task distribution?

    Authors: Chase Goddard, Lindsay M. Smith, Vudtiwat Ngampruetikorn, David J. Schwab

    Abstract: In-context learning (ICL) is a remarkable capability of pretrained transformers that allows models to generalize to unseen tasks after seeing only a few examples. We investigate empirically the conditions necessary on the pretraining distribution for ICL to emerge and generalize \emph{out-of-distribution}. Previous work has focused on the number of distinct tasks necessary in the pretraining datas… ▽ More

    Submitted 18 August, 2025; v1 submitted 5 June, 2025; originally announced June 2025.

    Comments: ICML 2025

    Journal ref: Proceedings of the 42nd International Conference on Machine Learning, PMLR 267:19585-19599, 2025

  24. arXiv:2505.15802  [pdf, ps, other

    cs.LG eess.SP physics.ao-ph

    A Deep Learning Framework for Two-Dimensional, Multi-Frequency Propagation Factor Estimation

    Authors: Sarah E. Wessinger, Leslie N. Smith, Jacob Gull, Jonathan Gehman, Zachary Beever, Andrew J. Kammerer

    Abstract: Accurately estimating the refractive environment over multiple frequencies within the marine atmospheric boundary layer is crucial for the effective deployment of radar technologies. Traditional parabolic equation simulations, while effective, can be computationally expensive and time-intensive, limiting their practical application. This communication explores a novel approach using deep neural ne… ▽ More

    Submitted 4 September, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: This work has been submitted to the IEEE for possible publication

  25. arXiv:2505.12084   

    cs.RO

    Bench-NPIN: Benchmarking Non-prehensile Interactive Navigation

    Authors: Ninghan Zhong, Steven Caro, Avraiem Iskandar, Megnath Ramesh, Stephen L. Smith

    Abstract: Mobile robots are increasingly deployed in unstructured environments where obstacles and objects are movable. Navigation in such environments is known as interactive navigation, where task completion requires not only avoiding obstacles but also strategic interactions with movable objects. Non-prehensile interactive navigation focuses on non-grasping interaction strategies, such as pushing, rather… ▽ More

    Submitted 5 February, 2026; v1 submitted 17 May, 2025; originally announced May 2025.

    Comments: This paper has been withdrawn by the authors. This paper has been superseded by arXiv:2512.11736

  26. arXiv:2504.16054  [pdf, other

    cs.LG cs.RO

    $π_{0.5}$: a Vision-Language-Action Model with Open-World Generalization

    Authors: Physical Intelligence, Kevin Black, Noah Brown, James Darpinian, Karan Dhabalia, Danny Driess, Adnan Esmail, Michael Equi, Chelsea Finn, Niccolo Fusai, Manuel Y. Galliker, Dibya Ghosh, Lachy Groom, Karol Hausman, Brian Ichter, Szymon Jakubczak, Tim Jones, Liyiming Ke, Devin LeBlanc, Sergey Levine, Adrian Li-Bell, Mohith Mothukuri, Suraj Nair, Karl Pertsch, Allen Z. Ren , et al. (11 additional authors not shown)

    Abstract: In order for robots to be useful, they must perform practically relevant tasks in the real world, outside of the lab. While vision-language-action (VLA) models have demonstrated impressive results for end-to-end robot control, it remains an open question how far such models can generalize in the wild. We describe $π_{0.5}$, a new model based on $π_{0}$ that uses co-training on heterogeneous tasks… ▽ More

    Submitted 22 April, 2025; originally announced April 2025.

  27. Euclid Quick Data Release (Q1). Active galactic nuclei identification using diffusion-based inpainting of Euclid VIS images

    Authors: Euclid Collaboration, G. Stevens, S. Fotopoulou, M. N. Bremer, T. Matamoro Zatarain, K. Jahnke, B. Margalef-Bentabol, M. Huertas-Company, M. J. Smith, M. Walmsley, M. Salvato, M. Mezcua, A. Paulino-Afonso, M. Siudek, M. Talia, F. Ricci, W. Roster, N. Aghanim, B. Altieri, S. Andreon, H. Aussel, C. Baccigalupi, M. Baldi, S. Bardelli, P. Battaglia , et al. (249 additional authors not shown)

    Abstract: Light emission from galaxies exhibit diverse brightness profiles, influenced by factors such as galaxy type, structural features and interactions with other galaxies. Elliptical galaxies feature more uniform light distributions, while spiral and irregular galaxies have complex, varied light profiles due to their structural heterogeneity and star-forming activity. In addition, galaxies with an acti… ▽ More

    Submitted 16 October, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

    Comments: Paper Accepted as part of the A&A Special Issue `Euclid Quick Data Release (Q1)', 34 pages, 26 figures

  28. arXiv:2502.14761  [pdf, other

    cs.HC

    User Awareness and Perspectives Survey on Privacy, Security and Usability of Auditory Prostheses

    Authors: Sohini Saha, Leslie M. Collins, Sherri L. Smith, Boyla O. Mainsah

    Abstract: According to the World Health Organization, over 466 million people worldwide suffer from disabling hearing loss, with approximately 34 million of these being children. Hearing aids (HA) and cochlear implants (CI) have become indispensable tools for restoring hearing and enhancing the quality of life for individuals with hearing impairments. Clinical research and consumer studies indicate that use… ▽ More

    Submitted 20 February, 2025; originally announced February 2025.

    Comments: USENIX Symposium on Usable Privacy and Security (SOUPS) 2024

  29. arXiv:2502.01912  [pdf, ps, other

    cs.CV cs.AI cs.LG

    PATCH: a deep learning method to assess heterogeneity of artistic practice in historical paintings

    Authors: Andrew Van Horn, Lauryn Smith, Mahamad Mahmoud, Michael McMaster, Clara Pinchbeck, Ina Martin, Andrew Lininger, Anthony Ingrisano, Adam Lowe, Carlos Bayod, Elizabeth Bolman, Kenneth Singer, Michael Hinczewski

    Abstract: The history of art has seen significant shifts in the manner in which artworks are created, making understanding of creative processes a central question in technical art history. In the Renaissance and Early Modern period, paintings were largely produced by master painters directing workshops of apprentices who often contributed to projects. The masters varied significantly in artistic and manage… ▽ More

    Submitted 16 July, 2025; v1 submitted 3 February, 2025; originally announced February 2025.

    Comments: main text: 15 pages, 5 figures; SI: 10 pages, 4 figures; v2: minor typo corrections, higher resolution figures; v3: additional comparisons with alternative methods

  30. arXiv:2501.13009  [pdf, other

    cs.CV cs.LG eess.IV

    Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects

    Authors: Louis Aberdeen, Mark Hansen, Melvyn L. Smith, Lyndon Smith

    Abstract: As the density of spacecraft in Earth's orbit increases, their recognition, pose and trajectory identification becomes crucial for averting potential collisions and executing debris removal operations. However, training models able to identify a spacecraft and its pose presents a significant challenge due to a lack of available image data for model training. This paper puts forth an innovative fra… ▽ More

    Submitted 22 January, 2025; originally announced January 2025.

    Comments: 10 pages, 13 figures

    MSC Class: 68T07 (Primary) 68T45 (Secondary) ACM Class: I.4.4; I.6.4

  31. arXiv:2412.18972  [pdf, other

    cs.LG cs.AI cs.SE

    Recommending Pre-Trained Models for IoT Devices

    Authors: Parth V. Patil, Wenxin Jiang, Huiyun Peng, Daniel Lugo, Kelechi G. Kalu, Josh LeBlanc, Lawrence Smith, Hyeonwoo Heo, Nathanael Aou, James C. Davis

    Abstract: The availability of pre-trained models (PTMs) has enabled faster deployment of machine learning across applications by reducing the need for extensive training. Techniques like quantization and distillation have further expanded PTM applicability to resource-constrained IoT hardware. Given the many PTM options for any given task, engineers often find it too costly to evaluate each model's suitabil… ▽ More

    Submitted 25 December, 2024; originally announced December 2024.

    Comments: Accepted at SERP4IOT'25

  32. AUTO-IceNav: A Local Navigation Strategy for Autonomous Surface Ships in Broken Ice Fields

    Authors: Rodrigue de Schaetzen, Alexander Botros, Ninghan Zhong, Kevin Murrant, Robert Gash, Stephen L. Smith

    Abstract: Ice conditions often require ships to reduce speed and deviate from their main course to avoid damage to the ship. In addition, broken ice fields are becoming the dominant ice conditions encountered in the Arctic, where the effects of collisions with ice are highly dependent on where contact occurs and on the particular features of the ice floes. In this paper, we present AUTO-IceNav, a framework… ▽ More

    Submitted 16 September, 2025; v1 submitted 26 November, 2024; originally announced November 2024.

    Comments: 20 pages, 18 figures

  33. Artificial Intelligence for Collective Intelligence: A National-Scale Research Strategy

    Authors: Seth Bullock, Nirav Ajmeri, Mike Batty, Michaela Black, John Cartlidge, Robert Challen, Cangxiong Chen, Jing Chen, Joan Condell, Leon Danon, Adam Dennett, Alison Heppenstall, Paul Marshall, Phil Morgan, Aisling O'Kane, Laura G. E. Smith, Theresa Smith, Hywel T. P. Williams

    Abstract: Advances in artificial intelligence (AI) have great potential to help address societal challenges that are both collective in nature and present at national or trans-national scale. Pressing challenges in healthcare, finance, infrastructure and sustainability, for instance, might all be productively addressed by leveraging and amplifying AI for national-scale collective intelligence. The developme… ▽ More

    Submitted 9 November, 2024; originally announced November 2024.

    Comments: 25 pages, 3 figures, Accepted for publication at Knowledge Engineering Review (KER)

    Journal ref: The Knowledge Engineering Review 39 (2024) e10

  34. arXiv:2411.03409  [pdf, other

    cs.RO cs.AI

    STEER: Flexible Robotic Manipulation via Dense Language Grounding

    Authors: Laura Smith, Alex Irpan, Montserrat Gonzalez Arenas, Sean Kirmani, Dmitry Kalashnikov, Dhruv Shah, Ted Xiao

    Abstract: The complexity of the real world demands robotic systems that can intelligently adapt to unseen situations. We present STEER, a robot learning framework that bridges high-level, commonsense reasoning with precise, flexible low-level control. Our approach translates complex situational awareness into actionable low-level behavior through training language-grounded policies with dense annotation. By… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

    Comments: Project website: https://lauramsmith.github.io/steer/

  35. arXiv:2411.02704  [pdf, other

    cs.RO cs.AI cs.CL cs.CV cs.LG

    RT-Affordance: Affordances are Versatile Intermediate Representations for Robot Manipulation

    Authors: Soroush Nasiriany, Sean Kirmani, Tianli Ding, Laura Smith, Yuke Zhu, Danny Driess, Dorsa Sadigh, Ted Xiao

    Abstract: We explore how intermediate policy representations can facilitate generalization by providing guidance on how to perform manipulation tasks. Existing representations such as language, goal images, and trajectory sketches have been shown to be helpful, but these representations either do not provide enough context or provide over-specified context that yields less robust policies. We propose condit… ▽ More

    Submitted 4 November, 2024; originally announced November 2024.

  36. arXiv:2410.10621  [pdf, other

    cs.RO

    Traversability-Aware Legged Navigation by Learning from Real-World Visual Data

    Authors: Hongbo Zhang, Zhongyu Li, Xuanqi Zeng, Laura Smith, Kyle Stachowicz, Dhruv Shah, Linzhu Yue, Zhitao Song, Weipeng Xia, Sergey Levine, Koushil Sreenath, Yun-hui Liu

    Abstract: The enhanced mobility brought by legged locomotion empowers quadrupedal robots to navigate through complex and unstructured environments. However, optimizing agile locomotion while accounting for the varying energy costs of traversing different terrains remains an open challenge. Most previous work focuses on planning trajectories with traversability cost estimation based on human-labeled environm… ▽ More

    Submitted 11 November, 2024; v1 submitted 14 October, 2024; originally announced October 2024.

  37. arXiv:2409.11326  [pdf, other

    cs.RO

    Autonomous Navigation in Ice-Covered Waters with Learned Predictions on Ship-Ice Interactions

    Authors: Ninghan Zhong, Alessandro Potenza, Stephen L. Smith

    Abstract: Autonomous navigation in ice-covered waters poses significant challenges due to the frequent lack of viable collision-free trajectories. When complete obstacle avoidance is infeasible, it becomes imperative for the navigation strategy to minimize collisions. Additionally, the dynamic nature of ice, which moves in response to ship maneuvers, complicates the path planning process. To address these c… ▽ More

    Submitted 18 September, 2024; v1 submitted 17 September, 2024; originally announced September 2024.

  38. arXiv:2409.06084  [pdf, ps, other

    cs.LG eess.SP

    Symmetry constrained neural networks for detection and localization of damage in metal plates

    Authors: James Amarel, Christopher Rudolf, Athanasios Iliopoulos, John Michopoulos, Leslie N. Smith

    Abstract: The present paper is concerned with deep learning techniques applied to detection and localization of damage in a thin aluminum plate. We used data collected on a tabletop apparatus by mounting to the plate four piezoelectric transducers, each of which took turn to generate a Lamb wave that then traversed the region of interest before being received by the remaining three sensors. On training a ne… ▽ More

    Submitted 26 May, 2025; v1 submitted 9 September, 2024; originally announced September 2024.

    Journal ref: APL Mach. Learn. 1 June 2025; 3 (2): 026106

  39. arXiv:2409.03120  [pdf, other

    cs.RO

    Approximate Environment Decompositions for Robot Coverage Planning using Submodular Set Cover

    Authors: Megnath Ramesh, Frank Imeson, Baris Fidan, Stephen L. Smith

    Abstract: In this paper, we investigate the problem of decomposing 2D environments for robot coverage planning. Coverage path planning (CPP) involves computing a cost-minimizing path for a robot equipped with a coverage or sensing tool so that the tool visits all points in the environment. CPP is an NP-Hard problem, so existing approaches simplify the problem by decomposing the environment into the minimum… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: Extended version of the 2024 IEEE CDC paper, 8 pages, 3 figures

  40. arXiv:2409.03055  [pdf, other

    cs.SD eess.AS

    SymPAC: Scalable Symbolic Music Generation With Prompts And Constraints

    Authors: Haonan Chen, Jordan B. L. Smith, Janne Spijkervet, Ju-Chiang Wang, Pei Zou, Bochen Li, Qiuqiang Kong, Xingjian Du

    Abstract: Progress in the task of symbolic music generation may be lagging behind other tasks like audio and text generation, in part because of the scarcity of symbolic training data. In this paper, we leverage the greater scale of audio music data by applying pre-trained MIR models (for transcription, beat tracking, structure analysis, etc.) to extract symbolic events and encode them into token sequences.… ▽ More

    Submitted 9 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

    Comments: ISMIR 2024

  41. arXiv:2409.02850  [pdf, other

    cs.LG cs.AI stat.ML

    Oops, I Sampled it Again: Reinterpreting Confidence Intervals in Few-Shot Learning

    Authors: Raphael Lafargue, Luke Smith, Franck Vermet, Mathias Löwe, Ian Reid, Vincent Gripon, Jack Valmadre

    Abstract: The predominant method for computing confidence intervals (CI) in few-shot learning (FSL) is based on sampling the tasks with replacement, i.e.\ allowing the same samples to appear in multiple tasks. This makes the CI misleading in that it takes into account the randomness of the sampler but not the data itself. To quantify the extent of this problem, we conduct a comparative analysis between CIs… ▽ More

    Submitted 6 September, 2024; v1 submitted 4 September, 2024; originally announced September 2024.

    MSC Class: 68T06 ACM Class: I.2; I.4; I.5; G.3

  42. Mix Testing: Specifying and Testing ABI Compatibility of C/C++ Atomics Implementations

    Authors: Luke Geeson, James Brotherston, Wilco Dijkstra, Alastair F. Donaldson, Lee Smith, Tyler Sorensen, John Wickerson

    Abstract: The correctness of complex software depends on the correctness of both the source code and the compilers that generate corresponding binary code. Compilers must do more than preserve the semantics of a single source file: they must ensure that generated binaries can be composed with other binaries to form a final executable. The compatibility of composition is ensured using an Application Binary I… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 26 pages, Accepted to OOPSLA (Object-oriented Programming, Systems, Languages, and Applications) 2024

    ACM Class: D.3.4; D.2.5

  43. arXiv:2408.08441  [pdf, other

    cs.LG cs.RO

    D5RL: Diverse Datasets for Data-Driven Deep Reinforcement Learning

    Authors: Rafael Rafailov, Kyle Hatch, Anikait Singh, Laura Smith, Aviral Kumar, Ilya Kostrikov, Philippe Hansen-Estruch, Victor Kolev, Philip Ball, Jiajun Wu, Chelsea Finn, Sergey Levine

    Abstract: Offline reinforcement learning algorithms hold the promise of enabling data-driven RL methods that do not require costly or dangerous real-world exploration and benefit from large pre-collected datasets. This in turn can facilitate real-world applications, as well as a more standardized approach to RL research. Furthermore, offline RL methods can provide effective initializations for online finetu… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: RLC 2024

  44. arXiv:2408.05147  [pdf, other

    cs.LG cs.AI cs.CL

    Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2

    Authors: Tom Lieberum, Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Nicolas Sonnerat, Vikrant Varma, János Kramár, Anca Dragan, Rohin Shah, Neel Nanda

    Abstract: Sparse autoencoders (SAEs) are an unsupervised method for learning a sparse decomposition of a neural network's latent representations into seemingly interpretable features. Despite recent excitement about their potential, research applications outside of industry are limited by the high cost of training a comprehensive suite of SAEs. In this work, we introduce Gemma Scope, an open suite of JumpRe… ▽ More

    Submitted 19 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

    Comments: 12 main text pages, and 14 pages of acknowledgements, references and appendices

  45. arXiv:2408.00113  [pdf, other

    cs.LG cs.AI cs.CL

    Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models

    Authors: Adam Karvonen, Benjamin Wright, Can Rager, Rico Angell, Jannik Brinkmann, Logan Smith, Claudio Mayrink Verdun, David Bau, Samuel Marks

    Abstract: What latent features are encoded in language model (LM) representations? Recent work on training sparse autoencoders (SAEs) to disentangle interpretable features in LM representations has shown significant promise. However, evaluating the quality of these SAEs is difficult because we lack a ground-truth collection of interpretable features that we expect good SAEs to recover. We thus propose to me… ▽ More

    Submitted 30 October, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted as an oral paper (top 5%) at the ICML 2024 Mechanistic Interpretability Workshop and to the NeurIPS 2024 Main Conference

  46. arXiv:2407.06584  [pdf, other

    cs.RO

    HiLMa-Res: A General Hierarchical Framework via Residual RL for Combining Quadrupedal Locomotion and Manipulation

    Authors: Xiaoyu Huang, Qiayuan Liao, Yiming Ni, Zhongyu Li, Laura Smith, Sergey Levine, Xue Bin Peng, Koushil Sreenath

    Abstract: This work presents HiLMa-Res, a hierarchical framework leveraging reinforcement learning to tackle manipulation tasks while performing continuous locomotion using quadrupedal robots. Unlike most previous efforts that focus on solving a specific task, HiLMa-Res is designed to be general for various loco-manipulation tasks that require quadrupedal robots to maintain sustained mobility. The novel des… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: IROS 2024

  47. arXiv:2407.02666  [pdf, other

    cs.RO cs.AI

    Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models

    Authors: Annie S. Chen, Alec M. Lessing, Andy Tang, Govind Chada, Laura Smith, Sergey Levine, Chelsea Finn

    Abstract: Legged robots are physically capable of navigating a diverse variety of environments and overcoming a wide range of obstructions. For example, in a search and rescue mission, a legged robot could climb over debris, crawl through gaps, and navigate out of dead ends. However, the robot's controller needs to respond intelligently to such varied obstacles, and this requires handling unexpected and unu… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    Comments: 27 pages

  48. arXiv:2406.06615  [pdf, other

    cs.CL cs.AI cs.LG cs.RO

    Language Guided Skill Discovery

    Authors: Seungeun Rho, Laura Smith, Tianyu Li, Sergey Levine, Xue Bin Peng, Sehoon Ha

    Abstract: Skill discovery methods enable agents to learn diverse emergent behaviors without explicit rewards. To make learned skills useful for unknown downstream tasks, obtaining a semantically diverse repertoire of skills is essential. While some approaches introduce a discriminator to distinguish skills and others aim to increase state coverage, no existing work directly addresses the "semantic diversity… ▽ More

    Submitted 28 February, 2025; v1 submitted 7 June, 2024; originally announced June 2024.

  49. arXiv:2404.16014  [pdf, other

    cs.LG cs.AI

    Improving Dictionary Learning with Gated Sparse Autoencoders

    Authors: Senthooran Rajamanoharan, Arthur Conmy, Lewis Smith, Tom Lieberum, Vikrant Varma, János Kramár, Rohin Shah, Neel Nanda

    Abstract: Recent work has found that sparse autoencoders (SAEs) are an effective technique for unsupervised discovery of interpretable features in language models' (LMs) activations, by finding sparse, linear reconstructions of LM activations. We introduce the Gated Sparse Autoencoder (Gated SAE), which achieves a Pareto improvement over training with prevailing methods. In SAEs, the L1 penalty used to enco… ▽ More

    Submitted 30 April, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 15 main text pages, 22 appendix pages

  50. arXiv:2404.07839  [pdf, other

    cs.LG cs.AI cs.CL

    RecurrentGemma: Moving Past Transformers for Efficient Open Language Models

    Authors: Aleksandar Botev, Soham De, Samuel L Smith, Anushan Fernando, George-Cristian Muraru, Ruba Haroun, Leonard Berrada, Razvan Pascanu, Pier Giuseppe Sessa, Robert Dadashi, Léonard Hussenot, Johan Ferret, Sertan Girgin, Olivier Bachem, Alek Andreev, Kathleen Kenealy, Thomas Mesnard, Cassidy Hardin, Surya Bhupatiraju, Shreya Pathak, Laurent Sifre, Morgane Rivière, Mihir Sanjay Kale, Juliette Love, Pouya Tafti , et al. (37 additional authors not shown)

    Abstract: We introduce RecurrentGemma, a family of open language models which uses Google's novel Griffin architecture. Griffin combines linear recurrences with local attention to achieve excellent performance on language. It has a fixed-sized state, which reduces memory use and enables efficient inference on long sequences. We provide two sizes of models, containing 2B and 9B parameters, and provide pre-tr… ▽ More

    Submitted 28 August, 2024; v1 submitted 11 April, 2024; originally announced April 2024.