Skip to main content

Showing 1–9 of 9 results for author: Sarra, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.02668  [pdf, ps, other

    cs.AI cs.LG

    SorryDB: Can AI Provers Complete Real-World Lean Theorems?

    Authors: Austin Letson, Leopoldo Sarra, Auguste Poiroux, Oliver Dressler, Paul Lezeau, Dhyan Aranha, Frederick Pu, Aaron Hill, Miguel Corredera Hidalgo, Julian Berman, George Tsoukalas, Lenny Taelman

    Abstract: We present SorryDB, a dynamically-updating benchmark of open Lean tasks drawn from 78 real world formalization projects on GitHub. Unlike existing static benchmarks, often composed of competition problems, hillclimbing the SorryDB benchmark will yield tools that are aligned to the community needs, more usable by mathematicians, and more capable of understanding complex dependencies. Moreover, by p… ▽ More

    Submitted 3 March, 2026; originally announced March 2026.

  2. arXiv:2602.24273  [pdf, ps, other

    cs.AI

    A Minimal Agent for Automated Theorem Proving

    Authors: Borja Requena, Austin Letson, Krystian Nowakowski, Izan Beltran Ferreiro, Leopoldo Sarra

    Abstract: We propose a minimal agentic baseline that enables systematic comparison across different AI-based theorem prover architectures. This design implements the core features shared among state-of-the-art systems: iterative proof refinement, library search and context management. We evaluate this agentic approach using qualitatively different benchmarks and compare various frontier language models and… ▽ More

    Submitted 11 March, 2026; v1 submitted 27 February, 2026; originally announced February 2026.

  3. arXiv:2510.17959  [pdf, ps, other

    astro-ph.IM cs.AI cs.LG

    Universal Spectral Tokenization via Self-Supervised Panchromatic Representation Learning

    Authors: Jeff Shen, Francois Lanusse, Liam Holden Parker, Ollie Liu, Tom Hehir, Leopoldo Sarra, Lucas Meyer, Micah Bowles, Sebastian Wagner-Carena, Sebastian Wagner-Carena, Helen Qu, Siavash Golkar, Alberto Bietti, Hatim Bourfoune, Nathan Cassereau, Pierre Cornette, Keiya Hirashima, Geraud Krawezik, Ruben Ohana, Nicholas Lourie, Michael McCabe, Rudy Morel, Payel Mukhopadhyay, Mariel Pettee, Bruno Régaldo-Saint Blancard , et al. (3 additional authors not shown)

    Abstract: Sequential scientific data span many resolutions and domains, and unifying them into a common representation is a key step toward developing foundation models for the sciences. Astronomical spectra exemplify this challenge: massive surveys have collected millions of spectra across a wide range of wavelengths and resolutions, yet analyses remain fragmented across spectral domains (e.g., optical vs.… ▽ More

    Submitted 10 November, 2025; v1 submitted 20 October, 2025; originally announced October 2025.

    Comments: Accepted at NeurIPS 2025 Machine Learning and the Physical Sciences Workshop; v2: added collaboration

  4. arXiv:2509.25449  [pdf, ps, other

    cs.LG cs.AI

    Joint Embeddings Go Temporal

    Authors: Sofiane Ennadir, Siavash Golkar, Leopoldo Sarra

    Abstract: Self-supervised learning has seen great success recently in unsupervised representation learning, enabling breakthroughs in natural language and image processing. However, these methods often rely on autoregressive and masked modeling, which aim to reproduce masked information in the input, which can be vulnerable to the presence of noise or confounding variables. To address this problem, Joint-Em… ▽ More

    Submitted 29 September, 2025; originally announced September 2025.

    Comments: Accepted at the Workshop on Time Series in the Age of Large Models - NeurIPS 2024

  5. arXiv:2411.00230  [pdf, ps, other

    quant-ph cs.AI cs.LG

    Reinforcement learning with learned gadgets to tackle hard quantum problems on real hardware

    Authors: Akash Kundu, Leopoldo Sarra

    Abstract: Quantum computing offers exciting opportunities for simulating complex quantum systems and optimizing large scale combinatorial problems, but its practical use is limited by device noise and constrained connectivity. Designing quantum circuits, which are fundamental to quantum algorithms, is therefore a central challenge in current quantum hardware. Existing reinforcement learning based methods fo… ▽ More

    Submitted 18 March, 2026; v1 submitted 31 October, 2024; originally announced November 2024.

    Comments: 28 page: Gadget reinforcement learning

    Journal ref: Communications Physics 9, 44 (2026)

  6. arXiv:2310.03024  [pdf, other

    astro-ph.IM cs.AI cs.LG

    AstroCLIP: A Cross-Modal Foundation Model for Galaxies

    Authors: Liam Parker, Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, Alberto Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe, Ruben Ohana, Mariel Pettee, Bruno Regaldo-Saint Blancard, Tiberiu Tesileanu, Kyunghyun Cho, Shirley Ho

    Abstract: We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used - without any model fine-tuning - for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation fro… ▽ More

    Submitted 14 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: 18 pages, accepted in Monthly Notices of the Royal Astronomical Society, Presented at the NeurIPS 2023 AI4Science Workshop

  7. arXiv:2306.14510  [pdf, other

    quant-ph cs.LG

    Deep Bayesian Experimental Design for Quantum Many-Body Systems

    Authors: Leopoldo Sarra, Florian Marquardt

    Abstract: Bayesian experimental design is a technique that allows to efficiently select measurements to characterize a physical system by maximizing the expected information gain. Recent developments in deep neural networks and normalizing flows allow for a more efficient approximation of the posterior and thus the extension of this technique to complex high-dimensional situations. In this paper, we show ho… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  8. arXiv:2305.01521  [pdf, other

    cs.LG stat.ML

    Unlocking the Power of Representations in Long-term Novelty-based Exploration

    Authors: Alaa Saade, Steven Kapturowski, Daniele Calandriello, Charles Blundell, Pablo Sprechmann, Leopoldo Sarra, Oliver Groth, Michal Valko, Bilal Piot

    Abstract: We introduce Robust Exploration via Clustering-based Online Density Estimation (RECODE), a non-parametric method for novelty-based exploration that estimates visitation counts for clusters of states based on their similarity in a chosen embedding space. By adapting classical clustering to the nonstationary setting of Deep RL, RECODE can efficiently track state visitation counts over thousands of e… ▽ More

    Submitted 2 May, 2023; originally announced May 2023.

  9. arXiv:2005.01912  [pdf, other

    cs.LG physics.data-an

    Renormalized Mutual Information for Artificial Scientific Discovery

    Authors: Leopoldo Sarra, Andrea Aiello, Florian Marquardt

    Abstract: We derive a well-defined renormalized version of mutual information that allows to estimate the dependence between continuous random variables in the important case when one is deterministically dependent on the other. This is the situation relevant for feature extraction, where the goal is to produce a low-dimensional effective description of a high-dimensional system. Our approach enables the di… ▽ More

    Submitted 5 March, 2021; v1 submitted 4 May, 2020; originally announced May 2020.

    Comments: Added a more detailed introduction and link to code repository. Physics-based examples and Feature Extraction section have been updated

    Journal ref: Phys. Rev. Lett. 126, 200601 (2021)