Skip to main content

Showing 1–50 of 108 results for author: Bautista, A

.
  1. arXiv:2604.11737  [pdf, ps, other

    cs.CV

    Learning Long-term Motion Embeddings for Efficient Kinematics Generation

    Authors: Nick Stracke, Kolja Bauer, Stefan Andreas Baumann, Miguel Angel Bautista, Josh Susskind, Björn Ommer

    Abstract: Understanding and predicting motion is a fundamental component of visual intelligence. Although modern video models exhibit strong comprehension of scene dynamics, exploring multiple possible futures through full video synthesis remains prohibitively inefficient. We model scene dynamics orders of magnitude more efficiently by directly operating on a long-term motion embedding that is learned from… ▽ More

    Submitted 13 April, 2026; originally announced April 2026.

    Comments: for the project page and code, view https://compvis.github.io/long-term-motion

  2. arXiv:2511.20462  [pdf, ps, other

    cs.CV cs.LG

    STARFlow-V: End-to-End Video Generative Modeling with Normalizing Flows

    Authors: Jiatao Gu, Ying Shen, Tianrong Chen, Laurent Dinh, Yuyang Wang, Miguel Angel Bautista, David Berthelot, Josh Susskind, Shuangfei Zhai

    Abstract: Normalizing flows (NFs) are end-to-end likelihood-based generative models for continuous data, and have recently regained attention with encouraging progress on image generation. Yet in the video generation domain, where spatiotemporal complexity and computational cost are substantially higher, state-of-the-art systems almost exclusively rely on diffusion-based models. In this work, we revisit thi… ▽ More

    Submitted 25 November, 2025; v1 submitted 25 November, 2025; originally announced November 2025.

    Comments: 21 pages, 9 figures. Code and samples are available at https://github.com/apple/ml-starflow

  3. arXiv:2510.14630  [pdf, ps, other

    cs.CV

    Adapting Self-Supervised Representations as a Latent Space for Efficient Generation

    Authors: Ming Gui, Johannes Schusterbauer, Timy Phan, Felix Krause, Josh Susskind, Miguel Angel Bautista, Björn Ommer

    Abstract: We introduce Representation Tokenizer (RepTok), a generative modeling framework that represents an image using a single continuous latent token obtained from self-supervised vision transformers. Building on a pre-trained SSL encoder, we fine-tune only the semantic token embedding and pair it with a generative decoder trained jointly using a standard flow matching objective. This adaptation enriche… ▽ More

    Submitted 16 October, 2025; originally announced October 2025.

    Comments: Code: https://github.com/CompVis/RepTok

  4. arXiv:2509.18480  [pdf, ps, other

    cs.LG q-bio.QM

    SimpleFold: Folding Proteins is Simpler than You Think

    Authors: Yuyang Wang, Jiarui Lu, Navdeep Jaitly, Josh Susskind, Miguel Angel Bautista

    Abstract: Protein folding models have achieved groundbreaking results typically via a combination of integrating domain knowledge into the architectural blocks and training pipelines. Nonetheless, given the success of generative models across different but related problems, it is natural to question whether these architectural designs are a necessary condition to build performant models. In this paper, we i… ▽ More

    Submitted 9 December, 2025; v1 submitted 22 September, 2025; originally announced September 2025.

    Comments: 30 pages, 11 figures, 15 tables

  5. On the classification of Jacobi curves and their conformal curvatures

    Authors: A. Bautista, A. Ibort, J. Lafuente

    Abstract: This paper describes the theory of Jacobi curves, a far reaching extension of the spaces of Jacobi fields along Riemannian geodesics, developed by Agrachev and Zelenko. Jacobi curves are curves in the Lagrangian Grassmannian of a symplectic space satisfying appropriate regularity conditions. It is shown that they are fully characterised in terms of a family of conformal symplectic invariant curvat… ▽ More

    Submitted 19 September, 2025; originally announced September 2025.

    Comments: 33 pages, 1 figure

    MSC Class: 53

    Journal ref: Int. J. Geom. Methods Mod. Phys. Vol.22, No.9 (2025) 2550070

  6. arXiv:2507.00425  [pdf, ps, other

    cs.LG cs.CL

    Flexible Language Modeling in Continuous Space with Transformer-based Autoregressive Flows

    Authors: Ruixiang Zhang, Shuangfei Zhai, Jiatao Gu, Yizhe Zhang, Huangjie Zheng, Tianrong Chen, Miguel Angel Bautista, Josh Susskind, Navdeep Jaitly

    Abstract: Autoregressive models have driven remarkable progress in language modeling. Their foundational reliance on discrete tokens, unidirectional context, and single-pass decoding, while central to their success, also inspires the exploration of a design space that could offer new axes of modeling flexibility. In this work, we explore an alternative paradigm, shifting language modeling from a discrete to… ▽ More

    Submitted 1 July, 2025; originally announced July 2025.

  7. arXiv:2506.06276  [pdf, ps, other

    cs.CV cs.AI cs.LG

    STARFlow: Scaling Latent Normalizing Flows for High-resolution Image Synthesis

    Authors: Jiatao Gu, Tianrong Chen, David Berthelot, Huangjie Zheng, Yuyang Wang, Ruixiang Zhang, Laurent Dinh, Miguel Angel Bautista, Josh Susskind, Shuangfei Zhai

    Abstract: We present STARFlow, a scalable generative model based on normalizing flows that achieves strong performance in high-resolution image synthesis. The core of STARFlow is Transformer Autoregressive Flow (TARFlow), which combines the expressive power of normalizing flows with the structured modeling capabilities of Autoregressive Transformers. We first establish the theoretical universality of TARFlo… ▽ More

    Submitted 6 June, 2025; originally announced June 2025.

    Comments: TLDR: We show for the first time that normalizing flows can be scaled for high-resolution and text-conditioned image synthesis

  8. arXiv:2501.12198  [pdf, ps, other

    cs.SI physics.soc-ph stat.AP

    Opinion dynamics in bounded confidence models with manipulative agents: Moving the Overton window

    Authors: A. Bautista

    Abstract: This paper focuses on the opinion dynamics under the influence of manipulative agents. This type of agents is characterized by the fact that their opinions follow a trajectory that does not respond to the dynamics of the model, although it does influence the rest of the normal agents. Simulation has been implemented to study how one manipulative group modifies the natural dynamics of some opinion… ▽ More

    Submitted 27 January, 2025; v1 submitted 21 January, 2025; originally announced January 2025.

    Comments: 26 pages, 29 figures

    Journal ref: Physica A: Statistical Mechanics and its Applications, Volume 660, 2025, 130379

  9. arXiv:2501.03889  [pdf, other

    astro-ph.HE

    Cosmic-ray acceleration and escape from supernova remnant W44 as probed by Fermi-LAT and MAGIC

    Authors: S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, K. Asano, A. Babi'c, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batkovi'c, A. Bautista, J. Baxter, J. Becerra Gonz'alez, W. Bednarek, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder , et al. (196 additional authors not shown)

    Abstract: Context. The supernova remnant (SNR) W44 and its surroundings are a prime target for studying the acceleration of cosmic rays (CRs). Several previous studies established an extended gamma-ray emission that is set apart from the radio shell of W44. This emission is thought to originate from escaped high-energy CRs that interact with a surrounding dense molecular cloud complex. Aims. We present a de… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

  10. arXiv:2501.03831  [pdf, other

    astro-ph.HE

    Characterization of Markarian 421 during its most violent year: Multiwavelength variability and correlations

    Authors: K. Abe, S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, K. Asano, D. Baack, A. Babić, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete, A. Berti , et al. (190 additional authors not shown)

    Abstract: Mrk 421 was in its most active state around early 2010, which led to the highest TeV gamma-ray flux ever recorded from any active galactic nuclei. We aim to characterize the multiwavelength behavior during this exceptional year for Mrk 421, and evaluate whether it is consistent with the picture derived with data from other less exceptional years. We investigated the period from November 5, 2009, (… ▽ More

    Submitted 7 January, 2025; originally announced January 2025.

    Comments: Accepted for publication in Astronomy & Astrophysics. Corresponding authors: Felix Schmuckermaier, David Paneque, Axel Arbet Engels

  11. Time-dependent modelling of short-term variability in the TeV-blazar VER J0521+211 during the major flare in 2020

    Authors: MAGIC Collaboration, S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, D. Baack, A. Babić, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete , et al. (206 additional authors not shown)

    Abstract: The BL Lacertae object VER J0521+211 underwent a notable flaring episode in February 2020. A short-term monitoring campaign, led by the MAGIC (Major Atmospheric Gamma Imaging Cherenkov) collaboration, covering a wide energy range from radio to very-high-energy (VHE, 100 GeV < E < 100 TeV) gamma rays was organised to study its evolution. These observations resulted in a consistent detection of the… ▽ More

    Submitted 20 December, 2024; originally announced December 2024.

    Comments: Accepted for publication in A&A

    Journal ref: A&A 694, A308 (2025)

  12. arXiv:2412.06329  [pdf, ps, other

    cs.CV cs.LG

    Normalizing Flows are Capable Generative Models

    Authors: Shuangfei Zhai, Ruixiang Zhang, Preetum Nakkiran, David Berthelot, Jiatao Gu, Huangjie Zheng, Tianrong Chen, Miguel Angel Bautista, Navdeep Jaitly, Josh Susskind

    Abstract: Normalizing Flows (NFs) are likelihood-based models for continuous inputs. They have demonstrated promising results on both density estimation and generative modeling tasks, but have received relatively little attention in recent years. In this work, we demonstrate that NFs are more powerful than previously believed. We present TarFlow: a simple and scalable architecture that enables highly perfor… ▽ More

    Submitted 6 June, 2025; v1 submitted 9 December, 2024; originally announced December 2024.

    Comments: ICML 2025

  13. arXiv:2412.03791  [pdf, ps, other

    cs.LG cs.AI

    INRFlow: Flow Matching for INRs in Ambient Space

    Authors: Yuyang Wang, Anurag Ranjan, Josh Susskind, Miguel Angel Bautista

    Abstract: Flow matching models have emerged as a powerful method for generative modeling on domains like images or videos, and even on irregular or unstructured data like 3D point clouds or even protein structures. These models are commonly trained in two stages: first, a data compressor is trained, and in a subsequent training stage a flow matching generative model is trained in the latent space of the dat… ▽ More

    Submitted 28 May, 2025; v1 submitted 4 December, 2024; originally announced December 2024.

    Comments: 22 pages, 14 figures, 13 tables

  14. arXiv:2412.01821  [pdf, other

    cs.CV

    World-consistent Video Diffusion with Explicit 3D Modeling

    Authors: Qihang Zhang, Shuangfei Zhai, Miguel Angel Bautista, Kevin Miao, Alexander Toshev, Joshua Susskind, Jiatao Gu

    Abstract: Recent advancements in diffusion models have set new benchmarks in image and video generation, enabling realistic visual synthesis across single- and multi-frame contexts. However, these models still struggle with efficiently and explicitly generating 3D-consistent content. To address this, we propose World-consistent Video Diffusion (WVD), a novel framework that incorporates explicit 3D supervisi… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

    Comments: 16 pages, 10 figures

  15. arXiv:2409.18823  [pdf, other

    astro-ph.IM astro-ph.HE

    Standardised formats and open-source analysis tools for the MAGIC telescopes data

    Authors: S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete, A. Berti, J. Besenrieder , et al. (186 additional authors not shown)

    Abstract: Instruments for gamma-ray astronomy at Very High Energies ($E>100\,{\rm GeV}$) have traditionally derived their scientific results through proprietary data and software. Data standardisation has become a prominent issue in this field both as a requirement for the dissemination of data from the next generation of gamma-ray observatories and as an effective solution to realise public data legacies o… ▽ More

    Submitted 7 October, 2024; v1 submitted 27 September, 2024; originally announced September 2024.

    Comments: Accepted for publication in the Journal of High Energy Astrophysics

    Journal ref: Journal of High Energy Astrophysics, Volume 44, pp. 266-278, November 2024

  16. Constraints on Lorentz invariance violation from the extraordinary Mrk 421 flare of 2014 using a novel analysis method

    Authors: MAGIC Collaboration, S. Abe, J. Abhir, A. Abhishek, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, J. Bernete , et al. (192 additional authors not shown)

    Abstract: The Lorentz Invariance Violation (LIV), a proposed consequence of certain quantum gravity (QG) scenarios, could instigate an energy-dependent group velocity for ultra-relativistic particles. This energy dependence, although suppressed by the massive QG energy scale $E_\mathrm{QG}$, expected to be on the level of the Planck energy $1.22 \times 10^{19}$ GeV, is potentially detectable in astrophysica… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Journal ref: Journal of Cosmology and Astroparticle Physics 07 p.044 (2024)

  17. arXiv:2405.07913  [pdf, other

    cs.CV

    CTRLorALTer: Conditional LoRAdapter for Efficient 0-Shot Control & Altering of T2I Models

    Authors: Nick Stracke, Stefan Andreas Baumann, Joshua M. Susskind, Miguel Angel Bautista, Björn Ommer

    Abstract: Text-to-image generative models have become a prominent and powerful tool that excels at generating high-resolution realistic images. However, guiding the generative process of these models to consider detailed forms of conditioning reflecting style and/or structure information remains an open problem. In this paper, we present LoRAdapter, an approach that unifies both style and structure conditio… ▽ More

    Submitted 8 October, 2024; v1 submitted 13 May, 2024; originally announced May 2024.

    Comments: for the project page and code, view https://compvis.github.io/LoRAdapter/

  18. arXiv:2404.17623  [pdf

    astro-ph.HE astro-ph.GA

    Broadband Multi-wavelength Properties of M87 during the 2018 EHT Campaign including a Very High Energy Flaring Episode

    Authors: J. C. Algaba, M. Balokovic, S. Chandra, W. Y. Cheong, Y. Z. Cui, F. D'Ammando, A. D. Falcone, N. M. Ford, M. Giroletti, C. Goddi, M. A. Gurwell, K. Hada, D. Haggard, S. Jorstad, A. Kaur, T. Kawashima, S. Kerby, J. Y. Kim, M. Kino, E. V. Kravchenko, S. S. Lee, R. S. Lu, S. Markoff, J. Michail, J. Neilsen , et al. (721 additional authors not shown)

    Abstract: The nearby elliptical galaxy M87 contains one of the only two supermassive black holes whose emission surrounding the event horizon has been imaged by the Event Horizon Telescope (EHT). In 2018, more than two dozen multi-wavelength (MWL) facilities (from radio to gamma-ray energies) took part in the second M87 EHT campaign. The goal of this extensive MWL campaign was to better understand the physi… ▽ More

    Submitted 5 December, 2024; v1 submitted 24 April, 2024; originally announced April 2024.

    Comments: 46 pages, 23 figures, accepted by Astronomy & Astrophysics on August. 29, 2024

    Journal ref: A&A 692, A140 (2024)

  19. arXiv:2404.06878  [pdf, other

    astro-ph.GA astro-ph.SR

    PROJECT-J: JWST observations of HH46~IRS and its outflow. Overview and first results

    Authors: B. Nisini, M. G. Navarro, T. Giannini, S. Antoniucci, P. J. Kavanagh, P. Hartigan, F. Bacciotti, A. Caratti o Garatti, A. Noriega Crespo, E. van Dishoek, E. Whelan, H. G. Arce, S. Cabrit, D. Coffey, D. Fedele, J. Eisloeffel, M. E. Palumbo, L. Podio, T. P. Ray, M. Schultze, R. G. Urso, J. M. Alcala', M. A. Bautista, C. Codella, T. G. Greene , et al. (1 additional authors not shown)

    Abstract: We present the first results of the JWST program PROJECT-J (PROtostellar JEts Cradle Tested with JWST ), designed to study the Class I source HH46 IRS and its outflow through NIRSpec and MIRI spectroscopy (1.66 to 28 micron). The data provide line-images (~ 6.6" in length with NIRSpec, and up to 20" with MIRI) revealing unprecedented details within the jet, the molecular outflow and the cavity. We… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: 28 pages, 15 figures, Accepted for publication on The Astrophysical Journal (9 April 2024)

  20. arXiv:2402.04755  [pdf, other

    astro-ph.IM astro-ph.SR

    Performance and first measurements of the MAGIC Stellar Intensity Interferometer

    Authors: MAGIC Collaboration, S. Abe, J. Abhir, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, M. Artero, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, E. Bernardini, M. Bernardos, J. Bernete, A. Berti , et al. (195 additional authors not shown)

    Abstract: In recent years, a new generation of optical intensity interferometers has emerged, leveraging the existing infrastructure of Imaging Atmospheric Cherenkov Telescopes (IACTs). The MAGIC telescopes host the MAGIC-SII system (Stellar Intensity Interferometer), implemented to investigate the feasibility and potential of this technique on IACTs. After the first successful measurements in 2019, the sys… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

    Comments: 18 pages, 13 figures, submitted to MNRAS

  21. Insights into the broad-band emission of the TeV blazar Mrk 501 during the first X-ray polarization measurements

    Authors: S. Abe, J. Abhir, V. A. Acciari, A. Aguasca-Cabot, I. Agudo, T. Aniello, S. Ansoldi, L. A. Antonelli, A. Arbet Engels, C. Arcaro, K. Asano, A. Babić, A. Baquero, U. Barres de Almeida, J. A. Barrio, I. Batković, A. Bautista, J. Baxter, J. Becerra González, W. Bednarek, E. Bernardini, M. Bernardos, J. Bernete, A. Berti, J. Besenrieder , et al. (239 additional authors not shown)

    Abstract: We present the first multi-wavelength study of Mrk 501 including very-high-energy (VHE) gamma-ray observations simultaneous to X-ray polarization measurements from the Imaging X-ray Polarimetry Explorer (IXPE). We use radio-to-VHE data from a multi-wavelength campaign organized between 2022-03-01 and 2022-07-19. The observations were performed by MAGIC, Fermi-LAT, NuSTAR, Swift (XRT and UVOT), and… ▽ More

    Submitted 1 September, 2025; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: A&A, 685, A117 (2024). 19 pages, 9 figures. Corresponding authors: Lea Heckmann, Axel Arbet Engels, David Paneque

  22. arXiv:2401.08541  [pdf, other

    cs.CV

    Scalable Pre-training of Large Autoregressive Image Models

    Authors: Alaaeldin El-Nouby, Michal Klein, Shuangfei Zhai, Miguel Angel Bautista, Alexander Toshev, Vaishaal Shankar, Joshua M Susskind, Armand Joulin

    Abstract: This paper introduces AIM, a collection of vision models pre-trained with an autoregressive objective. These models are inspired by their textual counterparts, i.e., Large Language Models (LLMs), and exhibit similar scaling properties. Specifically, we highlight two key findings: (1) the performance of the visual features scale with both the model capacity and the quantity of data, (2) the value o… ▽ More

    Submitted 16 January, 2024; originally announced January 2024.

    Comments: https://github.com/apple/ml-aim

  23. arXiv:2311.17932  [pdf, other

    physics.chem-ph cs.LG

    Swallowing the Bitter Pill: Simplified Scalable Conformer Generation

    Authors: Yuyang Wang, Ahmed A. Elhag, Navdeep Jaitly, Joshua M. Susskind, Miguel Angel Bautista

    Abstract: We present a novel way to predict molecular conformers through a simple formulation that sidesteps many of the heuristics of prior works and achieves state of the art results by using the advantages of scale. By training a diffusion generative model directly on 3D atomic positions without making assumptions about the explicit structure of molecules (e.g. modeling torsional angles) we are able to r… ▽ More

    Submitted 10 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 19 pages, 11 figures

  24. arXiv:2310.08866  [pdf, other

    cs.LG cs.AI

    Adaptivity and Modularity for Efficient Generalization Over Task Complexity

    Authors: Samira Abnar, Omid Saremi, Laurent Dinh, Shantel Wilson, Miguel Angel Bautista, Chen Huang, Vimal Thilak, Etai Littwin, Jiatao Gu, Josh Susskind, Samy Bengio

    Abstract: Can transformers generalize efficiently on problems that require dealing with examples with different levels of difficulty? We introduce a new task tailored to assess generalization over different complexities and present results that indicate that standard transformers face challenges in solving these tasks. These tasks are variations of pointer value retrieval previously introduced by Zhang et a… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  25. arXiv:2310.08587  [pdf, other

    cs.CV

    Pseudo-Generalized Dynamic View Synthesis from a Video

    Authors: Xiaoming Zhao, Alex Colburn, Fangchang Ma, Miguel Angel Bautista, Joshua M. Susskind, Alexander G. Schwing

    Abstract: Rendering scenes observed in a monocular video from novel viewpoints is a challenging problem. For static scenes the community has studied both scene-specific optimization techniques, which optimize on every test scene, and generalized techniques, which only run a deep net forward pass on a test scene. In contrast, for dynamic scenes, scene-specific optimization techniques exist, but, to our best… ▽ More

    Submitted 19 February, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; Originally titled as "Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today?"; Project page: https://xiaoming-zhao.github.io/projects/pgdvs

  26. arXiv:2306.07290  [pdf, other

    cs.LG cs.AI

    Value function estimation using conditional diffusion models for control

    Authors: Bogdan Mazoure, Walter Talbott, Miguel Angel Bautista, Devon Hjelm, Alexander Toshev, Josh Susskind

    Abstract: A fairly reliable trend in deep reinforcement learning is that the performance scales with the number of parameters, provided a complimentary scaling in amount of training data. As the appetite for large models increases, it is imperative to address, sooner than later, the potential problem of running out of high-quality demonstrations. In this case, instead of collecting only new data via costly… ▽ More

    Submitted 9 June, 2023; originally announced June 2023.

  27. arXiv:2305.15586  [pdf, other

    cs.LG

    Manifold Diffusion Fields

    Authors: Ahmed A. Elhag, Yuyang Wang, Joshua M. Susskind, Miguel Angel Bautista

    Abstract: We present Manifold Diffusion Fields (MDF), an approach that unlocks learning of diffusion models of data in general non-Euclidean geometries. Leveraging insights from spectral geometry analysis, we define an intrinsic coordinate system on the manifold via the eigen-functions of the Laplace-Beltrami Operator. MDF represents functions using an explicit parametrization formed by a set of multiple in… ▽ More

    Submitted 19 January, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: ICLR24 paper

  28. arXiv:2305.13218  [pdf, other

    math.OC

    Ground truth clustering is not the optimum clustering

    Authors: Lucia Absalom Bautista, Timotej Hrga, Janez Povh, Shudian Zhao

    Abstract: The clustering of data is one of the most important and challenging topics in data science. The minimum sum-of-squares clustering (MSSC) problem asks to cluster the data points into $k$ clusters such that the sum of squared distances between the data points and their cluster centers (centroids) is minimized. This problem is NP-hard, but there exist exact solvers that can solve such problem to opti… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

    Comments: 23 pages; 2 figures, 5 tables

  29. arXiv:2303.00165  [pdf, other

    cs.CV cs.AI

    Diffusion Probabilistic Fields

    Authors: Peiye Zhuang, Samira Abnar, Jiatao Gu, Alex Schwing, Joshua M. Susskind, Miguel Ángel Bautista

    Abstract: Diffusion probabilistic models have quickly become a major approach for generative modeling of images, 3D geometry, video and other domains. However, to adapt diffusion generative modeling to these domains the denoising network needs to be carefully designed for each domain independently, oftentimes under the assumption that data lives in a Euclidean grid. In this paper we introduce Diffusion Prob… ▽ More

    Submitted 28 February, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023. 20 pages, 17 figures

  30. arXiv:2210.04955  [pdf, other

    cs.CV cs.LG

    f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation

    Authors: Jiatao Gu, Shuangfei Zhai, Yizhe Zhang, Miguel Angel Bautista, Josh Susskind

    Abstract: Diffusion models (DMs) have recently emerged as SoTA tools for generative modeling in various domains. Standard DMs can be viewed as an instantiation of hierarchical variational autoencoders (VAEs) where the latent variables are inferred from input-centered Gaussian distributions with fixed scales and variances. Unlike VAEs, this formulation limits DMs from changing the latent spaces and learning… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 28 pages, 21 figures, work in progress

  31. arXiv:2207.13751  [pdf, other

    cs.CV cs.GR cs.LG

    GAUDI: A Neural Architect for Immersive 3D Scene Generation

    Authors: Miguel Angel Bautista, Pengsheng Guo, Samira Abnar, Walter Talbott, Alexander Toshev, Zhuoyuan Chen, Laurent Dinh, Shuangfei Zhai, Hanlin Goh, Daniel Ulbricht, Afshin Dehghan, Josh Susskind

    Abstract: We introduce GAUDI, a generative model capable of capturing the distribution of complex and realistic 3D scenes that can be rendered immersively from a moving camera. We tackle this challenging problem with a scalable yet powerful approach, where we first optimize a latent representation that disentangles radiance fields and camera poses. This latent representation is then used to learn a generati… ▽ More

    Submitted 27 July, 2022; originally announced July 2022.

    Comments: Project webpage: https://github.com/apple/ml-gaudi

  32. L-extensions and L-boundary of conformal spacetimes

    Authors: A. Bautista, A. Ibort, J. Lafuente

    Abstract: The notion of L-boundary, a new causal boundary proposed by R. Low based on constructing a `sky at infinity' for any light ray, is discussed in detail. The analysis of the notion of L-boundary will be done in the 3-dimensional situation for the ease of presentation. The proposed notion of causal boundary is intrinsically conformal and, as it will be proved in the paper, under natural conditions pr… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 39 pages, 6 figures

    Journal ref: General Relativity and Gravitation, volume 50, article number: 153 (2018)

  33. arXiv:2207.01273  [pdf, other

    gr-qc math-ph

    A conformal boundary for space-times based on light-like geodesics: the 3-dimensional case

    Authors: A. Bautista, A. Ibort, J. Lafuente, R. Low

    Abstract: A new causal boundary, which we will term the $l$-boundary, inspired by the geometry of the space of light rays and invariant by conformal diffeomorphisms for space-times of any dimension $m\geq 3$, proposed by one of the authors (R.J. Low, The space of null geodesics (and a new causal boundary), Lecture Notes in Physics, 692, Springer, 2006, 35--50) is analyzed in detail for space-times of dimens… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: 28 pages, 5 figures

    Journal ref: Journal of Mathematical Physics, 58, 022503 (2017)

  34. arXiv:2206.14095  [pdf, ps, other

    astro-ph.SR physics.atom-ph

    Atomic Radiative Data for Oxygen and Nitrogen for Solar Photospheric Studies

    Authors: Manuel A. Bautista, Maria Bergemann, Helena Carvajal Gallego, Sébastien Gamrath, Patrick Palmeri, Pascal Quinet

    Abstract: Our recent re-analysis of the solar photospheric spectra with non-local thermodynamic equilibrium (non-LTE) models resulted in higher metal abundances compared to previous works. When applying the new chemical abundances to Standard Solar Model calculations, the new composition resolves the long-standing discrepancies with independent constraints on the solar structure from helioseismology. Critic… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: 16 pages, 6 tables, 5 figures. Accepted for publication in Astronomy and Astrophysics

    Journal ref: A&A 665, A18 (2022)

  35. The sky invariant: A new conformal invariant for Schwarzschild spacetime

    Authors: A. Bautista, A. Ibort, J. Lafuente

    Abstract: A new class of conformal invariants for a given spacetime $M$ is introduced exploiting the conformal geometry of any light ray $Γ$. Each congruence of light rays passing through a given point $p$ defines the sky $S(p)$ of such point. The new conformal invariants are defined on the bundle of skies of the spacetime $M$, being called sky invariants accordingly. The natural conformal covariant derivat… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 20 pages, 2 figures, accepted for publication in International Journal of Geometric Methods in Modern Physics

  36. The space of light rays: Causality and $L$-boundary

    Authors: A. Bautista, A. Ibort, J. Lafuente

    Abstract: The space of light rays $\mathcal{N}$ of a conformal Lorentz manifold $(M,\mathcal{C})$ is, under some topological conditions, a manifold whose basic elements are unparametrized null geodesics. This manifold $\mathcal{N}$, strongly inspired on R. Penrose's twistor theory, keeps all information of $M$ and it could be used as a space complementing the spacetime model. In the present review, the geom… ▽ More

    Submitted 10 June, 2022; originally announced June 2022.

    Comments: 57 pages, 15 figures, accepted for publication in the topical collection of General Relativity and Gravitation dedicated to the meeting Singularity theorems, causality, and all that (SCRI21) in honor of Roger Penrose

  37. arXiv:2205.07763  [pdf, other

    cs.CV

    FvOR: Robust Joint Shape and Pose Optimization for Few-view Object Reconstruction

    Authors: Zhenpei Yang, Zhile Ren, Miguel Angel Bautista, Zaiwei Zhang, Qi Shan, Qixing Huang

    Abstract: Reconstructing an accurate 3D object model from a few image observations remains a challenging problem in computer vision. State-of-the-art approaches typically assume accurate camera poses as input, which could be difficult to obtain in realistic settings. In this paper, we present FvOR, a learning-based object reconstruction method that predicts accurate 3D models given a few images with noisy i… ▽ More

    Submitted 16 May, 2022; originally announced May 2022.

    Comments: CVPR 2022

  38. arXiv:2205.04708  [pdf, other

    astro-ph.HE astro-ph.GA

    Time Dependent Photoionization Modeling of Warm Absorbers in Active Galactic Nuclei

    Authors: Dev R Sadaula, Manuel A Bautista, Javier A Garcia, Timothy R Kallman

    Abstract: Warm absorber spectra contain bound-bound and bound-free absorption features seen in the X-ray and UV spectra from many active galactic nuclei (AGN). The widths and centroid energies of these features indicate they occur in outflowing gas, and the outflow can affect the gas within the host galaxy. Thus the warm absorber mass and energy budgets are of great interest. Estimates for these properties… ▽ More

    Submitted 17 February, 2023; v1 submitted 10 May, 2022; originally announced May 2022.

  39. Plasma environment effects on K lines of astrophysical interest. V. Universal formulae for ionization potential and K-threshold shifts

    Authors: P. Palmeri, J. Deprince, M. A. Bautista, S. Fritzsche, J. A. Garcia, T. R. Kallman, C. Mendoza, P. Quinet

    Abstract: Aims. We calculate the plasma environment effects on the ionization potentials (IPs) and K-thresholds used in the modeling of K lines for all the ions belonging to the isonuclear sequences of abundant elements apart from oxygen and iron, namely: carbon, silicon, calcium, chromium, and nickel. These calculations are used to extend the data points for the fits of the universal formulae, first propos… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Journal ref: A&A 657, A61 (2022)

  40. arXiv:2107.05775  [pdf, other

    cs.CV cs.GR cs.LG

    Fast and Explicit Neural View Synthesis

    Authors: Pengsheng Guo, Miguel Angel Bautista, Alex Colburn, Liang Yang, Daniel Ulbricht, Joshua M. Susskind, Qi Shan

    Abstract: We study the problem of novel view synthesis from sparse source observations of a scene comprised of 3D objects. We propose a simple yet effective approach that is neither continuous nor implicit, challenging recent trends on view synthesis. Our approach explicitly encodes observations into a volumetric representation that enables amortized rendering. We demonstrate that although continuous radian… ▽ More

    Submitted 8 December, 2021; v1 submitted 12 July, 2021; originally announced July 2021.

  41. arXiv:2104.00670  [pdf, other

    cs.CV cs.LG

    Unconstrained Scene Generation with Locally Conditioned Radiance Fields

    Authors: Terrance DeVries, Miguel Angel Bautista, Nitish Srivastava, Graham W. Taylor, Joshua M. Susskind

    Abstract: We tackle the challenge of learning a distribution over complex, realistic, indoor scenes. In this paper, we introduce Generative Scene Networks (GSN), which learns to decompose scenes into a collection of many local radiance fields that can be rendered from a free moving camera. Our model can be used as a prior to generate new scenes, or to complete a scene given only sparse 2D observations. Rece… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.

  42. arXiv:2012.02041  [pdf, other

    astro-ph.IM astro-ph.HE

    The XSTAR Atomic Database

    Authors: Claudio Mendoza, Manuel A. Bautista, Jérôme Deprince, Javier A. García, Efraín Gatuzz, Thomas W. Gorczyca, Timothy R. Kallman, Patrick Palmeri, Pascal Quinet, Michael C. Witthoeft

    Abstract: We describe the atomic database of the XSTAR spectral modeling code, summarizing the systematic upgrades carried out in the past twenty years to enable the modeling of K lines from chemical elements with atomic number $Z\leq 30$ and recent extensions to handle high-density plasmas. Such plasma environments are found, for instance, in the inner region of accretion disks round compact objects (neutr… ▽ More

    Submitted 3 December, 2020; originally announced December 2020.

    Comments: 36 pages, 11 figures

  43. arXiv:2011.02523  [pdf, other

    cs.CV cs.GR

    Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

    Authors: Mike Roberts, Jason Ramapuram, Anurag Ranjan, Atulit Kumar, Miguel Angel Bautista, Nathan Paczan, Russ Webb, Joshua M. Susskind

    Abstract: For many fundamental scene understanding tasks, it is difficult or impossible to obtain per-pixel ground truth labels from real images. We address this challenge by introducing Hypersim, a photorealistic synthetic dataset for holistic indoor scene understanding. To create our dataset, we leverage a large repository of synthetic scenes created by professional artists, and we generate 77,400 images… ▽ More

    Submitted 17 August, 2021; v1 submitted 4 November, 2020; originally announced November 2020.

    Comments: Accepted for publication at the International Conference on Computer Vision (ICCV) 2021

  44. arXiv:2009.10586  [pdf, other

    astro-ph.GA physics.atom-ph

    Atomic Data Assessment with PyNeb

    Authors: Christophe Morisset, Valentina Luridiana, Jorge García-Rojas, Verónica Gómez-Llanos, Manuel A. Bautista, Claudio Mendoza

    Abstract: PyNeb is a Python package widely used to model emission lines in gaseous nebulae. We take advantage of its object-oriented architecture, class methods, and historical atomic database to structure a practical environment for atomic data assessment. Our aim is to reduce the uncertainties in parameter space (line-ratio diagnostics, electron density and temperature, and ionic abundances) arising from… ▽ More

    Submitted 8 October, 2020; v1 submitted 22 September, 2020; originally announced September 2020.

    Comments: published in Atoms MDPI

    Journal ref: PyNeb. Atoms 2020, 8, 66

  45. arXiv:2006.15427  [pdf, other

    cs.CV

    On the generalization of learning-based 3D reconstruction

    Authors: Miguel Angel Bautista, Walter Talbott, Shuangfei Zhai, Nitish Srivastava, Joshua M Susskind

    Abstract: State-of-the-art learning-based monocular 3D reconstruction methods learn priors over object categories on the training set, and as a result struggle to achieve reasonable generalization to object categories unseen during training. In this paper we study the inductive biases encoded in the model architecture that impact the generalization of learning-based 3D reconstruction methods. We find that 3… ▽ More

    Submitted 27 June, 2020; originally announced June 2020.

  46. arXiv:2006.10705  [pdf, other

    cs.LG cs.CV stat.ML

    Set Distribution Networks: a Generative Model for Sets of Images

    Authors: Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Carlos Guestrin, Josh M. Susskind

    Abstract: Images with shared characteristics naturally form sets. For example, in a face verification benchmark, images of the same identity form sets. For generative models, the standard way of dealing with sets is to represent each as a one hot vector, and learn a conditional generative model $p(\mathbf{x}|\mathbf{y})$. This representation assumes that the number of sets is limited and known, such that th… ▽ More

    Submitted 18 June, 2020; originally announced June 2020.

  47. arXiv:2006.07630  [pdf, other

    cs.CV stat.ML

    Equivariant Neural Rendering

    Authors: Emilien Dupont, Miguel Angel Bautista, Alex Colburn, Aditya Sankar, Carlos Guestrin, Josh Susskind, Qi Shan

    Abstract: We propose a framework for learning neural scene representations directly from images, without 3D supervision. Our key insight is that 3D structure can be imposed by ensuring that the learned representation transforms like a real 3D scene. Specifically, we introduce a loss which enforces equivariance of the scene representation with respect to 3D transformations. Our formulation allows us to infer… ▽ More

    Submitted 21 December, 2020; v1 submitted 13 June, 2020; originally announced June 2020.

    Comments: Add link to code

  48. arXiv:2005.02139  [pdf, ps, other

    astro-ph.SR astro-ph.GA

    On the changes in the physical properties of the ionized region around the Weigelt structures in Eta Carinae over the 5.54-yr spectroscopic cycle

    Authors: M. Teodoro, T. R. Gull, M. A. Bautista, D. J. Hillier, G Weigelt, M. Corcoran

    Abstract: We present HST/STIS observations and analysis of two prominent nebular structures around the central source of Eta Carinae, the knots C and D. The former is brighter than the latter for emission lines from intermediate or high ionization potential ions. The brightness of lines from intermediate and high ionization potential ions significantly decreases at phases around periastron. We do not see co… ▽ More

    Submitted 5 May, 2020; originally announced May 2020.

    Comments: 19 pages, 18 figures

  49. arXiv:2001.11915  [pdf, other

    physics.atom-ph astro-ph.HE

    Plasma-environment effects on K lines of astrophysical interest III. IPs, K thresholds, radiative rates, and Auger widths in Fe ix - Fe xvi

    Authors: J. Deprince, M. A. Bautista, S. Fritzsche, J. A. Garcia, T. R. Kallman, C. Mendoza, P. Palmeri, P. Quinet

    Abstract: Aims. In the context of black-hole accretion disks, we aim to compute the plasma-environment effects on the atomic parameters used to model the decay of K-vacancy states in moderately charged iron ions, namely Fe ix - Fe xvi. Methods. We used the fully relativistic multiconfiguration Dirac-Fock (MCDF) method approximating the plasma electron-nucleus and electron-electron screenings with a time-ave… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Comments: 6 pages, 5 figures, to be published in A&A

    Journal ref: A&A 635, A70 (2020)

  50. arXiv:1905.05895  [pdf, other

    cs.LG cs.CV stat.ML

    Addressing the Loss-Metric Mismatch with Adaptive Loss Alignment

    Authors: Chen Huang, Shuangfei Zhai, Walter Talbott, Miguel Angel Bautista, Shih-Yu Sun, Carlos Guestrin, Josh Susskind

    Abstract: In most machine learning training paradigms a fixed, often handcrafted, loss function is assumed to be a good proxy for an underlying evaluation metric. In this work we assess this assumption by meta-learning an adaptive loss function to directly optimize the evaluation metric. We propose a sample efficient reinforcement learning approach for adapting the loss dynamically during training. We empir… ▽ More

    Submitted 14 May, 2019; originally announced May 2019.

    Comments: Accepted to ICML 2019