Skip to main content

Showing 1–6 of 6 results for author: Jones, H T

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.01570  [pdf, ps, other

    cs.DB cs.LG

    Adversarial Query Synthesis via Bayesian Optimization

    Authors: Jeffrey Tao, Yimeng Zeng, Haydn Thomas Jones, Natalie Maus, Osbert Bastani, Jacob R. Gardner, Ryan Marcus

    Abstract: Benchmark workloads are extremely important to the database management research community, especially as more machine learning components are integrated into database systems. Here, we propose a Bayesian optimization technique to automatically search for difficult benchmark queries, significantly reducing the amount of manual effort usually required. In preliminary experiments, we show that our ap… ▽ More

    Submitted 2 March, 2026; originally announced March 2026.

  2. arXiv:2601.22382  [pdf, ps, other

    cs.LG

    Purely Agentic Black-Box Optimization for Biological Design

    Authors: Natalie Maus, Yimeng Zeng, Haydn Thomas Jones, Yining Huang, Gaurav Ng Goel, Alden Rose, Kyurae Kim, Hyun-Su Lee, Marcelo Der Torossian Torres, Fangping Wan, Cesar de la Fuente-Nunez, Mark Yatskar, Osbert Bastani, Jacob R. Gardner

    Abstract: Many key challenges in biological design-such as small-molecule drug discovery, antimicrobial peptide development, and protein engineering-can be framed as black-box optimization over vast, complex structured spaces. Existing methods rely mainly on raw structural data and struggle to exploit the rich scientific literature. While large language models (LLMs) have been added to these pipelines, they… ▽ More

    Submitted 29 January, 2026; originally announced January 2026.

  3. arXiv:2508.10899  [pdf, ps, other

    cs.LG

    A Dataset for Distilling Knowledge Priors from Literature for Therapeutic Design

    Authors: Haydn Thomas Jones, Natalie Maus, Josh Magnus Ludan, Maggie Ziyu Huan, Jiaming Liang, Marcelo Der Torossian Torres, Jiatao Liang, Zachary Ives, Yoseph Barash, Cesar de la Fuente-Nunez, Jacob R. Gardner, Mark Yatskar

    Abstract: AI-driven discovery can greatly reduce design time and enhance new therapeutics' effectiveness. Models using simulators explore broad design spaces but risk violating implicit constraints due to a lack of experimental priors. For example, in a new analysis we performed on a diverse set of models on the GuacaMol benchmark using supervised classifiers, over 60\% of molecules proposed had high probab… ▽ More

    Submitted 11 September, 2025; v1 submitted 14 August, 2025; originally announced August 2025.

  4. arXiv:2503.08131  [pdf, ps, other

    cs.LG

    Large Scale Multi-Task Bayesian Optimization with Large Language Models

    Authors: Yimeng Zeng, Natalie Maus, Haydn Thomas Jones, Jeffrey Tao, Fangping Wan, Marcelo Der Torossian Torres, Cesar de la Fuente-Nunez, Ryan Marcus, Osbert Bastani, Jacob R. Gardner

    Abstract: In multi-task Bayesian optimization, the goal is to leverage experience from optimizing existing tasks to improve the efficiency of optimizing new ones. While approaches using multi-task Gaussian processes or deep kernel transfer exist, the performance improvement is marginal when scaling beyond a moderate number of tasks. We introduce a novel approach leveraging large language models (LLMs) to le… ▽ More

    Submitted 12 June, 2025; v1 submitted 11 March, 2025; originally announced March 2025.

  5. arXiv:2501.19342  [pdf, ps, other

    cs.LG

    Covering Multiple Objectives with a Small Set of Solutions Using Bayesian Optimization

    Authors: Natalie Maus, Kyurae Kim, Yimeng Zeng, Haydn Thomas Jones, Fangping Wan, Marcelo Der Torossian Torres, Cesar de la Fuente-Nunez, Jacob R. Gardner

    Abstract: In multi-objective black-box optimization, the goal is typically to find solutions that optimize a set of $T$ black-box objective functions, $f_1, \ldots f_T$, simultaneously. Traditional approaches often seek a single Pareto-optimal set that balances trade-offs among all objectives. In contrast, we consider a problem setting that departs from this paradigm: finding a small set of $K < T$ solution… ▽ More

    Submitted 27 October, 2025; v1 submitted 31 January, 2025; originally announced January 2025.

  6. arXiv:2201.11872  [pdf, other

    cs.LG stat.ML

    Local Latent Space Bayesian Optimization over Structured Inputs

    Authors: Natalie Maus, Haydn T. Jones, Juston S. Moore, Matt J. Kusner, John Bradshaw, Jacob R. Gardner

    Abstract: Bayesian optimization over the latent spaces of deep autoencoder models (DAEs) has recently emerged as a promising new approach for optimizing challenging black-box functions over structured, discrete, hard-to-enumerate search spaces (e.g., molecules). Here the DAE dramatically simplifies the search space by mapping inputs into a continuous latent space where familiar Bayesian optimization tools c… ▽ More

    Submitted 22 February, 2023; v1 submitted 27 January, 2022; originally announced January 2022.