Skip to main content

Showing 1–19 of 19 results for author: Zenati, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.28681  [pdf, ps, other

    stat.ML cs.LG

    Functional Natural Policy Gradients

    Authors: Aurelien Bibaut, Houssam Zenati, Thibaud Rahier, Nathan Kallus

    Abstract: We propose a cross-fitted debiasing device for policy learning from offline data. A key consequence of the resulting learning principle is $\sqrt N$ regret even for policy classes with complexity greater than Donsker, provided a product-of-errors nuisance remainder is $O(N^{-1/2})$. The regret bound factors into a plug-in policy error factor governed by policy-class complexity and an environment n… ▽ More

    Submitted 3 April, 2026; v1 submitted 30 March, 2026; originally announced March 2026.

  2. arXiv:2602.21478  [pdf, ps, other

    stat.ML cs.LG math.ST stat.ME

    Efficient Inference after Directionally Stable Adaptive Experiments

    Authors: Zikai Shen, Houssam Zenati, Nathan Kallus, Arthur Gretton, Koulik Khamaru, Aurélien Bibaut

    Abstract: We study inference on scalar-valued pathwise differentiable targets after adaptive data collection, such as a bandit algorithm. We introduce a novel target-specific condition, directional stability, which is strictly weaker than previously imposed target-agnostic stability conditions. Under directional stability, we show that estimators that would have been efficient under i.i.d. data remain asymp… ▽ More

    Submitted 24 February, 2026; originally announced February 2026.

    Comments: 34 pages

  3. arXiv:2510.15483  [pdf, ps, other

    stat.ML cs.LG

    Fast Best-in-Class Regret for Contextual Bandits

    Authors: Samuel Girard, Aurelien Bibaut, Arthur Gretton, Nathan Kallus, Houssam Zenati

    Abstract: We study the problem of stochastic contextual bandits in the agnostic setting, where the goal is to compete with the best policy in a given class without assuming realizability or imposing model restrictions on losses or rewards. In this work, we establish the first fast rate for regret relative to the best-in-class policy. Our proposed algorithm updates the policy at every round by minimizing a p… ▽ More

    Submitted 3 April, 2026; v1 submitted 17 October, 2025; originally announced October 2025.

  4. arXiv:2510.10245  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Kernel Treatment Effects with Adaptively Collected Data

    Authors: Houssam Zenati, Bariscan Bozkurt, Arthur Gretton

    Abstract: Adaptive experiments improve efficiency by adjusting treatment assignments based on past outcomes, but this adaptivity breaks the i.i.d. assumptions that underpins classical asymptotics. At the same time, many questions of interest are distributional, extending beyond average effects. Kernel treatment effects (KTE) provide a flexible framework by representing counterfactual outcome distributions i… ▽ More

    Submitted 11 October, 2025; originally announced October 2025.

  5. arXiv:2506.02793  [pdf, ps, other

    stat.ML cs.LG

    Doubly-Robust Estimation of Counterfactual Policy Mean Embeddings

    Authors: Houssam Zenati, Bariscan Bozkurt, Arthur Gretton

    Abstract: Estimating the distribution of outcomes under counterfactual policies is critical for decision-making in domains such as recommendation, advertising, and healthcare. We propose and analyze a novel framework-Counterfactual Policy Mean Embedding (CPME)-that represents the entire counterfactual outcome distribution in a reproducing kernel Hilbert space (RKHS), enabling flexible and nonparametric dist… ▽ More

    Submitted 27 October, 2025; v1 submitted 3 June, 2025; originally announced June 2025.

  6. arXiv:2505.19807  [pdf, ps, other

    cs.LG stat.ML

    Density Ratio-Free Doubly Robust Proxy Causal Learning

    Authors: Bariscan Bozkurt, Houssam Zenati, Dimitri Meunier, Liyuan Xu, Arthur Gretton

    Abstract: We study the problem of causal function estimation in the Proxy Causal Learning (PCL) framework, where confounders are not observed but proxies for the confounders are available. Two main approaches have been proposed: outcome bridge-based and treatment bridge-based methods. In this work, we propose two kernel-based doubly robust estimators that combine the strengths of both approaches, and natura… ▽ More

    Submitted 26 March, 2026; v1 submitted 26 May, 2025; originally announced May 2025.

    Comments: Neurips published version

  7. arXiv:2503.06156  [pdf, other

    stat.ML cs.LG

    Double Debiased Machine Learning for Mediation Analysis with Continuous Treatments

    Authors: Houssam Zenati, Judith Abécassis, Julie Josse, Bertrand Thirion

    Abstract: Uncovering causal mediation effects is of significant value to practitioners seeking to isolate the direct treatment effect from the potential mediated effect. We propose a double machine learning (DML) algorithm for mediation analysis that supports continuous treatments. To estimate the target mediated response curve, our method uses a kernel-based doubly robust moment function for which we prove… ▽ More

    Submitted 8 March, 2025; originally announced March 2025.

    Comments: To appear in AISTATS 2025

  8. arXiv:2402.15171  [pdf, other

    cs.LG math.ST stat.ML

    Towards Efficient and Optimal Covariance-Adaptive Algorithms for Combinatorial Semi-Bandits

    Authors: Julien Zhou, Pierre Gaillard, Thibaud Rahier, Houssam Zenati, Julyan Arbel

    Abstract: We address the problem of stochastic combinatorial semi-bandits, where a player selects among P actions from the power set of a set containing d base items. Adaptivity to the problem's structure is essential in order to obtain optimal regret upper bounds. As estimating the coefficients of a covariance matrix can be manageable in practice, leveraging them should improve the regret. We design "optim… ▽ More

    Submitted 15 November, 2024; v1 submitted 23 February, 2024; originally announced February 2024.

  9. arXiv:2302.12120  [pdf, other

    cs.LG

    Sequential Counterfactual Risk Minimization

    Authors: Houssam Zenati, Eustache Diemert, Matthieu Martin, Julien Mairal, Pierre Gaillard

    Abstract: Counterfactual Risk Minimization (CRM) is a framework for dealing with the logged bandit feedback problem, where the goal is to improve a logging policy using offline data. In this paper, we explore the case where it is possible to deploy learned policies multiple times and acquire new data. We extend the CRM principle and its theory to this scenario, which we call "Sequential Counterfactual Risk… ▽ More

    Submitted 25 May, 2023; v1 submitted 23 February, 2023; originally announced February 2023.

    Comments: To appear at ICML23

  10. arXiv:2206.09348  [pdf, other

    cs.LG cs.GT math.OC

    Nested bandits

    Authors: Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier, Houssam Zenati

    Abstract: In many online decision processes, the optimizing agent is called to choose between large numbers of alternatives with many inherent similarities; in turn, these similarities imply closely correlated losses that may confound standard discrete choice models and bandit algorithms. We study this question in the context of nested bandits, a class of adversarial multi-armed bandit problems where the le… ▽ More

    Submitted 19 June, 2022; originally announced June 2022.

    Comments: 35 pages, 14 figures; to appear in ICML 2022

    MSC Class: Primary 68Q32; secondary 91B06

  11. arXiv:2202.05638  [pdf, other

    cs.LG

    Efficient Kernel UCB for Contextual Bandits

    Authors: Houssam Zenati, Alberto Bietti, Eustache Diemert, Julien Mairal, Matthieu Martin, Pierre Gaillard

    Abstract: In this paper, we tackle the computational efficiency of kernelized UCB algorithms in contextual bandits. While standard methods require a O(CT^3) complexity where T is the horizon and the constant C is related to optimizing the UCB rule, we propose an efficient contextual algorithm for large-scale problems. Specifically, our method relies on incremental Nystrom approximations of the joint kernel… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: To appear at AISTATS2022

  12. arXiv:2004.11722  [pdf, other

    stat.ML cs.LG

    Counterfactual Learning of Stochastic Policies with Continuous Actions

    Authors: Houssam Zenati, Alberto Bietti, Matthieu Martin, Eustache Diemert, Pierre Gaillard, Julien Mairal

    Abstract: Counterfactual reasoning from logged data has become increasingly important for many applications such as web advertising or healthcare. In this paper, we address the problem of learning stochastic policies with continuous actions from the viewpoint of counterfactual risk minimization (CRM). While the CRM framework is appealing and well studied for discrete actions, the continuous action case rais… ▽ More

    Submitted 21 February, 2025; v1 submitted 22 April, 2020; originally announced April 2020.

  13. arXiv:2003.09404  [pdf, other

    eess.IV cs.CV

    RGB-Topography and X-rays Image Registration for Idiopathic Scoliosis Children Patient Follow-up

    Authors: Insaf Setitra, Noureddine Aouaa, Abdelkrim Meziane, Afef Benrabia, Houria Kaced, Hanene Belabassi, Sara Ait Ziane, Nadia Henda Zenati, Oualid Djekkoune

    Abstract: Children diagnosed with a scoliosis pathology are exposed during their follow up to ionic radiations in each X-rays diagnosis. This exposure can have negative effects on the patient's health and cause diseases in the adult age. In order to reduce X-rays scanning, recent systems provide diagnosis of scoliosis patients using solely RGB images. The output of such systems is a set of augmented images… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  14. arXiv:1812.07832  [pdf, other

    cs.CV

    Semi-Supervised Deep Learning for Abnormality Classification in Retinal Images

    Authors: Bruno Lecouat, Ken Chang, Chuan-Sheng Foo, Balagopal Unnikrishnan, James M. Brown, Houssam Zenati, Andrew Beers, Vijay Chandrasekhar, Jayashree Kalpathy-Cramer, Pavitra Krishnaswamy

    Abstract: Supervised deep learning algorithms have enabled significant performance gains in medical image classification tasks. But these methods rely on large labeled datasets that require resource-intensive expert annotation. Semi-supervised generative adversarial network (GAN) approaches offer a means to learn from limited labeled data alongside larger unlabeled datasets, but have not been applied to dis… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/227

  15. arXiv:1812.02288  [pdf, other

    cs.LG stat.ML

    Adversarially Learned Anomaly Detection

    Authors: Houssam Zenati, Manon Romain, Chuan Sheng Foo, Bruno Lecouat, Vijay Ramaseshan Chandrasekhar

    Abstract: Anomaly detection is a significant and hence well-studied problem. However, developing effective anomaly detection methods for complex and high-dimensional data remains a challenge. As Generative Adversarial Networks (GANs) are able to model the complex high-dimensional distributions of real-world data, they offer a promising approach to address this challenge. In this work, we propose an anomaly… ▽ More

    Submitted 5 December, 2018; originally announced December 2018.

    Comments: In the Proceedings of the 20th IEEE International Conference on Data Mining (ICDM), 2018

  16. arXiv:1807.04307  [pdf, other

    cs.LG stat.ML

    Manifold regularization with GANs for semi-supervised learning

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay Chandrasekhar

    Abstract: Generative Adversarial Networks are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating a variant of the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the semi-supervised feature-matching GAN we achieve state-of-the-art results… ▽ More

    Submitted 11 July, 2018; originally announced July 2018.

  17. arXiv:1807.02629  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Optimistic mirror descent in saddle-point problems: Going the extra (gradient) mile

    Authors: Panayotis Mertikopoulos, Bruno Lecouat, Houssam Zenati, Chuan-Sheng Foo, Vijay Chandrasekhar, Georgios Piliouras

    Abstract: Owing to their connection with generative adversarial networks (GANs), saddle-point problems have recently attracted considerable interest in machine learning and beyond. By necessity, most theoretical guarantees revolve around convex-concave (or even linear) problems; however, making theoretical inroads towards efficient GAN training depends crucially on moving beyond this classic framework. To m… ▽ More

    Submitted 1 October, 2018; v1 submitted 7 July, 2018; originally announced July 2018.

    Comments: 26 pages, 14 figures

  18. arXiv:1805.08957  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Learning with GANs: Revisiting Manifold Regularization

    Authors: Bruno Lecouat, Chuan-Sheng Foo, Houssam Zenati, Vijay R. Chandrasekhar

    Abstract: GANS are powerful generative models that are able to model the manifold of natural images. We leverage this property to perform manifold regularization by approximating the Laplacian norm using a Monte Carlo approximation that is easily computed with the GAN. When incorporated into the feature-matching GAN of Improved GAN, we achieve state-of-the-art results for GAN-based semi-supervised learning… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: Accepted paper

    Journal ref: Workshop track - ICLR 2018

  19. arXiv:1802.06222  [pdf, ps, other

    cs.LG stat.ML

    Efficient GAN-Based Anomaly Detection

    Authors: Houssam Zenati, Chuan Sheng Foo, Bruno Lecouat, Gaurav Manek, Vijay Ramaseshan Chandrasekhar

    Abstract: Generative adversarial networks (GANs) are able to model the complex highdimensional distributions of real-world data, which suggests they could be effective for anomaly detection. However, few works have explored the use of GANs for the anomaly detection task. We leverage recently developed GAN models for anomaly detection, and achieve state-of-the-art performance on image and network intrusion d… ▽ More

    Submitted 1 May, 2019; v1 submitted 17 February, 2018; originally announced February 2018.

    Comments: Updated version of this work is published at ICDM 2018, see arXiv:1812.02288 . Submitted to the ICLR Workshop 2018