Skip to main content

Showing 1–37 of 37 results for author: Ahmed, O

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.07924  [pdf

    cs.LG cs.CY

    Semantic Risk Scoring of Aggregated Metrics: An AI-Driven Approach for Healthcare Data Governance

    Authors: Mohammed Omer Shakeel Ahmed

    Abstract: Large healthcare institutions typically operate multiple business intelligence (BI) teams segmented by domain, including clinical performance, fundraising, operations, and compliance. Due to HIPAA, FERPA, and IRB restrictions, these teams face challenges in sharing patient-level data needed for analytics. To mitigate this, A metric aggregation table is proposed, which is a precomputed, privacy-com… ▽ More

    Submitted 8 March, 2026; originally announced March 2026.

    Comments: 6 pages, 3 figures, 1 Table, Accepted for publication in the 21st Int. Conference on Data Science (ICDATA 25)

  2. A Late-Fusion Multimodal AI Framework for Privacy-Preserving Deduplication in National Healthcare Data Environments

    Authors: Mohammed Omer Shakeel Ahmed

    Abstract: Duplicate records pose significant challenges in customer relationship management (CRM)and healthcare, often leading to inaccuracies in analytics, impaired user experiences, and compliance risks. Traditional deduplication methods rely heavily on direct identifiers such as names, emails, or Social Security Numbers (SSNs), making them ineffective under strict privacy regulations like GDPR and HIPAA,… ▽ More

    Submitted 4 March, 2026; originally announced March 2026.

    Comments: 6 pages, 1 figure, 1 table. Accepted for publication in the 2025 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)

    Journal ref: 2025 IEEE International Conference on Future Machine Learning and Data Science (FMLDS)

  3. arXiv:2601.04690  [pdf, ps, other

    cs.LG

    Do LLMs Benefit from User and Item Embeddings in Recommendation Tasks?

    Authors: Mir Rayat Imtiaz Hossain, Leo Feng, Leonid Sigal, Mohamed Osama Ahmed

    Abstract: Large Language Models (LLMs) have emerged as promising recommendation systems, offering novel ways to model user preferences through generative approaches. However, many existing methods often rely solely on text semantics or incorporate collaborative signals in a limited manner, typically using only user or item embeddings. These methods struggle to handle multiple item embeddings representing us… ▽ More

    Submitted 8 January, 2026; originally announced January 2026.

    Comments: Presented in Multimodal Algorithmic Reasoning Workshop at NeurIPS 2025

  4. arXiv:2511.04831  [pdf, ps, other

    cs.RO cs.AI

    Isaac Lab: A GPU-Accelerated Simulation Framework for Multi-Modal Robot Learning

    Authors: NVIDIA, :, Mayank Mittal, Pascal Roth, James Tigue, Antoine Richard, Octi Zhang, Peter Du, Antonio Serrano-Muñoz, Xinjie Yao, René Zurbrügg, Nikita Rudin, Lukasz Wawrzyniak, Milad Rakhsha, Alain Denzler, Eric Heiden, Ales Borovicka, Ossama Ahmed, Iretiayo Akinola, Abrar Anwar, Mark T. Carlson, Ji Yuan Feng, Animesh Garg, Renato Gasoto, Lionel Gulich , et al. (82 additional authors not shown)

    Abstract: We present Isaac Lab, the natural successor to Isaac Gym, which extends the paradigm of GPU-native robotics simulation into the era of large-scale multi-modal learning. Isaac Lab combines high-fidelity GPU parallel physics, photorealistic rendering, and a modular, composable architecture for designing environments and training robot policies. Beyond physics and rendering, the framework integrates… ▽ More

    Submitted 6 November, 2025; originally announced November 2025.

    Comments: Code and documentation are available here: https://github.com/isaac-sim/IsaacLab

  5. arXiv:2510.13634  [pdf, ps, other

    cs.LG cs.ET

    Multivariate Time Series Forecasting with Gate-Based Quantum Reservoir Computing on NISQ Hardware

    Authors: Wissal Hamhoum, Soumaya Cherkaoui, Jean-Frederic Laprade, Ola Ahmed, Shengrui Wang

    Abstract: Quantum reservoir computing (QRC) offers a hardware-friendly approach to temporal learning, yet most studies target univariate signals and overlook near-term hardware constraints. This work introduces a gate-based QRC for multivariate time series (MTS-QRC) that pairs injection and memory qubits and uses a Trotterized nearest-neighbor transverse-field Ising evolution optimized for current device co… ▽ More

    Submitted 15 October, 2025; originally announced October 2025.

  6. arXiv:2509.25448  [pdf, ps, other

    cs.CR cs.CL

    Fingerprinting LLMs via Prompt Injection

    Authors: Yuepeng Hu, Zhengyuan Jiang, Mengyuan Li, Osama Ahmed, Zhicong Huang, Cheng Hong, Neil Gong

    Abstract: Large language models (LLMs) are often modified after release through post-processing such as post-training or quantization, which makes it challenging to determine whether one model is derived from another. Existing provenance detection methods have two main limitations: (1) they embed signals into the base model before release, which is infeasible for already published models, or (2) they compar… ▽ More

    Submitted 1 October, 2025; v1 submitted 29 September, 2025; originally announced September 2025.

  7. arXiv:2506.22335  [pdf, ps, other

    quant-ph cs.LG nlin.CD

    Robust quantum reservoir computers for forecasting chaotic dynamics: generalized synchronization and stability

    Authors: Osama Ahmed, Felix Tennie, Luca Magri

    Abstract: We show that recurrent quantum reservoir computers (QRCs) and their recurrence-free architectures (RF-QRCs) are robust tools for learning and forecasting chaotic dynamics from time-series data. First, we formulate and interpret quantum reservoir computers as coupled dynamical systems, where the reservoir acts as a response system driven by training data; in other words, quantum reservoir computers… ▽ More

    Submitted 27 June, 2025; originally announced June 2025.

    Comments: 28 pages, 12 figures

  8. A Systematic Review of Open Datasets Used in Text-to-Image (T2I) Gen AI Model Safety

    Authors: Rakeen Rouf, Trupti Bavalatti, Osama Ahmed, Dhaval Potdar, Faraz Jawed

    Abstract: Novel research aimed at text-to-image (T2I) generative AI safety often relies on publicly available datasets for training and evaluation, making the quality and composition of these datasets crucial. This paper presents a comprehensive review of the key datasets used in the T2I research, detailing their collection methods, compositions, semantic and syntactic diversity of prompts and the quality,… ▽ More

    Submitted 22 February, 2025; originally announced March 2025.

    Comments: Accepted for publication in IEEE Access, DOI: 10.1109/ACCESS.2025.3539933

    Journal ref: IEEE Access 2025

  9. Multi-objective Cat Swarm Optimization Algorithm based on a Grid System

    Authors: Aram M. Ahmed, Bryar A. Hassan, Tarik A. Rashid, Kaniaw A. Noori, Soran Ab. M. Saeed, Omed H. Ahmed, Shahla U. Umar

    Abstract: This paper presents a multi-objective version of the Cat Swarm Optimization Algorithm called the Grid-based Multi-objective Cat Swarm Optimization Algorithm (GMOCSO). Convergence and diversity preservation are the two main goals pursued by modern multi-objective algorithms to yield robust results. To achieve these goals, we first replace the roulette wheel method of the original CSO algorithm with… ▽ More

    Submitted 22 February, 2025; originally announced February 2025.

  10. arXiv:2411.13205  [pdf, ps, other

    cs.RO cs.CV

    An Integrated Approach to Robotic Object Grasping and Manipulation

    Authors: Owais Ahmed, M Huzaifa, M Areeb, Hamza Ali Khan

    Abstract: In response to the growing challenges of manual labor and efficiency in warehouse operations, Amazon has embarked on a significant transformation by incorporating robotics to assist with various tasks. While a substantial number of robots have been successfully deployed for tasks such as item transportation within warehouses, the complex process of object picking from shelves remains a significant… ▽ More

    Submitted 29 July, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

  11. arXiv:2410.01201  [pdf, other

    cs.LG cs.AI

    Were RNNs All We Needed?

    Authors: Leo Feng, Frederick Tung, Mohamed Osama Ahmed, Yoshua Bengio, Hossein Hajimirsadeghi

    Abstract: The introduction of Transformers in 2017 reshaped the landscape of deep learning. Originally proposed for sequence modelling, Transformers have since achieved widespread success across various domains. However, the scalability limitations of Transformers - particularly with respect to sequence length - have sparked renewed interest in novel recurrent models that are parallelizable during training,… ▽ More

    Submitted 28 November, 2024; v1 submitted 1 October, 2024; originally announced October 2024.

  12. arXiv:2409.01394  [pdf, other

    quant-ph cs.LG nlin.CD

    Optimal training of finitely-sampled quantum reservoir computers for forecasting of chaotic dynamics

    Authors: Osama Ahmed, Felix Tennie, Luca Magri

    Abstract: In the current Noisy Intermediate Scale Quantum (NISQ) era, the presence of noise deteriorates the performance of quantum computing algorithms. Quantum Reservoir Computing (QRC) is a type of Quantum Machine Learning algorithm, which, however, can benefit from different types of tuned noise. In this paper, we analyse the effect that finite-sampling noise has on the chaotic time-series prediction ca… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

    Comments: 11 pages, 14 figures

  13. arXiv:2405.13956  [pdf, other

    cs.LG

    Attention as an RNN

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Mohamed Osama Ahmed, Yoshua Bengio, Greg Mori

    Abstract: The advent of Transformers marked a significant breakthrough in sequence modelling, providing a highly performant architecture capable of leveraging GPU parallelism. However, Transformers are computationally expensive at inference time, limiting their applications, particularly in low-resource settings (e.g., mobile and embedded devices). Addressing this, we (1) begin by showing that attention can… ▽ More

    Submitted 28 May, 2024; v1 submitted 22 May, 2024; originally announced May 2024.

  14. arXiv:2402.06935  [pdf, other

    cs.DS q-bio.GN q-bio.PE

    Taxonomic classification with maximal exact matches in KATKA kernels and minimizer digests

    Authors: Dominika Draesslerová, Omar Ahmed, Travis Gagie, Jan Holub, Ben Langmead, Giovanni Manzini, Gonzalo Navarro

    Abstract: For taxonomic classification, we are asked to index the genomes in a phylogenetic tree such that later, given a DNA read, we can quickly choose a small subtree likely to contain the genome from which that read was drawn. Although popular classifiers such as Kraken use $k$-mers, recent research indicates that using maximal exact matches (MEMs) can lead to better classifications. For example, we can… ▽ More

    Submitted 4 April, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

  15. Adaptive Prognostic Malfunction Based Processor for Autonomous Landing Guidance Assistance System Using FPGA

    Authors: Hossam O. Ahmed, David Wyatt

    Abstract: The demand for more developed and agile urban taxi drones is increasing rapidly nowadays to sustain crowded cities and their traffic issues. The critical factor for spreading such technology could be related to the safety criteria that must be considered. One of the most critical safety aspects for such vertical and/or Short Take-Off and Landing (V/STOL) drones is related to safety during the land… ▽ More

    Submitted 13 January, 2024; originally announced January 2024.

    Comments: Published in: IEEE Access ( Volume: 12) - Page(s): 2113 - 2122

  16. arXiv:2311.02891  [pdf, other

    cs.LG

    AdaFlood: Adaptive Flood Regularization

    Authors: Wonho Bae, Yi Ren, Mohamad Osama Ahmed, Frederick Tung, Danica J. Sutherland, Gabriel L. Oliveira

    Abstract: Although neural networks are conventionally optimized towards zero training loss, it has been recently learned that targeting a non-zero training loss threshold, referred to as a flood level, often enables better test time generalization. Current approaches, however, apply the same constant flood level to all training samples, which inherently assumes all the samples have the same difficulty. We p… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  17. arXiv:2309.17388  [pdf, other

    cs.LG

    Tree Cross Attention

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Cross Attention is a popular method for retrieving information from a set of context tokens for making predictions. At inference time, for each prediction, Cross Attention scans the full set of $\mathcal{O}(N)$ tokens. In practice, however, often only a small subset of tokens are required for good performance. Methods such as Perceiver IO are cheap at inference as they distill the information to a… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

    Comments: Accepted by ICLR 2024

  18. arXiv:2309.03544  [pdf, other

    cs.SD cs.LG eess.AS

    MVD:A Novel Methodology and Dataset for Acoustic Vehicle Type Classification

    Authors: Mohd Ashhad, Omar Ahmed, Sooraj K. Ambat, Zeeshan Ali Haq, Mansaf Alam

    Abstract: Rising urban populations have led to a surge in vehicle use and made traffic monitoring and management indispensable. Acoustic traffic monitoring (ATM) offers a cost-effective and efficient alternative to more computationally expensive methods of monitoring traffic such as those involving computer vision technologies. In this paper, we present MVD and MVDA: two open datasets for the development of… ▽ More

    Submitted 7 September, 2023; originally announced September 2023.

  19. ANER: Arabic and Arabizi Named Entity Recognition using Transformer-Based Approach

    Authors: Abdelrahman "Boda" Sadallah, Omar Ahmed, Shimaa Mohamed, Omar Hatem, Doaa Hesham, Ahmed H. Yousef

    Abstract: One of the main tasks of Natural Language Processing (NLP), is Named Entity Recognition (NER). It is used in many applications and also can be used as an intermediate step for other tasks. We present ANER, a web-based named entity recognizer for the Arabic, and Arabizi languages. The model is built upon BERT, which is a transformer-based encoder. It can recognize 50 different entity classes, cover… ▽ More

    Submitted 28 August, 2023; originally announced August 2023.

  20. arXiv:2306.12599  [pdf, other

    cs.LG

    Constant Memory Attention Block

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Modern foundation model architectures rely on attention mechanisms to effectively capture context. However, these methods require linear or quadratic memory in terms of the number of inputs/datapoints, limiting their applicability in low-compute domains. In this work, we propose Constant Memory Attention Block (CMAB), a novel general-purpose attention block that computes its output in constant mem… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: Workshop version of arXiv:2305.14567

  21. Segregated FLS Processing Cores for V/STOL Autonomous Landing Guidance Assistant System using FPGA

    Authors: Hossam O. Ahmed

    Abstract: It is highly predicted that the roads and parking areas will be extremely congested with vehicles to the point that searching for a novel solution will not be an optional choice for conserving the sustainability rate of the overall humanity's development growth. Such issue could be overcome by developing modified generations of the Urban Air Mobility (UAM) vehicles that essentially depend on the V… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 2021 Integrated Communications Navigation and Surveillance Conference (ICNS), Dulles, VA, USA

  22. arXiv:2305.14567  [pdf, other

    cs.LG cs.CV

    Memory Efficient Neural Processes via Constant Memory Attention Block

    Authors: Leo Feng, Frederick Tung, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Neural Processes (NPs) are popular meta-learning methods for efficiently modelling predictive uncertainty. Recent state-of-the-art methods, however, leverage expensive attention mechanisms, limiting their applications, particularly in low-resource settings. In this work, we propose Constant Memory Attentive Neural Processes (CMANPs), an NP variant that only requires constant memory. To do so, we f… ▽ More

    Submitted 27 May, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

  23. arXiv:2305.01771  [pdf

    eess.SY cs.DC

    Fault Tolerant Processing Unit Using Gamma Distribution Sliding Window For Autonomous Landing Guidance System

    Authors: Hossam O. Ahmed

    Abstract: To keep up with today's dense metropolitan areas and their accompanying traffic problems, a growing number of towns are looking for more advanced and swift urban taxi drones. The safety parameters that must be taken into consideration may be the most important element in the widespread use of such technology. Most recent aviation mishaps have happened during the landing phase, making this a partic… ▽ More

    Submitted 3 June, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 21st IEEE Interregional NEWCAS Conference, Edinburgh, Scotland

  24. arXiv:2304.02099  [pdf

    eess.SY cs.AR cs.DC eess.SP

    Coarse Grained FLS-based Processor with Prognostic Malfunction Feature for UAM Drones using FPGA

    Authors: Hossam O. Ahmed

    Abstract: Many overall safety factors need to be considered in the next generation of Urban Air Mobility (UAM) systems and addressing these can become the anchor point for such technology to reach consent for worldwide application. On the other hand, fulfilling the safety requirements from an exponential increase of prolific UAM systems, is extremely complicated, and requires careful consideration of a vari… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

    Comments: The paper is accepted

    Journal ref: The 23rd Integrated Communications, Navigation, and Surveillance Conference, 2023, USA

  25. arXiv:2301.12023  [pdf, other

    cs.LG

    Meta Temporal Point Processes

    Authors: Wonho Bae, Mohamed Osama Ahmed, Frederick Tung, Gabriel L. Oliveira

    Abstract: A temporal point process (TPP) is a stochastic process where its realization is a sequence of discrete events in time. Recent work in TPPs model the process using a neural network in a supervised learning framework, where a training set is a collection of all the sequences. In this work, we propose to train TPPs in a meta learning framework, where each sequence is treated as a different task, via… ▽ More

    Submitted 27 January, 2023; originally announced January 2023.

    Comments: Accepted to ICLR2023

  26. arXiv:2211.10564  [pdf, other

    cs.LG cs.CV

    Gumbel-Softmax Selective Networks

    Authors: Mahmoud Salem, Mohamed Osama Ahmed, Frederick Tung, Gabriel Oliveira

    Abstract: ML models often operate within the context of a larger system that can adapt its response when the ML model is uncertain, such as falling back on safe defaults or a human in the loop. This commonly encountered operational context calls for principled techniques for training ML models with the option to abstain from predicting when uncertain. Selective neural networks are trained with an integrated… ▽ More

    Submitted 18 November, 2022; originally announced November 2022.

  27. arXiv:2211.08458  [pdf, other

    cs.LG cs.AI

    Latent Bottlenecked Attentive Neural Processes

    Authors: Leo Feng, Hossein Hajimirsadeghi, Yoshua Bengio, Mohamed Osama Ahmed

    Abstract: Neural Processes (NPs) are popular methods in meta-learning that can estimate predictive uncertainty on target datapoints by conditioning on a context dataset. Previous state-of-the-art method Transformer Neural Processes (TNPs) achieve strong performance but require quadratic computation with respect to the number of context datapoints, significantly limiting its scalability. Conversely, existing… ▽ More

    Submitted 1 March, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  28. arXiv:2206.09034  [pdf, other

    cs.LG cs.AI cs.CV

    Towards Better Selective Classification

    Authors: Leo Feng, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Amir Abdi

    Abstract: We tackle the problem of Selective Classification where the objective is to achieve the best performance on a predetermined ratio (coverage) of the dataset. Recent state-of-the-art selective methods come with architectural changes either via introducing a separate selection head or an extra abstention logit. In this paper, we challenge the aforementioned methods. The results suggest that the super… ▽ More

    Submitted 1 March, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

  29. arXiv:2205.08247  [pdf, other

    cs.LG cs.AI

    Monotonicity Regularization: Improved Penalties and Novel Applications to Disentangled Representation Learning and Robust Classification

    Authors: Joao Monteiro, Mohamed Osama Ahmed, Hossein Hajimirsadeghi, Greg Mori

    Abstract: We study settings where gradient penalties are used alongside risk minimization with the goal of obtaining predictors satisfying different notions of monotonicity. Specifically, we present two sets of contributions. In the first part of the paper, we show that different choices of penalties define the regions of the input space where the property is observed. As such, previous methods result in mo… ▽ More

    Submitted 17 May, 2022; originally announced May 2022.

    Comments: Accepted to UAI 2022

  30. arXiv:2112.13168  [pdf, other

    q-bio.QM cs.LG

    AI-Bind: Improving Binding Predictions for Novel Protein Targets and Ligands

    Authors: Ayan Chatterjee, Robin Walters, Zohair Shafi, Omair Shafi Ahmed, Michael Sebek, Deisy Gysi, Rose Yu, Tina Eliassi-Rad, Albert-László Barabási, Giulia Menichetti

    Abstract: Identifying novel drug-target interactions (DTI) is a critical and rate limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We first unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortc… ▽ More

    Submitted 9 November, 2022; v1 submitted 24 December, 2021; originally announced December 2021.

    Comments: 83 pages, 26 figures, all references moved to a single section, new results added on AI interpretability, added comparison with MolTrans, added validation using gold standard experimental data

  31. arXiv:2010.04296  [pdf, other

    cs.RO cs.LG stat.ML

    CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

    Authors: Ossama Ahmed, Frederik Träuble, Anirudh Goyal, Alexander Neitz, Yoshua Bengio, Bernhard Schölkopf, Manuel Wüthrich, Stefan Bauer

    Abstract: Despite recent successes of reinforcement learning (RL), it remains a challenge for agents to transfer learned skills to related environments. To facilitate research addressing this problem, we propose CausalWorld, a benchmark for causal structure and transfer learning in a robotic manipulation environment. The environment is a simulation of an open-source robotic platform, hence offering the poss… ▽ More

    Submitted 24 November, 2020; v1 submitted 8 October, 2020; originally announced October 2020.

    Comments: The first two authors contributed equally, the last two authors avised jointly

  32. arXiv:1910.08281  [pdf, other

    cs.LG stat.ML

    Point Process Flows

    Authors: Nazanin Mehrasa, Ruizhi Deng, Mohamed Osama Ahmed, Bo Chang, Jiawei He, Thibaut Durand, Marcus Brubaker, Greg Mori

    Abstract: Event sequences can be modeled by temporal point processes (TPPs) to capture their asynchronous and probabilistic nature. We propose an intensity-free framework that directly models the point process distribution by utilizing normalizing flows. This approach is capable of capturing highly complex temporal distributions and does not rely on restrictive parametric forms. Comparisons with state-of-th… ▽ More

    Submitted 22 December, 2019; v1 submitted 18 October, 2019; originally announced October 2019.

  33. arXiv:1904.03603  [pdf, other

    cs.NE q-bio.NC

    Human Intracranial EEG Quantitative Analysis and Automatic Feature Learning for Epileptic Seizure Prediction

    Authors: Ramy Hussein, Mohamed Osama Ahmed, Rabab Ward, Z. Jane Wang, Levin Kuhlmann, Yi Guo

    Abstract: Objective: The aim of this study is to develop an efficient and reliable epileptic seizure prediction system using intracranial EEG (iEEG) data, especially for people with drug-resistant epilepsy. The prediction procedure should yield accurate results in a fast enough fashion to alert patients of impending seizures. Methods: We quantitatively analyze the human iEEG data to obtain insights into how… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  34. arXiv:1810.04336  [pdf, other

    cs.LG stat.ML

    Combining Bayesian Optimization and Lipschitz Optimization

    Authors: Mohamed Osama Ahmed, Sharan Vaswani, Mark Schmidt

    Abstract: Bayesian optimization and Lipschitz optimization have developed alternative techniques for optimizing black-box functions. They each exploit a different form of prior about the function. In this work, we explore strategies to combine these techniques for better global optimization. In particular, we propose ways to use the Lipschitz continuity assumption within traditional BO algorithms, which we… ▽ More

    Submitted 28 July, 2020; v1 submitted 9 October, 2018; originally announced October 2018.

  35. arXiv:1511.01942  [pdf, other

    cs.LG math.OC stat.CO stat.ML

    Stop Wasting My Gradients: Practical SVRG

    Authors: Reza Babanezhad, Mohamed Osama Ahmed, Alim Virani, Mark Schmidt, Jakub Konečný, Scott Sallinen

    Abstract: We present and analyze several strategies for improving the performance of stochastic variance-reduced gradient (SVRG) methods. We first show that the convergence rate of these methods can be preserved under a decreasing sequence of errors in the control variate, and use this to derive variants of SVRG that use growing-batch strategies to reduce the number of gradient calculations required in the… ▽ More

    Submitted 5 November, 2015; originally announced November 2015.

  36. arXiv:1504.04406  [pdf, other

    stat.ML cs.LG math.OC stat.CO

    Non-Uniform Stochastic Average Gradient Method for Training Conditional Random Fields

    Authors: Mark Schmidt, Reza Babanezhad, Mohamed Osama Ahmed, Aaron Defazio, Ann Clifton, Anoop Sarkar

    Abstract: We apply stochastic average gradient (SAG) algorithms for training conditional random fields (CRFs). We describe a practical implementation that uses structure in the CRF gradient to reduce the memory requirement of this linearly-convergent stochastic gradient method, propose a non-uniform sampling scheme that substantially improves practical performance, and analyze the rate of convergence of the… ▽ More

    Submitted 16 April, 2015; originally announced April 2015.

    Comments: AI/Stats 2015, 24 pages

  37. arXiv:1203.2394  [pdf, other

    stat.ML cs.LG stat.CO

    Decentralized, Adaptive, Look-Ahead Particle Filtering

    Authors: Mohamed Osama Ahmed, Pouyan T. Bibalan, Nando de Freitas, Simon Fauvel

    Abstract: The decentralized particle filter (DPF) was proposed recently to increase the level of parallelism of particle filtering. Given a decomposition of the state space into two nested sets of variables, the DPF uses a particle filter to sample the first set and then conditions on this sample to generate a set of samples for the second set of variables. The DPF can be understood as a variant of the popu… ▽ More

    Submitted 11 March, 2012; originally announced March 2012.

    Comments: 16 pages, 11 figures, Authorship in alphabetical order