Skip to main content

Showing 1–45 of 45 results for author: Afifi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2512.08564  [pdf, ps, other

    cs.CV

    Modular Neural Image Signal Processing

    Authors: Mahmoud Afifi, Zhongling Wang, Ran Zhang, Michael S. Brown

    Abstract: This paper presents a modular neural image signal processing (ISP) framework that processes raw inputs and renders high-quality display-referred images. Unlike prior neural ISP designs, our method introduces a high degree of modularity, providing full control over multiple intermediate stages of the rendering process.~This modular design not only achieves high rendering accuracy but also improves… ▽ More

    Submitted 9 December, 2025; originally announced December 2025.

  2. arXiv:2512.00912  [pdf, ps, other

    cs.CV cs.AI cs.LG

    ForamDeepSlice: A High-Accuracy Deep Learning Framework for Foraminifera Species Classification from 2D Micro-CT Slices

    Authors: Abdelghafour Halimi, Ali Alibrahim, Didier Barradas-Bautista, Ronell Sicat, Abdulkader M. Afifi

    Abstract: This study presents a comprehensive deep learning pipeline for the automated classification of 12 foraminifera species using 2D micro-CT slices derived from 3D scans. We curated a scientifically rigorous dataset comprising 97 micro-CT scanned specimens across 27 species, selecting 12 species with sufficient representation for robust machine learning. To ensure methodological integrity and prevent… ▽ More

    Submitted 30 November, 2025; originally announced December 2025.

    ACM Class: I.2.10; I.4.6; J.2

  3. arXiv:2509.19624  [pdf, ps, other

    cs.CV

    Raw-JPEG Adapter: Efficient Raw Image Compression with JPEG

    Authors: Mahmoud Afifi, Ran Zhang, Michael S. Brown

    Abstract: Digital cameras digitize scene light into linear raw representations, which the image signal processor (ISP) converts into display-ready outputs. While raw data preserves full sensor information--valuable for editing and vision tasks--formats such as Digital Negative (DNG) require large storage, making them impractical in constrained scenarios. In contrast, JPEG is a widely supported format, offer… ▽ More

    Submitted 29 September, 2025; v1 submitted 23 September, 2025; originally announced September 2025.

  4. arXiv:2507.01342  [pdf, ps, other

    cs.CV

    Learning Camera-Agnostic White-Balance Preferences

    Authors: Luxi Zhao, Mahmoud Afifi, Michael S. Brown

    Abstract: The image signal processor (ISP) pipeline in modern cameras consists of several modules that transform raw sensor data into visually pleasing images in a display color space. Among these, the auto white balance (AWB) module is essential for compensating for scene illumination. However, commercial AWB systems often strive to compute aesthetic white-balance preferences rather than accurate neutral c… ▽ More

    Submitted 14 August, 2025; v1 submitted 2 July, 2025; originally announced July 2025.

  5. arXiv:2504.07959  [pdf, ps, other

    cs.CV

    CCMNet: Leveraging Calibrated Color Correction Matrices for Cross-Camera Color Constancy

    Authors: Dongyoung Kim, Mahmoud Afifi, Dongyun Kim, Michael S. Brown, Seon Joo Kim

    Abstract: Computational color constancy, or white balancing, is a key module in a camera's image signal processor (ISP) that corrects color casts from scene lighting. Because this operation occurs in the camera-specific raw color space, white balance algorithms must adapt to different cameras. This paper introduces a learning-based method for cross-camera color constancy that generalizes to new cameras with… ▽ More

    Submitted 15 December, 2025; v1 submitted 10 April, 2025; originally announced April 2025.

  6. arXiv:2504.05623  [pdf, ps, other

    cs.CV

    Time-Aware Auto White Balance in Mobile Photography

    Authors: Mahmoud Afifi, Luxi Zhao, Abhijith Punnappurath, Mohammed A. Abdelsalam, Ran Zhang, Michael S. Brown

    Abstract: Cameras rely on auto white balance (AWB) to correct undesirable color casts caused by scene illumination and the camera's spectral sensitivity. This is typically achieved using an illuminant estimator that determines the global color cast solely from the color information in the camera's raw sensor image. Mobile devices provide valuable additional metadata-such as capture timestamp and geolocation… ▽ More

    Submitted 25 June, 2025; v1 submitted 7 April, 2025; originally announced April 2025.

  7. arXiv:2503.22026  [pdf, ps, other

    cs.CV eess.IV

    Multispectral Demosaicing via Dual Cameras

    Authors: SaiKiran Tedla, Junyong Lee, Beixuan Yang, Mahmoud Afifi, Michael S. Brown

    Abstract: Multispectral (MS) images capture detailed scene information across a wide range of spectral bands, making them invaluable for applications requiring rich spectral data. Integrating MS imaging into multi camera devices, such as smartphones, has the potential to enhance both spectral applications and RGB image quality. A critical step in processing MS data is demosaicing, which reconstructs color i… ▽ More

    Submitted 25 July, 2025; v1 submitted 27 March, 2025; originally announced March 2025.

    Comments: https://ms-demosaic.github.io/

  8. arXiv:2503.11781  [pdf, other

    cs.CV

    Color Matching Using Hypernetwork-Based Kolmogorov-Arnold Networks

    Authors: Artem Nikonorov, Georgy Perevozchikov, Andrei Korepanov, Nancy Mehta, Mahmoud Afifi, Egor Ershov, Radu Timofte

    Abstract: We present cmKAN, a versatile framework for color matching. Given an input image with colors from a source color distribution, our method effectively and accurately maps these colors to match a target color distribution in both supervised and unsupervised settings. Our framework leverages the spline capabilities of Kolmogorov-Arnold Networks (KANs) to model the color matching between source and ta… ▽ More

    Submitted 14 March, 2025; originally announced March 2025.

  9. arXiv:2412.12403  [pdf, other

    cs.CR

    Characterizing the Networks Sending Enterprise Phishing Emails

    Authors: Elisa Luo, Liane Young, Grant Ho, M. H. Afifi, Marco Schweighauser, Ethan Katz-Bassett, Asaf Cidon

    Abstract: Phishing attacks on enterprise employees present one of the most costly and potent threats to organizations. We explore an understudied facet of enterprise phishing attacks: the email relay infrastructure behind successfully delivered phishing emails. We draw on a dataset spanning one year across thousands of enterprises, billions of emails, and over 800,000 delivered phishing attacks. Our work sh… ▽ More

    Submitted 16 December, 2024; originally announced December 2024.

    Comments: To appear in the proceedings of the Passive and Active Network Measurement (PAM 2025)

  10. arXiv:2410.16151  [pdf, other

    cs.LG cs.AI cs.NE

    Small Contributions, Small Networks: Efficient Neural Network Pruning Based on Relative Importance

    Authors: Mostafa Hussien, Mahmoud Afifi, Kim Khoa Nguyen, Mohamed Cheriet

    Abstract: Recent advancements have scaled neural networks to unprecedented sizes, achieving remarkable performance across a wide range of tasks. However, deploying these large-scale models on resource-constrained devices poses significant challenges due to substantial storage and computational requirements. Neural network pruning has emerged as an effective technique to mitigate these limitations by reducin… ▽ More

    Submitted 21 October, 2024; originally announced October 2024.

  11. arXiv:2405.15668  [pdf, ps, other

    cs.CV

    What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Alec Go

    Abstract: Large language models (LLMs) have been effectively used for many computer vision tasks, including image classification. In this paper, we present a simple yet effective approach for zero-shot image classification using multimodal LLMs. Using multimodal LLMs, we generate comprehensive textual representations from input images. These textual representations are then utilized to generate fixed-dimens… ▽ More

    Submitted 25 June, 2025; v1 submitted 24 May, 2024; originally announced May 2024.

  12. arXiv:2404.10700  [pdf, other

    eess.IV cs.CV cs.LG

    Rawformer: Unpaired Raw-to-Raw Translation for Learnable Camera ISPs

    Authors: Georgy Perevozchikov, Nancy Mehta, Mahmoud Afifi, Radu Timofte

    Abstract: Modern smartphone camera quality heavily relies on the image signal processor (ISP) to enhance captured raw images, utilizing carefully designed modules to produce final output images encoded in a standard color space (e.g., sRGB). Neural-based end-to-end learnable ISPs offer promising advancements, potentially replacing traditional ISPs with their ability to adapt without requiring extensive tuni… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted by ECCV 2024

    Journal ref: https://eccv.ecva.net/Conferences/2024

  13. arXiv:2403.03111  [pdf, other

    cs.CV cs.AI cs.RO

    Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection

    Authors: Mohamed Afifi, Mohamed ElHelw

    Abstract: Perception is a key element for enabling intelligent autonomous navigation. Understanding the semantics of the surrounding environment and accurate vehicle pose estimation are essential capabilities for autonomous vehicles, including self-driving cars and mobile robots that perform complex tasks. Fast moving platforms like self-driving cars impose a hard challenge for localization and mapping algo… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  14. arXiv:2403.02449  [pdf, other

    cs.CV

    Optimizing Illuminant Estimation in Dual-Exposure HDR Imaging

    Authors: Mahmoud Afifi, Zhenhua Hu, Liang Liang

    Abstract: High dynamic range (HDR) imaging involves capturing a series of frames of the same scene, each with different exposure settings, to broaden the dynamic range of light. This can be achieved through burst capturing or using staggered HDR sensors that capture long and short exposures simultaneously in the camera image signal processor (ISP). Within camera ISP pipeline, illuminant estimation is a cruc… ▽ More

    Submitted 6 April, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

  15. arXiv:2111.07837  [pdf, other

    cs.CV

    Multi-View Motion Synthesis via Applying Rotated Dual-Pixel Blur Kernels

    Authors: Abdullah Abuolaim, Mahmoud Afifi, Michael S. Brown

    Abstract: Portrait mode is widely available on smartphone cameras to provide an enhanced photographic experience. One of the primary effects applied to images captured in portrait mode is a synthetic shallow depth of field (DoF). The synthetic DoF (or bokeh effect) selectively blurs regions in the image to emulate the effect of using a large lens with a wide aperture. In addition, many applications now inco… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  16. arXiv:2110.08732  [pdf

    cs.CV

    A Deep Learning-based Approach for Real-time Facemask Detection

    Authors: Wadii Boulila, Ayyub Alzahem, Aseel Almoudi, Muhanad Afifi, Ibrahim Alturki, Maha Driss

    Abstract: The COVID-19 pandemic is causing a global health crisis. Public spaces need to be safeguarded from the adverse effects of this pandemic. Wearing a facemask becomes one of the effective protection solutions adopted by many governments. Manual real-time monitoring of facemask wearing for a large group of people is becoming a difficult task. The goal of this paper is to use deep learning (DL), which… ▽ More

    Submitted 17 October, 2021; originally announced October 2021.

  17. arXiv:2109.08750  [pdf, other

    cs.CV

    Auto White-Balance Correction for Mixed-Illuminant Scenes

    Authors: Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown

    Abstract: Auto white balance (AWB) is applied by camera hardware at capture time to remove the color cast caused by the scene illumination. The vast majority of white-balance algorithms assume a single light source illuminates the scene; however, real scenes often have mixed lighting conditions. This paper presents an effective AWB method to deal with such mixed-illuminant scenes. A unique departure from co… ▽ More

    Submitted 7 October, 2021; v1 submitted 17 September, 2021; originally announced September 2021.

    Journal ref: WACV 2021

  18. arXiv:2108.05251  [pdf, other

    cs.CV

    Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning

    Authors: Abdullah Abuolaim, Mahmoud Afifi, Michael S. Brown

    Abstract: Many camera sensors use a dual-pixel (DP) design that operates as a rudimentary light field providing two sub-aperture views of a scene in a single capture. The DP sensor was developed to improve how cameras perform autofocus. Since the DP sensor's introduction, researchers have found additional uses for the DP data, such as depth estimation, reflection removal, and defocus deblurring. We are inte… ▽ More

    Submitted 9 February, 2022; v1 submitted 11 August, 2021; originally announced August 2021.

    Comments: Published in the Winter Conference on Applications of Computer Vision 2022 (WACV'22)

  19. arXiv:2107.13117  [pdf, other

    cs.CV

    Image color correction, enhancement, and editing

    Authors: Mahmoud Afifi

    Abstract: This thesis presents methods and approaches to image color correction, color enhancement, and color editing. To begin, we study the color correction problem from the standpoint of the camera's image signal processor (ISP). A camera's ISP is hardware that applies a series of in-camera image processing and color manipulation steps, many of which are nonlinear in nature, to render the initial sensor… ▽ More

    Submitted 27 July, 2021; originally announced July 2021.

    Comments: PhD dissertation

  20. arXiv:2106.13920  [pdf, other

    cs.CV

    CAMS: Color-Aware Multi-Style Transfer

    Authors: Mahmoud Afifi, Abdullah Abuolaim, Mostafa Hussien, Marcus A. Brubaker, Michael S. Brown

    Abstract: Image style transfer aims to manipulate the appearance of a source image, or "content" image, to share similar texture and colors of a target "style" image. Ideally, the style transfer manipulation should also preserve the semantic content of the source image. A commonly used approach to assist in transferring styles is based on Gram matrix optimization. One problem of Gram matrix-based optimizati… ▽ More

    Submitted 4 September, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  21. arXiv:2106.13883  [pdf, other

    cs.CV eess.IV

    Semi-Supervised Raw-to-Raw Mapping

    Authors: Mahmoud Afifi, Abdullah Abuolaim

    Abstract: The raw-RGB colors of a camera sensor vary due to the spectral sensitivity differences across different sensor makes and models. This paper focuses on the task of mapping between different sensor raw-RGB color spaces. Prior work addressed this problem using a pairwise calibration to achieve accurate color mapping. Although being accurate, this approach is less practical as it requires: (1) capturi… ▽ More

    Submitted 6 September, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

  22. arXiv:2012.07072  [pdf

    cs.CV

    Robust Real-Time Pedestrian Detection on Embedded Devices

    Authors: Mohamed Afifi, Yara Ali, Karim Amer, Mahmoud Shaker, Mohamed Elhelw

    Abstract: Detection of pedestrians on embedded devices, such as those on-board of robots and drones, has many applications including road intersection monitoring, security, crowd monitoring and surveillance, to name a few. However, the problem can be challenging due to continuously-changing camera viewpoint and varying object appearances as well as the need for lightweight algorithms suitable for embedded s… ▽ More

    Submitted 13 December, 2020; originally announced December 2020.

  23. arXiv:2011.11890  [pdf, other

    cs.CV

    Cross-Camera Convolutional Color Constancy

    Authors: Mahmoud Afifi, Jonathan T. Barron, Chloe LeGendre, Yun-Ta Tsai, Francois Bleibel

    Abstract: We present "Cross-Camera Convolutional Color Constancy" (C5), a learning-based method, trained on images from multiple cameras, that accurately estimates a scene's illuminant color from raw images captured by a new camera previously unseen during training. C5 is a hypernetwork-like extension of the convolutional color constancy (CCC) approach: C5 learns to generate the weights of a CCC model that… ▽ More

    Submitted 10 February, 2022; v1 submitted 23 November, 2020; originally announced November 2020.

    Journal ref: ICCV 2021

  24. arXiv:2011.11731  [pdf, other

    cs.CV

    HistoGAN: Controlling Colors of GAN-Generated and Real Images via Color Histograms

    Authors: Mahmoud Afifi, Marcus A. Brubaker, Michael S. Brown

    Abstract: While generative adversarial networks (GANs) can successfully produce high-quality images, they can be challenging to control. Simplifying GAN-based image generation is critical for their adoption in graphic design and artistic work. This goal has led to significant interest in methods that can intuitively control the appearance of images generated by GANs. In this paper, we present HistoGAN, a co… ▽ More

    Submitted 26 March, 2021; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: CVPR 2021

  25. arXiv:2011.04723  [pdf, other

    cs.SI cs.LG

    F-FADE: Frequency Factorization for Anomaly Detection in Edge Streams

    Authors: Yen-Yu Chang, Pan Li, Rok Sosic, M. H. Afifi, Marco Schweighauser, Jure Leskovec

    Abstract: Edge streams are commonly used to capture interactions in dynamic networks, such as email, social, or computer networks. The problem of detecting anomalies or rare events in edge streams has a wide range of applications. However, it presents many challenges due to lack of labels, a highly dynamic nature of interactions, and the entanglement of temporal and structural changes in the network. Curren… ▽ More

    Submitted 5 February, 2021; v1 submitted 9 November, 2020; originally announced November 2020.

    Comments: WSDM 2021

  26. arXiv:2011.01974  [pdf, other

    cs.CV

    Multi Projection Fusion for Real-time Semantic Segmentation of 3D LiDAR Point Clouds

    Authors: Yara Ali Alnaggar, Mohamed Afifi, Karim Amer, Mohamed Elhelw

    Abstract: Semantic segmentation of 3D point cloud data is essential for enhanced high-level perception in autonomous platforms. Furthermore, given the increasing deployment of LiDAR sensors onboard of cars and drones, a special emphasis is also placed on non-computationally intensive algorithms that operate on mobile GPUs. Previous efficient state-of-the-art methods relied on 2D spherical projection of poin… ▽ More

    Submitted 6 November, 2020; v1 submitted 3 November, 2020; originally announced November 2020.

    Comments: Accepted at the 2021 Winter Conference on Applications of Computer Vision (WACV 2021)

  27. arXiv:2009.12798  [pdf, other

    cs.CV eess.IV

    AIM 2020: Scene Relighting and Illumination Estimation Challenge

    Authors: Majed El Helou, Ruofan Zhou, Sabine Süsstrunk, Radu Timofte, Mahmoud Afifi, Michael S. Brown, Kele Xu, Hengxing Cai, Yuzhong Liu, Li-Wen Wang, Zhi-Song Liu, Chu-Tak Li, Sourya Dipta Das, Nisarg A. Shah, Akashdeep Jassal, Tongtong Zhao, Shanshan Zhao, Sabari Nathan, M. Parisa Beham, R. Suganya, Qing Wang, Zhongyun Hu, Xin Huang, Yaning Li, Maitreya Suin , et al. (12 additional authors not shown)

    Abstract: We review the AIM 2020 challenge on virtual image relighting and illumination estimation. This paper presents the novel VIDIT dataset used in the challenge and the different proposed solutions and final evaluation results over the 3 challenge tracks. The first track considered one-to-one relighting; the objective was to relight an input photo of a scene with a different color temperature and illum… ▽ More

    Submitted 27 September, 2020; originally announced September 2020.

    Comments: ECCVW 2020. Data and more information on https://github.com/majedelhelou/VIDIT

  28. arXiv:2009.12632  [pdf, other

    cs.CV

    Interactive White Balancing for Camera-Rendered Images

    Authors: Mahmoud Afifi, Michael S. Brown

    Abstract: White balance (WB) is one of the first photo-finishing steps used to render a captured image to its final output. WB is applied to remove the color cast caused by the scene's illumination. Interactive photo-editing software allows users to manually select different regions in a photo as examples of the illumination for WB correction (e.g., clicking on achromatic objects). Such interactive editing… ▽ More

    Submitted 26 September, 2020; originally announced September 2020.

    Comments: To appear in Color and Imaging Conference (CIC28), 2020

  29. arXiv:2007.14030  [pdf, other

    cs.CR cs.SI

    A Large-Scale Analysis of Attacker Activity in Compromised Enterprise Accounts

    Authors: Neil Shah, Grant Ho, Marco Schweighauser, M. H. Afifi, Asaf Cidon, David Wagner

    Abstract: We present a large-scale characterization of attacker activity across 111 real-world enterprise organizations. We develop a novel forensic technique for distinguishing between attacker activity and benign activity in compromised enterprise accounts that yields few false positives and enables us to perform fine-grained analysis of attacker behavior. Applying our methods to a set of 159 compromised… ▽ More

    Submitted 28 July, 2020; originally announced July 2020.

    Comments: Extended report of workshop paper presented at the 1st MLHat Workshop (MLHat Security and ML 2020). KDD, 2020

  30. arXiv:2006.12709  [pdf, other

    cs.CV eess.IV

    CIE XYZ Net: Unprocessing Images for Low-Level Computer Vision Tasks

    Authors: Mahmoud Afifi, Abdelrahman Abdelhamed, Abdullah Abuolaim, Abhijith Punnappurath, Michael S. Brown

    Abstract: Cameras currently allow access to two image states: (i) a minimally processed linear raw-RGB image state (i.e., raw sensor data) or (ii) a highly-processed nonlinear image state (e.g., sRGB). There are many computer vision tasks that work best with a linear image state, such as image deblurring and image dehazing. Unfortunately, the vast majority of images are saved in the nonlinear image state. B… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  31. arXiv:2005.04117  [pdf, other

    cs.CV eess.IV

    NTIRE 2020 Challenge on Real Image Denoising: Dataset, Methods and Results

    Authors: Abdelrahman Abdelhamed, Mahmoud Afifi, Radu Timofte, Michael S. Brown, Yue Cao, Zhilu Zhang, Wangmeng Zuo, Xiaoling Zhang, Jiye Liu, Wendong Chen, Changyuan Wen, Meng Liu, Shuailin Lv, Yunchao Zhang, Zhihong Pan, Baopu Li, Teng Xi, Yanwen Fan, Xiyu Yu, Gang Zhang, Jingtuo Liu, Junyu Han, Errui Ding, Songhyun Yu, Bumjun Park , et al. (65 additional authors not shown)

    Abstract: This paper reviews the NTIRE 2020 challenge on real image denoising with focus on the newly introduced dataset, the proposed methods and their results. The challenge is a new version of the previous NTIRE 2019 challenge on real image denoising that was based on the SIDD benchmark. This challenge is based on a newly collected validation and testing image datasets, and hence, named SIDD+. This chall… ▽ More

    Submitted 8 May, 2020; originally announced May 2020.

  32. arXiv:2004.01354  [pdf, other

    cs.CV

    Deep White-Balance Editing

    Authors: Mahmoud Afifi, Michael S. Brown

    Abstract: We introduce a deep learning approach to realistically edit an sRGB image's white balance. Cameras capture sensor images that are rendered by their integrated signal processor (ISP) to a standard RGB (sRGB) color space encoding. The ISP rendering begins with a white-balance procedure that is used to remove the color cast of the scene's illumination. The ISP then applies a series of nonlinear color… ▽ More

    Submitted 2 April, 2020; originally announced April 2020.

    Comments: Accepted as Oral at CVPR 2020

  33. arXiv:2003.11596  [pdf, other

    eess.IV cs.CV

    Learning Multi-Scale Photo Exposure Correction

    Authors: Mahmoud Afifi, Konstantinos G. Derpanis, Björn Ommer, Michael S. Brown

    Abstract: Capturing photographs with wrong exposures remains a major source of errors in camera-based imaging. Exposure problems are categorized as either: (i) overexposed, where the camera exposure was too long, resulting in bright and washed-out image regions, or (ii) underexposed, where the exposure was too short, resulting in dark regions. Both under- and overexposure greatly reduce the contrast and vis… ▽ More

    Submitted 30 March, 2021; v1 submitted 25 March, 2020; originally announced March 2020.

    Comments: CVPR 2021

  34. arXiv:1912.06960  [pdf, other

    cs.CV

    What Else Can Fool Deep Learning? Addressing Color Constancy Errors on Deep Neural Network Performance

    Authors: Mahmoud Afifi, Michael S Brown

    Abstract: There is active research targeting local image manipulations that can fool deep neural networks (DNNs) into producing incorrect results. This paper examines a type of global image manipulation that can produce similar adverse effects. Specifically, we explore how strong color casts caused by incorrectly applied computational color constancy - referred to as white balance (WB) in photography - nega… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Comments: ICCV 2019

  35. arXiv:1912.06888  [pdf, other

    cs.CV

    Sensor-Independent Illumination Estimation for DNN Models

    Authors: Mahmoud Afifi, Michael S. Brown

    Abstract: While modern deep neural networks (DNNs) achieve state-of-the-art results for illuminant estimation, it is currently necessary to train a separate DNN for each type of camera sensor. This means when a camera manufacturer uses a new sensor, it is necessary to retrain an existing DNN model with training images captured by the new sensor. This paper addresses this problem by introducing a novel senso… ▽ More

    Submitted 14 December, 2019; originally announced December 2019.

    Journal ref: BMVC 2019

  36. arXiv:1905.06653  [pdf

    cs.CV

    Robust Real-time Pedestrian Detection in Aerial Imagery on Jetson TX2

    Authors: Mohamed Afifi, Yara Ali, Karim Amer, Mahmoud Shaker, Mohamed ElHelw

    Abstract: Detection of pedestrians in aerial imagery captured by drones has many applications including intersection monitoring, patrolling, and surveillance, to name a few. However, the problem is involved due to continuouslychanging camera viewpoint and object appearance as well as the need for lightweight algorithms to run on on-board embedded systems. To address this issue, the paper proposes a framewor… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

  37. arXiv:1802.01009  [pdf, other

    cs.CV

    Image Posterization Using Fuzzy Logic and Bilateral Filter

    Authors: Mahmoud Afifi

    Abstract: Image posterization is converting images with a large number of tones into synthetic images with distinct flat areas and a fewer number of tones. In this technical report, we present the implementation and results of using fuzzy logic in order to generate a posterized image in a simple and fast way. The image filter is based on fuzzy logic and bilateral filtering; where, the given image is blurred… ▽ More

    Submitted 3 February, 2018; originally announced February 2018.

  38. arXiv:1802.00153  [pdf, other

    cs.CV

    Semantic White Balance: Semantic Color Constancy Using Convolutional Neural Network

    Authors: Mahmoud Afifi

    Abstract: The goal of computational color constancy is to preserve the perceptive colors of objects under different lighting conditions by removing the effect of color casts caused by the scene's illumination. With the rapid development of deep learning based techniques, significant progress has been made in image semantic segmentation. In this work, we exploit the semantic information together with the col… ▽ More

    Submitted 31 May, 2019; v1 submitted 31 January, 2018; originally announced February 2018.

    Comments: Deep Learning and Reinforcement Learning Summer School (DLR), CIFAR - Vector Institute, (Poster sessions), 2018

  39. arXiv:1711.04322  [pdf, other

    cs.CV

    11K Hands: Gender recognition and biometric identification using a large dataset of hand images

    Authors: Mahmoud Afifi

    Abstract: The human hand possesses distinctive features which can reveal gender information. In addition, the hand is considered one of the primary biometric traits used to identify a person. In this work, we propose a large dataset of human hand images (dorsal and palmar sides) with detailed ground-truth information for gender recognition and biometric identification. Using this dataset, a convolutional ne… ▽ More

    Submitted 16 September, 2018; v1 submitted 12 November, 2017; originally announced November 2017.

  40. arXiv:1711.00972  [pdf, other

    cs.CV

    The Achievement of Higher Flexibility in Multiple Choice-based Tests Using Image Classification Techniques

    Authors: Mahmoud Afifi, Khaled F. Hussain

    Abstract: In spite of the high accuracy of the existing optical mark reading (OMR) systems and devices, a few restrictions remain existent. In this work, we aim to reduce the restrictions of multiple choice questions (MCQ) within tests. We use an image registration technique to extract the answer boxes from answer sheets. Unlike other systems that rely on simple image processing steps to recognize the extra… ▽ More

    Submitted 11 January, 2019; v1 submitted 2 November, 2017; originally announced November 2017.

  41. arXiv:1709.07720  [pdf, other

    cs.CV

    Can We Boost the Power of the Viola-Jones Face Detector Using Pre-processing? An Empirical Study

    Authors: Mahmoud Afifi, Marwa Nasser, Mostafa Korashy, Katherine Rohde, Aly Abdelrahim

    Abstract: The Viola-Jones face detection algorithm was (and still is) a quite popular face detector. In spite of the numerous face detection techniques that have been recently presented, there are many research works that are still based on the Viola-Jones algorithm because of its simplicity. In this paper, we study the influence of a set of blind pre-processing methods on the face detection rate using the… ▽ More

    Submitted 10 December, 2017; v1 submitted 22 September, 2017; originally announced September 2017.

    Comments: 14 pages, 10 figures, 8 tables

  42. arXiv:1706.04277  [pdf, other

    cs.CV

    AFIF4: Deep Gender Classification based on AdaBoost-based Fusion of Isolated Facial Features and Foggy Faces

    Authors: Mahmoud Afifi, Abdelrahman Abdelhamed

    Abstract: Gender classification aims at recognizing a person's gender. Despite the high accuracy achieved by state-of-the-art methods for this task, there is still room for improvement in generalized and unrestricted datasets. In this paper, we advocate a new strategy inspired by the behavior of humans in gender recognition. Instead of dealing with the face image as a sole feature, we rely on the combinatio… ▽ More

    Submitted 17 November, 2017; v1 submitted 13 June, 2017; originally announced June 2017.

    Comments: 26 pages, 7 figures, 7 tables

  43. Can We See Photosynthesis? Magnifying the Tiny Color Changes of Plant Green Leaves Using Eulerian Video Magnification

    Authors: Islam A. T. F. Taj-Eddin, Mahmoud Afifi, Mostafa Korashy, Ali H. Ahmed, Ng Yoke Cheng, Evelyng Hernandez, Salma M. Abdel-latif

    Abstract: Plant aliveness is proven through laboratory experiments and special scientific instruments. In this paper, we aim to detect the degree of animation of plants based on the magnification of the small color changes in the plant's green leaves using the Eulerian video magnification. Capturing the video under a controlled environment, e.g., using a tripod and direct current (DC) light sources, reduces… ▽ More

    Submitted 29 August, 2017; v1 submitted 12 June, 2017; originally announced June 2017.

    Comments: 7 pages, 3 figures

    Journal ref: J. Electron. Imaging, 2017

  44. arXiv:1602.08472  [pdf, ps, other

    cs.CR

    ExpSOS: Secure and Verifiable Outsourcing of Exponentiation Operations for Mobile Cloud Computing

    Authors: Kai Zhou, M. H. Afifi, Jian Ren

    Abstract: Discrete exponential operation, such as modular exponentiation and scalar multiplication on elliptic curves, is a basic operation of many public-key cryptosystems. However, the exponential operations are considered prohibitively expensive for resource-constrained mobile devices. In this paper, we address the problem of secure outsourcing of exponentiation operations to one single untrusted server.… ▽ More

    Submitted 26 February, 2016; originally announced February 2016.

    Comments: 28 pages, journal paper

  45. arXiv:1005.4005  [pdf

    cs.CE

    Optical phase extraction algorithm based on the continuous wavelet and the Hilbert transforms

    Authors: Mustapha Bahich, Mohamed Afifi, Elmostafa Barj

    Abstract: In this paper we present an algorithm for optical phase evaluation based on the wavelet transform technique. The main advantage of this method is that it requires only one fringe pattern. This algorithm is based on the use of a second π/2 phase shifted fringe pattern where it is calculated via the Hilbert transform. To test its validity, the algorithm was used to demodulate a simulated fringe patt… ▽ More

    Submitted 21 May, 2010; originally announced May 2010.

    Comments: www.journalofcomputing.org

    Journal ref: Journal of Computing, Volume 2, Issue 5, May 2010