Skip to main content

Showing 1–50 of 68 results for author: Choudhary, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2603.20021  [pdf, ps, other

    cs.LG

    ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images

    Authors: Anand Choudhary, Xiaowu Sun, Thabo Mahendiran, Ortal Senouf, Denise Auberson, Bernard De Bruyne, Stephane Fournier, Olivier Muller, Emmanuel Abbé, Pascal Frossard, Dorina Thanou

    Abstract: Invasive Coronary Angiography (ICA) is the clinical gold standard for the assessment of coronary artery disease. However, its interpretation remains subjective and prone to intra- and inter-operator variability. In this work, we introduce ODySSeI: an Open-source end-to-end framework for automated Detection, Segmentation, and Severity estimation of lesions in ICA images. ODySSeI integrates deep lea… ▽ More

    Submitted 20 March, 2026; originally announced March 2026.

  2. arXiv:2603.10904  [pdf, ps, other

    cs.SD cs.AI cs.ET

    When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS

    Authors: Anupam Purwar, Aditya Choudhary

    Abstract: Large language models are increasingly adopted as semantic backbones for neural text-to-speech systems. However, frozen LLM representations are insufficient for modeling speaker specific acoustic and perceptual characteristics. Our experiments involving fine tuning of the Language Model backbone of TTS show promise in improving the voice consistency and Signal to Noise ratio SNR in voice cloning t… ▽ More

    Submitted 11 March, 2026; originally announced March 2026.

    Comments: We finetune the Qwen 0.5B backbone in an LLM TTS with LoRA to raise MOS speaker similarity and SNR. It works best with diverse training audio with uniform data it can amplify noise so tune decoding and use GGUF quantization for low latency stable quality

  3. arXiv:2603.09643  [pdf, ps, other

    cs.ET cs.AI

    MM-tau-p$^2$: Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings

    Authors: Anupam Purwar, Aditya Choudhary

    Abstract: Current evaluation frameworks and benchmarks for LLM powered agents focus on text chat driven agents, these frameworks do not expose the persona of user to the agent, thus operating in a user agnostic environment. Importantly, in customer experience management domain, the agent's behaviour evolves as the agent learns about user personality. With proliferation of real time TTS and multi-modal langu… ▽ More

    Submitted 7 April, 2026; v1 submitted 10 March, 2026; originally announced March 2026.

    Comments: A benchmark for evaluating multimodal both voice and text LLM agents in dualcontrol settings. We introduce persona adaptive prompting and 12 new metrics to assess robustness safety efficiency and recovery in customer support scenarios

  4. arXiv:2602.09162  [pdf, ps, other

    cs.LG cond-mat.mtrl-sci

    Boltzmann Reinforcement Learning for Noise resilience in Analog Ising Machines

    Authors: Aditya Choudhary, Saaketh Desai, Prasad Iyer

    Abstract: Analog Ising machines (AIMs) have emerged as a promising paradigm for combinatorial optimization, utilizing physical dynamics to solve Ising problems with high energy efficiency. However, the performance of traditional optimization and sampling algorithms on these platforms is often limited by inherent measurement noise. We introduce BRAIN (Boltzmann Reinforcement for Analog Ising Networks), a dis… ▽ More

    Submitted 9 February, 2026; originally announced February 2026.

  5. arXiv:2601.07367  [pdf, ps, other

    cs.SD

    FOCAL: A Novel Benchmarking Technique for Multi-modal Agents

    Authors: Anupam Purwar, Aditya Choudhary

    Abstract: With the recent advancements in reasoning capabilities, tool calling using MCP servers and Audio Language Models (ALMs), development and integration of multi-modal agents (with voice and text support) has come to the industry forefront. Cascading pipelines for voice agents still play a central role in the industry owing to their superior reasoning capabilities facilitated by LLMs. Although, cascad… ▽ More

    Submitted 2 March, 2026; v1 submitted 12 January, 2026; originally announced January 2026.

    Comments: We present a framework for evaluation of Multi-modal Agents consisting of Voice-to-voice model components viz. Text to Speech (TTS), Retrieval Augmented Generation (RAG) and Speech-to-text (STT)

  6. arXiv:2512.23747  [pdf, ps, other

    cs.SE cs.AI cs.CL

    State-of-the-art Small Language Coder Model: Mify-Coder

    Authors: Abhinav Parmar, Abhisek Panigrahi, Abhishek Kumar Dwivedi, Abhishek Bhattacharya, Adarsh Ramachandra, Aditya Choudhary, Aditya Garg, Aditya Raj, Alankrit Bhatt, Alpesh Yadav, Anant Vishnu, Ananthu Pillai, Ankush Kumar, Aryan Patnaik, Aswatha Narayanan S, Avanish Raj Singh, Bhavya Shree Gadda, Brijesh Pankajbhai Kachhadiya, Buggala Jahnavi, Chidurala Nithin Krishna, Chintan Shah, Chunduru Akshaya, Debarshi Banerjee, Debrup Dey, Deepa R. , et al. (71 additional authors not shown)

    Abstract: We present Mify-Coder, a 2.5B-parameter code model trained on 4.2T tokens using a compute-optimal strategy built on the Mify-2.5B foundation model. Mify-Coder achieves comparable accuracy and safety while significantly outperforming much larger baseline models on standard coding and function-calling benchmarks, demonstrating that compact models can match frontier-grade models in code generation an… ▽ More

    Submitted 26 December, 2025; originally announced December 2025.

  7. arXiv:2510.25170  [pdf, ps, other

    cs.DC

    Multi-Resolution Model Fusion for Accelerating the Convolutional Neural Network Training

    Authors: Kewei Wang, Claire Songhyun Lee, Sunwoo Lee, Vishu Gupta, Jan Balewski, Alex Sim, Peter Nugent, Ankit Agrawal, Alok Choudhary, Kesheng Wu, Wei-keng Liao

    Abstract: Neural networks are rapidly gaining popularity in scientific research, but training the models is often very time-consuming. Particularly when the training data samples are large high-dimensional arrays, efficient training methodologies that can reduce the computational costs are crucial. To reduce the training cost, we propose a Multi-Resolution Model Fusion (MRMF) method that combines models tra… ▽ More

    Submitted 29 October, 2025; originally announced October 2025.

  8. arXiv:2510.19778  [pdf, ps, other

    cs.LG cs.CL

    GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters

    Authors: Anand Choudhary, Yasser Sulaıman, Lukas Mauch, Ghouthi Boukli Hacene, Fabien Cardinaux, Antoine Bosselut

    Abstract: Sparse fine-tuning techniques adapt LLMs to downstream tasks by only tuning a sparse subset of model parameters. However, the effectiveness of sparse adaptation depends on optimally selecting the model parameters to be fine-tuned. In this work, we introduce a novel sparse fine-tuning technique named GaLLoP: Gradient-based Sparse Learning on Low-Magnitude Parameters, which fine-tunes only those mod… ▽ More

    Submitted 22 October, 2025; originally announced October 2025.

  9. arXiv:2510.04923  [pdf, ps, other

    cs.CV cs.AI

    REN: Anatomically-Informed Mixture-of-Experts for Interstitial Lung Disease Diagnosis

    Authors: Alec K. Peltekian, Halil Ertugrul Aktas, Gorkem Durak, Kevin Grudzinski, Bradford C. Bemiss, Carrie Richardson, Jane E. Dematte, G. R. Scott Budinger, Anthony J. Esposito, Alexander Misharin, Alok Choudhary, Ankit Agrawal, Ulas Bagci

    Abstract: Mixture-of-Experts (MoE) architectures achieve scalable learning by routing inputs to specialized subnetworks through conditional computation. However, conventional MoE designs assume homogeneous expert capability and domain-agnostic routing-assumptions that are fundamentally misaligned with medical imaging, where anatomical structure and regional disease heterogeneity govern pathological patterns… ▽ More

    Submitted 30 March, 2026; v1 submitted 6 October, 2025; originally announced October 2025.

    Comments: 13 pages, 4 figures, 5 tables

  10. Imaging-Based Mortality Prediction in Patients with Systemic Sclerosis

    Authors: Alec K. Peltekian, Karolina Senkow, Gorkem Durak, Kevin M. Grudzinski, Bradford C. Bemiss, Jane E. Dematte, Carrie Richardson, Nikolay S. Markov, Mary Carns, Kathleen Aren, Alexandra Soriano, Matthew Dapas, Harris Perlman, Aaron Gundersheimer, Kavitha C. Selvan, John Varga, Monique Hinchcliff, Krishnan Warrior, Catherine A. Gao, Richard G. Wunderink, GR Scott Budinger, Alok N. Choudhary, Anthony J. Esposito, Alexander V. Misharin, Ankit Agrawal , et al. (1 additional authors not shown)

    Abstract: Interstitial lung disease (ILD) is a leading cause of morbidity and mortality in systemic sclerosis (SSc). Chest computed tomography (CT) is the primary imaging modality for diagnosing and monitoring lung complications in SSc patients. However, its role in disease progression and mortality prediction has not yet been fully clarified. This study introduces a novel, large-scale longitudinal chest CT… ▽ More

    Submitted 27 September, 2025; originally announced September 2025.

    Comments: 11 pages, 4 figures, 1 table, accepted in MICCAI PRIME 2025

    Journal ref: MICCAI PRIME 2025

  11. arXiv:2509.20971  [pdf, ps, other

    cs.SD cs.AI

    i-LAVA: Insights on Low Latency Voice-2-Voice Architecture for Agents

    Authors: Anupam Purwar, Aditya Choudhary

    Abstract: We experiment with a low-latency, end-to-end voice-to-voice communication model to optimize it for real-time conversational applications. By analyzing components essential to voice to voice (V-2-V) system viz. automatic speech recognition (ASR), text-to-speech (TTS), and dialog management, our work analyzes how to reduce processing time while maintaining high-quality interactions to identify the l… ▽ More

    Submitted 27 September, 2025; v1 submitted 25 September, 2025; originally announced September 2025.

    Comments: This paper analyzes a low-latency, end-to-end voice-to-voice (V-2-V) architecture, identifying that the Text-to-Speech (TTS) component has the highest impact on real-time performance. By reducing the number of Residual Vector Quantization (RVQ) iterations in the TTS model, latency can be effectively halved. Its accepted at AIML Systems 2025

  12. arXiv:2508.06627  [pdf, ps, other

    cs.LG cs.AI

    Early Detection of Pancreatic Cancer Using Multimodal Learning on Electronic Health Records

    Authors: Mosbah Aouad, Anirudh Choudhary, Awais Farooq, Steven Nevers, Lusine Demirkhanyan, Bhrandon Harris, Suguna Pappu, Christopher Gondi, Ravishankar Iyer

    Abstract: Pancreatic ductal adenocarcinoma (PDAC) is one of the deadliest cancers, and early detection remains a major clinical challenge due to the absence of specific symptoms and reliable biomarkers. In this work, we propose a new multimodal approach that integrates longitudinal diagnosis code histories and routinely collected laboratory measurements from electronic health records to detect PDAC up to on… ▽ More

    Submitted 18 August, 2025; v1 submitted 8 August, 2025; originally announced August 2025.

    Journal ref: Proceedings of Machine Learning for Healthcare (2025)

  13. PunchPulse: A Physically Demanding Virtual Reality Boxing Game Designed with, for and by Blind and Low-Vision Players

    Authors: Sanchita S. Kamath, Omar Khan, Anurag Choudhary, Jan Meyerhoff-Liang, Soyoung Choi, JooYoung Seo

    Abstract: Blind and low-vision (BLV) individuals experience lower levels of physical activity (PA) compared to sighted peers due to a lack of accessible, engaging exercise options. Existing solutions often rely on auditory cues but do not fully integrate rich sensory feedback or support spatial navigation, limiting their effectiveness. This study introduces PunchPulse, a virtual reality (VR) boxing exergame… ▽ More

    Submitted 4 August, 2025; originally announced August 2025.

  14. arXiv:2506.17309  [pdf, ps, other

    cs.CR cs.LG

    Efficient Malware Detection with Optimized Learning on High-Dimensional Features

    Authors: Aditya Choudhary, Sarthak Pawar, Yashodhara Haribhakta

    Abstract: Malware detection using machine learning requires feature extraction from binary files, as models cannot process raw binaries directly. A common approach involves using LIEF for raw feature extraction and the EMBER vectorizer to generate 2381-dimensional feature vectors. However, the high dimensionality of these features introduces significant computational challenges. This study addresses these c… ▽ More

    Submitted 18 June, 2025; originally announced June 2025.

    Comments: This paper has been accepted for presentation at the International Conference on Innovations in Intelligent Systems: Advancements in Computing, Communication, and Cybersecurity (ISAC3)

  15. arXiv:2506.16597  [pdf, ps, other

    astro-ph.EP astro-ph.IM cs.CV

    Exoplanet Classification through Vision Transformers with Temporal Image Analysis

    Authors: Anupma Choudhary, Sohith Bandari, B. S. Kushvah, C. Swastik

    Abstract: The classification of exoplanets has been a longstanding challenge in astronomy, requiring significant computational and observational resources. Traditional methods demand substantial effort, time, and cost, highlighting the need for advanced machine learning techniques to enhance classification efficiency. In this study, we propose a methodology that transforms raw light curve data from NASA's K… ▽ More

    Submitted 19 June, 2025; originally announced June 2025.

    Comments: Accepted for publication in the Astronomical Journal

  16. arXiv:2506.15114  [pdf, ps, other

    cs.DC

    Parallel Data Object Creation: Towards Scalable Metadata Management in High-Performance I/O Library

    Authors: Youjia Li, Robert Latham, Robert Ross, Ankit Agrawal, Alok Choudhary, Wei-Keng Liao

    Abstract: High-level I/O libraries, such as HDF5 and PnetCDF, are commonly used by large-scale scientific applications to perform I/O tasks in parallel. These I/O libraries store the metadata such as data types and dimensionality along with the raw data in the same files. While these libraries are well-optimized for concurrent access to the raw data, they are designed neither to handle a large number of dat… ▽ More

    Submitted 17 June, 2025; originally announced June 2025.

  17. arXiv:2505.04962  [pdf, other

    cs.CV cs.RO

    An Efficient Method for Accurate Pose Estimation and Error Correction of Cuboidal Objects

    Authors: Utsav Rai, Hardik Mehta, Vismay Vakharia, Aditya Choudhary, Amit Parmar, Rolif Lima, Kaushik Das

    Abstract: The proposed system outlined in this paper is a solution to a use case that requires the autonomous picking of cuboidal objects from an organized or unorganized pile with high precision. This paper presents an efficient method for precise pose estimation of cuboid-shaped objects, which aims to reduce errors in target pose in a time-efficient manner. Typical pose estimation methods like global poin… ▽ More

    Submitted 8 May, 2025; originally announced May 2025.

    Comments: Accepted in IEEE/RSJ IROS 2022 Workshop on Mobile Manipulation and Embodied Intelligence (MOMA)

  18. arXiv:2504.21331  [pdf

    cond-mat.mtrl-sci cs.CV

    Towards Space Group Determination from EBSD Patterns: The Role of Deep Learning and High-throughput Dynamical Simulations

    Authors: Alfred Yan, Muhammad Nur Talha Kilic, Gert Nolze, Ankit Agrawal, Alok Choudhary, Roberto dos Reis, Vinayak Dravid

    Abstract: The design of novel materials hinges on the understanding of structure-property relationships. However, in recent times, our capability to synthesize a large number of materials has outpaced our speed at characterizing them. While the overall chemical constituents can be readily known during synthesis, the structural evolution and characterization of newly synthesized samples remains a bottleneck… ▽ More

    Submitted 2 May, 2025; v1 submitted 30 April, 2025; originally announced April 2025.

    Comments: 33 pages, preliminary version

  19. arXiv:2503.19115  [pdf, ps, other

    q-bio.MN cs.NE

    Implementation of Support Vector Machines using Reaction Networks

    Authors: Amey Choudhary, Jiaxin Jin, Abhishek Deshpande

    Abstract: Can machine learning algorithms be implemented using chemistry? We demonstrate that this is possible in the case of support vector machines (SVMs). SVMs are powerful tools for data classification, leveraging Vapnik-Chervonenkis theory to handle high-dimensional data and small datasets effectively. In this work, we propose a chemical reaction network scheme for implementing SVMs, utilizing the stea… ▽ More

    Submitted 1 April, 2026; v1 submitted 24 March, 2025; originally announced March 2025.

    Comments: 28 pages, 4 figures, 1 table

  20. arXiv:2503.08437  [pdf, other

    cs.CV cs.AI cs.HC cs.RO

    ICPR 2024 Competition on Rider Intention Prediction

    Authors: Shankar Gangisetty, Abdul Wasi, Shyam Nandan Rai, C. V. Jawahar, Sajay Raj, Manish Prajapati, Ayesha Choudhary, Aaryadev Chandra, Dev Chandan, Shireen Chand, Suvaditya Mukherjee

    Abstract: The recent surge in the vehicle market has led to an alarming increase in road accidents. This underscores the critical importance of enhancing road safety measures, particularly for vulnerable road users like motorcyclists. Hence, we introduce the rider intention prediction (RIP) competition that aims to address challenges in rider safety by proactively predicting maneuvers before they occur, the… ▽ More

    Submitted 11 March, 2025; originally announced March 2025.

  21. arXiv:2501.06434  [pdf, other

    cs.CL cs.AI

    Synthetic Feature Augmentation Improves Generalization Performance of Language Models

    Authors: Ashok Choudhary, Cornelius Thiels, Hojjat Salehinejad

    Abstract: Training and fine-tuning deep learning models, especially large language models (LLMs), on limited and imbalanced datasets poses substantial challenges. These issues often result in poor generalization, where models overfit to dominant classes and underperform on minority classes, leading to biased predictions and reduced robustness in real-world applications. To overcome these challenges, we prop… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: Accepted for presentation at IEEE SSCI 2025

  22. arXiv:2501.05260  [pdf

    cs.CL cs.AI cs.LG

    Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing

    Authors: Atharva Mutsaddi, Aditya Choudhary

    Abstract: Plagiarism involves using another person's work or concepts without proper attribution, presenting them as original creations. With the growing amount of data communicated in regional languages such as Marathi -- one of India's regional languages -- it is crucial to design robust plagiarism detection systems tailored for low-resource languages. Language models like Bidirectional Encoder Representa… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: Accepted into LoResLM: The First Workshop on Language Models for Low-Resource Languages, colocated with COLING 2025 and set to be published into ACL Anthology

    ACM Class: I.2.7; H.3.3

  23. arXiv:2412.01132  [pdf, other

    cs.CV

    Eyes on the Road: State-of-the-Art Video Question Answering Models Assessment for Traffic Monitoring Tasks

    Authors: Joseph Raj Vishal, Divesh Basina, Aarya Choudhary, Bharatesh Chakravarthi

    Abstract: Recent advances in video question answering (VideoQA) offer promising applications, especially in traffic monitoring, where efficient video interpretation is critical. Within ITS, answering complex, real-time queries like "How many red cars passed in the last 10 minutes?" or "Was there an incident between 3:00 PM and 3:05 PM?" enhances situational awareness and decision-making. Despite progress in… ▽ More

    Submitted 2 December, 2024; originally announced December 2024.

  24. arXiv:2411.17475  [pdf, ps, other

    cs.CV

    COBRA: A Continual Learning Approach to Vision-Brain Understanding

    Authors: Xuan-Bac Nguyen, Manuel Serna-Aguilera, Arabinda Kumar Choudhary, Pawan Sinha, Xin Li, Khoa Luu

    Abstract: Vision-Brain Understanding (VBU) aims to extract visual information perceived by humans from brain activity recorded through functional Magnetic Resonance Imaging (fMRI). Despite notable advancements in recent years, existing studies in VBU continue to face the challenge of catastrophic forgetting, where models lose knowledge from prior subjects as they adapt to new ones. Addressing continual lear… ▽ More

    Submitted 6 August, 2025; v1 submitted 25 November, 2024; originally announced November 2024.

  25. arXiv:2411.13378  [pdf, ps, other

    cs.CV

    Quantum-Brain: Quantum-Inspired Neural Network Approach to Vision-Brain Understanding

    Authors: Hoang-Quan Nguyen, Xuan-Bac Nguyen, Hugh Churchill, Arabinda Kumar Choudhary, Pawan Sinha, Samee U. Khan, Khoa Luu

    Abstract: Vision-brain understanding aims to extract semantic information about brain signals from human perceptions. Existing deep learning methods for vision-brain understanding are usually introduced in a traditional learning paradigm missing the ability to learn the connectivities between brain regions. Meanwhile, the quantum computing theory offers a new paradigm for designing deep learning models. Mot… ▽ More

    Submitted 14 August, 2025; v1 submitted 20 November, 2024; originally announced November 2024.

  26. arXiv:2411.10622  [pdf, other

    cs.LG cs.NE stat.ML

    KAT to KANs: A Review of Kolmogorov-Arnold Networks and the Neural Leap Forward

    Authors: Divesh Basina, Joseph Raj Vishal, Aarya Choudhary, Bharatesh Chakravarthi

    Abstract: The curse of dimensionality poses a significant challenge to modern multilayer perceptron-based architectures, often causing performance stagnation and scalability issues. Addressing this limitation typically requires vast amounts of data. In contrast, Kolmogorov-Arnold Networks have gained attention in the machine learning community for their bold claim of being unaffected by the curse of dimensi… ▽ More

    Submitted 15 November, 2024; originally announced November 2024.

  27. arXiv:2405.15777  [pdf, other

    cs.RO

    Multi-agent Collaborative Perception for Robotic Fleet: A Systematic Review

    Authors: Apoorv Singh, Gaurav Raut, Alka Choudhary

    Abstract: Collaborative perception in multi-robot fleets is a way to incorporate the power of unity in robotic fleets. Collaborative perception refers to the collective ability of multiple entities or agents to share and integrate their sensory information for a more comprehensive understanding of their environment. In other words, it involves the collaboration and fusion of data from various sensors or sou… ▽ More

    Submitted 22 March, 2024; originally announced May 2024.

    Comments: 10 pages, 4 figures, 3 tables

  28. arXiv:2311.13495  [pdf, other

    cs.CY cs.CL cs.LG

    Current Topological and Machine Learning Applications for Bias Detection in Text

    Authors: Colleen Farrelly, Yashbir Singh, Quincy A. Hathaway, Gunnar Carlsson, Ashok Choudhary, Rahul Paul, Gianfranco Doretto, Yassine Himeur, Shadi Atalls, Wathiq Mansoor

    Abstract: Institutional bias can impact patient outcomes, educational attainment, and legal system navigation. Written records often reflect bias, and once bias is identified; it is possible to refer individuals for training to reduce bias. Many machine learning tools exist to explore text data and create predictive models that can search written records to identify real-time bias. However, few previous stu… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  29. arXiv:2311.02087  [pdf

    cs.SD cs.AI cs.LG eess.AS eess.SP

    Design Of Rubble Analyzer Probe Using ML For Earthquake

    Authors: Abhishek Sebastian, R Pragna, K Vishal Vythianathan, Dasaraju Sohan Sai, U Shiva Sri Hari Al, R Anirudh, Apurv Choudhary

    Abstract: The earthquake rubble analyzer uses machine learning to detect human presence via ambient sounds, achieving 97.45% accuracy. It also provides real-time environmental data, aiding in assessing survival prospects for trapped individuals, crucial for post-earthquake rescue efforts

    Submitted 24 October, 2023; originally announced November 2023.

  30. arXiv:2309.03493  [pdf, other

    eess.IV cs.CV

    SAM3D: Segment Anything Model in Volumetric Medical Images

    Authors: Nhat-Tan Bui, Dinh-Hieu Hoang, Minh-Triet Tran, Gianfranco Doretto, Donald Adjeroh, Brijesh Patel, Arabinda Choudhary, Ngan Le

    Abstract: Image segmentation remains a pivotal component in medical image analysis, aiding in the extraction of critical information for precise diagnostic practices. With the advent of deep learning, automated image segmentation methods have risen to prominence, showcasing exceptional proficiency in processing medical imagery. Motivated by the Segment Anything Model (SAM)-a foundational model renowned for… ▽ More

    Submitted 5 March, 2024; v1 submitted 7 September, 2023; originally announced September 2023.

    Comments: Accepted at ISBI 2024

  31. arXiv:2308.15618  [pdf

    cs.CV cs.LG

    RACR-MIL: Rank-aware contextual reasoning for weakly supervised grading of squamous cell carcinoma using whole slide images

    Authors: Anirudh Choudhary, Mosbah Aouad, Krishnakant Saboo, Angelina Hwang, Jacob Kechter, Blake Bordeaux, Puneet Bhullar, David DiCaudo, Steven Nelson, Nneka Comfere, Emma Johnson, Olayemi Sokumbi, Jason Sluzevich, Leah Swanson, Dennis Murphree, Aaron Mangold, Ravishankar Iyer

    Abstract: Squamous cell carcinoma (SCC) is the most common cancer subtype, with an increasing incidence and a significant impact on cancer-related mortality. SCC grading using whole slide images is inherently challenging due to the lack of a reliable protocol and substantial tissue heterogeneity. We propose RACR-MIL, the first weakly-supervised SCC grading approach achieving robust generalization across mul… ▽ More

    Submitted 19 July, 2025; v1 submitted 29 August, 2023; originally announced August 2023.

    Comments: 17 pages main text, 2 page references, 2 page appendix; under submission

  32. arXiv:2304.14839  [pdf, other

    cs.RO

    Sampling-based Path Planning Algorithms: A Survey

    Authors: Alka Choudhary

    Abstract: Path planning is a classic problem for autonomous robots. To ensure safe and efficient point-to-point navigation an appropriate algorithm should be chosen keeping the robot's dimensions and its classification in mind. Autonomous robots use path-planning algorithms to safely navigate a dynamic, dense, and unknown environment. A few metrics for path planning algorithms to be taken into account are s… ▽ More

    Submitted 23 April, 2023; originally announced April 2023.

    Comments: 6 Pages, 2 Figures, 4 Algorithms, 2 Tables

  33. arXiv:2211.04011  [pdf, other

    cs.LG cs.CV

    An Incremental Phase Mapping Approach for X-ray Diffraction Patterns using Binary Peak Representations

    Authors: Dipendra Jha, K. V. L. V. Narayanachari, Ruifeng Zhang, Justin Liao, Denis T. Keane, Wei-keng Liao, Alok Choudhary, Yip-Wah Chung, Michael Bedzyk, Ankit Agrawal

    Abstract: Despite the huge advancement in knowledge discovery and data mining techniques, the X-ray diffraction (XRD) analysis process has mostly remained untouched and still involves manual investigation, comparison, and verification. Due to the large volume of XRD samples from high-throughput XRD experiments, it has become impossible for domain scientists to process them manually. Recently, they have star… ▽ More

    Submitted 8 November, 2022; originally announced November 2022.

    Comments: Accepted and presented at the International Workshop on Domain-Driven Data Mining (DDDM) as a part of the SIAM International Conference on Data Mining (SDM 2021). Contains 11 pages and 5 figures

  34. arXiv:2205.01168  [pdf, other

    cs.DC

    A Case Study on Parallel HDF5 Dataset Concatenation for High Energy Physics Data Analysis

    Authors: Sunwoo Lee, Kai-yuan Hou, Kewei Wang, Saba Sehrish, Marc Paterno, James Kowalkowski, Quincey Koziol, Robert Ross, Ankit Agrawal, Alok Choudhary, Wei-keng Liao

    Abstract: In High Energy Physics (HEP), experimentalists generate large volumes of data that, when analyzed, helps us better understand the fundamental particles and their interactions. This data is often captured in many files of small size, creating a data management challenge for scientists. In order to better facilitate data management, transfer, and analysis on large scale platforms, it is advantageous… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  35. Neuronal diversity can improve machine learning for physics and beyond

    Authors: Anshul Choudhary, Anil Radhakrishnan, John F. Lindner, Sudeshna Sinha, William L. Ditto

    Abstract: Diversity conveys advantages in nature, yet homogeneous neurons typically comprise the layers of artificial neural networks. Here we construct neural networks from neurons that learn their own activation functions, quickly diversify, and subsequently outperform their homogeneous counterparts on image classification and nonlinear regression tasks. Sub-networks instantiate the neurons, which meta-le… ▽ More

    Submitted 30 August, 2023; v1 submitted 8 April, 2022; originally announced April 2022.

    Comments: 13 pages, 9 figures

  36. arXiv:2106.16187  [pdf, other

    cs.LG

    Reinforcement Learning based Disease Progression Model for Alzheimer's Disease

    Authors: Krishnakant V. Saboo, Anirudh Choudhary, Yurui Cao, Gregory A. Worrell, David T. Jones, Ravishankar K. Iyer

    Abstract: We model Alzheimer's disease (AD) progression by combining differential equations (DEs) and reinforcement learning (RL) with domain knowledge. DEs provide relationships between some, but not all, factors relevant to AD. We assume that the missing relationships must satisfy general criteria about the working of the brain, for e.g., maximizing cognition while minimizing the cost of supporting cognit… ▽ More

    Submitted 2 November, 2021; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: 10 pages main text, 3 page references, 11 page appendix

  37. arXiv:2106.13675  [pdf

    cs.HC

    Creating and Implementing a Smart Speaker

    Authors: Sanskar Jethi, Avinash Kumar Choudhary, Yash Gupta, Abhishek Chaudhary

    Abstract: We have seen significant advancements in Artificial Intelligence and Machine Learning in the 21st century. It has enabled a new technology where we can have a human-like conversation with the machines. The most significant use of this speech recognition and contextual understanding technology exists in the form of a Smart Speaker. We have a wide variety of Smart Speaker products available to us. T… ▽ More

    Submitted 30 May, 2021; originally announced June 2021.

    Journal ref: IT in Industry, Vol. 9, No.3, 2021

  38. arXiv:2105.05151  [pdf, other

    cs.CG math.AT

    Improved Approximate Rips Filtrations with Shifted Integer Lattices and Cubical Complexes

    Authors: Aruni Choudhary, Michael Kerber, Sharath Raghvendra

    Abstract: Rips complexes are important structures for analyzing topological features of metric spaces. Unfortunately, generating these complexes is expensive because of a combinatorial explosion in the complex size. For $n$ points in $\mathbb{R}^d$, we present a scheme to construct a $2$-approximation of the filtration of the Rips complex in the $L_\infty$-norm, which extends to a $2d^{0.25}$-approximation… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

    Comments: To appear in Journal of Applied and Computational Topology. arXiv admin note: substantial text overlap with arXiv:1706.07399

  39. Domain Adaptive Egocentric Person Re-identification

    Authors: Ankit Choudhary, Deepak Mishra, Arnab Karmakar

    Abstract: Person re-identification (re-ID) in first-person (egocentric) vision is a fairly new and unexplored problem. With the increase of wearable video recording devices, egocentric data becomes readily available, and person re-identification has the potential to benefit greatly from this. However, there is a significant lack of large scale structured egocentric datasets for person re-identification, due… ▽ More

    Submitted 8 March, 2021; originally announced March 2021.

    Comments: 12 pages, 4 figures, In Proceedings of the Fifth IAPR International Conference on Computer Vision & Image Processing (CVIP), 2020

  40. arXiv:2101.10553  [pdf, other

    cs.LG

    A General Framework Combining Generative Adversarial Networks and Mixture Density Networks for Inverse Modeling in Microstructural Materials Design

    Authors: Zijiang Yang, Dipendra Jha, Arindam Paul, Wei-keng Liao, Alok Choudhary, Ankit Agrawal

    Abstract: Microstructural materials design is one of the most important applications of inverse modeling in materials science. Generally speaking, there are two broad modeling paradigms in scientific applications: forward and inverse. While the forward modeling estimates the observations based on known parameters, the inverse modeling attempts to infer the parameters given the observations. Inverse problems… ▽ More

    Submitted 25 January, 2021; originally announced January 2021.

  41. arXiv:2010.15201  [pdf, other

    cs.LG nlin.CD

    Forecasting Hamiltonian dynamics without canonical coordinates

    Authors: Anshul Choudhary, John F. Lindner, Elliott G. Holliday, Scott T. Miller, Sudeshna Sinha, William L. Ditto

    Abstract: Conventional neural networks are universal function approximators, but because they are unaware of underlying symmetries or physical laws, they may need impractically many training data to approximate nonlinear dynamics. Recently introduced Hamiltonian neural networks can efficiently learn and forecast dynamical systems that conserve energy, but they require special inputs called canonical coordin… ▽ More

    Submitted 28 October, 2020; originally announced October 2020.

    Comments: 7 pages, 4 figures

  42. arXiv:2008.08519  [pdf, other

    astro-ph.CO cs.DC

    Building Halo Merger Trees from the Q Continuum Simulation

    Authors: Esteban Rangel, Nicholas Frontiere, Salman Habib, Katrin Heitmann, Wei-keng Liao, Ankit Agrawal, Alok Choudhary

    Abstract: Cosmological N-body simulations rank among the most computationally intensive efforts today. A key challenge is the analysis of structure, substructure, and the merger history for many billions of compact particle clusters, called halos. Effectively representing the merging history of halos is essential for many galaxy formation models used to generate synthetic sky catalogs, an important applicat… ▽ More

    Submitted 19 August, 2020; originally announced August 2020.

    Comments: 2017 IEEE 24th International Conference on High Performance Computing

    Journal ref: 2017 IEEE 24th International Conference on High Performance Computing (HiPC), pp. 398-407. IEEE, 2017

  43. arXiv:2008.04214  [pdf, other

    cs.NE nlin.CD

    Mastering high-dimensional dynamics with Hamiltonian neural networks

    Authors: Scott T. Miller, John F. Lindner, Anshul Choudhary, Sudeshna Sinha, William L. Ditto

    Abstract: We detail how incorporating physics into neural network design can significantly improve the learning and forecasting of dynamical systems, even nonlinear systems of many dimensions. A map building perspective elucidates the superiority of Hamiltonian neural networks over conventional neural networks. The results clarify the critical relation between data, dimension, and neural network learning pe… ▽ More

    Submitted 28 July, 2020; originally announced August 2020.

    Comments: 7 pages, 9 figures

  44. arXiv:2003.09266  [pdf, other

    cs.CG cs.CC

    Computational Complexity of the $α$-Ham-Sandwich Problem

    Authors: Man-Kwun Chiu, Aruni Choudhary, Wolfgang Mulzer

    Abstract: The classic Ham-Sandwich theorem states that for any $d$ measurable sets in $\mathbb{R}^d$, there is a hyperplane that bisects them simultaneously. An extension by Bárány, Hubard, and Jerónimo [DCG 2008] states that if the sets are convex and \emph{well-separated}, then for any given $α_1, \dots, α_d \in [0, 1]$, there is a unique oriented hyperplane that cuts off a respective fraction… ▽ More

    Submitted 20 March, 2020; originally announced March 2020.

  45. arXiv:2003.02245  [pdf, other

    cs.CL cs.LG

    Data Augmentation using Pre-trained Transformer Models

    Authors: Varun Kumar, Ashutosh Choudhary, Eunah Cho

    Abstract: Language model based pre-trained models such as BERT have provided significant gains across different NLP tasks. In this paper, we study different types of transformer based pre-trained models such as auto-regressive models (GPT-2), auto-encoder models (BERT), and seq2seq models (BART) for conditional data augmentation. We show that prepending the class labels to text sequences provides a simple y… ▽ More

    Submitted 31 January, 2021; v1 submitted 4 March, 2020; originally announced March 2020.

    Comments: In Proceedings of the 2nd Workshop on Life-long Learning for Spoken Language Systems @ AACL 2020; Code: https://github.com/varinf/TransformersDataAugmentation

  46. arXiv:1907.12953  [pdf, other

    cs.LG stat.ML

    A real-time iterative machine learning approach for temperature profile prediction in additive manufacturing processes

    Authors: Arindam Paul, Mojtaba Mozaffar, Zijiang Yang, Wei-keng Liao, Alok Choudhary, Jian Cao, Ankit Agrawal

    Abstract: Additive Manufacturing (AM) is a manufacturing paradigm that builds three-dimensional objects from a computer-aided design model by successively adding material layer by layer. AM has become very popular in the past decade due to its utility for fast prototyping such as 3D printing as well as manufacturing functional parts with complex geometries using processes such as laser metal deposition that… ▽ More

    Submitted 9 August, 2019; v1 submitted 28 July, 2019; originally announced July 2019.

    Comments: 10 pages, 8 figures

    Journal ref: 6th IEEE International Conference on Data Science and Advanced Analytics (DSAA), 2019

  47. Improving MPI Collective I/O Performance With Intra-node Request Aggregation

    Authors: Qiao Kang, Sunwoo Lee, Kai-yuan Hou, Robert Ross, Ankit Agrawal, Alok Choudhary, Wei-keng Liao

    Abstract: Two-phase I/O is a well-known strategy for implementing collective MPI-IO functions. It redistributes I/O requests among the calling processes into a form that minimizes the file access costs. As modern parallel computers continue to grow into the exascale era, the communication cost of such request redistribution can quickly overwhelm collective I/O performance. This effect has been observed from… ▽ More

    Submitted 29 July, 2019; originally announced July 2019.

    Comments: 12 pages, 7 figures

  48. No-dimensional Tverberg Theorems and Algorithms

    Authors: Aruni Choudhary, Wolfgang Mulzer

    Abstract: Tverberg's theorem states that for any $k \ge 2$ and any set $P \subset \mathbb{R}^d$ of at least $(d + 1)(k - 1) + 1$ points in $d$ dimensions, we can partition $P$ into $k$ subsets whose convex hulls have a non-empty intersection. The associated search problem of finding the partition lies in the complexity class $\text{CLS} = \text{PPAD} \cap \text{PLS}$, but no hardness results are known. In t… ▽ More

    Submitted 5 July, 2023; v1 submitted 9 July, 2019; originally announced July 2019.

    Comments: 29 pages, 3 figures; a preliminary version appeared as SoCG 2020

    Journal ref: Discrete and Computational Geometry (DCG), 68(4), December 2022, pp. 964-996

  49. arXiv:1907.03222  [pdf, other

    physics.comp-ph cs.LG stat.ML

    IRNet: A General Purpose Deep Residual Regression Framework for Materials Discovery

    Authors: Dipendra Jha, Logan Ward, Zijiang Yang, Christopher Wolverton, Ian Foster, Wei-keng Liao, Alok Choudhary, Ankit Agrawal

    Abstract: Materials discovery is crucial for making scientific advances in many domains. Collections of data from experiments and first-principle computations have spurred interest in applying machine learning methods to create predictive models capable of mapping from composition and crystal structures to materials properties. Generally, these are regression problems with the input being a 1D vector compos… ▽ More

    Submitted 7 July, 2019; originally announced July 2019.

    Comments: 9 pages, under publication at KDD'19

  50. arXiv:1903.03178  [pdf, other

    cs.LG physics.chem-ph stat.ML

    Transfer Learning Using Ensemble Neural Networks for Organic Solar Cell Screening

    Authors: Arindam Paul, Dipendra Jha, Reda Al-Bahrani, Wei-keng Liao, Alok Choudhary, Ankit Agrawal

    Abstract: Organic Solar Cells are a promising technology for solving the clean energy crisis in the world. However, generating candidate chemical compounds for solar cells is a time-consuming process requiring thousands of hours of laboratory analysis. For a solar cell, the most important property is the power conversion efficiency which is dependent on the highest occupied molecular orbitals (HOMO) values… ▽ More

    Submitted 28 July, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: 8 pages, 11 figures, International Joint Conference on Neural Networks

    Journal ref: International Joint Conference on Neural Networks, Budapest Hungary, 14-19 July 2019