Skip to main content

Showing 1–50 of 79 results for author: Verma, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2512.06737  [pdf

    cs.LG cs.AI cs.CL cs.CV cs.NE

    Arc Gradient Descent: A Mathematically Derived Reformulation of Gradient Descent with Phase-Aware, User-Controlled Step Dynamics

    Authors: Nikhil Verma, Joonas Linnosmaa, Leonardo Espinosa-Leal, Napat Vajragupta

    Abstract: The paper presents the formulation, implementation, and evaluation of the ArcGD optimiser. The evaluation is conducted initially on a non-convex benchmark function and subsequently on a real-world ML dataset. The initial comparative study using the Adam optimiser is conducted on a stochastic variant of the highly non-convex and notoriously challenging Rosenbrock function, renowned for its narrow,… ▽ More

    Submitted 20 December, 2025; v1 submitted 7 December, 2025; originally announced December 2025.

    Comments: 80 pages, 6 tables, 2 figures, 5 appendices, proof-of-concept

  2. arXiv:2511.22143  [pdf, ps, other

    cs.CV cs.AI cs.LG

    Stacked Ensemble of Fine-Tuned CNNs for Knee Osteoarthritis Severity Grading

    Authors: Adarsh Gupta, Japleen Kaur, Tanvi Doshi, Teena Sharma, Nishchal K. Verma, Shantaram Vasikarla

    Abstract: Knee Osteoarthritis (KOA) is a musculoskeletal condition that can cause significant limitations and impairments in daily activities, especially among older individuals. To evaluate the severity of KOA, typically, X-ray images of the affected knee are analyzed, and a grade is assigned based on the Kellgren-Lawrence (KL) grading system, which classifies KOA severity into five levels, ranging from 0… ▽ More

    Submitted 27 November, 2025; originally announced November 2025.

    Comments: Accepted and Presented at IEEE UEMCON, IBM T.J. Watson Research Center, New York, USA, 2024

  3. arXiv:2510.23070  [pdf, ps, other

    cs.CL cs.AI

    Quality-Aware Translation Tagging in Multilingual RAG system

    Authors: Hoyeon Moon, Byeolhee Kim, Nikhil Verma

    Abstract: Multilingual Retrieval-Augmented Generation (mRAG) often retrieves English documents and translates them into the query language for low-resource settings. However, poor translation quality degrades response generation performance. Existing approaches either assume sufficient translation quality or utilize the rewriting method, which introduces factual distortion and hallucinations. To mitigate th… ▽ More

    Submitted 27 October, 2025; originally announced October 2025.

    Comments: EMNLP 2025 MRL Workshop

  4. arXiv:2510.18095  [pdf, ps, other

    cs.AI cs.CL

    SMaRT: Select, Mix, and ReinvenT -- A Strategy Fusion Framework for LLM-Driven Reasoning and Planning

    Authors: Nikhil Verma, Manasa Bharadwaj, Wonjun Jang, Harmanpreet Singh, Yixiao Wang, Homa Fashandi, Chul Lee

    Abstract: Large Language Models (LLMs) have redefined complex task automation with exceptional generalization capabilities. Despite these advancements, state-of-the-art methods rely on single-strategy prompting, missing the synergy of diverse reasoning approaches. No single strategy excels universally, highlighting the need for frameworks that fuse strategies to maximize performance and ensure robustness. W… ▽ More

    Submitted 20 October, 2025; originally announced October 2025.

  5. arXiv:2510.07746  [pdf, ps, other

    cs.LG

    t-SNE Exaggerates Clusters, Provably

    Authors: Noah Bergam, Szymon Snoeck, Nakul Verma

    Abstract: Central to the widespread use of t-distributed stochastic neighbor embedding (t-SNE) is the conviction that it produces visualizations whose structure roughly matches that of the input. To the contrary, we prove that (1) the strength of the input clustering, and (2) the extremity of outlier points, cannot be reliably inferred from the t-SNE output. We demonstrate the prevalence of these failure mo… ▽ More

    Submitted 8 October, 2025; originally announced October 2025.

  6. arXiv:2508.07119  [pdf, ps, other

    cs.CG math.MG

    Compressibility Barriers to Neighborhood-Preserving Data Visualizations

    Authors: Szymon Snoeck, Noah Bergam, Nakul Verma

    Abstract: To what extent is it possible to visualize high-dimensional datasets in a two- or three-dimensional space? We reframe this question in terms of embedding $n$-vertex graphs (representing the neighborhood structure of the input points) into metric spaces of low doubling dimension $d$, in such a way that maintains the separation between neighbors and non-neighbors. This seemingly lax embedding requir… ▽ More

    Submitted 9 August, 2025; originally announced August 2025.

  7. arXiv:2508.01888  [pdf, ps, other

    cs.LG cs.CR cs.MA

    Optimizing Day-Ahead Energy Trading with Proximal Policy Optimization and Blockchain

    Authors: Navneet Verma, Ying Xie

    Abstract: The increasing penetration of renewable energy sources in day-ahead energy markets introduces challenges in balancing supply and demand, ensuring grid resilience, and maintaining trust in decentralized trading systems. This paper proposes a novel framework that integrates the Proximal Policy Optimization (PPO) algorithm, a state-of-the-art reinforcement learning method, with blockchain technology… ▽ More

    Submitted 7 December, 2025; v1 submitted 3 August, 2025; originally announced August 2025.

  8. arXiv:2507.13190  [pdf, ps, other

    cs.CL

    GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems

    Authors: Jisoo Lee, Raeyoung Chang, Dongwook Kwon, Harmanpreet Singh, Nikhil Verma

    Abstract: Multi-agent systems built on language models have shown strong performance on collaborative reasoning tasks. However, existing evaluations focus only on the correctness of the final output, overlooking how inefficient communication and poor coordination contribute to redundant reasoning and higher computational costs. We introduce GEMMAS, a graph-based evaluation framework that analyzes the intern… ▽ More

    Submitted 17 July, 2025; originally announced July 2025.

    Comments: 4 figures, 1 algorithm, 2 tables, 6 pages, under review at EMNLP Industry track 2025

  9. arXiv:2507.04517  [pdf, ps, other

    cs.LG cs.CL

    DOTResize: Reducing LLM Width via Discrete Optimal Transport-based Neuron Merging

    Authors: Neha Verma, Kenton Murray, Kevin Duh

    Abstract: Model compression offers a promising path to reducing the cost and inaccessibility of large pre-trained models, without significantly compromising their impressive performance. Large Transformer models, including large language models (LLMs), often contain computational redundancy, which can serve as a target for new model compression methods. In this work, we specifically target neuron-level redu… ▽ More

    Submitted 6 July, 2025; originally announced July 2025.

  10. arXiv:2506.17449  [pdf, ps, other

    cs.AI

    OmniReflect: Discovering Transferable Constitutions for LLM agents via Neuro-Symbolic Reflections

    Authors: Manasa Bharadwaj, Nikhil Verma, Kevin Ferreira

    Abstract: Efforts to improve Large Language Model (LLM) agent performance on complex tasks have largely focused on fine-tuning and iterative self-correction. However, these approaches often lack generalizable mechanisms for longterm learning and remain inefficient in dynamic environments. We introduce OmniReflect, a hierarchical, reflection-driven framework that constructs a constitution, a compact set of g… ▽ More

    Submitted 20 June, 2025; originally announced June 2025.

  11. arXiv:2505.24544  [pdf, ps, other

    cs.CL cs.AI

    Cross-Attention Speculative Decoding

    Authors: Wei Zhong, Manasa Bharadwaj, Yixiao Wang, Nikhil Verma, Yipeng Ji, Chul Lee

    Abstract: Speculative decoding (SD) is a widely adopted approach for accelerating inference in large language models (LLMs), particularly when the draft and target models are well aligned. However, state-of-the-art SD methods typically rely on tightly coupled, self-attention-based Transformer decoders, often augmented with auxiliary pooling or fusion layers. This coupling makes them increasingly complex and… ▽ More

    Submitted 21 September, 2025; v1 submitted 30 May, 2025; originally announced May 2025.

  12. How to keep pushing ML accelerator performance? Know your rooflines!

    Authors: Marian Verhelst, Luca Benini, Naveen Verma

    Abstract: The rapidly growing importance of Machine Learning (ML) applications, coupled with their ever-increasing model size and inference energy footprint, has created a strong need for specialized ML hardware architectures. Numerous ML accelerators have been explored and implemented, primarily to increase task-level throughput per unit area and reduce task-level energy consumption. This paper surveys key… ▽ More

    Submitted 23 May, 2025; v1 submitted 22 May, 2025; originally announced May 2025.

    Comments: in IEEE Journal of Solid-State Circuits, 2025

  13. arXiv:2505.03407  [pdf, ps, other

    cs.IR cs.IT

    CB-cPIR: Code-Based Computational Private Information Retrieval

    Authors: Camilla Hollanti, Neehar Verma

    Abstract: A private information retrieval (PIR) scheme is a protocol that allows a user to retrieve a file from a database without revealing the identity of the desired file to a curious database. Given a distributed data storage system, efficient PIR can be achieved by making assumptions about the colluding capabilities of the storage servers holding the database. If these assumptions turn out to be incorr… ▽ More

    Submitted 7 August, 2025; v1 submitted 6 May, 2025; originally announced May 2025.

    Comments: This paper builds on the work done in arXiv: 2402.02871v1 (IEEE ISIT24) and arXiv: 2001.07049 (IEEE ISIT20) Remark 6. briefly outlines a fix to a new attack, this paper will soon be updated to reflect the changes to the scheme

  14. arXiv:2504.02708  [pdf, other

    cs.CL

    The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context

    Authors: Nikhil Verma, Manasa Bharadwaj

    Abstract: Alignment tuning has enabled large language models to excel in reasoning, instruction-following, and minimizing harmful generations. However, despite their widespread deployment, these models exhibit a monolingual bias, raising concerns about the effectiveness of alignment across languages. Current alignment methods predominantly focus on English, leaving it unclear how alignment mechanism general… ▽ More

    Submitted 3 April, 2025; originally announced April 2025.

    Comments: 14 pages, 11 Figures, 2 Tables, currently under review at ACL 2025

  15. arXiv:2503.19280  [pdf, other

    cs.DM cs.AI cs.HC

    LogicLearner: A Tool for the Guided Practice of Propositional Logic Proofs

    Authors: Amogh Inamdar, Uzay Macar, Michel Vazirani, Michael Tarnow, Zarina Mustapha, Natalia Dittren, Sam Sadeh, Nakul Verma, Ansaf Salleb-Aouissi

    Abstract: The study of propositional logic -- fundamental to the theory of computing -- is a cornerstone of the undergraduate computer science curriculum. Learning to solve logical proofs requires repeated guided practice, but undergraduate students often lack access to on-demand tutoring in a judgment-free environment. In this work, we highlight the need for guided practice tools in undergraduate mathemati… ▽ More

    Submitted 24 March, 2025; originally announced March 2025.

    Comments: 32 pages, 27 figures, open-source codebase linked in paper

  16. arXiv:2502.09955  [pdf, other

    cs.AI

    Diverse Inference and Verification for Advanced Reasoning

    Authors: Iddo Drori, Gaston Longhitano, Mao Mao, Seunghwan Hyun, Yuke Zhang, Sungjun Park, Zachary Meeks, Xin-Yu Zhang, Ben Segev, Howard Yong, Nakul Verma, Avi Shporer, Alon Amit, Madeleine Udell

    Abstract: Reasoning LLMs such as OpenAI o1, o3 and DeepSeek R1 have made significant progress in mathematics and coding, yet find challenging advanced tasks such as International Mathematical Olympiad (IMO) combinatorics problems, Abstraction and Reasoning Corpus (ARC) puzzles, and Humanity's Last Exam (HLE) questions. We use a diverse inference approach that combines multiple models and methods at test tim… ▽ More

    Submitted 14 February, 2025; originally announced February 2025.

    Comments: 165 pages

  17. arXiv:2501.06126  [pdf, other

    cs.CL cs.LG

    Merging Feed-Forward Sublayers for Compressed Transformers

    Authors: Neha Verma, Kenton Murray, Kevin Duh

    Abstract: With the rise and ubiquity of larger deep learning models, the need for high-quality compression techniques is growing in order to deploy these models widely. The sheer parameter count of these models makes it difficult to fit them into the memory constraints of different hardware. In this work, we present a novel approach to model compression by merging similar parameter groups within a model, ra… ▽ More

    Submitted 28 March, 2025; v1 submitted 10 January, 2025; originally announced January 2025.

  18. arXiv:2409.18581  [pdf, ps, other

    cs.LG stat.ML

    Deep Autoregressive Models as Causal Inference Engines

    Authors: Daniel Jiwoong Im, Kevin Zhang, Nakul Verma, Kyunghyun Cho

    Abstract: Existing causal inference (CI) models are often restricted to data with low-dimensional confounders and singleton actions. We propose an autoregressive (AR) CI framework capable of handling complex confounders and sequential actions commonly found in modern applications. Our approach accomplishes this using {\em sequencification}, which transforms data from an underlying causal diagram into a sequ… ▽ More

    Submitted 4 July, 2025; v1 submitted 27 September, 2024; originally announced September 2024.

  19. arXiv:2409.16126  [pdf, ps, other

    cs.CV

    VisioPhysioENet: Visual Physiological Engagement Detection Network

    Authors: Alakhsimar Singh, Kanav Goyal, Nischay Verma, Puneet Kumar, Xiaobai Li, Amritpal Singh

    Abstract: This paper presents VisioPhysioENet, a novel multimodal system that leverages visual and physiological signals to detect learner engagement. It employs a two-level approach for extracting both visual and physiological features. For visual feature extraction, Dlib is used to detect facial landmarks, while OpenCV provides additional estimations. The face recognition library, built on Dlib, is used t… ▽ More

    Submitted 20 August, 2025; v1 submitted 24 September, 2024; originally announced September 2024.

    Comments: 35 Pages, 4 figures, 5 Tables

  20. arXiv:2408.11981  [pdf, other

    cs.CL

    Large Language Models for Page Stream Segmentation

    Authors: Hunter Heidenreich, Ratish Dalvi, Rohith Mukku, Nikhil Verma, Neven Pičuljan

    Abstract: Page Stream Segmentation (PSS) is an essential prerequisite for automated document processing at scale. However, research progress has been limited by the absence of realistic public benchmarks. This paper works towards addressing this gap by introducing TABME++, an enhanced benchmark featuring commercial Optical Character Recognition (OCR) annotations. We evaluate the performance of large languag… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  21. arXiv:2405.11566  [pdf, other

    cs.LG

    Uncertainty-Aware PPG-2-ECG for Enhanced Cardiovascular Diagnosis using Diffusion Models

    Authors: Omer Belhasin, Idan Kligvasser, George Leifman, Regev Cohen, Erin Rainaldi, Li-Fang Cheng, Nishant Verma, Paul Varghese, Ehud Rivlin, Michael Elad

    Abstract: Analyzing the cardiovascular system condition via Electrocardiography (ECG) is a common and highly effective approach, and it has been practiced and perfected over many decades. ECG sensing is non-invasive and relatively easy to acquire, and yet it is still cumbersome for holter monitoring tests that may span over hours and even days. A possible alternative in this context is Photoplethysmography… ▽ More

    Submitted 20 April, 2025; v1 submitted 19 May, 2024; originally announced May 2024.

  22. GPT-DETOX: An In-Context Learning-Based Paraphraser for Text Detoxification

    Authors: Ali Pesaranghader, Nikhil Verma, Manasa Bharadwaj

    Abstract: Harmful and offensive communication or content is detrimental to social bonding and the mental state of users on social media platforms. Text detoxification is a crucial task in natural language processing (NLP), where the goal is removing profanity and toxicity from text while preserving its content. Supervised and unsupervised learning are common approaches for designing text detoxification solu… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 7 pages, 8 tables. Published in: 2023 International Conference on Machine Learning and Applications (ICMLA)

  23. arXiv:2403.00986  [pdf, other

    cs.CL cs.AI cs.LG

    Merging Text Transformer Models from Different Initializations

    Authors: Neha Verma, Maha Elbayad

    Abstract: Recent work on permutation-based model merging has shown impressive low- or zero-barrier mode connectivity between models from completely different initializations. However, this line of work has not yet extended to the Transformer architecture, despite its dominant popularity in the language domain. Therefore, in this work, we investigate the extent to which separate Transformer minima learn simi… ▽ More

    Submitted 16 December, 2024; v1 submitted 1 March, 2024; originally announced March 2024.

    Comments: TMLR, November 2024

  24. arXiv:2402.09957  [pdf, other

    cs.LG eess.SP

    On Designing Features for Condition Monitoring of Rotating Machines

    Authors: Seetaram Maurya, Nishchal K. Verma

    Abstract: Various methods for designing input features have been proposed for fault recognition in rotating machines using one-dimensional raw sensor data. The available methods are complex, rely on empirical approaches, and may differ depending on the condition monitoring data used. Therefore, this article proposes a novel algorithm to design input features that unifies the feature extraction process for d… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  25. Code-Based Single-Server Private Information Retrieval: Circumventing the Sub-Query Attack

    Authors: Neehar Verma, Camilla Hollanti

    Abstract: Private information retrieval from a single server is considered, utilizing random linear codes. Presented is a modified version of the first code-based single-server computational PIR scheme proposed by Holzbaur, Hollanti, and Wachter-Zeh in [Holzbaur et al., "Computational Code-Based Single-Server Private Information Retrieval", 2020 IEEE ISIT]. The original scheme was broken in [Bordage et al.,… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: The scheme proposed in this work is a modified version of the scheme in arXiv:2001.07049 (IEEE ISIT 2020) and provides a mend against the attack discovered in arXiv:2004.00509 (Cryptography and Communications, 2021)

  26. arXiv:2307.05616  [pdf, other

    cs.CV

    Image Reconstruction using Enhanced Vision Transformer

    Authors: Nikhil Verma, Deepkamal Kaur, Lydia Chau

    Abstract: Removing noise from images is a challenging and fundamental problem in the field of computer vision. Images captured by modern cameras are inevitably degraded by noise which limits the accuracy of any quantitative measurements on those images. In this project, we propose a novel image reconstruction framework which can be used for tasks such as image denoising, deblurring or inpainting. The model… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

  27. arXiv:2307.04978  [pdf, other

    cs.CV

    Diffusion idea exploration for art generation

    Authors: Nikhil Verma

    Abstract: Cross-Modal learning tasks have picked up pace in recent times. With plethora of applications in diverse areas, generation of novel content using multiple modalities of data has remained a challenging problem. To address the same, various generative modelling techniques have been proposed for specific tasks. Novel and creative image generation is one important aspect for industrial application whi… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: Report Submitted for degree completion of Master of Science in Applied Computing at University of Toronto

  28. arXiv:2306.08221  [pdf, other

    cs.CL

    Contrastive Loss is All You Need to Recover Analogies as Parallel Lines

    Authors: Narutatsu Ri, Fei-Tzin Lee, Nakul Verma

    Abstract: While static word embedding models are known to represent linguistic analogies as parallel lines in high-dimensional space, the underlying mechanism as to why they result in such geometric structures remains obscure. We find that an elementary contrastive-style method employed over distributional information performs competitively with popular word embedding models on analogy recovery tasks, while… ▽ More

    Submitted 13 June, 2023; originally announced June 2023.

  29. A Novel Vision Transformer with Residual in Self-attention for Biomedical Image Classification

    Authors: Arun K. Sharma, Nishchal K. Verma

    Abstract: Biomedical image classification requires capturing of bio-informatics based on specific feature distribution. In most of such applications, there are mainly challenges due to limited availability of samples for diseased cases and imbalanced nature of dataset. This article presents the novel framework of multi-head self-attention for vision transformer (ViT) which makes capable of capturing the spe… ▽ More

    Submitted 5 June, 2023; v1 submitted 2 June, 2023; originally announced June 2023.

    Journal ref: Pattern Recognition, Volume 172, Part B, April 2026, 112497

  30. arXiv:2305.14280  [pdf, other

    cs.CL

    Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer

    Authors: Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post

    Abstract: We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations. We experiment with two different data settings with a variety of language and script coverage, demonstrating improved performance compared to subword embeddings. We explore various properties of pixel representations such as parameter sharing within and across scripts to better… ▽ More

    Submitted 24 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: EMNLP 2023

  31. arXiv:2305.14230  [pdf, other

    cs.CL

    Exploring Representational Disparities Between Multilingual and Bilingual Translation Models

    Authors: Neha Verma, Kenton Murray, Kevin Duh

    Abstract: Multilingual machine translation has proven immensely useful for both parameter efficiency and overall performance across many language pairs via complete multilingual parameter sharing. However, some language pairs in multilingual models can see worse performance than in bilingual models, especially in the one-to-many translation setting. Motivated by their empirical differences, we examine the g… ▽ More

    Submitted 26 March, 2024; v1 submitted 23 May, 2023; originally announced May 2023.

    Comments: LREC-COLING 2024

  32. arXiv:2305.07552  [pdf, other

    cs.CV cs.AI cs.CY

    Dish detection in food platters: A framework for automated diet logging and nutrition management

    Authors: Mansi Goel, Shashank Dargar, Shounak Ghatak, Nidhi Verma, Pratik Chauhan, Anushka Gupta, Nikhila Vishnumolakala, Hareesh Amuru, Ekta Gambhir, Ronak Chhajed, Meenal Jain, Astha Jain, Samiksha Garg, Nitesh Narwade, Nikhilesh Verhwani, Abhuday Tiwari, Kirti Vashishtha, Ganesh Bagler

    Abstract: Diet is central to the epidemic of lifestyle disorders. Accurate and effortless diet logging is one of the significant bottlenecks for effective diet management and calorie restriction. Dish detection from food platters is a challenging problem due to a visually complex food layout. We present an end-to-end computational framework for diet management, from data compilation, annotation, and state-o… ▽ More

    Submitted 12 May, 2023; originally announced May 2023.

    Comments: 11 pages, 5 figures, 5 tables. Submitted to the 8th International Conference on Computer Vision & Image Processing (CVIP-2023)

    ACM Class: I.4.9; I.5.4; J.3

  33. arXiv:2303.01676  [pdf, other

    cs.RO

    eViper: A Scalable Platform for Untethered Modular Soft Robots

    Authors: Hsin Cheng, Zhiwu Zheng, Prakhar Kumar, Wali Afridi, Ben Kim, Sigurd Wagner, Naveen Verma, James C. Sturm, Minjie Chen

    Abstract: Soft robots present unique capabilities, but have been limited by the lack of scalable technologies for construction and the complexity of algorithms for efficient control and motion, which depend on soft-body dynamics, high-dimensional actuation patterns, and external/on-board forces. This paper presents scalable methods and platforms to study the impact of weight distribution and actuation patte… ▽ More

    Submitted 14 November, 2023; v1 submitted 2 March, 2023; originally announced March 2023.

    Comments: 8 pages, 21 figures, accepted by IROS 2023

  34. arXiv:2210.05098  [pdf, other

    cs.CL cs.LG

    IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces

    Authors: Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn

    Abstract: The ability to extract high-quality translation dictionaries from monolingual word embedding spaces depends critically on the geometric similarity of the spaces -- their degree of "isomorphism." We address the root-cause of faulty cross-lingual mapping: that word embedding training resulted in the underlying spaces being non-isomorphic. We incorporate global measures of isomorphism directly into t… ▽ More

    Submitted 4 July, 2023; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: Updated EMNLP2022 Camera Ready (citation correction, removed references to dimensionality reduction [was not used here].)

  35. arXiv:2209.04528  [pdf, other

    cs.LG

    Improving Model Training via Self-learned Label Representations

    Authors: Xiao Yu, Nakul Verma

    Abstract: Modern neural network architectures have shown remarkable success in several large-scale classification and prediction tasks. Part of the success of these architectures is their flexibility to transform the data from the raw input representations (e.g. pixels for vision tasks, or text for natural language processing tasks) to one-hot output encoding. While much of the work has focused on studying… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  36. Wirelessly-Controlled Untethered Piezoelectric Planar Soft Robot Capable of Bidirectional Crawling and Rotation

    Authors: Zhiwu Zheng, Hsin Cheng, Prakhar Kumar, Sigurd Wagner, Minjie Chen, Naveen Verma, James C. Sturm

    Abstract: Electrostatic actuators provide a promising approach to creating soft robotic sheets, due to their flexible form factor, modular integration, and fast response speed. However, their control requires kilo-Volt signals and understanding of complex dynamics resulting from force interactions by on-board and environmental effects. In this work, we demonstrate an untethered planar five-actuator piezoele… ▽ More

    Submitted 19 January, 2023; v1 submitted 1 July, 2022; originally announced July 2022.

    Comments: Accepted to the 2023 IEEE International Conference on Robotics and Automation (ICRA)

    Journal ref: 2023 IEEE International Conference on Robotics and Automation (ICRA), 641-647

  37. Model-Based Control of Planar Piezoelectric Inchworm Soft Robot for Crawling in Constrained Environments

    Authors: Zhiwu Zheng, Prakhar Kumar, Yenan Chen, Hsin Cheng, Sigurd Wagner, Minjie Chen, Naveen Verma, James C. Sturm

    Abstract: Soft robots have drawn significant attention recently for their ability to achieve rich shapes when interacting with complex environments. However, their elasticity and flexibility compared to rigid robots also pose significant challenges for precise and robust shape control in real-time. Motivated by their potential to operate in highly-constrained environments, as in search-and-rescue operations… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

    Comments: Accepted to the 2022 IEEE 5th International Conference on Soft Robotics (RoboSoft). Project website: https://piezorobotcontroller.github.io/ Summary video: https://youtu.be/Md-Uo-pUaIs

    Journal ref: 2022 IEEE 5th International Conference on Soft Robotics (RoboSoft), 693-698

  38. Scalable Simulation and Demonstration of Jumping Piezoelectric 2-D Soft Robots

    Authors: Zhiwu Zheng, Prakhar Kumar, Yenan Chen, Hsin Cheng, Sigurd Wagner, Minjie Chen, Naveen Verma, James C. Sturm

    Abstract: Soft robots have drawn great interest due to their ability to take on a rich range of shapes and motions, compared to traditional rigid robots. However, the motions, and underlying statics and dynamics, pose significant challenges to forming well-generalized and robust models necessary for robot design and control. In this work, we demonstrate a five-actuator soft robot capable of complex motions… ▽ More

    Submitted 27 February, 2022; originally announced February 2022.

    Comments: Accepted to the International Conference on Robotics and Automation (ICRA) 2022. Video: https://youtu.be/nHcH3V7rCrk

    Journal ref: 2022 International Conference on Robotics and Automation (ICRA), 2022, pp. 5199-5204

  39. A Neural Network Solves, Explains, and Generates University Math Problems by Program Synthesis and Few-Shot Learning at Human Level

    Authors: Iddo Drori, Sarah Zhang, Reece Shuttleworth, Leonard Tang, Albert Lu, Elizabeth Ke, Kevin Liu, Linda Chen, Sunny Tran, Newman Cheng, Roman Wang, Nikhil Singh, Taylor L. Patti, Jayson Lynch, Avi Shporer, Nakul Verma, Eugene Wu, Gilbert Strang

    Abstract: We demonstrate that a neural network pre-trained on text and fine-tuned on code solves mathematics course problems, explains solutions, and generates new questions at a human level. We automatically synthesize programs using few-shot learning and OpenAI's Codex transformer and execute them to solve course problems at 81% automatic accuracy. We curate a new dataset of questions from MIT's largest m… ▽ More

    Submitted 30 May, 2022; v1 submitted 31 December, 2021; originally announced December 2021.

    Comments: 181 pages, 8 figures, 280 tables

  40. arXiv:2112.11024  [pdf, ps, other

    cs.CR cs.CY cs.DC

    Reputation-based PoS for the Restriction of Illicit Activities on Blockchain: Algorand Usecase

    Authors: Mayank Pandey, Rachit Agarwal, Sandeep Kumar Shukla, Nishchal Kumar Verma

    Abstract: In cryptocurrency-based permissionless blockchain networks, the decentralized structure enables any user to join and operate across different regions. The criminal entities exploit it by using cryptocurrency transactions on the blockchain to facilitate activities such as money laundering, gambling, and ransomware attacks. In recent times, different machine learning-based techniques can detect such… ▽ More

    Submitted 16 August, 2025; v1 submitted 21 December, 2021; originally announced December 2021.

  41. arXiv:2112.08933  [pdf, other

    cs.LG cs.AI

    Responsive parallelized architecture for deploying deep learning models in production environments

    Authors: Nikhil Verma, Krishna Prasad

    Abstract: Recruiters can easily shortlist candidates for jobs via viewing their curriculum vitae (CV) document. Unstructured document CV beholds candidate's portfolio and named entities listing details. The main aim of this study is to design and propose a web oriented, highly responsive, computational pipeline that systematically predicts CV entities using hierarchically-refined label attention networks. D… ▽ More

    Submitted 10 July, 2023; v1 submitted 14 December, 2021; originally announced December 2021.

    Comments: 20 Pages

  42. arXiv:2111.13993  [pdf, other

    cs.CL

    An analysis of document graph construction methods for AMR summarization

    Authors: Fei-Tzin Lee, Chris Kedzie, Nakul Verma, Kathleen McKeown

    Abstract: Meaning Representation (AMR) is a graph-based semantic representation for sentences, composed of collections of concepts linked by semantic relations. AMR-based approaches have found success in a variety of applications, but a challenge to using it in tasks that require document-level context is that it only represents individual sentences. Prior work in AMR-based summarization has automatically m… ▽ More

    Submitted 27 November, 2021; originally announced November 2021.

  43. arXiv:2111.08267  [pdf, other

    cs.LG cs.AI cs.CL cs.PL

    Solving Probability and Statistics Problems by Program Synthesis

    Authors: Leonard Tang, Elizabeth Ke, Nikhil Singh, Nakul Verma, Iddo Drori

    Abstract: We solve university level probability and statistics questions by program synthesis using OpenAI's Codex, a Transformer trained on text and fine-tuned on code. We transform course problems from MIT's 18.05 Introduction to Probability and Statistics and Harvard's STAT110 Probability into programming tasks. We then execute the generated code to get a solution. Since these course questions are ground… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

    Comments: 33 pages, 4 figures

  44. arXiv:2111.08171  [pdf, other

    cs.LG cs.AI cs.CL

    Solving Linear Algebra by Program Synthesis

    Authors: Iddo Drori, Nakul Verma

    Abstract: We solve MIT's Linear Algebra 18.06 course and Columbia University's Computational Linear Algebra COMS3251 courses with perfect accuracy by interactive program synthesis. This surprisingly strong result is achieved by turning the course questions into programming tasks and then running the programs to produce the correct answers. We use OpenAI Codex with zero-shot learning, without providing any e… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: 32 pages, 3 figures

  45. arXiv:2111.06885  [pdf, other

    cs.NE eess.SY

    Guided Sampling-based Evolutionary Deep Neural Network for Intelligent Fault Diagnosis

    Authors: Arun K. Sharma, Nishchal K. Verma

    Abstract: The diagnostic performance of most of the deep learning models is greatly affected by the selection of model architecture and hyperparameters. Manual selection of model architecture is not feasible as training and evaluating the different architectures of deep learning models is a time-consuming process. Therefore, we have proposed a novel framework of evolutionary deep neural network which uses p… ▽ More

    Submitted 23 February, 2022; v1 submitted 12 November, 2021; originally announced November 2021.

  46. Piezoelectric Soft Robot Inchworm Motion by Tuning Ground Friction through Robot Shape: Quasi-Static Modeling and Experimental Validation

    Authors: Zhiwu Zheng, Prakhar Kumar, Yenan Chen, Hsin Cheng, Sigurd Wagner, Minjie Chen, Naveen Verma, James C. Sturm

    Abstract: Electrically-driven soft robots based on piezoelectric actuators may enable compact form factors and maneuverability in complex environments. In most prior work, piezoelectric actuators are used to control a single degree of freedom. In this work, the coordinated activation of five independent piezoelectric actuators, attached to a common metal foil, is used to implement inchworm-inspired crawling… ▽ More

    Submitted 14 December, 2023; v1 submitted 1 November, 2021; originally announced November 2021.

    Comments: Accepted to IEEE Transactions on Robotics

    Journal ref: IEEE Transactions on Robotics. 2024 Jan 11

  47. arXiv:2109.13479  [pdf, other

    eess.SP cs.AI eess.SY math.OC

    Knowledge Transfer based Evolutionary Deep Neural Network for Intelligent Fault Diagnosis

    Authors: Arun K. Sharma, Nishchal K. Verma

    Abstract: A faster response with commendable accuracy in intelligent systems is essential for the reliability and smooth operations of industrial machines. Two main challenges affect the design of such intelligent systems: (i) the selection of a suitable model and (ii) domain adaptation if there is a continuous change in operating conditions. Therefore, we propose an evolutionary Net2Net transformation (Evo… ▽ More

    Submitted 21 March, 2025; v1 submitted 28 September, 2021; originally announced September 2021.

    Comments: Submitted to IEEE Transactions on Sustainable Computing

  48. arXiv:2107.02975  [pdf, other

    cs.CL cs.AI

    Neural Natural Language Processing for Unstructured Data in Electronic Health Records: a Review

    Authors: Irene Li, Jessica Pan, Jeremy Goldwasser, Neha Verma, Wai Pan Wong, Muhammed Yavuz Nuzumlalı, Benjamin Rosand, Yixin Li, Matthew Zhang, David Chang, R. Andrew Taylor, Harlan M. Krumholz, Dragomir Radev

    Abstract: Electronic health records (EHRs), digital collections of patient healthcare events and observations, are ubiquitous in medicine and critical to healthcare delivery, operations, and research. Despite this central role, EHRs are notoriously difficult to process automatically. Well over half of the information stored within EHRs is in the form of unstructured text (e.g. provider notes, operation repo… ▽ More

    Submitted 6 July, 2021; originally announced July 2021.

    Comments: 33 pages, 11 figures

    MSC Class: 68T50 ACM Class: I.2.7

  49. arXiv:2106.11182  [pdf, other

    cs.LG cs.AI

    On fine-tuning of Autoencoders for Fuzzy rule classifiers

    Authors: Rahul Kumar Sevakula, Nishchal Kumar Verma, Hisao Ishibuchi

    Abstract: Recent discoveries in Deep Neural Networks are allowing researchers to tackle some very complex problems such as image classification and audio classification, with improved theoretical and empirical justifications. This paper presents a novel scheme to incorporate the use of autoencoders in Fuzzy rule classifiers (FRC). Autoencoders when stacked can learn the complex non-linear relationships amon… ▽ More

    Submitted 21 June, 2021; originally announced June 2021.

  50. arXiv:2104.00369  [pdf, other

    cs.CL

    FeTaQA: Free-form Table Question Answering

    Authors: Linyong Nan, Chiachun Hsieh, Ziming Mao, Xi Victoria Lin, Neha Verma, Rui Zhang, Wojciech Kryściński, Nick Schoelkopf, Riley Kong, Xiangru Tang, Murori Mutuma, Ben Rosand, Isabel Trindade, Renusree Bandaru, Jacob Cunningham, Caiming Xiong, Dragomir Radev

    Abstract: Existing table question answering datasets contain abundant factual questions that primarily evaluate the query and schema comprehension capability of a system, but they fail to include questions that require complex reasoning and integration of information due to the constraint of the associated short-form answers. To address these issues and to demonstrate the full challenge of table question an… ▽ More

    Submitted 1 April, 2021; originally announced April 2021.