Skip to main content

Showing 1–24 of 24 results for author: White, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2510.25087  [pdf, ps, other

    cs.CL cs.LG

    BioCoref: Benchmarking Biomedical Coreference Resolution with LLMs

    Authors: Nourah M Salem, Elizabeth White, Michael Bada, Lawrence Hunter

    Abstract: Coreference resolution in biomedical texts presents unique challenges due to complex domain-specific terminology, high ambiguity in mention forms, and long-distance dependencies between coreferring expressions. In this work, we present a comprehensive evaluation of generative large language models (LLMs) for coreference resolution in the biomedical domain. Using the CRAFT corpus as our benchmark,… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

  2. arXiv:2510.25055  [pdf, ps, other

    cs.CL cs.AI cs.LG

    GAPMAP: Mapping Scientific Knowledge Gaps in Biomedical Literature Using Large Language Models

    Authors: Nourah M Salem, Elizabeth White, Michael Bada, Lawrence Hunter

    Abstract: Scientific progress is driven by the deliberate articulation of what remains unknown. This study investigates the ability of large language models (LLMs) to identify research knowledge gaps in the biomedical literature. We define two categories of knowledge gaps: explicit gaps, clear declarations of missing knowledge; and implicit gaps, context-inferred missing knowledge. While prior work has focu… ▽ More

    Submitted 28 October, 2025; originally announced October 2025.

  3. arXiv:2509.16860  [pdf, ps, other

    cs.LG

    LVADNet3D: A Deep Autoencoder for Reconstructing 3D Intraventricular Flow from Sparse Hemodynamic Data

    Authors: Mohammad Abdul Hafeez Khan, Marcello Mattei Di Eugeni, Benjamin Diaz, Ruth E. White, Siddhartha Bhattacharyya, Venkat Keshav Chivukula

    Abstract: Accurate assessment of intraventricular blood flow is essential for evaluating hemodynamic conditions in patients supported by Left Ventricular Assist Devices (LVADs). However, clinical imaging is either incompatible with LVADs or yields sparse, low-quality velocity data. While Computational Fluid Dynamics (CFD) simulations provide high-fidelity data, they are computationally intensive and impract… ▽ More

    Submitted 20 September, 2025; originally announced September 2025.

    Comments: Accepted to International Conference on Machine Learning and Applications (ICMLA), 6 pages, 4 figure, 3 tables

  4. arXiv:2508.07514  [pdf, ps, other

    cs.CV cs.AI

    Mitigating Domain Drift in Multi Species Segmentation with DINOv2: A Cross-Domain Evaluation in Herbicide Research Trials

    Authors: Artzai Picon, Itziar Eguskiza, Daniel Mugica, Javier Romero, Carlos Javier Jimenez, Eric White, Gabriel Do-Lago-Junqueira, Christian Klukas, Ramon Navarra-Mestre

    Abstract: Reliable plant species and damage segmentation for herbicide field research trials requires models that can withstand substantial real-world variation across seasons, geographies, devices, and sensing modalities. Most deep learning approaches trained on controlled datasets fail to generalize under these domain shifts, limiting their suitability for operational phenotyping pipelines. This study eva… ▽ More

    Submitted 10 April, 2026; v1 submitted 10 August, 2025; originally announced August 2025.

  5. arXiv:2505.23883  [pdf, ps, other

    cs.CV cs.CL cs.LG

    BioCLIP 2: Emergent Properties from Scaling Hierarchical Contrastive Learning

    Authors: Jianyang Gu, Samuel Stevens, Elizabeth G Campolongo, Matthew J Thompson, Net Zhang, Jiaman Wu, Andrei Kopanev, Zheda Mai, Alexander E. White, James Balhoff, Wasila Dahdul, Daniel Rubenstein, Hilmar Lapp, Tanya Berger-Wolf, Wei-Lun Chao, Yu Su

    Abstract: Foundation models trained at scale exhibit remarkable emergent behaviors, learning new capabilities beyond their initial training objectives. We find such emergent behaviors in biological vision models via large-scale contrastive vision-language training. To achieve this, we first curate TreeOfLife-200M, comprising 214 million images of living organisms, the largest and most diverse biological org… ▽ More

    Submitted 23 October, 2025; v1 submitted 29 May, 2025; originally announced May 2025.

    Comments: NeurIPS 2025 Spotlight; Project page: https://imageomics.github.io/bioclip-2/

  6. arXiv:2505.06484  [pdf, ps, other

    cs.SE

    10 quick tips for making your software outlive your job

    Authors: Richard Littauer, Greg Wilson, Jan Ainali, Eman Abdullah AlOmar, Sylwester Arabas, Yanina Bellini Saibene, Kris Bubendorfer, Kaylea Champion, Clare Dillon, Jouni Helske, Pieter Huybrechts, Daniel S. Katz, Chang Liao, David Lippert, Fang Liu, Pierre Marshall, Daniel R. McCloy, Ian McInerney, Mohamed Wiem Mkaouer, Priyanka Ojha, Christoph Treude, Ethan P. White

    Abstract: Loss of key personnel has always been a risk for research software projects. Key members of the team may have to step away due to illness or burnout, to care for a family member, from a loss of financial support, or because their career is going in a new direction. Today, though, political and financial changes are putting large numbers of researchers out of work simultaneously, potentially leavin… ▽ More

    Submitted 28 October, 2025; v1 submitted 9 May, 2025; originally announced May 2025.

    Comments: 14 pages; 1 figure. New version uploaded to remove inaccurate, most likely hallucinated (!) refs in Rule 7, which we did not catch or suspect on previous submission

  7. arXiv:2412.02866  [pdf, ps, other

    math.CO cs.DM

    A note on the no-$(d+2)$-on-a-sphere problem

    Authors: Andrew Suk, Ethan Patrick White

    Abstract: For fixed $d\geq 3$, we construct subsets of the $d$-dimensional lattice cube $[n]^d$ of size $n^{\frac{3}{d + 1} - o(1)}$ with no $d+2$ points on a sphere or a hyperplane. This improves the previously best known bound of $Ω(n^{\frac{1}{d-1}})$ due to Thiele from 1995.

    Submitted 3 December, 2024; originally announced December 2024.

    Comments: 9 pages

    MSC Class: 52C35; 52C10; 05D40

  8. arXiv:2404.16436  [pdf

    cs.SD cs.AI cs.LG eess.AS

    Leveraging tropical reef, bird and unrelated sounds for superior transfer learning in marine bioacoustics

    Authors: Ben Williams, Bart van Merriënboer, Vincent Dumoulin, Jenny Hamer, Eleni Triantafillou, Abram B. Fleishman, Matthew McKown, Jill E. Munger, Aaron N. Rice, Ashlee Lillis, Clemency E. White, Catherine A. D. Hobbs, Tries B. Razak, Kate E. Jones, Tom Denton

    Abstract: Machine learning has the potential to revolutionize passive acoustic monitoring (PAM) for ecological assessments. However, high annotation and compute costs limit the field's efficacy. Generalizable pretrained networks can overcome these costs, but high-quality pretraining requires vast annotated libraries, limiting its current applicability primarily to bird taxa. Here, we identify the optimum pr… ▽ More

    Submitted 7 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

    Comments: 18 pages, 5 figures

  9. arXiv:2404.02369  [pdf, ps, other

    math.CO cs.DM

    Grid-drawings of graphs in three-dimensions

    Authors: Jozsef Balogh, Ethan Patrick White

    Abstract: Using probabilistic methods, we obtain grid-drawings of graphs without crossings with low volume and small aspect ratio. We show that every $D$-degenerate graph on $n$ vertices can be drawn in $[m]^3$ where $m^3 = O(D^2 n\log n)$. In particular, every graph of bounded maximum degree can be drawn in a grid with volume $O(n \log n)$.

    Submitted 14 June, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

    MSC Class: 68R10

  10. arXiv:2404.01845  [pdf, other

    cs.HC

    Unmasking the Nuances of Loneliness: Using Digital Biomarkers to Understand Social and Emotional Loneliness in College Students

    Authors: Malik Muhammad Qirtas, Evi Zafeirid, Dirk Pesch, Eleanor Bantry White

    Abstract: Background: Loneliness among students is increasing across the world, with potential consequences for mental health and academic success. To address this growing problem, accurate methods of detection are needed to identify loneliness and to differentiate social and emotional loneliness so that intervention can be personalized to individual need. Passive sensing technology provides a unique techni… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  11. arXiv:2403.05530  [pdf, other

    cs.CL cs.AI

    Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

    Authors: Gemini Team, Petko Georgiev, Ving Ian Lei, Ryan Burnell, Libin Bai, Anmol Gulati, Garrett Tanzer, Damien Vincent, Zhufeng Pan, Shibo Wang, Soroosh Mariooryad, Yifan Ding, Xinyang Geng, Fred Alcober, Roy Frostig, Mark Omernick, Lexi Walker, Cosmin Paduraru, Christina Sorokin, Andrea Tacchetti, Colin Gaffney, Samira Daruki, Olcan Sercinoglu, Zach Gleicher, Juliette Love , et al. (1112 additional authors not shown)

    Abstract: In this report, we introduce the Gemini 1.5 family of models, representing the next generation of highly compute-efficient multimodal models capable of recalling and reasoning over fine-grained information from millions of tokens of context, including multiple long documents and hours of video and audio. The family includes two new models: (1) an updated Gemini 1.5 Pro, which exceeds the February… ▽ More

    Submitted 16 December, 2024; v1 submitted 8 March, 2024; originally announced March 2024.

  12. arXiv:2402.05698  [pdf, other

    cs.HC

    Evolving AI for Wellness: Dynamic and Personalized Real-time Loneliness Detection Using Passive Sensing

    Authors: Malik Muhammad Qirtas, Evi Zafeiridi, Eleanor Bantry White, Dirk Pesch

    Abstract: Loneliness is a growing health concern as it can lead to depression and other associated mental health problems for people who experience feelings of loneliness over prolonged periods of time. Utilizing passive sensing methods that use smartphone and wearable sensor data to capture daily behavioural patterns offers a promising approach for the early detection of loneliness. Given the subjective na… ▽ More

    Submitted 8 February, 2024; originally announced February 2024.

  13. arXiv:2312.12610  [pdf, other

    physics.plasm-ph cs.LG physics.comp-ph

    Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solvers

    Authors: P. Rodriguez-Fernandez, N. T. Howard, A. Saltzman, S. Kantamneni, J. Candy, C. Holland, M. Balandat, S. Ament, A. E. White

    Abstract: This work presents the PORTALS framework, which leverages surrogate modeling and optimization techniques to enable the prediction of core plasma profiles and performance with nonlinear gyrokinetic simulations at significantly reduced cost, with no loss of accuracy. The efficiency of PORTALS is benchmarked against standard methods, and its full potential is demonstrated on a unique, simultaneous 5-… ▽ More

    Submitted 9 April, 2024; v1 submitted 19 December, 2023; originally announced December 2023.

  14. arXiv:2312.11805  [pdf, other

    cs.CL cs.AI cs.CV

    Gemini: A Family of Highly Capable Multimodal Models

    Authors: Gemini Team, Rohan Anil, Sebastian Borgeaud, Jean-Baptiste Alayrac, Jiahui Yu, Radu Soricut, Johan Schalkwyk, Andrew M. Dai, Anja Hauth, Katie Millican, David Silver, Melvin Johnson, Ioannis Antonoglou, Julian Schrittwieser, Amelia Glaese, Jilin Chen, Emily Pitler, Timothy Lillicrap, Angeliki Lazaridou, Orhan Firat, James Molloy, Michael Isard, Paul R. Barham, Tom Hennigan, Benjamin Lee , et al. (1326 additional authors not shown)

    Abstract: This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini family consists of Ultra, Pro, and Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on a broad range of benchmarks shows that our most-capable Gemini Ultr… ▽ More

    Submitted 9 May, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

  15. arXiv:2312.11723  [pdf, ps, other

    math.CO cs.IT

    Improving Uniquely Decodable Codes in Binary Adder Channels

    Authors: József Balogh, The Nguyen, Patric R. J. Ostergard, Ethan Patrick White, Michael Wigal

    Abstract: We present a general method to modify existing uniquely decodable codes in the $T$-user binary adder channel. If at least one of the original constituent codes does not have average weight exactly half of the dimension, then our method produces a new set of constituent codes in a higher dimension, with a strictly higher rate. Using our method we improve the highest known rate for the $T$-user bina… ▽ More

    Submitted 10 November, 2025; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: 8 pages

    MSC Class: 05D40; 05C65; 05D05; 94A40; 05B10

  16. On Axial Symmetry in Convex Bodies

    Authors: Ritesh Goenka, Kenneth Moore, Wen Rui Sun, Ethan Patrick White

    Abstract: For a two-dimensional convex body, the Kovner-Besicovitch measure of symmetry is defined as the volume ratio of the largest centrally symmetric body contained inside the body to the original body. A classical result states that the Kovner-Besicovitch measure is at least $2/3$ for every convex body and equals $2/3$ for triangles. Lassak showed that an alternative measure of symmetry, i.e., symmetry… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: 26 pages, 14 figures

    MSC Class: 52A10; 52A38 (Primary) 52A20; 52A41 (Secondary)

    Journal ref: Pacific J. Math. 341 (2026) 275-303

  17. arXiv:2308.15509  [pdf

    cs.HC

    The Relationship between Loneliness and Depression among College Students: Mining data derived from Passive Sensing

    Authors: Malik Muhammad Qirtas, Evi Zafeiridi, Eleanor Bantry White, Dirk Pesch

    Abstract: Loneliness and depression are interrelated mental health issues affecting students well-being. Using passive sensing data provides a novel approach to examine the granular behavioural indicators differentiating loneliness and depression, and the mediators in their relationship. This study aimed to investigate associations between behavioural features and loneliness and depression among students, e… ▽ More

    Submitted 29 August, 2023; originally announced August 2023.

    Comments: N/A

  18. arXiv:2304.03920  [pdf, other

    math.CO cs.CG math.MG

    Improved estimates on the number of unit perimeter triangles

    Authors: Ritesh Goenka, Kenneth Moore, Ethan Patrick White

    Abstract: We obtain new upper and lower bounds on the number of unit perimeter triangles spanned by points in the plane. We also establish improved bounds in the special case where the point set is a section of the integer grid.

    Submitted 8 April, 2023; originally announced April 2023.

    Comments: 8 pages, 1 figure

    MSC Class: 52C10

    Journal ref: Discrete Comput. Geom. 73 (2025), no. 3, 850--858

  19. arXiv:2212.07527  [pdf

    cs.CV

    Plastic Contaminant Detection in Aerial Imagery of Cotton Fields with Deep Learning

    Authors: Pappu Kumar Yadav, J. Alex Thomasson, Robert G. Hardin, Stephen W. Searcy, Ulisses Braga-Neto, Sorin C. Popescu, Roberto Rodriguez, Daniel E Martin, Juan Enciso, Karem Meza, Emma L. White

    Abstract: Plastic shopping bags that get carried away from the side of roads and tangled on cotton plants can end up at cotton gins if not removed before the harvest. Such bags may not only cause problem in the ginning process but might also get embodied in cotton fibers reducing its quality and marketable value. Therefore, it is required to detect, locate, and remove the bags before cotton is harvested. Ma… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: preprint

  20. arXiv:2105.02186  [pdf, other

    cs.CV

    RandCrowns: A Quantitative Metric for Imprecisely Labeled Tree Crown Delineation

    Authors: Dylan Stewart, Alina Zare, Sergio Marconi, Ben G. Weinstein, Ethan P. White, Sarah J. Graves, Stephanie A. Bohlman, Aditya Singh

    Abstract: Supervised methods for object delineation in remote sensing require labeled ground-truth data. Gathering sufficient high quality ground-truth data is difficult, especially when targets are of irregular shape or difficult to distinguish from background or neighboring objects. Tree crown delineation provides key information from remote sensing images for forestry, ecology, and management. However, t… ▽ More

    Submitted 20 October, 2021; v1 submitted 5 May, 2021; originally announced May 2021.

  21. arXiv:2005.09003  [pdf, ps, other

    math.CO cs.CG

    Combinatorics of intervals in the plane I: trapezoids

    Authors: Daniel Di Benedetto, Jozsef Solymosi, Ethan White

    Abstract: We study arrangements of intervals in $\mathbb{R}^2$ for which many pairs form trapezoids. We show that any set of intervals forming many trapezoids must have underlying algebraic structure, which we characterise. This leads to some unexpected examples of sets of intervals forming many trapezoids, where an important role is played by degree 2 curves.

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 14 pages, 11 figures

    MSC Class: 52C30; 05D40

  22. arXiv:1908.05841  [pdf, other

    physics.ao-ph cs.CV cs.LG physics.chem-ph physics.geo-ph

    Recurrent U-net: Deep learning to predict daily summertime ozone in the United States

    Authors: Tai-Long He, Dylan B. A. Jones, Binxuan Huang, Yuyang Liu, Kazuyuki Miyazaki, Zhe Jiang, E. Charlie White, Helen M. Worden, John R. Worden

    Abstract: We use a hybrid deep learning model to predict June-July-August (JJA) daily maximum 8-h average (MDA8) surface ozone concentrations in the US. A set of meteorological fields from the ERA-Interim reanalysis as well as monthly mean NO$_x$ emissions from the Community Emissions Data System (CEDS) inventory are selected as predictors. Ozone measurements from the US Environmental Protection Agency (EPA… ▽ More

    Submitted 16 August, 2019; originally announced August 2019.

  23. Best Practices for Scientific Computing

    Authors: Greg Wilson, D. A. Aruliah, C. Titus Brown, Neil P. Chue Hong, Matt Davis, Richard T. Guy, Steven H. D. Haddock, Katy Huff, Ian M. Mitchell, Mark Plumbley, Ben Waugh, Ethan P. White, Paul Wilson

    Abstract: Scientists spend an increasing amount of time building and using software. However, most scientists are never taught how to do this efficiently. As a result, many are unaware of tools and practices that would allow them to write more reliable and maintainable code with less effort. We describe a set of best practices for scientific software development that have solid foundations in research and e… ▽ More

    Submitted 26 September, 2013; v1 submitted 30 September, 2012; originally announced October 2012.

    Comments: 18 pages

    Journal ref: PLOS Biology 12(1): e1001745, Jan 2014

  24. arXiv:0908.0939  [pdf

    cs.LG

    Clustering for Improved Learning in Maze Traversal Problem

    Authors: Eddie White

    Abstract: The maze traversal problem (finding the shortest distance to the goal from any position in a maze) has been an interesting challenge in computational intelligence. Recent work has shown that the cellular simultaneous recurrent neural network (CSRN) can solve this problem for simple mazes. This thesis focuses on exploiting relevant information about the maze to improve learning and decrease the t… ▽ More

    Submitted 6 August, 2009; originally announced August 2009.

    Comments: 29 pages, 15 figures, Undergraduate Honors Thesis