Skip to main content

Showing 1–2 of 2 results for author: Barnard, L R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.01796  [pdf, other

    eess.AS cs.CL cs.LG

    Speech foundation models in healthcare: Effect of layer selection on pathological speech feature prediction

    Authors: Daniela A. Wiepert, Rene L. Utianski, Joseph R. Duffy, John L. Stricker, Leland R. Barnard, David T. Jones, Hugo Botha

    Abstract: Accurately extracting clinical information from speech is critical to the diagnosis and treatment of many neurological conditions. As such, there is interest in leveraging AI for automatic, objective assessments of clinical speech to facilitate diagnosis and treatment of speech disorders. We explore transfer learning using foundation models, focusing on the impact of layer selection for the downst… ▽ More

    Submitted 21 June, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted to INTERSPEECH 2024

  2. arXiv:2310.13010  [pdf, other

    eess.AS cs.AI

    Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model

    Authors: Hagen Soltau, Izhak Shafran, Alex Ottenwess, Joseph R. JR Duffy, Rene L. Utianski, Leland R. Barnard, John L. Stricker, Daniela Wiepert, David T. Jones, Hugo Botha

    Abstract: We propose a Perceiver-based sequence classifier to detect abnormalities in speech reflective of several neurological disorders. We combine this classifier with a Universal Speech Model (USM) that is trained (unsupervised) on 12 million hours of diverse audio recordings. Our model compresses long sequences into a small set of class-specific latent representations and a factorized projection is use… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Journal ref: Proc. ASRU, 2023