Skip to main content

Showing 1–48 of 48 results for author: Tetreault, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2512.15653  [pdf, ps, other

    cs.CL

    Characterizing Mamba's Selective Memory using Auto-Encoders

    Authors: Tamanna Hossain, Robert L. Logan IV, Ganesh Jagadeesan, Sameer Singh, Joel Tetreault, Alejandro Jaimes

    Abstract: State space models (SSMs) are a promising alternative to transformers for language modeling because they use fixed memory during inference. However, this fixed memory usage requires some information loss in the hidden state when processing long sequences. While prior work has studied the sequence length at which this information loss occurs, it does not characterize the types of information SSM la… ▽ More

    Submitted 17 December, 2025; originally announced December 2025.

    Comments: AACL 2025. Oral Presentation

  2. arXiv:2510.23639  [pdf, ps, other

    cs.LG cs.AI q-bio.QM

    Integrating Genomics into Multimodal EHR Foundation Models

    Authors: Jonathan Amar, Edward Liu, Alessandra Breschi, Liangliang Zhang, Pouya Kheradpour, Sylvia Li, Lisa Soleymani Lehmann, Alessandro Giulianelli, Matt Edwards, Yugang Jia, David Nola, Raghav Mani, Pankaj Vats, Jesse Tetreault, T. J. Chen, Cory Y. McLean

    Abstract: This paper introduces an innovative Electronic Health Record (EHR) foundation model that integrates Polygenic Risk Scores (PRS) as a foundational data modality, moving beyond traditional EHR-only approaches to build more holistic health profiles. Leveraging the extensive and diverse data from the All of Us (AoU) Research Program, this multimodal framework aims to learn complex relationships betwee… ▽ More

    Submitted 14 November, 2025; v1 submitted 24 October, 2025; originally announced October 2025.

  3. arXiv:2507.15823  [pdf, ps, other

    cs.CL cs.AI cs.SI

    Operationalizing AI for Good: Spotlight on Deployment and Integration of AI Models in Humanitarian Work

    Authors: Anton Abilov, Ke Zhang, Hemank Lamba, Elizabeth M. Olson, Joel R. Tetreault, Alejandro Jaimes

    Abstract: Publications in the AI for Good space have tended to focus on the research and model development that can support high-impact applications. However, very few AI for Good papers discuss the process of deploying and collaborating with the partner organization, and the resulting real-world impact. In this work, we share details about the close collaboration with a humanitarian-to-humanitarian (H2H) o… ▽ More

    Submitted 21 July, 2025; originally announced July 2025.

  4. arXiv:2507.09474  [pdf, ps, other

    cs.CL

    The CoNLL-2013 Shared Task on Grammatical Error Correction

    Authors: Hwee Tou Ng, Siew Mei Wu, Yuanbin Wu, Christian Hadiwinoto, Joel Tetreault

    Abstract: The CoNLL-2013 shared task was devoted to grammatical error correction. In this paper, we give the task definition, present the data sets, and describe the evaluation metric and scorer used in the shared task. We also give an overview of the various approaches adopted by the participating teams, and present the evaluation results.

    Submitted 12 July, 2025; originally announced July 2025.

    Comments: 12 pages

    Journal ref: Proceedings of the Seventeenth Conference on Computational Natural Language Learning: Shared Task, 2013

  5. arXiv:2505.22327  [pdf, ps, other

    cs.CL cs.CY

    NLP for Social Good: A Survey and Outlook of Challenges, Opportunities, and Responsible Deployment

    Authors: Antonia Karamolegkou, Angana Borah, Eunjung Cho, Sagnik Ray Choudhury, Martina Galletti, Pranav Gupta, Oana Ignat, Priyanka Kargupta, Neema Kotonya, Hemank Lamba, Sun-Joo Lee, Arushi Mangla, Ishani Mondal, Fatima Zahra Moudakir, Deniz Nazarova, Poli Nemkova, Dina Pisarevskaya, Naquee Rizwan, Nazanin Sabri, Keenan Samway, Dominik Stammbach, Anna Steinberg, David Tomás, Steven R Wilson, Bowen Yi , et al. (8 additional authors not shown)

    Abstract: Natural language processing (NLP) now shapes many aspects of our world, yet its potential for positive social impact is underexplored. This paper surveys work in ``NLP for Social Good" (NLP4SG) across nine domains relevant to global development and risk agendas, summarizing principal tasks and challenges. We analyze ACL Anthology trends, finding that inclusion and AI harms attract the most researc… ▽ More

    Submitted 21 January, 2026; v1 submitted 28 May, 2025; originally announced May 2025.

    Comments: Accepted to EACL 2026

  6. arXiv:2412.13511  [pdf, other

    cs.CL

    CEHA: A Dataset of Conflict Events in the Horn of Africa

    Authors: Rui Bai, Di Lu, Shihao Ran, Elizabeth Olson, Hemank Lamba, Aoife Cahill, Joel Tetreault, Alex Jaimes

    Abstract: Natural Language Processing (NLP) of news articles can play an important role in understanding the dynamics and causes of violent conflict. Despite the availability of datasets categorizing various conflict events, the existing labels often do not cover all of the fine-grained violent conflict event types relevant to areas like the Horn of Africa. In this paper, we introduce a new benchmark datase… ▽ More

    Submitted 18 December, 2024; originally announced December 2024.

    Comments: Accepted by COLING 2025

  7. arXiv:2412.13098  [pdf, other

    cs.CL cs.SI

    Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election

    Authors: Roberto Mondini, Neema Kotonya, Robert L. Logan IV, Elizabeth M Olson, Angela Oduor Lungati, Daniel Duke Odongo, Tim Ombasa, Hemank Lamba, Aoife Cahill, Joel R. Tetreault, Alejandro Jaimes

    Abstract: Online reporting platforms have enabled citizens around the world to collectively share their opinions and report in real time on events impacting their local communities. Systematically organizing (e.g., categorizing by attributes) and geotagging large amounts of crowdsourced information is crucial to ensuring that accurate and meaningful insights can be drawn from this data and used by policy ma… ▽ More

    Submitted 17 December, 2024; originally announced December 2024.

    Comments: COLING 2025

  8. arXiv:2410.06370  [pdf, other

    cs.CL cs.AI cs.LG cs.SI

    HumVI: A Multilingual Dataset for Detecting Violent Incidents Impacting Humanitarian Aid

    Authors: Hemank Lamba, Anton Abilov, Ke Zhang, Elizabeth M. Olson, Henry k. Dambanemuya, João c. Bárcia, David S. Batista, Christina Wille, Aoife Cahill, Joel Tetreault, Alex Jaimes

    Abstract: Humanitarian organizations can enhance their effectiveness by analyzing data to discover trends, gather aggregated insights, manage their security risks, support decision-making, and inform advocacy and funding proposals. However, data about violent incidents with direct impact and relevance for humanitarian aid operations is not readily available. An automatic data collection and NLP-backed class… ▽ More

    Submitted 15 October, 2024; v1 submitted 8 October, 2024; originally announced October 2024.

  9. arXiv:2402.12276  [pdf, other

    cs.IR

    Explain then Rank: Scale Calibration of Neural Rankers Using Natural Language Explanations from LLMs

    Authors: Puxuan Yu, Daniel Cohen, Hemank Lamba, Joel Tetreault, Alex Jaimes

    Abstract: In search settings, calibrating the scores during the ranking process to quantities such as click-through rates or relevance levels enhances a system's usefulness and trustworthiness for downstream users. While previous research has improved this notion of calibration for low complexity learning-to-rank models, the larger data demands and parameter count specific to modern neural text rankers prod… ▽ More

    Submitted 26 August, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

  10. Dissecting users' needs for search result explanations

    Authors: Prerna Juneja, Wenjuan Zhang, Alison Marie Smith-Renner, Hemank Lamba, Joel Tetreault, Alex Jaimes

    Abstract: There is a growing demand for transparency in search engines to understand how search results are curated and to enhance users' trust. Prior research has introduced search result explanations with a focus on how to explain, assuming explanations are beneficial. Our study takes a step back to examine if search explanations are needed and when they are likely to provide benefits. Additionally, we su… ▽ More

    Submitted 23 February, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  11. arXiv:2311.00686  [pdf, ps, other

    cs.CL

    Little Giants: Exploring the Potential of Small LLMs as Evaluation Metrics in Summarization in the Eval4NLP 2023 Shared Task

    Authors: Neema Kotonya, Saran Krishnasamy, Joel Tetreault, Alejandro Jaimes

    Abstract: This paper describes and analyzes our participation in the 2023 Eval4NLP shared task, which focuses on assessing the effectiveness of prompt-based techniques to empower Large Language Models to handle the task of quality estimation, particularly in the context of evaluating machine translations and summaries. We conducted systematic experiments with various prompting techniques, including standard… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

    Comments: Eval4NLP 2023 Shared Task

  12. arXiv:2310.20633  [pdf, other

    cs.CL

    Defining a New NLP Playground

    Authors: Sha Li, Chi Han, Pengfei Yu, Carl Edwards, Manling Li, Xingyao Wang, Yi R. Fung, Charles Yu, Joel R. Tetreault, Eduard H. Hovy, Heng Ji

    Abstract: The recent explosion of performance of large language models (LLMs) has changed the field of Natural Language Processing (NLP) more abruptly and seismically than any other shift in the field's 80-year history. This has resulted in concerns that the field will become homogenized and resource-intensive. The new status quo has put many academic researchers, especially PhD students, at a disadvantage.… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: EMNLP Findings 2023 "Theme Track: Large Language Models and the Future of NLP"

  13. arXiv:2310.10706  [pdf, other

    cs.CL cs.AI

    Harnessing the Power of LLMs: Evaluating Human-AI Text Co-Creation through the Lens of News Headline Generation

    Authors: Zijian Ding, Alison Smith-Renner, Wenjuan Zhang, Joel R. Tetreault, Alejandro Jaimes

    Abstract: To explore how humans can best leverage LLMs for writing and how interacting with these models affects feelings of ownership and trust in the writing process, we compared common human-AI interaction types (e.g., guiding system, selecting from system outputs, post-editing outputs) in the context of LLM-assisted news headline generation. While LLMs alone can generate satisfactory news headlines, on… ▽ More

    Submitted 17 October, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

  14. arXiv:2307.05567  [pdf, other

    cs.CL

    Event Extraction as Question Generation and Answering

    Authors: Di Lu, Shihao Ran, Joel Tetreault, Alejandro Jaimes

    Abstract: Recent work on Event Extraction has reframed the task as Question Answering (QA), with promising results. The advantage of this approach is that it addresses the error propagation issue found in traditional token-based classification approaches by directly predicting event arguments without extracting candidates first. However, the questions are typically based on fixed templates and they rarely l… ▽ More

    Submitted 9 July, 2023; originally announced July 2023.

    Comments: Accepted to ACL 2023

  15. arXiv:2306.17695  [pdf, other

    cs.CL

    A New Task and Dataset on Detecting Attacks on Human Rights Defenders

    Authors: Shihao Ran, Di Lu, Joel Tetreault, Aoife Cahill, Alejandro Jaimes

    Abstract: The ability to conduct retrospective analyses of attacks on human rights defenders over time and by location is important for humanitarian organizations to better understand historical or ongoing human rights violations and thus better manage the global impact of such events. We hypothesize that NLP can support such efforts by quickly processing large collections of news articles to detect and sum… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

  16. arXiv:2305.01633  [pdf, other

    cs.CL

    Missing Information, Unresponsive Authors, Experimental Flaws: The Impossibility of Assessing the Reproducibility of Previous Human Evaluations in NLP

    Authors: Anya Belz, Craig Thomson, Ehud Reiter, Gavin Abercrombie, Jose M. Alonso-Moral, Mohammad Arvan, Anouck Braggaar, Mark Cieliebak, Elizabeth Clark, Kees van Deemter, Tanvi Dinkar, Ondřej Dušek, Steffen Eger, Qixiang Fang, Mingqi Gao, Albert Gatt, Dimitra Gkatzia, Javier González-Corbelle, Dirk Hovy, Manuela Hürlimann, Takumi Ito, John D. Kelleher, Filip Klubicka, Emiel Krahmer, Huiyuan Lai , et al. (17 additional authors not shown)

    Abstract: We report our efforts in identifying a set of previous human evaluations in NLP that would be suitable for a coordinated study examining what makes human evaluations in NLP more/less reproducible. We present our results and findings, which include that just 13\% of papers had (i) sufficiently low barriers to reproduction, and (ii) enough obtainable information, to be considered for reproduction, a… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 May, 2023; originally announced May 2023.

    Comments: 5 pages plus appendix, 4 tables, 1 figure. To appear at "Workshop on Insights from Negative Results in NLP" (co-located with EACL2023). Updated author list and acknowledgements

    MSC Class: 68 ACM Class: I.2.7

  17. arXiv:2301.10389  [pdf, other

    cs.IR

    Counterfactual Editing for Search Result Explanation

    Authors: Zhichao Xu, Hemank Lamba, Qingyao Ai, Joel Tetreault, Alex Jaimes

    Abstract: Search Result Explanation (SeRE) aims to improve search sessions' effectiveness and efficiency by helping users interpret documents' relevance. Existing works mostly focus on factual explanation, i.e. to find/generate supporting evidence about documents' relevance to search queries. However, research in cognitive sciences has shown that human explanations are contrastive i.e. people explain an obs… ▽ More

    Submitted 28 June, 2024; v1 submitted 24 January, 2023; originally announced January 2023.

    Comments: ICTIR 2024

  18. arXiv:2212.09955  [pdf, other

    cs.CL

    BUMP: A Benchmark of Unfaithful Minimal Pairs for Meta-Evaluation of Faithfulness Metrics

    Authors: Liang Ma, Shuyang Cao, Robert L. Logan IV, Di Lu, Shihao Ran, Ke Zhang, Joel Tetreault, Alejandro Jaimes

    Abstract: The proliferation of automatic faithfulness metrics for summarization has produced a need for benchmarks to evaluate them. While existing benchmarks measure the correlation with human judgements of faithfulness on model-generated summaries, they are insufficient for diagnosing whether metrics are: 1) consistent, i.e., indicate lower faithfulness as errors are introduced into a summary, 2) effectiv… ▽ More

    Submitted 4 June, 2023; v1 submitted 19 December, 2022; originally announced December 2022.

    Comments: Accepted as a long main conference paper at ACL 2023

  19. arXiv:2210.14190  [pdf, other

    cs.CL

    CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and Summarization

    Authors: Hossein Rajaby Faghihi, Bashar Alhafni, Ke Zhang, Shihao Ran, Joel Tetreault, Alejandro Jaimes

    Abstract: Social media has increasingly played a key role in emergency response: first responders can use public posts to better react to ongoing crisis events and deploy the necessary resources where they are most needed. Timeline extraction and abstractive summarization are critical technical tasks to leverage large numbers of social media posts about events. Unfortunately, there are few datasets for benc… ▽ More

    Submitted 25 October, 2022; originally announced October 2022.

  20. arXiv:2206.14863  [pdf

    cs.HC

    Mapping the Design Space of Human-AI Interaction in Text Summarization

    Authors: Ruijia Cheng, Alison Smith-Renner, Ke Zhang, Joel R. Tetreault, Alejandro Jaimes

    Abstract: Automatic text summarization systems commonly involve humans for preparing data or evaluating model performance, yet, there lacks a systematic understanding of humans' roles, experience, and needs when interacting with or being assisted by AI. From a human-centered perspective, we map the design opportunities and considerations for human-AI interaction in text summarization and broader text genera… ▽ More

    Submitted 29 June, 2022; originally announced June 2022.

  21. arXiv:2206.06383  [pdf, other

    cs.CL cs.AI cs.HC

    An Exploration of Post-Editing Effectiveness in Text Summarization

    Authors: Vivian Lai, Alison Smith-Renner, Ke Zhang, Ruijia Cheng, Wenjuan Zhang, Joel Tetreault, Alejandro Jaimes

    Abstract: Automatic summarization methods are efficient but can suffer from low quality. In comparison, manual summarization is expensive but produces higher quality. Can humans and AI collaborate to improve summarization performance? In similar text generation tasks (e.g., machine translation), human-AI collaboration in the form of "post-editing" AI-generated text reduces human workload and improves the qu… ▽ More

    Submitted 13 June, 2022; originally announced June 2022.

    Comments: 18 pages, 21 figures

  22. arXiv:2205.01757  [pdf, other

    cs.CL cs.LG

    XLTime: A Cross-Lingual Knowledge Transfer Framework for Temporal Expression Extraction

    Authors: Yuwei Cao, William Groves, Tanay Kumar Saha, Joel R. Tetreault, Alex Jaimes, Hao Peng, Philip S. Yu

    Abstract: Temporal Expression Extraction (TEE) is essential for understanding time in natural language. It has applications in Natural Language Processing (NLP) tasks such as question answering, information retrieval, and causal inference. To date, work in this area has mostly focused on English as there is a scarcity of labeled data for other languages. We propose XLTime, a novel framework for multilingual… ▽ More

    Submitted 3 May, 2022; originally announced May 2022.

    Comments: This paper is accepted by the Findings of NAACL 2022

  23. arXiv:2110.10668  [pdf, other

    cs.CL

    Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Joel Tetreault, Marine Carpuat

    Abstract: While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading ST automatic metrics on the oft-researched task of formality style transfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this wo… ▽ More

    Submitted 20 October, 2021; originally announced October 2021.

    Comments: EMNLP 2021

  24. arXiv:2109.02865  [pdf, other

    cs.CV

    Journalistic Guidelines Aware News Image Captioning

    Authors: Xuewen Yang, Svebor Karaman, Joel Tetreault, Alex Jaimes

    Abstract: The task of news article image captioning aims to generate descriptive and informative captions for news article images. Unlike conventional image captions that simply describe the content of the image in general terms, news image captions follow journalistic guidelines and rely heavily on named entities to describe the image content, often drawing context from the whole article they are associate… ▽ More

    Submitted 10 September, 2021; v1 submitted 7 September, 2021; originally announced September 2021.

    Journal ref: EMNLP 2021

  25. arXiv:2106.04747  [pdf, other

    cs.CL

    A Review of Human Evaluation for Style Transfer

    Authors: Eleftheria Briakou, Sweta Agrawal, Ke Zhang, Joel Tetreault, Marine Carpuat

    Abstract: This paper reviews and summarizes human evaluation practices described in 97 style transfer papers with respect to three main evaluation aspects: style transfer, meaning preservation, and fluency. In principle, evaluations by human raters should be the most reliable. However, in style transfer papers, we find that protocols for human evaluations are often underspecified and not standardized, which… ▽ More

    Submitted 8 June, 2021; originally announced June 2021.

    Comments: GEM 2021

  26. arXiv:2104.15104  [pdf, other

    cs.CL cs.IR

    GTN-ED: Event Detection Using Graph Transformer Networks

    Authors: Sanghamitra Dutta, Liang Ma, Tanay Kumar Saha, Di Lu, Joel Tetreault, Alejandro Jaimes

    Abstract: Recent works show that the graph structure of sentences, generated from dependency parsers, has potential for improving event detection. However, they often only leverage the edges (dependencies) between words, and discard the dependency labels (e.g., nominal-subject), treating the underlying graph edges as homogeneous. In this work, we propose a novel framework for incorporating both dependencies… ▽ More

    Submitted 5 May, 2021; v1 submitted 30 April, 2021; originally announced April 2021.

    Journal ref: TextGraphs 2021 : 15th Workshop on Graph-Based Natural Language Processing

  27. arXiv:2104.04108  [pdf, other

    cs.CL cs.AI

    XFORMAL: A Benchmark for Multilingual Formality Style Transfer

    Authors: Eleftheria Briakou, Di Lu, Ke Zhang, Joel Tetreault

    Abstract: We take the first step towards multilingual style transfer by creating and releasing XFORMAL, a benchmark of multiple formal reformulations of informal text in Brazilian Portuguese, French, and Italian. Results on XFORMAL suggest that state-of-the-art style transfer approaches perform close to simple baselines, indicating that style transfer is even more challenging when moving multilingual.

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: NAACL 2021

  28. arXiv:2011.03287  [pdf, other

    cs.CL

    The ApposCorpus: A new multilingual, multi-domain dataset for factual appositive generation

    Authors: Yova Kementchedjhieva, Di Lu, Joel Tetreault

    Abstract: News articles, image captions, product reviews and many other texts mention people and organizations whose name recognition could vary for different audiences. In such cases, background information about the named entities could be provided in the form of an appositive noun phrase, either written by a human or generated automatically. We expand on the previous work in appositive generation with a… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: To appear at COLING2020

  29. arXiv:2011.01589  [pdf, other

    cs.CL

    Creating a Domain-diverse Corpus for Theory-based Argument Quality Assessment

    Authors: Lily Ng, Anne Lauscher, Joel Tetreault, Courtney Napoles

    Abstract: Computational models of argument quality (AQ) have focused primarily on assessing the overall quality or just one specific characteristic of an argument, such as its convincingness or its clarity. However, previous work has claimed that assessment based on theoretical dimensions of argumentation could benefit writers, but developing such models has been limited by the lack of annotated data. In th… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: accepted for ArgMining 20

  30. Federated Learning for Breast Density Classification: A Real-World Implementation

    Authors: Holger R. Roth, Ken Chang, Praveer Singh, Nir Neumark, Wenqi Li, Vikash Gupta, Sharut Gupta, Liangqiong Qu, Alvin Ihsani, Bernardo C. Bizzo, Yuhong Wen, Varun Buch, Meesam Shah, Felipe Kitamura, Matheus Mendonça, Vitor Lavor, Ahmed Harouni, Colin Compas, Jesse Tetreault, Prerna Dogra, Yan Cheng, Selnur Erdal, Richard White, Behrooz Hashemian, Thomas Schultz , et al. (18 additional authors not shown)

    Abstract: Building robust deep learning-based models requires large quantities of diverse training data. In this study, we investigate the use of federated learning (FL) to build medical imaging classification models in a real-world collaborative setting. Seven clinical institutions from across the world joined this FL effort to train a model for breast density classification based on Breast Imaging, Report… ▽ More

    Submitted 20 October, 2020; v1 submitted 3 September, 2020; originally announced September 2020.

    Comments: Accepted at the 1st MICCAI Workshop on "Distributed And Collaborative Learning"; add citation to Fig. 1 & 2 and update Fig. 5; fix typo in affiliations

    Journal ref: In: Albarqouni S. et al. (eds) Domain Adaptation and Representation Transfer, and Distributed and Collaborative Learning. DART 2020, DCL 2020. Lecture Notes in Computer Science, vol 12444. Springer, Cham

  31. arXiv:2007.11756  [pdf, other

    cs.SI cs.CL

    Clustering of Social Media Messages for Humanitarian Aid Response during Crisis

    Authors: Swati Padhee, Tanay Kumar Saha, Joel Tetreault, Alejandro Jaimes

    Abstract: Social media has quickly grown into an essential tool for people to communicate and express their needs during crisis events. Prior work in analyzing social media data for crisis management has focused primarily on automatically identifying actionable (or, informative) crisis-related messages. In this work, we show that recent advances in Deep Learning and Natural Language Processing outperform pr… ▽ More

    Submitted 22 July, 2020; originally announced July 2020.

    Comments: 6 pages, 1 figure. Research work was done while Swati was interning at Dataminr Inc. and presented at the AI for Social Good, Harvard CRCS Workshop 2020 (https://aiforgood2020.github.io)

  32. arXiv:2006.02964  [pdf, other

    cs.CL

    Personalizing Grammatical Error Correction: Adaptation to Proficiency Level and L1

    Authors: Maria Nadejde, Joel Tetreault

    Abstract: Grammar error correction (GEC) systems have become ubiquitous in a variety of software applications, and have started to approach human-level performance for some datasets. However, very little is known about how to efficiently personalize these systems to the user's characteristics, such as their proficiency level and first language, or to emerging domains of text. We present the first results on… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

    Comments: Proceedings of the 2019 EMNLP Workshop W-NUT: The 5th Workshop on Noisy User-generated Text

    Journal ref: Proceedings of the 2019 EMNLP Workshop W-NUT: The 5th Workshop on Noisy User-generated Text, pages 27-33, Hong Kong, Nov 4, 2019

  33. arXiv:2006.00843  [pdf, other

    cs.CL

    Rhetoric, Logic, and Dialectic: Advancing Theory-based Argument Quality Assessment in Natural Language Processing

    Authors: Anne Lauscher, Lily Ng, Courtney Napoles, Joel Tetreault

    Abstract: Though preceding work in computational argument quality (AQ) mostly focuses on assessing overall AQ, researchers agree that writers would benefit from feedback targeting individual dimensions of argumentation theory. However, a large-scale theory-based corpus and corresponding computational models are missing. We fill this gap by conducting an extensive analysis covering three diverse domains of o… ▽ More

    Submitted 3 November, 2020; v1 submitted 1 June, 2020; originally announced June 2020.

    Comments: accepted for COLING 20

  34. arXiv:2004.04917  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    Multimodal Categorization of Crisis Events in Social Media

    Authors: Mahdi Abavisani, Liwei Wu, Shengli Hu, Joel Tetreault, Alejandro Jaimes

    Abstract: Recent developments in image classification and natural language processing, coupled with the rapid growth in social media usage, have enabled fundamental advances in detecting breaking events around the world in real-time. Emergency response is one such area that stands to gain from these advances. By processing billions of texts and images a minute, events can be automatically detected to enable… ▽ More

    Submitted 10 April, 2020; originally announced April 2020.

    Comments: Conference on Computer Vision and Pattern Recognition (CVPR 2020)

    ACM Class: I.5.4

    Journal ref: Conference on Computer Vision and Pattern Recognition (CVPR 2020)

  35. arXiv:1907.08889  [pdf, other

    cs.CL

    The Unbearable Weight of Generating Artificial Errors for Grammatical Error Correction

    Authors: Phu Mon Htut, Joel Tetreault

    Abstract: In recent years, sequence-to-sequence models have been very effective for end-to-end grammatical error correction (GEC). As creating human-annotated parallel corpus for GEC is expensive and time-consuming, there has been work on artificial corpus generation with the aim of creating sentences that contain realistic grammatical errors from grammatically correct sentences. In this paper, we investiga… ▽ More

    Submitted 20 July, 2019; originally announced July 2019.

    Comments: To appear at ACL-BEA workshop 2019

  36. arXiv:1906.03497  [pdf, other

    cs.CL

    This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation

    Authors: Rui Zhang, Joel Tetreault

    Abstract: Given the overwhelming number of emails, an effective subject line becomes essential to better inform the recipient of the email's content. In this paper, we propose and study the task of email subject line generation: automatically generating an email subject line from the email body. We create the first dataset for this task and find that email subject line generation favor extremely abstractive… ▽ More

    Submitted 8 June, 2019; originally announced June 2019.

    Comments: ACL 2019, long paper

  37. arXiv:1904.02594  [pdf, other

    cs.CL

    Dialogue Act Classification with Context-Aware Self-Attention

    Authors: Vipul Raheja, Joel Tetreault

    Abstract: Recent work in Dialogue Act classification has treated the task as a sequence labeling problem using hierarchical deep neural networks. We build on this prior work by leveraging the effectiveness of a context-aware self-attention mechanism coupled with a hierarchical recurrent neural network. We conduct extensive evaluations on standard Dialogue Act classification datasets and show significant imp… ▽ More

    Submitted 6 May, 2019; v1 submitted 4 April, 2019; originally announced April 2019.

    Comments: NAACL-HLT 2019. 7 pages, 3 figures

  38. arXiv:1809.08298  [pdf, ps, other

    cs.CL

    How do you correct run-on sentences it's not as easy as it seems

    Authors: Junchao Zheng, Courtney Napoles, Joel Tetreault, Kostiantyn Omelianchuk

    Abstract: Run-on sentences are common grammatical mistakes but little research has tackled this problem to date. This work introduces two machine learning models to correct run-on sentences that outperform leading methods for related tasks, punctuation restoration and whole-sentence grammatical error correction. Due to the limited annotated data for this error, we experiment with artificially generating tra… ▽ More

    Submitted 21 September, 2018; originally announced September 2018.

    Comments: To appear in W-NUT 2018: Workshop on Noisy User-generated Text (at EMNLP)

  39. arXiv:1805.04993  [pdf, other

    cs.CL

    Discourse Coherence in the Wild: A Dataset, Evaluation and Methods

    Authors: Alice Lai, Joel Tetreault

    Abstract: To date there has been very little work on assessing discourse coherence methods on real-world data. To address this, we present a new corpus of real-world texts (GCDC) as well as the first large-scale evaluation of leading discourse coherence algorithms. We show that neural models, including two that we introduce here (SentAvg and ParSeq), tend to perform best. We analyze these performance differ… ▽ More

    Submitted 13 May, 2018; originally announced May 2018.

    Comments: Accepted at SIGDIAL 2018

  40. arXiv:1803.06535  [pdf, other

    cs.CL

    Dear Sir or Madam, May I introduce the GYAFC Dataset: Corpus, Benchmarks and Metrics for Formality Style Transfer

    Authors: Sudha Rao, Joel Tetreault

    Abstract: Style transfer is the task of automatically transforming a piece of text in one particular style into another. A major barrier to progress in this field has been a lack of training and evaluation datasets, as well as benchmarks and automatic metrics. In this work, we create the largest corpus for a particular stylistic transfer (formality) and show that techniques from the machine translation comm… ▽ More

    Submitted 16 April, 2018; v1 submitted 17 March, 2018; originally announced March 2018.

    Comments: To appear in the proceedings of North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2018

  41. arXiv:1702.04066  [pdf, other

    cs.CL

    JFLEG: A Fluency Corpus and Benchmark for Grammatical Error Correction

    Authors: Courtney Napoles, Keisuke Sakaguchi, Joel Tetreault

    Abstract: We present a new parallel corpus, JHU FLuency-Extended GUG corpus (JFLEG) for developing and evaluating grammatical error correction (GEC). Unlike other corpora, it represents a broad range of language proficiency levels and uses holistic fluency edits to not only correct grammatical errors but also make the original text more native sounding. We describe the types of corrections made and benchmar… ▽ More

    Submitted 13 February, 2017; originally announced February 2017.

    Comments: To appear in EACL 2017 (short papers)

  42. arXiv:1610.02124  [pdf, other

    cs.CL

    There's No Comparison: Reference-less Evaluation Metrics in Grammatical Error Correction

    Authors: Courtney Napoles, Keisuke Sakaguchi, Joel Tetreault

    Abstract: Current methods for automatically evaluating grammatical error correction (GEC) systems rely on gold-standard references. However, these methods suffer from penalizing grammatical edits that are correct but not in the gold standard. We show that reference-less grammaticality metrics correlate very strongly with human judgments and are competitive with the leading reference-based evaluation metrics… ▽ More

    Submitted 6 October, 2016; originally announced October 2016.

    Comments: to appear in Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing (EMNLP)

  43. arXiv:1608.02289  [pdf, other

    cs.CV cs.CL cs.MM

    Detecting Sarcasm in Multimodal Social Platforms

    Authors: Rossano Schifanella, Paloma de Juan, Joel Tetreault, Liangliang Cao

    Abstract: Sarcasm is a peculiar form of sentiment expression, where the surface sentiment differs from the implied sentiment. The detection of sarcasm in social media platforms has been applied in the past mainly to textual utterances where lexical indicators (such as interjections and intensifiers), linguistic markers, and contextual information (such as user profiles, or past conversations) were used to d… ▽ More

    Submitted 7 August, 2016; originally announced August 2016.

    Comments: 10 pages, 3 figures, final version published in the Proceedings of ACM Multimedia 2016

  44. arXiv:1605.02592  [pdf, other

    cs.CL

    GLEU Without Tuning

    Authors: Courtney Napoles, Keisuke Sakaguchi, Matt Post, Joel Tetreault

    Abstract: The GLEU metric was proposed for evaluating grammatical error corrections using n-gram overlap with a set of reference sentences, as opposed to precision/recall of specific annotated errors (Napoles et al., 2015). This paper describes improvements made to the GLEU metric that address problems that arise when using an increasing number of reference sets. Unlike the originally presented metric, the… ▽ More

    Submitted 9 May, 2016; originally announced May 2016.

  45. arXiv:1604.02748  [pdf, other

    cs.CV

    TGIF: A New Dataset and Benchmark on Animated GIF Description

    Authors: Yuncheng Li, Yale Song, Liangliang Cao, Joel Tetreault, Larry Goldberg, Alejandro Jaimes, Jiebo Luo

    Abstract: With the recent popularity of animated GIFs on social media, there is need for ways to index them with rich metadata. To advance research on animated GIF understanding, we collected a new dataset, Tumblr GIF (TGIF), with 100K animated GIFs from Tumblr and 120K natural language descriptions obtained via crowdsourcing. The motivation for this work is to develop a testbed for image sequence descripti… ▽ More

    Submitted 11 April, 2016; v1 submitted 10 April, 2016; originally announced April 2016.

    Comments: CVPR 2016 Camera Ready

  46. arXiv:1506.08126  [pdf, other

    cs.CL cs.AI cs.MM stat.ML

    Humor in Collective Discourse: Unsupervised Funniness Detection in the New Yorker Cartoon Caption Contest

    Authors: Dragomir Radev, Amanda Stent, Joel Tetreault, Aasish Pappu, Aikaterini Iliakopoulou, Agustin Chanfreau, Paloma de Juan, Jordi Vallmitjana, Alejandro Jaimes, Rahul Jha, Bob Mankoff

    Abstract: The New Yorker publishes a weekly captionless cartoon. More than 5,000 readers submit captions for it. The editors select three of them and ask the readers to pick the funniest one. We describe an experiment that compares a dozen automatic methods for selecting the funniest caption. We show that negative sentiment, human-centeredness, and lexical centrality most strongly match the funniest caption… ▽ More

    Submitted 26 June, 2015; originally announced June 2015.

    Comments: 10 pages, in submission

  47. arXiv:1503.06733  [pdf, other

    cs.CL

    Yara Parser: A Fast and Accurate Dependency Parser

    Authors: Mohammad Sadegh Rasooli, Joel Tetreault

    Abstract: Dependency parsers are among the most crucial tools in natural language processing as they have many important applications in downstream tasks such as information retrieval, machine translation and knowledge acquisition. We introduce the Yara Parser, a fast and accurate open-source dependency parser based on the arc-eager algorithm and beam search. It achieves an unlabeled accuracy of 93.32 on th… ▽ More

    Submitted 24 March, 2015; v1 submitted 23 March, 2015; originally announced March 2015.

  48. arXiv:1403.0801  [pdf, other

    cs.CL

    Is getting the right answer just about choosing the right words? The role of syntactically-informed features in short answer scoring

    Authors: Derrick Higgins, Chris Brew, Michael Heilman, Ramon Ziai, Lei Chen, Aoife Cahill, Michael Flor, Nitin Madnani, Joel Tetreault, Daniel Blanchard, Diane Napolitano, Chong Min Lee, John Blackmore

    Abstract: Developments in the educational landscape have spurred greater interest in the problem of automatically scoring short answer questions. A recent shared task on this topic revealed a fundamental divide in the modeling approaches that have been applied to this problem, with the best-performing systems split between those that employ a knowledge engineering approach and those that almost solely lever… ▽ More

    Submitted 5 March, 2014; v1 submitted 4 March, 2014; originally announced March 2014.