Computer Science > Information Retrieval

arXiv:2504.20047 (cs)

[Submitted on 9 Mar 2025 (v1), last revised 5 Mar 2026 (this version, v3)]

Title:HCT-QA: A Benchmark for Question Answering on Human-Centric Tables

Authors:Mohammad S. Ahmad, Zan A. Naeem, Michaël Aupetit, Ahmed Elmagarmid, Mohamed Eltabakh, Xiaosong Ma, Mourad Ouzzani, Chaoyi Ruan, Hani Al-Sayeh

View PDF HTML (experimental)

Abstract:Tabular data embedded in PDF files, web pages, and other types of documents is prevalent in various domains. These tables, which we call human-centric tables (HCTs for short), are dense in information but often exhibit complex structural and semantic layouts. To query these HCTs, some existing solutions focus on transforming them into relational formats. However, they fail to handle the diverse and complex layouts of HCTs, making them not amenable to easy querying with SQL-based approaches. Another emerging option is to use Large Language Models (LLMs) and Vision Language Models (VLMs). However, there is a lack of standard evaluation benchmarks to measure and compare the performance of models to query HCTs using natural language. To address this gap, we propose the HumanCentric Tables Question-Answering extensive benchmark (HCTQA) consisting of thousands of HCTs with several thousands of natural language questions with their respective answers. More specifically, HCT-QA includes 1,880 real-world HCTs with 9,835 QA pairs in addition to 4,679 synthetic HCTs with 67.7K QA pairs. Also, we show through extensive experiments the performance of 25 and 9 different LLMS and VLMs, respectively, in an answering HCT-QA's questions. In addition, we show how finetuning an LLM on HCT-QA improves F1 scores by up to 25 percentage points compared to the off-the-shelf model. Compared to existing benchmarks, HCT-QA stands out for its broad complexity and diversity of covered HCTs and generated questions, its comprehensive metadata enabling deeper insight and analysis, and its novel synthetic data and QA generator.

Subjects:	Information Retrieval (cs.IR); Artificial Intelligence (cs.AI); Databases (cs.DB)
Cite as:	arXiv:2504.20047 [cs.IR]
	(or arXiv:2504.20047v3 [cs.IR] for this version)
	https://doi.org/10.48550/arXiv.2504.20047

Submission history

From: Mohammad Shahmeer Ahmad [view email]
[v1] Sun, 9 Mar 2025 11:02:11 UTC (9,943 KB)
[v2] Sun, 2 Nov 2025 08:33:27 UTC (28,025 KB)
[v3] Thu, 5 Mar 2026 20:10:34 UTC (21,816 KB)

Computer Science > Information Retrieval

Title:HCT-QA: A Benchmark for Question Answering on Human-Centric Tables

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Information Retrieval

Title:HCT-QA: A Benchmark for Question Answering on Human-Centric Tables

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators