Computer Science > Artificial Intelligence

arXiv:2511.13131 (cs)

[Submitted on 17 Nov 2025]

Title:MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications

Authors:Gagan Raj Gupta, Anshul Kumar, Manish Rai, Apu Chakraborty, Ashutosh Modi, Abdelaali Chaoub, Soumajit Pramanik, Moyank Giri, Yashwanth Holla, Sunny Kumar, M. V. Kiran Sooraj

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have emerged as powerful tools for automating complex reasoning and decision-making tasks. In telecommunications, they hold the potential to transform network optimization, automate troubleshooting, enhance customer support, and ensure regulatory compliance. However, their deployment in telecom is hindered by domain-specific challenges that demand specialized adaptation. To overcome these challenges and to accelerate the adaptation of LLMs for telecom, we propose MM-Telco, a comprehensive suite of multimodal benchmarks and models tailored for the telecom domain. The benchmark introduces various tasks (both text based and image based) that address various practical real-life use cases such as network operations, network management, improving documentation quality, and retrieval of relevant text and images. Further, we perform baseline experiments with various LLMs and VLMs. The models fine-tuned on our dataset exhibit a significant boost in performance. Our experiments also help analyze the weak areas in the working of current state-of-art multimodal LLMs, thus guiding towards further development and research.

Subjects:	Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Emerging Technologies (cs.ET); Networking and Internet Architecture (cs.NI)
Cite as:	arXiv:2511.13131 [cs.AI]
	(or arXiv:2511.13131v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2511.13131

Submission history

From: Anshul Kumar Mr [view email]
[v1] Mon, 17 Nov 2025 08:34:41 UTC (5,665 KB)

Computer Science > Artificial Intelligence

Title:MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:MM-Telco: Benchmarks and Multimodal Large Language Models for Telecom Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators