Audience

Developers and AI researchers seeking a solution offering a vision-language model that balances size and capability, ideal for building multimodal agents, document/image analysis tools, or GUI-based automation workflows

About GLM-4.1V

GLM-4.1V is a vision-language model, providing a powerful, compact multimodal model designed for reasoning and perception across images, text, and documents. The 9-billion-parameter variant (GLM-4.1V-9B-Thinking) is built on the GLM-4-9B foundation and enhanced through a specialized training paradigm using Reinforcement Learning with Curriculum Sampling (RLCS). It supports a 64k-token context window and accepts high-resolution inputs (up to 4K images, any aspect ratio), enabling it to handle complex tasks such as optical character recognition, image captioning, chart and document parsing, video and scene understanding, GUI-agent workflows (e.g., interpreting screenshots, recognizing UI elements), and general vision-language reasoning. In benchmark evaluations at the 10 B-parameter scale, GLM-4.1V-9B-Thinking achieved top performance on 23 of 28 tasks.

Pricing

Starting Price:
Free
Free Version:
Free Version available.

Integrations

API:
Yes, GLM-4.1V offers API access

Ratings/Reviews

Overall 0.0 / 5
ease 0.0 / 5
features 0.0 / 5
design 0.0 / 5
support 0.0 / 5

This software hasn't been reviewed yet. Be the first to provide a review:

Review this Software

Company Information

Zhipu AI
Founded: 2023
China
chat.z.ai/

Videos and Screen Captures

GLM-4.1V Screenshot 1
Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free

Product Details

Platforms Supported
Cloud
Windows
Mac
Linux
On-Premises
Training
Documentation
Support
Online

GLM-4.1V Frequently Asked Questions

Q: What kinds of users and organization types does GLM-4.1V work with?
Q: What languages does GLM-4.1V support in their product?
Q: What kind of support options does GLM-4.1V offer?
Q: What other applications or services does GLM-4.1V integrate with?
Q: Does GLM-4.1V have an API?
Q: What type of training does GLM-4.1V provide?
Q: How much does GLM-4.1V cost?

GLM-4.1V Product Features

GLM-4.1V Additional Categories