User profiles for Zeyi Liao
Zeyi LiaoThe Ohio State University Verified email at osu.edu Cited by 1051 |
Eia: Environmental injection attack on generalist web agents for privacy leakage
Generalist web agents have demonstrated remarkable potential in autonomously completing
a wide range of tasks on real websites, significantly boosting human productivity. However, …
a wide range of tasks on real websites, significantly boosting human productivity. However, …
Amplegcg: Learning a universal and transferable generative model of adversarial suffixes for jailbreaking both open and closed llms
As large language models (LLMs) become increasingly prevalent and integrated into
autonomous systems, ensuring their safety is imperative. Despite significant strides toward safety …
autonomous systems, ensuring their safety is imperative. Despite significant strides toward safety …
Chatcounselor: A large language models for mental health support
This paper presents ChatCounselor, a large language model (LLM) solution designed to
provide mental health support. Unlike generic chatbots, ChatCounselor is distinguished by its …
provide mental health support. Unlike generic chatbots, ChatCounselor is distinguished by its …
AttributionBench: How Hard is Automatic Attribution Evaluation?
Modern generative search engines enhance the reliability of large language model (LLM)
responses by providing cited evidence. However, evaluating the answer’s attribution, ie, …
responses by providing cited evidence. However, evaluating the answer’s attribution, ie, …
Scienceagentbench: Toward rigorous assessment of language agents for data-driven scientific discovery
The advancements of large language models (LLMs) have piqued growing interest in
developing LLM-based language agents to automate scientific discovery end-to-end, which has …
developing LLM-based language agents to automate scientific discovery end-to-end, which has …
Introducing v0. 5 of the ai safety benchmark from mlcommons
This paper introduces v0.5 of the AI Safety Benchmark, which has been created by the
MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess …
MLCommons AI Safety Working Group. The AI Safety Benchmark has been designed to assess …
Agent learning via early experience
A long-term goal of language agents is to learn and improve through their own experience,
ultimately outperforming humans in complex, real-world tasks. However, training agents from …
ultimately outperforming humans in complex, real-world tasks. However, training agents from …
Redteamcua: Realistic adversarial testing of computer-use agents in hybrid web-os environments
… Zeyi Liao provided the benign task formulation used … Zeyi Liao and Linxi Jiang led the main
code implementation for the RedTeamCUA framework and sandbox construction. Zeyi Liao …
code implementation for the RedTeamCUA framework and sandbox construction. Zeyi Liao …
Mind2web 2: Evaluating agentic search with agent-as-a-judge
Agentic search such as Deep Research systems-where agents autonomously browse the
web, synthesize information, and return comprehensive citation-backed answers-represents a …
web, synthesize information, and return comprehensive citation-backed answers-represents a …
Advweb: Controllable black-box attacks on vlm-powered web agents
Vision Language Models (VLMs) have revolutionized the creation of generalist web agents,
empowering them to autonomously complete diverse tasks on real-world websites, thereby …
empowering them to autonomously complete diverse tasks on real-world websites, thereby …