8000
Skip to content
View wangphoebe's full-sized avatar

Block or report wangphoebe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is the official repo for paper "Agent-Environment Alignment via Automated Interface Generation"

Python 6 1 Updated Jun 11, 2025

Repo for paper "MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding".

Python 37 1 Updated Jun 9, 2025

StreamingBench: Assessing the Gap for MLLMs to Achieve Streaming Video Understanding

Python 140 6 Updated May 16, 2025

This is the official repo for paper "Visual Abstract Thinking Empowers Multimodal Reasoning"

Python 10 Updated May 29, 2025

Official Repository for “CoSpace: Benchmarking Continuous Space Perception Ability for Vision-Language Models" [CVPR2025]

Python 4 Updated Dec 14, 2025

Official repo for EscapeCraft (an 3D environment for room escape) and benchmark MM-Escape. This work is accepted by ICCV 2025.

Python 36 5 Updated Jul 7, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,086 2,291 Updated Dec 25, 2024

Open Platform for Embodied Agents

Python 334 22 Updated Jan 12, 2025

✨✨Latest Advances on Multimodal Large Language Models

17,039 1,097 Updated Dec 23, 2025

This is the repo for our work “Failures Pave the Way: Enhancing Large Language Models through Tuning-free Rule Accumulation” (EMNLP 2023).

Python 10 Updated Oct 25, 2023

Official code for our paper "Model Composition for Multimodal Large Language Models" (ACL 2024)

Python 31 1 Updated Jan 8, 2025

Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".

JavaScript 12 Updated Oct 14, 2024

Self-Knowledge Guided Retrieval Augmentation for Large Language Models (EMNLP Findings 2023)

Python 28 Updated Dec 8, 2023

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,283 209 Updated Mar 5, 2024

Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback

Jupyter Notebook 208 17 Updated May 24, 2023

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,969 1,872 Updated Jul 15, 2025

A large number of free HTTP proxies updated every 10 minutes.Keep http/s proxies fresh at all times.

170 15 Updated Dec 23, 2025

CCL 2022 汉语学习者文本纠错评测

Macaulay2 142 27 Updated Dec 16, 2022
0