8000
Skip to content
View aihao2000's full-sized avatar

Highlights

  • Pro

Block or report aihao2000

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for the paper "MICo-150K: A Comprehensive Dataset for Multi-Image Composition".

Python 22 Updated Dec 16, 2025
Swift 4,942 545 Updated Dec 1, 2025

A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training

Python 590 32 Updated Dec 20, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,479 295 Updated Dec 19, 2025

🤗A PyTorch-native Inference Engine with Hybrid Cache Acceleration and Parallelism for DiTs: Z-Image, FLUX2, Qwen-Image, etc.

Python 787 41 Updated Dec 20, 2025

ChronoEdit: Towards Temporal Reasoning for Image Editing and World Simulation

Python 628 37 Updated Nov 20, 2025

Human Preference Score v2: A Solid Benchmark for Evaluating Human Preferences of Text-to-Image Synthesis

Jupyter Notebook 634 24 Updated May 24, 2024

[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning

Python 1,339 77 Updated Sep 12, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,600 123 Updated Oct 31, 2025

🔥🔥 Official Repo of UMO: Scaling Multi-Identity Consistency for Image Customization via Matching Reward

Python 175 3 Updated Sep 15, 2025

Arxiv 25: Dynamic Pyramid Network for Efficient Multimodal Large Language Model

Python 1 Updated Apr 28, 2025

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,764 105 Updated Nov 4, 2025
Python 316 15 Updated Sep 15, 2025

Qwen-Image-Lightning: Speed up Qwen-Image model with distillation

Python 1,055 41 Updated Dec 3, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,444 361 Updated Dec 19, 2025

Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels with Hunyuan3D World Model

Python 2,563 221 Updated Dec 17, 2025

Consistency Distillation with Target Timestep Selection and Decoupled Guidance

Python 101 13 Updated Jan 4, 2025

Light Video Generation Inference Framework

Python 1,257 79 Updated Dec 19, 2025

This is the official implementation of our Señorita-2M [Weights and Dataset] : A High-Quality Instruction-based Dataset for General Video Editing by Video Specialists

Python 99 1 Updated Apr 9, 2025

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 4,834 322 Updated Dec 21, 2025

Making Flux go brrr on GPUs.

Python 158 16 Updated Jul 18, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,481 206 Updated Dec 21, 2025

(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models

Python 1,116 63 Updated Aug 7, 2025

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,465 94 Updated Sep 11, 2025

Enjoy the magic of Diffusion models!

Python 11,181 1,055 Updated Dec 20, 2025

Official PyTorch implementation - Video Motion Transfer with Diffusion Transformers

Python 76 3 Updated Jul 29, 2025

Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944

Python 328 16 Updated Aug 7, 2025

📹 A more flexible framework that can generate videos at any resolution and creates videos from images.

Python 1,709 128 Updated Dec 19, 2025

Open-source unified multimodal model

Python 5,491 480 Updated Oct 27, 2025
Next
0