Stars
The official implementation of "MagicColor: Multi-Instance Sketch Colorization"
Official PyTorch implementation of One-Minute Video Generation with Test-Time Training
[CVPR 2025] MatAnyone: Stable Video Matting with Consistent Memory Propagation
Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"
AWS MCP Servers — specialized MCP servers that bring AWS best practices directly to your development workflow
OpenMMLab Pose Estimation Toolbox and Benchmark.
A user-friendly Command-line/SDK tool that makes it quickly and easier to deploy open-source LLMs on AWS
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
InspireMusic: A Unified Framework for Music, Song, Audio Generation.
Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"
LHM: Large Animatable Human Reconstruction Model from a Single Image in Seconds
Official Implementation of Video-T1: Test-Time Scaling for Video Generation
MagicMotion: Controllable Video Generation with Dense-to-Sparse Trajectory Guidance
🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".
OmniPaint: Mastering Object-Oriented Editing via Disentangled Insertion-Removal Inpainting
Official implementation of Unified Reward Model for Multimodal Understanding and Generation.
Official implementation of DreamRenderer: Taming Multi-Instance Attribute Control in Large-Scale Text-to-Image Models
UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
A collection of papers on diffusion models for 3D generation.
Official implementation of OneDiffusion paper (CVPR 2025)
A lightning-fast, cross-platform AI chat application built with React Native.
🚀 Cross attention map tools for huggingface/diffusers
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
Model Context Procotol(MCP) server for using Amazon Bedrock Nova Canvas to generate images