Public GitHub footprint of KlingAI Research

ReCamMaster

★1,821

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

SynCamMaster

★689

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Updated Jun 2, 2026

UniVideo

★528

[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos

GameFactory

★490

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

VideoAlign

★474

[NeurIPS 2025] Improving Video Generation with Human Feedback

3DTrajMaster

★371

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook

Updated Jun 2, 2026

Koala-36M

★252

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

I2V-Adapter

★233

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models

Updated May 4, 2026

MemFlow

★209

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Updated Jun 10, 2026

X-Dub

★200

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

DiffMoE

★177

[Arxiv 2025] Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT

StyleMaster

★174

[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation

Jupyter Notebook

Updated Jun 7, 2026

ComfyUI-KLingAI-API

★172

No description provided for this repository.

Updated May 25, 2026

MultiShotMaster

★163

CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"

Updated May 29, 2026

CamCloneMaster

★158

[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation

Updated May 27, 2026

SVG-T2I

★152

[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Updated May 21, 2026

ShotStream

★150

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Updated Jun 10, 2026

VANS

★119

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Updated Jun 4, 2026

HumanAesExpert

★118

Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"

Updated Jun 1, 2026

StereoPilot

★115

The official implementation of StereoPilot

RoboMaster

★107

[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

Updated May 16, 2026

Uniaa

★94

Unified Multi-modal IAA Baseline and Benchmark

Updated Mar 9, 2026

VideoCanvas

★70

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

Updated May 2, 2026

MODA

★69

[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

AvatarForcing

★68

Official Pytorch implementation of AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

Updated Jun 10, 2026

VMoBA

★65

Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"

Updated Apr 2, 2026

SPF-Portrait

★63

Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"

Updated Apr 24, 2026

PhysMaster

★57

Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Updated May 8, 2026

T2I-CoReBench

★53

[ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Updated May 8, 2026

Alchemist

★39

No description provided for this repository.

Updated May 12, 2026

diffusing-right-space

★31

Metric implementation and raw data of "Diffusing in the Right Space: A Systematic Study of Latent Diffusability"

DecMem

★19

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

VidEmo

★15

[NeurIPS'25] VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models

Updated Apr 30, 2026

SegTune

★14

[ACL'26 Oral] Official implementation of "SegTune: Structured and Fine-Grained Control for Song Generation".

VQRAE

★12

VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

Updated May 28, 2026

VFRTok

★11

Official implementation of NeurIPS'25 paper "VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption"

IMBA-Loss

★11

[ICCV 2025] Official Implementation of the Paper "Imbalance in Balance: Online Concept Balancing in Generation Models".

Updated Jan 16, 2026

DVIS_Plus

★10

Decoupled Video Instance Segmentation Framework, improved version of dvis

Updated May 17, 2026

TexEditor

★7

TexEditor: Structure-Preserving Text-Driven Texture Editing

Jupyter Notebook

Updated Jun 2, 2026

DVIS

★7

Decoupled Video Instance Segmentation Framework

Updated May 17, 2026

ScalingCache

★5

[ICLR 2026] Scalingcache: extreme acceleration of dits through difference scaling and dynamic interval caching

Updated Apr 28, 2026

kling-waic-express

★4

This is the program for supporting KlingAI Express in WAIC 2025.

Kotlin

SocioEmoDialog

★4

Scripts for processing and evaluating SocioEmoDialog datasets. It includes the core processing scripts, evaluation metrics, and additional documentation.

Updated Jan 16, 2026

Send-VAE

★2

No description provided for this repository.

RewardHarness

★2

RewardHarness: Self-Evolving Agentic Post-Training https://rewardharness.com/

Updated May 18, 2026

VIVID

★2

No description provided for this repository.

HTML

Updated Jan 16, 2026

DeScore

★1

No description provided for this repository.

Updated May 26, 2026

kling-skills

★1

No description provided for this repository.

JavaScript

Updated Mar 20, 2026

ATR

★0

Making Image Editing Easier via Adaptive Task Reformulation with Agentic Executions

noise-awareness-guidance

★0

[ICLR'26] Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance