Impronta pubblica su GitHub di KlingAI Research

ReCamMaster

★1821

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

SynCamMaster

★689

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Aggiornato 2 giu 2026

UniVideo

★528

[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos

GameFactory

★490

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

VideoAlign

★474

[NeurIPS 2025] Improving Video Generation with Human Feedback

3DTrajMaster

★371

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook

Aggiornato 2 giu 2026

Koala-36M

★252

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

I2V-Adapter

★233

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models

Aggiornato 4 mag 2026

MemFlow

★209

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

Aggiornato 10 giu 2026

X-Dub

★200

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

DiffMoE

★177

[Arxiv 2025] Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT

StyleMaster

★174

[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation

Jupyter Notebook

Aggiornato 7 giu 2026

ComfyUI-KLingAI-API

★172

Nessuna descrizione fornita per questo repository.

Aggiornato 25 mag 2026

MultiShotMaster

★163

CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"

Aggiornato 29 mag 2026

CamCloneMaster

★158

[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation

Aggiornato 27 mag 2026

SVG-T2I

★152

[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

Aggiornato 21 mag 2026

ShotStream

★150

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

Aggiornato 10 giu 2026

VANS

★119

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

Aggiornato 4 giu 2026

HumanAesExpert

★118

Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"

Aggiornato 1 giu 2026

StereoPilot

★115

The official implementation of StereoPilot

RoboMaster

★107

[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

Aggiornato 16 mag 2026

Uniaa

★94

Unified Multi-modal IAA Baseline and Benchmark

Aggiornato 9 mar 2026

VideoCanvas

★70

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

Aggiornato 2 mag 2026

MODA

★69

[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

AvatarForcing

★68

Official Pytorch implementation of AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

Aggiornato 10 giu 2026

VMoBA

★65

Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"

Aggiornato 2 apr 2026

SPF-Portrait

★63

Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"

Aggiornato 24 apr 2026

PhysMaster

★57

Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

Aggiornato 8 mag 2026

T2I-CoReBench

★53

[ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

Aggiornato 8 mag 2026

Alchemist

★39

Nessuna descrizione fornita per questo repository.

Aggiornato 12 mag 2026

diffusing-right-space

★31

Metric implementation and raw data of "Diffusing in the Right Space: A Systematic Study of Latent Diffusability"

DecMem

★19

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

VidEmo

★15

[NeurIPS'25] VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models

Aggiornato 30 apr 2026

SegTune

★14

[ACL'26 Oral] Official implementation of "SegTune: Structured and Fine-Grained Control for Song Generation".

VQRAE

★12

VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

Aggiornato 28 mag 2026

VFRTok

★11

Official implementation of NeurIPS'25 paper "VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption"

IMBA-Loss

★11

[ICCV 2025] Official Implementation of the Paper "Imbalance in Balance: Online Concept Balancing in Generation Models".

Aggiornato 16 gen 2026

DVIS_Plus

★10

Decoupled Video Instance Segmentation Framework, improved version of dvis

Aggiornato 17 mag 2026

TexEditor

★7

TexEditor: Structure-Preserving Text-Driven Texture Editing

Jupyter Notebook

Aggiornato 2 giu 2026

DVIS

★7

Decoupled Video Instance Segmentation Framework

Aggiornato 17 mag 2026

ScalingCache

★5

[ICLR 2026] Scalingcache: extreme acceleration of dits through difference scaling and dynamic interval caching

Aggiornato 28 apr 2026

kling-waic-express

★4

This is the program for supporting KlingAI Express in WAIC 2025.

Kotlin

SocioEmoDialog

★4

Scripts for processing and evaluating SocioEmoDialog datasets. It includes the core processing scripts, evaluation metrics, and additional documentation.

Aggiornato 16 gen 2026

Send-VAE

★2

Nessuna descrizione fornita per questo repository.

RewardHarness

★2

RewardHarness: Self-Evolving Agentic Post-Training https://rewardharness.com/

Aggiornato 18 mag 2026

VIVID

★2

Nessuna descrizione fornita per questo repository.

HTML

Aggiornato 16 gen 2026

DeScore

★1

Nessuna descrizione fornita per questo repository.

Aggiornato 26 mag 2026

kling-skills

★1

Nessuna descrizione fornita per questo repository.

JavaScript

Aggiornato 20 mar 2026

ATR

★0

Making Image Editing Easier via Adaptive Task Reformulation with Agentic Executions

noise-awareness-guidance

★0

[ICLR'26] Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance