KlingAI Research의 공개 GitHub 발자국

ReCamMaster

★1,821

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

SynCamMaster

★689

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

업데이트됨 2026년 6월 2일

UniVideo

★528

[ICLR 2026] UniVideo: Unified Understanding, Generation, and Editing for Videos

GameFactory

★490

[ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos

VideoAlign

★474

[NeurIPS 2025] Improving Video Generation with Human Feedback

3DTrajMaster

★371

[ICLR'25] 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation

Jupyter Notebook

업데이트됨 2026년 6월 2일

Koala-36M

★252

Official implementation of the paper "Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content".

업데이트됨 2026년 6월 8일

I2V-Adapter

★233

I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models

업데이트됨 2026년 5월 4일

MemFlow

★209

Official Implementation of "MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives"

업데이트됨 2026년 6월 10일

X-Dub

★200

Try X-Dub to sync any character in a video with any audio you like | Official repository for "From Inpainting to Editing: Unlocking Robust Mask-Free Visual Dubbing via Generative Bootstrapping"

DiffMoE

★177

[Arxiv 2025] Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT

StyleMaster

★174

[CVPR'25] StyleMaster: Stylize Your Video with Artistic Generation and Translation

Jupyter Notebook

업데이트됨 2026년 6월 7일

ComfyUI-KLingAI-API

★172

이 저장소에 대한 설명이 제공되지 않았습니다.

업데이트됨 2026년 5월 25일

MultiShotMaster

★163

CVPR 2026 | Official Implementation of "MultiShotMaster: A Controllable Multi-Shot Video Generation Framework"

업데이트됨 2026년 5월 29일

CamCloneMaster

★158

[SIGGRAPH Asia'25] Enabling Reference-based Camera Control via Context without Explicit 3D Estimation

업데이트됨 2026년 5월 27일

SVG-T2I

★152

[Arxiv 2025] Official PyTorch Implementation of "SVG-T2I: Scaling up Text-to-Image Latent Diffusion Model Without Variational Autoencoder".

업데이트됨 2026년 5월 21일

ShotStream

★150

ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling

업데이트됨 2026년 6월 10일

VANS

★119

[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO

업데이트됨 2026년 6월 4일

HumanAesExpert

★118

Official implementation of "HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment"

업데이트됨 2026년 6월 1일

StereoPilot

★115

The official implementation of StereoPilot

RoboMaster

★107

[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control

업데이트됨 2026년 5월 16일

Uniaa

★94

Unified Multi-modal IAA Baseline and Benchmark

업데이트됨 2026년 3월 9일

VideoCanvas

★70

Official Code of "VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning"

알 수 없는 언어

업데이트됨 2026년 5월 2일

MODA

★69

[ICML 2025 Spotlight] MODA: MOdular Duplex Attention for Multimodal Perception, Cognition, and Emotion Understanding

AvatarForcing

★68

Official Pytorch implementation of AvatarForcing: One-Step Streaming Talking Avatars via Local-Future Sliding-Window Denoising

업데이트됨 2026년 6월 10일

VMoBA

★65

Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"

업데이트됨 2026년 4월 2일

SPF-Portrait

★63

Official implementation of "SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning"

알 수 없는 언어

업데이트됨 2026년 4월 24일

PhysMaster

★57

Official repository of PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning

알 수 없는 언어

업데이트됨 2026년 5월 8일

T2I-CoReBench

★53

[ICLR'26] Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play?

업데이트됨 2026년 5월 8일

Alchemist

★39

이 저장소에 대한 설명이 제공되지 않았습니다.

업데이트됨 2026년 5월 12일

diffusing-right-space

★31

Metric implementation and raw data of "Diffusing in the Right Space: A Systematic Study of Latent Diffusability"

DecMem

★19

DecMem: Towards Minute-Long Consistent World Generation with Decoupled Memory

업데이트됨 2026년 6월 8일

VidEmo

★15

[NeurIPS'25] VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models

업데이트됨 2026년 4월 30일

SegTune

★14

[ACL'26 Oral] Official implementation of "SegTune: Structured and Fine-Grained Control for Song Generation".

VQRAE

★12

VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction

업데이트됨 2026년 5월 28일

VFRTok

★11

Official implementation of NeurIPS'25 paper "VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption"

업데이트됨 2026년 6월 8일

IMBA-Loss

★11

[ICCV 2025] Official Implementation of the Paper "Imbalance in Balance: Online Concept Balancing in Generation Models".

업데이트됨 2026년 1월 16일

DVIS_Plus

★10

Decoupled Video Instance Segmentation Framework, improved version of dvis

업데이트됨 2026년 5월 17일

TexEditor

★7

TexEditor: Structure-Preserving Text-Driven Texture Editing

Jupyter Notebook

업데이트됨 2026년 6월 2일

DVIS

★7

Decoupled Video Instance Segmentation Framework

업데이트됨 2026년 5월 17일

ScalingCache

★5

[ICLR 2026] Scalingcache: extreme acceleration of dits through difference scaling and dynamic interval caching

업데이트됨 2026년 4월 28일

kling-waic-express

★4

This is the program for supporting KlingAI Express in WAIC 2025.

Kotlin

SocioEmoDialog

★4

Scripts for processing and evaluating SocioEmoDialog datasets. It includes the core processing scripts, evaluation metrics, and additional documentation.