Computer Vision and Learning research group at Ludwig Maximilian University of Munich (formerly Computer Vision Group at Heidelberg University)
62
公共仓库
100,472
总星标
4,404
关注者
CompVis 是慕尼黑大学计算机视觉与学习研究小组的 GitHub 组织,专注于计算机视觉领域。该组织拥有多种公共仓库,主要使用 Python 和 Jupyter Notebook 进行研究工作。其著名项目包括 stable-diffusion 和 latent-diffusion,这些项目在计算机视觉社区中得到了广泛的应用和关注。
A latent text-to-image diffusion model
High-Resolution Image Synthesis with Latent Diffusion Models
Taming Transformers for High-Resolution Image Synthesis
[AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matching
source code for the ECCV18 paper A Style-Aware Content Loss for Real-time HD Style Transfer
[CVPR 2026] A PyTorch implementation of the paper "EDGS: Eliminating Densification for Efficient Convergence of 3DGS"
A generative model conditioned on shape and appearance.
Is a geometric model required to synthesize novel views from a single image?
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
Source code for the paper "Divide and Conquer the Embedding Space for Metric Learning", CVPR 2019
[ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching
Network-to-Network Translation with Conditional Invertible Neural Networks
Implementation of Stochastic Image-to-Video Synthesis using cINNs.
此仓库未提供描述。
TensorFlow implementation of our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".
Official codebase for the Paper “Retrieval-Augmented Diffusion Models”
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
此仓库未提供描述。
此仓库未提供描述。
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
The official implementation of "[MASK] is All You Need"
A Disentangling Invertible Interpretation Network
[CVPR 2025] Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
Content and Style Disentanglement for Artistic Style Transfer [ICCV19]
[CVPR 2026] Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
此仓库未提供描述。
[ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation
Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with Invertible Neural Networks
此仓库未提供描述。
此仓库未提供描述。
iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis
[ICCV 2025] SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
此仓库未提供描述。
此仓库未提供描述。
[WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence
Code for GCPR 2020 Oral : "Unsupervised Part Discovery by Unsupervised Disentanglement"
此仓库未提供描述。
[NeurIPS 2025] DisMo: DIsentangled Motion Representations for Open-World Motion Transfer
MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation
此仓库未提供描述。
[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis
Content Transformation Block For Image Style Transfer [CVPR19]
Source code for the paper "Improving Deep Metric Learning byDivide and Conquer"
Dataset provided with the article "Deep learning for cuneiform sign detection with weak supervision using transliteration alignment". It comprises image references, transliterations and sign annotations of clay tablets from the Neo-Assyrian epoch.
Visual search interface
Code for the article "Deep learning of cuneiform sign detection with weak supervision using transliteration alignment"
Towards Learning a Realistic Rendering of Human Behavior
Unsupervised Magnification of Posture Deviations Across Subjects
[CVPR 2026] Probabilistic Precipitation Nowcasting with Rectified Flow Transformers
[ICCV 2025] Stochastic Interpolants for Revealing Stylistic Flows across the History of Art
Landing point for "Envisioning the Future, One Step at a Time"
此仓库未提供描述。
Official project page for the paper "WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians"
此仓库未提供描述。
Source Code + Documentation of our Automatic Behavior Analysis Software
Code for demo web application of the article "Deep learning for cuneiform sign detection with weak supervision using transliteration alignment".
此仓库未提供描述。
此仓库未提供描述。
Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792
Deep Unsupervised Similarity Learning using Partially Ordered Sets (CVPR17)
The official implementation of "[MASK] is All You Need"
CompVis 在 GitHub 上构建了一系列与计算机视觉相关的项目,包括 stable-diffusion 和 latent-diffusion。这些项目提供了先进的图像合成和转换技术,广泛应用于学术和工业研究中。
CompVis 的主要编程语言包括 Python、Jupyter Notebook 和 JavaScript。此外,组织还使用 HTML、Matlab 和 CSS 进行其研究项目的开发和展示。
是的,CompVis 的所有仓库都是公开的。这使得社区成员能够访问其源代码,促进了合作与知识共享,尤其是在计算机视觉研究领域。