Computer Vision and Learning research group at Ludwig Maximilian University of Munich (formerly Computer Vision Group at Heidelberg University)
62
Public repositories
100,472
Total stars
4,404
Followers
CompVis is a research group at Ludwig Maximilian University of Munich, focusing on Computer Vision and Learning. Their GitHub presence features a wide range of public repositories, primarily in Python and Jupyter Notebook, with notable projects like stable-diffusion and latent-diffusion contributing to advancements in image synthesis and diffusion models.
A latent text-to-image diffusion model
High-Resolution Image Synthesis with Latent Diffusion Models
Taming Transformers for High-Resolution Image Synthesis
[AAAI 2025, Oral] DepthFM: Fast Monocular Depth Estimation with Flow Matching
source code for the ECCV18 paper A Style-Aware Content Loss for Real-time HD Style Transfer
[CVPR 2026] A PyTorch implementation of the paper "EDGS: Eliminating Densification for Efficient Convergence of 3DGS"
A generative model conditioned on shape and appearance.
Is a geometric model required to synthesize novel views from a single image?
A PyTorch implementation of the paper "ZigMa: A DiT-Style Mamba-based Diffusion Model" (ECCV 2024)
Source code for the paper "Divide and Conquer the Embedding Space for Metric Learning", CVPR 2019
[ECCV 2024, Oral] FMBoost: Boosting Latent Diffusion with Flow Matching
Network-to-Network Translation with Conditional Invertible Neural Networks
Implementation of Stochastic Image-to-Video Synthesis using cINNs.
No description provided for this repository.
TensorFlow implementation of our CVPR 2021 Paper "Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes".
Official codebase for the Paper “Retrieval-Augmented Diffusion Models”
Fine-Grained Subject-Specific Attribute Expression Control in T2I Models
No description provided for this repository.
No description provided for this repository.
ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis
The official implementation of "[MASK] is All You Need"
A Disentangling Invertible Interpretation Network
[CVPR 2025] Diff2Flow: Training Flow Matching Models via Diffusion Model Alignment
Content and Style Disentanglement for Artistic Style Transfer [ICCV19]
[CVPR 2026] Denoising, Fast and Slow: Difficulty-Aware Adaptive Sampling for Image Generation
No description provided for this repository.
[ICLR 2026] Adapting Self-Supervised Representations as a Latent Space for Efficient Generation
Making Sense of CNNs: Interpreting Deep Representations & Their Invariances with Invertible Neural Networks
No description provided for this repository.
No description provided for this repository.
iPOKE: Poking a Still Image for Controlled Stochastic Video Synthesis
[ICCV 2025] SCFlow: Implicitly Learning Style and Content Disentanglement with Flow Models
No description provided for this repository.
No description provided for this repository.
[WACV 2025] DistillDIFT: Distillation of Diffusion Features for Semantic Correspondence
Code for GCPR 2020 Oral : "Unsupervised Part Discovery by Unsupervised Disentanglement"
No description provided for this repository.
[NeurIPS 2025] DisMo: DIsentangled Motion Representations for Open-World Motion Transfer
MaskFlow: Discrete Flows For Flexible and Efficient Long Video Generation
No description provided for this repository.
[AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?
Unsupervised Robust Disentangling of Latent Characteristics for Image Synthesis
Content Transformation Block For Image Style Transfer [CVPR19]
Source code for the paper "Improving Deep Metric Learning byDivide and Conquer"
Dataset provided with the article "Deep learning for cuneiform sign detection with weak supervision using transliteration alignment". It comprises image references, transliterations and sign annotations of clay tablets from the Neo-Assyrian epoch.
Visual search interface
Code for the article "Deep learning of cuneiform sign detection with weak supervision using transliteration alignment"
Towards Learning a Realistic Rendering of Human Behavior
Unsupervised Magnification of Posture Deviations Across Subjects
[CVPR 2026] Probabilistic Precipitation Nowcasting with Rectified Flow Transformers
[ICCV 2025] Stochastic Interpolants for Revealing Stylistic Flows across the History of Art
Landing point for "Envisioning the Future, One Step at a Time"
No description provided for this repository.
Official project page for the paper "WaSt-3D: Wasserstein-2 Distance for Scene-to-Scene Stylization on 3D Gaussians"
No description provided for this repository.
Source Code + Documentation of our Automatic Behavior Analysis Software
Code for demo web application of the article "Deep learning for cuneiform sign detection with weak supervision using transliteration alignment".
No description provided for this repository.
No description provided for this repository.
Code for our paper "CliqueCNN: Deep Unsupervised Exemplar Learning" https://arxiv.org/abs/1608.08792
Deep Unsupervised Similarity Learning using Partially Ordered Sets (CVPR17)
The official implementation of "[MASK] is All You Need"
CompVis builds several notable projects on GitHub, including stable-diffusion, latent-diffusion, and taming-transformers. These repositories focus on advanced techniques in image synthesis and diffusion models, showcasing their research contributions.
CompVis primarily utilizes Python and Jupyter Notebook for their projects, along with JavaScript, HTML, Matlab, and CSS. This variety of languages supports their diverse research initiatives in computer vision and learning.
Yes, CompVis's repositories are public on GitHub. This open access allows the research community and developers to explore and contribute to their significant projects in computer vision and learning.
Monitor CompVis - Computer Vision and Learning LMU Munich with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account