Vision CAIR Group, KAUST, supported by Mohamed Elhoseiny
39
Public repositories
28,095
Total stars
598
Followers
Vision CAIR Research Group, KAUST, maintains a significant public presence on GitHub, showcasing a wide range of repositories primarily in Python, Jupyter Notebook, HTML, and JavaScript. Notable projects include MiniGPT-4 and ChatCaptioner, which contribute to advancements in video understanding and image captioning.
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
Official Repository of ChatCaptioner
[ICML 2025] Official PyTorch implementation of LongVU
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
Open-sourced code of MiniGPT-Med
3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition
No description provided for this repository.
No description provided for this repository.
Code for the paper: It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection
No description provided for this repository.
Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025
Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows
Official repository for the 3DCoMPaT dataset (ECCV2022 Oral)
No description provided for this repository.
No description provided for this repository.
Creative AI for Visual Art and Music slides and demos.
Official repository of Action-Free Guide
Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms
No description provided for this repository.
No description provided for this repository.
CIZSL++: Creativity Inspired Generative Zero-Shot Learning. T-PAMI under review.
Code for Wölfflin Affective Generative Analysis paper published in ICCC 2021
CS326 Practical assignment #2: few-shot classification
Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning Representation. CVPR 2022 Workshop, ICCC 2022.
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
VisionCAIR Zero-Shot Learning Research
VisionCAIR Affective and Creative AI Research
No description provided for this repository.
No description provided for this repository.
Creativity Inspired Zero-Shot Learning
Generator loss to reduce mode-collapse and to improve the generated samples quality.
Vision-CAIR builds various projects on GitHub, focusing on artificial intelligence and machine learning. Their repositories include MiniGPT-4 for natural language processing and ChatCaptioner for caption generation, among others.
Vision-CAIR primarily uses Python and Jupyter Notebook for their projects, along with HTML and JavaScript. This diverse language usage supports their work in AI, machine learning, and data visualization.
Yes, all of Vision-CAIR's repositories are public on GitHub. This openness allows other researchers and developers to access, use, and contribute to their innovative projects in the field of AI.
Monitor Vision CAIR Research Group, KAUST with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account