RepoGuard
Updated 8 h ago
Vision CAIR Research Group, KAUST

Organization

Public GitHub footprint of Vision CAIR Research Group, KAUST

@Vision-CAIR
View profile on GitHub

Vision CAIR Group, KAUST, supported by Mohamed Elhoseiny

39

Public repositories

28,095

Total stars

598

Followers

Vision CAIR Research Group, KAUST, maintains a significant public presence on GitHub, showcasing a wide range of repositories primarily in Python, Jupyter Notebook, HTML, and JavaScript. Notable projects include MiniGPT-4 and ChatCaptioner, which contribute to advancements in video understanding and image captioning.

Top languages

Python 24Jupyter Notebook 7HTML 2JavaScript 1

Public repositories

MiniGPT-4

25,680

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python
Updated Jun 12, 2026

MiniGPT4-video

639

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python
Updated May 19, 2026

ChatCaptioner

468

Official Repository of ChatCaptioner

Jupyter Notebook
Updated Mar 5, 2026

LongVU

427

[ICML 2025] Official PyTorch implementation of LongVU

Python
Updated Jun 2, 2026

VisualGPT

342

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Python
Updated Mar 24, 2026

MiniGPT-Med

139

Open-sourced code of MiniGPT-Med

Python
Updated May 12, 2026

3DCoMPaT-v2

99

3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

Python
Updated May 30, 2026

MammalNet

48

No description provided for this repository.

Python
Updated May 22, 2026

LTVRR

35

No description provided for this repository.

Python
Updated Jan 4, 2024

artemis-v2

30

Code for the paper: It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection

Jupyter Notebook
Updated May 14, 2026

RelTransformer

29

No description provided for this repository.

Python
Updated May 31, 2024

dochaystacks

26

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025

Python
Updated Jun 8, 2026

Infinibench

20

Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows

Python
Updated Apr 24, 2026

3DCoMPaT

18

Official repository for the 3DCoMPaT dataset (ECCV2022 Oral)

Jupyter Notebook
Updated Mar 21, 2026

iMotion-LLM

14

No description provided for this repository.

Python
Updated Apr 28, 2026

affectiveVisDial

13

No description provided for this repository.

Python
Updated Jun 20, 2025

saai-factory-tutorial-creative-ai

11

Creative AI for Visual Art and Music slides and demos.

Unknown Language
Updated Apr 29, 2024

AF-Guide

10

Official repository of Action-Free Guide

Python
Updated May 2, 2026

CWAN

7

Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms

Python
Updated Jul 25, 2023

Freshness-Aware-PER

6

No description provided for this repository.

Python
Updated May 4, 2026

HalentNet

6

No description provided for this repository.

Python
Updated Jul 7, 2024

CIZSLv2

6

CIZSL++: Creativity Inspired Generative Zero-Shot Learning. T-PAMI under review.

Python
Updated Jan 23, 2024

WAGA

6

Code for Wölfflin Affective Generative Analysis paper published in ICCC 2021

Jupyter Notebook
Updated Sep 15, 2023

cs326-few-shot-classification

5

CS326 Practical assignment #2: few-shot classification

Python
Updated Dec 30, 2022

GRaWD

4

Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning Representation. CVPR 2022 Workshop, ICCC 2022.

Python
Updated Dec 6, 2022

artelingo

3

No description provided for this repository.

Jupyter Notebook
Updated Jul 9, 2024

UnlikelihoodMotionForecasting

3

No description provided for this repository.

Jupyter Notebook
Updated Jul 29, 2023

HomeGPT

1

No description provided for this repository.

Jupyter Notebook
Updated Nov 30, 2024

ROLL

0

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python
Updated May 4, 2026

Goldfish_website

0

No description provided for this repository.

JavaScript
Updated Dec 10, 2024

3dreftransformer

0

No description provided for this repository.

Unknown Language
Updated Sep 26, 2024

FishNet2.0

0

No description provided for this repository.

Unknown Language
Updated Jun 9, 2024

affective-vision-language

0

No description provided for this repository.

Python
Updated Aug 29, 2023

Zero-Shot-Learning

0

VisionCAIR Zero-Shot Learning Research

HTML
Updated Dec 10, 2021

Affective-and-Creative-AI

0

VisionCAIR Affective and Creative AI Research

HTML
Updated Oct 17, 2021

lifelong_fact_learning

0

No description provided for this repository.

Python
Updated Aug 14, 2020

feelings

0

No description provided for this repository.

Python
Updated Apr 29, 2020

CIZSL

0

Creativity Inspired Zero-Shot Learning

Unknown Language
Updated Mar 10, 2020

GDPP

0

Generator loss to reduce mode-collapse and to improve the generated samples quality.

Unknown Language
Updated Mar 10, 2020

Frequently asked questions

What does Vision-CAIR build on GitHub?

Vision-CAIR builds various projects on GitHub, focusing on artificial intelligence and machine learning. Their repositories include MiniGPT-4 for natural language processing and ChatCaptioner for caption generation, among others.

Which programming languages does Vision-CAIR use?

Vision-CAIR primarily uses Python and Jupyter Notebook for their projects, along with HTML and JavaScript. This diverse language usage supports their work in AI, machine learning, and data visualization.

Are Vision-CAIR's repositories public?

Yes, all of Vision-CAIR's repositories are public on GitHub. This openness allows other researchers and developers to access, use, and contribute to their innovative projects in the field of AI.

Is this exposure intended?

Monitor Vision CAIR Research Group, KAUST with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account