已更新 9 h ago

Organization

Vision CAIR Research Group, KAUST 的公共 GitHub 足迹

@Vision-CAIR

在 GitHub 上查看个人资料

Vision CAIR Group, KAUST, supported by Mohamed Elhoseiny

公共仓库

28,095

总星标

598

关注者

Vision CAIR Research Group在GitHub上的公开存在展示了其在人工智能和计算机视觉领域的贡献。该组织的主要编程语言包括Python和Jupyter Notebook，开发了一系列知名项目，如MiniGPT-4和ChatCaptioner，涵盖了从视频理解到情感图像字幕生成的广泛应用。

顶级语言

Python 24Jupyter Notebook 7HTML 2JavaScript 1

公共仓库

MiniGPT-4

★25,680

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python

已更新 2026年6月12日

MiniGPT4-video

★639

Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding

Python

已更新 2026年5月19日

ChatCaptioner

★468

Official Repository of ChatCaptioner

Jupyter Notebook

已更新 2026年3月5日

LongVU

★427

[ICML 2025] Official PyTorch implementation of LongVU

Python

已更新 2026年6月2日

VisualGPT

★342

VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models

Python

已更新 2026年3月24日

MiniGPT-Med

★139

Open-sourced code of MiniGPT-Med

Python

已更新 2026年5月12日

3DCoMPaT-v2

★99

3DCoMPaT++: An improved large-scale 3D vision dataset for compositional recognition

Python

已更新 2026年5月30日

MammalNet

★48

此仓库未提供描述。

Python

已更新 2026年5月22日

LTVRR

★35

此仓库未提供描述。

Python

已更新 2024年1月4日

artemis-v2

★30

Code for the paper: It is Okay to Not Be Okay: Overcoming Emotional Bias in Affective Image Captioning by Contrastive Data Collection

Jupyter Notebook

已更新 2026年5月14日

RelTransformer

★29

此仓库未提供描述。

Python

已更新 2024年5月31日

dochaystacks

★26

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025

Python

已更新 2026年6月8日

Infinibench

★20

Official InfiniBench: A Benchmark for Large Multi-Modal Models in Long-Form Movies and TV Shows

Python

已更新 2026年4月24日

3DCoMPaT

★18

Official repository for the 3DCoMPaT dataset (ECCV2022 Oral)

Jupyter Notebook

已更新 2026年3月21日

iMotion-LLM

★14

此仓库未提供描述。

Python

已更新 2026年4月28日

affectiveVisDial

★13

此仓库未提供描述。

Python

已更新 2025年6月20日

saai-factory-tutorial-creative-ai

★11

Creative AI for Visual Art and Music slides and demos.

未知语言

已更新 2024年4月29日

AF-Guide

★10

Official repository of Action-Free Guide

Python

已更新 2026年5月2日

CWAN

★7

Creative Walk Adversarial Networks: Novel Art Generation with Probabilistic Random Walk Deviation from Style Norms

Python

已更新 2023年7月25日

Freshness-Aware-PER

★6

此仓库未提供描述。

Python

已更新 2026年5月4日

HalentNet

★6

此仓库未提供描述。

Python

已更新 2024年7月7日

CIZSLv2

★6

CIZSL++: Creativity Inspired Generative Zero-Shot Learning. T-PAMI under review.

Python

已更新 2024年1月23日

WAGA

★6

Code for Wölfflin Affective Generative Analysis paper published in ICCC 2021

Jupyter Notebook

已更新 2023年9月15日

cs326-few-shot-classification

★5

CS326 Practical assignment #2: few-shot classification

Python

已更新 2022年12月30日

GRaWD

★4

Imaginative Walks: Generative Random Walk Deviation Loss for Improved Unseen Learning Representation. CVPR 2022 Workshop, ICCC 2022.

Python

已更新 2022年12月6日

artelingo

★3

此仓库未提供描述。

Jupyter Notebook

已更新 2024年7月9日

UnlikelihoodMotionForecasting

★3

此仓库未提供描述。

Jupyter Notebook

已更新 2023年7月29日

HomeGPT

★1

此仓库未提供描述。

Jupyter Notebook

已更新 2024年11月30日

ROLL

★0

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python

已更新 2026年5月4日

Goldfish_website

★0

此仓库未提供描述。

JavaScript

已更新 2024年12月10日

3dreftransformer

★0

此仓库未提供描述。

未知语言

已更新 2024年9月26日

FishNet2.0

★0

此仓库未提供描述。

未知语言

已更新 2024年6月9日

affective-vision-language

★0

此仓库未提供描述。

Python

已更新 2023年8月29日

Zero-Shot-Learning

★0

VisionCAIR Zero-Shot Learning Research

HTML

已更新 2021年12月10日

Affective-and-Creative-AI

★0

VisionCAIR Affective and Creative AI Research

HTML

已更新 2021年10月17日

lifelong_fact_learning

★0

此仓库未提供描述。

Python

已更新 2020年8月14日

feelings

★0

此仓库未提供描述。

Python

已更新 2020年4月29日

CIZSL

★0

Creativity Inspired Zero-Shot Learning

未知语言

已更新 2020年3月10日

GDPP

★0

Generator loss to reduce mode-collapse and to improve the generated samples quality.

未知语言

已更新 2020年3月10日

常见问题

Vision-CAIR在GitHub上构建了什么？

Vision-CAIR主要开发与人工智能和计算机视觉相关的项目，包括MiniGPT-4和ChatCaptioner等，致力于推动相关技术的开源共享与应用。

Vision-CAIR使用哪些编程语言？

Vision-CAIR的主要编程语言包括Python和Jupyter Notebook，此外还使用HTML和JavaScript。这些语言的组合使其能够开发多功能的机器学习和数据科学项目。

Vision-CAIR的代码库是公开的吗？

是的，Vision-CAIR的所有代码库都是公开的，允许开发者和研究人员访问其开源项目，促进合作和技术传播。

这种曝光是有意的吗？

使用 RepoGuard 监控 Vision CAIR Research Group, KAUST，并在新公共仓库出现的瞬间提醒您。

监控此账户