RepoGuard
Updated 10 h ago
The Apache Software Foundation

Organization

Public GitHub footprint of The Apache Software Foundation

@apache
View profile on GitHub

3,137

Public repositories

586,577

Total stars

23,080

Followers

The Apache Software Foundation maintains a significant presence on GitHub, featuring a wide range of public repositories. Their projects primarily utilize programming languages such as Java, Rust, Python, Scala, Go, and C++. Notable repositories include Apache Superset, Apache ECharts, and Apache Airflow, all contributing to data processing and visualization.

Top languages

Java 43Scala 13Rust 8Go 6TypeScript 4Python 4HTML 3JavaScript 2

Public repositories

superset

73,265

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript
Updated Jun 12, 2026

echarts

66,575

Apache ECharts is a powerful, interactive charting and data visualization library for browser

TypeScript
Updated Jun 13, 2026

airflow

45,793

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python
Updated Jun 12, 2026

spark

43,445

Apache Spark - A unified analytics engine for large-scale data processing

Scala
Updated Jun 13, 2026

kafka

32,802

Apache Kafka - A distributed event streaming platform

Java
Updated Jun 13, 2026

shardingsphere

20,735

Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.

Java
Updated Jun 13, 2026

pouchdb

17,579

:kangaroo: - PouchDB is a pocket-sized database.

JavaScript
Updated Jun 12, 2026

arrow

16,834

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++
Updated Jun 12, 2026

hadoop

15,566

Apache Hadoop

Java
Updated Jun 13, 2026

answer

15,546

A Q&A platform software for teams at any scales. Whether it's a community forum, help center, or knowledge management platform, you can always count on Apache Answer.

Go
Updated Jun 12, 2026

doris

15,464

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java
Updated Jun 12, 2026

pulsar

15,266

Apache Pulsar - distributed pub-sub messaging system

Java
Updated Jun 13, 2026

dolphinscheduler

14,310

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java
Updated Jun 13, 2026

druid

14,015

Apache Druid: a high performance real-time analytics database.

Java
Updated Jun 12, 2026

tvm

13,463

Open Machine Learning Compiler Framework

Python
Updated Jun 13, 2026

cassandra

9,773

Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure without compromising performance.

Java
Updated Jun 13, 2026

iceberg

8,961

Apache Iceberg

Java
Updated Jun 13, 2026

datafusion

8,868

Apache DataFusion SQL Query Engine

Rust
Updated Jun 13, 2026

beam

8,611

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java
Updated Jun 12, 2026

tomcat

8,185

Apache Tomcat

Java
Updated Jun 12, 2026

couchdb

6,896

Seamless multi-primary syncing database with an intuitive HTTP/JSON API, designed for reliability

Erlang
Updated Jun 12, 2026

iotdb

6,347

Apache IoTDB

Java
Updated Jun 12, 2026

camel

6,231

Apache Camel is an open source integration framework with 300+ connectors. Write routes in Java, YAML, or XML. Run on Spring Boot, Quarkus, or standalone. Apache License 2.0.

Java
Updated Jun 12, 2026

hudi

6,176

Upserts, Deletes And Incremental Processing on Big Data.

Java
Updated Jun 13, 2026

pinot

6,097

Apache Pinot - A realtime distributed OLAP datastore

Java
Updated Jun 13, 2026

hbase

5,534

Apache HBase

Java
Updated Jun 12, 2026

groovy

5,448

Apache Groovy: A powerful multi-faceted programming language for the JVM platform

Java
Updated Jun 13, 2026

opendal

5,167

Apache OpenDAL: One Layer, All Storage.

Rust
Updated Jun 12, 2026

age

4,597

Graph database optimized for fast analysis and real-time data processing. It is provided as an extension to PostgreSQL.

C
Updated Jun 12, 2026

shiro

4,430

Apache Shiro is a powerful and easy-to-use Java security framework that performs authentication, authorization, cryptography, and session management

Java
Updated Jun 13, 2026

fory

4,408

A blazingly fast multi-language serialization framework for idiomatic domain objects, schema IDL, and cross-language data exchange.

Java
Updated Jun 12, 2026

kvrocks

4,333

Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.

C++
Updated Jun 12, 2026

httpd

4,012

Mirror of Apache HTTP Server. Issues: http://issues.apache.org

C
Updated Jun 12, 2026

logging-log4j2

3,603

Apache Log4j is a versatile, feature-rich, efficient logging API and backend for Java.

Java
Updated Jun 13, 2026

arrow-rs

3,494

Official Rust implementation of Apache Arrow

Rust
Updated Jun 13, 2026

maven-mvnd

3,429

Apache Maven Daemon

Java
Updated Jun 12, 2026

nutch

3,191

Apache Nutch is an extensible and scalable web crawler

Java
Updated Jun 13, 2026

parquet-format

2,440

Apache Parquet Format

Thrift
Updated Jun 12, 2026

lucenenet

2,392

Apache Lucene.NET is an open-source full-text search library written in C#, ported from the Apache Lucene project.

C#
Updated Jun 12, 2026

burr

2,379

Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.

Python
Updated Jun 13, 2026

gobblin

2,267

A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.

Java
Updated Jun 12, 2026

poi

2,232

Mirror of Apache POI gitbox. The Java API for Microsoft Documents.

Java
Updated Jun 12, 2026

tinkerpop

2,134

Apache TinkerPop - a graph computing framework

Java
Updated Jun 12, 2026

atlas

2,111

Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond

Java
Updated Jun 12, 2026

bookkeeper

2,001

Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads

Java
Updated Jun 13, 2026

polaris

1,968

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Java
Updated Jun 12, 2026

fluss

1,940

Apache Fluss is a streaming storage built for real-time analytics.

Java
Updated Jun 12, 2026

opennlp

1,600

Apache OpenNLP

Java
Updated Jun 12, 2026

pekko

1,585

Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala

Scala
Updated Jun 12, 2026

ratis

1,453

Open source Java implementation for Raft consensus protocol.

Java
Updated Jun 12, 2026

iceberg-rust

1,319

Apache Iceberg

Rust
Updated Jun 12, 2026

datafusion-comet

1,209

Apache DataFusion Comet Spark Accelerator

Scala
Updated Jun 13, 2026

phoenix

1,059

Apache Phoenix

Java
Updated Jun 12, 2026

flink-kubernetes-operator

1,015

Apache Flink Kubernetes Operator

Java
Updated Jun 12, 2026

camel-k

919

Apache Camel K is a lightweight integration platform, born on Kubernetes, with serverless superpowers

Go
Updated Jun 13, 2026

teaclave

801

Apache Teaclave™ is an open source universal secure computing platform, making computation on privacy-sensitive data safe and simple.

Unknown Language
Updated Jun 12, 2026

arrow-adbc

595

Database connectivity API standard and libraries for Apache Arrow

C#
Updated Jun 12, 2026

sedona-db

465

A single-node analytical database engine with geospatial as a first-class citizen

Rust
Updated Jun 12, 2026

iceberg-go

431

Apache Iceberg - Go

Go
Updated Jun 12, 2026

camel-spring-boot-examples

353

Apache Camel Spring Boot Examples

Java
Updated Jun 13, 2026

skywalking-banyandb

341

An observability database aims to ingest, analyze and store Metrics, Tracing and Logging data.

Go
Updated Jun 13, 2026

spamassassin

337

Read-only mirror of Apache SpamAssassin.

Perl
Updated Jun 13, 2026

incubator-kie-kogito-examples

296

Apache KIE Examples repository with showcases on how to use Kogito, Drools, and jBPM

Java
Updated Jun 12, 2026

echarts-doc

259

Official repository of Apache ECharts documentation

JavaScript
Updated Jun 12, 2026

texera

244

Human-AI Collaborative Data Science Using Visual Workflows

Scala
Updated Jun 13, 2026

dubbo-spi-extensions

206

Apache Dubbo SPI Extensions

Java
Updated Jun 13, 2026

pekko-http

191

The Streaming-first HTTP server/module of Apache Pekko

Scala
Updated Jun 12, 2026

opendal-reqsign

160

Signing HTTP requests without heavy SDKs.

Rust
Updated Jun 12, 2026

maven-archetype

146

Apache Maven Archetype (Plugin)

Java
Updated Jun 12, 2026

skywalking-showcase

132

Showcase Application to demonstrate features of Apache SkyWalking

Makefile
Updated Jun 13, 2026

calcite-avatica-go

125

Apache Calcite Go

Go
Updated Jun 12, 2026

maven-sources

120

Apache Maven Sources

Unknown Language
Updated Jun 12, 2026

jspwiki

116

Apache JSPWiki is a leading open source WikiWiki engine, feature-rich and built around standard JEE components (Java, servlets, JSP)

Java
Updated Jun 12, 2026

royale-compiler

111

Apache Royale Compiler

Java
Updated Jun 12, 2026

maven-javadoc-plugin

102

Apache Maven Javadoc Plugin

Java
Updated Jun 12, 2026

pekko-connectors

76

Apache Pekko Connectors is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Apache Pekko.

Scala
Updated Jun 12, 2026

pekko-grpc

53

Apache Pekko gRPC

Scala
Updated Jun 12, 2026

fluss-rust

52

Rust Client for Apache Fluss (Incubating)

Rust
Updated Jun 12, 2026

pulsar-site

45

Apache Pulsar Site

HTML
Updated Jun 13, 2026

rocketmq-a2a

38

Integrate Apache RocketMQ with A2A

Java
Updated Jun 13, 2026

pekko-management

32

Apache Pekko Management is a suite of tools for operating Pekko Clusters.

Scala
Updated Jun 12, 2026

fory-site

27

Apache Fory Website

TypeScript
Updated Jun 12, 2026

pekko-projection

26

Apache Pekko Projections is intended for building systems with the CQRS pattern, and facilitate in event-based service-to-service communication.

Scala
Updated Jun 12, 2026

infrastructure-actions

25

ASF GitHub Actions Repository

Python
Updated Jun 12, 2026

pekko-persistence-jdbc

24

Asynchronously writes journal and snapshot entries to configured JDBC databases so that Apache Pekko Actors can recover state

Scala
Updated Jun 12, 2026

kvrocks-website

23

Apache Kvrocks Website

CSS
Updated Jun 13, 2026

pekko-persistence-r2dbc

20

Asynchronously writes journal and snapshot entries to configured R2DBC databases so that Apache Pekko Actors can recover state

Scala
Updated Jun 12, 2026

airavata-custos

19

Apache Airavata Custos Security

Go
Updated Jun 12, 2026

daffodil-vscode

17

Apache Daffodil™ Extension for Visual Studio Code

TypeScript
Updated Jun 12, 2026

paimon-mosaic

16

Apache Paimon Mosaic: a columnar-bucket hybrid format optimized for wide tables.

Rust
Updated Jun 12, 2026

maven-dist-tool

13

Apache Maven Distribution Tools

Shell
Updated Jun 12, 2026

pekko-persistence-cassandra

13

A replicated Apache Pekko Persistence journal backed by Apache Cassandra

Scala
Updated Jun 12, 2026

pekko-persistence-dynamodb

12

DynamoDBJournal for Apache Pekko Persistence

Scala
Updated Jun 12, 2026

hbase-site

11

Apache HBase Site

HTML
Updated Jun 12, 2026

maven-archiver

10

Apache Maven Archiver

Java
Updated Jun 12, 2026

skywalking-horizon-ui

5

Apache SkyWalking next-generation UI (Horizon)

Vue
Updated Jun 13, 2026

skywalking-ruby

5

The Ruby agent for Apache SkyWalking

Ruby
Updated Jun 12, 2026

maven-xinclude-extension

5

Apache maven

Java
Updated Jun 12, 2026

tomee-site-pub

4

Apache TomEE published website

HTML
Updated Jun 13, 2026

grails-website

3

Apache Grails Website & Documentation

Unknown Language
Updated Jun 13, 2026

Frequently asked questions

What does apache build on GitHub?

Apache develops various data processing and visualization tools on GitHub. Projects like Apache Superset and Apache Airflow highlight their focus on data analytics and workflow management, while other repositories support distributed event streaming and microservices.

Which programming languages does apache use?

The primary programming languages used by Apache include Java, Rust, Python, Scala, Go, and C++. These languages allow for the development of a diverse range of applications, from data visualization to distributed systems.

Are apache's repositories public?

Yes, all of Apache's repositories on GitHub are public. This openness allows for community contributions and transparency in development, fostering collaboration on numerous projects across various domains.

Is this exposure intended?

Monitor The Apache Software Foundation with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account