The Apache Software Foundation maintains a significant presence on GitHub, featuring a wide range of public repositories. Their projects primarily utilize programming languages such as Java, Rust, Python, Scala, Go, and C++. Notable repositories include Apache Superset, Apache ECharts, and Apache Airflow, all contributing to data processing and visualization.
Apache Superset is a Data Visualization and Data Exploration Platform
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache Spark - A unified analytics engine for large-scale data processing
Apache Kafka - A distributed event streaming platform
Empowering Data Intelligence with Distributed SQL for Sharding, Scalability, and Security Across All Databases.
:kangaroo: - PouchDB is a pocket-sized database.
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Apache Hadoop
A Q&A platform software for teams at any scales. Whether it's a community forum, help center, or knowledge management platform, you can always count on Apache Answer.
Apache Doris is an easy-to-use, high performance and unified analytics database.
Apache Pulsar - distributed pub-sub messaging system
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Apache Druid: a high performance real-time analytics database.
Open Machine Learning Compiler Framework
Open source transactional distributed database. Linear scalability and proven fault-tolerance on commodity hardware or cloud infrastructure without compromising performance.
Apache Iceberg
Apache DataFusion SQL Query Engine
Apache Beam is a unified programming model for Batch and Streaming data processing.
Apache Tomcat
Seamless multi-primary syncing database with an intuitive HTTP/JSON API, designed for reliability
Apache IoTDB
Apache Camel is an open source integration framework with 300+ connectors. Write routes in Java, YAML, or XML. Run on Spring Boot, Quarkus, or standalone. Apache License 2.0.
Upserts, Deletes And Incremental Processing on Big Data.
Apache Pinot - A realtime distributed OLAP datastore
Apache HBase
Apache Groovy: A powerful multi-faceted programming language for the JVM platform
Apache OpenDAL: One Layer, All Storage.
Graph database optimized for fast analysis and real-time data processing. It is provided as an extension to PostgreSQL.
Apache Shiro is a powerful and easy-to-use Java security framework that performs authentication, authorization, cryptography, and session management
A blazingly fast multi-language serialization framework for idiomatic domain objects, schema IDL, and cross-language data exchange.
Apache Kvrocks is a distributed key value NoSQL database that uses RocksDB as storage engine and is compatible with Redis protocol.
Mirror of Apache HTTP Server. Issues: http://issues.apache.org
Apache Log4j is a versatile, feature-rich, efficient logging API and backend for Java.
Official Rust implementation of Apache Arrow
Apache Maven Daemon
Apache Nutch is an extensible and scalable web crawler
Apache Parquet Format
Apache Lucene.NET is an open-source full-text search library written in C#, ported from the Apache Lucene project.
Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, trace, persist, and execute on your own infrastructure.
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
Mirror of Apache POI gitbox. The Java API for Microsoft Documents.
Apache TinkerPop - a graph computing framework
Apache Atlas - Open Metadata Management and Governance capabilities across the Hadoop platform and beyond
Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Apache Fluss is a streaming storage built for real-time analytics.
Apache OpenNLP
Build highly concurrent, distributed, and resilient message-driven applications using Java/Scala
Open source Java implementation for Raft consensus protocol.
Apache Iceberg
Apache DataFusion Comet Spark Accelerator
Apache Phoenix
Apache Flink Kubernetes Operator
Apache Camel K is a lightweight integration platform, born on Kubernetes, with serverless superpowers
Apache Teaclave™ is an open source universal secure computing platform, making computation on privacy-sensitive data safe and simple.
Database connectivity API standard and libraries for Apache Arrow
A single-node analytical database engine with geospatial as a first-class citizen
Apache Iceberg - Go
Apache Camel Spring Boot Examples
An observability database aims to ingest, analyze and store Metrics, Tracing and Logging data.
Read-only mirror of Apache SpamAssassin.
Apache KIE Examples repository with showcases on how to use Kogito, Drools, and jBPM
Official repository of Apache ECharts documentation
Human-AI Collaborative Data Science Using Visual Workflows
Apache Dubbo SPI Extensions
The Streaming-first HTTP server/module of Apache Pekko
Signing HTTP requests without heavy SDKs.
Apache Maven Archetype (Plugin)
Showcase Application to demonstrate features of Apache SkyWalking
Apache Calcite Go
Apache Maven Sources
Apache JSPWiki is a leading open source WikiWiki engine, feature-rich and built around standard JEE components (Java, servlets, JSP)
Apache Royale Compiler
Apache Maven Javadoc Plugin
Apache Pekko Connectors is a Reactive Enterprise Integration library for Java and Scala, based on Reactive Streams and Apache Pekko.
Apache Pekko gRPC
Rust Client for Apache Fluss (Incubating)
Apache Pulsar Site
Integrate Apache RocketMQ with A2A
Apache Pekko Management is a suite of tools for operating Pekko Clusters.
Apache Fory Website
Apache Pekko Projections is intended for building systems with the CQRS pattern, and facilitate in event-based service-to-service communication.
ASF GitHub Actions Repository
Asynchronously writes journal and snapshot entries to configured JDBC databases so that Apache Pekko Actors can recover state
Apache Kvrocks Website
Asynchronously writes journal and snapshot entries to configured R2DBC databases so that Apache Pekko Actors can recover state
Apache Airavata Custos Security
Apache Daffodil™ Extension for Visual Studio Code
Apache Paimon Mosaic: a columnar-bucket hybrid format optimized for wide tables.
Apache Maven Distribution Tools
A replicated Apache Pekko Persistence journal backed by Apache Cassandra
DynamoDBJournal for Apache Pekko Persistence
Apache HBase Site
Apache Maven Archiver
Apache SkyWalking next-generation UI (Horizon)
The Ruby agent for Apache SkyWalking
Apache maven
Apache TomEE published website
Apache Grails Website & Documentation
Apache develops various data processing and visualization tools on GitHub. Projects like Apache Superset and Apache Airflow highlight their focus on data analytics and workflow management, while other repositories support distributed event streaming and microservices.
The primary programming languages used by Apache include Java, Rust, Python, Scala, Go, and C++. These languages allow for the development of a diverse range of applications, from data visualization to distributed systems.
Yes, all of Apache's repositories on GitHub are public. This openness allows for community contributions and transparency in development, fostering collaboration on numerous projects across various domains.
Monitor The Apache Software Foundation with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account