RepoGuard
Updated 7 h ago
Apify

Organization

Public GitHub footprint of Apify

@apify
View profile on GitHub

Put the web to work.

227

Public repositories

44,611

Total stars

2,030

Followers

Apify's public GitHub presence showcases a wide range of repositories primarily focused on web scraping and browser automation. Their notable projects include Crawlee, which is available in both TypeScript and Python, and various tools for proxy management and browser fingerprinting. The organization utilizes several programming languages, including TypeScript, Python, and JavaScript, to support its diverse offerings.

Top languages

TypeScript 38Python 23JavaScript 12Rust 6MDX 3Dockerfile 2Ruby 1C# 1

Public repositories

crawlee

23,755

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

TypeScript
Updated Jun 13, 2026

crawlee-python

9,178

Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

Python
Updated Jun 13, 2026

fingerprint-suite

2,389

Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

TypeScript
Updated Jun 12, 2026

agent-skills

2,143

Collection of Apify Agent Skills

Python
Updated Jun 13, 2026

apify-mcp-server

1,333

The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.

TypeScript
Updated Jun 13, 2026

proxy-chain

1,006

Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chaining.

JavaScript
Updated Jun 13, 2026

got-scraping

755

HTTP client made for scraping based on got.

TypeScript
Updated Jun 6, 2026

mcpc

686

A universal CLI client for MCP. mcpc supports persistent sessions, stdio/HTTP, OAuth 2.1, tasks, JSON output for code mode, proxy for AI sandboxes, x402, and more.

TypeScript
Updated Jun 12, 2026

impit

500

impit | rust library for browser impersonation

Rust
Updated Jun 12, 2026

camoufox-js

229

Experimental Camoufox JS port

TypeScript
Updated Jun 11, 2026

apify-cli

222

Apify command-line interface helps you create, develop, build and run Apify Actors, and manage the Apify cloud platform.

TypeScript
Updated Jun 10, 2026

mcp-server-rag-web-browser

204

A MCP Server for the RAG Web Browser Actor

JavaScript
Updated May 29, 2026

awesome-skills

201

Community collection of Apify agent skills for AI coding assistants

Python
Updated Jun 13, 2026

apify-sdk-js

178

Apify SDK monorepo

MDX
Updated Jun 12, 2026

apify-sdk-python

170

Apify SDK for Python—The official library for building Apify Actors: serverless cloud programs for web scraping, browser automation, data processing, and AI agents. Manages the Actor lifecycle, storages (datasets, key-value stores, request queues), events, proxies, and pay-per-event monetization. Built on top of the the Apify API Client.

Python
Updated Jun 12, 2026

actor-page-analyzer

152

Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.

JavaScript
Updated May 16, 2026

actor-scraper

138

House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.

TypeScript
Updated Jun 6, 2026

browser-pool

99

A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.

TypeScript
Updated May 27, 2026

apify-client-python

93

Apify API client for Python—Programmatically run Actors, manage and stream data from storages (datasets, key-value stores, request queues), schedule and monitor runs, and access the full Apify platform API. Sync and async interfaces with automatic retries and pagination.

Python
Updated Jun 12, 2026

apify-actor-docker

91

Base Docker images for Apify actors.

Dockerfile
Updated Jun 12, 2026

fingerprint-generator

89

Generates realistic browser fingerprints

TypeScript
Updated Jun 6, 2026

actor-whitepaper

88

This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.

Python
Updated Apr 12, 2026

apify-client-js

86

Apify API client for JavaScript / Node.js.

TypeScript
Updated Jun 12, 2026

mcp-client-capabilities

79

Index of all Model Context Protocol (MCP) clients and their capabilities

TypeScript
Updated Jun 11, 2026

fingerprint-injector

75

Home of fingerprint injector.

TypeScript
Updated May 9, 2026

actor-rag-web-browser

72

RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.

TypeScript
Updated Jun 11, 2026

header-generator

71

NodeJs package for generating browser-like headers.

TypeScript
Updated Apr 10, 2026

apify-docs

69

This project is the home of Apify's documentation.

JavaScript
Updated Jun 12, 2026

actor-templates

56

This project is the :house: home of Apify Actor templates to help users quickly get started. Contributions welcome!

Python
Updated Jun 4, 2026

super-scraper

40

Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!

TypeScript
Updated Jun 4, 2026

n8n-nodes-apify-old

37

n8n node to interact with Apify APIs

TypeScript
Updated Apr 21, 2026

xlsx-stream

31

JavaScript / Node.js library to stream data into an XLSX file

JavaScript
Updated Jun 4, 2026

actor-llmstxt-generator

31

The /llms.txt Generator Actor 🕸️📄 extracts website content to create an llms.txt file for AI apps 🤖✨ like LLM fine-tuning and indexing. Output is available 📥 in the Key-Value Store for easy download and integration into workflows. 🚀

Python
Updated May 25, 2026

apify-openclaw-plugin

23

OpenClaw extension integration

TypeScript
Updated Jun 2, 2026

idcac

19

I Don't Care About Cookies extension compiled for use with Playwright/Puppeteer

JavaScript
Updated May 31, 2026

actor-mcp-servers

19

A curated collection of awesome MCP servers, published and monetized as Actors on Apify

TypeScript
Updated May 20, 2026

apify-shared-js

18

Utilities and constants shared across Apify projects.

TypeScript
Updated Jun 12, 2026

actor-quick-start

18

Contains a boilerplate of an Apify actor to help you get started quickly build your own actors.

Dockerfile
Updated May 22, 2026

apify-zapier-integration

15

Apify integration for Zapier

JavaScript
Updated Jun 8, 2026

push-actor-action

15

A GitHub Action to push an Actor the the Apify platform

Unknown Language
Updated Jun 4, 2026

workflows

14

Apify's reusable github workflows

TypeScript
Updated Jun 10, 2026

homebrew-tap

12

A Homebrew tap for Apify tools

Ruby
Updated Jun 3, 2026

actor-vector-database-integrations

9

Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)

Python
Updated Mar 19, 2026

n8n-nodes-apify-template

8

This is a template repo for the n8n single Actor apps.

TypeScript
Updated Jun 11, 2026

actor-finance-monitoring-agent

7

The Finance Monitoring AI Agent 📊💹 analyzes specific tickers, gathering and processing data to generate insightful reports 📈📉. Designed for investors and analysts, this agent provides detailed performance analysis and trends. 🚀

Python
Updated May 28, 2026

actor-example-python

7

Example Apify Actor written in Python

Python
Updated May 4, 2026

crawlee-parallel-scraping-example

7

An example repository showcasing how you can scrape in parallel using one request queue

TypeScript
Updated Mar 26, 2026

langchain-apify

6

Apify integration for LangChain 🦜🔗

Python
Updated May 24, 2026

actor-whitepaper-web

6

Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.

MDX
Updated Apr 2, 2026

actor-ai-code-sandbox

5

Open-source Actor that provides a sandbox for secure execution of AI generated code. Supports Node.js, Python. Provides pre-configured Claude Code, Codex CLI, and OpenCode. 📦

TypeScript
Updated Jun 8, 2026

actor-camoufox-scraper

5

No description provided for this repository.

Python
Updated Apr 7, 2026

actor-monorepo-example

5

An example repository with multiple Apify Actors sharing code between each other.

JavaScript
Updated Mar 13, 2026

rustls

4

Patched fork of `ruslts` for `impit`

Rust
Updated Jun 6, 2026

actor-beautifulsoup-scraper

4

No description provided for this repository.

Python
Updated Jun 4, 2026

apify-storage-local-js

4

Local emulation of the apify-client NPM package, which enables local use of Apify SDK.

TypeScript
Updated May 29, 2026

apify-eslint-config

4

Apify ESLint preset to be shared between projects

JavaScript
Updated Apr 11, 2026

crawlee-storage

3

A Rust implementation of the filesystem storage used by the Crawlee web scraping framework

Rust
Updated Jun 12, 2026

actor-scrapy-books-example

3

Example of Python Scrapy project. It scrapes book data from https://books.toscrape.com/.

Python
Updated Jun 4, 2026

apify-haystack

3

The official integration for Apify and Haystack 2.0

Python
Updated May 19, 2026

agents-data-workshop

3

Teach your agents to scrape real-time data with this self-guided workshop.

Unknown Language
Updated May 13, 2026

n8n-nodes-apify

2

Apify nodes for n8n.

TypeScript
Updated Jun 10, 2026

apify-shared-python

2

Constants and utilities shared across Apify's Python libraries and projects.

Python
Updated May 26, 2026

docling

2

Get your documents ready for gen AI

Python
Updated May 22, 2026

actor-sherlock

2

🔎 Hunt down social media accounts by username across social networks

Python
Updated May 22, 2026

ai-snitch

2

No description provided for this repository.

TypeScript
Updated May 22, 2026

activepieces

2

AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents

TypeScript
Updated May 6, 2026

pull-request-toolkit-action

1

The Github action that makes sure that each PR is correctly set up and has a milestone set.

TypeScript
Updated Jun 5, 2026

apify-claude-code-plugin

1

No description provided for this repository.

Unknown Language
Updated Jun 1, 2026

komparz

1

Special, yet insignificant actors

TypeScript
Updated May 31, 2026

dify-plugins

1

All Dify Plugins listed in Dify Marketplace, plus illustrated plugin examples.

Unknown Language
Updated May 22, 2026

setup-apify-cli-action

1

No description provided for this repository.

Unknown Language
Updated May 21, 2026

apify-tsconfig

1

TypeScript configuration shared across projects in Apify.

Unknown Language
Updated Apr 15, 2026

langflow

1

Langflow is a powerful tool for building and deploying AI-powered agents and workflows.

Python
Updated Mar 16, 2026

apify-cursor-plugin

0

No description provided for this repository.

Unknown Language
Updated Jun 12, 2026

release-pr-action

0

This action simplify creating of release PR

JavaScript
Updated Jun 11, 2026

apify-microsoft-power-automate-integration

0

No description provided for this repository.

C#
Updated Jun 11, 2026

apify-test-tools

0

Tools & lib to test actors on the Apify platform

TypeScript
Updated Jun 10, 2026

strands-harness-sdk

0

A model-driven approach to building AI agents in just a few lines of code.

Unknown Language
Updated Jun 10, 2026

tower-http

0

HTTP specific Tower utilities.

Rust
Updated Jun 9, 2026

h2

0

Patched fork of h2 for impit

Rust
Updated Jun 6, 2026

actions

0

Apify's custom GitHub Actions for internal use

TypeScript
Updated Jun 5, 2026

docusaurus-plugin-typedoc-api

0

Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.

TypeScript
Updated Jun 5, 2026

kilocode-plugin

0

No description provided for this repository.

TypeScript
Updated Jun 2, 2026

mongo-watcher-actor

0

This Actor is running in a schedule every day and monitors the log for new slow queries

TypeScript
Updated Jun 2, 2026

kilocode

0

Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.

Unknown Language
Updated Jun 1, 2026

strands-docs

0

Documentation for the Strands Agents SDK. A model-driven approach to building AI agents in just a few lines of code.

MDX
Updated Jun 1, 2026

strands-apify

0

No description provided for this repository.

Python
Updated May 28, 2026

mcp-server-test-actor

0

A simple actor used to test the Apify MCP server

TypeScript
Updated May 27, 2026

actor-claude-managed-agent

0

Template for Claude managed agents Actors

TypeScript
Updated May 26, 2026

apify-dify-integration

0

No description provided for this repository.

Python
Updated May 26, 2026

hermes-agent

0

The agent that grows with you

Unknown Language
Updated May 25, 2026

hubspot-data-import-actor

0

This Actor maps your Apify dataset items into HubSpot company fields and performs imports

TypeScript
Updated May 19, 2026

apify-oxlint-config

0

Apify oxlint preset to be shared between projects

JavaScript
Updated May 18, 2026

hyper-util

0

Common utilities used with hyper.

Rust
Updated May 18, 2026

recombee-local

0

Local service that mocks recombee, completely vibe coded 🤖

JavaScript
Updated Apr 30, 2026

kiro-powers

0

Official Apify powers for Kiro IDE — web scraping, data extraction, and Actor development

Unknown Language
Updated Apr 30, 2026

deer-flow

0

An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.

Python
Updated Apr 27, 2026

.github

0

Repository to define an organization (or team) wide Github Actions workflows

Unknown Language
Updated Apr 7, 2026

apify-goose-integration

0

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Unknown Language
Updated Apr 7, 2026

tools

0

A set of tools that gives agents powerful capabilities.

Unknown Language
Updated Mar 18, 2026

Frequently asked questions

What does apify build on GitHub?

Apify builds a variety of web scraping and automation tools on GitHub, with notable projects like Crawlee in TypeScript and Python. Their repositories also include libraries for proxy management and browser fingerprinting.

Which programming languages does apify use?

Apify primarily uses TypeScript, Python, and JavaScript for its projects, alongside Rust and MDX. This diverse language use reflects their focus on building robust web scraping solutions.

Are apify's repositories public?

Yes, all of Apify's repositories are public on GitHub. This transparency allows users and developers to access and contribute to their various web scraping tools and automation libraries.

Is this exposure intended?

Monitor Apify with RepoGuard and get alerted the moment a new public repository appears.

Monitor this account