Apify는 GitHub에서 다양한 오픈 소스 프로젝트를 통해 웹 스크래핑과 브라우저 자동화 솔루션을 제공합니다. 주요 언어로는 TypeScript, Python, JavaScript, Rust가 있으며, crawlee 및 crawlee-python과 같은 광범위한 리포지토리를 통해 신뢰할 수 있는 크롤러를 구축할 수 있습니다. 이들 리포지토리는 AI 및 데이터 수집에 대한 폭넓은 활용을 지원합니다.
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Parsel, BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.
Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.
Collection of Apify Agent Skills
The Apify MCP server enables your AI agents to extract data from social media, search engines, maps, e-commerce sites, or any other website using thousands of ready-made scrapers, crawlers, and automation tools available on the Apify Store.
Node.js implementation of a proxy server (think Squid) with support for SSL, HTTP/HTTPS, SOCKS5, authentication, and upstream proxy chaining.
HTTP client made for scraping based on got.
A universal CLI client for MCP. mcpc supports persistent sessions, stdio/HTTP, OAuth 2.1, tasks, JSON output for code mode, proxy for AI sandboxes, x402, and more.
impit | rust library for browser impersonation
Experimental Camoufox JS port
Apify command-line interface helps you create, develop, build and run Apify Actors, and manage the Apify cloud platform.
A MCP Server for the RAG Web Browser Actor
Community collection of Apify agent skills for AI coding assistants
Apify SDK monorepo
Apify SDK for Python—The official library for building Apify Actors: serverless cloud programs for web scraping, browser automation, data processing, and AI agents. Manages the Actor lifecycle, storages (datasets, key-value stores, request queues), events, proxies, and pay-per-event monetization. Built on top of the the Apify API Client.
Apify actor that opens a web page in headless Chrome and analyzes the HTML and JavaScript objects, looks for schema.org microdata and JSON-LD metadata, analyzes AJAX requests, etc.
House of Apify Scrapers. Generic scraping actors with a simple UI to handle complex web crawling and scraping use cases.
A Node.js library to easily manage and rotate a pool of web browsers, using any of the popular browser automation libraries like Puppeteer, Playwright, or SecretAgent.
Apify API client for Python—Programmatically run Actors, manage and stream data from storages (datasets, key-value stores, request queues), schedule and monitor runs, and access the full Apify platform API. Sync and async interfaces with automatic retries and pagination.
Base Docker images for Apify actors.
Generates realistic browser fingerprints
This whitepaper describes a new concept for building serverless microapps called Actors, which are easy to develop, share, integrate, and build upon. Actors are a reincarnation of the UNIX philosophy for programs running in the cloud.
Apify API client for JavaScript / Node.js.
Index of all Model Context Protocol (MCP) clients and their capabilities
Home of fingerprint injector.
RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.
NodeJs package for generating browser-like headers.
This project is the home of Apify's documentation.
This project is the :house: home of Apify Actor templates to help users quickly get started. Contributions welcome!
Generic REST API for scraping websites. Drop-in replacement for ScrapingBee, ScrapingAnt, and ScraperAPI services. And it is open-source!
n8n node to interact with Apify APIs
JavaScript / Node.js library to stream data into an XLSX file
The /llms.txt Generator Actor 🕸️📄 extracts website content to create an llms.txt file for AI apps 🤖✨ like LLM fine-tuning and indexing. Output is available 📥 in the Key-Value Store for easy download and integration into workflows. 🚀
OpenClaw extension integration
I Don't Care About Cookies extension compiled for use with Playwright/Puppeteer
A curated collection of awesome MCP servers, published and monetized as Actors on Apify
Utilities and constants shared across Apify projects.
Contains a boilerplate of an Apify actor to help you get started quickly build your own actors.
Apify integration for Zapier
A GitHub Action to push an Actor the the Apify platform
Apify's reusable github workflows
A Homebrew tap for Apify tools
Transfer data from Apify Actors to vector databases (Chroma, Milvus, Pinecone, PostgreSQL (PG-Vector), Qdrant, and Weaviate)
This is a template repo for the n8n single Actor apps.
The Finance Monitoring AI Agent 📊💹 analyzes specific tickers, gathering and processing data to generate insightful reports 📈📉. Designed for investors and analysts, this agent provides detailed performance analysis and trends. 🚀
Example Apify Actor written in Python
An example repository showcasing how you can scrape in parallel using one request queue
Apify integration for LangChain 🦜🔗
Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.
Open-source Actor that provides a sandbox for secure execution of AI generated code. Supports Node.js, Python. Provides pre-configured Claude Code, Codex CLI, and OpenCode. 📦
이 저장소에 대한 설명이 제공되지 않았습니다.
An example repository with multiple Apify Actors sharing code between each other.
Patched fork of `ruslts` for `impit`
이 저장소에 대한 설명이 제공되지 않았습니다.
Local emulation of the apify-client NPM package, which enables local use of Apify SDK.
Apify ESLint preset to be shared between projects
A Rust implementation of the filesystem storage used by the Crawlee web scraping framework
Example of Python Scrapy project. It scrapes book data from https://books.toscrape.com/.
The official integration for Apify and Haystack 2.0
Teach your agents to scrape real-time data with this self-guided workshop.
Apify nodes for n8n.
Constants and utilities shared across Apify's Python libraries and projects.
Get your documents ready for gen AI
🔎 Hunt down social media accounts by username across social networks
이 저장소에 대한 설명이 제공되지 않았습니다.
AI Agents & MCPs & AI Workflow Automation • (~400 MCP servers for AI agents) • AI Automation / AI Agent with MCPs • AI Workflows & AI Agents • MCPs for AI Agents
The Github action that makes sure that each PR is correctly set up and has a milestone set.
이 저장소에 대한 설명이 제공되지 않았습니다.
Special, yet insignificant actors
All Dify Plugins listed in Dify Marketplace, plus illustrated plugin examples.
이 저장소에 대한 설명이 제공되지 않았습니다.
TypeScript configuration shared across projects in Apify.
Langflow is a powerful tool for building and deploying AI-powered agents and workflows.
이 저장소에 대한 설명이 제공되지 않았습니다.
This action simplify creating of release PR
이 저장소에 대한 설명이 제공되지 않았습니다.
Tools & lib to test actors on the Apify platform
A model-driven approach to building AI agents in just a few lines of code.
HTTP specific Tower utilities.
Patched fork of h2 for impit
Apify's custom GitHub Actions for internal use
Apify's fork of `docusaurus-plugin-typedoc-api`, customized for our Python documentation.
이 저장소에 대한 설명이 제공되지 않았습니다.
This Actor is running in a schedule every day and monitors the log for new slow queries
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent.
Documentation for the Strands Agents SDK. A model-driven approach to building AI agents in just a few lines of code.
이 저장소에 대한 설명이 제공되지 않았습니다.
A simple actor used to test the Apify MCP server
Template for Claude managed agents Actors
이 저장소에 대한 설명이 제공되지 않았습니다.
The agent that grows with you
This Actor maps your Apify dataset items into HubSpot company fields and performs imports
Apify oxlint preset to be shared between projects
Common utilities used with hyper.
Local service that mocks recombee, completely vibe coded 🤖
Official Apify powers for Kiro IDE — web scraping, data extraction, and Actor development
An open-source long-horizon SuperAgent harness that researches, codes, and creates. With the help of sandboxes, memories, tools, skill, subagents and message gateway, it handles different levels of tasks that could take minutes to hours.
Repository to define an organization (or team) wide Github Actions workflows
an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM
A set of tools that gives agents powerful capabilities.
Apify는 웹 스크래핑 및 브라우저 자동화에 중점을 두고 다양한 프로젝트를 개발합니다. 주요 리포지토리인 crawlee와 crawlee-python은 신뢰할 수 있는 크롤러를 구축하는 데 도움을 줍니다.
Apify는 TypeScript, Python, JavaScript, Rust 등 여러 프로그래밍 언어를 사용하여 다양한 프로젝트를 개발합니다. 이들 언어는 웹 스크래핑과 크롤링에 적합합니다.
네, apify의 모든 리포지토리는 공개되어 있습니다. 이를 통해 개발자들은 다양한 도구와 라이브러리를 사용할 수 있으며, 코드에 기여할 수 있는 기회를 제공합니다.