RepoGuard
Atualizado 10 h ago
Scrapy project

Organization

Pegada pública no GitHub de Scrapy project

@scrapy
Ver perfil no GitHub

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

29

Repositórios públicos

74.276

Total de estrelas

808

Seguidores

Principais linguagens

Python 22HTML 2C++ 2DIGITAL Command Language 1Shell 1

Repositórios públicos

scrapy

62.224

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python
Atualizado 13 de jun. de 2026

scrapyd

3.094

A service daemon to run Scrapy spiders

Python
Atualizado 13 de jun. de 2026

scrapely

1.888

A pure-python HTML screen-scraping library

HTML
Atualizado 9 de jun. de 2026

dirbot

1.628

Scrapy project to scrape public web directories (educational) [DEPRECATED]

Python
Atualizado 12 de jun. de 2026

quotesbot

1.357

This is a sample Scrapy project for educational purposes

Python
Atualizado 8 de jun. de 2026

parsel

1.333

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python
Atualizado 11 de jun. de 2026

scrapyd-client

773

Command line client for Scrapyd server

Python
Atualizado 3 de jun. de 2026

w3lib

419

Python library of web-related functions

Python
Atualizado 10 de jun. de 2026

cssselect

309

CSS Selectors for Python

Python
Atualizado 1 de jun. de 2026

queuelib

299

Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

Python
Atualizado 1 de jun. de 2026

loginform

279

Fill HTML login forms automatically

Python
Atualizado 29 de mar. de 2026

slybot

224

Nenhuma descrição fornecida para este repositório.

Linguagem Desconhecida
Atualizado 12 de jun. de 2026

protego

88

A pure-Python robots.txt parser with support for modern conventions.

DIGITAL Command Language
Atualizado 11 de jun. de 2026

itemadapter

70

Common interface for data container classes

Python
Atualizado 1 de jun. de 2026

scrapy.org

66

The scrapy.org website (old code)

HTML
Atualizado 3 de jun. de 2026

itemloaders

49

Library to populate items using XPath and CSS with a convenient API

Python
Atualizado 2 de jun. de 2026

booksbot

42

A crawler for http://books.toscrape.com

Python
Atualizado 8 de dez. de 2025

scrapy-bench

32

A CLI for benchmarking Scrapy.

Python
Atualizado 15 de set. de 2025

scrapy-lint

22

A linter for Scrapy projects.

Python
Atualizado 15 de abr. de 2026

scurl

21

Performance-focused replacement for Python urllib

Python
Atualizado 26 de mai. de 2026

pypydispatcher

16

A fork of http://pydispatcher.sourceforge.net/ with PyPy support

Python
Atualizado 12 de jun. de 2024

xtractmime

13

https://mimesniff.spec.whatwg.org/ implementation for Python

Python
Atualizado 10 de jun. de 2026

base-chromium

8

base component forked from Chromium source https://chromium.googlesource.com/chromium/src/base/

C++
Atualizado 10 de mar. de 2026

scrapy-itemloader

7

[Archived] Library to populate Scrapy items using XPath and CSS with a convenient API

Python
Atualizado 10 de mar. de 2026

form2request

5

Python library to build HTTP requests out of HTML forms

Python
Atualizado 12 de jun. de 2026

url-chromium

4

url component from Chromium source code, forked from https://chromium.googlesource.com/chromium/src/url

C++
Atualizado 10 de mar. de 2026

gsoc2014-integration-tests

3

GSoC2014 - Scrapy Integration tests project

Shell
Atualizado 6 de jul. de 2017

scrapy-bench-speedcenter

2

Codespeed for scrapy-bench

Python
Atualizado 26 de mai. de 2026

sphinx-scrapy

1

Sphinx extension for documentation in the Scrapy ecosystem

Python
Atualizado 11 de jun. de 2026

Essa exposição é intencional?

Monitore Scrapy project com o RepoGuard e receba alertas no momento em que um novo repositório público aparecer.

Monitore esta conta