RepoGuard
Actualizado 10 h ago
Scrapy project

Organization

Huella pública de GitHub de Scrapy project

@scrapy
Ver perfil en GitHub

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

29

Repositorios públicos

74.276

Total de estrellas

808

Seguidores

Principales lenguajes

Python 22HTML 2C++ 2DIGITAL Command Language 1Shell 1

Repositorios públicos

scrapy

62.224

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python
Actualizado 13 jun 2026

scrapyd

3094

A service daemon to run Scrapy spiders

Python
Actualizado 13 jun 2026

scrapely

1888

A pure-python HTML screen-scraping library

HTML
Actualizado 9 jun 2026

dirbot

1628

Scrapy project to scrape public web directories (educational) [DEPRECATED]

Python
Actualizado 12 jun 2026

quotesbot

1357

This is a sample Scrapy project for educational purposes

Python
Actualizado 8 jun 2026

parsel

1333

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python
Actualizado 11 jun 2026

scrapyd-client

773

Command line client for Scrapyd server

Python
Actualizado 3 jun 2026

w3lib

419

Python library of web-related functions

Python
Actualizado 10 jun 2026

cssselect

309

CSS Selectors for Python

Python
Actualizado 1 jun 2026

queuelib

299

Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

Python
Actualizado 1 jun 2026

loginform

279

Fill HTML login forms automatically

Python
Actualizado 29 mar 2026

slybot

224

No se proporcionó descripción para este repositorio.

Idioma desconocido
Actualizado 12 jun 2026

protego

88

A pure-Python robots.txt parser with support for modern conventions.

DIGITAL Command Language
Actualizado 11 jun 2026

itemadapter

70

Common interface for data container classes

Python
Actualizado 1 jun 2026

scrapy.org

66

The scrapy.org website (old code)

HTML
Actualizado 3 jun 2026

itemloaders

49

Library to populate items using XPath and CSS with a convenient API

Python
Actualizado 2 jun 2026

booksbot

42

A crawler for http://books.toscrape.com

Python
Actualizado 8 dic 2025

scrapy-bench

32

A CLI for benchmarking Scrapy.

Python
Actualizado 15 sept 2025

scrapy-lint

22

A linter for Scrapy projects.

Python
Actualizado 15 abr 2026

scurl

21

Performance-focused replacement for Python urllib

Python
Actualizado 26 may 2026

pypydispatcher

16

A fork of http://pydispatcher.sourceforge.net/ with PyPy support

Python
Actualizado 12 jun 2024

xtractmime

13

https://mimesniff.spec.whatwg.org/ implementation for Python

Python
Actualizado 10 jun 2026

base-chromium

8

base component forked from Chromium source https://chromium.googlesource.com/chromium/src/base/

C++
Actualizado 10 mar 2026

scrapy-itemloader

7

[Archived] Library to populate Scrapy items using XPath and CSS with a convenient API

Python
Actualizado 10 mar 2026

form2request

5

Python library to build HTTP requests out of HTML forms

Python
Actualizado 12 jun 2026

url-chromium

4

url component from Chromium source code, forked from https://chromium.googlesource.com/chromium/src/url

C++
Actualizado 10 mar 2026

gsoc2014-integration-tests

3

GSoC2014 - Scrapy Integration tests project

Shell
Actualizado 6 jul 2017

scrapy-bench-speedcenter

2

Codespeed for scrapy-bench

Python
Actualizado 26 may 2026

sphinx-scrapy

1

Sphinx extension for documentation in the Scrapy ecosystem

Python
Actualizado 11 jun 2026

¿Esta exposición es intencionada?

Monitorea a Scrapy project con RepoGuard y recibe alertas en el momento en que aparece un nuevo repositorio público.

Monitorea esta cuenta