Aktualisiert vor 10 h

Organization

Öffentlicher GitHub-Footprint von Scrapy project

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

Öffentliche Repositories

74.276

Sterne gesamt

808

Follower

Das Scrapy-Projekt ist eine Organisation auf GitHub, die ein Open-Source-Framework für das Scraping von Websites bereitstellt. Die öffentliche Präsenz umfasst eine Vielzahl von Repositories, die hauptsächlich in Python, HTML und C++ entwickelt wurden. Zu den bemerkenswerten Projekten gehören Scrapy, scrapyd und parsel, die von vielen Entwicklern genutzt werden.

Top-Sprachen

Python 22HTML 2C++ 2DIGITAL Command Language 1Shell 1

Öffentliche Repositories

scrapy

★62.224

Scrapy, a fast high-level web crawling & scraping framework for Python.

Python

Aktualisiert 13. Juni 2026

scrapyd

★3.094

A service daemon to run Scrapy spiders

Python

Aktualisiert 13. Juni 2026

scrapely

★1.888

A pure-python HTML screen-scraping library

HTML

Aktualisiert 9. Juni 2026

dirbot

★1.628

Scrapy project to scrape public web directories (educational) [DEPRECATED]

Python

Aktualisiert 12. Juni 2026

quotesbot

★1.357

This is a sample Scrapy project for educational purposes

Python

Aktualisiert 8. Juni 2026

parsel

★1.333

Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

Python

Aktualisiert 11. Juni 2026

scrapyd-client

★773

Command line client for Scrapyd server

Python

Aktualisiert 3. Juni 2026

w3lib

★419

Python library of web-related functions

Python

Aktualisiert 10. Juni 2026

cssselect

★309

CSS Selectors for Python

Python

Aktualisiert 1. Juni 2026

queuelib

★299

Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

Python

Aktualisiert 1. Juni 2026

loginform

★279

Fill HTML login forms automatically

Python

Aktualisiert 29. März 2026

slybot

★224

Keine Beschreibung für dieses Repository vorhanden.

Unbekannte Sprache

Aktualisiert 12. Juni 2026

protego

★88

A pure-Python robots.txt parser with support for modern conventions.

DIGITAL Command Language

Aktualisiert 11. Juni 2026

itemadapter

★70

Common interface for data container classes

Python

Aktualisiert 1. Juni 2026

scrapy.org

★66

The scrapy.org website (old code)

HTML

Aktualisiert 3. Juni 2026

itemloaders

★49

Library to populate items using XPath and CSS with a convenient API

Python

Aktualisiert 2. Juni 2026

booksbot

★42

A crawler for http://books.toscrape.com

Python

Aktualisiert 8. Dez. 2025

scrapy-bench

★32

A CLI for benchmarking Scrapy.

Python

Aktualisiert 15. Sept. 2025

scrapy-lint

★22

A linter for Scrapy projects.

Python

Aktualisiert 15. Apr. 2026

scurl

★21

Performance-focused replacement for Python urllib

Python

Aktualisiert 26. Mai 2026

pypydispatcher

★16

A fork of http://pydispatcher.sourceforge.net/ with PyPy support

Python

Aktualisiert 12. Juni 2024

xtractmime

★13

https://mimesniff.spec.whatwg.org/ implementation for Python

Python

Aktualisiert 10. Juni 2026

base-chromium

★8

base component forked from Chromium source https://chromium.googlesource.com/chromium/src/base/

C++

Aktualisiert 10. März 2026

scrapy-itemloader

★7

[Archived] Library to populate Scrapy items using XPath and CSS with a convenient API

Python

Aktualisiert 10. März 2026

form2request

★5

Python library to build HTTP requests out of HTML forms

Python

Aktualisiert 12. Juni 2026

url-chromium

★4

url component from Chromium source code, forked from https://chromium.googlesource.com/chromium/src/url

C++

Aktualisiert 10. März 2026

gsoc2014-integration-tests

★3

GSoC2014 - Scrapy Integration tests project

Shell

Aktualisiert 6. Juli 2017

scrapy-bench-speedcenter

★2

Codespeed for scrapy-bench

Python

Aktualisiert 26. Mai 2026

sphinx-scrapy

★1

Sphinx extension for documentation in the Scrapy ecosystem

Python

Aktualisiert 11. Juni 2026

Häufige Fragen

Welche Programmiersprachen verwendet scrapy?

Die Repositories von scrapy sind hauptsächlich in Python, HTML, C++, DIGITAL Command Language und Shell geschrieben. Diese Sprachen unterstützen die Entwicklung von Tools zum Web-Scraping und zur Datenextraktion.

Was entwickelt scrapy auf GitHub?

Scrapy entwickelt eine Reihe von Tools und Bibliotheken, die sich auf Web-Crawling und Scraping konzentrieren. Zu den bekanntesten Projekten gehören Scrapy, scrapyd und scrapely, die eine breite Palette von Funktionen bieten.

Sind die Repositories von scrapy öffentlich?

Ja, die Repositories von scrapy sind öffentlich zugänglich. Jeder kann die Projekte einsehen, nutzen und zur Verbesserung beitragen, was die Zusammenarbeit und den Austausch in der Entwicklergemeinschaft fördert.

Ist diese Sichtbarkeit gewollt?

Überwache Scrapy project mit RepoGuard und werde benachrichtigt, sobald ein neues öffentliches Repository auftaucht.

Diesen Account überwachen