Introduction

Spider-Py is the fastest web crawler and indexer written in Rust ported to Python.

  • Concurrent
  • Streaming
  • Decentralization
  • Headless Chrome Rendering
  • HTTP Proxies
  • Cron Jobs
  • Subscriptions
  • Blacklisting and Budgeting Depth
  • Written in Rust for speed, safety, and simplicity

Spider powers some big tools and helps bring the crawling aspect to almost no downtime with the correct setup, view the spider project to learn more.

Test url: https://espn.com

librariespagesspeed
spider-rs(python): crawl150,387186s
scrapy(python): crawl49,5981h

The benches above were ran on a mac m1, spider on linux arm machines performs about 2-10x faster.