Crawler
A Web crawler, sometimes called a spider or spiderbot and often shortened to crawler, is an Internet bot that systematically browses the World Wide Web and that is typically operated by search engines for the purpose of Web indexing (web spidering).
Here are 258 public repositories matching this topic...
一些非常有趣的python爬虫例子,对新手比较友好,主要爬取淘宝、天猫、微信、微信读书、豆瓣、QQ等网站。(Some interesting examples of python crawlers that are friendly to beginners. )
-
Updated
Jun 28, 2025 - HTML
A Python module to scrape several search engines (like Google, Yandex, Bing, Duckduckgo, ...). Including asynchronous networking support.
-
Updated
Jul 3, 2021 - HTML
📰 Newspaper4k a fork of the beloved Newspaper3k. Extraction of articles, titles, and metadata from news websites.
-
Updated
Nov 26, 2025 - HTML
Golang短视频去水印:抖音,皮皮虾,火山,微视,最右,快手,全民小视频,皮皮搞笑,西瓜视频,虎牙,梨视频,acfun,好看视频...
-
Updated
Dec 2, 2025 - HTML
计算机专业系统性学习资料(python,c,c++,计算机组成,计算机网络,编译原理,电路,谷歌插件,爬虫)
-
Updated
Aug 27, 2023 - HTML
A utility package for automating lighthouse reporting
-
Updated
Nov 18, 2025 - HTML
Selenium automation test framework
-
Updated
Nov 25, 2021 - HTML
News extraction and scraping. Article Parsing
-
Updated
Mar 4, 2023 - HTML
A bot that automatically sends emails to new ads posted in any desired xe.gr search url.
-
Updated
Oct 24, 2024 - HTML
🧩 / 🕸 WebsiteCrawler - This plugin automatically crawls the main content of a specified URL webpage and uses it as context input.
-
Updated
Dec 15, 2023 - HTML
Oh no, stop this. You can see my local IP address 😲! Use `foundation` attribute against CRC32 lookup table to reveal local IP address of a Chrome/Chromium visitor.
-
Updated
Nov 9, 2022 - HTML
- Followers
- 534 followers
- Website
- github.com/topics/crawler
- Wikipedia
- Wikipedia