WWW
Code repo for "WebArena: A Realistic Web Environment for Building Autonomous Agents"
MiniWoB++: a web interaction benchmark for reinforcement learning
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:
Best and simplest tool for website change detection, web page monitoring, and website change alerts. Perfect for tracking content changes, price drops, restock alerts, and website defacement monito…
Scrapy, a fast high-level web crawling & scraping framework for Python.
100+ open-source clones of popular sites like Airbnb, Amazon, Instagram, Netflix, Tiktok, Spotify, Whatsapp, Youtube etc. See source code, demo links, tech stack, github stars.
Fully functional Twitter clone built in flutter framework using Firebase realtime database and storage
Duck-themed multi-user virtual spaces in WebVR. Built with A-Frame.
🔥 The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data
Instructions and code for using the Common Crawl Web Graph in Neo4j format

