Skip to content
View jsvine's full-sized avatar

Organizations

@BuzzFeedNews

Block or report jsvine

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A blazingly fast PDF table extraction library with python API powered by Rust

Rust 5 Updated Mar 20, 2026

Faster Whisper transcription with CTranslate2

Python 21,619 1,758 Updated Nov 19, 2025

censusdis is a Python package for discovering, loading and analyzing, U.S. Census demographic, economic, and geographic data and metadata. It is designed to be intuitive and Pythonic, giving users …

Python 129 21 Updated Mar 13, 2026

Download GitHub repositories

Python 12 Updated May 10, 2025

Scrapes an ESRI MapServer REST endpoint to spit out more generally-usable geodata.

Python 360 71 Updated Jul 11, 2024

Parallel and LAzY Analyzer for PDFs 🏖️

Jupyter Notebook 40 3 Updated Mar 9, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 72,718 10,004 Updated Mar 19, 2026

A vector search SQLite extension that runs anywhere!

C 7,239 294 Updated Mar 21, 2026

jq, but with many interoperable configuration format transcodings and interactive querying.

Go 718 8 Updated Mar 19, 2026

an editor for spoken-word audio with automatic transcription

TypeScript 1,826 54 Updated Jan 6, 2026

Code and analysis for CBS News Reports documentary on sheriff misconduct.

Jupyter Notebook 4 Updated Jul 17, 2025

sqlite3 in ur indexeddb (hopefully a better backend soon)

JavaScript 4,332 106 Updated Aug 6, 2023

conversion of documents to styled HTML

HTML 5 2 Updated Mar 24, 2024

CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and …

Python 8,560 225 Updated Mar 16, 2026
Jupyter Notebook 8,833 632 Updated Oct 25, 2025

docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Python 5,970 631 Updated Mar 15, 2026

A SQLite extension for efficient vector search, based on Faiss!

C++ 1,984 74 Updated May 5, 2024
Jupyter Notebook 339 31 Updated Jan 3, 2024

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

Python 3,512 225 Updated Jul 3, 2024

Quickly and accurately render even the largest data.

Python 3,523 375 Updated Mar 20, 2026

Access large language models from the command-line

Python 11,386 773 Updated Mar 17, 2026

The New York Review of Computation

105 Updated May 26, 2023

Array-Inspired Pipeline Language

Python 120 7 Updated Nov 6, 2023

This repository will house any one-off side projects I take on on the behalf of others or small scripts I write to accomplish x or y task.

Jupyter Notebook 6 Updated Oct 10, 2025

Tax filing web application

TypeScript 1,638 134 Updated Mar 21, 2026

A CLI Swiss Army Knife for ChatGPT

Python 2,604 91 Updated Feb 2, 2026

Live display of current GitHub action runs

Python 415 15 Updated Jan 18, 2026

System font stack CSS organized by typeface classification for every modern operating system

HTML 3,437 51 Updated Mar 10, 2026

Fraud detection related data and scripts to share with partners.

Jupyter Notebook 27 7 Updated Mar 5, 2023

A minimal webapp for converting Google Docs to Markdown

JavaScript 247 51 Updated Mar 20, 2026
Next