- Bangalore
- https://techytushar.github.io
Highlights
Stars
An extremely fast Python package and project manager, written in Rust.
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
Hunt down social media accounts by username across social networks
Anthropic's educational courses
Build local voice agents with open-source models
An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tokens that are part of the named-entity
MARS5 speech model (TTS) from CAMB.AI
adefossez / demucs
Forked from facebookresearch/demucsCode for the paper Hybrid Spectrogram and Waveform Source Separation
End-to-end platform for building voice first multimodal agents
kaldi-asr/kaldi is the official location of the Kaldi project.
Instant voice cloning by MIT and MyShell. Audio foundation model.
Controllable and fast Text-to-Speech for over 7000 languages!
a state-of-the-art-level open visual language model | 多模态预训练模型
A library to train, evaluate, interpret, and productionize decision forest models such as Random Forest and Gradient Boosted Decision Trees.
🔊 Text-Prompted Generative Audio Model
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Qualcomm® AI Hub Models is our collection of state-of-the-art machine learning models optimized for performance (latency, memory etc.) and ready to deploy on Qualcomm® devices.
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
Apache Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage/tracing and metadata. Runs and scales everywhere python does.
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
An open-source alternative to Ngrok, designed to serve production traffic and be simple to host (particularly on Kubernetes)
A multi-voice TTS system trained with an emphasis on quality
Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.





