FastCountVectorizer is a faster alternative to scikit-learn's CountVectorizer.
pip install fastcountvectorizer
See full documentation.
Copyright (c) 2020 Santiago M. Mola
FastCountVectorizer is released under the MIT License.
The following files are included from or derived from third party projects:
fastcountvectorizer.pyis derived from scikit-learn'sscikit-learn/sklearn/feature_extraction/text.py, licensed under a 3-clause BSD license. The original list of authors and license text can be found in the file header._csr.his derived from scipy'sscipy/sparse/sparsetools/csr.h, licensed under a 3-clause BSD license. The original list of authors and license text can be found in the file header.fastcountvectorizer/thirdparty/tslincludes thetsl::sparse_mapproject, released under the MIT License.fastcountvectorizer/thirdpartyincludes thexxHashproject, released under a BSD-2 Clause license.