Feature Request: Gene Clustering for Non-Redundant Gene Catalog

Dear SqueezeMeta team,

I have been using SqueezeMeta for a while and it has been very helpful, thanks! 
I was wondering if you have any plans to implement gene clustering (_e.g._, CD-HIT, MMseqs2) for creating a non-redundant gene catalog in the pipeline. This approach has been widely used (_e.g._, https://metagenome-atlas.readthedocs.io/en/latest/usage/output.html#gene-catalog, https://methods-in-microbiomics.readthedocs.io/en/latest/assembly/metagenomic_workflows.html#gene-catalogs) in the field to remove redundancy / aggregate information and fasten downstream analyses (i.e., annotation and data analyses) 

Curious to hear your thoughts.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Gene Clustering for Non-Redundant Gene Catalog #919

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Feature Request: Gene Clustering for Non-Redundant Gene Catalog #919

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions