-
Notifications
You must be signed in to change notification settings - Fork 6
Description
Feature description
I’m working on batch construction of Cat-VRS CategoricalVariants (potentially at scale for a curated variant set), and I’m still unsure about best practices around building Constraints, especially Defining Location Constraint.
Question / Pain Point
When I decide to use a Defining Location Constraint, I need to construct the Location object(s) precisely (e.g., reference sequence identifiers, coordinate system, normalization behavior, transcript vs genomic context, etc.). In practice this is error-prone for bulk generation.
Is there an existing tool or recommended workflow to automatically generate the Defining Location Constraint content, given higher-level inputs like:
a gene symbol (e.g., PTEN / EGFR),
a transcript accession (optional),
a region description (e.g., “exon 19”, “kinase domain”, “coding region”, “promoter”, “whole gene”, etc.),
or a HGVS-like description / coordinates?
If not currently available:
Feature Request
Would the project consider adding (or recommending) a helper utility / API in the Python reference implementation to generate a valid Defining Location Constraint automatically, for example:
Input: gene symbol + transcript ID + region spec (exon/domain/interval)
Output: a normalized VRS Location (SequenceLocation / ChromosomeLocation) suitable for Defining Location Constraint.
Or Input: HGVS / VCF-like coordinates + assembly
Output: normalized Location object + any required identifiers.