Skip to content

tooling to auto-generate Defining Location Constraint (bulk Cat-VRS construction) #211

@Hui-zju

Description

@Hui-zju

Feature description

I’m working on batch construction of Cat-VRS CategoricalVariants (potentially at scale for a curated variant set), and I’m still unsure about best practices around building Constraints, especially Defining Location Constraint.

Question / Pain Point

When I decide to use a Defining Location Constraint, I need to construct the Location object(s) precisely (e.g., reference sequence identifiers, coordinate system, normalization behavior, transcript vs genomic context, etc.). In practice this is error-prone for bulk generation.

Is there an existing tool or recommended workflow to automatically generate the Defining Location Constraint content, given higher-level inputs like:

a gene symbol (e.g., PTEN / EGFR),

a transcript accession (optional),

a region description (e.g., “exon 19”, “kinase domain”, “coding region”, “promoter”, “whole gene”, etc.),

or a HGVS-like description / coordinates?

If not currently available:

Feature Request

Would the project consider adding (or recommending) a helper utility / API in the Python reference implementation to generate a valid Defining Location Constraint automatically, for example:

Input: gene symbol + transcript ID + region spec (exon/domain/interval)
Output: a normalized VRS Location (SequenceLocation / ChromosomeLocation) suitable for Defining Location Constraint.

Or Input: HGVS / VCF-like coordinates + assembly
Output: normalized Location object + any required identifiers.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions