Skip to content

GitHub mirror ingestion should avoid full Linear issue scans #26

Description

@miyaontherelay

Problem

Factory GitHub issue ingestion currently performs a full /linear/issues mirror-candidate scan during startup/backfill. In the live workspace this took roughly 100 seconds before it could finish processing 29 GitHub issues from relayfile-adapters, and the full factory canary run took about 4 minutes.

Observed logs:

[factory] Linear mirror candidate loading started
[factory] relayfile listTree completed ... "prefix":"/linear/issues" ... "count":819
[factory] Linear mirror candidate loading completed {"elapsedMs":106660,"scanned":187,"candidates":186}

The scan makes the command look stuck even when it is progressing, and it will get worse as /linear/issues grows.

Expected

GitHub mirror dedupe should avoid reading broad Linear issue sets on every run. Possible directions:

  • first check deterministic factory-create-github-<hash>.json draft paths
  • maintain/cache a source-provider index for GitHub mirrors
  • use a narrower /linear/issues query/alias if relayfile exposes one
  • only fall back to full scan when deterministic paths/cache miss

Related

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions