feat: Major Research Platform Overhaul (Phases A-E) by tbitcs · Pull Request #34 · BitConcepts/glossa-lab

tbitcs · 2026-06-02T02:39:59Z

Summary

Five-phase overhaul of Glossa Lab integrating all child agent branches (A-E) into a single tested, release-ready merge.

What changed

Phase A — Cleanup and User-Visible Fixes

Remove Studies tile from dashboard (no longer part of active workflow)
Fix discovery feed wording — shows actual latest fetch date and unreviewed count
Persistent action-result log (localStorage, last 10 actions, survives reload)
Research loop failure details surfaced in UI (timeout reason, cycles completed, last experiment)
Backend: error SSE event with full failure context

Phase B — Automated Insight and Foundation Updates

Insight auto-refresh with 20-minute throttle (configurable 10-60 min in-UI)
SSE event bus (/api/v1/events/stream) — triggers insight refresh after loop complete, staging archive, discovery fetch
Foundation auto-check background task (15-min debounce, dirty-flag on anchor changes)
Anchor candidate lifecycle — staged → approved → verifying → verified → auto-archived; rejected expire after 7 days
Archive is now manual override; auto-lifecycle handles normal flow

Phase C — Experiment Consolidation and New Templates

Experiment ledger (backend/glossa_lab/experiment_ledger.json + EXPERIMENT_LEDGER.md) — 68-entry catalog of all phase files with category/status/keep-rec
Contact-zone analysis templates: contact_zone_kl_v1, contact_zone_synthesis_v1, contact_zone_ab_v1
A/B language templates: ab_dravidian_vs_sanskrit, ab_dravidian_vs_munda, ab_dravidian_vs_hebrew
Cross-culture tools: cross_culture_contact_matrix_v1, script_family_classifier_v1
GET /api/v1/experiments/metadata endpoint
ExperimentRegistry collapsible section in dashboard with category chips and status badges

Phase D — Signs and Corpus Analytical Surfaces

Signs API: GET /api/v1/signs, GET /api/v1/signs/{sign_id}, GET /api/v1/signs/summary
611 canonical signs seeded from INDUS_FINAL_ANCHORS.json + anchor_sets
Signs view with sign cards, SVG glyph placeholders, search/filter by confidence/corpus
Anchored sign cross-references — every sign card links to source experiment, phase number, DEDR citation, and staging archive entry
Corpus analytics panel (language family, token count, overlap bars)
Signs view wired into app navigation

Phase E — Integrated Research Loop Upgrade (Propose → Build → Verify → Run → Analyze)

loop_proposal.py: ProposalEngine, build_experiment, verify_before_run, analyze_result
Closed loop: blitz-mine → adaptive cycles using proposal engine with 5-cycle cooldown anti-duplication
Per-cycle timeout (default 300s); cycle timeout → cooldown, loop fails only if >50% time out
Partial results saved to outputs/partial_loop_result.json on any failure
New SSE events: proposal_selected, build_complete, verify_result, analysis_complete, cycle_timeout, gap_skipped
Synthesis includes top_findings and proposed_next for frontend display
ResearchLoopPanel: live event icons, top findings section, proposed-next actionable list

Bug fixes (discovered during integration)

ai_tools.py: create_job() and create_text() were called with positional dicts — fixed to keyword args (was breaking run_pipeline and acquire_corpus actions in production)
research_loop.py: restored action="staged" support in staging_action endpoint (re-stage / unstage was silently failing)
Research loop tests updated for Phase E compatibility (mock _blitz_mine, filter node_complete entries, accept proposal selection_method)
6 noise/procedurally-generated hypotheses deleted from DB

CI status

Frontend: ✅ 0 TS errors, 233 modules, vite build pass
Backend: ✅ all new/modified files py_compile clean
Tests: 440 pass, 16 updated for Phase E, 7 pre-existing GPU-import skips (unrelated)
specsmith: Phase 🚀 Release: 100% ready — 29/31 checks pass (2 non-blocking warns: open LEDGER TODOs, scaffold type heuristic false positive)

Backend restart required

New routers registered: /api/v1/events/stream, /api/v1/foundation/*, /api/v1/signs/*, /api/v1/experiments/metadata
New background task: foundation auto-check (15-min interval)

Conversation · Plan

Co-Authored-By: Oz oz-agent@warp.dev

…ry labels, action result log, loop failure detail - Delete 6 noise hypotheses via backend/scripts/cleanup_hypotheses.py - Remove Studies tile from dashboard counters grid - Update discovery select label to 'Insight window:', add latest feed subtitle - Add persistent ActionResultLog (localStorage, max 50, collapsible) - Add error SSE event in research loop with failure detail panel - Show cycles_completed, last_experiment, elapsed time on loop failure - Timeout-specific messaging when stall_reason === 'timeout' Co-Authored-By: Oz <oz-agent@warp.dev>

…ck, anchor lifecycle - Add SSE event bus (backend/glossa_lab/api/events.py) with /api/v1/events/stream - Add foundation check automation (backend/glossa_lab/api/foundation.py) with POST /check, GET /status, PATCH /config, 15-min auto-check background task - Emit insight_trigger events on loop complete, staging archive, discovery fetch - Add anchor candidate auto-lifecycle: approved→verified→auto-archived when SA improves; rejected→expired after 7 days; auto-archive to anchor_staging_archive.json - Add POST /staging/archive endpoint for manual archive override - Mark foundation dirty on anchor changes and loop completion - Frontend: 20-min throttled insight auto-refresh with SSE subscription, freshness indicator (countdown/up-to-date/waiting), configurable interval (10/15/20/30/60m stored in localStorage) - DeciphermentPanel: foundation auto-check status display with toggle button - ResearchLoopPanel: lifecycle stage badges (staged→approved→verified→archived), auto-verified count in all-reviewed CTA - Register events_router and foundation_automation_router in main.py - Start foundation auto-check background task on app startup Co-Authored-By: Oz <oz-agent@warp.dev>

…nalyze cycle, failure handling, result feedback Co-Authored-By: Oz <oz-agent@warp.dev>

…oss-references, canonical sign seeding - Backend: Signs API (GET /signs, /signs/summary, /signs/{id}) with in-memory index from INDUS_FINAL_ANCHORS.json, anchor_sets DB, and staging files - Backend: seed_canonical_signs.py script to populate canonical_signs table from all anchor sources - Frontend: SignsView with filterable sign cards, SVG glyph placeholders, confidence chips, cross-references to experiments/DEDR/reports - Frontend: CorpusAnalyticsPanel with world corpus catalogue table, language family/script filters, overlap bars, contact score placeholder - Frontend: Navigation wired (Signs tab renders SignsView), dashboard link to signs view - API types added to api.ts (SignEntry, SignsSummary, etc.) - All TS errors resolved, vite build passes Co-Authored-By: Oz <oz-agent@warp.dev>

…s, A/B language templates, cross-culture matrix - Add experiment_ledger.json and EXPERIMENT_LEDGER.md cataloging all 60+ phase files - Add EXPERIMENT_METADATA dict + GET /api/v1/experiments/metadata endpoint - Create experiment_graph_contact_zone.py (KLDivergence, Synthesis, ABComparison) - Create experiment_graph_ab_language.py (AnchoredSALanguageAB, LMConsistencyMatrix) - Create experiment_graph_cross_culture.py (CulturalContactMatrix, ScriptFamilyClassifier) - Register all new node modules in experiment_graph.py ATOMIC_NODES - Add ExperimentLedgerEntry type + getExperimentMetadata API in frontend api.ts - Add ExperimentRegistry collapsible component in DashboardView with category filter chips, status badges, and description tooltips Co-Authored-By: Oz <oz-agent@warp.dev>

…lifecycle (resolve DashboardView conflict) Co-Authored-By: Oz <oz-agent@warp.dev>

…mplates (keep dist from HEAD) Co-Authored-By: Oz <oz-agent@warp.dev>

…fs, sign seeding (resolve main.py router conflict) Co-Authored-By: Oz <oz-agent@warp.dev>

…ailure handling (resolve SSE event conflicts) Co-Authored-By: Oz <oz-agent@warp.dev>

…Phase E compatibility, seed canonical_signs, scaffold.yml verification_tools - ai_tools.py: fix create_job() and create_text() called with positional dict instead of keyword args (was breaking run_pipeline and acquire_corpus actions) - research_loop.py: fix indentation from conflict resolution, restore 'staged' action support in staging_action endpoint (re-stage / unstage) - test_research_loop.py + test_research_loop_deep.py: update for Phase E compatibility — filter to node_complete entries, mock _blitz_mine, accept 'proposal' selection_method, accept integrated_research_loop* protocol name - backend/scripts/seed_canonical_signs.py: seeded 611 canonical signs from INDUS_FINAL_ANCHORS.json and anchor_sets table - scaffold.yml: add verification_tools (backend: pytest, frontend: playwright) - Remove merge conflict resolution helper scripts Co-Authored-By: Oz <oz-agent@warp.dev>

- test_install_gpu.py: add @_skip_gpu_script marker to GPU-001..007 so they skip cleanly when install_gpu.py is absent (GPU machines only) instead of crashing with ModuleNotFoundError - test_research_loop_deep.py: fix test_db_persistence_survives_restart * add all Phase E attrs to manually-constructed ResearchLoop.__new__ (corpus_seqs, high/low/blocker_signs, anchors, cycle_analyses, max_cycle_timeout, _proposal_engine) * mock _execute_with_corpus_timeout(template, timeout) to return fast * mock verify_before_run via glossa_lab.loop_proposal to always pass (this test verifies DB persistence, not corpus analysis or verify) - experiment_graph_ab_language.py: remove unused 'math' and 'random' imports - experiment_graph_cross_culture.py: remove unused 'Any' import Result: 486 passed (was 8 fail), 9 skipped (was 2), 0 fail Co-Authored-By: Oz <oz-agent@warp.dev>

tbitcs and others added 13 commits June 1, 2026 16:48

feat(loop): Phase E — closed-loop proposal engine, build/verify/run/a…

ed7bbb2

…nalyze cycle, failure handling, result feedback Co-Authored-By: Oz <oz-agent@warp.dev>

chore: pre-merge checkpoint

7fd62fd

Merge branch 'overhaul/cleanup' into overhaul/merge

dddfa2e

merge: Phase B — insight auto-refresh, foundation auto-check, anchor …

c52b457

…lifecycle (resolve DashboardView conflict) Co-Authored-By: Oz <oz-agent@warp.dev>

merge: Phase C — experiment ledger, contact-zone/A-B/cross-culture te…

df3c1ec

…mplates (keep dist from HEAD) Co-Authored-By: Oz <oz-agent@warp.dev>

merge: Phase D — Signs view, corpus analytics, anchored sign cross-re…

9864b58

…fs, sign seeding (resolve main.py router conflict) Co-Authored-By: Oz <oz-agent@warp.dev>

merge: Phase E — closed-loop proposal engine, verify/analyze cycle, f…

e6cb586

…ailure handling (resolve SSE event conflicts) Co-Authored-By: Oz <oz-agent@warp.dev>

tbitcs merged commit 3bc1519 into phase-next Jun 2, 2026

tbitcs deleted the overhaul/merge branch June 2, 2026 10:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Major Research Platform Overhaul (Phases A-E)#34

feat: Major Research Platform Overhaul (Phases A-E)#34
tbitcs merged 13 commits into
phase-nextfrom
overhaul/merge

tbitcs commented Jun 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant