Skip to content

docs: Jina v5 paper analysis + vision sensor plan#122

Merged
AdaWorldAPI merged 1 commit into
mainfrom
claude/risc-thought-engine-TCZw7
Apr 6, 2026
Merged

docs: Jina v5 paper analysis + vision sensor plan#122
AdaWorldAPI merged 1 commit into
mainfrom
claude/risc-thought-engine-TCZw7

Conversation

@AdaWorldAPI
Copy link
Copy Markdown
Owner

Summary

  • Jina v5 paper analysis (arXiv:2602.15547) — GOR, CoSENT, LoRA adapters, Matryoshka
  • Vision sensor plan — ViT-Huge-14 for medical imaging + CLIP cross-modal

Jina v5 Paper Findings

  • GOR regularizer: makes embeddings robust to binary quantization → our i8 tables lose LESS info on v5 than v3
  • CoSENT loss: directly optimizes Spearman ρ (= our calibration metric). 5 lines in candle.
  • 4 LoRA adapters (retrieval/STS/cluster/classify) = our ThinkingPresets (Analytical/Balanced/Creative/Focused)
  • Matryoshka: 256D minimum for CLAM codebook building (below = quality drops fast)
  • Listwise Reranker v3: reads ALL candidates together, not pairwise — our reranker_relevance() needs rework
  • Cronbach α as meta-debugger: where Reranker and Engine disagree on ranking = where our encoding is wrong

Vision Sensor Plan

  • FP32 ground truth: Kijai/WanVideo_comfy ViT-Huge-14 (2.53 GB safetensors)
  • BF16 production: DeepBeepMeep/Wan2.1 combined CLIP (2.39 GB)
  • Medical pipeline: DICOM → ViT patches → codebook → SPO → NARS
  • Cross-modal: text ↔ image in same CLIP embedding space
  • Three tools: candle (text+vision), ort (reranker), rten (medical ViT)

Calibration Predictions

Jina v5 (GOR-trained): i8 Spearman ρ > 0.95 (quantization-robust)
Jina v3 (no GOR):      i8 Spearman ρ < 0.90 (not trained for quantization)
CoSENT fine-tune in candle if ρ < 0.998 (directly optimizes rank order)

https://claude.ai/code/session_019RzHP8tpJu55ESTxhfUy1A

Key findings from arXiv:2602.15547:
  Architecture: Qwen3-0.6B + 4 LoRA adapters (retrieval/STS/cluster/classify)
  GOR regularizer: makes embeddings robust to binary quantization
  CoSENT loss: directly optimizes Spearman ρ (= our calibration metric)
  Matryoshka: 256D slice → 4× faster CLAM, ~95% accuracy
  LoRA per task = our ThinkingPresets per thinking style

Predictions:
  Jina v5 (GOR-trained) → i8 tables lose LESS info than Jina v3
  CoSENT loss in candle = 5 lines, directly fixes rank order
  Matryoshka 256D for CLAM → verify ρ(256D, 1024D) > 0.99

https://claude.ai/code/session_019RzHP8tpJu55ESTxhfUy1A
@AdaWorldAPI AdaWorldAPI merged commit 6558bca into main Apr 6, 2026
AdaWorldAPI pushed a commit that referenced this pull request May 13, 2026
…** permissions

PR_ARC PREPEND for #366 (sprint-7 7-worker implementation wave +
AuditSink trait unification). LATEST_STATE header updated +
prepended #366 row. ISSUES.md new entry for the ndarray:master
hpc-extras gap surfaced by MedCare-rs#118 (P2, upstream-blocked).

Adjacent landings recorded inline: MedCare-rs sprint-1 10-PR sweep
(#113-#122) including E1-1 OQ-3 direct migration consuming our
0d725d4 decision; MedCare-rs sprint-2 5 PRs queued (item 5
consumes this PR's new UnifiedBridge::with_jsonl_audit constructor).

settings.json: consolidated per-sprint-log-N entries into single
.claude/board/** glob for Write/Edit/tee. Drops 18 specific entries
in favor of 3 globs. Future sprint-log-N dirs won't need a
permissions patch before spawning workers.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants