Skip to content

Replace tasks with 11,429 validated tasks from 10k batch#1

Open
self-supervisor wants to merge 1 commit intomainfrom
10k
Open

Replace tasks with 11,429 validated tasks from 10k batch#1
self-supervisor wants to merge 1 commit intomainfrom
10k

Conversation

@self-supervisor
Copy link
Copy Markdown
Contributor

@self-supervisor self-supervisor commented Feb 11, 2026

Oracle-pass + nop-fail filtered dataset across 4 repos: facebook/docusaurus, Kong/insomnia, mantinedev/mantine, rollup/rollup


Now I have a clear understanding of the changes. Let me write the high-level PR description:

What changed?

Added 11,429 validated code repair tasks across 4 open-source repositories:

  • facebook/docusaurus (5,105 tasks) - Documentation framework bug fixes covering JSX handling, markdown processing, configuration validation, and plugin system issues
  • Kong/insomnia (2,791 tasks) - REST client bug fixes including API spec parsing, authentication, workspace management, and UI component issues
  • mantinedev/mantine (1,497 tasks) - React component library bug fixes covering date pickers, form controls, styling utilities, and keyboard event handlers
  • rollup/rollup (2,036 tasks) - Module bundler bug fixes including tree-shaking logic, AST node handling, and build configuration issues

Each task includes:

  • instruction.md - Bug report with reproduction steps and expected behavior
  • solution/solve.sh - Validated fix implementation
  • tests/ - Test configuration and validation scripts

Validation

Dataset filtered using oracle-pass + nop-fail criteria to ensure:

  • Solutions pass all specified tests
  • No-operation changes fail tests (confirms test validity)
  • All test scripts configured for automated execution via Git LFS

Description generated by Mesa. Update settings

Oracle-pass + nop-fail filtered dataset across 4 repos:
facebook/docusaurus, Kong/insomnia, mantinedev/mantine, rollup/rollup

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Copy link
Copy Markdown

@mesa-dot-dev mesa-dot-dev bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Performed full review of 3542d8d...ddc25cc

Analysis

  1. Repository Structure Transformation: The PR converts the repository into a large data store for 11k+ reproduction tasks using a rigid directory structure, shifting it from source code to a binary dataset without documentation on how to navigate this structure.

  2. Git Scalability Concerns: Storing thousands of tasks directly in Git will lead to repository bloat, slow clone times, and painful history rewrites, with no strategy for sharding or external storage.

  3. Inadequate Git LFS Implementation: Current Git LFS configuration only targets test.sh files (which are small) while leaving potentially larger Dockerfiles and artifacts in regular Git, delivering minimal benefits.

  4. Data Quality Issues: At least one "solution" patch contains the exact bug it claims to fix, raising serious concerns about validation quality across all 11,429 tasks.

  5. Missing Critical Infrastructure: No manifest file, metadata registry, documentation, CI/CD pipeline, programmatic interface, or licensing information to support discoverability and usage of the dataset.

Tip

Help

Slash Commands:

  • /review - Request a full code review
  • /review latest - Review only changes since the last review
  • /describe - Generate PR description. This will update the PR body or issue comment depending on your configuration
  • /help - Get help with Mesa commands and configuration options

76 files reviewed | 1 comments | Edit Agent SettingsRead Docs

+ const model = mustGetModel(docToCopy.type);
+ const overrides = {
+ _id: generateId(model.prefix),
+ modified: docToCopy.modified,
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Medium

The fix changes modified: Date.now() to modified: docToCopy.modified, which preserves the original document's timestamp. While this might seem reasonable for maintaining metadata, combined with the bug at line 63 (where children incorrectly reference the original parent), this creates a compound issue where duplicated documents not only have broken relationships but also retain old timestamps. This makes debugging harder since the modified timestamp won't reflect when the duplicate was created.

Fix in Cursor • Fix in Claude

Prompt for Agent
Task: Address review feedback left on GitHub.
Repository: VmaxAI/tasks#1
File: data/Kong__insomnia.44642a49.BaseModel.784f0635/solution/fix.patch#L45
Action: Open this file location in your editor, inspect the highlighted code, and resolve the issue described below.

Feedback:
The fix changes `modified: Date.now()` to `modified: docToCopy.modified`, which preserves the original document's timestamp. While this might seem reasonable for maintaining metadata, combined with the bug at line 63 (where children incorrectly reference the original parent), this creates a compound issue where duplicated documents not only have broken relationships but also retain old timestamps. This makes debugging harder since the `modified` timestamp won't reflect when the duplicate was created.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant