Skip to content

Enhance extensibility with learning from first LLM Judge category#650

Merged
haoranpb merged 3 commits into
mainfrom
feature/extensibility-from-nl2al
May 26, 2026
Merged

Enhance extensibility with learning from first LLM Judge category#650
haoranpb merged 3 commits into
mainfrom
feature/extensibility-from-nl2al

Conversation

@haoranpb
Copy link
Copy Markdown
Collaborator

Some findings after enabling our first LLM judge category.

  • Allow skip leaderboard for categories that don't need it
  • Introduce Judge Based eval category class
  • Enhanced display for different categories

Comment thread src/bcbench/types.py Dismissed
Comment thread src/bcbench/types.py Dismissed
@haoranpb haoranpb enabled auto-merge (squash) May 26, 2026 08:15
@haoranpb haoranpb merged commit 83c2eee into main May 26, 2026
13 checks passed
@haoranpb haoranpb deleted the feature/extensibility-from-nl2al branch May 26, 2026 08:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants