You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/evaluation.md
+64-13Lines changed: 64 additions & 13 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,6 +6,11 @@ sd_hide_title: true
6
6
7
7
## Evaluating docking poses across a stratified test set
8
8
9
+
The `plinder.eval` subpackage allows (1) assessing protein-ligand complex predictions against reference `plinder` systems, and
10
+
(2) correlating the performance of these predictions against the level of similarity of each test system to the corresponding training set.
11
+
12
+
The output file from running the scripts `src/plinder/eval/docking/write_scores.py` and `src/plinder/eval/docking/stratify_test_set.py` generates the same evaluation metrics as the ones we have on the public leaderboard.
13
+
9
14
The `plinder-eval` package allows
10
15
11
16
1. assessing protein-ligand complex predictions against reference `plinder` systems, and
@@ -26,17 +31,17 @@ leaderboard (coming soon).
26
31
-`confidence`: Optional score associated with the pose
27
32
-`ligand_file`: Path to the SDF file of the pose
28
33
29
-
`split.csv` with `system_id` and `split` columns mapping PLINDER systems to `train`, or `test`.
34
+
`split.parquet` with, at a minimum,`system_id` and `split` columns mapping PLINDER systems to `train`, or `test`.
This calculates accuracy metrics for all predicted poses compared to the reference. JSON files of each pose are stored in `scores/scores` and the summary file across all poses is stored in `scores.parquet`.
44
+
This calculates accuracy metrics for all predicted poses compared to the reference. JSON files of each pose are stored in `test_eval/scores` and the summary file across all poses is stored in `test_eval/scores.parquet`.
40
45
41
46
The predicted pose is compared to the reference system and the following ligand scores are calculated:
42
47
@@ -69,24 +74,70 @@ For oligomeric complexes:
69
74
70
75
If `score_posebusters` is True, all posebusters checks are saved.
71
76
77
+
You can inspect the results at `test_eval/scores.parquet`
- Labels the maximum similarity of each test system to the training set across all the similarity metrics
83
-
- Stratifies the test set based on training set similarity into `novel_pocket_pli`, `novel_pocket_ligand`, `novel_protein`, `novel_all`, and `not_novel`
- Stratifies the test set based on training set similarity into `novel_pocket_pli`, `novel_ligand_pli`, `novel_protein`, `novel_ligand`, `novel_all` and `not_novel`
0 commit comments