ImageNet-3DCC and corruption updates by ofkar · Pull Request #85 · RobustBench/robustbench

ofkar · 2022-04-30T17:34:22Z

Hi

Git tells that nearly every file in the repository has been changed in this pull request (224 files in total), although for most files there are no actual changes. This can be a bit misleading for the git history. Is it possible to remove this effect? The responsible commit for that seems to be this one.

I forked again and made the updates, this should be good now. Sorry for the issue.

I'd remove print statements here and here.

Done.

Now thinking a bit more, I'm not sure if it's needed there. In principle, the evaluation process of 3DCC is the same as for 2DCC which is already illustrated in the lines above. The only difference is how to download the data.

Actually, I also created a new loader function to isolate the two. Also, the quickstart includes the names of the corruptions in ImageNet-3DCC which could be handy. Let me know if this sounds good.

Now about how to get the data: we briefly noted here that the test set of ImageNet should be downloaded manually. We didn't tell anything about ImageNet-C, though (I think we just forgot to write something about it). I'd suggest that you add there regarding where to download ImageNet-C and then your instruction about ImageNet-3DCC:
Download the data from here using the provided tool. The data will be saved into a folder named ImageNet-3DCC.
What do you think?

Good point, added a section for that.

Since it's not going to be a new leaderboard, I'd formulate it a bit differently. More like: "We have extended the common corruptions leaderboard on ImageNet" instead of "created a new benchmark". Also it's worth specifying that we still sort the entries according to the 2D common corruptions. Also it's worth telling in a single sentence why these 3D common corruptions are interesting: (1) they are more realistic, (2) they can be used to assess generalization of the existing models which may have overfitted to 2DCC.

Done.

And as a separate news, I'd also add that we fixed the preprocessing issue and write explicitly that this changed the ranking between the top-1 and top-2 entries.

Done.

Let me know if you see further issues. Thanks!

fra31 · 2022-04-30T17:50:10Z

Hi,

thanks a lot for the contribution, it looks great!

Is it possible to add also the unaggregated results as here?

As minor thing, I'd add in the header of the table in the readme e.g. arrows to indicate that for robust accuracy higher is better, while for mCE it's the opposite.

ofkar · 2022-04-30T20:58:44Z

Thanks. I made the updates you suggested. Let me know if everything looks good.

max-andr · 2022-05-01T06:48:21Z

Wow, that was fast, especially for a Saturday ;) The changes look good. A few further suggestions:

README, news section: add a space after '-' (so that we Markdown renders it as a bullet point).
README, news section: I'd explain in a few words: "We fixed the preprocessing issue for ImageNet corruption evaluations." -> "We fixed the preprocessing issue for ImageNet corruption evaluations: previously we used resize to 256x256 and central crop to 224x224 which was a mistake since the ImageNet-C images are already 224x224 and we cropped them further losing information."
Slightly reorganized the dataset downloading instructions.

Actually, I also created a new loader function to isolate the two. Also, the quickstart includes the names of the corruptions in ImageNet-3DCC which could be handy. Let me know if this sounds good.

Ok, I agree, some people can find that snippet useful. I've compressed it a bit to make more concise (e.g., saving as a pickle is not the first necessity, the loop over models is something that the users can do by themselves, etc).

I applied these and few other minor changes to the README in a new commit. I'd say that everything looks good to me and we could merge unless others (@VSehwag, @dedeswim) have some further suggestions.

dedeswim

LGTM. Sorry if it took me long to review this. Thanks a lot for the contribution! :)

dedeswim

Actually, I have just realized that also the Jinja template used to generate the leaderboard website should be updated to reflect this addition, by adding the new columns in the Corruptions ImageNet leaderboard.

@ofkar can you take care of that? Otherwise I can help with this :)

ofkar · 2022-05-03T21:12:53Z

@dedeswim I'm not too familiar with it, but I actually created another PR to update the website too: RobustBench/robustbench.github.io#13

So is this update related to those changes as well? I have already entered new entries to the corruption leaderboard there.

dedeswim · 2022-05-03T21:22:45Z

Yeah I saw the PR, thanks also for that one! We have this template and script which we use for generating the leaderboard from the *.json files to make updates easier. If you are not familiar with it, I can edit the template for you, no worries

max-andr · 2022-05-09T08:24:04Z

We agreed with Edoardo that the change to the ninja template can be done as a separate PR. Merging then! Thanks again, Oguzhan!

ofkar and others added 6 commits April 30, 2022 16:44

ImageNet-3DCC

f654e32

ImageNet-3DCC

6db61ae

ImageNet-3DCC

be41c74

Update README.md

276c132

Update README.md

3bb43a8

Update README.md

e3fa245

ofkar changed the title ~~ImageNet-3DD and corruption updates~~ ImageNet-3DCC and corruption updates Apr 30, 2022

ofkar added 3 commits April 30, 2022 20:00

added arrows

b9a1017

Delete unaggregated_results.csv

e448a68

Add files via upload

c34d780

various improvements of the README

4bd71d7

add pointer to the Github issue about the preprocessing on imagenet

7e99e1a

dedeswim approved these changes May 3, 2022

View reviewed changes

dedeswim requested changes May 3, 2022

View reviewed changes

max-andr mentioned this pull request May 9, 2022

Incorrect preprocessing for ImageNet-C evaluation #59

Closed

max-andr merged commit df31621 into RobustBench:master May 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ImageNet-3DCC and corruption updates#85

ImageNet-3DCC and corruption updates#85
max-andr merged 11 commits intoRobustBench:masterfrom
ofkar:master

ofkar commented Apr 30, 2022

Uh oh!

fra31 commented Apr 30, 2022

Uh oh!

ofkar commented Apr 30, 2022

Uh oh!

max-andr commented May 1, 2022

Uh oh!

dedeswim left a comment

Uh oh!

dedeswim left a comment

Uh oh!

ofkar commented May 3, 2022

Uh oh!

dedeswim commented May 3, 2022

Uh oh!

max-andr commented May 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ofkar commented Apr 30, 2022

Uh oh!

fra31 commented Apr 30, 2022

Uh oh!

ofkar commented Apr 30, 2022

Uh oh!

max-andr commented May 1, 2022

Uh oh!

dedeswim left a comment

Choose a reason for hiding this comment

Uh oh!

dedeswim left a comment

Choose a reason for hiding this comment

Uh oh!

ofkar commented May 3, 2022

Uh oh!

dedeswim commented May 3, 2022

Uh oh!

max-andr commented May 9, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants