Incorrect preprocessing for ImageNet-C evaluation

I see that the ImageNet-C evaluation uses the preprocessing: `Resize(256)+CenterCrop(224)+ToTensor()`. 

https://github.com/RobustBench/robustbench/blob/61ce9e9d752eb28b54bdd08bc61bad52656d2d45/robustbench/data.py#L146-L154

This causes discrepancies with the scores reported in the original papers (DeepAugment, AugMix, Standard RN-50). The ImageNet-C dataset already contains 224x224 images and hence only `ToTensor()` should be used for consistency. 

Fixing `prepr='none'` in `load_imagenetc` should solve the issue (assuming all the models are capable of handling 224x224 images as input).

	def load_imagenetc(
	n_examples: Optional[int] = 5000,
	severity: int = 5,
	data_dir: str = './data',
	shuffle: bool = False,
	corruptions: Sequence[str] = CORRUPTIONS,
	prepr: str = 'Res256Crop224'
	) -> Tuple[torch.Tensor, torch.Tensor]:
	transforms_test = PREPROCESSINGS[prepr]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect preprocessing for ImageNet-C evaluation #59

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Incorrect preprocessing for ImageNet-C evaluation #59

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions