This fork of SWE-bench includes updates necessary for running SWE-rebench files. This repository was forked at commit fea293e.
The original README.md is available here.
To run a sample evaluation, use the following command:
python -m swebench.harness.run_evaluation \
--dataset_name nebius/SWE-rebench \
--predictions_path gold \
--instance_ids oemof__tespy-653 \
--cache_level instance \
--run_id validate-gold \
--namespace ""