Referee wants me to do some tests using a completely different dataset that is: not comparable, expensive, and too limited to estimate what I want. Can I simply decline the request (explaining why of course)?
More generally, how reluctant should one be to disobey a referee? Where do you draw the line in the sand?