I agree that a larger sample would be welcomed. In particular, spreading the experiment out over many days would help eliminate a "oh all these packages are the same, they probably go here" mistake. Whilst sending them all out on the same day eliminates some sources of bias, it may have made the experiment more susceptible to other errors.