Effect of ensemble size on reliability and Brier score