[SPARK-20862][MLLIB][PYTHON] Avoid passing float to ndarray.reshape i…

…n LogisticRegressionModel ## What changes were proposed in this pull request? Fixed TypeError with python3 and numpy 1.12.1. Numpy's `reshape` no longer takes floats as arguments as of 1.12. Also, python3 uses float division for `/`, we should be using `//` to ensure that `_dataWithBiasSize` doesn't get set to a float. ## How was this patch tested? Existing tests run using python3 and numpy 1.12. Author: Bago Amirbekian <[email protected]> Closes #18081 from MrBago/BF-py3floatbug.
apache · ambauma · May 10, 2017 · Oct 10, 2017 · Oct 10, 2017 · Oct 19, 2017
commit cb1609b055dafd78af15d7a1b19658f81df1ebca
diff --git a/python/pyspark/mllib/classification.py b/python/pyspark/mllib/classification.py
@@ -173,7 +173,7 @@ def __init__(self, weights, intercept, numFeatures, numClasses):
             self._dataWithBiasSize = None
             self._weightsMatrix = None
         else:
-            self._dataWithBiasSize = self._coeff.size / (self._numClasses - 1)
+            self._dataWithBiasSize = self._coeff.size // (self._numClasses - 1)
             self._weightsMatrix = self._coeff.toArray().reshape(self._numClasses - 1,
                                                                 self._dataWithBiasSize)