[python] How to calculate the sum of all columns of a 2D numpy array (efficiently)