If we consider matrices simply as tables of numbers than we can define many possible different binary operations that we can call '' multiplications'', simply using this name to distinguish this operation from the addition (defined as the sum of corresponding elements). Obviously different definitions give different properties of the ''multiplication'' and someone can be useful in some contest, but not in other.
As an example the Hadamard product of two matrix (defined as the product of the corresponding elements) is associative, distributive and also commutative, but can be defined only for matrices that have the same dimension, and ( as far as I know) is used in computer graphic.
The Kroneker product is another possible kind of multiplication, that has usefull properties and has important applications being related to the tensor product of linear transformations.
The usual row-column product has the advantage that it can represent the action of linear transformations between vector spaces, and capture all properties of these transformations (linearity, associativity, non commutativity, existence of a neutral element and of not invertible elements). There is some amount of convection in the definition, in the sense that we can chose the row to the left and column to the right ( as usual) or vice versa, but really these two possible alternative give isomorphic structures.