I'd like to get a recommendation how to attack a problem of predicting multiple numbers.
Training data contains 4 columns, each says a probability of record being in the bucket. So, for example:
X = [0.25, 0.5, 0.25, 0.0]
and
corresponding output should be e.g. Y = [0.8, 0.1, 0.0, 0.1]
.
Each row should sum to 1.0
What type of approach would you recommend?
I already tried simple neural network with 4 neurons in the last layer and softmax activation but wondering if there is a better solution.
Thanks!
0-18 years, x[1] = 19-40 years, x[2] = 41-60 years and x[3] is 61+ years. x[0] = 0.25
means 25% of people belong to the age group 0-18 years. And Y contains real distribution. – Peter Krejzl May 28 '18 at 12:19