2

I have the data below: enter image description here I want to explain the relationship between 'Milieu' who has two factors, and 'DAM'. As you may notice, the blue population's included in the red population. Can I apply a logistic regression?

  • yes, you can. the results are not that good, especially in the "overlapping" region. Maybe yu should find other features better explaining the phenomenon – Oscar May 19 '22 at 14:56
  • 1
    It is not clear to me exactly what you want to predict from the graph – Ralph Winters May 19 '22 at 21:33

1 Answers1

1

Yes. If you have numeric features for a classification problem, you can apply logistic regression.

However, you are unlikely to see spectacular results for this data. Let's look at the classic example Iris data set that does perform well under logistic regression:

Iris dataset separability

This data set works well because the classes are largely linearly separable. Essentially, you could draw lines on that graph to separate the classes. Logistic regression is able to learn this and correctly classify most samples.

In the case of your data, logistic regression (and all other methods of classification) will struggle in the "overlapping" region because the features you have available simply don't provide enough information to correctly identify classes in the this region. You should still see some success outside of this region.

The best way to answer whether or not logistic regression will meet your needs is to run an experiment. Run it on your training data while holding back a test set and check performance on the held out data. If this gives performance that meets your needs, you're good to go. Otherwise, you'll likely need to explore additional features or come up with another way to solve this problem.

jncraton
  • 578
  • 5
  • 9