gotchaMajor
What exactly is the difference between supervised and unsupervised learning?
Viewed 0 times
unsupervisedthewhatlearningdifferencebetweenandexactlysupervised
Problem
I am trying to understand clustering methods.
What I I think I understood:
-
In supervised learning, the categories/labels data is assigned to are known before computation. So, the labels, classes or categories are being used in order to "learn" the parameters that are really significant for those clusters.
-
In unsupervised learning, datasets are assigned to segments, without the clusters being known.
Does that mean that, if I don't even know which parameters are crucial for a segmentation, I should prefer supervised learning?
What I I think I understood:
-
In supervised learning, the categories/labels data is assigned to are known before computation. So, the labels, classes or categories are being used in order to "learn" the parameters that are really significant for those clusters.
-
In unsupervised learning, datasets are assigned to segments, without the clusters being known.
Does that mean that, if I don't even know which parameters are crucial for a segmentation, I should prefer supervised learning?
Solution
The difference is that in supervised learning the "categories", "classes" or "labels" are known. In unsupervised learning, they are not, and the learning process attempts to find appropriate "categories". In both kinds of learning all parameters are considered to determine which are most appropriate to perform the classification.
Whether you chose supervised or unsupervised should be based on whether or not you know what the "categories" of your data are. If you know, use supervised learning. If you do not know, then use unsupervised.
As you have a large number of parameters and you do not know which ones are relevant, you could use something like principle component analysis to help determine the relevant ones.
Whether you chose supervised or unsupervised should be based on whether or not you know what the "categories" of your data are. If you know, use supervised learning. If you do not know, then use unsupervised.
As you have a large number of parameters and you do not know which ones are relevant, you could use something like principle component analysis to help determine the relevant ones.
Context
StackExchange Computer Science Q#2907, answer score: 24
Revisions (0)
No revisions yet.