Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[MRG+2] modify disadvantage #8521

Merged
merged 1 commit into from
Mar 4, 2017
Merged

Conversation

Ellen-Co2
Copy link
Contributor

@Ellen-Co2 Ellen-Co2 commented Mar 4, 2017

svm can work effectively when feature number is >> number of samples.
But to avoid over-fitting usually happens in such situation by choosing
appropriate kernel (model selection) is important

Reference Issue

<-- Fixes #8450 -->

What does this implement/fix? Explain your changes.

In case of high dimensionality, SVM can still work effectively, but the over-fitting issue still need to be considered, cause the vc dimension might be close to infinite in such case, thus choose of kernel or control the regularization factor "C" is essential.

Any other comments?

To test for over-fitting, use cross validation or larger hold-out can be useful. Check some discussions regarding dimensionality here

svm can work effectively when feature number is >> number of samples.
But to avoid over-fitting usually happens in such situation by choosing
appropriate kernel (model selection) is important
@amueller
Copy link
Member

amueller commented Mar 4, 2017

LGTM.

@@ -28,7 +28,8 @@ The advantages of support vector machines are:
The disadvantages of support vector machines include:

- If the number of features is much greater than the number of
samples, the method is likely to give poor performances.
samples, avoid over-fitting in choosing :ref:`svm_kernels` and regularization
Copy link
Member

@amueller amueller Mar 4, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Did you respect the 80 character line length?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks like its 84 characters, is that a major issue?

@amueller amueller changed the title [MRG] modify disadvantage [MRG + 1] modify disadvantage Mar 4, 2017
@codecov
Copy link

codecov bot commented Mar 4, 2017

Codecov Report

Merging #8521 into master will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master    #8521   +/-   ##
=======================================
  Coverage   95.48%   95.48%           
=======================================
  Files         342      342           
  Lines       60913    60913           
=======================================
  Hits        58160    58160           
  Misses       2753     2753

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update cdd693b...54b789b. Read the comment docs.

@jmschrei
Copy link
Member

jmschrei commented Mar 4, 2017

LGTM as well.

@jmschrei jmschrei changed the title [MRG + 1] modify disadvantage [MRG+2] modify disadvantage Mar 4, 2017
@jmschrei jmschrei merged commit 56d5789 into scikit-learn:master Mar 4, 2017
@jmschrei
Copy link
Member

jmschrei commented Mar 4, 2017

Congrats @Ellen-Co2 !

@Przemo10 Przemo10 mentioned this pull request Mar 17, 2017
herilalaina pushed a commit to herilalaina/scikit-learn that referenced this pull request Mar 26, 2017
[MRG+2] modify disadvantage
massich pushed a commit to massich/scikit-learn that referenced this pull request Apr 26, 2017
[MRG+2] modify disadvantage
Sundrique pushed a commit to Sundrique/scikit-learn that referenced this pull request Jun 14, 2017
[MRG+2] modify disadvantage
NelleV pushed a commit to NelleV/scikit-learn that referenced this pull request Aug 11, 2017
[MRG+2] modify disadvantage
paulha pushed a commit to paulha/scikit-learn that referenced this pull request Aug 19, 2017
[MRG+2] modify disadvantage
maskani-moh pushed a commit to maskani-moh/scikit-learn that referenced this pull request Nov 15, 2017
[MRG+2] modify disadvantage
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Ambiguous Support Vector Machines documentation
3 participants