create account

Machine Learning on a Cancer Dataset - Part 27 by cristi

View this thread on: hive.blogpeakd.comecency.com
· @cristi ·
$290.52
Machine Learning on a Cancer Dataset - Part 27
In the third video on support vector machines (SVMs) we begin implementing an SVM on our cancer dataset in scikit-learn.

We're using a support vector classifier (SVC) with an RBF (radial basis function) kernel. For an overview on kernels and how they work conceptually, please look at the previous video in this series. 

There are many parameters that can be adjusted for our classifier. The defaults are usually good to start with. However, for our cancer dataset, the classifier seems to be overfitting with the default parameters (as it leads to 100% performance on the training subset). 

To fix this we could try adjusting parameters such as the C and/or gamma which control regularization and the width of the Gaussian kernel. We could also look into the scaling of the data; it is currently unscaled. And this is what we're gonna work on in the next video. But for now, see the current tutorial on how to implement SVMs in scikit-learn.
___
As a reminder:

In this series I'm going to explore the cancer dataset that comes pre-loaded with scikit-learn. The purpose is to train the classifiers on this dataset, which consists of labeled data: ~569 tumor samples, each labeled malignant or benign, and then use them on new, unlabeled data.
___
Previous videos in this series:

1. [Machine Learning on a Cancer Dataset - Part 20](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-20)
2. [Machine Learning on a Cancer Dataset - Part 21](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-21)
3. [Machine Learning on a Cancer Dataset - Part 22](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-22)
4. [Machine Learning on a Cancer Dataset - Part 23](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-23)
5. [Machine Learning on a Cancer Dataset - Part 24](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-24)
6. [Machine Learning on a Cancer Dataset - Part 25](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-25)
7. [Machine Learning on a Cancer Dataset - Part 26](https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-26)
___
<center><iframe width="560" height="315" src="https://www.youtube.com/embed/cciPGGnHAKQ" frameborder="0" allowfullscreen></iframe></center>
___
### <center>To stay in touch with me, follow @cristi </center>
___
[Cristi Vlad](http://cristivlad.com), Self-Experimenter and Author
👍  , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , and 69 others
properties (23)
authorcristi
permlinkmachine-learning-on-a-cancer-dataset-part-27
categorymachine-learning
json_metadata{"tags":["machine-learning","science","python"],"users":["cristi"],"image":["https://img.youtube.com/vi/cciPGGnHAKQ/0.jpg"],"links":["https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-20","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-21","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-22","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-23","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-24","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-25","https://steemit.com/machine-learning/@cristi/machine-learning-on-a-cancer-dataset-part-26","https://www.youtube.com/embed/cciPGGnHAKQ","http://cristivlad.com"],"app":"steemit/0.1","format":"markdown"}
created2017-06-18 13:46:39
last_update2017-06-18 13:46:39
depth0
children6
last_payout2017-06-25 13:46:39
cashout_time1969-12-31 23:59:59
total_payout_value249.364 HBD
curator_payout_value41.153 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length2,566
author_reputation128,305,218,872,904
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,177,122
net_rshares14,352,860,106,741
author_curate_reward""
vote details (133)
@aguayojoshua ·
Could you explain a bit what what RVC and SBF are? (sorry if I got the acronyms wrong, I'm on mobile) XD
properties (22)
authoraguayojoshua
permlinkre-cristi-2017618t102824371z
categorymachine-learning
json_metadata{"tags":"machine-learning","app":"esteem/1.4.5","format":"markdown+html","community":"esteem"}
created2017-06-18 14:28:24
last_update2017-06-18 14:28:24
depth1
children1
last_payout2017-06-25 14:28:24
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length104
author_reputation469,235,892,088
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries
0.
accountesteemapp
weight500
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,180,070
net_rshares0
@cristi ·
SVC is the support vector classifier and RBF is the Gaussian kernel or radial basis function kernel, and these have been explained in the previous [video](https://www.youtube.com/watch?v=404knXpDaPM)
properties (22)
authorcristi
permlinkre-aguayojoshua-re-cristi-2017618t102824371z-20170618t145126972z
categorymachine-learning
json_metadata{"tags":["machine-learning"],"links":["https://www.youtube.com/watch?v=404knXpDaPM"],"app":"steemit/0.1"}
created2017-06-18 14:50:33
last_update2017-06-18 14:50:33
depth2
children0
last_payout2017-06-25 14:50:33
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length199
author_reputation128,305,218,872,904
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,181,731
net_rshares0
@arckrai ·
Machine Learning is so interesting.
Thank you for this Post!
👍  
properties (23)
authorarckrai
permlinkre-cristi-machine-learning-on-a-cancer-dataset-part-27-20170618t141451579z
categorymachine-learning
json_metadata{"tags":["machine-learning"],"app":"steemit/0.1"}
created2017-06-18 14:14:54
last_update2017-06-18 14:14:54
depth1
children1
last_payout2017-06-25 14:14:54
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length60
author_reputation16,610,122,450,873
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,179,053
net_rshares145,088,301
author_curate_reward""
vote details (1)
@cristi ·
you're welcome.
👍  
properties (23)
authorcristi
permlinkre-arckrai-re-cristi-machine-learning-on-a-cancer-dataset-part-27-20170618t142123878z
categorymachine-learning
json_metadata{"tags":["machine-learning"],"app":"steemit/0.1"}
created2017-06-18 14:20:30
last_update2017-06-18 14:20:30
depth2
children0
last_payout2017-06-25 14:20:30
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length15
author_reputation128,305,218,872,904
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,179,471
net_rshares145,088,301
author_curate_reward""
vote details (1)
@randowhale ·
$0.03
This post received a 4.1% upvote from @randowhale thanks to @cristi!  For more information, [click here](https://steemit.com/steemit/@randowhale/introducing-randowhale-will-you-get-the-100-vote-give-it-a-shot)!
👍  
properties (23)
authorrandowhale
permlinkre-machine-learning-on-a-cancer-dataset-part-27-20170624t145055
categorymachine-learning
json_metadata"{"app": "randowhale/0.1", "format": "markdown"}"
created2017-06-24 14:50:57
last_update2017-06-24 14:50:57
depth1
children0
last_payout2017-07-01 14:50:57
cashout_time1969-12-31 23:59:59
total_payout_value0.028 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length210
author_reputation47,657,457,485,459
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,918,127
net_rshares2,563,895,891
author_curate_reward""
vote details (1)
@rublevoy ·
good luck you with your post)
properties (22)
authorrublevoy
permlinkre-cristi-machine-learning-on-a-cancer-dataset-part-27-20170618t142357423z
categorymachine-learning
json_metadata{"tags":["machine-learning"],"app":"steemit/0.1"}
created2017-06-18 14:24:00
last_update2017-06-18 14:24:00
depth1
children0
last_payout2017-06-25 14:24:00
cashout_time1969-12-31 23:59:59
total_payout_value0.000 HBD
curator_payout_value0.000 HBD
pending_payout_value0.000 HBD
promoted0.000 HBD
body_length29
author_reputation220,503,083,310
root_title"Machine Learning on a Cancer Dataset - Part 27"
beneficiaries[]
max_accepted_payout1,000,000.000 HBD
percent_hbd10,000
post_id5,179,726
net_rshares0