Skip to content

Commit 8816917

Browse files
committed
.
1 parent 08c3d58 commit 8816917

4 files changed

Lines changed: 98 additions & 4 deletions

File tree

.ipynb_checkpoints/03. Support Vector Machines-checkpoint.ipynb

Lines changed: 19 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -704,6 +704,24 @@
704704
"plt.show()"
705705
]
706706
},
707+
{
708+
"cell_type": "markdown",
709+
"metadata": {},
710+
"source": [
711+
"# Advantages of SVM\n",
712+
"1. Robust to outliers: it ignore outliers and find the hyper-plane that has the maximum margin. \n",
713+
"2. Effective in cases where the number of features is greater than the number of samples\n"
714+
]
715+
},
716+
{
717+
"cell_type": "markdown",
718+
"metadata": {},
719+
"source": [
720+
"# Disadvantages of SVM\n",
721+
"1. Doesn’t perform well when the dataset is large because the required training time is higher\n",
722+
"2. Doesn’t perform very well, when the data set has more noise i.e. target classes are overlapping"
723+
]
724+
},
707725
{
708726
"cell_type": "code",
709727
"execution_count": null,
@@ -728,7 +746,7 @@
728746
"name": "python",
729747
"nbconvert_exporter": "python",
730748
"pygments_lexer": "ipython3",
731-
"version": "3.6.6"
749+
"version": "3.8.5"
732750
}
733751
},
734752
"nbformat": 4,

.ipynb_checkpoints/09. Random Forest-checkpoint.ipynb

Lines changed: 31 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,8 @@
66
"source": [
77
"# Random Forest\n",
88
"- Ensemble Algorithm\n",
9-
"- model made up of many decision trees\n",
9+
"- model made up of many decision trees, which are independent of/uncorrelated to each other\n",
10+
"- because of no/little correlation between trees provides randomness, due to which impact of error from one tree dont influence other trees\n",
1011
"\n",
1112
"Key Concepts:\n",
1213
"- While building trees it performs random sampling of training data points\n",
@@ -220,6 +221,34 @@
220221
"plt.show()"
221222
]
222223
},
224+
{
225+
"cell_type": "markdown",
226+
"metadata": {},
227+
"source": [
228+
"# Advantages\n",
229+
"* its robust for missing and erroneous data as well as insufficient information with good performance"
230+
]
231+
},
232+
{
233+
"cell_type": "markdown",
234+
"metadata": {},
235+
"source": [
236+
"# Questions and Answers\n",
237+
"**Ques:** While taking samples randomly in random forest for making individual decision trees, some of the entries are duplicates. DOes this creates problem?\n",
238+
"**Ans**: \n",
239+
"- It will be considered as “Row sampling without replacement”.When a row is added, it means a new data point is added in the feature space. The model will only try to learn from this data point. \n",
240+
"- Problem happens when the same feature is added twice, it means a dimension will overlap another dimension. It technically makes no sense but just increases the computation of the algorithm. \n",
241+
"- CRicket Balls Box: Adding same row/reading is like adding ball. No issue with this. But if we add a plate in the base of box it is of no use(like adding duplicate feature)."
242+
]
243+
},
244+
{
245+
"cell_type": "markdown",
246+
"metadata": {},
247+
"source": [
248+
"Research Paper: \n",
249+
"https://www.hindawi.com/journals/jam/2012/258054/ "
250+
]
251+
},
223252
{
224253
"cell_type": "code",
225254
"execution_count": null,
@@ -244,7 +273,7 @@
244273
"name": "python",
245274
"nbconvert_exporter": "python",
246275
"pygments_lexer": "ipython3",
247-
"version": "3.6.6"
276+
"version": "3.8.5"
248277
}
249278
},
250279
"nbformat": 4,

03. Support Vector Machines.ipynb

Lines changed: 18 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -704,6 +704,24 @@
704704
"plt.show()"
705705
]
706706
},
707+
{
708+
"cell_type": "markdown",
709+
"metadata": {},
710+
"source": [
711+
"# Advantages of SVM\n",
712+
"1. Robust to outliers: it ignore outliers and find the hyper-plane that has the maximum margin. \n",
713+
"2. Effective in cases where the number of features is greater than the number of samples\n"
714+
]
715+
},
716+
{
717+
"cell_type": "markdown",
718+
"metadata": {},
719+
"source": [
720+
"# Disadvantages of SVM\n",
721+
"1. Doesn’t perform well when the dataset is large because the required training time is higher\n",
722+
"2. Doesn’t perform very well, when the data set has more noise i.e. target classes are overlapping"
723+
]
724+
},
707725
{
708726
"cell_type": "code",
709727
"execution_count": null,

09. Random Forest.ipynb

Lines changed: 30 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,8 @@
66
"source": [
77
"# Random Forest\n",
88
"- Ensemble Algorithm\n",
9-
"- model made up of many decision trees\n",
9+
"- model made up of many decision trees, which are independent of/uncorrelated to each other\n",
10+
"- because of no/little correlation between trees provides randomness, due to which impact of error from one tree dont influence other trees\n",
1011
"\n",
1112
"Key Concepts:\n",
1213
"- While building trees it performs random sampling of training data points\n",
@@ -220,6 +221,34 @@
220221
"plt.show()"
221222
]
222223
},
224+
{
225+
"cell_type": "markdown",
226+
"metadata": {},
227+
"source": [
228+
"# Advantages\n",
229+
"* its robust for missing and erroneous data as well as insufficient information with good performance"
230+
]
231+
},
232+
{
233+
"cell_type": "markdown",
234+
"metadata": {},
235+
"source": [
236+
"# Questions and Answers\n",
237+
"**Ques:** While taking samples randomly in random forest for making individual decision trees, some of the entries are duplicates. DOes this creates problem?\n",
238+
"**Ans**: \n",
239+
"- It will be considered as “Row sampling without replacement”.When a row is added, it means a new data point is added in the feature space. The model will only try to learn from this data point. \n",
240+
"- Problem happens when the same feature is added twice, it means a dimension will overlap another dimension. It technically makes no sense but just increases the computation of the algorithm. \n",
241+
"- CRicket Balls Box: Adding same row/reading is like adding ball. No issue with this. But if we add a plate in the base of box it is of no use(like adding duplicate feature)."
242+
]
243+
},
244+
{
245+
"cell_type": "markdown",
246+
"metadata": {},
247+
"source": [
248+
"Research Paper: \n",
249+
"https://www.hindawi.com/journals/jam/2012/258054/ "
250+
]
251+
},
223252
{
224253
"cell_type": "code",
225254
"execution_count": null,

0 commit comments

Comments
 (0)