|
6 | 6 | "source": [ |
7 | 7 | "# Random Forest\n", |
8 | 8 | "- Ensemble Algorithm\n", |
9 | | - "- model made up of many decision trees\n", |
| 9 | + "- model made up of many decision trees, which are independent of/uncorrelated to each other\n", |
| 10 | + "- because of no/little correlation between trees provides randomness, due to which impact of error from one tree dont influence other trees\n", |
10 | 11 | "\n", |
11 | 12 | "Key Concepts:\n", |
12 | 13 | "- While building trees it performs random sampling of training data points\n", |
|
220 | 221 | "plt.show()" |
221 | 222 | ] |
222 | 223 | }, |
| 224 | + { |
| 225 | + "cell_type": "markdown", |
| 226 | + "metadata": {}, |
| 227 | + "source": [ |
| 228 | + "# Advantages\n", |
| 229 | + "* its robust for missing and erroneous data as well as insufficient information with good performance" |
| 230 | + ] |
| 231 | + }, |
| 232 | + { |
| 233 | + "cell_type": "markdown", |
| 234 | + "metadata": {}, |
| 235 | + "source": [ |
| 236 | + "# Questions and Answers\n", |
| 237 | + "**Ques:** While taking samples randomly in random forest for making individual decision trees, some of the entries are duplicates. DOes this creates problem?\n", |
| 238 | + "**Ans**: \n", |
| 239 | + "- It will be considered as “Row sampling without replacement”.When a row is added, it means a new data point is added in the feature space. The model will only try to learn from this data point. \n", |
| 240 | + "- Problem happens when the same feature is added twice, it means a dimension will overlap another dimension. It technically makes no sense but just increases the computation of the algorithm. \n", |
| 241 | + "- CRicket Balls Box: Adding same row/reading is like adding ball. No issue with this. But if we add a plate in the base of box it is of no use(like adding duplicate feature)." |
| 242 | + ] |
| 243 | + }, |
| 244 | + { |
| 245 | + "cell_type": "markdown", |
| 246 | + "metadata": {}, |
| 247 | + "source": [ |
| 248 | + "Research Paper: \n", |
| 249 | + "https://www.hindawi.com/journals/jam/2012/258054/ " |
| 250 | + ] |
| 251 | + }, |
223 | 252 | { |
224 | 253 | "cell_type": "code", |
225 | 254 | "execution_count": null, |
|
244 | 273 | "name": "python", |
245 | 274 | "nbconvert_exporter": "python", |
246 | 275 | "pygments_lexer": "ipython3", |
247 | | - "version": "3.6.6" |
| 276 | + "version": "3.8.5" |
248 | 277 | } |
249 | 278 | }, |
250 | 279 | "nbformat": 4, |
|
0 commit comments