You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: README.md
+81-2
Original file line number
Diff line number
Diff line change
@@ -33,7 +33,7 @@ python3 --version
33
33
which python3
34
34
```
35
35
36
-
The ouput of the directory should be somethhing like "/usr/local/bin/python3.7" or "/Library/Frameworks/Python.framework/Versions/3.7/bin/python3.7"
36
+
The ouput of the directory should be something like "/usr/local/bin/python3.7" or "/Library/Frameworks/Python.framework/Versions/3.7/bin/python3.7"
37
37
38
38
## Getting Started
39
39
@@ -55,7 +55,86 @@ if __name__ == '__main__':
55
55
56
56
57
57
## Solving our first ML Problem
58
-
We are gonna work on our first dataset by using simple [linear regression](http://onlinestatbook.com/2/regression/intro.html). We would be using a fairly small dataset for our problem called [Auto MPG](https://archive.ics.uci.edu/ml/datasets/auto+mpg).
58
+
We are gonna work on our first dataset by using simple [linear regression](http://onlinestatbook.com/2/regression/intro.html). We would be using a fairly small dataset for our problem called [Auto MPG](https://archive.ics.uci.edu/ml/datasets/auto+mpg). Before we get stared we need to install some libraries in our Python environment. In order to do that, click on the terminal tab in PyCharm at the bottom of the window, it should have (venv) as its set environment. Run the following commands in it.
59
+
```bash
60
+
pip install pandas
61
+
pip install sklearn
62
+
pip install matplotlib
63
+
```
64
+
Our dataset is present in the file "auto-mpg.data", so download it inside your project directory.
65
+
66
+
### Preparing the Dataset
67
+
First let's see how can we explore the dataset in Python. Observe that the values of each row in the dataset are seperated by spaces which are not fixed in length. Thus it would require us to read the file line by line in Python. Below is the source code to do the needfull.
0 commit comments