An basic repository with an environment and .gitignore
https://www.thesquirrelcensus.com
The survey data was originally collected and stored in to files available at this link:
https://www.thesquirrelcensus.com/data
For this exercise, we will use an integrated .csv
file that combines data and is a little more structured than the raw .csv
. This file was downloaded from kaggle.com at this link:
The downloaded file is already added to the class repository in the data/
folder as as 2018_Central_Park_Squirrel_Census_-_Squirrel_Data.csv
. You can copy this folder to your new repository on your local machine
-
Create a short name for your group and a new repository for this exercise. Clone the repo to your local machines.
-
Load the combined datafile (either using the url or from your repository's
../data
directory) usingpd.read_csv()
. -
Use standard
df.info()
,df.head()
,df.describe()
to explore the data. -
Read over the user guide here to understand what the various columns contain.
Additional information on the survey and stories are available here:
https://www.thesquirrelcensus.com/data
- Work together to make a new
.ipynb
in your repo that contains an analysis and simple visualization of the data!