Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
92,825 changes: 92,825 additions & 0 deletions 02_activities/assignments/5d2df591-33d4-4b36-bb1d-e3d8d74633ae.csv

Large diffs are not rendered by default.

92,828 changes: 92,828 additions & 0 deletions 02_activities/assignments/5d2df591-33d4-4b36-bb1d-e3d8d74633ae.json

Large diffs are not rendered by default.

202 changes: 202 additions & 0 deletions 02_activities/assignments/Assignment 3.ipynb

Large diffs are not rendered by default.

23 changes: 13 additions & 10 deletions 02_activities/assignments/assignment_2.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,22 +10,25 @@
- For each visualization (good and bad):
- Explain (with reference to material covered up to date, along with readings and other scholarly sources, as needed) why you classified that visualization the way you did.
```
Your answer...




Bad example: Japanese Architectural Heritage https://public.tableau.com/app/profile/alexandervar/viz/JapaneseArchitecturalHeritage/Map
This visualization is visually attractive but weak as a data visualization. It mainly encodes one variable: location. The illustrated buildings are placed near regions on a map, but their placement does not communicate additional information such as time period, type, scale, significance, or change over time. As a result, the design emphasizes decoration more than analysis.
Because there is no legend, scale, or annotation, the viewer is not guided toward comparison or inference. It reads more like an illustrated poster than a visualization designed to support sensemaking. In terms of course principles and common guidance in visualization scholarship, the graphic has low data density and limited rhetorical force: it shows objects but does not help the viewer learn something from the data.

Good example: South Korea Demographics https://public.tableau.com/app/profile/marc.reid/viz/SouthKorea-Demographics/SouthKoreaPopulationDensityandDemographics
This visualization is strong because it integrates multiple related datasets into one figure. It combines a population density map with coordinated plots for total population, fertility, life expectancy, and age and gender structure. This supports both overview and detail, helping viewers connect spatial patterns with long term demographic trends.
Structurally, the figure contains multiple axes, each showing a different aspect of the same topic. Within each axis, visual elements such as position, color, and scale are used consistently, which supports interpretation. The main limitation is cognitive load: the figure is information dense and may be hard to note an entry point for first time viewers.



```
- How could this data visualization have been improved?
```
Your answer...




Bad example: Japanese Architectural Heritage https://public.tableau.com/app/profile/alexandervar/viz/JapaneseArchitecturalHeritage/Map

It can be improved by adding more meaningful variables and a clearer narrative goal. For example, use color to encode historical period, size to represent significance, and symbols to distinguish categories such as religious, residential, and military. Add brief annotations or an interactive tooltip layer to explain why each site is included. This would shift the graphic from aesthetic display toward interpretable evidence.

Good example: South Korea Demographics https://public.tableau.com/app/profile/marc.reid/viz/SouthKorea-Demographics/SouthKoreaPopulationDensityandDemographics
It can be improved by strengthening visual hierarchy and reading guidance. Add clearer section headers, a suggested reading order, or light annotations that highlight key takeaways. If interactive, allow toggling layers so viewers can focus on one axis group at a time. These changes keep the richness while improving accessibility.



Expand Down
48 changes: 41 additions & 7 deletions 02_activities/assignments/assignment_3.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,24 +6,58 @@
- We will finish this class by giving you the chance to use what you have learned in a practical context, by creating data visualizations from raw data.
- Choose a dataset of interest from the [City of Toronto’s Open Data Portal](https://www.toronto.ca/city-government/data-research-maps/open-data/) or [Ontario’s Open Data Catalogue](https://data.ontario.ca/).
- Using Python and one other data visualization software (Excel or free alternative, Tableau Public, any other tool you prefer), create two distinct visualizations from your dataset of choice.

The dataset i chose was from this website:
https://data.ontario.ca/dataset/ontario-top-baby-names-male/resource/9571139d-e505-4a35-82fa-192af66c5714

This dataset is about Ontario top baby names (male).
The tools i used was Python and Google Sheet.

See my Assignment 3 ipynb doc for python code.
see this link for google sheet analysis. https://docs.google.com/spreadsheets/d/1GLQFEEPWy-6nAB5E_MTBAtT1gAzSU0TkH2iQRuExRxY/edit?gid=1596697244#gid=1596697244


For python:
- For each visualization, describe and justify:
> What software did you use to create your data visualization?
Visual Studio Code and Python.
This visualization was created using Python, specifically the pandas library for data processing and matplotlib for plotting.
> Who is your intended audience?
The intended audience includes parents, educators, and students who are interested in understanding long term trends in baby name popularity in Ontario.
> What information or message are you trying to convey with your visualization?
This visualization shows how the popularity of selected male baby names in Ontario has changed over time between 2000 and 2023. It highlights both rising and declining naming trends rather than focusing on a single year snapshot.
> What aspects of design did you consider when making your visualization? How did you apply them? With what elements of your plots?
A line chart was chosen because it is well suited for representing change over time. Clear axis labels and a descriptive title were included to support interpretation. Different markers and line styles were used so that trends can be distinguished without relying only on color.
> How did you ensure that your data visualizations are reproducible? If the tool you used to make your data visualization is not reproducible, how will this impact your data visualization?
The visualization is reproducible because it is generated entirely through a Python script. The script reads fixed input files from Ontario’s Open Data Catalogue, applies deterministic cleaning and filtering steps, and saves the output figure programmatically.
> How did you ensure that your data visualization is accessible?
Accessibility was considered by using readable font sizes, clear labels, and visual distinctions beyond color such as markers and line styles. The legend explicitly labels each name to avoid ambiguity.
> Who are the individuals and communities who might be impacted by your visualization?
Parents and families may use this information when considering baby names. The visualization reflects male naming data as categorized in the original dataset, which represents binary sex classifications and should be interpreted within that limitation.
> How did you choose which features of your chosen dataset to include or exclude from your visualization?
Only a small set of well known male names and years from 2000 to 2023 were included to reduce visual clutter and improve interpretability. Less frequent names were excluded to keep the focus on dominant trends.
> What ‘underwater labour’ contributed to your final data visualization product?
Underwater labour included cleaning bilingual column names, standardizing text formatting, filtering the dataset by year and sex, selecting representative names, and testing multiple design choices to ensure clarity.

For Google Sheet:
> What software did you use to create your data visualization?
This visualization was created using Google Sheets.
> Who is your intended audience?

The intended audience is a general public audience, including parents and educators, who want a quick and accessible overview of popular male baby names in Ontario.
> What information or message are you trying to convey with your visualization?

This visualization shows the relative popularity of the top 10 male baby names in Ontario in a single recent year. It emphasizes comparison across names at one point in time.
> What aspects of design did you consider when making your visualization? How did you apply them? With what elements of your plots?

A bar chart was selected because it supports straightforward comparison across categories. A simple color palette, clear axis labels, and an informative title were used to reduce cognitive load and support readability.
> How did you ensure that your data visualizations are reproducible? If the tool you used to make your data visualization is not reproducible, how will this impact your data visualization?

Google Sheets visualizations are not fully reproducible in the same way as code based workflows. However, the chart can be recreated by applying the same documented steps, including filtering by year and sorting by frequency, to the publicly available dataset.
> How did you ensure that your data visualization is accessible?

The visualization uses clear text labels, a simple layout, and does not rely on color alone to communicate values. The chart is readable without prior technical knowledge.
> Who are the individuals and communities who might be impacted by your visualization?

Parents and families may be influenced by perceptions of name popularity. As with the Python visualization, the chart reflects male naming categories as defined in the dataset.
> How did you choose which features of your chosen dataset to include or exclude from your visualization?

Only the top 10 names from a single year were included to keep the visualization focused and easy to interpret. Lower frequency names were excluded to avoid overcrowding.
> What ‘underwater labour’ contributed to your final data visualization product?
Underwater labour included filtering the dataset by year, sorting by frequency, selecting a subset of names, and adjusting chart settings to improve clarity and presentation.

- This assignment is intentionally open-ended - you are free to create static or dynamic data visualizations, maps, or whatever form of data visualization you think best communicates your information to your audience of choice!
- Total word count should not exceed **(as a maximum) 1000 words**
Expand Down
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added 02_activities/assignments/image.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading