Skip to content

I completed assignment 3#3

Open
Fooroogh-RM wants to merge 3 commits intomainfrom
assignment-3
Open

I completed assignment 3#3
Fooroogh-RM wants to merge 3 commits intomainfrom
assignment-3

Conversation

@Fooroogh-RM
Copy link
Owner

What changes are you trying to make? (e.g. Adding or removing code, refactoring existing code, adding reports)

added the required codes.

What did you learn from the changes you have made?

I learnt more on graphs, clustering

Was there another approach you were thinking about making? If so, what approach(es) were you thinking of?

not really

Were there any challenges? If so, what issue(s) did you face? How did you overcome it?

yes. I had challenges with writing the bootstrapping code and asked my friend to help me out with understanding and writing this part.

How were these changes tested?

just tested them in VS

A reference to a related issue in your repository (if applicable)

Checklist

  • I can confirm that my changes are working as intended

Copy link

@anjali-deshpande-hub anjali-deshpande-hub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good.
Please submit the requested changes -

Question 2: Could you please give an example of one pair of features from the plots that indicate visible clustering between wines and how the patterns in the plot help in distinguishing wines?

Question 5:

  1. This section has a code block with errors. (The syntax error has been committed)
    You can either remove this block of code, or complete the sections with the variables that you have calculated in the following code block.

  2. Uncomment this code and modify the variables to plot the scatter plots, colored by cluster labels

# Use the helper function to plot scatter plots, colored by cluster labels
#plot_feature_pairs(clustered_wine_data, feature_names, color_labels=clustered_wine_data['Cluster'], title_prefix='Clustered Data: ')
  1. Question 5 (sub question 2) - What is the purpose of calculating the confidence interval from the bootstrap samples? How does it help us interpret the variability of the estimate?
    The confidence interval from bootstrap samples shows how much the estimate (e.g., mean, median) could vary if we collected new data. It helps us understand the uncertainty of the estimate and how stable or reliable it is.

@anjali-deshpande-hub
Copy link

A friendly reminder to finish the requested review changes in order to complete this assignment. Thank you.

@Fooroogh-RM
Copy link
Owner Author

A friendly reminder to finish the requested review changes in order to complete this assignment. Thank you.

I completed the requested and pushed it under assignment_3.ipynb.
For question 5, part 2. This was already completed; it is only in the next block of code.

Copy link

@anjali-deshpande-hub anjali-deshpande-hub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the feedback.
From the review comments - Question 2 is fixed, Question 5 (review comment 3 related to part 2) is fixed now.
But review comment 1, 2 for Question 5 is pending.

The notebook file should not have any errors when run. Please see attached screenshot.
Screenshot 2025-12-22 144253

plot_feature_pairs(scaled_wine_df_clustered, feature_names, color_labels=scaled_wine_df_clustered['Cluster'], title_prefix='Clustered Data: ')

needs to be called to plot the K-means cluster plots for all features.

Copy link

@anjali-deshpande-hub anjali-deshpande-hub left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All fixed now. Thank you,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants

Comments