Skip to content

Commit

Permalink
_intelligent_sampling:correction
Browse files Browse the repository at this point in the history
  • Loading branch information
guillaume-vignal committed Jun 18, 2024
1 parent 399e339 commit 73f5416
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion contribution_plot_improvment.md
Original file line number Diff line number Diff line change
Expand Up @@ -100,7 +100,7 @@ def _intelligent_sampling(self, data, max_points, col_value_count, random_seed):
The `_intelligent_sampling` function selects a subset of data based on the distribution of values in a specified column. Here’s how it operates:

1. **Data Type Handling**:
- It checks if the column (`col`) contains numerical (`float` or `int`) or categorical (`object` or `category`) data.
- It checks if the data contains numerical (`float` or `int`) or categorical (`object` or `category`) data.

2. **Condition Check**:
- If the number of unique values (`col_value_count`) is less than 5% of the total rows in the dataset (`len(data) / 20`) or if the column contains string data, it uses the original column values without clustering (`is_col_str`).
Expand Down

0 comments on commit 73f5416

Please sign in to comment.