Skip to content

Conversation

@mumichae
Copy link
Collaborator

@mumichae mumichae commented Aug 18, 2025

Describe your changes

Checklist before requesting a review

  • I have performed a self-review of my code

  • Check the correct box. Does this PR contain:

    • Breaking changes
    • New functionality
    • Major changes
    • Minor changes
    • Bug fixes
  • Proposed changes are described in the CHANGELOG.md

  • CI Tests succeed and look good!

@mumichae mumichae marked this pull request as ready for review August 29, 2025 10:05
@mumichae mumichae changed the title Features/ksim New metric: ksim Aug 29, 2025
@seohyonkim
Copy link
Contributor

@mumichae it's good to go! Please check out whenever you have time :)

Copy link
Collaborator Author

@mumichae mumichae left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looking good! If you update the metric description, it should be ready to merge

label: kSIM
summary: "The kSIM acceptance rate measures whether cells of the same pre-annotated cell type are still close to each other in the local neighborhoods after batch correction."
description: |
The kSIM acceptance rate requires ground truth cell type information and measures whether the neighbors of a cell have the same cell type as it does. If a method overcorrects the batch effects, it will have a low kSIM acceptance rate. We use the HNSW algorithm to find k-NNs (including the cell itself) for each cell i and denote the number of neighbors that have the same cell type as i as . In addition, we require at least β fraction of neighbors of cell i to have the same cell type as i in order to say cell i has a consistent neighborhood.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you rephrase in your own words? Also, parameters don't make much sense when the formula isn't included. You could rephrase in such a way that the formula isn't needed to understand the metric

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please rephrase the use of "we", since refers to the study authors, not the Openproblems team. Passive voice would be the best here.


* Added `metrics/kbet_pg` and `metrics/kbet_pg_label` components (PR #52).

* Added `metircs/ksim` component (PR #75).
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* Added `metircs/ksim` component (PR #75).
* Added `metrics/ksim` component (PR #75).

label: kSIM
summary: "The kSIM acceptance rate measures whether cells of the same pre-annotated cell type are still close to each other in the local neighborhoods after batch correction."
description: |
The kSIM acceptance rate requires ground truth cell type information and measures whether the neighbors of a cell have the same cell type as it does. If a method overcorrects the batch effects, it will have a low kSIM acceptance rate. We use the HNSW algorithm to find k-NNs (including the cell itself) for each cell i and denote the number of neighbors that have the same cell type as i as . In addition, we require at least β fraction of neighbors of cell i to have the same cell type as i in order to say cell i has a consistent neighborhood.
Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please rephrase the use of "we", since refers to the study authors, not the Openproblems team. Passive voice would be the best here.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants