Adding the ability to specify classes to filter user-skill calculation #796

AgentM-GEG · 2024-10-29T13:59:53Z

Context: The current version of the user-skill calculation is done on ALL detected classes present within a task (either the mean skill or that skill for all classes be above a certain skill_threshold). This creates a situation where, for a task with large number of classes OR imbalanced datasets, the user has to see at least N images per class before they get a chance to even be considered for leveling up.

Motivation: Research teams should be given the opportunity to provide specific classes using which they can judge the leveling up decision.

This PR:

The user_skill_reducer function now takes in focus_classes argument (default: None).
The solution involved a conditional statement saying if focus_classes are provided (e.g., ['square', 'triangle']), then compute the mean_skill, null_removed_classes, and null_removed_class_counts on these subset classes (instead of everything).
As such, the output still contains the entire confusion matrix (for all classes), but, the mean skill is computed on user-specified classes only.
A refactoring on lines 87-89 were done as this block of code is just repeated between if binary... else: ... statement, with the only difference being the null_class='False' in the binary case.

An example caeasar config looks as such: .../reducers/user_skill_reducer?mode='one-to-one'&count_threshold=5&focus_classes=['1', '2']&strategy='all'&skill_threshold=0.2

AgentM-GEG · 2024-10-29T14:08:38Z

tagging @ramanakumars as well for visibility and crosschecking.

lcjohnso · 2024-10-29T14:08:45Z

Hi @CKrawczyk -- Would you mind reviewing this PR?

CKrawczyk

All changes look reasonable to me. It might be worth adding a new test to https://github.com/zooniverse/aggregation-for-caesar/blob/master/panoptes_aggregation/tests/reducer_tests/test_user_skill_reducer.py but this is not a blocker.

AgentM-GEG · 2024-11-07T21:23:14Z

panoptes_aggregation/reducers/reducer_wrapper.py

lines 60-64 reflect a slightly different treatment of the focus_classes keyword argument input so that the tests can ingest this info.

AgentM-GEG · 2024-11-07T21:24:46Z

@CKrawczyk I added a test for the focus_classes behavior and pushed those changes. I also changed a little bit of the reducer_wrapper code where the focus_classes argument is being parsed appropriately. Let me know how these changes look.

CKrawczyk

Looks good. Let me know when you are happy to have it merged.

AgentM-GEG · 2024-11-08T12:51:18Z

@CKrawczyk @lcjohnso , thank you! I am happy for it to be merged whenever works for either/both of you.

adding ability to specify classes for Uskill calc

f86c56b

AgentM-GEG assigned lcjohnso and CKrawczyk Oct 29, 2024

lcjohnso requested a review from CKrawczyk October 29, 2024 14:08

CKrawczyk approved these changes Oct 30, 2024

View reviewed changes

adding tests for user skill reducer focus classes

217d903

AgentM-GEG commented Nov 7, 2024

View reviewed changes

lcjohnso requested a review from CKrawczyk November 7, 2024 22:15

CKrawczyk approved these changes Nov 8, 2024

View reviewed changes

CKrawczyk merged commit c7505ab into zooniverse:master Nov 11, 2024
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the ability to specify classes to filter user-skill calculation #796

Adding the ability to specify classes to filter user-skill calculation #796

AgentM-GEG commented Oct 29, 2024

AgentM-GEG commented Oct 29, 2024

lcjohnso commented Oct 29, 2024

CKrawczyk left a comment

AgentM-GEG Nov 7, 2024

AgentM-GEG commented Nov 7, 2024

CKrawczyk left a comment

AgentM-GEG commented Nov 8, 2024

Adding the ability to specify classes to filter user-skill calculation #796

Adding the ability to specify classes to filter user-skill calculation #796

Conversation

AgentM-GEG commented Oct 29, 2024

AgentM-GEG commented Oct 29, 2024

lcjohnso commented Oct 29, 2024

CKrawczyk left a comment

Choose a reason for hiding this comment

AgentM-GEG Nov 7, 2024

Choose a reason for hiding this comment

AgentM-GEG commented Nov 7, 2024

CKrawczyk left a comment

Choose a reason for hiding this comment

AgentM-GEG commented Nov 8, 2024