Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Verify upstream labels #129

Open
Tracked by #54
bfhealy opened this issue Oct 17, 2022 · 3 comments
Open
Tracked by #54

Verify upstream labels #129

bfhealy opened this issue Oct 17, 2022 · 3 comments
Assignees

Comments

@bfhealy
Copy link
Collaborator

bfhealy commented Oct 17, 2022

On Fritz, the sitewide and scope phenomenological taxonomies have hierarchical connections that are not always maintained in the golden dataset. For example, 93 objects are labeled as periodic but not variable. While that is a small number compared to the ~104 variable sources, for some classes it may be a more substantial fraction. Verifying the upstream labels may improve training results and maintain consistency going forward.

@bfhealy
Copy link
Collaborator Author

bfhealy commented Oct 21, 2022

The most common missing upstream labels are as follows:

  • periodic (3374)
    • Most of the sources missing the periodic label have the long timescale (2936) or wrong period (402) labels.
  • irregular (450)
    • Mostly due to flaring (447) sources
  • variable (201)
    • 93 periodic sources, various others

There are also 579 non-variable labels missing, mostly from sources labeled as bogus (502). The non-variable label is less important than the ones above, since for 98.8% of sources it is simply 1 - P(variable) and does not train a binary classifier.

The periodic label has the most missing sources, but its classifier is already achieving ~95% precision and recall with the current training set. Going forward, we may want to label flaring sources as irregular variables, since it seems to be an appropriate umbrella term for both flaring and dipping stars.

@park2454
Copy link

Hi Brian, I will look into this issue.

@bfhealy
Copy link
Collaborator Author

bfhealy commented Oct 28, 2022

Thanks Sungmin!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants