Investigate predictions on "swap" synthetic test dataset #80

SteveOv · 2024-08-12T10:59:28Z

Investigate the phenomenon shown below, when we test against a synthetic test dataset where instances are swapped if the original secondary eclipse is found to be deeper.

All instances	Transiting instances

SteveOv · 2024-08-12T12:19:00Z

Added a swapped field to saved CSVs with 81a257d

SteveOv · 2024-08-12T12:57:39Z

Can confirm that this is definitely down to the instances that have been swapped.

Swapped instances	Non-swapped instances

SteveOv · 2024-08-12T13:20:04Z

By combining the swapped and transiting criteria we see that the majority are swapped and transiting.

Swapped & transiting instances	swapped & non-transiting instances

SteveOv · 2024-08-12T20:06:32Z

Doesn't completely resolve the issue, but I've found that I've been handling the change in bP and bS when switching components incorrectly. These need to be recalculated as they must relate to the newly assigned star A

SteveOv · 2024-08-16T18:29:37Z

Tried training models on a datasets with swap enabled. Invariably, these improved the results with the synthetic set with swap enabled at the expense of these without it, however the net result was significantly worse predictions.

The following shows the predictions for k against both "swapped" and "non-swapped" synthetic-mist-tess-datasets with a model trained on s 100k dataset with swapped instances (without additional restrictions on k, J or qphot);

synth test dataset with swap	synth test dataset without swap

For the "swap" model (trained on 100k train/val instances without swap):

test dataset	all instances	transiting	non-transiting
synth test dataset with swap (k<=10)	0.060 041	0.124 873	0.042 900
synth test dataset without swap	0.061 276	0.092 278	0.053 035
formal test dataset (effectively with swap)	0.064 847	0.107 058	0.0530122

For the control model (trained on 100k train/val instances without swap):

test dataset	all instances	transiting	non-transiting
synth test dataset with swap (k<=10)	0.071 529	0.179 780	0.042 909
synth test dataset without swap	0.040 515	0.063 774	0.034 333
formal test dataset (effectively with swap)	0.050 801	0.077 074	0.043 503

SteveOv · 2024-10-09T13:25:27Z

Returning to this with models trained with the mags feature centred on the midpoint between the eclipses and roll (agumentation) <= 512 bins.

SteveOv added the question Further information is requested label Aug 12, 2024

SteveOv self-assigned this Aug 12, 2024

SteveOv mentioned this issue Aug 12, 2024

Rework trainset/dataset code so that it can flip the components/roll mags if secondary is deeper #79

Closed

SteveOv added a commit that referenced this issue Aug 12, 2024

Fixed recalc of bP & bA on swapping components (relates to #80)

69b6fcc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate predictions on "swap" synthetic test dataset #80

Investigate predictions on "swap" synthetic test dataset #80

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 12, 2024

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 12, 2024

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 16, 2024 •

edited

Loading

SteveOv commented Oct 9, 2024 •

edited

Loading

Investigate predictions on "swap" synthetic test dataset #80

Investigate predictions on "swap" synthetic test dataset #80

Comments

SteveOv commented Aug 12, 2024 • edited Loading

SteveOv commented Aug 12, 2024

SteveOv commented Aug 12, 2024 • edited Loading

SteveOv commented Aug 12, 2024

SteveOv commented Aug 12, 2024 • edited Loading

SteveOv commented Aug 16, 2024 • edited Loading

SteveOv commented Oct 9, 2024 • edited Loading

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 12, 2024 •

edited

Loading

SteveOv commented Aug 16, 2024 •

edited

Loading

SteveOv commented Oct 9, 2024 •

edited

Loading