Make data handling during evaluation more explicit #88

alexzwanenburg · 2024-12-06T08:11:12Z

During evaluation, models are assessed for three categories of data: external validation, internal validation and development. At the level of an ensemble of models, each model may have different development and internal validation data. This is currently handled correctly. However, the system is obscure. It depends on data not being loaded at prediction time, and then making decisions based on data_ids and run_ids. That is, this behaviour is baked into the deepest code layers, but is directed from high-level code in an indirect manner.

To make this behaviour more transparent and testable, we need to make this behaviour steerable from the code that generates and configures the evaluation task (enabled in v2.0.0 by switching to a task-based system instead of a purely functional system).

The text was updated successfully, but these errors were encountered:

alexzwanenburg · 2024-12-06T17:08:53Z

Added delayedDataObject class to start distinguishing between dataObject that can be processed directly, and delayedDataObject which is intended to populate the dataObject using its associated attributes and the backend. This will allow for separating methods and attributes related to loading data that are currently associated with dataObject.
Rework methods for dataObject that deal with loading data and move these to delayedDataObject.
Use delayedDataObject to contextualise data processing within evaluation steps.

alexzwanenburg added the enhancement New feature or request label Dec 6, 2024

alexzwanenburg self-assigned this Dec 6, 2024

alexzwanenburg added a commit that referenced this issue Dec 6, 2024

WIP on #88

c55cd72

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make data handling during evaluation more explicit #88

Make data handling during evaluation more explicit #88

alexzwanenburg commented Dec 6, 2024

alexzwanenburg commented Dec 6, 2024

Make data handling during evaluation more explicit #88

Make data handling during evaluation more explicit #88

Comments

alexzwanenburg commented Dec 6, 2024

alexzwanenburg commented Dec 6, 2024