Add support for virtual env directory flag #611

LennartKloppenburg · 2023-10-18T13:35:06Z

Description

Added virtualenv_dir as an option to ExecutionConfig which is then propagated downstream to DbtVirtualenvBaseOperator.

The following now happens:

If the flag is set, the operator will attempt to locate the venv's python binary under the provided virtualenv_dir.
- If so, it will conclude that the venv exists and continues without creating a new one.
- If not, it will create a new one at virtualenv_dir
If the flag is not set, simply continue using the temporary directory solution that was already in place.

Impact

A very basic test using a local docker compose set-up as per the contribution guide and the example_virtualenv DAG saw the DAG's runtime go down from 2m31s to just 32s. I'd this improvement to be even more noticeable with more complex graphs and more python requirements.

Related Issue(s)

Implements #610

Breaking Change?

None, the flag is optional and is ignored (with a warning) when used outside of VirtualEnv execution mode.

Checklist

I have made corresponding changes to the documentation (if required)
I have added tests that prove my fix is effective or that my feature works

netlify · 2023-10-18T13:35:12Z

👷 Deploy Preview for amazing-pothos-a3bca0 processing.

Name	Link
🔨 Latest commit	`be0de1a`
🔍 Latest deploy log	https://app.netlify.com/sites/amazing-pothos-a3bca0/deploys/6582c61c78d17900084dc3d7

codecov · 2023-10-18T15:50:58Z

Codecov Report

Attention: Patch coverage is 83.67347% with 8 lines in your changes are missing coverage. Please review.

Project coverage is 93.06%. Comparing base (090116e) to head (87c5da0).
Report is 109 commits behind head on main.

❗ Current head 87c5da0 differs from pull request most recent head be0de1a

Please upload reports for the commit be0de1a to get more accurate results.

Files	Patch %	Lines
cosmos/operators/virtualenv.py	81.39%	8 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #611      +/-   ##
==========================================
- Coverage   93.28%   93.06%   -0.23%     
==========================================
  Files          55       54       -1     
  Lines        2502     2163     -339     
==========================================
- Hits         2334     2013     -321     
+ Misses        168      150      -18

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tatiana

Thank you very much for creating this PR so quickly, @LennartKloppenburg ! This is looking very good.

I added some comments in line, and I have a gut feeling we may need to add some additional tests to cover the possible behaviours in _get_or_create_venv_py_interpreter.

We can aim to release this change as part of 1.2.1 (if we consider it a bugfix) or 1.3 (if we consider it a new feature) 🎉

cosmos/config.py

cosmos/converter.py

tatiana · 2023-10-19T09:45:41Z

cosmos/operators/virtualenv.py

+            self.log.info(f"Checking for venv interpreter: {py_interpreter_path} : {py_interpreter_path.is_file()}")
+            if py_interpreter_path.is_file():
+                self.log.info(f"Found Python interpreter in cached virtualenv: `{str(py_interpreter_path)}`")
+                return str(py_interpreter_path)


In this case, does it still make sense to install any potential dependencies/update them - if there were requirement changes?

Yeah ideally we'd be able to clean up the virtual env after the DAG run, but for the reasons you mentioned before this can be tricky.
One way to "perhaps" invalidate the virtualenv is to check when it was created and, after say 24 hours or 48 or so, have this operator clean it up and recreate it?

The time-based approach could lead to some strange scenarios and be tricky to troubleshoot. How feasible would be for us to run a pip install in an existing virtualenv? It should be very quick if it was previously setup, and it would make the operator reliable.

Regarding the cleanup - I know - ideally we'd be able to set the venv only once during the DAG setup and delete during tear down. Unfortunately - to my knowledge - even the latest Airflow (2.7) does not allow us to have a setup/tear down per worker node during the lifecycle of a DAG. But this can be an improvement for the future - in a separate PR!

Sorry for the late response here!

so the underlying prepare_virtualenv that we are "avoiding" after determining it's already there is imported from Airflow core (airflow.utils.python_virtualenv). That little helper also takes into account the python requirements so if we bypass this helper, we can't inject requirements unless we repeat the logic over here:

... pip_cmd = None if requirements is not None and len(requirements) != 0: pip_cmd = _generate_pip_install_cmd_from_list(venv_directory, requirements, pip_install_options) if requirements_file_path is not None and requirements_file_path: pip_cmd = _generate_pip_install_cmd_from_file( venv_directory, requirements_file_path, pip_install_options ) if pip_cmd: execute_in_subprocess(pip_cmd) ...

What do you think?

In our case, we probably would only need to do part of the logic:

if requirements is not None and len(requirements) != 0: pip_cmd = _generate_pip_install_cmd_from_list(venv_directory, requirements, pip_install_options)

Since we don't support requirements_file_path.

If we don't want to add this call unnecessarily, we'd probably need a pip freeze call - to confirm if the desired dependencies are already installed, which may be more work.

We probably need one or both of these. Otherwise, we're at the risk of an Airflow worker having partial/outdated dependencies that are incompatible with the dependencies the user requested.

I'm in favour of us caching for performance reasons, but we still should aim to have the task being idempotent.

dev/dags/example_virtualenv.py

tatiana · 2023-10-19T09:52:34Z

cosmos/operators/virtualenv.py

        **kwargs: Any,
    ) -> None:
        self.py_requirements = py_requirements or []
        self.py_system_site_packages = py_system_site_packages
        super().__init__(**kwargs)
+        self._venv_dir = virtualenv_dir
        self._venv_tmp_dir: None | TemporaryDirectory[str] = None

    @cached_property


A general thought: do we still want to cache this property? Is there any risk that we could end up caching the incorrect path?

How is this property cached? If people are debugging or want to pass in more dynamically configured directories, I don't know how this decorator behaves :) Is it per task_id per dagrun_id or is it more persistent?

The property is cached while the Python process is alive.

LennartKloppenburg · 2023-10-23T13:33:43Z

@tatiana I've updated the PR with some changes you've requested :)

One lingering issue:
When there's no virtual env handy and multiple tasks are executed concurrently, they will all attempt the same checks (checking for the dir, for the virtual env, to install stuff etc.) which leads to all kinds of issues because they don't automatically "wait" for the venv to be created by one single operator. This occurs in environments that schedule multiple DBT tasks simultaneously -- which is obviously very common.

The issue can then be resolved by retrying these tasks with some retry_delay, which will achieve the same result as "waiting" for the virtual env to be provisioned would have done.
What do you think of this?

If execution_config was reused, Cosmos 1.2.2 would raise: ``` astronomer-cosmos/dags/basic_cosmos_task_group.py Traceback (most recent call last): File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/venv-38/lib/python3.8/site-packages/airflow/models/dagbag.py", line 343, in parse loader.exec_module(new_module) File "<frozen importlib._bootstrap_external>", line 848, in exec_module File "<frozen importlib._bootstrap>", line 219, in _call_with_frames_removed File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/dags/basic_cosmos_task_group.py", line 74, in <module> basic_cosmos_task_group() File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/venv-38/lib/python3.8/site-packages/airflow/models/dag.py", line 3817, in factory f(**f_kwargs) File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/dags/basic_cosmos_task_group.py", line 54, in basic_cosmos_task_group orders = DbtTaskGroup( File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/airflow/task_group.py", line 26, in __init__ DbtToAirflowConverter.__init__(self, *args, **specific_kwargs(**kwargs)) File "/Users/tati/Code/cosmos-clean/astronomer-cosmos/cosmos/converter.py", line 113, in __init__ raise CosmosValueError( cosmos.exceptions.CosmosValueError: ProjectConfig.dbt_project_path is mutually exclusive with RenderConfig.dbt_project_path and ExecutionConfig.dbt_project_path.If using RenderConfig.dbt_project_path or ExecutionConfig.dbt_project_path, ProjectConfig.dbt_project_path should be None ``` This has been raised by an Astro customer and our field engineer, who tried to run: https://github.com/astronomer/cosmos-demo

LennartKloppenburg · 2023-12-17T18:13:59Z

@tatiana Just completed the rebase, saw some artifacts that trip up the tests, will look at those tomorrow :) !

LennartKloppenburg · 2023-12-20T11:38:22Z

@tatiana
I could use two more eyes on this (from the CI/CD tests):

FAILED tests/dbt/test_graph.py::test_load_via_dbt_ls_project_config_env_vars - cosmos.dbt.graph.CosmosLoadDbtException: Unable to find the dbt executable: dbt
FAILED tests/dbt/test_graph.py::test_load_via_dbt_ls_project_config_dbt_vars - cosmos.dbt.graph.CosmosLoadDbtException: Unable to find the dbt executable: dbt
FAILED tests/dbt/test_graph.py::test_load_via_dbt_ls_render_config_selector_arg_is_used - cosmos.dbt.graph.CosmosLoadDbtException: Unable to find the dbt executable: dbt

When I run the tests locally they pass, maybe I missed something while rebasing? I rebased so much that I no longer know where it was introduced :D
Thanks in advance!!

tatiana · 2024-05-10T15:14:54Z

Hi @LennartKloppenburg ! I'm sorry for the massive delay, I've been working on other projects and it has been hard to keep up with everything. I'm planning to get back to this PR next week, so we can try to release it as part of Cosmos 1.5

tatiana · 2024-07-05T09:11:20Z

Hi @LennartKloppenburg ! I'm very sorry for the very long delay.
I solved all the conflicts, and all the tests seem to be passing in #1079 - which is a copy of your PR, with the additional changes:

Rebase
Resolve conflicts
Fix tests

If you are happy with the proposed changes, please feel free to incorporate them into your PR.
I'd like us to merge this to have alpha versions and validate this change before the 1.6 release.

@LennartKloppenburg

## Description Added `virtualenv_dir` as an option to `ExecutionConfig` which is then propagated downstream to `DbtVirtualenvBaseOperator`. The following now happens: - If the flag is set, the operator will attempt to locate the `venv`'s `python` binary under the provided `virtualenv_dir`. - If so, it will conclude that the `venv` exists and continues without creating a new one. - If not, it will create a new one at `virtualenv_dir` - If the flag is not set, simply continue using the temporary directory solution that was already in place. ## Impact A very basic test using a local `docker compose` set-up as per the contribution guide and the [example_virtualenv](https://github.com/astronomer/astronomer-cosmos/blob/main/dev/dags/example_virtualenv.py) DAG saw the DAG's runtime go down from **2m31s** to just **32s**. I'd this improvement to be even more noticeable with more complex graphs and more python requirements. ## Related Issue(s) Closes: #610 Partially solves: #1042 Follow up ticket: #1157 ## Breaking Change? None, the flag is optional and is ignored (with a [warning](https://github.com/astronomer/astronomer-cosmos/compare/main...LennartKloppenburg:astronomer-cosmos:feature/cache-virtualenv?expand=1#diff-61b585fb903927b6868b9626c95e0ec47e3818eb477d795ebd13b0276d4fd76cR125)) when used outside of `VirtualEnv` execution mode. ## Important notice Most of the changes in this PR were originally implemented in PR #611 by @LennartKloppenburg. It became stale over the last few months due to limited maintainer availability. Our sincere apologies to the original author. What was accomplished since: 1. Rebased 2. Fixed conflicts 3. Fixed failing tests 4. Introduced new tests Co-authored-by: Lennart Kloppenburg <[email protected]>

tatiana · 2024-08-16T11:14:38Z

We took this to completion in #1079, giving the credits to @LennartKloppenburg and this original PR

LennartKloppenburg had a problem deploying to external October 18, 2023 13:35 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external October 18, 2023 13:36 Error

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 64c4b93 to 0364ea3 Compare October 18, 2023 13:42

LennartKloppenburg had a problem deploying to external October 18, 2023 13:42 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external October 18, 2023 13:43 Error

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from ae7d1f5 to 9b94cec Compare October 18, 2023 13:58

LennartKloppenburg had a problem deploying to external October 18, 2023 13:58 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external October 18, 2023 13:58 Error

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 95cb840 to 9b0e6e3 Compare October 18, 2023 14:03

LennartKloppenburg had a problem deploying to external October 18, 2023 14:04 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external October 18, 2023 14:06 Error

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 06be330 to 26032c9 Compare October 18, 2023 14:09

LennartKloppenburg had a problem deploying to external October 18, 2023 14:09 — with GitHub Actions Error

pre-commit-ci bot temporarily deployed to external October 18, 2023 14:10 Inactive

LennartKloppenburg marked this pull request as ready for review October 18, 2023 14:13

LennartKloppenburg requested a review from a team as a code owner October 18, 2023 14:13

LennartKloppenburg requested a review from a team October 18, 2023 14:13

tatiana temporarily deployed to external October 18, 2023 20:33 — with GitHub Actions Inactive

tatiana reviewed Oct 19, 2023

View reviewed changes

tatiana added this to the 1.3.0 milestone Oct 19, 2023

tatiana mentioned this pull request Oct 19, 2023

Add support for GKEStartPodOperator #455

Closed

tatiana temporarily deployed to external October 20, 2023 11:18 — with GitHub Actions Inactive

LennartKloppenburg had a problem deploying to external October 23, 2023 13:24 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external October 23, 2023 13:25 Error

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 384c7b5 to 02dbd9a Compare October 23, 2023 13:26

LennartKloppenburg had a problem deploying to external October 23, 2023 13:26 — with GitHub Actions Error

pre-commit-ci bot temporarily deployed to external October 23, 2023 13:27 Inactive

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 53c7595 to 3a84aa7 Compare December 17, 2023 17:56

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Dec 17, 2023

LennartKloppenburg had a problem deploying to external December 17, 2023 17:56 — with GitHub Actions Error

pre-commit-ci bot had a problem deploying to external December 17, 2023 17:57 Error

Iron out locking flow

547b0af

LennartKloppenburg force-pushed the feature/cache-virtualenv branch from 62c2a7c to 547b0af Compare December 20, 2023 10:46

LennartKloppenburg had a problem deploying to external December 20, 2023 10:46 — with GitHub Actions Error

🎨 [pre-commit.ci] Auto format from pre-commit.com hooks

be0de1a

pre-commit-ci bot temporarily deployed to external December 20, 2023 10:47 Inactive

tatiana added status:awaiting-reviewer The issue/PR is awaiting for a reviewer input and removed status:awaiting-author Issue/PR is under discussion and waiting for author's input labels Jan 9, 2024

tatiana modified the milestones: 1.4.0, 1.5.0 Apr 25, 2024

tatiana self-assigned this May 10, 2024

tatiana added triage-needed Items need to be reviewed / assigned to milestone epic-assigned and removed triage-needed Items need to be reviewed / assigned to milestone labels May 17, 2024

tatiana mentioned this pull request May 17, 2024

Community Support #981

Closed

tatiana modified the milestones: Cosmos 1.5.0, Cosmos 1.6.0 Jun 27, 2024

tatiana mentioned this pull request Jul 5, 2024

Support persisting the LoadMode.VIRTUALENV directory #1079

Merged

tatiana mentioned this pull request Jul 31, 2024

Release Cosmos 1.6.0 #1103

Closed

18 tasks

tatiana closed this Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for virtual env directory flag #611

Add support for virtual env directory flag #611

LennartKloppenburg commented Oct 18, 2023

netlify bot commented Oct 18, 2023 •

edited

Loading

codecov bot commented Oct 18, 2023 •

edited

Loading

tatiana left a comment

tatiana Oct 19, 2023

LennartKloppenburg Oct 19, 2023

tatiana Oct 19, 2023

LennartKloppenburg Oct 23, 2023

tatiana Oct 25, 2023

tatiana Oct 19, 2023

LennartKloppenburg Oct 23, 2023

tatiana Oct 24, 2023

LennartKloppenburg commented Oct 23, 2023

LennartKloppenburg commented Dec 17, 2023

LennartKloppenburg commented Dec 20, 2023 •

edited

Loading

tatiana commented May 10, 2024

tatiana commented Jul 5, 2024

tatiana commented Aug 16, 2024

Add support for virtual env directory flag #611

Add support for virtual env directory flag #611

Conversation

LennartKloppenburg commented Oct 18, 2023

Description

Impact

Related Issue(s)

Breaking Change?

Checklist

netlify bot commented Oct 18, 2023 • edited Loading

👷 Deploy Preview for amazing-pothos-a3bca0 processing.

codecov bot commented Oct 18, 2023 • edited Loading

Codecov Report

tatiana left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

LennartKloppenburg commented Oct 23, 2023

LennartKloppenburg commented Dec 17, 2023

LennartKloppenburg commented Dec 20, 2023 • edited Loading

tatiana commented May 10, 2024

tatiana commented Jul 5, 2024

tatiana commented Aug 16, 2024

netlify bot commented Oct 18, 2023 •

edited

Loading

codecov bot commented Oct 18, 2023 •

edited

Loading

LennartKloppenburg commented Dec 20, 2023 •

edited

Loading