MTV-1537 | Optimalise the plan scheduler #1088

mnecas · 2024-10-07T14:21:49Z

Issue:
When we start the warm migration with VM which has a lot of disks we wait for the whole VM to get migrated. We do not ignore the disks that have already been migrated. This can cause that when we have 2 VMs with 10 disks each and on each there is one larger disk the whole scheduler will be halted until they finish. So even if the left 9 disks are done no migration will be started.

Fix:
Subtract the finished disks from the disk count.

Fixes: https://issues.redhat.com/browse/MTV-1537

codecov-commenter · 2024-10-07T14:27:54Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 0% with 9 lines in your changes missing coverage. Please review.

Project coverage is 16.30%. Comparing base (0a30857) to head (d7f93f4).

Files with missing lines	Patch %	Lines
pkg/controller/plan/scheduler/vsphere/scheduler.go	0.00%	9 Missing ⚠️

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1088      +/-   ##
==========================================
+ Coverage   16.26%   16.30%   +0.03%     
==========================================
  Files         112      112              
  Lines       19794    19802       +8     
==========================================
+ Hits         3220     3229       +9     
+ Misses      16289    16286       -3     
- Partials      285      287       +2

Flag	Coverage Δ
unittests	`16.30% <0.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Issue: When we start the warm migration with VM which has a lot of disks we wait for the whole VM to get migrated. We do not ignore the disks that has been already migrated. This can cause that when we have 2 VMs with 10 disks each and on each there is one larger disk the whole scheduler will be halted untill they finish. So even if the left 9 disks will be done no migration will be started. Fix: Subtract the finished disks from the disk count. Fixes: https://issues.redhat.com/browse/MTV-1537 Signed-off-by: Martin Necas <[email protected]>

sonarqubecloud · 2024-10-07T15:00:51Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

Issue: When we start the warm migration with VM which has a lot of disks we wait for the whole VM to get migrated. We do not ignore the disks that has been already migrated. This can cause that when we have 2 VMs with 10 disks each and on each there is one larger disk the whole scheduler will be halted untill they finish. So even if the left 9 disks will be done no migration will be started. Fix: Subtract the finished disks from the disk count. Fixes: https://issues.redhat.com/browse/MTV-1537 Signed-off-by: Martin Necas <[email protected]>

Issues: [1] Allow migration of "unknow" guests Right now when we want to migrate an unknown and unsupported operating system which is unsupported by the virt-v2v [3]. [2] Unifying the process and potential speedup Right now we are using two different methods for the disk transfer. This brings additional engineering for maintaining two paths. It's harder to debug two different flows. The virt-v2v transfers the disks in the sequence whereas using the CDI we can start multiple disk imports in parallel. This can improve the migration speeds. Fix: MTV is already using the CNV CDI for the warm and remote migration. We just need to adjust the code to remove the virt-v2v transfer and rely on the CNV CDI to do it for us. Drawbacks: - CNV CDI *requires* the VDDK, which was till now highly recommended. - CNV CDI is not maintained inside the MTV and there might be problems escalating and backporting the patches as CNV has a different release cycle. - Because we will be migrating all disks in parallel we need to optimise our migration scheduler as we don't want to take too much of the hosts/network resources. I have already done some optimisations in [4,5,6]. Notes: This change removes the usage of virt-v2v and we will only use the virt-v2v-in-place. Ref: [1] https://issues.redhat.com/browse/MTV-1536 [2] https://issues.redhat.com/browse/MTV-1581 [3] https://access.redhat.com/articles/1351473 [4] kubev2v#1088 [5] kubev2v#1087 [6] kubev2v#1086 Signed-off-by: Martin Necas <[email protected]>

mnecas requested a review from yaacov as a code owner October 7, 2024 14:21

mnecas force-pushed the MTV-1537 branch from 4e53143 to d7f93f4 Compare October 7, 2024 15:00

yaacov approved these changes Oct 7, 2024

View reviewed changes

yaacov merged commit 6e899ea into kubev2v:main Oct 7, 2024
22 of 32 checks passed

mnecas mentioned this pull request Oct 8, 2024

Backport changes to 2.7 #1090

Merged

mnecas mentioned this pull request Oct 16, 2024

MTV-1536 | Use CDI for disk transfer in cold migration #1109

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MTV-1537 | Optimalise the plan scheduler #1088

MTV-1537 | Optimalise the plan scheduler #1088

mnecas commented Oct 7, 2024 •

edited

Loading

codecov-commenter commented Oct 7, 2024 •

edited

Loading

sonarqubecloud bot commented Oct 7, 2024

MTV-1537 | Optimalise the plan scheduler #1088

MTV-1537 | Optimalise the plan scheduler #1088

Conversation

mnecas commented Oct 7, 2024 • edited Loading

codecov-commenter commented Oct 7, 2024 • edited Loading

Codecov Report

sonarqubecloud bot commented Oct 7, 2024

Quality Gate passed

mnecas commented Oct 7, 2024 •

edited

Loading

codecov-commenter commented Oct 7, 2024 •

edited

Loading