Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add recipe for pytorch (C++ interface only) #8388

Merged
merged 38 commits into from
Dec 14, 2023

Conversation

iarspider
Copy link
Contributor

@iarspider iarspider commented Mar 15, 2023

No description provided.

@cmsbuild
Copy link
Contributor

cmsbuild commented Mar 15, 2023

A new Pull Request was created by @iarspider for branch IB/CMSSW_13_1_X/master.

@cmsbuild, @smuzaffar, @aandvalenzuela, @iarspider can you please review it and eventually sign? Thanks.
@perrotta, @dpiparo, @rappoccio you are the release manager for this.
cms-bot commands are listed here

@iarspider
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

Pull request #8388 was updated.

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/31301/summary.html
COMMIT: 0181b0c
CMSSW: CMSSW_13_1_X_2023-03-15-1100/el8_amd64_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8388/31301/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/31301/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/31301/git-merge-result

Comparison Summary

Summary:

  • You potentially added 11 lines to the logs
  • Reco comparison results: 17 differences found in the comparisons
  • DQMHistoTests: Total files compared: 49
  • DQMHistoTests: Total histograms compared: 3550756
  • DQMHistoTests: Total failures: 170
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3550564
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 309.217 KiB( 48 files compared)
  • DQMHistoSizes: changed ( 1001.0 ): 309.217 KiB AlCaReco/SiStripHitEfficiency
  • Checked 213 log files, 164 edm output root files, 49 DQM output files
  • TriggerResults: no differences found

@cmsbuild
Copy link
Contributor

Pull request #8388 was updated.

@iarspider
Copy link
Contributor Author

please test for el8_ppc64le_gcc11

@cmsbuild
Copy link
Contributor

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/31307/summary.html
COMMIT: 317acbd
CMSSW: CMSSW_13_1_X_2023-03-15-2300/el8_ppc64le_gcc11
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8388/31307/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

+ tar xfz '/scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/SOURCES/external/py3-torch/1.13.1-c52873272d6ed54dadebd1dc97e7c1eb/%{pkgsource}'
tar (child): /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/SOURCES/external/py3-torch/1.13.1-c52873272d6ed54dadebd1dc97e7c1eb/%{pkgsource}: Cannot open: No such file or directory
tar (child): Error is not recoverable: exiting now
tar: Child returned status 2
tar: Error is not recoverable: exiting now
error: Bad exit status from /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.9nIz8V (%build)


RPM build errors:
line 37: It's not recommended to have unversioned Obsoletes: Obsoletes: external+py3-torch+1.13.1-c52873272d6ed54dadebd1dc97e7c1eb
Bad exit status from /scratch/cmsbuild/jenkins_a/workspace/ib-run-pr-tests/testBuildDir/tmp/rpm-tmp.9nIz8V (%build)


@iarspider
Copy link
Contributor Author

please test

@smuzaffar
Copy link
Contributor

please test for el9_amd64_gcc13

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 4, 2023

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36292/summary.html
COMMIT: cbd56e4
CMSSW: CMSSW_14_0_X_2023-12-03-2300/el8_aarch64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8388/36292/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36292/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36292/git-merge-result

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 4, 2023

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36290/summary.html
COMMIT: cbd56e4
CMSSW: CMSSW_14_0_X_2023-12-03-2300/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8388/36290/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36290/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36290/git-merge-result

Comparison Summary

Summary:

  • You potentially added 75 lines to the logs
  • Reco comparison results: 16 differences found in the comparisons
  • DQMHistoTests: Total files compared: 50
  • DQMHistoTests: Total histograms compared: 3370032
  • DQMHistoTests: Total failures: 81
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3369929
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 49 files compared)
  • Checked 214 log files, 167 edm output root files, 50 DQM output files
  • TriggerResults: no differences found

GPU Comparison Summary

Summary:

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 4, 2023

-1

Failed Tests: Build HeaderConsistency
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36306/summary.html
COMMIT: cbd56e4
CMSSW: CMSSW_14_0_X_2023-12-01-2300/el9_amd64_gcc13
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/8388/36306/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36306/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36306/git-merge-result

Build

I found compilation error when building:

      |                                           ~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el9_amd64_gcc13/external/pytorch/2.1.1-815fe07924977ebb6c7bffd29d80d4f3/include/torch/csrc/api/include/torch/nn/modules/container/sequential.h:122:52: note: remove 'std::move' call
>> Building binary testTorchTimeSeries
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el9_amd64_gcc13/external/gcc/13.2.0-1b0a3367d04f48f01ad3ccf40e55475c/bin/../lib/gcc/x86_64-redhat-linux-gnu/13.2.0/../../../../x86_64-redhat-linux-gnu/bin/ld.bfd: cannot find -ltorch_cuda: No such file or directory
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/el9_amd64_gcc13/external/gcc/13.2.0-1b0a3367d04f48f01ad3ccf40e55475c/bin/../lib/gcc/x86_64-redhat-linux-gnu/13.2.0/../../../../x86_64-redhat-linux-gnu/bin/ld.bfd: cannot find -lc10_cuda: No such file or directory
collect2: error: ld returned 1 exit status
>> Deleted: tmp/el9_amd64_gcc13/src/PhysicsTools/PythonAnalysis/test/testTorchTimeSeries/testTorchTimeSeries
gmake: *** [tmp/el9_amd64_gcc13/src/PhysicsTools/PythonAnalysis/test/testTorchTimeSeries/testTorchTimeSeries] Error 1
>> Compiling  /data/cmsbld/jenkins/workspace/ib-run-pr-tests/CMSSW_14_0_X_2023-12-01-2300/src/PhysicsTools/PythonAnalysis/test/test_PyMVA.cpp
>> Building binary test_PyMVA
Copying tmp/el9_amd64_gcc13/src/PhysicsTools/PythonAnalysis/test/test_PyMVA/test_PyMVA to productstore area:


@iarspider
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Dec 7, 2023

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36368/summary.html
COMMIT: cbd56e4
CMSSW: CMSSW_14_0_X_2023-12-07-1100/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/8388/36368/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • You potentially added 431 lines to the logs
  • Reco comparison results: 1 differences found in the comparisons
  • DQMHistoTests: Total files compared: 50
  • DQMHistoTests: Total histograms compared: 3430794
  • DQMHistoTests: Total failures: 73
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3430699
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 49 files compared)
  • Checked 214 log files, 167 edm output root files, 50 DQM output files
  • TriggerResults: no differences found

GPU Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 32 differences found in the comparisons
  • DQMHistoTests: Total files compared: 3
  • DQMHistoTests: Total histograms compared: 39740
  • DQMHistoTests: Total failures: 746
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 38994
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 2 files compared)
  • Checked 8 log files, 10 edm output root files, 3 DQM output files
  • TriggerResults: no differences found

@tvami
Copy link

tvami commented Dec 13, 2023

hi @iarspider do you think it would be helpful to squash to commits? Is there anything more missing or should we merge after the squashing is done?

@smuzaffar
Copy link
Contributor

please test

lets refresh the tests
@tvami , sure I will squash merge this PR

@cmsbuild
Copy link
Contributor

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36490/summary.html
COMMIT: cbd56e4
CMSSW: CMSSW_14_0_X_2023-12-13-2300/el8_amd64_gcc12
Additional Tests: GPU
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/8388/36490/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36490/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-d4402c/36490/git-merge-result

Comparison Summary

There are some workflows for which there are errors in the baseline:
10024.3 step 3
10024.4 step 3
2017.13 step 1
2018.13 step 1
The results for the comparisons for these workflows could be incomplete
This means most likely that the IB is having errors in the relvals.The error does NOT come from this pull request

Summary:

  • You potentially added 83 lines to the logs
  • Reco comparison results: 7 differences found in the comparisons
  • DQMHistoTests: Total files compared: 50
  • DQMHistoTests: Total histograms compared: 3429858
  • DQMHistoTests: Total failures: 6
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 3429830
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 49 files compared)
  • Checked 214 log files, 167 edm output root files, 50 DQM output files
  • TriggerResults: no differences found

GPU Comparison Summary

Summary:

  • No significant changes to the logs found
  • Reco comparison results: 35 differences found in the comparisons
  • DQMHistoTests: Total files compared: 3
  • DQMHistoTests: Total histograms compared: 39740
  • DQMHistoTests: Total failures: 1156
  • DQMHistoTests: Total nulls: 0
  • DQMHistoTests: Total successes: 38584
  • DQMHistoTests: Total skipped: 0
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.0 KiB( 2 files compared)
  • Checked 8 log files, 10 edm output root files, 3 DQM output files
  • TriggerResults: no differences found

@tvami
Copy link

tvami commented Dec 14, 2023

+analysis

@smuzaffar
Copy link
Contributor

+externals

@cmsbuild
Copy link
Contributor

This pull request is fully signed and it will be integrated in one of the next IB/CMSSW_14_0_X/master IBs (tests are also fine). This pull request will now be reviewed by the release team before it's merged. @sextonkennedy, @antoniovilela, @rappoccio (and backports should be raised in the release meeting by the corresponding L2)

@smuzaffar smuzaffar merged commit 5354a4c into IB/CMSSW_14_0_X/master Dec 14, 2023
27 of 29 checks passed
@smuzaffar smuzaffar deleted the add-py3-torch branch December 18, 2023 18:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants