Skip to content

Conversation

@smuzaffar
Copy link
Contributor

@smuzaffar
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Apr 8, 2021

A new Pull Request was created by @smuzaffar (Malik Shahzad Muzaffar) for branch IB/CMSSW_11_3_X/gcc10.

@smuzaffar, @mrodozov can you please review it and eventually sign? Thanks.
cms-bot commands are listed here

@cmsbuild
Copy link
Contributor

cmsbuild commented Apr 8, 2021

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/14111/summary.html
COMMIT: a7d3e28
CMSSW: CMSSW_11_3_X_2021-04-07-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/6799/14111/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

for dep in getPackages(pkg.dependencies): installApt(dep, scheduler, cache)
File "./pkgtools/cmsBuild", line 3254, in installApt
raise RpmInstallFailed(pkg, output)
RpmInstallFailed: Failed to install package gcc. Reason:
Reading Package Lists...
error: unknown package: external+gcc+10.3.0-5b3b2b6744d296c11f567ddae2ca2f60

* The action "final-job" was not completed successfully because The following dependencies could not complete:
install-external+gcc-fixincludes+1.0
build-cms+cmssw-tool-conf+46.0-5e50b8e553368a4d06a21be1e4f488f5
install-cms+cmssw-tool-conf+46.0-5e50b8e553368a4d06a21be1e4f488f5


@smuzaffar
Copy link
Contributor Author

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Apr 9, 2021

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/14128/summary.html
COMMIT: a7d3e28
CMSSW: CMSSW_11_3_X_2021-04-08-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week1/cms-sw/cmsdist/6799/14128/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono: In substitution of 'template template using __is_harmonic = std::__bool_constant<(std::ratio<((_Period2::num / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::num, _Period::num)) * (_Period::den / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::den, _Period::den))), ((_Period2::den / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::den, _Period::den)) * (_Period::num / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::num, _Period::num)))>::den == 1)> [with _Period2 = _Period2; _Rep = _Rep; _Period = _Period]':
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono:473:154:   required from here
/data/cmsbld/jenkins/workspace/ib-run-pr-tests/testBuildDir/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono:428:27: internal compiler error: Segmentation fault
428 |  _S_gcd(intmax_t __m, intmax_t __n) noexcept
|                           ^~~~~~
Please submit a full bug report,
with preprocessed source if appropriate.
See  for instructions.


@smuzaffar
Copy link
Contributor Author

looks like cuda and gcc 10.3 does not work well

[a]

/build/muz/ssl/w/slc7_amd64_gcc10/external/cuda/11.2.2-6245018a3e5d9469c9421d6e54f288a6/bin/nvcc -forward-unknown-to-host-compiler -DEIGEN_MPL2_ONLY -DEIGEN_USE_THREADS -DENABLE_ORT_FORMAT_LOAD -DNSYNC_ATOMIC_CPP11 -DONNX_ML=1 -DONNX_NAMESPACE=onnx -DPLATFORM_POSIX -DUSE_CUDA=1 -DUSE_EIGEN_FOR_BLAS -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/include/onnxruntime -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/include/onnxruntime/core/session -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/SafeInt -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/optional-lite/include -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/nsync/public -I. -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/onnx -Iexternal/onnx -I/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/flatbuffers/include -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/onnxruntime -I/build/muz/ssl/w/slc7_amd64_gcc10/external/cudnn/8.1.1.33-c83ead05344eb08780c652b63d3353d0/include -I/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/cmake/external/eigen -I/build/muz/ssl/w/slc7_amd64_gcc10/external/cuda/11.2.2-6245018a3e5d9469c9421d6e54f288a6/include -cudart shared -gencode=arch=compute_37,code=sm_37 -gencode=arch=compute_50,code=sm_50 -gencode=arch=compute_52,code=sm_52 -gencode=arch=compute_60,code=sm_60 -gencode=arch=compute_70,code=sm_70 -gencode=arch=compute_80,code=sm_80 --expt-relaxed-constexpr --default-stream legacy -Xcudafe "--diag_suppress=bad_friend_decl" -Xcudafe "--diag_suppress=unsigned_compare_with_zero" -Xcudafe "--diag_suppress=expr_has_no_effect" -O3 -DNDEBUG -Xcompiler=-fPIC   -Xcompiler -Wno-reorder -Xcompiler -Wno-error=sign-compare -std=c++14 -MD -MT CMakeFiles/onnxruntime_providers_cuda.dir/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/onnxruntime/core/providers/cuda/math/binary_elementwise_ops_impl.cu.o -MF CMakeFiles/onnxruntime_providers_cuda.dir/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/onnxruntime/core/providers/cuda/math/binary_elementwise_ops_impl.cu.o.d -x cu -c /build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/onnxruntime/core/providers/cuda/math/binary_elementwise_ops_impl.cu -o CMakeFiles/onnxruntime_providers_cuda.dir/build/muz/ssl/w/BUILD/slc7_amd64_gcc10/external/onnxruntime/1.6.0/onnxruntime-1.6.0/onnxruntime/core/providers/cuda/math/binary_elementwise_ops_impl.cu.o
nvcc warning : The 'compute_35', 'compute_37', 'compute_50', 'sm_35', 'sm_37' and 'sm_50' architectures are deprecated, and may be removed in a future release (Use -Wno-deprecated-gpu-targets to suppress warning).
/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arenastring.h(102): warning: integer conversion resulted in a change of sign
          detected during instantiation of "T *google::protobuf::internal::TaggedPtr<T>::Get() const [with T=std::string]"
(200): here

/build/muz/ssl/w/slc7_amd64_gcc10/external/protobuf/3.15.1-aa4b6413e1b721a708ffb2f88ecca382/include/google/protobuf/arena_impl.h(347): warning: integer conversion resulted in a change of sign

/build/muz/ssl/w/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono: In substitution of 'template<class _Rep, class _Period> template<class _Period2> using __is_harmonic = std::__bool_constant<(std::ratio<((_Period2::num / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::num, _Period::num)) * (_Period::den / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::den, _Period::den))), ((_Period2::den / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::den, _Period::den)) * (_Period::num / std::chrono::duration<_Rep, _Period>::_S_gcd(_Period2::num, _Period::num)))>::den == 1)> [with _Period2 = _Period2; _Rep = _Rep; _Period = _Period]':
/build/muz/ssl/w/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono:473:154:   required from here
/build/muz/ssl/w/slc7_amd64_gcc10/external/gcc/10.3.0-5b3b2b6744d296c11f567ddae2ca2f60/include/c++/10.3.0/chrono:428:27: internal compiler error: Segmentation fault
  428 |  _S_gcd(intmax_t __m, intmax_t __n) noexcept
      |                           ^~~~~~
Please submit a full bug report,

@mrodozov mrodozov changed the base branch from IB/CMSSW_11_3_X/gcc10 to IB/CMSSW_12_0_X/gcc10 April 15, 2021 15:29
@fwyzard
Copy link
Contributor

fwyzard commented Apr 22, 2021

@smuzaffar sorry, looks like I missed this until now.

I've reported the problem to NVIDIA (it happens also with CUDA 11.3).

.A

@fwyzard
Copy link
Contributor

fwyzard commented May 4, 2021

The same issue is tracked by other projects:

@edrozenberg
Copy link

The GCC project has committed a patch:

https://gcc.gnu.org/git/gitweb.cgi?p=gcc.git;h=5357ab75dedef403b0eebf9277d61d1cbeb5898f
(in response to the problem report https://gcc.gnu.org/bugzilla/show_bug.cgi?id=100102)

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 7, 2021

Pull request #6799 was updated.

@mrodozov
Copy link
Contributor

mrodozov commented Jun 7, 2021

please test

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 8, 2021

-1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/15734/summary.html
COMMIT: 760b24a
CMSSW: CMSSW_12_0_X_2021-06-06-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/6799/15734/install.sh to create a dev area with all the needed externals and cmssw changes.

External Build

I found compilation error when building:

for dep in getPackages(pkg.dependencies): installApt(dep, scheduler, cache)
File "./pkgtools/cmsBuild", line 3282, in installApt
raise RpmInstallFailed(pkg, output)
RpmInstallFailed: Failed to install package gcc. Reason:
Reading Package Lists...
error: unknown package: external+gcc+10.3.0-72ba78434861bc4c0da4f08894e9a451



@mrodozov
Copy link
Contributor

mrodozov commented Jun 9, 2021

please test
retry once and then I'll upload the gcc and toolfile manually

@cmsbuild
Copy link
Contributor

cmsbuild commented Jun 9, 2021

+1

Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/15815/summary.html
COMMIT: 760b24a
CMSSW: CMSSW_12_0_X_2021-06-08-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/6799/15815/install.sh to create a dev area with all the needed externals and cmssw changes.

The following merge commits were also included on top of IB + this PR after doing git cms-merge-topic:

You can see more details here:
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/15815/git-recent-commits.json
https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/15815/git-merge-result

Comparison Summary

Summary:

  • No significant changes to the logs found
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 48528 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2862520
  • DQMHistoTests: Total failures: 208466
  • DQMHistoTests: Total nulls: 13
  • DQMHistoTests: Total successes: 2654019
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.372 KiB( 37 files compared)
  • DQMHistoSizes: changed ( 10224.0 ): 0.190 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 250202.181 ): 0.182 KiB SiStrip/MechanicalView
  • Checked 160 log files, 37 edm output root files, 38 DQM output files
  • TriggerResults: found differences in 11 / 37 workflows

@smuzaffar
Copy link
Contributor Author

smuzaffar commented Jun 10, 2021

test parameters

@smuzaffar
Copy link
Contributor Author

please test

@smuzaffar
Copy link
Contributor Author

please test

@smuzaffar
Copy link
Contributor Author

please test for slc7_amd64_gcc10

@cmsbuild
Copy link
Contributor

-1

Failed Tests: HeaderConsistency
Summary: https://cmssdt.cern.ch/SDT/jenkins-artifacts/pull-request-integration/PR-dac18a/15862/summary.html
COMMIT: 760b24a
CMSSW: CMSSW_12_0_X_2021-06-09-2300/slc7_amd64_gcc10
User test area: For local testing, you can use /cvmfs/cms-ci.cern.ch/week0/cms-sw/cmsdist/6799/15862/install.sh to create a dev area with all the needed externals and cmssw changes.

Comparison Summary

Summary:

  • No significant changes to the logs found
  • ROOTFileChecks: Some differences in event products or their sizes found
  • Reco comparison results: 48796 differences found in the comparisons
  • DQMHistoTests: Total files compared: 38
  • DQMHistoTests: Total histograms compared: 2862520
  • DQMHistoTests: Total failures: 208466
  • DQMHistoTests: Total nulls: 14
  • DQMHistoTests: Total successes: 2654018
  • DQMHistoTests: Total skipped: 22
  • DQMHistoTests: Total Missing objects: 0
  • DQMHistoSizes: Histogram memory added: 0.376 KiB( 37 files compared)
  • DQMHistoSizes: changed ( 10224.0 ): 0.190 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 250202.181 ): 0.182 KiB SiStrip/MechanicalView
  • DQMHistoSizes: changed ( 312.0 ): 0.004 KiB MessageLogger/Warnings
  • Checked 160 log files, 37 edm output root files, 38 DQM output files
  • TriggerResults: found differences in 11 / 37 workflows

@smuzaffar
Copy link
Contributor Author

looks good, lets get it in next IB to run full scale tests

@smuzaffar smuzaffar merged commit 2561e72 into IB/CMSSW_12_0_X/gcc10 Jun 10, 2021
@smuzaffar smuzaffar deleted the smuzaffar-patch-3 branch June 10, 2021 20:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants