Skip to content
This repository was archived by the owner on Feb 23, 2023. It is now read-only.

Arc6#169

Merged
petricm merged 2 commits into
DIRACGrid:masterfrom
chaen:arc6
Apr 14, 2021
Merged

Arc6#169
petricm merged 2 commits into
DIRACGrid:masterfrom
chaen:arc6

Conversation

@chaen

@chaen chaen commented Mar 26, 2021

Copy link
Copy Markdown
Contributor

This was tested here https://gitlab.cern.ch/CLICdp/iLCDirac/diracos-test/-/pipelines/2435576 . The integration tests fail for reasons totally independent from ARC. Now we need someone to actually test that against ARC CEs @fstagni @andresailer @VladimirRomanovsky @rajanandakumar ?

BEGINRELEASENOTES

CHANGE: Move to ARC 6

ENDRELEASENOTES

@petricm

petricm commented Mar 26, 2021

Copy link
Copy Markdown

I have uploaded the build of this to the standard location https://diracos.web.cern.ch/diracos/releases/diracos-arc6.tar.gz

@andresailer

Copy link
Copy Markdown
Contributor

I can push this to the iLCDirac test server, maybe even today...

@andresailer andresailer self-assigned this Mar 26, 2021
@andresailer

Copy link
Copy Markdown
Contributor

I am getting

2021-03-26 10:55:11 UTC WorkloadManagement/SiteDirector/arc-ce02.gridpp.rl.ac.uk WARN: Failed to submit job because  SUBMITTER_PLUGIN_NOT_LOADED : ARC library installation problem?

Am I missing some setting somewhere? bashrc comes from puppet

@chaen

chaen commented Mar 26, 2021

Copy link
Copy Markdown
Contributor Author

I have absolutely no idea :-)
There are arc variables defined in diracosrc, maybe they need an update ?

@andresailer

Copy link
Copy Markdown
Contributor

@rajanandakumar

Copy link
Copy Markdown

@andresailer how do I reproduce this issue? Thanks!

@andresailer

Copy link
Copy Markdown
Contributor

I use the diracos-arc6 tarball and run the sitedirector for an ARC CE as normal.
What I don't see in that tarbal is something like the ftpjob lib. But I am not sure which plugin is used for the job submission at the moment? Arc rest? gridftpjob?

@rajanandakumar

Copy link
Copy Markdown

@chaen

chaen commented Mar 29, 2021

Copy link
Copy Markdown
Contributor Author

It would seem that the packaging changed and the split a lot of things.

So this is the complete list of packages we produced:

u'nordugrid-arc-devel': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-devel-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'python2-nordugrid-arc': u'/var/lib/mock/epel-6-x86_64/result/python2-nordugrid-arc-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-test-utils': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-test-utils-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-arcctl': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-arcctl-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-arcctl-service': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-arcctl-service-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-gridftpd': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-gridftpd-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-nordugridmap': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-nordugridmap-6.10.1-1.el6.py27.usc4.noarch.rpm'
u'nordugrid-arc-hed': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-hed-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-internal': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-internal-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-client': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-client-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-arcrest': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-arcrest-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-archery-manage': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-archery-manage-6.10.1-1.el6.py27.usc4.noarch.rpm'
u'nordugrid-arc-monitor': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-monitor-6.10.1-1.el6.py27.usc4.noarch.rpm'
u'nordugrid-arc-plugins-globus-common': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-globus-common-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-xrootd': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-xrootd-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-infosys-ldap': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-infosys-ldap-6.10.1-1.el6.py27.usc4.noarch.rpm'
u'nordugrid-arc-plugins-gridftp': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gridftp-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-python': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-python-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-arex': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-arex-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-globus': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-globus-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-datadelivery-service': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-datadelivery-service-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-gfal': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gfal-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-needed': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-needed-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-lcas-lcmaps': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-lcas-lcmaps-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-wn': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-wn-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-debuginfo': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-debuginfo-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-community-rtes': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-community-rtes-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-gridftpjob': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gridftpjob-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'nordugrid-arc-plugins-s3': u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-s3-6.10.1-1.el6.py27.usc4.x86_64.rpm'}

Those that we shipped with DIRACOS are


u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-client-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-needed-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-globus-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'/var/lib/mock/epel-6-x86_64/result/python2-nordugrid-arc-6.10.1-1.el6.py27.usc4.x86_64.rpm'
u'/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-6.10.1-1.el6.py27.usc4.src.rpm'

Please let me know which one you want to add (the least should ideally be complete, to avoid further compilation :-D )

@rajanandakumar

Copy link
Copy Markdown

After staring at it for some time, I think that the following should keep us going (I am not able to see the pipeline in the original message above for the error)

nordugrid-arc
nordugrid-arc-client
nordugrid-arc-plugins-needed
nordugrid-arc-plugins-globus
nordugrid-arc-plugins-arcrest
nordugrid-arc-plugins-gridftpjob
nordugrid-arc-plugins-globus-common
nordugrid-arc-plugins-gridftp
nordugrid-arc-plugins-xrootd
nordugrid-arc-plugins-gfal
python2-nordugrid-arc

@chaen

chaen commented Mar 31, 2021

Copy link
Copy Markdown
Contributor Author

So much ?!?! How come ? :-)

I've recompiled by hand and kept all the RPMs.

@andresailer would you please be so kind as to take /afs/cern.ch/user/c/chaen/public/arc6/allArc.tar.gz and to extract the RPMs @rajanandakumar listed in the DIRACOS of your test machine ? you can do that by being in the diracos folder, and running rpm2cpio <rpmfile> | cpio -dvim

@andresailer

Copy link
Copy Markdown
Contributor

I have extracted the plugins as instructed, but I still don't get pilots.

  346  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-client-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  347  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-needed-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  348  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-globus-common-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  349  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-globus-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  350  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-arcrest-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  351  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gridftp-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  352  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gridftpjob-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  353  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-xrootd-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  354  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-gfal-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  355  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/python2-nordugrid-arc-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim
  359  rpm2cpio allarcs/var/lib/mock/epel-6-x86_64/result/nordugrid-arc-plugins-python-6.10.1-1.el6.py27.usc4.x86_64.rpm | cpio -dvim

Also nordugrid-arc-plugins-python, just in case.

I don't get the ARC_PLUGIN_NOT_LOADED warning any longer, but

2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector VERBOSE: Evaluating queue kek2-ce01.cc.kek.jp_nordugrid-lsf-gridilc_long
2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector INFO: kek2-ce01.cc.kek.jp_nordugrid-lsf-gridilc_long: Slots=100, TQ jobs(pilotsWeMayWantToSubmit)=3, Pilots: waiting 0, to submit=3
2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector VERBOSE: Getting pilot proxy for /DC=ch/DC=cern/OU=Organic Units/OU=Users/CN=sailer/CN=683529/CN=Andre Sailer/ilc_pilot 345600 long
2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector INFO: Going to submit pilots (a maximum of 3 pilots to kek2-ce01.cc.kek.jp_nordugrid-lsf-gridilc_long queue)
2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector VERBOSE: pilotOptions: -S ILC-Development -l ILCDIRAC -r v31r0-pre -C dips://voilcdiractest71.cern.ch:9135/Configuration/Server -e ILC -N kek2-ce01.cc.kek.jp -Q nordugrid-lsf-gridilc_long -n LCG.KEK.jp -o /Resources/Computing/CEDefaults/SubmitPool=gLite
2021-04-06 09:25:09 UTC WorkloadManagement/SiteDirector/kek2-ce01.cc.kek.jp VERBOSE: Executable file path: /opt/dirac/data/work/SiteDirector/DIRAC__eLX8L_pilotwrapper.py
2021-04-06 09:25:13 UTC WorkloadManagement/SiteDirector/kek2-ce01.cc.kek.jp WARN: Failed to submit job because  Job not submitted - incorrect job description? (missing field in XRSL string?)
2021-04-06 09:25:13 UTC WorkloadManagement/SiteDirector/kek2-ce01.cc.kek.jp WARN: Failed to submit job because  some error from the CE - possibly CE problems?
2021-04-06 09:25:13 UTC WorkloadManagement/SiteDirector/kek2-ce01.cc.kek.jp WARN: Failed to submit job because  ... maybe above messages will give a hint.
2021-04-06 09:25:13 UTC WorkloadManagement/SiteDirector ERROR: Failed submission to queue Queue kek2-ce01.cc.kek.jp_nordugrid-lsf-gridilc_long:
, No pilot references obtained from the ARC job submission

With debug enabled

2021-04-06 09:34:02 UTC WorkloadManagement/SiteDirector INFO: Going to submit pilots (a maximum of 1 pilots to arc-ce02.gridpp.rl.ac.uk_nordugrid-Condor-grid3000M queue)
2021-04-06 09:34:02 UTC WorkloadManagement/SiteDirector VERBOSE: pilotOptions: -S ILC-Development -l ILCDIRAC -r v31r0-pre -C dips://voilcdiractest71.cern.ch:9135/Configuration/Server -e ILC -N arc-ce02.gridpp.rl.ac.uk -Q nordugrid-Condor-grid3000M -n LCG.RAL-LCG2.uk -o /Resources/Computing/CEDefaults/SubmitPool=gLi
te
2021-04-06 09:34:02 UTC WorkloadManagement/SiteDirector/arc-ce02.gridpp.rl.ac.uk VERBOSE: Executable file path: /opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: String successfully parsed as nordugrid:xrsl.
VERBOSE: SendCommand: Response: 250 "jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m" is current directory
VERBOSE: Generating nordugrid:xrsl job description output
VERBOSE: SendCommand: Response: 229 Entering Extended Passive Mode (|||50486|)
VERBOSE: FTP Job Control: Data channel: 130.246.180.247:50486
VERBOSE: Disconnect: Failed aborting - ignoring: Handle not in the proper state
INFO: Transfer from file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py to gsiftp://arc-ce02.gridpp.rl.ac.uk:2811/jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: DataMover: cycle
INFO: Real transfer from file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py to gsiftp://arc-ce02.gridpp.rl.ac.uk:2811/jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Creating buffer: 1048576 x 2
VERBOSE: DataMove::Transfer: no checksum calculation for file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Failed to load plugin for URL (empty)
VERBOSE: Failed to load plugin for URL (empty)
INFO: Using buffered transfer method
INFO: write_thread: get and pass buffers
VERBOSE: Waiting for buffer
FATAL: Failed to execute command
VERBOSE: write_thread: out failed - aborting
VERBOSE: write_thread: exiting
VERBOSE: buffer: read EOF : yes
VERBOSE: buffer: write EOF: yes
VERBOSE: buffer: error    : yes, read: no, write: yes
VERBOSE: Closing read channel
VERBOSE: Closing write channel
VERBOSE: (Re)Trying next destination
VERBOSE: DataMover: cycle
INFO: Real transfer from file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py to gsiftp://arc-ce02.gridpp.rl.ac.uk:2811/jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Creating buffer: 1048576 x 2
VERBOSE: DataMove::Transfer: no checksum calculation for file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Failed to load plugin for URL (empty)
VERBOSE: Failed to load plugin for URL (empty)
INFO: Using buffered transfer method
INFO: write_thread: get and pass buffers
VERBOSE: Waiting for buffer
FATAL: Failed to execute command
VERBOSE: write_thread: out failed - aborting
VERBOSE: write_thread: exiting
VERBOSE: buffer: read EOF : yes
VERBOSE: buffer: write EOF: yes
VERBOSE: buffer: error    : yes, read: no, write: yes
VERBOSE: Closing read channel
VERBOSE: Closing write channel
VERBOSE: (Re)Trying next destination
VERBOSE: DataMover: cycle
INFO: Real transfer from file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py to gsiftp://arc-ce02.gridpp.rl.ac.uk:2811/jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Creating buffer: 1048576 x 2
VERBOSE: DataMove::Transfer: no checksum calculation for file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py
VERBOSE: Failed to load plugin for URL (empty)
VERBOSE: Failed to load plugin for URL (empty)
INFO: Using buffered transfer method
INFO: write_thread: get and pass buffers
VERBOSE: Waiting for buffer
FATAL: Failed to execute command
VERBOSE: write_thread: out failed - aborting
VERBOSE: write_thread: exiting
VERBOSE: buffer: read EOF : yes
VERBOSE: buffer: write EOF: yes
VERBOSE: buffer: error    : yes, read: no, write: yes
VERBOSE: Closing read channel
VERBOSE: Closing write channel
VERBOSE: DataMover: cycle
VERBOSE: DataMover: destination out of tries - exit
ERROR: Failed uploading file file:/opt/dirac/data/work/SiteDirector/DIRAC_ehLllC_pilotwrapper.py to gsiftp://arc-ce02.gridpp.rl.ac.uk:2811;blocksize=1048576;checksum=no/jobs/jMPODm8Gxnync1XDjqYugZkqABFKDmABFKDmOoFLDmABFKDmIGF00m/DIRAC_ehLllC_pilotwrapper.py: Failed while writing to destination
INFO: Submit: Failed uploading local input files

@andresailer

Copy link
Copy Markdown
Contributor

Also export
export ARC_LOCATION=/opt/dirac/pro/diracos/usr
and for now the LD_LIBRARY_PATH to find

$ find . -name libarcglobusutils.so.3
./usr/lib64/libarcglobusutils.so.3

so that this command
/opt/dirac/pro/diracos/usr/lib64/arc/arc-dmcgridftp

works to upload the input files for the jobs.

@chaen

chaen commented Apr 6, 2021

Copy link
Copy Markdown
Contributor Author

@petricm

petricm commented Apr 6, 2021

Copy link
Copy Markdown

The build passed but there are missing links

./tests/integration/test_ldd.sh
Finding all the binary dependencies, and putting them in /tmp/diracos/missingDependencies.txt
New missing dependencies
libarccommon.so.3
libarccommunication.so.3
libarccompute.so.3
libarccredential.so.3
libarccredentialstore.so.3
libarccrypto.so.3
libarcdata.so.3
libarcdatastaging.so.3
libarcinfosys.so.0
libarcloader.so.3
libarcmessage.so.3
libarcotokens.so.3
libarcsecurity.so.3
libarcwsaddressing.so.0
libarcwssecurity.so.3
libarcxmlsec.so.3

I have dumped the tar file diracos-arc6-2.tar.gz to the usual location

@chaen

chaen commented Apr 7, 2021

Copy link
Copy Markdown
Contributor Author

my bad, I reshuffled the list of packages to ship, and I removed one by mistake.... I redo it

@chaen

chaen commented Apr 7, 2021

Copy link
Copy Markdown
Contributor Author

@chaen

chaen commented Apr 7, 2021

Copy link
Copy Markdown
Contributor Author

The build succeeds. @petricm can you please move it as diracos-arc6 with the md5 such that @andresailer can install it ?

@petricm

petricm commented Apr 7, 2021

Copy link
Copy Markdown

I have overwritten the old diracos-arc6.tar.gz with the new build

@andresailer

Copy link
Copy Markdown
Contributor

Pulled the new tarball. Pilots are still being submitted

@chaen

chaen commented Apr 7, 2021

Copy link
Copy Markdown
Contributor Author

so all good ? Want to do more tests or should we merge ?

@andresailer

Copy link
Copy Markdown
Contributor

Pilot status is updated, pilot output can be downloaded. So seems to be OK from my point of view.

@chaen

chaen commented Apr 8, 2021

Copy link
Copy Markdown
Contributor Author

OK then I squash all the commit and we are good to go

Comment thread diracos/scriptTemplates/diracosrc_tpl.sh Outdated
Co-authored-by: Marko Petric <marko.petric@cern.ch>
@petricm petricm merged commit 142fb90 into DIRACGrid:master Apr 14, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants