Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add additional validation samples #17

Closed
jpata opened this issue Jan 30, 2019 · 7 comments
Closed

Add additional validation samples #17

jpata opened this issue Jan 30, 2019 · 7 comments
Labels
particleflow CMS Particle Flow Validation

Comments

@jpata
Copy link
Owner

jpata commented Jan 30, 2019

So far we were testing with a FlatQCD sample. Now need to add a few additional ones (PU & no PU, Zmm, MinBias). One issue is making sure we can access these samples from CERN, even if they are stored at FNAL.

@jpata jpata added the particleflow CMS Particle Flow Validation label Jan 30, 2019
@jpata
Copy link
Owner Author

jpata commented Jan 31, 2019

I looked into it a little bit and at CERN we only have the 10_4_X samples, as that is the dev release.
Issue #21.

@jpata
Copy link
Owner Author

jpata commented Jan 31, 2019

I'm having issues with the dataset availability at CERN, I've made a hypernews thread here: https://hypernews.cern.ch/HyperNews/CMS/get/comp-ops/4388.html

@jpata jpata mentioned this issue Feb 5, 2019
4 tasks
@jpata
Copy link
Owner Author

jpata commented Feb 5, 2019

I was not able to get a response on transferring datasets to CERN. Not sure who is managing T2_CH_CERN and under what conditions can we transfer data there using Phedex.
In any case, datasets can be defined here: https://github.com/jpata/cmssw/blob/pfvalidation-10_4_X-correct/Validation/RecoParticleFlow/test/datasets.py#L129

@jpata
Copy link
Owner Author

jpata commented Feb 5, 2019

OK, so we can't ask to transfer those datasets to CERN (see hn thread). I'm currently seeing how reliable reading them via xrootd from Caltech batch jobs will be.
Added ZMM and MinBias samples here: d9a89a7

@jpata
Copy link
Owner Author

jpata commented Feb 6, 2019

I discussed with Jean-Roch Vlimant, and to us it seemed more logical to transfer 1-2TB of samples to CERN than rely on xrootd daily in hundreds of batch jobs. Indeed, 99% of my batch jobs on lxplus failed because of the xrootd connection from CERN to FNAL.

Here is the Phedex request for the samples: https://cmsweb.cern.ch/phedex/prod/Request::View?request=1620067

@juska
Copy link
Collaborator

juska commented Feb 7, 2019 via email

@jpata
Copy link
Owner Author

jpata commented Feb 12, 2019

To close this topic, we will likely have to ask for data to be moved to CERN if we want to run the code only at CERN. Likely we will need one set of datasets per major CMSSW release (few TB), not every RelVal (many TB per release?). LXbatch condor jobs opening files at FNAL will work but require resubmission occasionally.

@jpata jpata closed this as completed Feb 12, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
particleflow CMS Particle Flow Validation
Projects
None yet
Development

No branches or pull requests

2 participants