Difference between revisions of "DQM payloads"

From DUNE
Jump to navigation Jump to search
 
(16 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
= Overview =
 
= Overview =
 +
Historical notes on the design and planning have been moved to the page "[[DQM notes]]".
 +
User-level documents will be updated and kept on GitHub, in the "documents"
 +
folder of the p3s repo: https://github.com/DUNE/p3s.
  
[[ DQM notes]]
+
The remainder of this page serves to keep record of current testing for the experts
 +
and can be expected to change periodically.
  
The original requirements for DQM were published as DocDB 1811.
+
DQM Mailing list: DUNE-PROTO-SP-DQM
Over time, there was more detail added to some of the DQM components
+
<pre>
and their expected functionality.
+
dladams@BNL.GOV        David Adams
 +
baller@FNAL.GOV        Bruce R Baller
 +
Nectarios.Benekos@CERN.CH Nektarios Benekos
 +
georgios.christodoulou@CERN.CH Georgios Christodoulou
 +
zalesak@FZU.CZ        Zalesak Jaroslav
 +
trj@FNAL.GOV          Thomas R Junk
 +
kettell@BNL.GOV        Steve Kettell
 +
kutter@PHYS.LSU.EDU    Thomas Kutter
 +
mrmooney@COLOSTATE.EDU Michael Mooney
 +
potekhin@BNL.GOV      Maxim Potekhin
 +
bv@BNL.GOV            Brett Viren
 +
mworcester@BNL.GOV    Matthew Worcester
 +
tjyang@FNAL.GOV        Tingjun Yang
 +
</pre>
  
As an additional piece of information, it is useful to consider "uBooNE lessons learned"
+
=Setting up=
information (courtesy M.Mooney)
+
For (mostly) official instructions see:
* Electron lifetime/purity every 2-8 hours.  Hardware and software. Track-based, requires O(1000) triggers.
 
* Noise filtering + event display is important (that's on the protoDUNE DQM list)
 
* Removal of coherent noise allows for uncovering of more subtle problems
 
* Channel "health": pedestal RMS, channel FFTs
 
* Slow Control is critical.  Mostly for HV monitoring
 
* Drill down from global view to subsystems
 
* Space Charge Effect probably not needed in DQM
 
  
= Evolution of the DQM plan =
+
https://wiki.dunescience.org/wiki/DUNE_Computing/Analysis_Module_From_Example_August2017
== "Original Plan" ==
 
See DocDB references for prompt processing, p3s etc (cf. DocDB 1861). The short list of DQM items includes
 
* ADC/FFT with variety of aggregation modes (e.g. APA, board, power supply etc)
 
* signal processing (filtering, de-noising, deconvolution etc)
 
* some (unspecified) fast recontruction and event display
 
  
== Beam Instrumentation ==
+
Check versions at CERN:
Early 2017: additions by Flavio et al. Need to monitor BI itself and also match tracks to the TPC.
 
  
Comments and questions:
+
<pre>
* pLAPPD ToF will be joined with the “normal” DAQ data stream
+
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
* The fiber tracker data, other spill-relevant data, and ckov data will go into the BI DB
+
ups list -aK+ dunetpc
* When and how to capture BI data from the DB? (Current thinking from J.Paley - purely offline, from CERN DB without own cache - any news on that?)
+
</pre>
* When and how to merge it with the TPC and other data?
 
* Track reco and matching to the TPC - who will do it?
 
  
== Signal Processing and Basic Event Display ==
+
Recall that when you run larsoft with the "-c" option which tells it the name of the FCL file, said file will be searched for in the path contained in $FHICL_FILE_PATH which typically starts with a "." (current directory).
Mid-2017: add basic event-display-type (cf. channel vs time) of visual product before and after signal processing.
 
* demonstrates that the detector and software both work as expected
 
* similar to the original plans
 
* BNL team has plans for this item, leveraging signal processing experience in μBooNE, 35t etc
 
* Synergy with the OM group
 
** prototypes such as "purity calculation" have been tested under p3s at CERN - thousands of jobs run
 
** new prototype of "purity calculator" under development by B.Baller
 
** the boundary between OM and DQM can be quite fluid, most software is portable
 
*** Generally a feature branch configured at CERN and a FCL file are all that's needed
 
*** ...although a simple wrapper provides additional convenience for managing I/O file names
 
** ROOT and art/LArSoft are common denominators
 
** Photon Detector (need more info)
 
** CRT will need to be merged offline (need more info)
 
  
= People =
+
= Current Work =
==Summary==
+
References to JSON, FCL and .sh files correspond to what is in the current repo, available on GitHub.
* Photon Detector
+
The "lxvm" prefix in filenames was dropped for brevity in the following notes.
** Alex Himmel (have contact but need more info)
 
* Purity and other modules related to or derived from OM
 
** OM/DRA (Bruce Baller, Robert Sulej, Dorota Stefan, new DRA members)
 
* Signal Processing (ADC correction, filtering etc), first look at the data
 
** BNL (David Adams, Xin Qian, TBD)
 
* CRT (?)
 
* Cable map
 
** Flor (CERN)
 
* Channel map
 
** Karol, Giovanna (?)
 
* p3s deployment and operations
 
** Maxim Potekhin
 
* DQM Visualization
 
** David, Maxim
 
* Reuse of the OM visualization system is TBD (Marco?)
 
  
==Need attention==
+
==v06==
* BI (Jon Paley but he has a lot of other work); need reco part; Leigh Whitehead (?)
+
Works: purity_5.json
* PD (Need to ping Alex Himmel)
+
<pre>
* CRT (Need to ping the team)
+
[
* OM visualization tool reuse -TBD
+
    {
 +
"name": "Purity:Main",
 +
"timeout": "1000",
 +
        "jobtype": "purity",
 +
        "payload": "/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/purity/lxdqm_purity_3.sh",
 +
        "priority": "1",
 +
"state": "defined",
 +
"env": {
 +
    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
 +
    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
 +
    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
 +
    "P3S_NEVENTS":"10",
 +
    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/purity",
 +
    "P3S_FCL":"lxdqm_purity_1.fcl",
 +
    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input",
 +
    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
 +
    "P3S_OUTPUT_FILE":"PurityOutput.root"}
 +
    }
 +
]
 +
</pre>
  
=P3S=
 
See the [[p3s|p3s page]]
 
  
= Latency =
+
 
* The way the F-FTS operates it naturally introduces a delay of a few minutes between inital transfer of the data to Tier-0 and placement of the data into a directory where it's available for * processing and further transfers
+
Works: evdisp_4.json
* This could be mitigated if DQM ran in EHN1 using a lightweight XRootD daemon with access to the online buffer, only accounting for ~1% if total bandwidth
+
<pre>
* There are no easy solutions outside of that, so protoDUNE needs to decide whether a lag of a few minutes is acceptable
+
[
 +
    {
 +
"name": "EvDisp:Main",
 +
"timeout": "1000",
 +
        "jobtype": "evdisp",
 +
        "payload": "/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp/lxdqm_evdisp_4.sh",
 +
        "priority": "1",
 +
"state": "defined",
 +
"env": {
 +
    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
 +
    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
 +
    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
 +
    "P3S_NEVENTS":"2",
 +
    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp",
 +
    "P3S_FCL":"lxdqm_evdisp_4.fcl",
 +
    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input/",
 +
    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
 +
    "P3S_EVDISP_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/evdisp/",
 +
    "P3S_USED_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/used/",
 +
    "P3S_OUTPUT_FILE":"evdisp.root"}
 +
    }
 +
]
 +
</pre>
 +
 
 +
 
 +
Works: crt_tpc_3.json
 +
 
 +
<pre>
 +
[
 +
    {
 +
"name": "CRT:Main",
 +
"timeout": "1000",
 +
        "jobtype": "crt2tpc",
 +
        "payload": "/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/lxdqm_crt_tpc_3.sh",
 +
        "priority": "1",
 +
"state": "defined",
 +
"env": {
 +
    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
 +
    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
 +
    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
 +
    "P3S_NEVENTS":"50",
 +
    "P3S_FCL":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/lxdqm_crt_tpc_1.fcl",
 +
    "P3S_CRT_TXT":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/CRTs.txt",
 +
    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output",
 +
    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input",
 +
    "P3S_DATA":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/"
 +
}
 +
    }
 +
]
 +
</pre>
 +
 
 +
==v06_57==
 +
 
 +
Works: evdisp_5.json
 +
<pre>
 +
[
 +
    {
 +
"name": "EvDisp:Main",
 +
"timeout": "1000",
 +
        "jobtype": "evdisp",
 +
        "payload": "/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp/lxdqm_evdisp_5.sh",
 +
        "priority": "1",
 +
"state": "defined",
 +
"env": {
 +
    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
 +
    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06_57_00/",
 +
    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_57_00_e14_prof",
 +
    "P3S_NEVENTS":"2",
 +
    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp",
 +
    "P3S_FCL":"lxdqm_evdisp_4.fcl",
 +
    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input/",
 +
    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
 +
    "P3S_EVDISP_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/evdisp/",
 +
    "P3S_USED_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/used/",
 +
    "P3S_OUTPUT_FILE":"evdisp.root"}
 +
    }
 +
]
 +
</pre>

Latest revision as of 12:28, 1 June 2018

Overview

Historical notes on the design and planning have been moved to the page "DQM notes". User-level documents will be updated and kept on GitHub, in the "documents" folder of the p3s repo: https://github.com/DUNE/p3s.

The remainder of this page serves to keep record of current testing for the experts and can be expected to change periodically.

DQM Mailing list: DUNE-PROTO-SP-DQM

dladams@BNL.GOV        David Adams
baller@FNAL.GOV        Bruce R Baller
Nectarios.Benekos@CERN.CH Nektarios Benekos
georgios.christodoulou@CERN.CH Georgios Christodoulou
zalesak@FZU.CZ         Zalesak Jaroslav
trj@FNAL.GOV           Thomas R Junk
kettell@BNL.GOV        Steve Kettell
kutter@PHYS.LSU.EDU    Thomas Kutter
mrmooney@COLOSTATE.EDU Michael Mooney
potekhin@BNL.GOV       Maxim Potekhin
bv@BNL.GOV             Brett Viren
mworcester@BNL.GOV     Matthew Worcester
tjyang@FNAL.GOV        Tingjun Yang

Setting up

For (mostly) official instructions see:

https://wiki.dunescience.org/wiki/DUNE_Computing/Analysis_Module_From_Example_August2017

Check versions at CERN:

source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh 
ups list -aK+ dunetpc

Recall that when you run larsoft with the "-c" option which tells it the name of the FCL file, said file will be searched for in the path contained in $FHICL_FILE_PATH which typically starts with a "." (current directory).

Current Work

References to JSON, FCL and .sh files correspond to what is in the current repo, available on GitHub. The "lxvm" prefix in filenames was dropped for brevity in the following notes.

v06

Works: purity_5.json

[
    {
	"name":		"Purity:Main",
	"timeout":	"1000",
        "jobtype":	"purity",
        "payload":	"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/purity/lxdqm_purity_3.sh",
        "priority":	"1",
	"state":	"defined",
	"env":		{
	    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
	    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
	    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
	    "P3S_NEVENTS":"10",
	    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/purity",
	    "P3S_FCL":"lxdqm_purity_1.fcl",
	    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input",
	    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
	    "P3S_OUTPUT_FILE":"PurityOutput.root"}
    }
]


Works: evdisp_4.json

[
    {
	"name":		"EvDisp:Main",
	"timeout":	"1000",
        "jobtype":	"evdisp",
        "payload":	"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp/lxdqm_evdisp_4.sh",
        "priority":	"1",
	"state":	"defined",
	"env":		{
	    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
	    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
	    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
	    "P3S_NEVENTS":"2",
	    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp",
	    "P3S_FCL":"lxdqm_evdisp_4.fcl",
	    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input/",
	    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
	    "P3S_EVDISP_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/evdisp/",
	    "P3S_USED_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/used/",
	    "P3S_OUTPUT_FILE":"evdisp.root"}
    }
]


Works: crt_tpc_3.json

[
    {
	"name":		"CRT:Main",
	"timeout":	"1000",
        "jobtype":	"crt2tpc",
        "payload":	"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/lxdqm_crt_tpc_3.sh",
        "priority":	"1",
	"state":	"defined",
	"env":	{
	    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
	    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06/",
	    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_45_01_e14_prof",
	    "P3S_NEVENTS":"50",
	    "P3S_FCL":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/lxdqm_crt_tpc_1.fcl",
	    "P3S_CRT_TXT":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/crt/CRTs.txt",
	    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output",
	    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input",
	    "P3S_DATA":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/"
	}
    }
]

v06_57

Works: evdisp_5.json

[
    {
	"name":		"EvDisp:Main",
	"timeout":	"1000",
        "jobtype":	"evdisp",
        "payload":	"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp/lxdqm_evdisp_5.sh",
        "priority":	"1",
	"state":	"defined",
	"env":		{
	    "P3S_VENV":"/afs/cern.ch/user/m/mxp/vp3s",
	    "P3S_LARSOFT_HOME":"/afs/cern.ch/user/d/dostefan/public/v06_57_00/",
	    "P3S_LARSOFT_RELEASE":"localProducts_larsoft_v06_57_00_e14_prof",
	    "P3S_NEVENTS":"2",
	    "P3S_FCL_DIR":"/afs/cern.ch/user/m/mxp/projects/p3s/inputs/larsoft/evdisp",
	    "P3S_FCL":"lxdqm_evdisp_4.fcl",
	    "P3S_INPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/input/",
	    "P3S_OUTPUT_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/output/",
	    "P3S_EVDISP_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/evdisp/",
	    "P3S_USED_DIR":"/eos/experiment/neutplatform/protodune/np04tier0/p3s/used/",
	    "P3S_OUTPUT_FILE":"evdisp.root"}
    }
]