CERN Prototype

From DUNE
Jump to navigation Jump to search

Materials and Meetings

Infrastructure

Expected Data Volume and Rates

Estimates in this area were developed over a period of time. Both data rate and volume are determined primarily by the number of tracks due to cosmic ray muons, recorded within the readout window, which is commensurate with the electron collection time in the TPC (~2ms).

For a quick summary of the data rates, data volume and related requirements see:

A few numbers:

  • Planned trigger rate: 200Hz
  • Instantaneous data rate in DAQ: 1GB/s
  • Sustained average: 200MB/s

The measurement program is still being updated, the total volume of data to be taken will be ~O(1PB). Brief notes on the statistics can be found in Appendix II of the "Materials" page.

Software and Computing

Intro

As of March 2015, this is work in progress. In accordance with common requirements, we anticipate preserving three copies of "precious" data to be collected during the experiment. One primary copy would be stored on tape at CERN, another at FNAL and auxiliary copies will be shared between US sites e.g. BNL and NERSC. There are proposal to reuse software which was proven in IceCube and Daya Bay experiments, to move data between CERN and the US with appropriate degree of automation, error checking and correction, and monitoring.

The salient point of the Software and Computing plan is near-time processing and monitoring of data quality, including full tracking in express production streams. This can be done on a subset of the raw data. At the same time, a rough estimate indicates that for off-line processing, ~5000 cores will be sufficient to process data with about same speed as it is collected.

Handling the data

Storage at CERN

In early 2000s, the CASTOR system was deployed at CERN which provides front-end to mass storage, in the form of both tape and disk pools. In early 2010s, the disk pools were largely migrated to EOS, a newer and high-performance system which has better functionality for managing large disk pools. CASTOR is still used for custodial data on tape.

EOS is derived from xrootd and root files are accessible natively.


Links of interest

Note: some of these links may be restricted to users associated with respective LHC experiments. This will be resolved at a later date (i.e. relevant and public information extracted, reduced and systematized).