Advertisement

CERN is making the Large Hadron Collider's data more accessible

It's almost impossible for the organization to release raw datasets, however.

The European Organization for Nuclear Research (CERN) will open up access to more data from Large Hadron Collider (LHC) experiments. Under an updated policy, data will be released around five years after it's collected and CERN hopes to release the full dataset publicly "by the close of the experiment concerned." Core LHC collaborators ALICE, ATLAS, CMS and LHCb all endorsed the move.

CERN will make level 3 data available, which will allow anyone to conduct “high-quality analysis” on information obtained from Large Hadron Collider experiments. Level 3 relates to "calibrated reconstructed data with the level of detail useful for algorithmic, performance and physics studies," according to CERN.

The organization won't release raw data, however. The open data policy states that it's "not practically possible to make the full raw dataset from the LHC experiments usable in a meaningful way outside its collaborations." That's because of the complexity of the data, software and metadata and access issues to the vast troves of stored information, among other factors. LHC collaborators don't have general access to the raw data either. Instead, the assembly of level 3 data "is performed centrally."

Still, CERN suggests level 3 data can bolster particle physics research, for example. It noted the dataset could be used for scientific computing research too. Researchers may, for instance, tap into the data to "improve reconstruction or analysis methods based on machine learning techniques." The organization notes that approach needs rich datasets for training and validation.

Samples of level 1 and level 2 data were already available. Level 1 relates to supporting data for results that are published in scientific articles. Level 2, meanwhile, denotes dedicated datasets that are designed for outreach and education purposes.