CERN Accelerating science

End of year activities in SFT group

The end of year is traditionally the time when SFT makes new releases of our main software products. It is a very busy time with everyone working energetically to meet various deadlines. All last minute changes and fixes need to be integrated and run through extensive checks to ensure that quality of the performance of the code is not compromised in all its aspects. As for any new release, a lot of valuable feedback is encouraged and expected from the user community in the coming weeks.

In addition, the group participated strongly in the first “Data Science at the LHC” workshop and subsequently two group members gave a mini course in “Statistics and Machine Learning” in Brasil, as part of the EPlanet project. Both these events were held in November.

More details on these activities are given below.

Geant4 10.2

The new 2015 release of Geant4, release 10.2, implements further consolidation of the code, with improvements in CPU speed of the order of 5 to 8% in calorimeter setups and with a reduced memory footprint. Release 10.2 also introduces modern C++11 Standard features in the code, with the aim to extend its use further in the next releases. The main ingredients include the following:

  • the Unified Solids Library, an alternative to the original Geant4 geometrical primitives, which is now provided as an external package called VecGeom - this change will allow us to adiabatically evolve the implementation of the various geometry primitives towards modern vectorised algorithms, with overall improved CPU performance,
  • an evolved API for allowing efficient adoption of multi-threading in conjunction with MPI, together with a better integration of the visualization system in multi-threaded applications,
  • a new implementation of the Goudsmit-Saunderson model for multiple-scattering which is used for electrons and positrons below 100 MeV - the new model implements a more accurate path-length correction algorithm, allowing for boosted CPU performance, whilst retaining an accurate physics description,
  • improved simulation of low-energy evaporated nucleons by coupling the Fritiof hadronic model with pre-compound de-excitation - this affects the energy response in hadronic showers, where the visible energy in simplified calorimeters increases for absorber materials like iron or copper; a new model for nuclear gamma de-excitation has also been introduced, affecting the lateral shapes of hadronic showers especially for heavy absorber materials,
  • the model for high-precision neutron simulation has been merged into a new model, extending the treatment of low-energy neutrons (below 20 MeV) to other charged particles: proton, deuteron, triton, 3He and alpha, with energies up to 200 MeV,
  • substantial revision of the radioactive decay model such that atomic de-excitation is now used to handle fluorescence and Auger electrons.

ROOT 6.06 and towards ROOT 7

The ROOT team has released version 6.06/00, the first in the new production release series v6.06 that will be used by almost all the LHC experiments in 2016. We expect it to show up on a computer near you fairly soon! It features several new developments, for instance command line utilities (try "rootls myfile.root"!), a much more usable new default graphics palette, a lightweight multi-processing interface to accelerate your analyses and ROOT notebooks, a fantastic new way of tutoring and doing interactive analysis. It employs PyROOT features and our JavaScript efforts. You might have seen the latter when opening a ROOT file in CernBox! Many more things are moving also behind the scenes; the class documentation has been migrated (to a large extent) to Doxygen; ROOT is again safer to be used in multithreaded contexts; we are adding new platform support (ARM and PowerPC); and we expect to move the forum to a new platform in 2016!

The next production release of ROOT, v6.08, is scheduled for the first half of next year. It will use multiple threads to accelerate for instance the reading of ROOT files. And of course we again expect several new features and improvements, such as in TMVA!

The team is also starting to prepare for the longer term by announcing plans for a new major release, ROOT 7. This will be the first step to  bringing ROOT's interfaces into the modern C++ world with the aim of making ROOT simpler to use and more robust. Watch out for blog posts discussing some of the new interfaces, and make sure to get involved by contacting us early: this is your ROOT!

CernVM

The CernVM team is preparing release candidates of the next versions of the main products, CernVM-FS and CernVM Appliance. These are due to go in production in early 2016. The next version of the file system will provide consolidation of the main features, focusing in particular on special setups, such as those encountered in supercomputers. The next version of the appliance will provide users with the possibility to create software environments compatible with the CentOS 7 Linux distribution, the latest validated version currently being deployed by the LHC experiments. It will also provide full support for container technology, making it easier to adapt the image to the diverse use cases addressed.

In addition to being widely used by the LHC experiments, with about 400000 virtual machines started each month, the CernVM appliance and file system have been the enabling technologies behind the CERN Public Computing Challenge, which took place in the second half of November.

The CernVM team is currently engaged in the organization of the next CernVM Users Workshop, which will follow the successful event held at CERN in March 2015. This will take place in the beginning of June 2016, most likely at RAL, UK.

Workshop on Data science at the LHC

From Nov. 9 - Nov. 13, the first data science at the LHC workshop was held at CERN (see Photo 1). It brought together leading experts in the field of machine learning in particle physics and other areas. The number of registrants exceeded expectations and contributed to an exciting and productive atmosphere. The workshop featured talks by all LHC experiments about their advances in the field of machine learning, new ideas in data analysis, software and hardware, talks from Google DeepMind, Idsia and others. Currently hot machine-learning topics, such as deep learning, were discussed in the particle physics context. The workshop also featured afternoon software tutorials, including Rootbook (Root-integrated Jupiter notebooks) demos of new features in TMVA, machine-learning toolkit in Root. Participants were very excited about Rootbooks, which proved to be a great learning tool, running on dedicated servers generously provided by the CERN OpenStack project.

Photo 1: Data Science at the LHC Workshop, CERN, Nov 9-13

Mini Course in Statistics and Machine Learning in Brazil

From Nov. 15 – Nov. 30, a mini-course on Statistics and Data Analysis was held in UERJ in Rio de Janeiro, as part of the EPlanet program. The course featured lectures on Root, basics of statistical data analysis, and fundamentals of machine learning.   In addition to classroom lectures, taught by experts from CERN (S. Gleyzer and L. Moneta) there were practical tutorials based on Rootbooks. One of the many highlights of the course was the use of mobile phones to run Root and TMVA examples with Rootbooks. The course was very productive and well attended by students, engineers and professors from several Brazilian universities and research centres: UERJ, UFRJ, and UEA in Manaus (see Photo 2). Follow-up collaboration and involvement of scientists from Brazil in CERN Machine Learning activities is planned.

Photo 2 : Students from UERJ and UFRJ (Rio, Brasil) who attended a “Mini Course in Statistics and Machine Learning” as part of the EPlanet project