Technical Meeting on Standards and Software Tools for Atomic and Molecular Databases

Meeting Report

A Technical Meeting on Standards and Software Tools for Atomic and Molecular Databases, was held in room M0E-100 at IAEA Headquarters in Vienna from 27 – 29 November 2019. The Scientific Secretary was Christian HILL.

Overview

The goals of this meeting are:

  • To expand, update and clarify the classification of collisional processes in plasmas given in the 2003 report written by Humbert et al. [pdf] outlining the recommendations of the IAEA Data Centres Network. Of particular interest is the inclusion of a more complete set of plasma-surface interaction processes.

  • To codify a set of standards for unambiguously identifying:

    • Atoms and ions and their isotopes; small molecules and molecular ions and their isotopologues; photons and subatomic particles of relevance to plasma processes;
    • The rotational, vibrational, and electronic states of atoms, ions and (small) molecules.

Some progress has been made at the Atomic and Molecular Data Unit to implement a set of text-based conventions on its databases: a demonstration of the PyValem application for describing, validating and transforming reactions, species and states of relevance to atomic and molecular physics is available online.

  • To recommend relational database structures for storing and software for parsing and querying information relating to atomic and molecular data.

  • To recommend a nomenclature for interconversion between unit systems considered as equivalent in the fields of spectroscopy and collisional physics (e.g. frequency ≌ wavenumber ≌ wavelength ≌ photon energy).

Examples and Motivation

Collisional processes

Some of the processes missing from the existing recommended classification are:

  • Collisions involving a change of excitation, particularly by heavy-particle collisions, e.g. Ar* + O → Ar + O*

  • Various kinds of molecular dissociation and reaction, e.g. N3+ → N2 + N+, O + O3- → e- + O2 + O2;

  • Reactions occurring on surfaces with or without desorption of the product;

  • Distinctions between changes of excitation to a specific state vs. a generic or unknown state (e.g. Ar + e- → Ar* + e- vs. Ar + e- → Ar 4s[3/2]2);

  • Rotational and vibrational excitation of molecules;

  • Processes for which the products may be partially unknown or a mixture of stoichiometric possibilities.

Identification of particles, atoms and molecules and their states
  • Journals, in particular, have incompatible and inconsistently applied conventions for referring to these species: e.g. “W LXIV” vs. “Na-like W” vs. “W+63” vs. “W63+” vs. “W^{+63}”, “W<sup>+63</sup>” and many more combinations, making searching difficult and error-prone.

A recommendation for identifying such species in a database and tools for transforming the output according to different conventions would help here. We might think of hosting a database to resolve (as far as possible) different conventions into their canonical form;

  • The same applies (more so, in fact), to atomic and molecular states. Where MathML or images are used, there is rarely any semantic metadata attached to these entities to make them meaningful to parsers. For example, the title “Dipole-allowed transitions between the X3Σ−, A′3∆, A′′3Σ+, A3Π, B3Σ−, and C3Π states of the sulfur monoxide radical”, [JQSRT 234, 98 (2019)] refers to several electronic states of a diatomic molecule, but cannot be reliably transcribed into a bibliography and the state labels are not easily searched. One doesn't have to look far to find examples of awkward implementations of incompatible conventions here:
  • No agreed convention exists to specify the extent to which an atomic or molecular state is known: it would be helpful to have a distinct notation for an “unspecified” state (a species identified only by its stoichiometry and (in the case of molecules) connectivity), an “unknown” state, a partially-specified state (e.g. “all rotational states within a specified vibrational state”, “all rovibrational states belonging to a given electronic state”, etc.)

  • A great many standards already exist for the description of individual species (InChI[Key], SMILES, SYBYL, and so on), and the Virtual Atomic and Molecular Data Centre (VAMDC) consortium has developed the XSAMS standard for data exchange in atomic and molecular physics. The goal of this meeting is not to recommend yet another alternative standard, but rather to suggest a practical, unambiguous, text-based internal representation of relevant quantities for online databases, and to motivate the development of software tools for the interconversion between existing standards.

Unit conventions in spectroscopy and collisional physics

It would helpful to specify precisely what is meant by various unit conversions in the context of particular fields, especially those which are not dimensionally correct: for example, the transformation from a photon’s (vacuum) wavelength to its frequency and energy; conversion between the wavelength and wavenumber of a spectroscopic transition, etc. Of relevance to this is the long-standing and unresolved debate about dimensionless units in the SI sysem, such as the radian [1, 2]: the status quo of distinguishing the implied units of periodic and cyclic frequencies (related by a factor of 2π) through the symbol used to denote a physical quantity (typically ν or ω) is unsatisfactory.

[1] P. R. Bunker et al., "The Planck Constant and its Units", Journal of Quantitative Spectroscopy and Radiative Transfer 237, 106594 (2019).

[2] P. J. Mohr and W. D. Phillips, "Dimensionless units in the SI", Metrologia 52, 40 (2015).

Software Tools group photo

Agenda

Wednesday, 27 November 2019

09:30 – 10:00Christian HILL and Kalle HEINOLA: Welcome, introduction of the participants, adoption of the agenda
Session 1: Collisional Processes

Chair: Christian HILL

10:00 – 10:25Grzegorz KARWASZ (Nicolaus Copernicus University, Toruń, Poland)
Total and partial cross sections for electron scattering on atoms and molecules
[abstract (pdf)] [presentation (pdf: 4.6 MB)]
10:25 – 10:50Bratislav MARINKOVIĆ (Laboratory for Atomic Collision Processes, Institute of Physics Belgrade, Serbia)
Elastic electron scattering by atomic particles and cross section representation in the Belgrade Electron-Atom/Molecule DataBase -BEAMDB
[abstract (pdf)] [presentation (pdf: 2.5 MB)]
10:50 – 11:20Coffee Break
11:20 – 12:10Marie-Lise DUBERNET (Paris Observatory, France)
From Species to publications by going through data: the VAMDC species database and Query Store
[abstract (pdf)]
With Carlo Maria ZWÖLF and Nicolas MOREAU
12:10 – 12:35Sebastian MOHR (Quantemol LTD, United Kingdom)
Quantemol-DB: A species and reaction database for plasma applications
[abstract (pdf)]
12:35 – 14:00Lunch
Session 2: Atomic and Molecular Spectroscopy

Chair: Marie-Lise DUBERNET

14:00 – 14:30Christian ENDRES (Max Planck Institute for Extraterrestrial Physics, Germany)
The Cologne Database for Molecular Spectroscopy, CDMS: spectroscopic data for Astrochemists and Astrophysicists
[abstract (pdf)]
14:30 – 15:00Thomas MARQUART (Uppsala University, Sweden)
Some viewpoints from VALD and VAMDC
15:00 – 15:30Coffee Break
Session 3: Databases and Data Services at the IAEA

Chair: Kalle HEINOLA

15:30 – 16:30Christian HILL (IAEA)
Data activities of the Atomic and Molecular Data Unit
[presentation (pdf: 1021.9 KB)]
16:30 – 17:00Jean-Christophe SUBLET (IAEA)
presentation title to follow

Thursday, 28 November 2019

Session 4: Databases and Data Services at the IAEA

Chair: Kalle HEINOLA

09:00 – 09:25Ludmila MARIAN (IAEA)
Open Science Tools
[presentation (pdf: 17.4 MB)]
Session 5: Discussion: Atomic and Molecular Species and States
09:25 – 11:00Discussion (all): Recommendations for identifying, classifying and representing atoms, molecules, ions and their states
11:00 – 11:30Coffee Break
11:30 – 12:30Discussion (all): Surfaces and plasma-surface interactions: description and representation
12:30 – 14:00Lunch
Session 6: Discussion: Collisional Process Description and Classification
14:00 – 15:30Discussion (all): Recommendations for classifying and exemplifying collisional processes
15:30 – 16:00Coffee Break
16:00 – 17:30Drafting of recommendations report; summary of discussions
19:00 – 21:30Social Dinner: Café Ansari, Praterstraße 15, 1020 Wien

Friday, 29 November 2019

Session 7: Discussion: Database Implementation, Maintenance and Interoperability
09:00 – 11:00Discussion (all): Structures for storage and querying of atomic and molecular data; Database implementations and sustainability
11:00 – 11:30Coffee Break
11:30 – 13:00Discussion (all): Facilitating interoperability between databases; Software tools for comparison and interconversion between data formats; Units
13:00 – 14:00Lunch
Session 8: Discussion: Recommendations and Draft Report
14:00 – 16:00Drafting of recommendations report; Summary of meeting
16:00 – 16:30Concluding remarks; Close of meeting

Participants

12 participants from 6 countries.

Marie-Lise DUBERNET Paris Observatory, France
Christian ENDRES Max Planck Institute for Extraterrestrial Physics, Germany
Christian HILL IAEA
Grzegorz KARWASZ Nicolaus Copernicus University, Toruń, Poland
Ludmila MARIAN IAEA
Bratislav MARINKOVIĆ Laboratory for Atomic Collision Processes, Institute of Physics Belgrade, Serbia
Thomas MARQUART Uppsala University, Sweden
Sebastian MOHR Quantemol LTD, United Kingdom
Jean-Christophe SUBLET IAEA
Kalle HEINOLA IAEA
Carlo Maria ZWÖLF Paris Observatory, France
Nicolas MOREAU Paris Observatory, France

Presentations

Marie-Lise DUBERNET (Paris Observatory, France)
From Species to publications by going through data: the VAMDC species database and Query Store
[abstract (pdf)]
With Carlo Maria ZWÖLF and Nicolas MOREAU

Christian ENDRES (Max Planck Institute for Extraterrestrial Physics, Germany)
The Cologne Database for Molecular Spectroscopy, CDMS: spectroscopic data for Astrochemists and Astrophysicists
[abstract (pdf)]

Christian HILL (IAEA)
Data activities of the Atomic and Molecular Data Unit
[presentation (pdf: 1021.9 KB)]

Grzegorz KARWASZ (Nicolaus Copernicus University, Toruń, Poland)
Total and partial cross sections for electron scattering on atoms and molecules
[abstract (pdf)] [presentation (pdf: 4.6 MB)]

Ludmila MARIAN (IAEA)
Open Science Tools
[presentation (pdf: 17.4 MB)]

Bratislav MARINKOVIĆ (Laboratory for Atomic Collision Processes, Institute of Physics Belgrade, Serbia)
Elastic electron scattering by atomic particles and cross section representation in the Belgrade Electron-Atom/Molecule DataBase -BEAMDB
[abstract (pdf)] [presentation (pdf: 2.5 MB)]

Thomas MARQUART (Uppsala University, Sweden)
Some viewpoints from VALD and VAMDC

Sebastian MOHR (Quantemol LTD, United Kingdom)
Quantemol-DB: A species and reaction database for plasma applications
[abstract (pdf)]

Jean-Christophe SUBLET (IAEA)
presentation title to follow