tcutility.job#

Overview#

This module offers you the tools to efficiently and easily build computational workflows with various engines. The module defines usefull classes that do all the heavy lifting (input and runscript preparation, job submission, etc.) in the background, while ensuring correctness of the generated inputs.

Job classes#

Jobs are run using subclasses of the Job class. The base Job class handles setting up directories and running the calculations.

The Job subclasses are also context-managers, which results in cleaner and more error-proof code:

from tcutility.job import ADFJob

# job classes are also context-managers
# when exiting the context-manager the job will automatically be run
# this ensures you won't forget to start the job
with ADFJob() as job:
    job.molecule('example.xyz')

# you can also run without the use of context-managers
# in that case, don't forget to run the job
job = ADFJob()
job.molecule('example.xyz')
job.run()

You can control where a calculation is run by changing the job.name and job.rundir properties.

from tcutility.job import ADFJob

with ADFJob() as job:
    job.molecule('example.xyz')
    job.rundir = './calc_dir/molecule_1'
    job.name = 'ADF_calculation'

print(job.workdir)

This script will run a single point calculation using ADF in the working directory ./calc_dir/molecule_1/ADF_calculation. You can access the full path to the working directory using the job.workdir property.

Slurm support#

One usefull feature is that the Job class detects if slurm is able to be used on the platform the script is running on. If slurm is available, jobs will be submitted using sbatch instead of ran locally. It is possible to set any sbatch option you would like.

from tcutility.job import ADFJob

with ADFJob() as job:
    job.molecule('example.xyz')
    # we can set any sbatch settings using the job.sbatch() method
    # in this case, we set the partition to 'tc' and the number of cores to 32
    job.sbatch(p='tc', n=32)

Furthermore, the Job class detects which platform you are running on (e.g. Bazis or Snellius) and sets default sbatch and module loading settings accordingly.

Job dependencies#

It is possible to set up dependencies between jobs. This allows you to use the results of one calculation as input for a different calculation.

from tcutility.job import ADFJob, CRESTJob

# submit and run a CREST calculation
with CRESTJob() as crest_job:
    crest_job.molecule('input.xyz')
    crest_job.sbatch(p='tc', n=32)

    crest_job.rundir = './calculations/molecule_1'
    crest_job.name = 'CREST'

# get the 10 lowest conformers using the crest_job.get_conformer_xyz() method
for i, conformer_xyz in enumerate(crest_job.get_conformer_xyz(10)):
    # set up the ADF calculation
    with ADFJob() as opt_job:
        # make the ADFJob depend on the CRESTJob
        # slurm will wait for the CRESTJob to finish before starting the ADFJob
        opt_job.dependency(crest_job)
        # you can set a file to an xyz-file
        # that does not exist yet as the molecule
        opt_job.molecule(conformer_xyz)
        opt_job.sbatch(p='tc', n=16)

        opt_job.functional('OLYP-D3(BJ)')
        opt_job.basis_set('TZ2P')
        opt_job.quality('Good')
        opt_job.optimization()

        opt_job.rundir = './calculations/molecule_1'
        opt_job.name = f'conformer_{i}'

This script will first setup and submit a CRESTJob calculation to generate conformers for the structure in input.xyz. It will then submit geometry optimizations for the 10 lowest conformers using ADFJob at the OLYP-D3(BJ)/TZ2P level of theory. Slurm will first wait for the CRESTJob calculation to finish before starting the ADFJob calculations.

Rerun prevention#

Before submitting a calculation tcutility.job will check if the calculation has already been run or is currently being managed by slurm. This way you can be sure that you are not wasting time rerunning your calculation when you run a script you have run before.

For example, we can write a script that performs optimizations using ADFJob on structures stored in a directory:

from tcutility.job import ADFJob
import os


input_xyz_directory = 'molecules'

# get the xyz files we want to optimize
xyz_files = [os.path.join(input_xyz_directory, file) for file in os.listdir(input_xyz_directory) if file.endswith('.xyz')]

for xyz_file in xyz_files:
    with ADFJob() as job:
        job.molecule(xyz_file)
        job.sbatch(p='tc', n=16)

        job.functional('OLYP-D3(BJ)')
        job.basis_set('TZ2P')
        job.quality('Good')
        job.optimization()

        job.rundir = './calculations'
        job.name = os.path.split(file)[1].removesuffix('.xyz')

Everytime this script is run it will loop through the molecules stored in the molecules directory. If you add new molecules to this directory and then rerun it, the script will detect which molecules were previously optimized and skip those. This way you can easily reuse the script multiple times without manually checking/implementing rerun prevention.

Supported engines#

We currently support the following engines and job classes:

Amsterdam Density Functional (ADF)
- ADFJob, regular ADF calculations
- ADFFragmentJob, fragment based calculations
- NMRJob, Nuclear Magnetic Resonance (NMR) calculations using ADF
- BANDJob, coming soon …
- BANDFragmentJob, coming soon …
Density Functional with Tight Binding (DFTB)
- DFTBJob, regular DFTB calculations
ORCA
- ORCAJob, regular ORCA calculations
Conformer rotamer ensemble sampling tool (CREST) including Quantum Cluster Growth (QCG)
- CRESTJob, CREST conformational search
- QCGJob, QCG explicit solvation search
Extended tight binding (xTB)
- XTBJob, extended tight binding calculations

See the API Documentation for an overview of the Job classes offered by tcutility.job module.

Note

If you want support for new engines/classes, please open an issue on our GitHub page, or let one of the developers know!

Requirements#

To run calculations related to the Amsterdam Modelling Suite (AMS) you will require a license.

For ORCA calculations you will need to add the ORCA executable to your PATH environmental variable.

Examples#

A few typical use-cases are given below. Click here for a full overview of all examples. Of course, the scripts shown above are also valid example uses of tcutility.job!

Geometry optimization using ADF#

It is quite easy to set up calculations using the tcutility.job package. For example, if we want to run a simple geometry optimization using ADF we can use the ADFJob class.

In this case we are optimizing the water dimer at the BP86-D3(BJ)/TZ2P level. To handle the ADF settings you can refer to the GUI. For example, to use a specific functional simply enter the name of the functional as it appears in the ADF GUI. The same applies to pretty much all settings. The ADFJob class will handle everything in the background for you.

The job will be run in the ./calculations/GO_water_dimer directory. The tcutility.job package will handle running of the calculation as well. It will detect if your platform supports slurm and if it does, will use sbatch to run your calculations. Otherwise, it will simply run the calculation locally.

import pathlib as pl

from scm.plams import AMSJob, Molecule, Settings, config, finish, init
from tcutility.job import ADFJob

current_file_path = pl.Path(__file__).parent
mol_path = current_file_path / "water_dimer.xyz"


def try_plams_job(mol: Molecule) -> None:
    # Test case with plams for checking if plams works solely on Windows
    run_set = Settings()
    run_set.input.ams.Task = "GeometryOptimization"
    run_set.input.adf.Basis.Type = "DZP"
    run_set.input.adf.XC.GGA = "BP86"

    config.log.file = 7
    config.log.stdout = 7

    init(path=str(current_file_path), folder="GO_water_dimer", config_settings=config)
    AMSJob(molecule=mol, name="water_dimer", settings=run_set).run()
    finish()


def try_tcutility_job(mol: Molecule) -> None:
    # Test case with tcutility for checking if tcutility works solely on Windows
    with ADFJob(use_slurm=False) as job:
        job.molecule(mol)
        job.rundir = str(current_file_path / "calculations")
        job.name = "GO_water_dimer"
        job.functional("BP86-D3(BJ)")
        job.basis_set("TZ2P")
        job.quality("Good")
        job.optimization()


def main():
    current_file_path = pl.Path(__file__).parent
    mol_path = current_file_path / "water_dimer.xyz"

    mol = Molecule(str(mol_path))

    # Use these functions to test if a plams and tcutility job can be run on Windows, Mac, and Linux. Both do not use slurm.
    try_plams_job(mol)
    try_tcutility_job(mol)


if __name__ == "__main__":
    main()

6

O      -1.61075942       0.14972207       0.00000000
O       1.27324620      -0.14984188       0.00000000
H      -2.05173067      -0.71502154       0.00000000
H      -0.65160034      -0.06225163       0.00000000
H       1.52042212       0.38869649      -0.77034720
H       1.52042212       0.38869649       0.77034720

Fragment calculation using ADF#

Another common usage of ADF is running a fragment calculation. This calculation requires setting up three different ADF jobs. Using the tcutility.job package allows you to set up and run these kinds of calculations in as little as 8 lines of code.

In this case we make use of a special xyz file format (see tcutility.molecule.guess_fragments()) which specifies the fragments. This saves us some work in setting up the calculations.

from tcutility.job import ADFFragmentJob
from tcutility import molecule

# load a molecule
mol = molecule.load('NH3BH3.xyz')

# define a new job using the Job context-manager
with ADFFragmentJob() as job:
	# add the molecule
	job.molecule(mol)
	# add the fragments. The fragment atoms are defined in the input xyz file
	for fragment_name, fragment in molecule.guess_fragments(mol).items():
		job.add_fragment(fragment, fragment_name)

8

N       0.00000000       0.00000000      -0.81474153 frag=Donor
B      -0.00000000      -0.00000000       0.83567034 frag=Acceptor
H       0.47608351      -0.82460084      -1.14410295 frag=Donor
H       0.47608351       0.82460084      -1.14410295 frag=Donor
H      -0.95216703       0.00000000      -1.14410295 frag=Donor
H      -0.58149793       1.00718395       1.13712667 frag=Acceptor
H      -0.58149793      -1.00718395       1.13712667 frag=Acceptor
H       1.16299585      -0.00000000       1.13712667 frag=Acceptor

tcutility.job#

Overview#

Job classes#

Slurm support#

Job dependencies#

Rerun prevention#

Supported engines#

Requirements#

Examples#

Geometry optimization using ADF#

Fragment calculation using ADF#

This Page