Skip to content

Tutorials

Running ATHENA

To run ATHENA, you can pass the main file to Python (version 3.10 or later):

python athena.py

or, if you have made the file executable:

athena.py

When passing no arguments, ATHENA will output the command-line options available to it:

>> python athena.py
Welcome to ATHENA - Help. The following are the available command line arguments. Please see manual
for more detailed information on the parameters.

ATHENA command-line usage:
--codon-consumption {eager,lazy}
                        Options are eager and lazy. Specifies whether grammar will consume codons
                        when only one choice for a rule in the grammar. (default: eager)
--codon-size CODON_SIZE
                        Maximum value of a codon in the genome of an individualin the evolutionary
                        population. At a minimum it should be>= the largest number of choices for a
                        rule in the grammar (default: 250)
--color-map-file COLOR_MAP_FILE ```

Example files

The files used in this tutorial are distributed in the example directory. The tutorial assumes you are running in the parent directory of the examples directory.

Minimal options

At a minimum, ATHENA requires a grammar file, an outcome file and either a genotype data or continuous data input file. This example shows a minimal run where all other options us the default values.

athena.py --grammar-file example/genn.bnf --contin-file example/contin.txt --outcome-file example/outcome_casecon.txt

CV:  1

gen = 0 , Best fitness = (0.565,)
gen = 1 , Best fitness = (0.565,) , Number of invalids = 133
gen = 2 , Best fitness = (0.565,) , Number of invalids = 93
gen = 3 , Best fitness = (0.565,) , Number of invalids = 100
gen = 4 , Best fitness = (0.565,) , Number of invalids = 77

ATHENA will print its progress and list the cross validation number, the current generation, the best fitness value for a network in its population and the number of invalid networks in the population.

The output files will be written to the current working directory and have the basename of 'athena_results'. ATHENA writes a plot of the best network and log of each cross-validation interval. It also creates a summary file listing the networks and fitnesses of the best networks across all cross-validations.

athena_results.cv1.log      athena_results.cv3.log      athena_results.cv5.log
athena_results.cv1.png      athena_results.cv3.png      athena_results.cv5.png
athena_results.cv2.log      athena_results.cv4.log      athena_results_summary.txt
athena_results.cv2.png      athena_results.cv4.png

Specifying outputs name and location

The --out/OUT parameter can contain a path (either relative or full) along with the basename portion of the output files. For example, to have the files written to a results directory under your current directory and named experiment1:

athena.py --grammar-file example/genn.bnf --contin-file example/contin.txt --outcome-file example/outcome_casecon.txt --out results/example1

The directory must exist prior to the run. ATHENA will overwrite files with the same names.

Using genotype data

ATHENA will accept genotype data using the --geno-file/GENO_FILE parameter. The values in the genotype file should all be 0,1,2 or the value for missing (set by --missing/MISSING ). The values can be encoded as either additive or add_quad.

athena.py --grammar-file example/genn.bnf --contin-file example/contin.txt --outcome-file example/outcome_casecon.txt --out results/example1 --geno-file example/genos.txt --geno-encode add_quad

Using a parameter file

All options (except for --param-file itself) can be passed to ATHENA in a parameter file. The parameters file can serve as a record for the parameters used in a run. The options can be mixed between the command-line and the file with the value in the parameter file taking precendence when an option appears in both.

python athena.py --param-file example/rsquared.config

Limiting variables in the algorithm

The option --includedvars/INCLUDEDVARS filters the input variables to only the variables included in the list.

python athena.py --param-file example/included.config

Color coding plots

A color mapping file maps specific variables name to colors and can be included to color those nodes in the network plots.

python athena.py --param-file example/included.config --color-map-file example/color_map.txt

Parallel run

If MPI and the Python package mpi4py are installed, ATHENA can run in parallel mode.

mpirun -np 4 athena.py --grammar-file example/genn.bnf --contin-file example/contin.txt --outcome-file example/outcome_casecon.txt --out results/parallel