Supporting Information for Evaluation of a Permutation-Based Evolutionary Framework for Lyndon Factorizations - Data

Contributor(s) Organisations

Description

Evaluation of a Permutation-Based Evolutionary Framework for Lyndon Factorizations

We use data from NCBI RefSeq (see prokaryotes.csv) in amino acid format. From the NCBI genome list, FTP URLs are shown to download the files. The amino acid format files end in "_protein.faa.gz".

The list of genomes we used for the testing of the minimisation of the number of Lyndon factors and balancing the length of Lyndon factors is in testingGenomeList. The name of the genome we used for finding the mutation operator to use for the EA is in trainingGenome.

In our code, ModifiedDuvals was the working name of Flexi-Duval. Similarly, ModifiedDuvalsOperator refers to the LF-inspired operator.

NCBI have reduced the number of reference genomes https://ncbiinsights.ncbi.nlm.nih.gov/2020/02/14/assembly-changes/.
Date made available05 Sep 2020
PublisherPrifysgol Aberystwyth | Aberystwyth University
Date of data production27 May 2020
DOI
Show download statistics
View graph of relations

Description

Evaluation of a Permutation-Based Evolutionary Framework for Lyndon Factorizations

We use data from NCBI RefSeq (see prokaryotes.csv) in amino acid format. From the NCBI genome list, FTP URLs are shown to download the files. The amino acid format files end in "_protein.faa.gz".

The list of genomes we used for the testing of the minimisation of the number of Lyndon factors and balancing the length of Lyndon factors is in testingGenomeList. The name of the genome we used for finding the mutation operator to use for the EA is in trainingGenome.

In our code, ModifiedDuvals was the working name of Flexi-Duval. Similarly, ModifiedDuvalsOperator refers to the LF-inspired operator.

NCBI have reduced the number of reference genomes https://ncbiinsights.ncbi.nlm.nih.gov/2020/02/14/assembly-changes/.
Date made available05 Sep 2020
PublisherPrifysgol Aberystwyth | Aberystwyth University
Date of data production27 May 2020

Documents