Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

 

MLE_BiMarkers [-pseudo] [-sd seed] [-pl parallelThreads] [-mr maxReticulation] [-tm taxonMap] [-fixtheta theta] [-varytheta] [-esptheta] [-snet startingNetwork] [-ptheta startingTheta] [-pi0 PI0] [-diploid] [-dominant dominantMarker] [-op]

ML Settings
-mnr numRunsThe number of iterations of simulated annealing. The default value is 100.

optional

-mec maxExaminationsCount

The maximum allowed times of examining a state during one iteration. The default value is 50,000.optional

-mno numOptimums

The number of optimal networks to print. The default value is 10.

optional

-mf maxFailuresThe maximum allowed times of failures to accept a new state during one iteration. The default value is 50.optional
-pl parallelThreads The number of threads running in parallel. The default value is the number of threads in your machine.optional
Inference Settings
-pseudoUse pseudolikelihood.optional
-mr maxReticulationThe maximum number of reticulation nodes in the sampled phylogenetic networks. The default value is 4.optional
-taxa taxaListThe taxa in gene tree used for inference. For example, -taxa (a,b,c)required
-tm taxonMapGene tree / species tree taxa association. By default, it is assumed that only one individual is sampled per species in gene trees. This option allows multiple alleles to be sampled. For example, the gene tree is (((a1,a2),(b1,b2)),c); and the species tree is ((a,b),c);, the command is -tm <a:a1,a2; b:b1,b2;c:c>. Note that the taxa association should cover all species, e.g. -tm <a:a1,a2; b:b1,b2> is incorrect because c:c is dropped out. optional
-fixtheta thetaFix the population mutation rates associated with all branches of the phylogenetic network to this given value (theta). By default, we estimate a constant population size across all branches.optional
-espthetaEstimate the mean value of prior of population mutation rates.optional
Starting State Settings
-snetSpecify the starting network. The input network should be ultrametric with divergence times in units of expected number of mutations per site, inheritance probabilities and population sizes in units of population mutation rate (optional). See example below. The default starting network is the MDC trees given starting gene trees. optional
-ptheta startingThetaPriorSpecify the mean value of prior of population mutation rate (startingThetaPrior). The default value is 0.036. If -esptheta is used, startingThetaPrior will be treated as the starting value, otherwise startingThetaPrior will be treated as the fixed mean value of prior of population mutation rates.optional

Data related settings

-diploidSpecify whether sequence sampled from diploids.optional
-dominant dominantMarkerSpecify which marker is dominant if the data is dominant. Either be '0' or '1'.optional
-opSpecify whether or not to ignore all monomorphic sites. If this option is used, the data will be treated as containing only polymorphic sites.optional
-pi0 valueSpecify the stationary distribution of marker "0". Value should be between 0 and 1. If not specified, the stationary distribution will be calculated from input data.optional

 

Example

Download: run_0.nex

Please download the example instead of copying from this webpage and pasting into your local file!

 

#NEXUS
Begin data;
Dimensions ntax=5 nchar=100;
Format datatype=dna symbols="012" missing=? gap=-;
Matrix

 

A_0 1001011010101011001000010101010111001010011001100101111011000011111000001010001001100000110100001011
C_0 1001111011101011001001010101010111011010010001100001111111001000111000001010011001100100100110001011
L_0 1001011010100111001000010101010111001010011001100101111111001011110000001010001001100000110100001011
Q_0 1001011010101011001001010101010111001010011001100101111111001011110000001010001001100000110100001011
R_0 1001011010101011001101010001010111001110011001100101011111001011110000001010101001100000100100001001
;End;
BEGIN PHYLONET;
MLE_BiMarkers -pseudo -mnr 10 -mec 50000 -mno 20 -mf 100 -pi0 0.5 -dd -mr 1 -pl 8 -ptheta 0.006 -thetawindow 0.006 -sd 12345678 -taxa (A_0,C_0,L_0,R_0,Q_0) -tm <A:A_0; C:C_0;L:L_0;Q:Q_0;R:R_0> ;
END;

 Note that an empty line should be left after "Matrix".

This command will run maximum pseudolikelihood estimation of 10 iterations with 20 optimal networks printed. And after 100 times of failure to accept a new state, or after 50000 examinations of new states, it will start a new iteration. We will estimate population mutation rates for all branches, and they are the same across all branches. The number of reticulation nodes is limited to 1. The starting value of population mutation rate is given by 0.006. We use the random seed of 12345678. In the end, we indicate the mapping from taxa to species.