Notes on running Crux 1.2.0 =========================== `Crux is written by Jason Evans `_ redpoint -------- redpoint usage: +++++++++++++++ redpoint -h | --help redpoint -V | --version redpoint [] General options: ++++++++++++++++ -h, --help Print usage and exit. -V, --version Print version information and exit. -v, --verbose Enable verbose output (\*disabled). -q, --quiet Disable verbose output (\*enabled). -t, --threaded Enable thread parallelism (\*enabled). -u, --unthreaded Disable thread parallelism. -s , --seed= Set pseudo-random number generator seed (\*based on system time, microsecond resolution). -G , --gcmult= Factor by which to modify Python's garbage collection thresholds (\*1.0). Input/output options: +++++++++++++++++++++ -g , --stages= Comma-separated list composed of: - mc3: Run Mc3 analysis (\*enabled). - post: Compute statistics on posterior distribution (\*enabled). -i , --input-file= Input DNA alignment, in FASTA format. -p , --prefix= Input/output path prefix, used as the base name for Mc3 output files. Mc3 options (see Crux.Mc3.Mc3.optName for documentation): +++++++++++++++++++++++++++++++++++++++++++++++++++++++++ General: ........ :: --graphDelay= . --cvgSampStride= . --cvgAlpha= . --cvgEpsilon= . --minStep= . --maxStep= . --stride= . --nruns= . --ncoupled= . --heatDelta= . --swapStride= . Proposal parameters: .................... :: --ncat= . --catMedian= . --invar= . --weightLambda= . --freqLambda= . --rmultLambda= . --rateLambda= . --rateShapeInvLambda= . --invarLambda= . --brlenLambda= . --etbrPExt= . --etbrLambda= . Model parameter priors: ....................... :: --rateShapeInvPrior= . --invarPrior= . --brlenPrior= . --brlenPrior= . --rateJumpPrior= . --polytomyJumpPrior= . --rateShapeInvJumpPrior= . --invarJumpPrior= . --freqJumpPrior= . --mixtureJumpPrior= . Proposal probabilities: ....................... :: --weightProp= . --freqProp= . --rmultProp= . --rateProp= . --rateShapeInvProp= . --invarProp= . --brlenProp= . --etbrProp= . --rateJumpProp= . --polytomyJumpProp= . --rateShapeInvJumpProp= . --invarJumpProp= . --freqJumpProp= . --mixtureJumpProp= . Fixed parameter overrides (appropriate proposals are implicitly disabled): ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ --fixed-topology= Use the Newick tree in \ to fix the tree topology. --fixed-tree= Use the Newick tree in \ to fix the tree topology and branch lengths. --fixed-nmodels= Fix the number of models in the mixture. --fixed-rclass= Fix the comma-separated rclass. Semantics are strange unless \--fixed-nmodels is also specified. --fixed-rates= Fix the comma-separated rates associated with the fixed rclass. Semantics are strange unless \--fixed-nmodels is also specified. --fixed-shape= Fix the +G shape parameter. Semantics are strange unless --fixed-nmodels is also specified. --fixed-pinvar= Fix the +I proportion of invariable sites. Semantics are strange unless --fixed-nmodels is also specified. --fixed-freqs= Fix the comma-separated state frequencies. Semantics are strange unless --fixed-nmodels is also specified. Posterior distribution statistics options (\*all enabled by default): +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ --burnin= Set number of burn-in samples. Specify \--burnin=half for a burn-in equal to half the total samples (\*default). --sum= Write summary statistics to \.sum. --trprobs= Write tree topology frequencies with mean branch lengths to \.trprobs. --parts= Write partition frequencies and branch length statistics to \.parts. --con= Write consensus tree (first tree with mean branch lengths, second with bipartition support values) to \.con. Example command lines --------------------- # Default starting conditions: GTR, 1 run, no Metropolis-coupling, no proportion of invariants, and no gdsarv: ``redpoint -i ../concatg_set5.fa --prefix=run1`` # GTR+I+G, 2 runs, each 4 chains, pInvar, and gdsarv 4 categories, submitted with MPI on 8 CPUs (1 per chain): ``mpirun -np 8 redpoint -i ../concatg_set5.fa --nruns=2 --ncoupled=4 --prefix=run1 --ncat=4 --catMedian=True --invar=True`` Output files ------------ :: .con - 2 trees: a) phylogram, b) cladogram with PP's .l - Starting configuration .lnL.R - R script to print lnL plots .p - sampled parameter values .parts - partition table .s - lnL, proposal probs., swap rates .sum - summary .t - all sampled trees (Newick) .trprobs - tree posteriors Typical options to change ------------------------- Default values: +++++++++++++++ :: burnin=half Default burnin - and interger or "half" ? nruns=1 Number of independent runs ncoupled=1 Number of chains in Metropolis-coupling stride=100 Sampling frequency minStep=100000 Minimum chain length maxStep=~inf Maximum chain length (essential set to inf) swapStride=1 ??? How often swapping is proposed between hot and cold chains ??? heatDelta=0.05 M-coupled chain heating temp. ncat=1 Number of categories in Gdasrv catMedian=False Use Gdasrv category means invar=False Set proportion of invariant sites cvgSampStride=1 How often to run convergence diagnostics (cvgSampStride x stride) graphDelay=-1.0 If positive, output R program (.lnL.R) every seconds