This is a guest post from Anthony Underwood.
I’ve been exploring the use of IQ-TREE for rapid ML tree calculation. I’ve had reservations for a while when using RAxML since even with multi-threading it can take hours to run and have often instead submitted them to the CIPRES supercomputing facilities at phylo.org where the results are usually returned in under an hour. This again makes me uncomfortable since it is an external service and is not necessarily something we can rely on.
Some of the features of IQ-Tree are
1) A novel fast and effective stochastic algorithm to estimate maximum likelihood trees
2) An ultrafast bootstrap approximation to assess branch supports which is 10 to 40 times faster than RAxML rapid bootstrap and obtains less biased support values.
3) An ultrafast and automatic model selection which is 10 to 100 times faster than jModelTest and ProtTest.
It has a FAQ which describes how it treats ambiguous characters (similar to how RAxML does) and how to interpret the bootstrap support values – they suggest only trusting clades with >= 95% support. In terms of compilation, it requires a recent version of CMake (2.8+) and GCC (4.6+), apart from that compilation is fairly straight forward.
This command line option (once IQ-Tree is installed) will test which evolutionary model is best, construct an ML tree using this model and perform 1000 bootstraps. For an alignment with just SNP positions this will in my experience take minutes and for an alignment containing 100 average bacterial full genome length sequences , a couple of hours.
iqtree-omp -s alignment.fas -nt 4 -m TEST -bb 1000`
The results look promising with tree topology, branch length, and support values showing broad correlation (see below for data). The fact that this is an actively developed piece of software (https://github.com/Cibiv/IQ-TREE), with good documentation and some good peer reviewed papers (http://dx.doi.org/10.1093/molbev/msu300 and http://dx.doi.org/10.1093/molbev/mst024) gives me confidence to try this for the next few phylogenetic analyses I need to run
Neisseria Tanglegram (RAxML on left, IQ_TREE on right)
Shigella sonnei tree((RAxML on left, IQ_TREE on right. Thanks to Tim for the data)