visits since October 2005

Arlequin ver 3.5.2

(released on 12.04.2015)


Arlecchino
About What's new Implemented methods Downloads Screenshots

What's new

Version 3.5.2 (12.04.2015)

This version introduces a few changes as compared to ver 3.5.1.3:

Bug corrections :

Version 3.5.1.3 (17.09.2011)

Version 3.5.1.3 is a maintenance version correcting a few bugs and introducing small modifications compared to version 3.5.1.2.

WinArl on Mac OS X (11.10.2010)

Kent Holsinger has kindly developed a MacOSX binary version of Winarl35 under WineBottler, which needs to be installed on your Mac.
More information and a downloadable version of WinArl35.dmg can be found here.

Version 3.5.1.2 (22.04.2010)

Version 3.5.1.2 just include a bug correction compared to version 3.5.
The bug occured when trying to estimate allele or haplotype frequencies via the EM algorithm on recessive data. The program was stalling due to the presence of an infinite loop. All versions on all platforms were affected. This bug has now been corrected in ver 3.5.1.2, which is available on the download page.

Version 3.5

Compared to version 3.11, Arlequin 3.5 includes several bug corrections, addition of new computations, and several significant improvements. The main improvement is its interfacing with the R statistical package, allowing one to produce high qualitiy graphs of many results found in the result files. We also introduce new console versions of Arlequin for both Windows and Linux.

Additions:

  1. New procedure to detect loci under selection from hierarchical F-statistics, as implemented in Excoffier et al. (2009)
  2. Computation of allele frequencies at all loci for all populations, which are output in locus-specific files.
  3. Computation of the genetic distance (δμ)² for microsatellite data.
  4. Possibility to output results as an XML file with a dedicated style sheet.
  5. R-lequin:
  6. Console version of Arlequin, called arlecore, for both Windows and Linux, allowing the analysis of a large number of files with bash scripts. See readme file.
  7. Modified console version of Arlequin, called arlsumstat, for Windows and Linux, to compute specific summary statistics on data found in an arlequin project. This version is specifically intended to be used on a large number of simulated data files, for instance in an Approximate Bayesian Computation (ABC) framework. See readme file to see how to use it.

Modifications:

Bug corrections:

  1. In the summary statistics, the reported mean number of alleles was zero when there was a single monomorphic locus.
  2. LocusSeparator = None was not recognized (NONE was needed)
  3. Chakraborty's neutrality test: there was an overflow when the number of allele was larger than 265.
  4. In the molecular diversity summary table, the number of sites with transversion was incorrectly reported as the number of sites with transitions
  5. The total number of polymorphic sites reported in summary stat table was not really the total number of polymorphic sites that would be computed on the pooled populations. It was rather the total number of sites that were found polymorphic within populations.
  6. Errors when computing average summary statistics within-samples, if some loci were monomorphic in some populations.
  7. Wrong computations of standard deviations of some summary statistics (Garza-Williamson, modified Garza-Williamson, total range) and Theta(H) for microsatellite data.
  8. Option to use associated settings did not work anymore
  9. Error when computing statistics within groups and within samples when DNA sequences contained white spaces and LocusSeparator was set to Whitespace.
  10. No message was issued when a population contained only missing data at a given locus and one was attempting to perform a locus-by-locus analysis. The locus was just not listed in the locus-by-locus AMOVA.Now, a warning message is issued. 
  11. Bad handling of diploid individuals having partially missing data (on one chromosome only) when one attempts to compute locus by locus AMOVA with individual level (FIS and FIT).
  12. Setting file (arl_run.ars) was always saved in the arlequin directory instead of the directory chosen in the dialog box.
  13. It was impossible to compute the expected mismatch distribution under the demographic and the range expansion models at the same time
  14. Mantel test was not performed when a custom Ymatrix was provided.
  15. When there is a single polymorphic microsat locus, the reported average Garza-Williamson statistics was  the number of loci...


Version 3.11

Compared to version 3.1, Arlequin 3.11 is mainly an update of ver 3.1, and there is no new manual.

Bug corrections:

  1. Significance level of FSC and Var(b). The p-value associated to the variance component due to differences between populations within groups was erroneously computed when the number of samples in the genetic structure to test was identical to the total number of samples defined in the Samples section but the order of the samples in the Genetic Structure section was different from that in the Samples section. This bug has been around since the first release of Arlequin 2.0... Thanks to Romina Piccinali for finding it.
  2. The expected homozygosity reported in the Ewens-Watterson test in the samples summary section was that of the  last simulated sample. Correct value was reported only if no permutations were done.
  3. Total number of alleles reported in the statistics summary section also included the missing data allele.
  4. The population labels were incorrectly reported when computing population-specific FIS statistics. The reported order corresponded to that of the last permutation. The population labels were only correct when the significance of the global FIS statistic was not tested. Thanks to Jeff Lozier for finding this bad bug.

Modifications

Additions:


Version 3.1

Compared to version 3.01, Arlequin 3.1 includes cosmetic and speed improvements, several bug corrections and additional features:

Bug corrections:

  1. Locus-by-locus AMOVA failed for on DNA sequences when corrections  for multiple hits were selected.
  2. File conversion towards the Phylip format could not be done.
  3. It was impossible to change the default significance level of  0.05 for highlighting significant genetic distances in output file.
  4. Missing data identifier other than "?" was not accepted.
  5. If a a project file (or the path to it) contained the letters "arb", then it was erroneously considered as a Batch file.
  6. Reported confidence interval around FST were badly reported by the bootstrap procedure in case of a single group and no Individual     Level taken into account.
  7. Estimation of haplotype frequencies from distance matrix was not performed when "Conventional FST" option was selected.
  8. Locus-by-locus AMOVA reported incorrect results for Genotypic data when individuals had missing data for only one of their gene copy at a given locus.
  9. Reported number of indels differed according to the weight given to indels in the Option panel. This bug did not affect AMOVA computations.
  10. For sequence data, a mixture of N's and missing data led to problems in identifying distinct DNA sequences from distance matrix, leading to slightly incorrect FST computations.
  11. Exact test of population differentiation could not be performed when gametic phase was unknown. Now, this option has been restored, like  in ver. 2.
  12. Arlequin hanged when a given population was entered several times in the definition of group for the computation of genetic structure. Now, the error is simply flagged but the program does not hang.
  13. For frequency data, it was impossible to use a predefined distance matrix.
  14. Beta approximation of the significance of Tajima's D gave wrong results. This approximation has been suppressed and now we only report the significance level obtained from coalescent
  15. Bad computation of inbreeding coefficients under the locus-by-locus AMOVA approach for genotypic data when phenotype frequencies were larger than one. The bug caused an overestimation of the local (FIS) and total (FIT) inbreeding level. For samples where phenotype frequencies were all set to 1, the inbreeding coefficients were     correctly estimated.
  16. Expected heterozygosity reported under HWE exact test section was inaccurately computed. This inaccuracy however did not affect the results of the HWE exact test, which does not use information on  observed and expected heterozygosity.

Improvements

Additions:


Version 3.01

Compared to version 3.0, Arlequin 3.01 include some bug corrections and some additional features:

Bug corrections:

  1. Minimum Spanning Tree Checkbox was not available for Genotypic data with known Gametic Phase.
  2. Choice of how FST is computed was not available when computing pairwise distances. Now, it is synchronized with the choice of distance in the AMOVA panel.
  3. "Search for shared haplotypes" did not work for Genotypic Data with known Gametic Phase. This has been corrected and Arlequin now ouputs a list of haplotypes before the table of frequencies.
  4. [[Mantel]] section was not recognized if located after a [[Structure]] section.
  5. Improved conversion between GenePop and Arlequin formats.
  6. "Diploid Data" option is now present when converting from Genepop to Arlequin.
  7. Output of s.d. of the number of alleles (k) was sometimes zero in output of Fu's FS test. This is now corrected and annoying warning messages about " "No molecular diversity within a sample while performing Fu's test" have been suppressed in output file.

Additions:


Version 3.0

Compared to version 2, Arlequin version 3 now integrates the core computational routines and the interface in a single program written in C++. Therefore Arlequin does not rely on Java anymore. This has two consequences: the new graphical interface is nicer and faster, but it is less portable than before. At the moment we release a Windows version (2000, XP, and above) and we shall probably release later a Linux. Support for the Mac has been discontinued.

Other main changes include:

  1. Correction of many small bugs
  2. Incorporation of two new methods to estimate gametic phase and haplotype frequencies
    1. EM zipper algorithm: An extension of the EM algorithm allowing one to handle a larger number of polymorphic sites than the plain EM algorithm.
    2. ELB algorithm: a pseudo-Bayesian approach to specifically estimate gametic phase in recombining sequences.
  3. Incorporation of a least-square approach to estimate the parameters of an instantaneous spatial expansion from DNA sequence diversity within samples, and computations of bootstrap confidence intervals using coalescent simulations.
  4. Estimation of confidence intervals for F-statistics, using a bootstrap approach when genetic data on more than 8 loci are available.
  5. Update of the java-script routines in the output html files, making them fully compatible with Firefox 1.X.
  6. A completely rewritten and more robust input file parsing procedure, giving more precise information on the location of potential syntax and format mistakes.
  7. Use of the ELB algorithm described above to generate samples of phased multi-locus genotypes, which allows one to analyse unphased multi-locus genotype data as if the phase was known. The phased data sets are output in Arlequin projects that can be analysed in a batch mode to obtain the distribution of statistics taking phase uncertainty into account.
  8. No need to define a web browser for consulting the results. Arlequin will automatically present the results in your default web browser (we recommend the use of Firefox freely available on http://www.mozilla.org/products/firefox/central.html.

Last edit:  10.04.2015 by Laurent Excoffier