A Survey of Error Analysis Procedures Used by Existing XAFS Software Packages

Revision: June 25, 1998


Summary

A number of existing software packages were surveyed to determine currently used methods for error analysis. This survey is part of the effort by the Standards and Criteria (S&C) committee of the International XAFS Society (IXS) to develop guidelines for standardized error-reporting procedures and minimum reporting requirements. Such guidelines will ultimately be used by authors of present and future software packages to gradually standardize error reporting methods.


Purpose

The purpose of this survey it to determine currently used methods for error analysis. The decision to conduct the survey was taken at the meeting of the S&C committee of the IXS in July 1997. The goal is to eventually develop guidelines for standardized error-reporting procedures and minimum reporting requirements that are compatible with accepted practices for error analysis and, as much as possible, currently existing procedures. Such guidelines will ultimately be used by authors of present and future software packages to gradually standardize error reporting methods.


Scope

This survey is neither exhaustive nor complete. The majority of the software packages surveyed are listed on the ESRF page of XAFS software. In addition, several packages listed on the IXS and UWXAFS home pages were also included. The following authors/maintainers have provided information about their respective packages:

 Author/Maintainer  Package
 Daniel Aberdam  SEDEM
 Boyan Boyanov  MacXAFS
 Adriano Filipponi  GNXAS
 Alain Michalowicz  EXAFS pour le Mac
 Matt Newville  FeffIt
 Thorsten Ressler  WinXAS
 Paul Stephenson  EXCURVE
 Marius Vaarkamp  XDAP
 Markus Winterer  XAFS

Note that this does not include quite a few of the packages listed at the above locations, whose authors did not respond. Information on other packages, to be included in future revisions of this document, may be submitted here.


Questionnaire

Authors were asked to answer the following questions:

  1. Form of function(s) being minimized (, other), including normalization pre-factor(s), experimental uncertainties, etc.
  2. Provisions for estimating "experimental" errors, if any. How are these incorporated into the function being minimized? How are the estimated uncertainties transformed between k-space and r-space (if applicable). Possible candidates in this category include:
  3. Methods used to calculate confidence limits, e.g., square roots of the diagonal elements of the covariance matrix, brute force methods, whatever. Please address specifically how are inadequate estimates of "experimental" errors accommodated, e.g., renormalization of the at the minimum, doubling of , increasing by a fixed fraction, etc.
  4. Does your software calculate correlation matrices? If yes, what is the definition used? Are any guidelines provided for the interpretation of the correlation matrix?
  5. Figures of merit provided to the user---normalized residuals, R-factors, etc. Please include specific formulae, as well as ranges of acceptable values, etc.
  6. Whatever thoughts you care to include on the subject


Glossary

This section summarizes the notation used in this document. Some of the notation used by the authors has been changed to maintain consistency of style across the document. The following set of common symbols is used:

The packages are listed in alphabetical order. The name of the person providing the information is included.


EXAFS pour le Mac

 WWW Site:  http://lure.u-psud.fr/
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/pourlemac
 Contact:  Alain Michalowicz

EXAFS pour le Mac minimizes the statistical chi-square function or

Errors in the experimental data may be specified in one of the following ways:

All fits are performed in k-space. Confidence limits are calculated either from the square roots of the diagonal elements of the covariance matrix

where is a statistical confidence level, e.g., as defined on p. 536 (Chapter 14) of the 1986 edition of Numerical Recipes. It is also possible to estimate confidence limits via brute-force methods, i.e., without assumption of parabolicity of in the vicinity of the minimum. A correlation matrix is calculated and displayed, but interpretation is left to the user. The following figures of merit are provided:

Built-in facilities for statistical significance testing (F-test) are also available.


EXCURVE

 WWW Site:  http://www.dl.ac.uk/SRS/XRS/Computing/Programs/excurv97/ex97.html
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/excurve
 Contact:  Paul Stephenson

EXCURVE minimizes the quantity

The weightings w determine the relative significance of the EXAFS, distance, and angle contributions. The sum of the weightings must equal one. Often only the EXAFS contribution is used. The other two terms are restraints, used in special circumstances where the model includes well characterised groups of atoms. The EXAFS contribution is given by:

where

Details on the definitions of the other terms are available here. It is also possible to use experimental values as the weights in the refinement, where can be obtained by using a spline fit to the experiment. Confidence limits for the fit parameters [p] are estimated from the square roots of the diagonal elements of the covariance matrix [C] as

The following figures of merit are provided to the user:

A value of around 25% for R would normally be considered a reasonable fit, with values of 10% or less being difficult to obtain on unfiltered data.


FeffIt

 WWW Site:  http://krazy.phys.washington.edu/
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/feffit
 Contact:  Matt Newville

FeffIt minimizes the unnormalized "squared residue" function . Fits to multiple data sets are possible in r-space (with or without phase corrections), unfiltered, and back-transformed k-space. R-space fits implicitly include k-weighting and FT windows. Complex algebraic and transcendental relationships may be defined between the fit parameters with FORTRAN-like syntax, both within a single data set, and accross multiple data sets. Errors in the experimental data are estimated from the r.m.s. amplitude of the r-space FT between 15 and 25 A or are specified by the user, in both cases as a single number for all data points. This number is transformed as needed between k and r-space with the formula

where is the k-grid spacing. Note that the above formula assumes a specific normalization of the FTs. Confidence limits for the fit parameters [p] are estimated from the square roots of the diagonal elements of the covariance matrix [C] as

and correspond to a increase in from its "optimal" value by a factor of . Pair-correlation coefficients are calculated as

The program documentation discusses the correlation matrix to some extent. The final values of the statistical chi-square function, the reduced chi-square function, and an R-factor are reported:


GNXAS

 WWW Site:  http://camcnr.unicam.it/www/gnxas.html
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/gnxas
 Contact:  Adriano Filipponi

In the GNXAS package the function minimized is

where w is a suitable, not necessarily integer, k-weighting of the residuals. is equivalent to a scaled residual function of the type defined above, provided that mimics the energy dependence of the inverse squared mean noise in the data . The sum is extended over M experimental points. In practice is evaluated in different regions of the spectrum by the residual differences between the data points and a high degree polynomial approximation (for instance 6 degree every 50 points). This algorithm requires finely sampled spectra possibly with . These values are fitted to the function , where is a constant which contributes to the scaling factor between and . The minimization of is achieved with the MINUIT subroutine of the CERN library. The confidence intervals for the fit parameters in the p-dimensional space are defined by the equation

where C is the critical value of the distribution with p degrees of freedom (p is the number of free parameters) and the 95% confidence level. The confidence intervals are roughly ellipsoidal regions in the p-dimensional parameter space. The shape of the region is usually given by the covariance matrix which can be printed. Intersection of this region with specific 2-dimensional parameter sub-spaces can be visualized using the contour MINUIT command. Details on the interpretation of the correlation matrices and minimization procedures can be found in A. Filipponi, J. Phys. Condensed Matter 7, 9343 (1995) and A. Filipponi and A. Di Cicco, Phys. Rev. B 52, 15135 (1995).


MacXAFS

 WWW Site:  ftp://ixs.csrri.iit.edu/programs/uwlib.mac/macxafs.hqx
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/macxafs
 Contact:  Boyan Boyanov

MacXAFS minimizes either the statistical chi-square function or

Up to two data sets may be fit simultaneously in either k-space or r-space, the latter with or without phase corrections. Simple algebraic relationships may be defined between the fit parameters, both within and accross data sets. When applicable, errors in the experimental data may be specified in one of four ways:

In the former two cases the error estimate is transformed between k-space and r-space as needed with a formula identical to that used by FeffIt. Confidence limits for the fit parameters [p] are estimated from the square roots of the diagonal elements of the covariance matrix [C] as

and correspond to one of the following:

Pair-correlation coefficients are calculated as

No interpretation guidelines are provided. Brute force pair-correlation contours may be calculated at the user's discretion. An R-factor is calculated as an indicator of overall fit quality:


SEDEM

 WWW Site:  http://www.esrf.fr/computing/expg/subgroups/theory/xafs/SEDEM/SEDEM.html
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/sedem
 Contact:  Daniel Aberdam

SEDEM minimizes the unnormalized "squared residue" function in unfiltered or back-transformed k-space . Errors in the experimental data are estimated with one of two methods:

Either a k-dependent table or the r.m.s. value may be used in both cases. Systematic errors in the background subtraction are accounted for implicitly when the second method described above is used to determine the experimental errors. Confidence limits for the fit parameters [p] are calculated in two steps. In the first stem, the confidence limits are estimated from the square roots of the diagonal elements of the covariance matrix [C] as

where is a statistical confidence level, e.g., as defined on p. 536 (Chapter 14) of the 1986 edition of Numerical Recipes. The user is advised that these estimates represent a lower limit of the parameter uncertainties, and should not be given actual statistical interpretation. In the second step the pair correlations between the parameter and all other parameters are calculated, and the largest of the uncertainty domains found is selected. The uncertainty domain is is defined as the width of the constant- contour corresponding to the chosen confidence level, measured along the parameter axis corresponding to. Specific formulae may be found in the SEDEM documentation. Pair-correlation coefficients are calculated as

A global correlation coefficient is also calculated for each parameter

where

and is the curvature matrix. Guidelines for the interpretation of the correlation coefficients will be provided in future releases. A fit quality factor is also calculated

where Q is the incomplete gamma function.


WinXAS

 WWW Site:  http://ourworld.compuserve.com/homepages/t_ressler
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/winxas
 Contact:  Thorsten Ressler

WinXAS minimizes the unnormalized "squared residue" function . An estimate for the experimental error is calculated from the noise amplitude of the experimental absorption spectrum, and may be adjusted by the user. Confidence limits are estimated from the square roots of the diagonal elements of the fit covariance matrix

Pair-correlation coefficients are calculated as

Guidelines for interpretation are provided in the user manual. Several figures of merit are calculated:


XAFS

 WWW Site:  http://www.esrf.fr/computing/expg/subgroups/theory/xafs/xafs_software.html#MarkusWinterer
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/winterer
 Contact:  Markus Winterer

XAFS minimizes the reduced chi-square function . Statistical error of the data are used as described in Koningsberger and Prins and an average is used for propagation in the analysis procedure, e.g.,

for transmission data. The user is given the option of switching to other error estimates. Systematic errors in the background subtraction are checked by derivatives of the background with different k-weights and by the Fourier transforms (FTs) of the derivative of the background. Confidence limits for the fit parameters [p] are estimated from the square roots of the diagonal elements of the covariance matrix [C]

where is a user-specified confidence level. Pair-correlation coefficients are calculated as

and no interpretation guidelines are provided. A goodness-of-fit estimate G, as described in Chapter 14 of Numerical Recipes is calculated

where Q is the incomplete gamma function. An R-factor is planned for future versions.


XDAP

 WWW Site:  http://www.xs4all.nl/~xsi
 Abstract:  http://ixs.csrri.iit.edu/catalog/XAFS_Programs/xdap
 Contact:  Marius Vaarkamp

XDAP minimizes either the unnormalized "squared residue" function or

The experimental errors can be calculated after background subtraction and normalization or after Fourier filtering. Confidence limits are estimated from the square roots of the diagonal elements of the fit covariance matrix

Pair-correlation coefficients are calculated as

Interpretation is left to the user, although examples of good results are supplied. The covariance matrix is calculated using statistical errors in the data and analytical partial derivatives of the model function to account for correlation between parameters. The following figures of merit are supplied:

The significance of additional contributions to the model function may be tested with a standard F-test (a.k.a. Joyner test).


Last modified on June 25, 1998 by Boyan Boyanov


| Top of Page | Purpose | Scope | Questionnaire | Glossary | IXS home |