Tools for processing and analyzing chemometric data
Traditionally, the practitioner of a chemometrics protocol is faced with the simultaneous analysis of tens to hundreds of observations. In the case of NMR metabolomics, these datasets are commonly 1D 1H NMR spectra of some cell metabolite extract or biofluid. It is not uncommon to find ad hoc toolchains or protocols involving multiple different software packages and manual data curation steps between data acquisition and data analysis. While it is still somewhat ad hoc in a broader sense, the MVAPACK toolbox provides a flexible platform in which both new and established data handling methods may be tested and automated, at least in the context of NMR datasets.
The MVAPACK toolbox provides NMR data processing functions, statistical scaling and normalization functions, model-building routines and validation functions to facilitate rapid data analysis in NMR chemometrics. The many functions in MVAPACK are documented in the MVAPACK manual, available for download here:
The MVAPACK toolbox is written in (mostly MATLAB-compatible) GNU Octave language, and is completely open-source under the GNU GPLv3. MVAPACK runs and has been tested on the GNU/Linux and Mac OS X operating systems, and requires NMRPipe to load NMR data. Feel free to use or modify the code to your purposes, and also feel free to submit bug reports and patches to our our MVAPACK address.
If you find MVAPACK useful in your research, we ask that you cite the following work:
- B. Worley and R. Powers* (2014) "MVAPACK: A Complete Data Handling Package for NMR Metabolomics" , ACS Chem. Biol., 9(5):1138-1144 PMC4033658
The coffees dataset, published to illustrate the use of MVAPACK on a real dataset for real inference, is available in two forms. The first form is a considerably smaller file and includes the NMR data and the MVAPACK script used to analyze it. The second form is larger, and contains the resulting GNU Octave session data of running the script on the data. Both are available below.
Industrial users may contact Robert Powers for licensing details.
MVAPACK is free to use for academic users. However, you must register for access.
An email address from a valid academic institution is required for registration.
References
Publications related to MVAPACK
- B. Worley and R. Powers* (2014) "MVAPACK: A Complete Data Handling Package for NMR Metabolomics" , ACS Chem. Biol., 9(5):1138-1144 PMC4033658
- D. D. Marshall, S. Lei, B. Worley, Y. Huang, A. Garcia-Garcia, R. Franco, E. D. Dodds*, and R. Powers* (2015) "Combining DI-ESI-MS and NMR Datasets for Metabolic Profiling" Metabolomics, 11(2):391-402 PMC4354777.
- B. Worley and R. Powers* (2014). "Simultaneous Phase and Scatter Correction for NMR Datasets" Chemometr. Intell. Lab. Syst., 131(2014):1-6 PMC3907089.
- B. Worley, S. Halouska and R. Powers* (2013) "Utilities for Quantifying Separation in PCA/PLS-DA Scores" Anal. Biochem., 433(2):102-104 PMC3534867.