In Data-Dependent Analysis on some high resolution mass spectrometry instruments, the precursor ion m/z value doesn't perfectly correspond to the ion from MS1 spectra retained for the fragmentation.
As shown in this example:
The MS1 scan shows the presence of 332.2201, 337.1059 and 843.3971. This ions were retained for the further fragmentation to generate MS2 spectra.
When we look the values of precursor ions in mzXML file for the next scans here is what we have:
The value for 332.2201 is 332.2430, for 337.1059 is 337.1240 and for 843.3971 is 843.5640, either 69, 54 and 198 ppm error. For some scans the error can achieve 500 ppm.
The instruments where this problem is observed are:
Bruker Impact II - 50-70 ppm
This script will fix the precursor values in mzXML files. Works with mzXML 2.1 and 3.2 and with mzML
The steps to fix the problem are:
- Put the executable, that you can download at https://github.com/elnurgar/mzxml-precursor-corrector/releases/, in the same folder with mzXML files to be fixed.
-
Run it
-
The Waters Sypant G2Si injects every 80 scans a lock mass, reference mix. Therefore, the MS1 scans of lock mass will be ignored, as the don't come from the sample. If your machine has MS1 scans only with lock mass, you can list them in this step. Therefore, the MS1 scans with this ions as the most abundant will not be retained. If your machine doesn't have MS1 scans with lock mass just press Enter. For Waters with lock mass type 554.2615 for negative mode or 556.2771 for positive mode data.
-
The script will list mzXML and mzML files to fix. If you are ok type Y.
-
The script will generate new _modified mzXML or _modified mzML with with fixed precursor m/z values.
-
At the same time for each mzXML or mzML file the script will generate a csv file with MS1 scan number, MS2 scan number, MS2 precursor old m/z value, MS2 precursor new m/z value, Score and Rank
Score is calculated by division of log2 of intensity value on the error in ppm The rank means that this ion is the nth ion the most intense of MS1 spectra.
-
For all files, fixed in the same batch one file log.csv is generated with the file name, and correspoding scan number with the highest error and highest rank.
pyteomics= 4.5.6 tqdm>=4.64.1 numpy>=1.23.4
If you use this script please cite : Breaud, C.; Lallemand, L.; Mares, G.; Mabrouki, F.; Bertolotti, M.; Simmler, C.; Greff, S.; Mauduit, M.; Herbette, G.; Garayev, E.; Lavergne, C.; Cesari, M.; Bun-Llopet, S.-S.; Baghdikian, B.; Garayev, E. LC-MS Based Phytochemical Profiling towards the Identification of Antioxidant Markers in Some Endemic Aloe Species from Mascarene Islands. Antioxidants 2023, 12, 50. https://doi.org/10.3390/antiox12010050
All the contributions are welcome. Feel free to contact: [email protected]
MIT License