Blog

Articles tagged: PSI

Return to blog archive

Posted by Ville Koskinen (October 21, 2015)

PSI file formats, part 4: mzTab

mzTab is a relatively new file format for reporting protein and peptide search results. Its specification (1.0) was published in June 2014. Like mzIdentML, it is developed by the Proteomics Standards Initiative (PSI) and shares some of the same controlled vocabulary (CV). However, that’s where the similarities end. The biggest differences are that mzTab is table-based text, not XML, and [...]

Full article and comments form

Posted by Ville Koskinen (August 15, 2015)

PSI file formats, part 3: repositories

We’ve talked about mzIdentML validity only in terms of file structure. Proteomics repositories, such as PRIDE or ProteoRed, of course require files to be valid in that sense, but they impose additional requirements. If you need to upload your search results to a repository, it is worth looking at this more extended idea of validity. For simplicity, I’ll only consider [...]

Full article and comments form

Posted by Ville Koskinen (June 17, 2015)

PSI file formats, part 2: validation

The first part listed a number of ways for generating mzIdentML files and named a few pieces of software capable of reading and processing them. This part of the series discusses a rather technical issue with mzIdentML files, namely validity, and how it can affect you as a user. Validation is somewhat tangled with submitting mzIdentML files to proteomics repositories, [...]

Full article and comments form

Posted by Ville Koskinen (May 14, 2015)

PSI file formats, part 1: mzIdentML

Mascot search results are saved in a .dat format, which contains all protein and peptide identifications output by Mascot, as well as auxiliary information (search parameters, mass definitions for modifications, etc.). The .dat file is almost never the final step in data analysis; for the next step, you need to import the search results in some other piece of software. [...]

Full article and comments form