| Title | Attribute-Weighted Aggregation of Tandem Mass Reporter Ion Intensity for Protein Quantification Using Isobaric Labeling |
| Publication Type | Journal Article |
| Year of Publication | 2025 |
| Authors | Tan, J, Negri, GL, Morin, GB, Chen, DDY |
| Journal | Journal of Proteome Research |
| Volume | 24 |
| Pagination | 4875-4887 |
| Abstract | Isobaric labeling is a commonly used technique in proteomics. In bottom-up proteomics, protein abundance estimation requires combining reporter ion intensities from the corresponding peptide-spectrum matches (PSMs), a process referred to as aggregation. It is usually assumed that PSMs in this step represent protein abundance equally, but the differences in ionizability and propensity for isolation interference result in different levels of quantitative accuracy for PSMs. This work developed an attribute-weighted aggregation (AWA) method that considers PSM attributes with reporter ion intensities to provide a more accurate estimate of protein abundance. A random forest model was trained on the characteristics of PSMs using three spike-in data sets and used to predict the quantitative inaccuracy of PSMs to be aggregated based on their attributes. These PSMs were aggregated to the protein level based on the predicted inaccuracy. AWA was evaluated using the three spike-in data sets and applied to two large cancer cohorts. The results showed that applying AWA to different data sets can lead to better recall in differential expression analyses while maintaining high precision. To facilitate the application of AWA, an R package AWAggregator was developed, which also offers functions to retrain the random forest model for additional or alternative spike-in data sets. |
| URL | https://doi.org/10.1021/acs.jproteome.4c00992 |
| DOI | 10.1021/acs.jproteome.4c00992 |