Transforming peptide hormone prediction: The role of AI in modern proteomics
Omics Studies in CKD: Diagnostic Opportunities and Therapeutic Potential
Omics technologies have significantly advanced the prediction and therapeutic approaches for chronic kidney disease (CKD) by providing comprehensive molecular insights. This is a review of the current state and future prospects of integrating biomarkers into the clinical practice for CKD, aiming to improve patient outcomes by targeted therapeutic interventions. In fact, the integration of genomic, transcriptomic, proteomic, and metabolomic data has enhanced our understanding of CKD pathogenesis and identified novel biomarkers for an early diagnosis and targeted treatment. Advanced computational methods and artificial intelligence (AI) have further refined multi-omics data analysis, leading to more accurate prediction models for disease progression and therapeutic responses. These developments highlight the potential to improve CKD patient care with a precise and individualized treatment plan .
Recent Advances in Labeling-Based Quantitative Glycomics: From High-Throughput Quantification to Structural Elucidation
Glycosylation, a crucial posttranslational modification (PTM), plays important roles in numerous biological processes and is linked to various diseases. Despite its significance, the structural complexity and diversity of glycans present significant challenges for mass spectrometry (MS)-based quantitative analysis. This review aims to provide an in-depth overview of recent advancements in labeling strategies for N-glycomics and O-glycomics, with a specific focus on enhancing the sensitivity, specificity, and throughput of MS analyses. We categorize these advancements into three major areas: (1) the development of isotopic/isobaric labeling techniques that significantly improve multiplexing capacity and throughput for glycan quantification; (2) novel methods that aid in the structural elucidation of complex glycans, particularly sialylated and fucosylated glycans; and (3) labeling techniques that enhance detection ionization efficiency, separation, and sensitivity for matrix-assisted laser desorption/ionization (MALDI)-MS and capillary electrophoresis (CE)-based glycan analysis. In addition, we highlight emerging trends in single-cell glycomics and bioinformatics tools that have the potential to revolutionize glycan quantification. These developments not only expand our understanding of glycan structures and functions but also open new avenues for biomarker discovery and therapeutic applications. Through detailed discussions of methodological advancements, this review underscores the critical role of derivatization methods in advancing glycan identification and quantification.
In-Depth Proteome Profiling of the Hippocampus of LDLR Knockout Mice Reveals Alternation in Synaptic Signaling Pathway
The low-density lipoprotein receptor (LDLR) is a major apolipoprotein receptor that regulates cholesterol homeostasis. LDLR deficiency is associated with cognitive impairment by the induction of synaptopathy in the hippocampus. Despite the close relationship between LDLR and neurodegenerative disorders, proteomics research for protein profiling in the LDLR knockout (KO) model remains insufficient. Therefore, understanding LDLR KO-mediated differential protein expression within the hippocampus is crucial for elucidating a role of LDLR in neurodegenerative disorders. In this study, we conducted first-time proteomic profiling of hippocampus tissue from LDLR KO mice using tandem mass tag (TMT)-based MS analysis. LDLR deficiency induces changes in proteins associated with the transport of diverse molecules, and activity of kinase and catalyst within the hippocampus. Additionally, significant alterations in the expression of components in the major synaptic pathways were found. Furthermore, these synaptic effects were verified using a data-independent acquisition (DIA)-based proteomic method. Our data will serve as a valuable resource for further studies to discover the molecular function of LDLR in neurodegenerative disorders.
Review and Practical Guide for Getting Started With Single-Cell Proteomics
Single-cell proteomics (SCP) has advanced significantly in recent years, with new tools specifically designed for the preparation and analysis of single cells now commercially available to researchers. The field is sufficiently mature to be broadly accessible to any lab capable of isolating single cells and performing bulk-scale proteomic analyses. In this review, we highlight recent work in the SCP field that has significantly lowered the barrier to entry, thus providing a practical guide for those who are newly entering the SCP field. We outline the fundamental principles and report multiple paths to accomplish the key steps of a successful SCP experiment including sample preparation, separation, and mass spectrometry data acquisition and analysis. We recommend that researchers start with a label-free SCP workflow, as achieving high-quality and quantitatively accurate results is more straightforward than label-based multiplexed strategies. By leveraging these accessible means, researchers can confidently perform SCP experiments and make meaningful discoveries at the single-cell level.
Special Issue on "Metaproteomics and meta-omics perspectives to decrypt Microbiome Functionality"
Parallel Analyses by Mass Spectrometry (MS) and Reverse Phase Protein Array (RPPA) Reveal Complementary Proteomic Profiles in Triple-Negative Breast Cancer (TNBC) Patient Tissues and Cell Cultures
High-plex proteomic technologies have made substantial contributions to mechanism studies and biomarker discovery in complex diseases, particularly cancer. Despite technological advancements, inherent limitations in individual proteomic approaches persist, impeding the achievement of comprehensive quantitative insights into the proteome. In this study, we employed two widely used proteomic technologies, mass spectrometry (MS) and reverse phase protein array (RPPA) to analyze identical samples, aiming to systematically assess the outcomes and performance of the different technologies. Additionally, we sought to establish an integrated workflow by combining these two proteomic approaches to augment the coverage of protein targets for discovery purposes. We used 14 fresh frozen tissue samples from triple-negative breast cancer (TNBC: seven tumors versus seven adjacent non-cancerous tissues) and cell line samples to evaluate both technologies and implement this dual-proteomic strategy. Using a single-step protein denaturation and extraction protocol, protein samples were subjected to reverse-phase liquid chromatography (LC) followed by electrospray ionization (ESI)-mediated MS/MS for proteomic profiling. Concurrently, identical sample aliquots were analyzed by RPPA for profiling of over 300 proteins and phosphoproteins that are in key signaling pathways or druggable targets in cancer. Both proteomic methods demonstrated the expected ability to differentiate samples by groups, revealing distinct proteomic patterns under various experimental conditions, albeit with minimal overlap in identified targets. Mechanism-based analysis uncovered divergent biological processes identified with the two proteomic technologies, capitalizing on their complementary exploratory potential.
Prediction of Plant Resistance Proteins Using Alignment-Based and Alignment-Free Approaches
Plant disease resistance (PDR) proteins are critical in identifying plant pathogens. Predicting PDR protein is essential for understanding plant-pathogen interactions and developing strategies for crop protection. This study proposes a hybrid model for predicting and designing PDR proteins against plant-invading pathogens. Initially, we tried alignment-based approaches, such as Basic Local Alignment Search Tool (BLAST) for similarity search and MERCI for motif search. These alignment-based approaches exhibit very poor coverage or sensitivity. To overcome these limitations, we developed alignment-free or machine learning (ML)-based methods using compositional features of proteins. Our ML-based model, developed using compositional features of proteins, achieved a maximum performance area under the receiver operating characteristic curve (AUROC) of 0.91. The performance of our model improved significantly from AUROC of 0.91-0.95 when we used evolutionary information instead of protein sequence. Finally, we developed a hybrid or ensemble model that combined our best ML model with BLAST and obtained the highest AUROC of 0.98 on the validation dataset. We trained and tested our models on a training dataset and evaluated them on a validation dataset. None of the proteins in our validation dataset are more than 40% similar to proteins in the training dataset. One of the objectives of this study is to facilitate the scientific community working in plant biology. Thus, we developed an online platform for predicting and designing plant resistance proteins, "PlantDRPpred" (https://webs.iiitd.edu.in/raghava/plantdrppred).
Urinary Proteomics and Systems Biology Link Eight Proteins to the Higher Risk of Hypertension and Related Complications in Blacks Versus Whites
Blacks are more prone to salt-sensitive hypertension than Whites. This cross-sectional analysis of a multi-ethnic cohort aimed to search for proteins potentially involved in the susceptibility to salt sensitivity, hypertension, and hypertension-related complications. The study included individuals enrolled in African Prospective Study on the Early Detection and Identification of Cardiovascular Disease and Hypertension (African-PREDICT), Flemish Study of the Environment, Genes and Health Outcomes (FLEMENGHO), Prospective Cohort Study in Patients with Type 2 Diabetes Mellitus for Validation of Biomarkers (PROVALID)-Austria, and Urinary Proteomics Combined with Home Blood Pressure Telemonitoring for Health Care Reform Trial (UPRIGHT-HTM). Sequenced urinary peptides detectable in 70% of participants allowed the identification of parental proteins and were compared between Blacks and Whites. Of 513 urinary peptides, 300 had significantly different levels among healthy Black (n = 476) and White (n = 483) South Africans sharing the same environment. Analyses contrasting 582 Blacks versus 1731 Whites, and Sub-Saharan Blacks versus European Whites replicated the findings. COL4A1, COL4A2, FGA, PROC, MGP, MYOCD, FYXD2, and UMOD were identified as the most likely candidates underlying the racially different susceptibility to salt sensitivity, hypertension, and related complications. Enriched pathways included hemostasis, platelet activity, collagens, biology of the extracellular matrix, and protein digestion and absorption. Our study suggests that MGP and MYOCD being involved in cardiovascular function, FGA and PROC in coagulation, FYXD2 and UMOD in salt homeostasis, and COL4A1 and COL4A2 as major components of the glomerular basement membrane are among the many proteins potentially incriminated in the higher susceptibility of Blacks compared to Whites to salt sensitivity, hypertension, and its complication. Nevertheless, these eight proteins and their associated pathways deserve further exploration in molecular and human studies as potential targets for intervention to reduce the excess risk of hypertension and cardiovascular complications in Blacks versus Whites.
Integrative Proteomic and Phosphoproteomic Profiling Reveals the Salt-Responsive Mechanisms in Two Rice Varieties (Oryza Sativa subsp. Japonica and Indica)
Salinity stress induces ionic and osmotic imbalances in rice plants that in turn negatively affect the photosynthesis rate, resulting in growth retardation and yield penalty. Efforts have, therefore, been carried out to understand the mechanism of salt tolerance, however, the complexity of biological processes at proteome levels remains a major challenge. Here, we performed a comparative proteome and phosphoproteome profiling of microsome enriched fractions of salt-tolerant (cv. IR73; indica) and salt-susceptible (cv. Dongjin/DJ; japonica) rice varieties. This approach led to the identification of 5856 proteins, of which 473 and 484 proteins showed differential modulation between DJ and IR73 sample sets, respectively. The phosphoproteome analysis led to the identification of a total of 10,873 phosphopeptides of which 2929 and 3049 phosphopeptides showed significant differences in DJ and IR73 sample sets, respectively. The integration of proteome and phosphoproteome data showed activation of ABA and Ca signaling components exclusively in the salt-tolerant variety IR73 in response to salinity stress. Taken together, our results highlight the changes at proteome and phosphoproteome levels and provide a mechanistic understanding of salinity stress tolerance in rice.
Proteome integral solubility alteration via label-free DIA approach (PISA-DIA), game changer in drug target deconvolution
Drug protein-target identification in past decades required screening compound libraries against known proteins to determine drugs binding to specific protein. Protein targets used in drug-target screening were selected predominantly used laborious genetic manipulation assays. In 2013, a team led by Pär Nordlund from Karolinska Institutet (Stockholm, Sweden) developed Cellular Thermal Shift Assay (CETSA), a method which, for the first time, enabled the possibility of drug protein-target identification in the complex cellular proteome. High throughput, quantitative mass spectrometry (MS) proteomics appeared as a compatible analytical method of choice to complement CETSA, aka Thermal Protein Profiling assay (TPP). Since the seminal CETSA-MS/ TPP-MS publications, different protein-target deconvolution strategies emerged including Proteome Integral Solubility Alteration (PISA). The work of Emery-Corbin et al. (Proteomics 2024, 2300644), titled Proteome Integral Solubility Alteration via label-free DIA approach (PISA-DIA), introduces Data-Independent Acquisition (DIA) as a quantification method, opening new avenues in drug target-deconvolution field. Application of DIA for target deconvolution offers attractive alternative to widely used data dependent methodology.
(Prote)omics for Superior Management of Kidney and Cardiovascular Disease-A Thought-Provoking Impulse From Nephrology
Chronic kidney disease (CKD) and cardiovascular disease (CVD) are complex conditions often managed by nephrologists. This viewpoint paper advocates for a multi-omics approach, integrating clinical symptom patterns, non-invasive biomarkers, imaging and invasive diagnostics to enhance diagnosis and treatment. Early detection of molecular changes, particularly in collagen turnover, is crucial for preventing disease progression. For instance, urinary proteomics can detect early molecular changes in diabetic kidney disease (DKD), heart failure (HF) and coronary artery disease (CAD), enabling proactive interventions and reducing the need for invasive procedures like renal biopsies. For example, urinary proteomic patterns can differentiate between glomerular and extraglomerular pathologies, aiding in the diagnosis of specific kidney diseases. Additionally, urinary peptides can predict CKD progression and HF development, offering a non-invasive alternative to traditional biomarkers like eGFR and NT-proBNP. The integration of multi-omics data with artificial intelligence (AI) holds promise for personalised treatment strategies, optimizing patient outcomes. This approach can also reduce healthcare costs by minimizing unnecessary invasive procedures and hospitalizations. In conclusion, the adoption of multi-omics and non-invasive biomarkers in nephrology and cardiology can revolutionize disease management, enabling early detection, personalised treatment and improved patient outcomes.
Astronaut proteomics: Japan leads the way for transformative studies in space
CoNglyPred: Accurate Prediction of N-Linked Glycosylation Sites Using ESM-2 and Structural Features With Graph Network and Co-Attention
N-Linked glycosylation is crucial for various biological processes such as protein folding, immune response, and cellular transport. Traditional experimental methods for determining N-linked glycosylation sites entail substantial time and labor investment, which has led to the development of computational approaches as a more efficient alternative. However, due to the limited availability of 3D structural data, existing prediction methods often struggle to fully utilize structural information and fall short in integrating sequence and structural information effectively. Motivated by the progress of protein pretrained language models (pLMs) and the breakthrough in protein structure prediction, we introduced a high-accuracy model called CoNglyPred. Having compared various pLMs, we opt for the large-scale pLM ESM-2 to extract sequence embeddings, thus mitigating certain limitations associated with manual feature extraction. Meanwhile, our approach employs a graph transformer network to process the 3D protein structures predicted by AlphaFold2. The final graph output and ESM-2 embedding are intricately integrated through a co-attention mechanism. Among a series of comprehensive experiments on the independent test dataset, CoNglyPred outperforms state-of-the-art models and demonstrates exceptional performance in case study. In addition, we are the first to report the uncertainty of N-linked glycosylation predictors using expected calibration error and expected uncertainty calibration error.
SWATH-MS Based Secretome Proteomic Analysis of Pseudomonas aeruginosa Against MRSA
The study uses Sequential Window Acquisition of All Theoretical Fragment Ion Mass Spectra (SWATH)-MS in conjunction with secretome proteomics to identify key proteins that Pseudomonas aeruginosa secretes against methicillin-resistant Staphylococcus aureus (MRSA). Variations in the inhibition zones indicated differences in strain resistance. Multivariate statistical methods were applied to filter the proteomic results, revealing five potential protein biomarkers, including Peptidase M23. Gene ontology (GO) analysis and sequence alignment supported their antibacterial activity. Thus, SWATH-MS provides a comprehensive understanding of the secretome of P. aeruginosa in its action against MRSA, guiding future antibacterial research.
Characterization of Trivalently Crosslinked C-Terminal Telopeptide of Type I Collagen (CTX) Species in Human Plasma and Serum Using High-Resolution Mass Spectrometry
With an aging population, the increased interest in the monitoring of skeletal diseases such as osteoporosis led to significant progress in the discovery and measurement of bone turnover biomarkers since the 2000s. Multiple markers derived from type I collagen, such as CTX, NTX, PINP, and ICTP, have been developed. Extensive efforts have been devoted to characterizing these molecules; however, their complex crosslinked structures have posed significant analytical challenges, and to date, these biomarkers remain poorly characterized. Previous attempts at characterization involved gel-based separation methods and MALDI-TOF analysis on collagen peptides directly extracted from bone. However, using bone powder, which is rich in collagen, does not represent the true structure of the peptides in the biofluids as it was cleaved. In this study, our goal was to characterize plasma and serum CTX for subsequent LC-MS/MS method development. We extracted and characterized type I collagen peptides directly from human plasma and serum using a proteomics workflow that integrates preparative LC, affinity chromatography, and HR-MS. Subsequently, we successfully identified numerous CTX species, providing valuable insights into the characterization of these crucial biomarkers.
Proteomics analysis of round and wrinkled pea (Pisum sativum L.) seeds during different development periods
Seed development is complex, influenced by genetic and environmental factors. Understanding proteome profiles at different seed developmental stages is key to improving seed composition and quality. We used label-free quantitative proteomics to analyze round and wrinkled pea seeds at five growth stages: 4, 7, 12, 15, and days after anthesis (DAA), and at maturity. Wrinkled peas had lower starch content (30%) compared to round peas (47%-55%). Proteomic analysis identified 3659 protein groups, with 21%-24% shared across growth stages. More proteins were identified during early seed development than at maturity. Statistical analysis found 735 significantly different proteins between wrinkled and round seeds, regardless of the growth stage. The detected proteins were categorized into 31 functional classes, including metabolic enzymes, proteins involved in protein biosynthesis and homeostasis, carbohydrate metabolism, and cell division. Cell division-related proteins were more abundant in early stages, while storage proteins were more abundant later in seed development. Wrinkled seeds had lower levels of the starch-branching enzyme (SBEI), which is essential for amylopectin biosynthesis. Seed storage proteins like legumin and albumin (PA2) were more abundant in round peas, whereas vicilin was more prevalent in wrinkled peas. This study enhances our understanding of seed development in round and wrinkled peas. The study highlighted the seed growth patterns and protein profiles in round and wrinkled peas during seed development. It showed how protein accumulation changed, particularly focusing on proteins implicated in cell division, seed reserve metabolism, as well as storage proteins and protease inhibitors. These findings underscore the crucial role of these proteins in seed development. By linking the proteins identified to Cameor-based pea reference genome, our research can open avenues for deeper investigations into individual proteins, facilitate their practical application in crop improvement, and advance our knowledge of seed development.