Abstract:Objective To screen potential biomarkers of Helicobacter pylori-associated atrophic gastritis (HPAG) using weighted gene co-expression network analysis (WGCNA), and machine learning algorithms.Methods To download the transcriptomic data of gastric tissues containing HPAG and non-Helicobacter pylori (nonHP) infection was from gene expression databases for differential analysis, and perform gene set enrichment analysis (GSEA) on differentially expressed genes (DEGs). WGCNA results and DEGs were integrated to screen HPAG-related genes. Machine learning methods such as least absolute shrinkage and selection operator (LASSO), support vector machine recursive feature elimination (SVM-RFE) and random forest (RF) were utilized to screen potential biomarkers for HPAG, and biomarker expressions were extracted for intergroup comparison.Results A total of 213 DEGs were obtained, which were mainly enriched in signaling pathways such as cholesterol metabolism, digestion and absorption of fat. A machine learning algorithm screened the potential biomarker of AF, S100 calcium-binding protein G (S100G). The expression level of S100G was higher in HPAG samples than in nonHP samples.Conclusion HPAG pathogenesis involves cholesterol metabolism, digestion and absorption of fat, and other signaling pathways. S100G expression was significantly increased in HPAG gastric tissues, which may become a new target for HPAG treatment.