|
Vol 59(2025) N 5 p. 827-835; DOI 10.1134/S0026893325601302 ![]() G.J. Osmak1,2*, M.V. Pisklova1,2 GeneLens: A Python Package Implementing Monte Carlo Machine Learning and Network Analysis Methods for Biomarker Discovery and Gene Functional Annotation 1Chazov National Medical Research Center of Cardiology, Ministry of Health of the Russian Federation, Moscow, 121552 Russia2Pirogov Russian National Research Medical University, Ministry of Health of the Russian Federation, Moscow, 117997 Russia *german.osmak@gmail.com Received - 2025-04-09; Revised - 2025-04-09; Accepted - 2025-04-24 We present GeneLens, a Python package for comprehensive analysis of differentially expressed genes and biomarker discovery. The package consists of two core modules, FSelector for biomarker identification by utilizing Monte Carlo simulations of L1-regularized models and NetAnalyzer for functional prediction of selected gene sets based on the topology of their protein-protein interaction networks. FSelector includes: (1) automated gene selection through iterative bootstrap sampling, (2) calculation of gene significance weights by taking account of ROC-AUC models and their number in simulations, and (3) adaptive thresholding for feature space reduction. NetAnalyzer performs a pathway enrichment analysis while integrating the significance weights from FSelector. Implemented as a PIP module, GeneLens provides standardized algorithms for applying machine learning and network analysis methods in differential gene expression studies, along with automated model hyperparameter tuning and visualization tools. transcriptomics, machine learning, Monte Carlo, biomarkers, differentially expressed genes, network analysis |