📑

학술

arXiv 등 학술 논문. CC-BY 라이선스로 자유 재사용 가능 — 출처표시 시 상업 사용 OK.

총 423건

Reinforcement learning for policymaking in epidemic control: A scoping review

by Oleksandr Bolshov, Dmytro Chumachenko Background Managing an epidemic demands policies that respond at the pace of the outbreak. Conventional rule‑based interventions struggle to keep up, prompting interest in reinforcement learning (RL) for designing non‑pharmaceutical interventions (NPIs). However, current evidence is fragmented across diverse models and reporting styles. Objectives To systematically map how RL is applied for epidemic NPI design, describe modeling choices, algorithm architectures, evaluation practices, and identify trends and research gaps. Methods Peer-reviewed studies (2014–2025, English) that applied deep RL to select NPIs were retrieved from IEEE Xplore, ACM Digital Library, ScienceDirect, and Scopus, searched on December 23, 2025. Reference list scanning supplemented database results. Predefined data items (bibliographic details, epidemic and RL model characteristics, experiments, validation methods, outcomes) were charted and summarized descriptively. Results Of 512 retrieved records, 10 met the inclusion criteria, and three additional studies were identified via reference-list scanning, yielding 13. Five employed value‑based methods, four policy‑gradient, and four hybrid; one study additionally incorporated model-based planning. Six simulations relied on compartmental models, six on agent‑based models, and one on a hybrid model. Action spaces were predominantly discrete restriction levels. Five studies incorporated sequence-modeling techniques to include temporal context into a state space. Eleven studies designed reward functions as a trade-off between pandemic severity and socio-economic cost. According to the reviewed studies, RL policies across various settings outperform heuristic, rule-based, and historical baselines in reducing infections, deaths, or lockdown duration while limiting economic loss. Conclusions RL shows promise for adaptive epidemic control. Comparison is hampered by simplified economic costs, inconsistent calibration rigor, varied evaluation metrics, and limited uncertainty or policy robustness analysis. Future work should establish common benchmark environments and reporting standards, incorporate empirically grounded economic and behavioral models, adopt uncertainty-aware and probabilistic RL, develop more sophisticated control spaces, investigate more advanced algorithms, and validate learned policies prospectively to enable real-world deployment.

학술

Reinforcement learning for policymaking in epidemic control: A scoping review

Retinal microvascular alterations in children with amblyopia

Perspective of Turkish society toward autistic individuals: Personal experiences, knowledge, and interaction comfort

Expression of melanoma differentiation–associated gene 5 in the epidermis and cutaneous deposition of complement C3 and immunoglobulins in patients with dermatomyositis

Correction: Salidroside protects against high-altitude hypoxia-induced kidney injury via regulation of renal dopamine D1-like receptors

Correction: The additive effect of IgE-mediated and pseudoallergic hypersensitivity in RBL-2H3 cells and guinea pigs

Association between hypnotic medication use and in-hospital falls among older adults: A multicenter landmark analysis

Spurious effects in random-intercept cross-lagged panel models: Results from simulations and reanalyses of data on self-esteem and problematic eating behaviors used by Beckers et al. (2023)

Correction: Resilience of the gelatinous zooplankton species <i>Oikopleura dioica</i> to ocean alkalinity enhancement

Advanced glycation end product accumulation was associated with renal function impairment in males in large health examination population

Topological data analysis for predicting disease outbreaks in humanitarian settings: A machine learning approach

J-shaped relationship between stress hyperglycemia ratio and delirium risk in critically ill patients: A population-based study

RNA metagenomic profiling of mosquito viromes associated with Vector-Borne diseases in Quebec, Canada

Focusing on legal cases: Automatic classification of legal documents with sentence embeddings and deep learning models

Single-cell profiling of kinase substrate phosphorylation by single-molecule imaging

Navigating the digital era: The impact of digitalization and work-life harmony on well-being among solo self-employed individuals

Comparative impact of insect growth regulators on mortality and development of <i>Amrasca biguttula</i> (Hemiptera: Cicadellidae)

The application of large language models in bariatric surgery: A scoping review

Clinical performance of the BioFire Blood Culture Identification 2 panel for microorganism species identification and resistance gene detection in blood culture-positive specimens

Identifying key factors in building fires: A novel approach fusing K-shell entropy gravity

학술

Reinforcement learning for policymaking in epidemic control: A scoping review

Retinal microvascular alterations in children with amblyopia

Perspective of Turkish society toward autistic individuals: Personal experiences, knowledge, and interaction comfort

Expression of melanoma differentiation–associated gene 5 in the epidermis and cutaneous deposition of complement C3 and immunoglobulins in patients with dermatomyositis

Correction: Salidroside protects against high-altitude hypoxia-induced kidney injury via regulation of renal dopamine D1-like receptors

Correction: The additive effect of IgE-mediated and pseudoallergic hypersensitivity in RBL-2H3 cells and guinea pigs

Association between hypnotic medication use and in-hospital falls among older adults: A multicenter landmark analysis

Spurious effects in random-intercept cross-lagged panel models: Results from simulations and reanalyses of data on self-esteem and problematic eating behaviors used by Beckers et al. (2023)

Correction: Resilience of the gelatinous zooplankton species <i>Oikopleura dioica</i> to ocean alkalinity enhancement

Advanced glycation end product accumulation was associated with renal function impairment in males in large health examination population

Topological data analysis for predicting disease outbreaks in humanitarian settings: A machine learning approach

J-shaped relationship between stress hyperglycemia ratio and delirium risk in critically ill patients: A population-based study

RNA metagenomic profiling of mosquito viromes associated with Vector-Borne diseases in Quebec, Canada

Focusing on legal cases: Automatic classification of legal documents with sentence embeddings and deep learning models

Single-cell profiling of kinase substrate phosphorylation by single-molecule imaging

Navigating the digital era: The impact of digitalization and work-life harmony on well-being among solo self-employed individuals

Comparative impact of insect growth regulators on mortality and development of <i>Amrasca biguttula</i> (Hemiptera: Cicadellidae)

The application of large language models in bariatric surgery: A scoping review

Clinical performance of the BioFire Blood Culture Identification 2 panel for microorganism species identification and resistance gene detection in blood culture-positive specimens

Identifying key factors in building fires: A novel approach fusing K-shell entropy gravity