Translation Readiness Index: Measuring Patent-Paper Proximity from Scientific Publication Text
이 뉴스, 어떠셨어요?
한 번의 탭으로 반응을 남겨요 · 로그인 불필요
Abstract
Universities, funders, investors, and policy agencies often need to identify research with translational relevance before patents, licenses, startups, or industry collaborations are visible.
This study introduces the Translation Readiness Index (TRI), a text-based measure evaluating a publication's semantic similarity to papers that appear in high-confidence patent-paper pairs.
Using 20,610 publications from OpenAlex, including 9,431 publications from the Reliance on Science patent-paper pairs data and 11,179 matched comparison publications, we created paper-level 768-dimensional semantic embeddings from titles and abstracts with SPECTER2.
After evaluating four machine learning classifiers, XGBoost achieved the highest ROC-AUC (0.77).
We define TRI as the model-estimated probability that a publication belongs to the patent-paper-paired class.
Linguistic analysis revealed that patent-paired publications more often use an invention-oriented framing, distinct from the observational language of the comparison group.
External validation across University of Western Australia (UWA) publications and leading global universities demonstrated positive associations between high TRI scores and independent translational indicators.
TRI provides a text-based method for identifying translation-ready research, though it should be interpreted as a measure of semantic proximity to patented science rather than a direct measure of realized commercialization.