Big Data from Pharmaceutical Patents: A Computational Analysis of Medicinal Chemists Bread and Butter
Schneider, Nadine, Lowe, Daniel, Sayle, Roger, Tarselli, Mike and Landrum, Gregory (2016) Big Data from Pharmaceutical Patents: A Computational Analysis of Medicinal Chemists Bread and Butter. Journal of Medicinal Chemistry, 59 (9). pp. 4385-4402. ISSN 1520-4804
Abstract
Multiple recent studies have focused on unraveling the content of the medicinal chemists toolbox. Here, we present an investigation of chemical reactions and molecules retrieved from U.S. patents over the past 40 years (1976-2015). We used a sophisticated text-mining pipeline to extract 1.15 million unique whole reaction schemes, including reaction roles and yields, from pharmaceutical patents. The reactions were assigned to well-known reaction types such as Wittig olefination or Buchwald-Hartwig amination using an expert system. Analyzing the evolution of reaction types over time, we observe the previously reported bias toward reaction classes like amide bond formations or Suzuki couplings. Our study also shows a steady increase in the number of different reaction types used in pharmaceutical patents but a trend toward lower median yield for some of the reaction classes. Finally, we found that todays typical product molecule is larger, more hydrophobic, and more rigid than 40 years ago.
Item Type: | Article |
---|---|
Date Deposited: | 26 Jan 2018 00:45 |
Last Modified: | 25 Jan 2019 00:45 |
URI: | https://oak.novartis.com/id/eprint/28285 |