A Strength-Weaknesses-Opportunities-Threats (SWOT) Analysis of Cheminformatics in Natural Product Research

1 Introduction

Small molecule natural products are biosynthesized by biological systems to enable communication and interaction between cells, individuals, and species, serving as repellents, poisons, attractants, and signaling molecules. Owing to their biosynthetic enzyme origin and specific biological purposes, their chemical structures were designed by evolution to interact with macromolecules such as proteins, lipids, and nucleic acids [1–4]. This is in accordance with the finding of increased hit rates of natural product collections compared to synthetic and combinatorial collections in high-throughput screening campaigns [5, 6]. Analysis of such natural product collections revealed an exceptionally high diversity of molecular structures and properties, such as considerable molecular shape, stereogenic and ring-system complexity. They cover a broad chemical space, especially biologically relevant space [7–13] as outlined in detailed chapter “Cheminformatics Explorations of Natural Products” by Medina-Franco et al. in this volume (p. 1).

This makes natural products ideal candidates for drug discovery. Indeed, plants, fungi, and animals were almost the only source for pharmaceutical preparations for a long period of human history. Even with the advent of modern single-molecule medicines, natural products continued to play an important role [14]. A comprehensive analysis by Newman and Cragg points out that still 32% of all small-molecule approved drugs launched between 1981 and 2014 are unaltered natural products or natural product derivatives. Another 32% were inspired by natural products or their pharmacophores [15]. It is therefore tempting to speculate that natural product structures are privileged, possessing particular geometries; for instance, they exhibit a variety of novel, non-flat ring systems suitable for specific side chain substitutions, which are then prone to interact with an array of target proteins [16].

In contrast to the well-recognized high potential of natural products in drug discovery, the research engagement in this field has been dramatically scaled down in major pharmaceutical companies, mainly because it is stigmatized as an expensive endeavor [17]. The process of choosing a suitable biological source, its often limited or restricted access, the successful isolation of single active constituents from complex matrices, and deciphering their molecular structures seem too cumbersome compared to an increasingly automated and straightforward drug discovery process. New technologies like combinatorial chemistry, high-throughput screening using miniaturized and automatized assay batteries, and big data evaluation have triggered a great transformation in drug discovery [18]. The identification of ligands against specific targets as starting points for lead development is of utmost importance in this scientific field [19–21].

A major challenge in drug discovery from natural sources is hereby the identification of single bioactive constituents in order to establish unambiguous cause-effect relationships for later lead development. The classical approach for this task has been the bioassay-guided fractionation of crude or partly purified extracts [22]. Thus, a multicomponent mixture (extract) is separated step by step with subsequent assessment of the biological activities of the fractions obtained, followed by iterative rounds of separation and assaying [23–25]. Ideally, the goal is to end up with a single or a few purified active constituents—a goal which is certainly not often achieved because of certain shortcomings, such as insufficient robustness of bioassays used, potential solute adsorption to the solid phase during chromatographic fractionation, the re-isolation of previously known bioactive compounds, the failure to detect synergistic activity between the components present, and/or the decomposition of the constituents [14, 26].

The reverse path of testing pure natural products after isolation brings up several questions: (1) How to choose the natural starting organism? (2) Which components should be isolated? (3) How to choose a promising target for testing? Some of the most interesting natural products are difficult to isolate and only contained in small quantities in their natural source, for example, the yield of paclitaxel isolated from its source plant, Taxus brevifolia bark, was in the range of 0.01% [27]. Moreover, only 10% of all known natural products can be obtained by commercial suppliers [28], and these sometimes command very high prices. This is one reason their macromolecular targets remain largely unknown [29]. Natural products can be considered as too precious for dissecting their potential bioactivities by trial and error, and a rationale to streamline their biological evaluation is needed.

In this context, the application of in silico tools, in particular, virtual screening, has developed as an important strategy in natural product research for the prediction of ligand-target interactions and for rationalizing their bioactivity or even efficacy on a molecular level. Computational models can be created based upon already available information for the system under investigation and used to make predictions on new events. Without question, cheminformatics-based techniques are nowadays increasingly vital and substantial parts of modern-day drug discovery in medicinal chemistry, in both industry and academia. Their impact in natural product research is also increasing and has been reviewed elsewhere [30–32].

Here, we provide a comprehensive analysis of the strengths, weaknesses, opportunities, and threats (SWOT) of cheminformatics tools in natural product research. The analysis will provide a guide to facilitate their concatenation on the basis of past research projects, and aims to indicate gaps and caveats that exist. Therefore, the outcome of this analysis should give insight into strategic steps for further advances toward the combined use of cheminformatics and natural products drug discovery, to cope expediently with the challenges and opportunities in these two promising and prolific research areas.

2 S: Strengths of Cheminformatics in Natural Product Research

Cheminformatics is the use of computational and informational tools to understand and solve problems in the field of chemistry, particularly drug lead identification and optimization. The intended goal is to make better decisions faster [33]. In particular, virtual screening, which is the use of computational algorithms and models for the identification of bioactivities, has huge potential for more extensive application in natural product research [34, 35].

The implementation of cheminformatic tools can circumvent some of the costly and time-consuming bottlenecks prohibitive to drug discovery from natural sources. From a pharmacognostic perspective, the prediction of molecular properties, possible targets but also antitargets of secondary metabolites, may be extremely useful to streamline experimental efforts, and hence to accelerate research and development projects. The scarce availability of isolated test materials demands for in silico predictions to unravel natural product molecular modes of actions and to deploy a rationale in lead development [32, 36–38]. From a cheminformatics perspective, virtual screening of collections consisting of fewer, but more sophisticated chemical entities, which are designed by evolution to interact specifically with macromolecular targets, rather than large synthetic molecule collections, can be a straightforward and prolific approach for the identification of novel lead compounds. The exploitation of natural product chemistry to implement Nature’s privileged structures and chemical traits into synthetic compound repositories is another important topic [39–41].

From a retrospective analysis of research in the past two decades, the concatenation of cheminformatics and natural product research has certain prerequisites, which have gained substantial input and development in recent years, to categorize them as strengths. These refer to:

1.
The availability and access to data providing available information on and the ability to obtain reliable data of the system under investigation
2.
Natural product collections including their annotation to meta-data, curation, and a well-analyzed content
3.
Availability and applicability of cheminformatic tools for the handling of natural products and specialized software and methods for event prediction

The following sections will provide more insight into the tools and most important databases available or literature dealing with this topic, without intending to provide a complete account.

2.1 Availability and Access to Data

A computational model’s predictive power can be correlated roughly to the state of knowledge for the system it describes. The access to resources such as chemical databases, bioactivity collections, and biological data and a viable linkage and curation of these data is required to perform successful projects [42–44]. Lots of these resources are deposited and freely accessible. Chemical molecular databases with close to a billion virtual molecular entities have been established [44]. In 2017, four big chemical databases, PubChem, ChemSpider, Scifinder, and UniChem, compiled 95, 63, 134, and 154 million chemical structure records, respectively [45].

Biological and biomedical data stored in publicly available bioactivity databases provide a huge amount of detailed information on chemical entities in combination with target proteins, quantitative binding, and bioactivity values. The ChEMBL database [46–48] connects 1.8 million 2D drug-like small molecule structure records with 12,000 molecular targets and 15.2 million bioactivities in an easily accessible interface. The data are derived mainly from seven medicinal chemistry journals (Bioorganic and Medicinal Chemistry Letters, Journal of Medicinal Chemistry, Bioorganic and Medicinal Chemistry, Journal of Natural Products, European Journal of Medicinal Chemistry, ACS Medicinal Chemistry Letters, MedChemComm) and selected articles from 200 journals and certain patents [48]. PubChem has compiled 239.6 million bioactivities for 3.4 million molecules, mainly from high-throughput screening experiments [49, 50]. Chemical patents represent another rich resource of chemical and biomedical information. The SureCHEMBL database aims to make the chemistry annotations of US, EP, WO, and JP patents available in a searchable interface. However, the connected biomedical data are not annotated [51, 52]. A smaller but highly curated database is the DrugBank with 12,000 chemical entries focusing on drugs and related molecules like nutraceuticals. Drug targets, pathways, indications and other pharmacological information are provided [53–55]. A large and comprehensive biomedical database of natural products does not yet exist. The Protein Data Bank (PDB) is a valuable resource for 3D information on biological macromolecules. It archives 144,000 experimentally determined structures and their complexes with metals, co-factors, crystal water, and small-molecule ligands [56, 57].

Table 1 summarizes the most important free accessible databases of biomedical and biological information useful in cheminformatics. A more detailed list has been compiled in [58]. It should be noted that the quality of information in the databases differs due to diverse data sources, data acquisition procedures, and curation efforts.

Table 1

Biomedical databases

Database	Content	Size	References
BindingDB	Experimental protein-small molecule binding affinities	1.2 million binding data for 55,000 proteins and 520,000 drug-like molecules	[59]
CHEMBL	Data compiled from literature; PubChem and SureCHEMBL	1.8 million drug-like small molecules 15.2 million bioactivities	[46–48]
Drugbank	Highly curated drug data combined with drug target, pathway, indication, and other pharmacological information	12,000 nutraceuticals, approved and experimental drugs	[53–55]
DUD.E.	Active compounds and target affinities, includes widely used decoys in virtual screening	22,886 actives 102 targets 50 decoys for each active	[60, 61]
GLASS	Manually curated repository for experimentally validated GPCR-ligand interactions	342.5 million ligands 3 million GPCR targets	[62]
GOSTAR	Manually curated SAR database	6.6 million inhibitors 22 million quantitative SAR points	[63]
OCHEM	ADME data	2.8 million property records	[64, 65]
PDBbind	Binding affinities of PDB entries	11,000 binding affinities	[60]
PubChem	Chemical database with bioactivity data from HTS assays	63 million molecules For 3.4 million molecules 239.6 million bioactivities are compiled	[49, 50]
Binding MOAD	High-quality PDB subset of ligand-protein complexes	33,000 structures	[67, 68]
PDB	Databank of experimentally determined structures of proteins, nucleic acids and complex assemblies	144,000 experimental determined macromolecule structures	[56, 57]
SMPDB	Interactive and visual small molecule pathway database	30,000 human pathways	[69, 70]
TTD	Database of therapeutic targets	3000 targets	[71]

DUD.E database of useful decoys, GLASS GPCR-ligand association database, GOSTAR global online structure-activity relationship database, GPCR G-protein-coupled receptor, MOAD mother of all databases, OCHEM online chemical database, PDB protein databank, SAR structure-activity relationship, SMPDB small molecule pathway database, TTD therapeutic target database

Chemical, biomedical, and other life science data can be estimated to grow further in the future as the integration of chemical information from multiple sources and analytical techniques, extracting and mining information from journal articles and patents is still improving. Collaborative efforts and the commitment to make generated data available in the public domain will stimulate this development.

2.2 Natural Product Collections

A prerequisite of conducting cheminformatics in natural product research is the existence of stereochemically well-defined molecules. Appropriate commercial and also free natural product databases are available. These important resources have been reviewed several times [28, 72–77].

The most comprehensive database is the Dictionary of Natural Products (DNP) with currently 260,000 natural products. Information on trivial names, physicochemical properties, and toxicity data are supplied. For pharmaceutical biologists the information on biological sources and experimental properties such as UV spectra and dissociation constants can be very useful. Caution should be given when used for 3D applications, because the stereochemistry is not annotated in the 2D connection tables. The database was built manually by a team of academics and freelancers, who enable reconciling of errors and ensure high quality data [28, 78]. Although this database is comprehensive and well curated and covers a large chemical diversity, its availability only on a commercial basis hampers its broader use by the interested scientific community.

Alternatives are free virtual natural product collections, like the Universal Natural Product Database (UNPD) , the TCM database@Taiwan, NPCARE, and the NuBBE database; these all have been made available free of charge [79–83]. Chen et al. recently have analyzed the content of natural product collections and observed a large overlap (108,000 molecules) of free virtual natural product collections with the DNP [28]. A thorough survey on natural product resources and their characteristics is provided in the chapter “Resources for Chemical, Biological, and Structural Data on Natural Products” by Kirchmair et al. in this volume (p. 37).

The use of cheminformatics tools to select natural products and natural product like compounds from large chemical (e.g., PubChem [50]), biomedical (e.g., ChEMBL [47], PDB [57]) or commercial vendor databases (e.g., ZINC [84], Aldrich Market Select [85]) would be a worthwhile strategy. Several tools able to identify natural products in large molecule sets have been developed. They are based on different machine learning tools such as rule-based approaches, similarity measurements of structural space, or connectivity fingerprints [86–91]. Recently, a random forest classifier with high accuracy was made available in a free online tool [92].

The diligent exploitation of natural product resources from widely unexplored organisms from different niches of our globe and the closer examination of already investigated marine and terrestrial organisms by advanced technical means will continue to extend the diversity and coverage of natural product collections. The exchange of virtual physically available collections between cooperation partners has been suggested to increase the access to natural products [93]. Efforts to compile, annotate, analyze, and finally enable their availability to a broad community will lead to an increasingly valuable resource for future drug discovery.

2.3 Applicability of Cheminformatics Tools

As summarized earlier, scientists have to learn from the vast amount of biomedical data generated and made available via data-sharing platforms. However, it is indisputable that the amount of data is far beyond traditional analysis and learning [94]. To create predictive cheminformatic models from big data, various approaches have been established ranging from comprehensive similarity measurements (e.g., pharmacophore, shape-based approaches, physicochemical property comparison) to complex molecular docking and sophisticated machine learning approaches (e.g., self-organizing maps). The basic concepts underlying these methods have been reviewed elsewhere [30–32, 95].

Notably, most virtual screening, binding pose prediction, and target fishing approaches have been shown to be also applicable to natural products. From the examples presented in Tables 2, 3, and 4, previous studies have been carried out frequently with user-friendly comprehensible in silico tools. Three-dimensional pharmacophore alignments, e.g., with Catalyst or LigandScout, and molecular docking, e.g., with Autodock Vina or Glide, offer an intuitive interface and allow easy implementation also to scientists not specialized in cheminformatics. These methods are already well-established and have demonstrated solid performance as shown by many successful projects [98, 99, 101, 123, 126, 134, 137, 138]. Most studies have combined different methods such as molecular docking and molecular dynamic simulations for the prediction of binding modes [100] or shape and molecular docking for virtual screening [136]. Further cheminformatic approaches, such as artificial neural networks, increasingly have gained importance, especially for target and activity prediction [29, 36, 107], and also in qualitative virtual screening experiments [129, 142, 151] (see the chapters “The Pharmacophore Concept and Its Applications in Computer-Aided Drug Design” and “Cheminformatic Analysis for Natural Product Fragments” of Langer and Reker, this volume (p. 97 and 141)).

Table 2

Approaches for the prediction of natural product binding modes

Technique	Software	Target^a	Target class^b	Examples
Molecular docking	Ligandfit	HR	V	[96]
	GOLD	NA	V	[97]
		5-LOX	E	[98]
		COX-2	E	[98]
		11β-HSD1	E	[99]
	Glide	MD-2	PPI	[100]
	Glide	5-HT_2C	GPCR	[101]
	MOE	PPARγ	TF	[102]
	Autodock	AChE	E	[103]
	Autodock Vina	NF-κB	TF	[98]
	CDOCKER	PPARγ	TF	[104]
Molecular dynamic simulation	AMBER	NA	V	[97]
		MD-2	AG	[100]
		DNA	DNA	[105]
		AChE	E	[103]
	NAMD	NF-κB	TF	[98]

^aTarget abbreviations: 11β-HSD1 11β-hydroxysteroid dehydrogenase type 1, 5-LOX 5-lipoxygenase, 5-HT _2C 5-hydroxytryptamine_2C receptor, AChE acetylcholinesterase, COX-2 cyclooxygenase-2, DNA deoxyribonucleic acid, HR human rhinovirus coat protein, MD-2 lymphocyte antigen 96, NA neuraminidase, NF-κB nuclear factor kappa-light-chain-enhancer of activated B cells, PPARγ peroxisome proliferator-activated receptor gamma

^bTarget class abbreviations: AG antigen, DNA deoxyribonucleic acid, E enzyme, GPCR G-protein coupled receptor, PPI protein-protein interaction, TF transcription factor, V viral protein

Table 3

Different approaches for the prediction of natural product molecular targets

Technique	Strategy/software	Examples
Artificial neural networks	Self-organizing maps, e.g. [106]	[29, 36, 107]
Hierarchical clustering	Based on in silico retrobiosynthesis [108]	[109]
Virtual parallel screening	Ligandprofiler, PipelinePilot, Ligandscout, Catalyst	[95, 110, 111]
Reverse docking	Autodock Vina	[112, 113]

Table 4

Different/complementary virtual screening approaches applied to natural products

Approach	Strategy/software	Target^a	Target class^b	Examples
Pharmacophore-based virtual screening	Catalyst	AChE	E	[114]
		COX-1, COX-2	E	[115, 116]
		HR	V	[96]
		hERG	IC	[117, 118]
		FXR	TF	[119, 120]
		cPLA₂α	E	[121]
		mPGES-1	E	[122]
		IKK-β	E	[98]
		mGlu	GPCR	[123]
		PrPC	V	[124]
	Ligandscout	AChE	E	[114]
		hERG	IC	[117, 118]
		GPBAR1	GPCR	[125]
		11β-HSD1	E	[99, 126]
		PPARγ	TF	[127]
		CETP	LTP	[128]
	PharmaGIST	AMA1-RON2	PPI	[129]
	MOE	PPARγ	TF	[102]
	MOE	TbGAPDH	E	[130]
2D similarity search	chemGPS [7]	Antichlamydial	–	[131]
2D similarity search	Connectivity fingerprints	FXR	TF	[132]
3D similarity search	ROCS	GPBAR1	GPCR	[125]
		NA	V	[133]
		IKK-β	E	[134]
	SQUIRREL	mPGES-1	E	[135]
	Phase	HIV-1 RT	V	[136]
Molecular docking	Autodock	Complex III	E	[137, 138]
		NA	V	[139]
		AMPK	E	[140]
	Autodock Vina	ROCK1	E	[141]
	Autodock Vina	Complex III	E	[137, 138]
	GOLD	AChE	E	[142]
	GOLD	CK2	E	[143]
	Glide	HIV-1 RT	V	[136]
		CK2	E	[143]
		FXR	TF	[132]
		PPARγ	TF	[144]
		Sirt1	E	[145]
		ACE	E	[146]
	LigandFit	mGlu	GPCR	[123]
	CDOCKER	PrPC	V	[124]
	MOE	CK2	E	[143]
	MOE	TbGAPDH	E	[130]
	Molsoft	TNF-α	PPI	[147]
	Molsoft	DNA	DNA	[148]
	DOCK	AMPK	E	[140]
	DOCK	ROCK1	E	[141]
QSAR	GP regression^c	IRF-7	TF	[149]
QSAR	Multiple linear regression	Antitrypanosomal	–	[150]
Machine learning	Self-organizing maps	AChE	E	[142]
	Random forest classifier	AMA1-RON2	PPI	[129]
	GP regression^c	PPARγ	TF	[151]

^aTarget abbreviations: 11β-HSD1 1β-hydroxysteroid dehydrogenase type 1, 5-LOX 5-lipoxygenase, ACE angiotensin-converting enzyme, AChE acetylcholinesterase, AMA1 apical membrane antigen 1, AMPK 5′ AMP-activated protein kinase, CETP cholesteryl ester transfer protein, CK2 casein kinase 2, Complex III coenzyme Q-cytochrome c-oxidoreductase, COX-1 cyclooxygenase-1, COX-2 cyclooxygenase-2, cPLA ₂ α Cytosolic phospholipase A2α, DNA deoxyribonucleic acid, FXR farnesoid X receptor, GPBAR1 G protein-coupled bile acid receptor, hERG human ether-à-go-go-related gene potassium ion channel, HIV-1 RT human immunodeficiency virus type 1 reverse transcriptase, HR human rhinovirus coat protein, IKK-β inhibitor of nuclear factor kappa-B kinase subunit beta, IRF-7 Interferon regulatory factor 7, mPGES-1 microsomal prostaglandin E synthase-1, MD-2 lymphocyte antigen 96, NA neuraminidase, NF-κB nuclear factor kappa-light-chain-enhancer of activated B cells, mGlu metabotropic glutamate receptor, PPARγ peroxisome proliferator-activated receptor gamma, PrPC cellular prion protein, ROCK1 Rho-associated protein kinase, RON2 rhoptry neck protein 2, Sirt1 NAD-dependent deacetylase sirtuin-1, TbGAPDH Mycobacterium tuberculosis glyceraldehyde-3-phosphate dehydrogenase, TNF-α tumor necrosis factor ligand superfamily member 2

^bTarget class abbreviations: DNA deoxyribonucleic acid, E enzyme, GPCR G protein-coupled receptor, IC ion channel, LTP lipid transfer protein, PPI protein-protein interaction, TF transcription factor, V viral protein

^cGaussian process regression

Perhaps the most important cheminformatics application for natural product researchers is the prediction of molecular targets as thoroughly reviewed in the chapter “A Toolbox for the Identification of Modes of Action of Natural Products” provided by Rodrigues et al. (this volume, p. 73). Besides virtual target fishing of new isolates, it can help to fast forward the rationalization of traditionally used herbal remedies, the prediction of side effects, and the profiling of polypharmacologic actions [29, 30, 110, 112, 152]. The experimental validation of the target-predicting approaches is usually demonstrated on single molecules or only on few examples rather than on a large set of natural products [36, 108, 113], mainly owing to the major effort necessary for experimental testing and the limited physical availability of compounds.

The benefit of experimental testing based on virtual predictions compared to serendipitous experimental screening could be demonstrated convincingly by Doman et al. [153]. Their random in vitro screening for protein tyrosine phosphatase inhibitors revealed a hit rate of 0.02%, while assaying the virtually predicted hits yielded a hit rate of 34.8%. In general, the first evaluation of virtual hits does not require any physically available material but requires a critical check on various parameters before compounds are selected as candidates for experimental testing, e.g., availability; isolation efforts; physicochemical parameters referring to PAINS or inappropriate absorption, distribution, metabolism, excretion, and toxicity (ADMET); reported toxicity; and reliability of predictions [72, 30]. Rare biological material and precious isolates can be saved, and fewer bioassays are needed for the identification of active hits.

Computer-aided techniques have shown to be applicable to many natural product scaffolds such as polyketides [109], alkaloids [37, 118], coumarins [111, 125], flavonoids [133], and sesqui- and triterpenes, [99, 126, 150], and they have been used to make predictions on many biological drug target classes and phenotypic effects.

The concatenation of cheminformatics tools in combination with pharmacognostic expertise and complementary empirical knowledge, such as information from traditional medicine, in vivo studies, epidemiological or clinical investigations, bioassay-guided fractionation, and high-resolution mass spectrometry-based dereplication is able to dramatically enhance the true positive hit rates as discussed in Sect. 4.2 [116, 117].

The ever-increasing computing power and availability of augmented data analysis algorithms have led to a broad use of computational tools in drug discovery. Even big data quantities can be processed with increasingly clever algorithms. Moreover, some predictive methods have shown similar performance levels to a group of experienced medicinal chemists in predicting biological activities, and outperform the brains of experts in the ability to process large databases [154].

3 W: Weaknesses of Cheminformatics in Natural Product Research

The many successful projects documented in the literature should not lead to wrong perceptions. The processing of natural products with cheminformatics bears some caveats, risks and limitations, which are present not only in both domains (cheminformatics and natural product research) but also at their interface (Fig. 1). To overcome weaknesses, these limitations should be recognized in order to be considered and avoided as far as possible.

../images/480635_1_En_7_Chapter/480635_1_En_7_Fig1_HTML.png — Fig. 1
Weaknesses and challenges in cheminformatics, in natural product research, and at the interface of these two fields

The limited availability of natural starting material [155] and of readily available natural products by commercial vendors [28], the absence of elucidated molecular structures for the vast majority of natural products that exist, in addition to assay interference [156], are examples of drawbacks with respect to natural products. The complexity of multicomponent mixtures with difficult-to-predict additive effects and separation problems in isolation efforts are further caveats.

In the field of cheminformatics, there are recommended reviews on the pitfalls of virtual screening [157] also in combination with natural product research [72, 30]. The most fatal weakness of cheminformatics approaches is that they have an inherent incapability to find novel compounds or novel molecular mechanisms of action. They can just extend knowledge on existing topics; the predictive power is better the more knowledge is available already for the system under investigation. An investigator has to navigate on the one hand between innovation usually combined with interesting but ambitious topics with few relevant data available, and, on the other hand, probably trite, less risky targets with good prospects of success due to a vast amount of information already available. A number of molecular mechanisms have been explored by means of natural products, and some biological targets have even been named by their natural product ligands, as exemplified by the muscarinic acetylcholine receptor and cannabinoid receptors. Therefore, in silico tools should be used as part of an interconnected network combined with empirical knowledge and phenotype-directed and target-directed screening platforms [38, 158].

3.1 Structural Complexity of Natural Products

A main weakness appearing upon the handling of natural products with computational algorithms is the difference between natural products and synthetic small molecules [35], which was previously analyzed by several groups [11, 12, 76, 159–161]. Most algorithms were trained on synthetic molecules and might perform less well when they are confronted to unfamiliar molecules [35].

Natural products differ from other compound sets in several molecular properties. They are more hydrophobic and contain more oxygen atoms and fewer nitrogen atoms compared to synthetic drugs. The structural complexity, especially the differences in ring architecture with unsaturated ring systems and more three-dimensional molecular shapes but less aromaticity is, on the one hand, closely correlated to the concept of privileged structures but may cause a difference in performance [161].

Natural products are more flexible due to high numbers of sp³ hybridized atoms making computations with three-dimensional tools (e.g., molecular docking) and conformational sampling for 3D similarity searches or pharmacophore-based virtual screening slower and more error-prone. A large number of rotatable bonds can also lead to promiscuous results, where ligands are fitted to molecular shapes, pharmacophores, and molecular docking in implausible ways. Rotatable bond filters for shape matching experiments like the suggested Veber rule (rotatable bonds <12) [162] can be applied.

A characteristic of natural products is the frequent occurrence of one or even more chiral centers [11, 76, 160], which are not always annotated in natural product databases or catalogs of chemical vendors [78]. Moreover, the exact configuration is not always reported in the primary literature. The generation of all stereochemical configurations is time-intensive and error-prone.

Projects are more likely to be successful if the input information is related to the test subjects. Screening of natural product collections with a synthetic molecule query may be problematic concerning the reliability of the prediction. Similarly, the screening of synthetic molecule collections with a natural product-like query may lead to disappointing results. It is obvious that different ligands can occupy different regions on the same protein, even in the same binding site, making 3D alignments like pharmacohore- and shape-based screening prone to high rates of false-negative results [157].

3.2 Handling of Glycosides

Glycosides play an important role in living organisms and are abundant moieties of natural products with different biological roles. Glycosides like amygdalin are used by different plants as storage and transport forms of their aglycone molecules. Upon disruption of compartments (e.g., by grazing herbivores), enzyme hydrolysis cleaves the glycosides and sets free the toxic aglycone. Other glycosides are natural prodrugs, enabling improved drug-likeness of the transformed metabolites [163, 164].

At first glance and from a medicinal chemistry perspective, sugars and sugar-like moieties are not in the focus of drug discovery. They are easily cleaved in the gut by microbes or by first-pass metabolism, increase the molecular weight, and lead to steric hindrance. Further, the polar glycoside moiety hinders the lipophilic effect between protein and ligand. Therefore, algorithms were created to cleave sugars from their aglycone counterparts for creation of virtual screening databases [28, 165].

The molecular docking force field was adjusted to the binding of comparably rigid and nonpolar molecules and performs therefore well on such molecules; however, the performance with carbohydrates and carbohydrate-containing molecules is questionable. The frequently used molecular docking tool Autodock Vina was able to produce acceptable structures within the top five ranked poses in only 55% of experimental crystallographic carbohydrate-protein complexes [166].

Notably, glycosides have been important drugs for a long time. In herbal medicines, it is acknowledged that glycosides decrease capillary fragility and exert secretolytic, diuretic, and antiexudative effects [167–169]. Carbohydrates play important biological roles such as cell signaling, infection, and protein function [170–172]. These effects are mediated generally by nonclassical modes of action such as membrane activity and interaction with protein surfaces yet difficult to describe with algorithms [173–175].

There are also examples of classic ligand-target interactions with natural product glycosides. Thus, phlorizin, a dihydrochalcone derivative, was the blueprint for sodium-dependent glucose transporter 2 inhibitors. The sugar moiety of phlorizin represents a vital part of the necessary pharmacophore to block the transporter [176]. From perspectives such as this, it may be a fallacy to exclude glycosides from virtual screening databases.

The handling of glycosides may be dependent on the individual target and project but definitely needs consideration. Further improvement of virtual screening tools toward a better applicability for glycosides is certainly needed.

3.3 Tiny Databases

A comparison of commercially available natural product collections with synthetic collections reveals a large difference in their size (Fig. 2). When compared to large databases of commercially available synthetic and mixed collections like Aldrich Market Select [85] with 8 million unique available compounds, and ZINC [84] comprising 120 million available compounds (7.3 million in stock), 11,000 natural products available from natural product-only catalogues and 25,000 natural products in total (including natural products in mixed catalogues) are fairly small [28]. In total, an estimated 250,000–300,000 natural products are known up to now [28, 83].

../images/480635_1_En_7_Chapter/480635_1_En_7_Fig2_HTML.png — Fig. 2
Amounts of purchasable compounds in virtual collections on a logarithmic scale; white, natural product; black, primarily synthetic molecules; CA-NP, commercially available natural products; AMS Aldrich Market Select

Model rigidity has to be balanced according to the size of the databases screened. Assuming a restrictive model with a hit rate of 0.2% will lead to estimated 50 virtual hits from commercially available natural product databases and 16,000 virtual hits from commercially available synthetic molecules.

Natural product chemical diversity, however, is insufficiently explored and is biased toward molecules from extensively exploited sources making a final statement on their extent speculative. This is underlined, for example, by the discovery of naturally occurring organohalogens, which were considered until quite recently as rare and exotic isolates and often suspected to be artifacts. With the exploitation of unexplored sources such as marine organisms, algae, and lichens, thousands of these have been described [177]. Also, improved isolation and analytical methods, which enable the characterization of natural products contained at even lower traces, constantly change our perception of natural products chemistry.

Two main issues in future will be to continue the present rate of natural product discovery and to properly exploit what is found [178].

4 O: Opportunities of Cheminformatics in Natural Product Research

The growing popularity in the usage of computer-aided techniques in natural product research resulted in numerous successful application examples. Depending on the scientific issues at hand and the available information, in addition to that missing, different in silico tools and strategies have to be carefully selected. Figure 3 provides a schematic overview on opportunities to approach scientific questions by cheminformatic means. Besides the individual application examples named in Tables 2, 3, and 4, some successful projects are outlined in this chapter.

../images/480635_1_En_7_Chapter/480635_1_En_7_Fig3_HTML.png — Fig. 3
Opportunities and areas of applications of cheminformatics in natural product research

4.1 Virtual Screening of Natural Product Databases

When considering the innate character of natural product collections (prolific, but low number of entities, difficult availability, high cost to obtain, etc.), as discussed in the previous chapters, it is highly recommended to first validate the predictive power of the model used by experimental testing of a set of virtual hits from easily accessible and inexpensive, physically available (synthetic) databases. Also, a proper preparation of the database subjected to virtual screening, e.g. by pre-connected filtering experiments may help to (1) focus on the most interesting candidates and (2) economize computational power.

For example, Su et al. prepared a virtual screening collection with fingerprint clustering and drug-likeness filters. Natural products unsuitable for the molecular docking algorithms due to their size and polarity could be removed in advance. The virtual screening of only 24,000 molecules with a stepwise workflow employing molecular docking led to the identification of baicalein and phloretin as new natural Rho kinase inhibitors [141].

Considerable database preparation was also performed by Costa et al. for the identification of HIV-1 reverse transcriptase inhibitors. They generated a natural product database from 11 vendors and natural product databases publicly available in the ZINC repository. They narrowed down the database by removing molecules violating the Lipinski Rule of Five [179], and with predicted poor solubility and permeability. A parallel molecular docking protocol as well as a 3D similarity search led to the selection and experimental testing of several virtual hits. β-Carboline derivatives were identified as HIV-1 reverse transcriptase inhibitors and their binding mode was examined using the molecular docking predictions as well as molecular dynamic simulations [136].

Insufficient capacities to obtain large sets of natural products for experimental testing may be circumvented by the application of a set of ligand-based pharmacophore models previously validated mainly on synthetic molecules for the most prevalent antitarget in drug discovery and development, i.e., the hERG channel [38, 117, 118]. For a detailed insight into the performance of different hERG prediction tools toward a fast and efficient cardiotoxic risk assessment, reference is made in the contribution in the chapter “Open Access Activity Prediction Tools for Natural Products. Case Study: hERG Blockers” by Schuster (this volume, p. 175). Kratz et al. used the previously generated, best performing pharmacophore model for the subsequent virtual hERG screening of natural product databases. They validated their predictions in a patch clamp assay by testing small-scale lead-like enhanced extracts from 12 plant materials known to contain the virtual hits. At 100 μg/cm³, 4 out of the 12 extracts exerted a hERG tail current inhibition of more than 30%, among them Ipecacuanhae Radix. Use of an appropriate phytochemical workflow resulted in the isolation and identification of five out of the six virtually predicted alkaloids, among them the major constituents emetine and cephaeline with IC ₅₀ values of 21.4 and 5.3 μM, respectively [118]. Similarly, Vuorinen et al. [126] used pharmacophore models for the identification of hydroxysteroid dehydrogenase inhibitors from Nature using previously validated models [180, 181].

Virtual screening can also predict phenotypic efficacy as shown by work of Karhu et al. [131]. They performed a principal component analysis [7] of the physicochemical properties of a natural product database and an antichlamydial reference set and compared the Euclidian distances in the chemical space. Out of 26 virtual hits, 6 molecules were confirmed as active and 1 high-potency lead was identified.

4.2 Exploitation of Pharmacognostic Knowledge

The implementation of information from traditional medicine and the knowledge from structural ligand-target interaction can increase significantly the yield of true active hits (Fig. 4a, b). Applying pharmacophore models for cyclooxygenase (COX) inhibitors, which were completely derived with input from synthesis chemistry, Rollinger et al. were able to demonstrate statistically their effectiveness in the field of natural products. A comparison of virtual hits obtained by screening of the mainly synthetic molecular 3D collections of the Derwent World Drug Index (WDI) and the Database of the National Cancer Institute (NCI) revealed hit rates in the range of 6.6% to 13.7% (depending on the search queries used). Using the in-house-generated natural product database NPD consisting of molecular structures from 80,000 natural products, even a slight increase of molecules that virtually fit into the required features of the pharmacophore models could be achieved. A striking result of this study, however, was the average increase of hit rates (77 to 133%) when an ethnopharmacologically biased database labeled as DIOS was screened compared to the WDI and the NCI. The DIOS database contains structural information of 28,000 secondary metabolites reported from those medicinal plants that Pedanius Dioscorides (first century AD) described in his “De Materia Medica” as useful in the application of different sorts of inflammation. In this way, the distinct statistical benefit of a combination of an ethnopharmacological approach and an in silico screening could be demonstrated [115, 116]. In a follow-up study, one of the most promising herbal drugs, the root bark of Morus alba, was selected based on the predictions from the DIOS database. The plant material was phytochemically investigated to evaluate the applicability of the computer-aided approach. Several virtually predicted constituents from the group of the isolated Diels-Alder adducts could be confirmed successfully as COX inhibitors [116].

../images/480635_1_En_7_Chapter/480635_1_En_7_Fig4_HTML.png — Fig. 4
Examples of strategies for the implementation of cheminformatics in pharmacognostic workflows: (a) starting from validated in silico model/s; (b) starting from bioactive natural material

Kirchweger et al. [125] performed a virtual screening of several small natural product databases and a larger synthetic small molecule collection (SPECS) for the identification of activators for the G protein-coupled bile acid receptor 1 (GPBAR1) using a ligand-based pharmacophore virtual screening approach. The virtual hits were ranked according to a shape-focused similarity score and the molecules were clustered according to their physicochemical properties. This approach enabled the selection of chemically diverse compounds endowed with the putative structural requirements to act as ligands of the envisaged target for experimental validation using a reporter-gene based assay. Both synthetic and natural product-derived virtual hits were subjected to experimental testing. Accordingly, the yield of active synthetic compounds (>15% receptor activation at 20 μM) was 10.5% (2 out of 19); natural products resulted in a five-time higher hit rate (57%; 8 out of 14). The latter group also included two novel GPBAR1 activating scaffolds, namely, the sesquiterpene coumarins farnesiferol B and microlobidene, which at 20 μM increased the receptor activation to 61% and 84%, respectively, thus showing an activity comparable to that of the endogenous ligand, lithocholic acid.

Cheminformatics can also be used in a straightforward manner for the identification of active principles of traditionally used medicines and unravel their molecular modes of actions. Schuster et al. [120] generated a set of validated pharmacophore models for the transcription factor FXR, a drug target for inflammatory liver diseases [182]. Grienke et al. [119] used this model for virtual screening of the Chinese herbal medicine database, and, from this work, lanostane-type triterpenes from the fruit body of Ganoderma lucidum were predicted as virtual hits. As this mushroom is traditionally used against hepatitis, liver disease, and arthritis, a full mycochemical investigation and isolation was performed. Five isolated lanostane triterpenes were confirmed experimentally to induce FXR activation with EC ₅₀ values in the low micromolar range.

4.3 Virtual Target Fishing

It is a frequent observation that a herbal drug shows a well-documented biological or clinical effect, but the constituents responsible as well as their underlying mechanisms of action remain elusive [95, 108]. Binding mode prediction and virtual target fishing can help to fast forward the rationalization of research and identify possible drug leads. Similar to already described nutritional and medicinal effects in humans, an observed phenotypic effect such as cytotoxicity, antimicrobial, or hypoglycemic activity can be followed up with focused isolation and experimental efforts.

In 2014, Reker et al. [29] presented a novel method for target fishing, which is independent of the target structure. The approach uses topological pharmacophore features of query compound fragments to compare them to pre-calculated drug compound clusters. The constituent can then be assigned to the cluster with the smallest Euclidian distance. Target information for the cluster was derived from confirmed interaction partners of reference drugs within the cluster. As a prospective application example, the macrolide archazolide A (ArcA) was investigated. This compound exerts potent cancer-related effects by inhibiting the ion pump vacuolar-type H⁺-ATPase at the nanomolar level. However, it was suggested that additional targets might be responsible for the pronounced antitumor effect. The analysis predicted several targets involved in arachidonic acid-associated signaling cascades as potential interaction partners, and subsequent biological testing confirmed a concentration-dependent effect of ArcA on half of these targets. In addition, weak effects on two further targets were observed. The experimental results validated the applicability of the natural product-derived fragment-based approach for the identification of novel macromolecular targets. Remarkably, all newly identified interaction partners of ArcA have also been linked to putative anticancer effects [29].

Mastic gum has been used traditionally against metabolic disorders [183] and has also shown to exert a hypoglycemic in vivo activity [184]. Its bioactive constituents and the molecular targets responsible were largely unknown. The virtual screening of a natural compound database against 11β-HSD1 pharmacophore models retrieved triterpenoids from Pistacia lentiscus as virtual hits. Together with empirical and preclinical data, the prediction seemed plausible. Therefore, mastic gum and its acidic fraction, which is known to contain the predicted hits, were subjected to experimental testing. Both samples inhibited 11β-HSD1 in a concentration-dependent manner; the two virtually predicted main triterpenes showed IC ₅₀ values in the low micromolar range [126].

Gong et al. [112] used a similar approach based on reverse docking against 211 cancer-related targets to explain an observed cytotoxic effect against two cancer cell lines of two novel sponge metabolites. The precious isolates were only tested against the two most promising targets according to the docking scores. The experimental testing explained the phenotypic effects as attributed to the inhibition of histone acetyltransferase h(p300).

Several target prediction tools have been made accessible online such as the self-organizing map-based prediction of drug equivalence relationships (SPIDER) [106] and the Antibiotic'ome [108].

4.4 Binding Pose and Activity Predictions

If a broad set of structurally very similar molecules and their biological activity in a certain assay is well described, quantitative structure-activity relationship (QSAR) models can be calculated. Schmidt et al. used the information on 69 sesquiterpene lactone structures and their antitrypanosomal activities to generate a predictive model. The query was able to predict correctly furanoheliangolides with highly potent antitrypanosomal in vitro activity out of a virtual sesquiterpene database [150].

Molecular docking in combination with molecular dynamic simulations but also pharmacophore alignments have been demonstrated to accurately predict the binding mode of natural products to their respective targets offering valuable support for the understanding of bioactivities on a molecular level. Rollinger et al. used a combination of molecular docking and pharmacophore-based virtual screening to identify experimentally novel inhibitors of the human rhinovirus (HRV) capsid binders and to give insights into the interaction of natural product-derived inhibitors in the binding pocket. They proposed an eight-feature pharmacophore necessary for the identified ligands interacting in the binding site in addition to their fitting and binding mechanism into the highly lipophilic pocket [96].

The structure and function of membrane-bound GPCRs is still not well understood due to their difficult crystallization. Binding mechanisms of their ligands are nevertheless crucial, since approximately one third of all drugs target these proteins. After identifying several alkaloids as 5-HT_2C receptor ligands with a combined virtual and experimental screening, Peng et al. used a homology model to predict the interaction pattern of the ligands. Molecular docking and molecular dynamics suggested key interactions such as a conserved salt bridge and π stacking [101].

5 T: Threats of Cheminformatics in Natural Product Research

At first glance, the broad use of natural products in the field of cheminformatics should not lead to overestimated perceptions. As outlined in Sect. 3 weaknesses are pervasive and experiments are mandatory for confirmation of results. However, commonly, this is not the case for binding mode predictions, which frequently are reported without any proof of correctness.

Molecular target prediction tools are similarly hard to evaluate experimentally and natural product researchers should scrutinize retrieved predictions with healthy skepticism. The biomedical data for natural products is comparatively small when compared to other molecule classes. Therefore, it must be assumed that they are generally underrepresented in generation and validation of computational models. This might not only be the case for target prediction but also for the estimation of lipophilicity, conformer generation, assay interference prediction, molecular docking force-field adjustment, and other tasks.

In silico models must follow scripted instructions and generate only predictions. Flexibility, dynamics, entropic issues along with many more aspects can only be approached with extensive computational efforts. Virtual screening experiments still produce many false-positive virtual hits and incorrect or distorted results. Accordingly, predictions without any solid and unbiased experimental validation are not able to stand any test of scientific meaningfulness and therefore have to be regarded as “preliminary.” On the other hand, even if experimentally validated, the probability of not being able to gain access to information of experimentally proven wrong hypothesis/models is very high. This not only refers to models that failed a proof of concept but also to test data of compounds showing no activity on a specific target. With special regard to the correct feeding and training of prediction tools with structural data covering a broad range of activity, ideally from inactive compounds to highly potent ones, learning from previous mistakes and non-working hypotheses would be extremely valuable. The fact that so many successful projects have been reported disguises the fact that other projects failed.

The availability of natural products in sufficient purity from commercial suppliers or obtaining these by isolation from a suitable natural source can be very costly or time-intensive. The natural starting material should be accessible and legally available for collection/acquisition considering issues on bioprospecting, intellectual property rights, and transfer of natural material to the outside its country of origin (Nagoya protocol, [155]). Also reliable reports on the natural product isolation procedure as well as compound structure elucidation parameters and the description of relevant physicochemical properties should be accessible for a target-oriented re-isolation and identification using mass spectrometry-based dereplication.

Special attention should be devoted to broadly distributed PAINS motifs in natural products such as catechols, hydroquinones, epoxides, peroxide bridges, and phenolic Mannich bases. Other concerns are solubility problems and compound aggregation. However, it might be inadvisable to generate a naive black-box application of PAINS and general drug-likeness filters [156] without looking beyond these parameters.

6 Conclusion

The process of small-molecule drug discovery can be described as being deterministic and nonlinear (e.g., activity cliffs) resembling a chaotic system. This is particularly true for drug discovery from natural products, where researchers are confronted along with nonlinear behavior, serendipitous events, errors, and incompleteness also from biological variance, complex multicomponent mixture interactions, and frequent assay interferences. The current challenge of medicinal chemists is to choose which of the possible 10⁶⁰ drug-like molecules should be synthesized and tested [18]. Considering the historical impact of natural products on the pharmaceutical arsenal and their infinite (however, incompletely known) diversity, secondary metabolites have already been synthesized by the most trained chemist on earth and thus are hidden gems designed to have key functions. In natural product research, the application of cheminformatics-based strategies is limited to already structurally disclosed molecules; accordingly, their potentially very large impact relies on properly performed and trustworthy chemical studies on natural resources and their constituents and their documentation and dissemination.

The technological advances and experimental exploration of the last centuries, in particular, have afforded the opportunity of accessing enormous amounts of data. Selecting the appropriate computational tools for handling these data and for addressing the research question is a key step but still requires a healthy skepticism and an unbiased attitude.

The Nobel Laureate Rolf Zinkernagel once made a piercing summary of different research strategies and their chance for success [185]: Having no rationale and performing no experiments is cheap but will not lead to results. To start from a rationale, but renounce experimental work is another relatively cheap method, but similarly does not lead to results. Lots of experiments without any rationale may produce interesting and serendipitous results, but with a disproportionate effort and waste of capacities. To perform experimental studies with a rationale is without surprise the gold method with a good yield of results and appropriate expenses. The generation of this rationale assisted by the use of already available data and with modern computational techniques based on the combined expertise from natural product researchers and computational chemists harbors the key to successful drug discovery processes in the field of remedies from Mother Nature.

References

1.
Morimoto M, Komai K (2000) Plant secondary metabolites as plant defense systems. Recent Res Dev Phytochem 4:99
2.
Hadacek F (2002) Secondary metabolites as plant traits: current assessment and future perspectives. Crit Rev Plant Sci 21:273
3.
Moghe GD, Last RL (2015) Something old, something new: conserved enzymes and the evolution of novelty in plant specialized metabolism. Plant Physiol 169:1512
4.
Wöll S, Kim SH, Greten HJ, Efferth T (2013) Animal plant warfare and secondary metabolite evolution. Nat Prod Bioprospect 3:1
5.
van Hattum H, Waldmann H (2014) Biology-oriented synthesis: harnessing the power of evolution. J Am Chem Soc 136:11853
6.
Sukuru SC, Jenkins JL, Beckwith RE, Scheiber J, Bender A, Mikhailov D, Davies JW, Glick M (2009) Plate-based diversity selection based on empirical HTS data to enhance the number of hits and their chemical diversity. J Biomol Screen 14:690
7.
Larsson J, Gottfries J, Muresan S, Backlund A (2007) ChemGPS-NP: tuned for navigation in biologically relevant chemical space. J Nat Prod 70:789
8.
Ertl P, Schuffenhauer A (2008) Cheminformatics analysis of natural products: lessons from Nature inspiring the design of new drugs. In: Petersen F, Amstutz R (eds) Natural compounds as drugs, vol II. Birkhäuser, Basel, p 217
9.
Koch MA, Schuffenhauer A, Scheck M, Wetzel S, Casaulta M, Odermatt A, Ertl P, Waldmann H (2005) Charting biologically relevant chemical space: a structural classification of natural products (SCONP). Proc Natl Acad Sci U S A 102:17272
10.
Camp D, Garavelas A, Campitelli M (2015) Analysis of physicochemical properties for drugs of natural origin. J Nat Prod 78:1370
11.
Stratton CF, Newman DJ, Tan DS (2015) Cheminformatic comparison of approved drugs from natural product versus synthetic origins. Bioorg Med Chem Lett 25:4802
12.
Wetzel S, Schuffenhauer A, Roggo S, Ertl P, Waldmann H (2007) Cheminformatic analysis of natural products and their chemical space. Chimia 61:355
13.
Lopez-Vallejo F, Giulianotti MA, Houghten RA, Medina-Franco JL (2012) Expanding the medicinally relevant chemical space with compound libraries. Drug Discov Today 17:718
14.
Rollinger JM, Langer T, Stuppner H (2006) Strategies for efficient lead structure discovery from natural products. Curr Med Chem 13:1491
15.
Newman DJ, Cragg GM (2016) Natural products as sources of new drugs from 1981 to 2014. J Nat Prod 79:629
16.
Rodrigues T, Reker D, Schneider P, Schneider G (2016) Counting on natural products for drug design. Nat Chem 8:531
17.
Strohl WR (2000) The role of natural products in a modern drug discovery program. Drug Discov Today 5:39
18.
Schneider G (2017) Automating drug discovery. Nat Rev Drug Discov 17:97
19.
Keller TH, Shi P-Y, Wang Q-Y (2011) Anti-infectives: can cellular screening deliver? Curr Opin Chem Biol 15:529
20.
Swinney DC, Anthony J (2011) How were new medicines discovered? Nat Rev Drug Discov 10:507
21.
Harrison S, Lahue B, Peng Z, Donofrio A, Chang C, Glick M (2017) Extending “predict first” to the design-make-test cycle in small-molecule drug discovery. Future Med Chem 9:533
22.
Weller MG (2012) A unifying review of bioassay-guided fractionation, effect-directed analysis and related techniques. Sensors 12:9181
23.
Kaur K, Michael H, Arora S, Harkonen P, Kumar S (2005) In vitro bioactivity-guided fractionation and characterization of polyphenolic inhibitory fractions from Acacia nilotica (L.) Willd. ex Del. J Ethnopharmacol 99:353
24.
Yang X, Summerhurst DK, Koval SF, Ficker C, Smith ML, Bernards MA (2001) Isolation of an antimicrobial compound from Impatiens balsamina L. using bioassay-guided fractionation. Phytother Res 15:676
25.
Cardellina JH, Munro MHG, Fuller RW, Manfredi KP, McKee TC, Tischler M, Bokesch HR, Gustafson KR, Beutler JA, Boyd MR (1993) A chemical screening strategy for the dereplication and prioritization of HIV-inhibitory aqueous natural products extracts. J Nat Prod 56:1123
26.
Bindseil KU, Jakupovic J, Wolf D, Lavayre J, Leboul J, van der Pyl D (2001) Pure compound libraries; a new perspective for natural product based drug discovery. Drug Discov Today 6:840
27.
Rao KV (1993) Taxol and related taxanes. I. Taxanes of Taxus brevifolia bark. Pharm Res 10:521
28.
Chen Y, de Bruyn Kops C, Kirchmair J (2017) Data resources for the computer-guided discovery of bioactive natural products. J Chem Inf Model 57:2099
29.
Reker D, Perna AM, Rodrigues T, Schneider P, Reutlinger M, Monch B, Koeberle A, Lamers C, Gabler M, Steinmetz H, Muller R, Schubert-Zsilavecz M, Werz O, Schneider G (2014) Revealing the macromolecular targets of complex natural products. Nat Chem 6:1072
30.
Kirchweger B, Rollinger JM (2018) Virtual screening for the discovery of active principles from natural products. In: Cechinel-Filho V (ed) Natural products as source of molecules with therapeutic potential: research & development, challenges and perspectives. Springer, Cham, pp 333–364
31.
Rollinger JM, Langer T, Stuppner H (2006) Integrated in silico tools for exploiting the natural products’ bioactivity. Planta Med 72:671
32.
Rollinger JM, Quinn RJ (2015) In silico driven pharmacognosy: forth, back and reverse. Planta Med 81:427
33.
Gasteiger J, Engel T (eds) (2006) Chemoinformatics: a textbook. Wiley-VCH, Weinheim
34.
Rester U (2008) From virtuality to reality – virtual screening in lead discovery and lead optimization: a medicinal chemistry perspective. Curr Opin Drug Discov Devel 11:559
35.
Rollinger JM, Wolber G (2011) Computational approaches for the discovery of natural lead structures. In: Tringali C (ed) Bioactive compounds from natural sources, 2nd edn. CRC Press, Boca Raton, FL, p 167
36.
Schneider P, Schneider G (2017) De-orphaning the marine natural product (±)-marinopyrrole A by computational target prediction and biochemical validation. Chem Commun 53:2272
37.
Rodrigues T, Sieglitz F, Somovilla VJ, Cal PM, Galione A, Corzana F, Bernardes GJ (2016) Unveiling (–)-englerin A as a modulator of L-type calcium channels. Angew Chem Int Ed Eng 55:11077
38.
Kratz JM, Grienke U, Scheel O, Mann SA, Rollinger JM (2017) Natural products modulating the hERG channel: heartaches and hope. Nat Prod Rep 34:957
39.
Lee ML, Schneider G (2001) Scaffold architecture and pharmacophoric properties of natural products and trade drugs: application in the design of natural product-based combinatorial libraries. J Comb Chem 3:284
40.
Friedrich L, Rodrigues T, Neuhaus CS, Schneider P, Schneider G (2016) From complex natural products to simple synthetic mimetics by computational de novo design. Angew Chem Int Ed Eng 55:6789
41.
Grisoni F, Merk D, Consonni V, Hiss JA, Tagliabue SG, Todeschini R, Schneider G (2018) Scaffold hopping from natural products to synthetic mimetics by holistic molecular similarity. Commun Chem 1:44
42.
Sichao W, Youyong L, Lei X, Dan L, Tingjun H (2013) Recent developments in computational prediction of hERG blockage. Curr Top Med Chem 13:1317
43.
Scior T, Bernard P, Medina-Franco JL, Maggiora GM (2007) Large compound databases for structure-activity relationships studies in drug discovery. Mini-Rev Med Chem 7:851
44.
Walters WP (2019) Virtual chemical libraries. J Med Chem 62:1116
45.
Southan C (2018) Caveat usor: assessing differences between major chemistry databases. ChemMedChem 13:470
46.
Gaulton A, Bellis LJ, Bento AP, Chambers J, Davies M, Hersey A, Light Y, McGlinchey S, Michalovich D, Al-Lazikani B, Overington JP (2012) ChEMBL: a large-scale bioactivity database for drug discovery. Nucleic Acids Res 40:D1100
47.
Bento AP, Gaulton A, Hersey A, Bellis LJ, Chambers J, Davies M, Krüger FA, Light Y, Mak L, McGlinchey S, Nowotka M, Papadatos G, Santos R, Overington JP (2014) The ChEMBL bioactivity database: an update. Nucleic Acids Res 42(D1):D1083
48.
EMBL-EBI (2019) Homepage of the ChEMBL database, 2019. https://www.ebi.ac.uk/chembl/
49.
NCBI (2019) PubChem, 2019. https://pubchem.ncbi.nlm.nih.gov/
50.
Kim S, Thiessen PA, Bolton EE, Chen J, Fu G, Gindulyte A, Han L, He J, He S, Shoemaker BA, Wang J, Yu B, Zhang J, Bryant SH (2016) PubChem substance and compound databases. Nucleic Acids Res 44(D1):D1202
51.
Papadatos G, Davies M, Dedman N, Chambers J, Gaulton A, Siddle J, Koks R, Irvine SA, Pettersson J, Goncharoff N, Hersey A, Overington JP (2016) SureChEMBL: a large-scale, chemically annotated patent document database. Nucleic Acids Res 44(D1):D1220
52.
EMBL-EBI (2019) SureCHEMBL, 2019. https://www.surechembl.org/search/
53.
Wishart DS, Feunang YD, Guo AC, Lo EJ, Marcu A, Grant JR, Sajed T, Johnson D, Li C, Sayeeda Z, Assempour N, Iynkkaran I, Liu Y, Maciejewski A, Gale N, Wilson A, Chin L, Cummings R, Le D, Pon A, Knox C, Wilson M (2018) DrugBank 5.0: a major update to the DrugBank database for 2018. Nucleic Acids Res 46(D1):D1074
54.
Wishart DS, Knox C, Guo AC, Shrivastava S, Hassanali M, Stothard P, Chang Z, Woolsey J (2006) DrugBank: a comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res 34(Database issue):D668
55.
DrugBank (2019) The DrugBank database, 2019. https://www.drugbank.ca/
56.
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28:235
57.
wwPDB consortium (2018) Protein data bank: the single global archive for 3D macromolecular structure data. Nucleic Acids Res 47(D1):D520
58.
Gilson MK, Liu T, Baitaluk M, Nicola G, Hwang L, Chong J (2016) BindingDB in 2015: a public database for medicinal chemistry, computational chemistry and systems pharmacology. Nucleic Acids Res 44(D1):D1045
59.
Irwin JJ (2008) Community benchmarks for virtual screening. J Comput Aided Mol Des 22:193
60.
Mysinger MM, Carchia M, Irwin JJ, Shoichet BK (2012) Directory of Useful Decoys, Enhanced (DUD-E): better ligands and decoys for better benchmarking. J Med Chem 55:6582
61.
Chan WK, Zhang H, Yang J, Brender JR, Hur J, Özgür A, Zhang Y (2015) GLASS: a comprehensive database for experimentally validated GPCR-ligand associations. Bioinformatics 31:3035
62.
Excelra Knowledge Solutions (2019) GOSTAR database, 2019. https://www.gostardb.com/gostar/index.jsp
63.
Sushko I, Novotarskyi S, Körner R, Pandey AK, Rupp M, Teetz W, Brandmaier S, Abdelaziz A, Prokopenko VV, Tanchuk VY, Todeschini R, Varnek A, Marcou G, Ertl P, Potemkin V, Grishina M, Gasteiger J, Schwab C, Baskin II, Palyulin VA, Radchenko EV, Welsh WJ, Kholodovych V, Chekmarev D, Cherkasov A, Aires-de-Sousa J, Zhang Q-Y, Bender A, Nigsch F, Patiny L, Williams A, Tkachenko V, Tetko IV (2011) Online chemical modeling environment (OCHEM): web platform for data storage, model development and publishing of chemical information. J Comp-Aided Mol Design 25:533
64.
OCHEM (2019) Online chemical database, 2019. https://ochem.eu/home/show.do
65.
Li J, Liu J, Han L, Wang R, Nie W, Li Y, Liu Y, Liu Z, Zhao Z (2014) PDB-wide collection of binding data: current status of the PDBbind database. Bioinformatics 31:405
66.
Ahmed A, Dunbar JB Jr, Clark JJ, Smith RD, Carlson HA (2014) Recent improvements to Binding MOAD: a resource for protein–ligand binding affinities and structures. Nucleic Acids Res 43(D1):D465
67.
Hu L, Benson ML, Smith RD, Lerner MG, Carlson HA (2005) Binding MOAD (mother of all databases). Proteins Struct Funct Bioinf 60:333
68.
Frolkis A, Knox C, Lim E, Jewison T, Law V, Hau DD, Liu P, Gautam B, Ly S, Guo AC, Xia J, Liang Y, Shrivastava S, Wishart DS (2010) SMPDB: the small molecule pathway database. Nucleic Acids Res 38:D480
69.
Jewison T, Su Y, Disfany FM, Liang Y, Knox C, Maciejewski A, Poelzer J, Huynh J, Zhou Y, Arndt D, Djoumbou Y, Liu Y, Deng L, Guo AC, Han B, Pon A, Wilson M, Rafatnia S, Liu P, Wishart DS (2014) SMPDB 2.0: big improvements to the small molecule pathway database. Nucleic Acids Res 42:D478
70.
Li YH, Yu CY, Li XX, Zhang P, Tang J, Yang Q, Fu T, Zhang X, Cui X, Tu G, Zhang Y, Li S, Yang F, Sun Q, Qin C, Zeng X, Chen Z, Chen YZ, Zhu F (2017) Therapeutic target database update 2018: enriched resource for facilitating bench-to-clinic research of targeted therapeutics. Nucleic Acids Res 46:D1121
71.
Swiss Institute of Bioinformatics (2019) Directory of computer-aided drug design tools, 2019. https://www.click2drug.org/
72.
Kaserer T, Schuster D, Rollinger JM (2018) Chemoinformatics in natural product research. In: Engel T, Gasteiger J (eds) Applied chemoinformatics: achievements and future opportunities. Wiley-VCH, Weinheim, p 207
73.
Harvey AL, Edrada-Ebel R, Quinn RJ (2015) The re-emergence of natural products for drug discovery in the genomics era. Nat Rev Drug Discov 14:111
74.
Blunt J, Munro M, Upjohn M (2012) The role of databases in marine natural products research. In: Fattorusso E, Gerwick WH, Taglialatela-Scafati O (eds) Natural compounds as drugs, 2nd edn. Springer, Dordrecht, p 389
75.
Mohamed A, Nguyen CH, Mamitsuka H (2016) Current status and prospects of computational resources for natural product dereplication: a review. Brief Bioinform 17:309
76.
Ma DL, Chan DSH, Leung CH (2011) Molecular docking for virtual screening of natural product databases. Chem Sci 2:1656
77.
Blunt JW, Munro MHG (2014) Is there an ideal database for natural products research? In: Osbourn A, Goss RJ, Carter GT (eds) Natural products. Wiley-VCH, Weinheim, p 413
78.
CRC Press, Taylor & Francis Group (2019) Dictionary of natural products 27.2., 2019. http://dnp.chemnetbase.com
79.
Gu J, Gui Y, Chen L, Yuan G, Lu H-Z, Xu X (2013) Use of natural products as chemical library for drug discovery and network pharmacology. PLoS One 8:e62839
80.
Chen CY-C (2011) TCM Database@Taiwan: the world’s largest traditional Chinese medicine database for drug screening in silico. PLoS One 6:e15939
81.
Choi H, Cho SY, Pak HJ, Kim Y, Choi J-Y, Lee YJ, Gong BH, Kang YS, Han T, Choi G, Cho Y, Lee S, Ryoo D, Park H (2017) NPCARE: database of natural products and fractional extracts for cancer regulation. J Cheminf 9:2
82.
Pilon AC, Valli M, Dametto AC, Pinto MEF, Freire RT, Castro-Gamboa I, Andricopulo AD, Bolzani VS (2017) NuBBEDB: an updated database to uncover chemical and biological information from Brazilian biodiversity. Sci Rep 7:7215
83.
Banerjee P, Erehman J, Gohlke B-O, Wilhelm T, Preissner R, Dunkel M (2015) Super Natural II—a database of natural products. Nucleic Acids Res 43:D935
84.
Irwin JJ, Sterling T, Mysinger MM, Bolstad ES, Coleman RG (2012) ZINC: a free tool to discover chemistry for biology. J Chem Inf Model 52:1757
85.
Sigma-Aldrich (2019) Aldrich market select, 2019. https://www.sigmaaldrich.com/chemistry/chemistry-services/aldrich-market-select.html
86.
Ertl P, Roggo S, Schuffenhauer A (2008) Natural product-likeness score and its application for prioritization of compound libraries. J Chem Inf Model 48:68
87.
Jayaseelan KV, Moreno P, Truszkowski A, Ertl P, Steinbeck C (2012) Natural product-likeness score revisited: an open-source, open-data implementation. BMC Bioinf 13:106
88.
RDKIT Open-source cheminformactics software (2019) RDKIT version 2017.09.3, 2019. http://www.rdkit.org/
89.
Yu MJ (2011) Natural product-like virtual libraries: recursive atom-based enumeration. J Chem Inf Model 51:541
90.
Jayaseelan KV, Steinbeck C (2014) Building blocks for automated elucidation of metabolites: natural product-likeness for candidate ranking. BMC Bioinf 15:234
91.
Zaid H, Raiyn J, Nasser A, Saad B, Rayan A (2010) Physicochemical properties of natural based products versus synthetic chemicals. Open Nutraceuticals J 3:194
92.
Chen Y, Stork C, Hirte S, Kirchmair J (2019) NP-Scout: machine learning approach for the quantification and visualization of the natural product-likeness of small molecules. Biomol Ther 9:43
93.
Harvey AL (2000) Natural products in drug discovery. Drug Discov Today 13:894
94.
Tetko IV, Engkvist O, Koch U, Reymond J-L, Chen H (2016) BIGCHEM: challenges and opportunities for big data analysis in chemistry. Mol Inf 35:615
95.
Rollinger JM (2009) Accessing target information by virtual parallel screening – The impact on natural product research. Phytochem Lett 2:53
96.
Rollinger JM, Steindl TM, Schuster D, Kirchmair J, Anrain K, Ellmerer EP, Langer T, Stuppner H, Wutzler P, Schmidtke M (2008) Structure-based virtual screening for the discovery of natural inhibitors for human rhinovirus coat protein. J Med Chem 51:842
97.
Grienke U, Schmidtke M, Kirchmair J, Pfarr K, Wutzler P, Dürrwald R, Wolber G, Liedl KR, Stuppner H, Rollinger JM (2010) Antiviral potential and molecular insight into neuraminidase inhibiting diarylheptanoids from Alpinia katsumadai. J Med Chem 53:778
98.
Leláková V, Šmejkal K, Jakubczyk K, Veselý O, Landa P, Václavík J, Bobáľ P, Pížová H, Temml V, Steinacher T, Schuster D, Granica S, Hanáková Z, Hošek J (2019) Parallel in vitro and in silico investigations into anti-inflammatory effects of non-prenylated stilbenoids. Food Chem 285:431
99.
Rollinger JM, Kratschmar DV, Schuster D, Pfisterer PH, Gumy C, Aubry EM, Brandstotter S, Stuppner H, Wolber G, Odermatt A (2010) 11β-Hydroxysteroid dehydrogenase 1 inhibiting constituents from Eriobotrya japonica revealed by bioactivity-guided isolation and computational approaches. Bioorg Med Chem 18:1507
100.
Fu W, Chen L, Wang Z, Zhao C, Chen G, Liu X, Dai Y, Cai Y, Li C, Zhou J, Liang G (2016) Determination of the binding mode for anti-inflammatory natural product xanthohumol with myeloid differentiation protein 2. Drug Des Dev Ther 10:455
101.
Peng Y, Zhao S, Wu Y, Cao H, Xu Y, Liu X, Shui W, Cheng J, Zhao S, Shen L, Ma J, Quinn RJ, Stevens RC, Zhong G, Liu Z-J (2018) Identification of natural products as novel ligands for the human 5-HT2C receptor. Biophys Rep 4:50
102.
Petersen RK, Christensen KB, Assimopoulou AN, Fretté X, Papageorgiou VP, Kristiansen K, Kouskoumvekaki I (2011) Pharmacophore-driven identification of PPARγ agonists from natural sources. J Comput Aided Mol Des 25:107
103.
Zaheer-ul-Haq ZU, Wellenzohn B, Liedl KR, Rode BM (2003) Molecular docking studies of natural cholinesterase-inhibiting steroidal alkaloids from Sarcococca saligna. J Med Chem 46:5087
104.
Atanasov AG, Wang JN, Gu SP, Bu J, Kramer MP, Baumgartner L, Fakhrudin N, Ladurner A, Malainer C, Vuorinen A, Noha SM, Schwaiger S, Rollinger JM, Schuster D, Stuppner H, Dirsch VM, Heiss EH (2013) Honokiol: a non-adipogenic PPARγ agonist from Nature. Biochim Biophys Acta 1830:4813
105.
Mulholland K, Wu C (2016) Binding of telomestatin to a telomeric G-quadruplex DNA probed by all-atom molecular dynamics simulations with explicit solvent. J Chem Inf Model 56:2093
106.
ETH Zürich (2019) SPiDER Target Prediction Software, 2019. http://modlabcadd.ethz.ch/software/spider/
107.
Petra S, Gisbert S (2017) A computational method for unveiling the target promiscuity of pharmacologically active compounds. Angew Chem Int Ed 56:11520
108.
Nathan Magarvey Lab (2019) Antibiotic’ome, 2019. https://magarveylab.ca/antibioticome/#!/search
109.
Johnston CW, Skinnider MA, Dejong CA, Rees PN, Chen GM, Walker CG, French S, Brown ED, Bérdy J, Liu DY, Magarvey NA (2016) Assembly and clustering of natural antibiotics guides target identification. Nat Chem Biol 12:233
110.
Grienke U, Kaserer T, Pfluger F, Mair CE, Langer T, Schuster D, Rollinger JM (2015) Accessing biological actions of Ganoderma secondary metabolites by in silico profiling. Phytochemistry 114:114
111.
Rollinger JM, Schuster D, Danzl B, Schwaiger S, Markt P, Schmidtke M, Gertsch J, Raduner S, Wolber G, Langer T, Stuppner H (2009) In silico target fishing for rationalized ligand discovery exemplified on constituents of Ruta graveolens. Planta Med 75:195
112.
Gong J, Sun P, Jiang N, Riccio R, Lauro G, Bifulco G, Li T-J, Gerwick WH, Zhang W (2014) New steroids with a rearranged skeleton as (h)P300 inhibitors from the sponge Theonella swinhoei. Org Lett 16:2224
113.
Di Micco S, Pulvirenti L, Bruno I, Terracciano S, Russo A, Vaccaro MC, Ruggiero D, Muccilli V, Cardullo N, Tringali C, Riccio R, Bifulco G (2018) Identification by inverse virtual screening of magnolol-based scaffold as new tankyrase-2 inhibitors. Bioorg Med Chem 26:3953
114.
Rollinger JM, Hornick A, Langer T, Stuppner H, Prast H (2004) Acetylcholinesterase inhibitory activity of scopolin and scopoletin discovered by virtual screening of natural products. J Med Chem 47:6248
115.
Rollinger JM, Haupt S, Stuppner H, Langer T (2004) Combining ethnopharmacology and virtual screening for lead structure discovery: COX-inhibitors as application example. J Chem Inf Comput Sci 44:480
116.
Rollinger JM, Bodensieck A, Seger C, Ellmerer EP, Bauer R, Langer T, Stuppner H (2005) Discovering COX-inhibiting constituents of Morus root bark: activity-guided versus computer-aided methods. Planta Med 71:399
117.
Kratz JM, Mair CE, Oettl SK, Saxena P, Scheel O, Schuster D, Hering S, Rollinger JM (2016) hERG channel blocking ipecac alkaloids identified by combined in silico – in vitro screening. Planta Med 82:1009
118.
Kratz JM, Schuster D, Edtbauer M, Saxena P, Mair CE, Kirchebner J, Matuszczak B, Baburin I, Hering S, Rollinger JM (2014) Experimentally validated hERG pharmacophore models as cardiotoxicity prediction tools. J Chem Inf Model 54:2887
119.
Grienke U, Mihaly-Bison J, Schuster D, Afonyushkin T, Binder M, Guan SH, Cheng CR, Wolber G, Stuppner H, Guo DA, Bochkov VN, Rollinger JM (2011) Pharmacophore-based discovery of FXR-agonists. Part II: identification of bioactive triterpenes from Ganoderma lucidum. Bioorg Med Chem 19:6779
120.
Schuster D, Markt P, Grienke U, Mihaly-Bison J, Binder M, Noha SM, Rollinger JM, Stuppner H, Bochkov VN, Wolber G (2011) Pharmacophore-based discovery of FXR agonists. Part I: model development and experimental validation. Bioorg Med Chem 1:7168
121.
Noha SM, Jazzar B, Kuehnl S, Rollinger JM, Stuppner H, Schaible AM, Werz O, Wolber G, Schuster D (2012) Pharmacophore-based discovery of a novel cytosolic phospholipase A(2)α inhibitor. Bioorg Med Chem Lett 22:1202
122.
Waltenberger B, Wiechmann K, Bauer J, Markt P, Noha SM, Wolber G, Rollinger JM, Werz O, Schuster D, Stuppner H (2011) Pharmacophore modeling and virtual screening for novel acidic inhibitors of microsomal prostaglandin E₂ synthase-1 (mPGES-1). J Med Chem 54:3163
123.
Triballeau N, Acher F, Brabet I, Pin JP, Bertrand HO (2005) Virtual screening workflow development guided by the “receiver operating characteristic” curve approach. Application to high-throughput docking on metabotropic glutamate receptor subtype 4. J Med Chem 48:2534
124.
Choi J, Kim H-J, Jin X, Lim H, Kim S, Roh I-S, Kang H-E, No KT, Sohn H-J (2018) Application of the fragment molecular orbital method to discover novel natural products for prion disease. Sci Rep 8:13063
125.
Kirchweger B, Kratz JM, Ladurner A, Grienke U, Langer T, Dirsch VM, Rollinger JM (2018) In silico workflow for the identification of natural products targeting GPBAR1. Front Chem 6:242
126.
Vuorinen A, Seibert J, Papageorgiou VP, Rollinger JM, Odermatt A, Schuster D, Assimopoulou AN (2015) Pistacia lentiscus oleoresin: virtual screening and identification of masticadienonic and isomasticadienonic acids as inhibitors of 11β-hydroxysteroid dehydrogenase 1. Planta Med 81:525
127.
Fakhrudin N, Ladurner A, Atanasov AG, Heiss EH, Baumgartner L, Markt P, Schuster D, Ellmerer EP, Wolber G, Rollinger JM, Stuppner H, Dirsch VM (2010) Computer-aided discovery, validation, and mechanistic characterization of novel neolignan activators of peroxisome proliferator-activated receptor gamma. Mol Pharmacol 77:559
128.
Duwensee K, Schwaiger S, Tancevski I, Eller K, van Eck M, Markt P, Linder T, Stanzl U, Ritsch A, Patsch JR, Schuster D, Stuppner H, Bernhard D, Eller P (2011) Leoligin, the major lignan from Edelweiss, activates cholesteryl ester transfer protein. Atherosclerosis 219:109
129.
Maindola P, Jamal S, Grover A (2015) Cheminformatics based machine learning models for AMA1-RON2 abrogators for inhibiting Plasmodium falciparum erythrocyte invasion. Mol Inf 34:655
130.
Herrmann FC, Lenz M, Jose J, Kaiser M, Brun R, Schmidt TJ (2015) In silico identification and in vitro activity of novel natural inhibitors of Trypanosoma brucei glyceraldehyde-3-phosphate-dehydrogenase. Molecules 20:16154
131.
Karhu E, Isojärvi J, Vuorela P, Hanski L, Fallarero A (2017) Identification of privileged antichlamydial natural products by a ligand-based strategy. J Nat Prod 80:2602
132.
Diao Y, Jiang J, Zhang S, Li S, Shan L, Huang J, Zhang W, Li H (2018) Discovery of natural products as novel and potent FXR antagonists by virtual screening. Front Chem 6:140
133.
Grienke U, Braun H, Seidel N, Kirchmair J, Richter M, Krumbholz A, von Grafenstein S, Liedl KR, Schmidtke M, Rollinger JM (2014) Computer-guided approach to access the anti-influenza activity of licorice constituents. J Nat Prod 77:563
134.
Noha SM, Atanasov AG, Schuster D, Markt P, Fakhrudin N, Heiss EH, Schrammel O, Rollinger JM, Stuppner H, Dirsch VM, Wolber G (2011) Discovery of a novel IKK-β inhibitor by ligand-based virtual screening techniques. Bioorg Med Chem Lett 21:577
135.
Bauer J, Waltenberger B, Noha SM, Schuster D, Rollinger JM, Boustie J, Chollet M, Stuppner H, Werz O (2012) Discovery of depsides and depsidones from lichen as potent inhibitors of microsomal prostaglandin E₂ synthase-1 using pharmacophore models. ChemMedChem 7:2077
136.
Costa G, Rocca R, Corona A, Grandi N, Moraca F, Romeo I, Talarico C, Gagliardi MG, Ambrosio FA, Ortuso F, Alcaro S, Distinto S, Maccioni E, Tramontano E, Artese A (2019) Novel natural non-nucleoside inhibitors of HIV-1 reverse transcriptase identified by shape- and structure-based virtual screening techniques. Eur J Med Chem 161:1
137.
Carrasco MP, Gut J, Rodrigues T, Ribeiro MHL, Lopes F, Rosenthal PJ, Moreira R, dos Santos DJ (2013) Exploring the molecular basis of Qo bc1 complex inhibitors activity to find novel antimalarials hits. Mol Inf 32:659
138.
Rodrigues T, Ressurreição AS, da Cruz FP, Albuquerque IS, Gut J, Carrasco MP, Gonçalves D, Guedes RC, dos Santos DJ, Mota MM, Rosenthal PJ, Moreira R, Prudêncio M, Lopes F (2013) Flavones as isosteres of 4(1H)-quinolones: discovery of ligand efficient and dual stage antimalarial lead compounds. Eur J Med Chem 69:872
139.
Ikram NKK, Durrant JD, Muchtaridi M, Zalaludin AS, Purwitasari N, Mohamed N, Rahim ASA, Lam CK, Normi YM, Rahman NA, Amaro RE, Wahab HA (2015) A virtual screening approach for identifying plants with anti H5N1 neuraminidase activity. J Chem Inf Model 55:308
140.
Ou T, Hou X, Guan S, Dai J, Han W, Li R, Wang W, Qu X, Zhang M (2016) Targeting AMPK signalling pathway with natural medicines for atherosclerosis therapy: an integration of in silico screening and in vitro assay. Nat Prod Res 30:1240
141.
Su H, Yan J, Xu J, Fan XZ, Sun XL, Chen KY (2015) Stepwise high-throughput virtual screening of Rho kinase inhibitors from natural product library and potential therapeutics for pulmonary hypertension. Pharm Biol 53:1201
142.
Schuster D, Kern L, Hristozov DP, Terfloth L, Bienfait B, Laggner C, Kirchmair J, Grienke U, Wolber G, Langer T (2010) Applications of integrated data mining methods to exploring natural product space for acetylcholinesterase inhibitors. Comb Chem High Throughput Screen 13:54
143.
Cozza G, Bonvini P, Zorzi E, Poletto G, Pagano MA, Sarno S, Donella-Deana A, Zagotto G, Rosolen A, Pinna LA, Meggio F, Moro S (2006) Identification of ellagic acid as potent inhibitor of protein kinase CK2: a successful example of a virtual screening application. J Med Chem 49:2363
144.
Salam NK, Huang TH, Kota BP, Kim MS, Li Y, Hibbs DE (2008) Novel PPAR-gamma agonists identified from a natural product library: a virtual screening, induced-fit docking and biological assay study. Chem Biol Drug Des 71:57
145.
Karaman B, Alhalabi Z, Swyter S, Mihigo SO, Andrae-Marobela K, Jung M, Sippl W, Ntie-Kang F (2018) Identification of bichalcones as sirtuin inhibitors by virtual screening and in vitro testing. Molecules 23:416
146.
Ke Z, Su Z, Zhang X, Cao Z, Ding Y, Cao L, Ding G, Wang Z, Liu H, Xiao W (2017) Discovery of a potent angiotensin converting enzyme inhibitor via virtual screening. Bioorg Med Chem Lett 27:3688
147.
Chan DS-H, Lee H-M, Yang F, Che C-M, Wong CCL, Abagyan R, Leung C-H, Ma DL (2010) Structure-based discovery of natural-product-like TNF-α inhibitors. Angew Chem Int Ed 49:2860
148.
Ma DL, Chan DS, Fu WC, He HZ, Yang H, Yan SC, Leung CH (2012) Discovery of a natural product-like c-myc G-quadruplex DNA groove-binder by molecular docking. PLoS One 7:e43278
149.
Liu Y, Huang L, Ye H, Lv X (2016) Combined QSAR-based virtual screening and fluorescence binding assay to identify natural product mediators of interferon regulatory factor 7 (IRF-7) in pulmonary infection. SAR QSAR Environ Res 27:967
150.
Schmidt TJ, Da Costa FB, Lopes NP, Kaiser M, Brun R (2014) In silico prediction and experimental evaluation of furanoheliangolide sesquiterpene lactones as potent agents against Trypanosoma brucei rhodesiense. Antimicrob Agents Chemother 58:325
151.
Rupp M, Schroeter T, Steri R, Zettl H, Proschak E, Hansen K, Rau O, Schwarz O, Muller-Kuhrt L, Schubert-Zsilavecz M, Muller KR, Schneider G (2010) From machine learning to natural product derivatives that selectively activate transcription factor PPARgamma. ChemMedChem 5:191
152.
Schuster D, Wolber G (2010) Identification of bioactive natural products by pharmacophore-based virtual screening. Curr Pharm Des 16:1666
153.
Doman TN, McGovern SL, Witherbee BJ, Kasten TP, Kurumbail R, Stallings WC, Connolly DT, Shoichet BK (2002) Molecular docking and high-throughput screening for novel inhibitors of protein tyrosine phosphatase-1B. J Med Chem 45:2213
154.
Lyu J, Wang S, Balius TE, Singh I, Levit A, Moroz YS, O'Meara MJ, Che T, Algaa E, Tolmachova K, Tolmachev AA, Shoichet BK, Roth BL, Irwin JJ (2019) Ultra-large library docking for discovering new chemotypes. Nature 566:224
155.
Matthias B, Clare H (2011) The Nagoya protocol on access to genetic resources and the fair and equitable sharing of benefits arising from their utilization to the Convention on Biological Diversity. Rev Eur Commun Int Environ Law 20:47
156.
Baell JB (2016) Feeling Nature’s PAINS: natural products, natural product drugs, and pan assay interference compounds (PAINS). J Nat Prod 79:616
157.
Scior T, Bender A, Tresadern G, Medina-Franco JL, Martínez-Mayorga K, Langer T, Cuanalo-Contreras K, Agrafiotis DK (2012) Recognizing pitfalls in virtual screening: a critical review. J Chem Inf Model 52:867
158.
Grienke U, Mair CE, Kirchmair J, Schmidtke M, Rollinger JM (2018) Discovery of bioactive natural products for the treatment of acute respiratory infections – an integrated approach. Planta Med 84:684
159.
Chen Y, Garcia de Lomana M, Friedrich N-O, Kirchmair J (2018) Characterization of the chemical space of known and readily obtainable natural products. J Chem Inf Model 58:1518
160.
Feher M, Schmidt JM (2003) Property distributions: differences between drugs, natural products, and molecules from combinatorial chemistry. J Chem Inf Comput Sci 43:218
161.
Clemons PA, Bodycombe NE, Carrinski HA, Wilson JA, Shamji AF, Wagner BK, Koehler AN, Schreiber SL (2010) Small molecules of different origins have distinct distributions of structural complexity that correlate with protein-binding profiles. Proc Natl Acad Sci U S A 107:18787
162.
Veber DF, Johnson SR, Cheng HY, Smith BR, Ward KW, Kopple KD (2002) Molecular properties that influence the oral bioavailability of drug candidates. J Med Chem 45:2615
163.
Mora CA, Halter JG, Adler C, Hund A, Anders H, Yu K, Stark WJ (2016) Application of the Prunus spp. cyanide seed defense system onto wheat: reduced insect feeding and field growth tests. J Agric Food Chem 64:3501
164.
Zhou M, Zhang R-H, Wang M, Xu G-B, Liao S-G (2017) Prodrugs of triterpenoids and their derivatives. Eur J Med Chem 131:222
165.
CDK Knime Nodepit (2019) Sugar Remover, 2019. https://nodepit.com/node/org.openscience.cdk.knime.nodes.sugarremover.SugarRemoverNodeFactory
166.
Nivedha AK, Thieker DF, Makeneni S, Hu H, Woods RJ (2016) Vina-Carb: improving glycosidic angles during carbohydrate docking. J Chem Theory Comput 12:892
167.
Clostre F (1999) Ginkgo biloba extract (EGb 761). State of knowledge in the dawn of the year 2000. Ann Pharm Fr 57(Suppl 1):1S8
168.
Xu R, Zhao W, Xu J, Shao B, Qin G (1996) Studies on bioactive saponins from Chinese medicinal plants. Adv Exp Med Biol 404:371
169.
Cesarone MR, Ricci A, Di Renzo A, Belcaro G, Dugall M (2004) Efficacy of topical treatment with aescin + essential phospholipids gel on capillary fragility. Angiology 55(Suppl 1):S23
170.
Brandley BK, Schnaar RL (1986) Cell-surface carbohydrates in cell recognition and response. J Leukoc Biol 40:97
171.
McBride A, Ghilagaber S, Nikolaev A, Hardie DG (2009) The glycogen-binding domain on the AMPK beta subunit allows the kinase to act as a glycogen sensor. Cell Metab 9:23
172.
Kato K, Ishiwa A (2015) The role of carbohydrates in infection strategies of enteric pathogens. Trop Med Health 43:41
173.
Lorent JH, Quetin-Leclercq J, Mingeot-Leclercq MP (2014) The amphiphilic nature of saponins and their effects on artificial and biological membranes and potential consequences for red blood and cancer cells. Org Biomol Chem 12:8803
174.
Barbič M, Willer EA, Rothenhöfer M, Heilmann J, Fürst R, Jürgenliemk G (2013) Spirostanol saponins and esculin from Rusci Rhizoma reduce the thrombin-induced hyperpermeability of endothelial cells. Phytochemistry 90:106
175.
Sotriffer C (2018) Docking of covalent ligands: challenges and approaches. Mol Inf 37:1800062
176.
Hardman TC, Dubrey SW (2011) Development and potential role of type-2 sodium-glucose transporter inhibitors for management of type 2 diabetes. Diabetes Ther 2:133
177.
Gribble GW (1998) Naturally occurring organohalogen compounds. Acc Chem Res 31:141
178.
Pye CR, Bertin MJ, Lokey RS, Gerwick WH, Linington RG (2017) Retrospective analysis of natural products provides insights for future discovery trends. Proc Natl Acad Sci U S A 114:5601
179.
Lipinski CA, Lombardo F, Dominy BW, Feeney PJ (2001) Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev 46:3
180.
Vuorinen A, Engeli R, Meyer A, Bachmann F, Griesser UJ, Schuster D, Odermatt A (2014) Ligand-based pharmacophore modeling and virtual screening for the discovery of novel 17β-hydroxysteroid dehydrogenase 2 inhibitors. J Med Chem 57:5995
181.
Vuorinen A, Nashev LG, Odermatt A, Rollinger JM, Schuster D (2014) Pharmacophore model refinement for 11β-hydroxysteroid dehydrogenase inhibitors: search for modulators of intracellular glucocorticoid concentrations. Mol Inf 33:15
182.
Trivedi PJ, Hirschfield GM, Gershwin ME (2016) Obeticholic acid for the treatment of primary biliary cirrhosis. Expert Rev Clin Pharmacol 9:13
183.
Bozorgi M, Memariani Z, Mobli M, Salehi Surmaghi MH, Shams-Ardekani MR, Rahimi R (2013) Five Pistacia species (P. vera, P. atlantica, P. terebinthus, P. khinjuk, and P. lentiscus): a review of their traditional uses, phytochemistry, and pharmacology. Sci World J D 2013:219815
184.
Georgiadis I, Karatzas T, Korou LM, Agrogiannis G, Vlachos IS, Pantopoulou A, Tzanetakou IP, Katsilambros N, Perrea DN (2014) Evaluation of Chios mastic gum on lipid and glucose metabolism in diabetic mice. J Med Food 17:393
185.
Zinkernagel RM (1997) Lecture “Immunität gegen Viren” presented at the Symposium “Pharmazie. Die Wissenschaft vom Arzneimittel” of the German and Swiss Pharmaceutical Societies, Zürich, Switzerland, October 02–05