abcd
play

ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application - PowerPoint PPT Presentation

ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application of Daylight Fingerprints to Virtual Screening Uta Lessel Boehringer Ingelheim Pharma GmbH & Co. KG Department of Lead Discovery ABCD Ligand Based Virtual Screening


  1. ABCD Daylight User Group Meeting Cambridge, 4.-5.11.04 Application of Daylight Fingerprints to Virtual Screening Uta Lessel Boehringer Ingelheim Pharma GmbH & Co. KG Department of Lead Discovery

  2. ABCD Ligand Based Virtual Screening Goal: • Selection of subsets with increased hit rates from a data set Procedure: • Looking for compounds similar to known actives • Ranking of data sets with actives and inactives according to decreasing similarities Evaluation: • E.g. determination of enrichment curves

  3. ABCD Study Aim: Comparison of different methods for the search for similar compounds Methods analyzed: • Tanimoto coefficients on the basis of Daylight Fingerprints • Euklidean distances in a 5-dimensional BCUT property space ( R.S. Pearlman, K.M. Smith, Perspectives in Drug Discovery and Design, 9/10/11, 339-353, 1998 ) • Feature Trees ( M. Rarey, J.S. Dixon, J. of Computer-Aided Molecular Design, 12, 471-490, 1998 )

  4. ABCD Data Set 75 5HT 1A agonists 75 H2 antagonists 75 MAO A inhibitors 75 Thrombin inhibitors + ~ 15.000 compounds chosen randomly from MDDR data base Examples shown for the 5HT 1A agonists

  5. ABCD First Step each in turn Query 75 Known Actives Similarity Search 3 Data set Ranked Data Set Enrichment Curve

  6. ABCD Results from First Step 1. Shapes of individual enrichment curves depend on the query, shown for Daylight Fingerprints

  7. ABCD Individual Enrichment Curves - Daylight Fingerprints 5HT - Daylight Fingerprints 120 100 5HT_21 80 % Hits found 5HT_53 RANDOM 60 5HT_01 5HT_31 40 5HT_64 20 0 0 20 40 60 80 100 120 % of data set screened

  8. ABCD Results from First Step 1. Shapes of individual enrichment curves depend on the query Valid for all three methods 2. Shapes of individual enrichment curves depend on the method used for similarity searches, shown for 5HT_57

  9. ABCD Corresponding Results Achieved with Daylight Fingerprints, BCUTs, and FTs 5HT_57 120 100 80 Daylight BCUTs 60 FTs Random 40 20 0 0 20 40 60 80 100 120

  10. ABCD Results from First Step 1. Shapes of individual enrichment curves depend on the query Valid for all three methods 2. Shapes of individual enrichment curves depend on the method used for similarity searches, shown for 5HT_59 3. Ranking of the 3 methods depends on the queries Complementarity?

  11. ABCD Consequences from First Step Global conclusions on this basis questionable! ⇒ Try to reduce variance and / or dependence on the queries ⇒ Analyze complementarity of the methods

  12. ABCD Strategy to Reduce Variance Combination of Queries: 75 x random selection of 3 actives for each combination: • determine distances to all 3 actives for the whole data set • for each compound: select the distance to the nearest of the 3 actives • rank all compounds according to those distances perform this procedure for all 3 methods

  13. ABCD Results for Combinations of 3 Queries Single queries combinations # method 1. Average Average Average comp. SD SD number of # hits # hits hits found Daylight 5.5 2.2 11.1 3.0 increased 75 BCUTs 4.2 3.3 7.4 2.9 2. Relative SD decreased FTs 6.4 3.0 12.1 3.5 3. Trends Daylight 22.2 8.3 30.9 7.0 instead of global 1500 BCUTs 29.1 12.1 35.2 6.6 conclusions FTs 26.4 9.3 34.7 8.2

  14. ABCD Average Enrichment Curves for 75 Combinations of 3 Queries 5HT-1A 120 100 80 % hits found Random Daylight 60 BCUTs FTs 40 20 0 0 20 40 60 80 100 120 % data set screened

  15. ABCD Average Enrichment Curves for 75 Combinations of 3 Queries - Detail 5HT-1A 60 50 % hits found 40 Random Daylight 30 BCUTs FTs 20 10 0 0 2 4 6 8 10 12 % data set screened

  16. ABCD Average Number of Hits Found # comp. screened Heat Map 75 150 300 500 1000 2000 3000 4000 5000 6000 7000 8000 9000 10000 11000 12000 13000 14000 15271 Random Daylight BCUTs FTs

  17. ABCD Nearest Neighbors (Actives) to 5HT_59 O N H 141 N O 38 N O N Daylight NH H N O N 67 H O O 15 O NH O 1 2 O O O N N 99 N 3 N 5 NH NH O Cl O Feature Trees BCUTs

  18. ABCD Overlap Daylight – Feature Trees Average # hits detected by screening x% of the data set x = 0. 5 x = 5 x = 10 15.1 hits found: 33.6 hits found: 43.4 hits found: 7.4 6.6 2.6 4 12.5 10.3 8.5 16.7 23.5 only Feature Trees only Daylight both

  19. ABCD Overlap Daylight - BCUTs Average # hits detected by screening x% of the data set x = 0. 5 x = 5 x = 10 14 hits found: 35.1 hits found: 47 hits found: 2.9 11.8 10.9 11.8 16.1 6.6 4.5 12.4 19.1 only BCUTs only Daylight both Combination of methods - but how?

  20. ABCD Characteristics of Methods BCUTs: • Allow scaffold hopping • Higher percentages of the data set have to be screened to make full use of the method‘s potential Daylight Fingerprints: • Especially useful for the detection of actives from the same structural class • Extremely high enrichments among the very nearest neighbors • High hit rates among nearest neighbors within a Tanimoto threshold

  21. ABCD Similarity Search with Daylight Fingerprints Using a Tanimoto Threshold - Procedure Combined 1. Number of combined queries Act1 query: with any nearest neighbors Act2 within Tanimoto threshold Act3 2. Average hit rate of subsets A 0.95 from queries with any nearest Rank data B 0.83 neighbors within Tanimoto set using C 0.79 threshold Daylight D 0.72 Fingerprints E 0.69 3. Sum of hits and sum of F 0.68 non-hits within all subsets … from all queries

  22. ABCD Similarity Search with Daylight Fingerprints Using a Tanimoto Threshold - Results Tanimoto # Queries Average hit # non- # hits Threshold with NNs rate hits 0.8 73 94.1 % 233 8 0.7 75 88.0 % 387 60 0.6 75 55.6 % 549 602

  23. ABCD Procedure Daylight NN > 0.7 Combined 1 2 Act1 query: 3 4 5 Act2 Act3 6 A 7 8 9 10 Similarity search B 11 12 using BCUTs C 13 … D Ranked data set: E F …

  24. ABCD Average Number of Hits Found 1. Combination better # comp. Daylight Daylight BCUTs Random than BCUTs for screened + BCUTs screening 75 compounds 75 11.1 7.4 9.9 0.4 2. Combination better than both methods for all other cases 3. Single methods as 500 19.9 19.0 21.9 2.4 well as combination clearly superior to 1500 30.9 35.2 39.6 7.1 random selection

  25. ABCD Conclusions • Reasonable enrichments of actives can be achieved using each of the three methods to measure similarity • Results of the three methods are complementary to each other • Daylight Fingerprints show • extremely high enrichments among the very nearest neighbors (actives from the same structural class) • High hit rates among nearest neighbors within a Tanimoto threshold (e.g. 0.8 / 0.7) • BCUT distances allow scaffold hopping, but higher percentages of the data set have to be screened to make full use of the method‘s potential • Feature Trees allow scaffold hopping, but they are also useful for the detection of actives from the same structural class • Improvement of results by combining methods

  26. ABCD Acknowledgements Michael Bieler Bernd Wellenzohn Herbert Köppen

  27. ABCD BACKUP

  28. ABCD Descriptors Generally any kind of descriptors can be used! Diverse Solutions provides BCUT values : diagonal elements contain atomic properties: atom no. : 1 2 3 4 • Gasteiger charges 1 • H-donor and H-acceptor abilities 2 • polarizabilities 3 4 off-diagonal elements reflect connectivity information: 2D, 3D, topological BCUTs for each matrix different BCUT values: • highest and lowest eigen values • set of scaling factors

  29. Clustering of Compounds from Different Activity Classes GPCR ligands Kinase inhibitors Protease inhibitors BCUT values useful for similarity searches / virtual screening?

  30. ABCD Feature Trees Instead of a linear representation of a molecule, the molecule is described by a tree structure representing its major chemical building blocks and the way they are connected. Characteristics: • conformation independent (2.5 D) • fragment based • can handle local similarity

Recommend


More recommend