| PROTEIN-BASED VIRTUAL SCREENING OF A 3D CHEMICAL DATABASE |
|---|
Aim : Virtual screening of a chemical database to identify potential hits
Target : Estrogen receptor (ERa)
A number of X-ray structures of the ligand binding domain of ER-a are available
(at least two containing antagonists: raloxifene, and 4-hydroxy-tamoxifen).
Database : 32 small molecular weight molecules
Rarey, M.; Kramer, B.; Lengauer, T.; Klebe, G. J. Mol. Biol. 1996, 261, 470-89.
Rarey, M.; Kramer, B.; Lengauer, T. Proteins 1999, 34, 17-28.
Briefly, FlexX is a flexible docking tool using an incremental construction algorithm that first
places a base fragment in the active site and then extends it to peripheral fragments according to the
most favorable torsion angles (intramolecular energy) and protein-ligand interactions
(intermolecular energy)
Analysis :
- Docking accuracy (root-mean square deviations of the FlexX pose from the X-ray solution)
- Ranking accuracy, according to seven different scoring functions (FlexX, Gold, Pmf, Dock,
Chemscore, Score)
- Possible use of consensus lists
- Hit rate in the top 5 ligands according to single or consensus scoring
Fig.1 Known ERa antagonists in the database

If you are working from a Windows PC, first read how to setup a X-windows Session with our main Unix machine cmbi6.cmbi.kun.nl.
From the UNIX shell, change directory to data/bioinf4/docking by typing
> cd data/bioinf4/docking
then type
> sybyl The SYBYL menu appears
| Default directory from which SYBYL has been started |
|
The Atom Expression window appears on the screen (Fig.3) 1. Move from atoms to Monomers in the upper left menu 2.In the Substructures menu, select OHT600 after browsing the proposed list and clicking ok. 3.Click the Sets menu, activate the Sphere option, give 6.5 as radius, and confirm by clicking ok 4.click ok |
|
Select in the Display Area menu (left menu on the screen) and undisplay the full protein by disabling the On/Off option (D1 row) of the Mol Display submenu. Quit this window by clicking the Q button. |
|
1. Unselect m1 (by clicking the row), select m2 as the molecule area to work with. 2. Browse the Substructures menu and select OHT600, Confirm the selection by clicking ok 3. click ok to close the window |
|
1. Unselect m1 (by clicking the row), select m2 as the molecule area to work with. 2. Choose the PDB format for saving coordinates 3. 3. give a name to the file to save : active_site.pdb 4. click on save |

|
1. PDB Filename: click the ... button, select 3ert.pdb in the « Files » window. Confirm the selection by clicking ok 2. Active-Site File: click the ... button, select active_site.pdb in the « Files » window. Confirm the selection by clicking ok 3. click OK for exiting the window |
|
1. Ligands from:Select the database file type (mol2). 2. click the ... button, select database.mol2 in the « Files » window. Confirm the selection by clicking ok. 3. Assign formal charges 4. Activate the « FlexX Details » window and modify the default Maximum Number of Poses per Ligand (from 30 to 1). This means that only the top solution will be saved for each ligand. Confirm by clicking ok. 5. activate the Netbatch mode 6. Step 6 should submit the job. For saving time (docking 32 ligands requires ca. 1h cpu time), we assume that the job has been submitted and we will directly analyze the results. |
click the ... button, select dbflexx as jobfile in the « Sub-Directories » window. Confirm the
selection by clicking ok. The window is updated (Fig. 11) and the list of docked ligands with their
binding energy score (FlexX score) is given
![]() |
|
| Fig.10 FlexX answer browser window | Fig.11 Updated FlexX answer browser window |
| Rank | Ligand | FlexX Score |
| kJ/mol | ||
| 1 | NSC__147505 | -81.9 |
| 2 | RALOXIFENE | -72.0 |
| 3 | 4_hydroxy_tamoxifen | -66.4 |
| 4 | EM_343 | -66.2 |
| 5 | ICI_164384 | -63.3 |
| 6 | dihydroxy_tamoxifen | -59.9 |
| 7 | LY_326315 | -57.1 |
| 8 | NSC__506431 | -49.9 |
| 9 | NAFOXIDENE | -48.5 |
| 10 | NSC__152522 | -46.0 |
| 11 | NSC__131754 | -44.5 |
| 12 | NSC__88579 | -41.3 |
| 13 | NSC__74751 | -39.0 |
| 14 | NSC__46215 | -38.4 |
| 15 | NSC__2 | -38.3 |
| 16 | NSC__240424 | -38.1 |
| 17 | NSC__102240 | -36.9 |
| 18 | NSC__618129 | -36.9 |
| 19 | NSC__658337 | -34.9 |
| 20 | NSC__679529 | -34.5 |
| 21 | NSC__208922 | -34.3 |
| 22 | NSC__346517 | -34.1 |
| 23 | NSC__176927 | -32.6 |
| 24 | NSC__163127 | -28.7 |
| 25 | NSC__60047 | -28.5 |
| 26 | NSC__118161 | -28.1 |
| 27 | NSC__636713 | -25.5 |
| 28 | NSC__34379 | -23.9 |
| 29 | NSC__276435 | -23.5 |
| 30 | NSC__382147 | -18.4 |
| 31 | NSC__703010 | -16.3 |
For two ligands (raloxifene, 4-oh-tamoxifen), a protein-ligand X-ray structure is available.
We can then compare the FlexX pose with the X-ray solution.
Load the protein active site:
File >> Read >> active_site.pdb >> No (Do not center the molecule at screen)
Load the ERa-bound X-ray structure of 4-oh-tamoxifen:
File >> Read >> 4_oh_tamoxifen_xray_pdb >> No
Load the FlexX solution for 4-oh- tamoxifen:
File >> Read >>
Select dbflexx, then 4_hydroxy_tamoxifen.mdb in the upper « Sub-directories » menu and then 4-hydroxy-tamoxifen_001.mol2 in the right « Files » menu.
You can color the ligands by:
View >> Color >> Atoms.. >> Select Molecular Area >> All >> OK >> Choose a Color
| Look at analogies and differences between the two poses. |
The 31 hits docked by FlexX will be rescored by 4 other scoring functions:
- Rescoring with the CScoreTM module of Sybyl
The CScoreTM option allows to compute FlexX (F_score), Dock (D-score), Gold (G score), PMF
(P_score) and Chemscore (C_score) scores from a table where the FlexX docked conformations
had previously been saved.
Delete all molecules from the screen:
Build/Edit >> Zap (Delete) Molecule
Load the table:
File >> Molecular Spreadsheet >> New >> Database >>
Select hits.mdb in the right window where all available databases are listed.
Confirm by clicking the Open button.
A new spreadsheet entitled HITS is displayed on the screen (Fig. 11)

Fig.11 Sybyl Molecular Spreadsheet
The 31 molecules docked by FlexX are here listed by alphabetical order. To run CScore :
type at the SYBYL shell:
| Sybyl>> cscore[ENTER] | !! Warning |
| Associated receptor mol2 file: protein.mol2 | Commands to execute in the |
| Row expression : *[ENTER} | bottom shell |


File (Spreadsheet menu) >> Put Rows into Molecule Areas
| The docked conformation of NSC_152522 is displayed. Load the protein and 4-oh- |
| tamoxifen_xray as references (recall earlier). The two phenolic groups are H-bonded to Glu353 and |
| Thr347 side chains and fit very well the same two moieties of 4-oh-tamoxifen. The 2 aromatic |
| rings are also well superimposed. Thus, this new compound might be a putative true hit. You can |
| label the protein residues by View >> Label >> Substructure >> All |

| Look at the performance of the five scoring functions (ranking of known ligands ) |

Built/Edit >> Zap (Delete) Molecule
View >> Delete All Backgrounds
File (Spreadsheet menu) >> Close >> No
Exit sybyl:
Sybyl > exit [ENTER]
!! Warning: Command to execute in the bottom blue shell