G-LoSA Toolkit
Standalone Java Applications
| Feature | Name | Description |
| Basic classes (non-executable classes) |
Atom | defines an atom (atom name, residue name, residue sequence, coordinates, etc) |
| Molecule | defines a molecule (atoms, residues, selected residues, etc) | |
| StructureHandler | provides basic methods for structure reading, writing, manipulating, and analysis | |
| NWAlign |
provides methods for sequence alignment by Needleman-Wunsch algorithm (http://zhanglab.ccmb.med.umich.edu/NW-align) |
|
| PDB structure library preparation | BuildLigandBSStructureLibrary | builds structure library for ligand, ligand binding site (BS), and protein chain for a given set of PDB files |
| RemoveRedundantLigandBS | removes structurally redundant ligand BS based on sequence identity between protein chains and structure similarity between ligands. Homologous protein chains are identified using a Blast clustering result file (bc-X.out) available in the PDB (http://www.rcsb.org/pdb/statistics/clusterStatistics.do) and ligand similarity is calculated using a path-based FP2 fingerprint in Open Babel | |
| BuildProteinStructureLibrary | builds structure library for protein chains for a given set of PDB files | |
| RemoveRedundantProteins | removes structurally redundant protein chains based on sequence identity. Homologous protein chains are identified using a Blast clustering result file (bc-X.out) available in the PDB (http://www.rcsb.org/pdb/statistics/clusterStatistics.do) | |
| GenerateCFFiles | generates chemical feature (CF) files for a given set of ligand BS structures | |
| Protein surface structure generation | GetSurfaceResiduesUsingSurfaceRacer | extracts surface residues from a protein structure using SurfaceRacer program (http://pharmacy.mc.uky.edu/faculty/tsodikov/software.php) |
| GetSurfaceResiduesUsingVisGrid | extracts surface residues from a protein structure using VisGrid program (http://kiharalab.org/VisGrid/) | |
| G-LoSA search | GLoSASearch | performs G-LoSA against ligand BS library for a given query structure |
| RunPBSJobs | submits multiple G-LoSA search jobs | |
| CheckGLoSASearchResults | checks if multiple G-LoSA search jobs are successfully completed | |
| MergeGLoSASearchResults | merges multiple G-LoSA search results into a single file (score.txt) | |
| GetTopTemplates | sorts G-LoSA search result by GA-score and filter the results by a cutoff GA-score or the number of top templates | |
| FilterTemplatesBySeqIdentity | filters G-LoSA search result by sequence identity to query protein | |
| GetAlignedTemplates | generates ligand and ligand BS structures aligned onto query protein | |
| FilterTemplatesByPositionalOverlapTanimoto | filters G-LoSA search results by overlap ratio, which is a Tanimoto coefficient representing the positional overlap between template ligands | |
| Miscellaneous | AddPropertyResolution | adds X-ray resolution to G-LoSA search result |
| SplitList | splits a list into a set of sub-lists | |
| GeneratePyMolScript | generates a PyMol script file (view_struct.pml) to display the structures of query and templates | |
| GetAnnotations | generates a html file (annotation.html) to retrieve protein function information for templates |
| Click here to download G-LoSA Toolkit |
| >javac *.java (for compiling all java classes) |
| >java Application_Name [options] (for executing an application) |
| >java Application_Name or Application_Name --help (for help) |
Databases
| Feature | Name | Description |
| Non-redundant small molecule ligand/binding site structure library (as of July. 2018) |
nr-ligand-bs-70 | Filtered by 70% protein sequence identity. If two binding sites are homologous and their ligands are not identical, both binding sites were kept in the structure library. |
