G-LoSA Toolkit

Standalone Java Applications

Feature Name Description
Basic classes
(non-executable classes)
Atom defines an atom (atom name, residue name, residue sequence, coordinates, etc)
Molecule defines a molecule (atoms, residues, selected residues, etc)
StructureHandler provides basic methods for structure reading, writing, manipulating, and analysis
NWAlign provides methods for sequence alignment by Needleman-Wunsch algorithm
(http://zhanglab.ccmb.med.umich.edu/NW-align)
PDB structure library preparation BuildLigandBSStructureLibrary builds structure library for ligand, ligand binding site (BS), and protein chain for a given set of PDB files
RemoveRedundantLigandBS removes structurally redundant ligand BS based on sequence identity between protein chains and structure similarity between ligands. Homologous protein chains are identified using a Blast clustering result file (bc-X.out) available in the PDB (http://www.rcsb.org/pdb/statistics/clusterStatistics.do) and ligand similarity is calculated using a path-based FP2 fingerprint in Open Babel
BuildProteinStructureLibrary builds structure library for protein chains for a given set of PDB files
RemoveRedundantProteins removes structurally redundant protein chains based on sequence identity. Homologous protein chains are identified using a Blast clustering result file (bc-X.out) available in the PDB (http://www.rcsb.org/pdb/statistics/clusterStatistics.do)
GenerateCFFiles generates chemical feature (CF) files for a given set of ligand BS structures
Protein surface structure generation GetSurfaceResiduesUsingSurfaceRacer extracts surface residues from a protein structure using SurfaceRacer program (http://pharmacy.mc.uky.edu/faculty/tsodikov/software.php)
GetSurfaceResiduesUsingVisGrid extracts surface residues from a protein structure using VisGrid program (http://kiharalab.org/VisGrid/)
G-LoSA search GLoSASearch performs G-LoSA against ligand BS library for a given query structure
RunPBSJobs submits multiple G-LoSA search jobs
CheckGLoSASearchResults checks if multiple G-LoSA search jobs are successfully completed
MergeGLoSASearchResults merges multiple G-LoSA search results into a single file (score.txt)
GetTopTemplates sorts G-LoSA search result by GA-score and filter the results by a cutoff GA-score or the number of top templates
FilterTemplatesBySeqIdentity filters G-LoSA search result by sequence identity to query protein
GetAlignedTemplates generates ligand and ligand BS structures aligned onto query protein
FilterTemplatesByPositionalOverlapTanimoto filters G-LoSA search results by overlap ratio, which is a Tanimoto coefficient representing the positional overlap between template ligands
Miscellaneous AddPropertyResolution adds X-ray resolution to G-LoSA search result
SplitList splits a list into a set of sub-lists
GeneratePyMolScript generates a PyMol script file (view_struct.pml) to display the structures of query and templates
GetAnnotations generates a html file (annotation.html) to retrieve protein function information for templates
Click here to download G-LoSA Toolkit
>javac *.java (for compiling all java classes)
>java Application_Name [options] (for executing an application)
>java Application_Name or Application_Name --help (for help)

 

Databases

Feature Name Description
Non-redundant small molecule ligand/binding site structure library
(as of July. 2018)
nr-ligand-bs-70 Filtered by 70% protein sequence identity. If two binding sites are homologous and their ligands are not identical, both binding sites were kept in the structure library.