Cheminformatics toolkits
Cheminformatics toolkits are software development kits that allow cheminformaticians to develop custom computer applications for use in virtual screening, chemical database mining, and structure-activity studies. Toolkits are often used for experimentation with new methodologies. Their most important functions deal with the manipulation of chemical structures and comparisons between structures. Programmatic access is provided to properties of individual bonds and atoms.
Functionality
Toolkits provide the following functionality:- Read and save structures in various chemistry file formats.
- Determine if one structure is a substructure of another.
- Determine if two structures are equal.
- Identification of substructures common to structures in a set.
- Disassemble molecules, splitting into fragments.
- Assemble molecules from elements or submolecules.
- Apply reactions on input reactant structures, resulting in output of reaction product structures.
- Generate molecular fingerprints. Fingerprints are bit-vectors where individual bits correspond to the presence or absence of structural features. The most important use of fingerprints is in indexing of chemistry databases.
List of notable cheminformatics toolkits
Name | License | APIs | Home Page | Notes |
CDK | Open source | Java | http://sourceforge.net/projects/cdk/ | |
ChemmineR | Open source | R, C++ | http://manuals.bioinformatics.ucr.edu/home/chemminer | |
Enalos KNIME nodes | Open source | KNIME | http://tech.knime.org/community/enalos-nodes | |
Enalos+ KNIME nodes | Proprietary | KNIME | http://enalosplus.novamechanics.com/ | |
Indigo | Open source | C, C#, Java, Python | http://lifescience.opensource.epam.com/indigo | |
MolEngine | Proprietary | .NET | http://www.scilligence.com | |
Molecular Operating Environment | Proprietary | Scientific Vector Language | https://web.archive.org/web/20160909172415/http://www.chemcomp.com/MOE-Cheminformatics_and_QSAR.htm | |
OpenBabel | Open source | C, Python, Ruby | http://openbabel.org/ | , |
Helium | Open source | C++ | https://web.archive.org/web/20140407082845/http://www.moldb.net/helium.html | |
RDKit | Open source | Python, C++, Java, Knime | http://www.rdkit.org/ | |
Rcpi | Open source | R | https://bioconductor.org/packages/Rcpi | |
frowns | Open source | Python | http://frowns.sourceforge.net/ | |
OUCH | Open source | Haskell | http://www.pharmash.com/posts/2010-08-02-ouch.html | |
chemf | Open source | Scala | https://github.com/stefan-hoeck/chemf | |
3D-e-Chem | Open source | Python, Java, Knime | https://3d-e-chem.github.io/ | |
SMSD | Creative Commons Attribution | Java | http://www.ebi.ac.uk/thornton-srv/software/SMSD/ | |
Accord SDK | Proprietary | VBA,.NET, PL/SQL | http://accelrys.com/products/datasheets/accord-software-development-kit.pdf | |
CACTVS | Proprietary, free for academia, personal use, public web services | Tcl, C, C++, Python, Knime | http://www.xemistry.com/academic | |
Daylight | Proprietary | C, C++, Java, Fortran | http://www.daylight.com/products/toolkit.html | |
OEChem | Proprietary, free for academia | C++, Python, C#, Java | http://eyesopen.com/ | |
Marvin, JChem | Proprietary, free for academia | Java,.NET, Javascript | http://www.chemaxon.com | |
ChemDoodle API | Proprietary | Java, Javascript | http://www.ichemlabs.com | |
PerlMol | Open source | Perl | https://web.archive.org/web/20120315121757/http://www.perlmol.org/ | |
ADMET Predictor, MedChem Studio, MedChem Designer | Proprietary, free for qualifying academics | C++, KNIME, Pipeline Pilot | http://www.simulations-plus.com | |
CDD Vault | Proprietary, free for CDD Public read-only data | CDD Vault | https://www.collaborativedrug.com/cdd-vault | |
MolecularGraph.jl | MIT License | Julia | https://github.com/mojaie/MolecularGraph.jl |