|Description||A chemical structure database providing fast access to over 30 million structures, properties and associated information.|
|Research center||Raleigh, North Carolina, United States|
|License||Creative Commons Attribution Share-alike|
The database contains information on more than 30 million unique molecules from over 450 data sources including:
- EPA DSSTox
- U.S. Food and Drug Administration (FDA)
- Human Metabolome Database
- Journal of Heterocyclic Chemistry
- Nature Chemical Biology
- National Institutes of Health (NIH)
- NINDS Approved Drug Screening Program
- NIST Chemistry WebBook
- Prous Science Drugs of the Future
- R&D Chemicals
- San Diego Center for Chemical Genomics
- SGCOxCompounds, SGCStoCompounds
- Structural Genomics Consortium
- Thomson Pharma
- Total TOSLab Building-Blocks
- Web of Science
The ChemSpider database can be updated with user contributions including chemical structure deposition, spectra deposition and user curation. This is a crowdsourcing approach to develop an online chemistry database. Crowdsourced based curation of the data has produced a dictionary of chemical names associated with chemical structures that has been used in text-mining applications of the biomedical and chemical literature.
However, database rights are not waived and a data dump is not available; in fact, the FAQ even states that only limited downloads are allowed: therefore the right to fork is not guaranteed and the project can't be considered free/open.
A number of available search modules are provided:
- The standard search allows querying for systematic names, trade names and synonyms and registry numbers
- The advanced search allows interactive searching by chemical structure, chemical substructure, using also molecular formula and molecular weight range, CAS numbers, suppliers, etc. The search can be used to widen or restrict already found results.
- Structure searching on mobile devices can be done using free apps for iOS (iPhone/iPod/iPad) and for the Android (operating system).
Chemistry document mark-up
The ChemSpider database has been used in combination with text mining as the basis of chemistry document markup. ChemMantis, the Chemistry Markup And Nomenclature Transformation Integrated System uses algorithms to identify and extract chemical names from documents and web pages and converts the chemical names to chemical structures using name-to-structure conversion algorithms and dictionary look-ups in the ChemSpider database. The result is an integrated system between chemistry documents and information look-up via ChemSpider into over 150 data sources.
ChemSpider was acquired by the Royal Society of Chemistry (RSC) in May, 2009. Prior to the acquisition by RSC, ChemSpider was controlled by a private corporation, ChemZoo Inc. The system was first launched in March 2007 in a beta release form and transitioned to release in March 2008. ChemSpider has expanded the generic support of a chemistry database to include support of the WorldHeritage chemical structure collection via their WiChempedia implementation.
A number of services are made available online. These include the conversion of chemical names to hash table resolver for InChIKeys, shorter hashed forms of InChIs.
SyntheticPages is a free interactive database of University of Bristol), Stephen Caddick (University College London), Peter Scott (University of Warwick) and Dr Max Hammond. In February 2010 a merger was announced with the Royal Society of Chemistry's chemical structure search engine ChemSpider and the formation of ChemSpider|SyntheticPages (CS|SP).
ChemSpider is serving as the chemical compound repository as part of the Open PHACTS project, an Innovative Medicines Initiative. Open PHACTS will deploy an open standards, open access, semantic web approach to address bottlenecks in small molecule drug discovery - disparate information sources, lack of standards and information overload.
- Van Noorden, R. (2012). "Chemistry's web of data expands". Nature 483 (7391): 524.
- Chemical & Engineering News 85 (24). June 11, 2007.
- Ekins, S; Iyer, M; Krasowski, M. D.; Kharasch, E. D. (2008). "Molecular characterization of CYP2B6 substrates". Current drug metabolism 9 (5): 363–73.
- Brumfiel, G. (2008). "Chemists spin a web of data". Nature 453 (7192): 139.
- E. Curry, A. Freitas, and S. O’Riáin, "The Role of Community-Driven Data Curation for Enterprises," in Linking Enterprise Data, D. Wood, Ed. Boston, MA: Springer US, 2010, pp. 25–47.
- Links to Multiple Presentations about ChemSpider
- Identification of Nontargeted Species Using Accurate Mass/Mass Spectrometry Data and ChemSpider
- Williams, A. J. (2011). "Collaborative Computational Technologies for Biomedical Research". p. 363.
- Pence, H. E.; Williams, A. (2010). "ChemSpider: An Online Chemical Information Resource". Journal of Chemical Education 87 (11): 1123.
- Little, J. L.; Williams, A. J.; Pshenichnov, A.; Tkachenko, V. (2011). "Identification of "Known Unknowns" Utilizing Accurate Mass Data and Chem Spider". Journal of the American Society for Mass Spectrometry 23 (1): 179–85.
- Hettne, K. M.; Williams, A. J.; Van Mulligen, E. M.; Kleinjans, J.; Tkachenko, V.; Kors, J. A. (2010). "Automatic vs. Manual curation of a multi-source chemical dictionary: The impact on text mining". Journal of Cheminformatics 2 (1): 3.
- Welcome ChemMantis to ChemZoo and a Call for Contributions from the Community,2008-10-23, A. Williams,blog post
- "RSC acquires ChemSpider". Royal Society of Chemistry. 11 May 2009. Retrieved 2009-05-11.
- "ChemSpider SyntheticPages". ChemSpider SyntheticPages. Royal Society of Chemistry. Retrieved 26 June 2012.
- "ChemSpider and SyntheticPages support synthetic chemistry". RSC Publishing. Royal Society of Chemistry. 2010-02-05. Retrieved 2012-06-26.