I  U  P  A  C






News & Notices

Organizations & People

Standing Committees

Divisions

Projects

Reports

Publications

Symposia

AMP

Links of Interest

Search the Site

Home Page

 

 

 The IUPAC International Chemical Identifier
(InChITM)

InChI 1.02beta Software Release,
introducing InChIKey

A beta-release of InChI version 1.02 software was issued in September 2007. The complete package, downloadable from www.iupac.org/inchi/download/ contains the following

  • source code and Application Program Interface (API)
  • stand-alone executable (cInChI)
  • description of new features, with examples of using new functionality
  • copy of GNU LGPL licence

Full InChI documentation, Windows executable (wInChI.exe), and validation set will be included in the final version 1.02 release.

The principal new features of this release are:

(1) A fixed-length (25-character) condensed digital representation of the Identifier to be known as InChIKey. In particular, this will

  • facilitate web searching, previously complicated by unpredictable breaking of InChI character strings by search engines
  • allow development of a web-based InChI lookup service
  • permit an InChI representation to be stored in fixed length fields
  • make chemical structure database indexing easier
  • allow verification of InChI strings after network transmission.

An example of InChI with its InChKey equivalent is shown below. There is a finite, but very small probability of finding two structures with the same InChIKey. For duplication of only the first block of 14 characters this is 1.3% in 109, equivalent to a single collision in one of 75 databases of 109 compounds each.


Caffeine:
InChI=1/C8H10N4O2/c1-10-4-9-6-5(10)7(13)12(3)8(14)11(6)2/h4H,1-3H3
InChIKey=RYYVLZVUVIJVGH-UHFFFAOYAW
First block (14 letters), encodes molecular skeleton (connectivity): RYYVLZVUVIJVGH
Second block (8 letters), encodes proton positions (tautomers), stereochemistry, isotopes, reconnected layer: UHFFFAOY
Flag character, indicates InChI version, presence/absence of fixed H layer, isotopes, and stereochemistry: A
Check character: W



(2) Restructured InChI generating software that separates key steps in its creation from an input chemical structure file. Among other uses, this allows checking of intermediate results to enable easier testing and development of InChI-based applications.

(3) Bug fixes designed to withstand malicious attempts to attack a Web server by providing a specially designed InChI string input to InChI binaries.

Users are encouraged to report their experiences and any problems via the SourceForge website (http://sourceforge.net/projects/inchi).

Steve Heller
Alan McNaught
Igor Pletnev
Steve Stein
Dmitrii Tchekhovskoi

September 2007

> back to inchi

 

<announcement published in Chem. Int. Nov 2007>

> link to project: www.iupac.org/projects/2004/2004-039-1-800.html

> for a review of what is InChI and what it can be used for, see 'Tools of the Trade' published in Chem Int Nov '06

 


Page last modified 28 November 2007.
Copyright ©1997- 2007 International Union of Pure and Applied Chemistry.
Questions or comments about IUPAC, please contact, the Secretariat.
Questions regarding the website, please contact Web Help.