Discussion:
[Rdkit-discuss] MolFromXYZ?
Naeem Attari
2015-01-17 14:57:17 UTC
Permalink
Hi,

Is it possible to generate rdkit mol object only from xyz coordinates?
It would be useful to connect rdkit with other tools.

Was asked earlier: http://sourceforge.net/p/rdkit/mailman/message/31601105/

Thanks
Kind Regards
Shaikh Naeem
Dmitri Maziuk
2015-01-17 17:07:51 UTC
Permalink
Post by Naeem Attari
Hi,
Is it possible to generate rdkit mol object only from xyz coordinates?
It would be useful to connect rdkit with other tools.
If you can't transform your xyz coordinates into pdb ATOM line, then
probably not.

Dima
Greg Landrum
2015-01-19 08:48:04 UTC
Permalink
Hi,

Without information about whether or not atoms are bonded, and the orders
of the bonds between them, there's no sensible way to get an XYZ file into
the RDKit. If you can combine the XYZ file with a SMILES (or some other
format that has bond information), then it should be no problem

-greg
Post by Naeem Attari
Hi,
Is it possible to generate rdkit mol object only from xyz coordinates?
It would be useful to connect rdkit with other tools.
http://sourceforge.net/p/rdkit/mailman/message/31601105/
Thanks
Kind Regards
Shaikh Naeem
------------------------------------------------------------------------------
New Year. New Location. New Benefits. New Data Center in Ashburn, VA.
GigeNET is offering a free month of service with a new server in Ashburn.
Choose from 2 high performing configs, both with 100TB of bandwidth.
Higher redundancy.Lower latency.Increased capacity.Completely compliant.
http://p.sf.net/sfu/gigenet
_______________________________________________
Rdkit-discuss mailing list
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
Greg Landrum
2015-01-19 09:14:31 UTC
Permalink
[adding the list back to the recipient list]
Can't we derive the bond order and connectivity information from distance
matrix (which can be derived from coordinates)?
Assuming all hydrogens and charges are present in the file, the geometry is
a good one, and the structure is made up of octet-complete atoms from the
"organic subset", getting >90% of the bonds correct is almost certainly
doable without too much effort. The last bit will be problematic. If you
remove any of the above constraints, you're left with a research problem.
Here's a recent publication on the topic:
http://pubs.acs.org/doi/abs/10.1021/ci300358c

This isn't an attempt to discourage you from doing an implementation for
your own purposes, just an explanation of why something that is seemingly
simple is not present in the RDKit.

-greg

Loading...