Structure Seer – a machine learning model for chemical structure elucidation from node labelling of a molecular graph

文献信息

发布日期 2023-12-20
DOI 10.1039/D3DD00178D
影响因子 0
作者

Joseph C. Bear


查看原文

摘要

The identification of a compound's chemical structure remains one of the most crucial everyday tasks in chemistry. Among the vast range of existing analytical techniques NMR spectroscopy remains one of the most powerful tools. As a step towards structure prediction from experimental NMR spectra, this article introduces a novel machine-learning (ML) Structure Seer model that is designed to provide a quantitative probabilistic prediction on the connectivity of the atoms based on the information on the elemental composition of the molecule along with a list of atom-attributed isotropic shielding constants, obtained via quantum chemical methods based on a Hartree–Fock calculation. The utilization of shielding constants in the approach instead of NMR chemical shifts helps overcome challenges linked to the relatively limited sizes of datasets comprising reliably measured spectra. Additionally, our approach holds significant potential for scalability, as it can harness vast amounts of information on known chemical structures for the model's learning process. A comprehensive evaluation of the model trained on the QM9 and custom dataset derived from the PubChem database was conducted. The trained model was demonstrated to have the capability of accurately predicting up to 100% of the bonds for selected compounds from the QM9 dataset, achieving an impressive average accuracy rate of 37.5% for predicted bonds in the test fold. The application of the model to the tasks of NMR peak attribution, structure prediction and identification is discussed, along with prospective strategies of prediction interpretation, such as similarity searches and ranking of isomeric structures.

相关文献

Structural dynamics effects on the ultrafast chemical bond cleavage of a photodissociation reaction

María E. Corrales, Garikoitz Balerdi, Rebeca de Nalda, Luis Bañares, Ahmed H. Zewail

2013-12-24 Paper

DOI: 10.1039/C3CP54677B

Role of the nano amorphous interface in the crystallization of Sb2Te3 towards non-volatile phase change memory: insights from first principles

Xue-Peng Wang, Nian-Ke Chen, Xian-Bin Li, Yan Cheng, X. Q. Liu, Meng-Jiao Xia, Z. T. Song, X. D. Han, Hong-Bo Sun

2014-04-24 Paper

DOI: 10.1039/C3CP55476G

Spectro-microscopic photoemission evidence of charge uncompensated areas in Pb(Zr,Ti)O3(001) layers

Dana Georgeta Popescu, Marius Adrian Huşanu, Lucian Trupinǎ, Luminiţa Hrib, Lucian Pintilie, Alexei Barinov, Silvano Lizzit, Paolo Lacovig, Cristian Mihail Teodorescu

2014-11-05 Paper

DOI: 10.1039/C4CP04546G

Morphology and chemical states of size-selected Ptn clusters on an aluminium oxide film on NiAl(110)

Atsushi Beniya, Noritake Isomura, Hirohito Hirata, Yoshihide Watanabe

2014-07-01 Paper

DOI: 10.1039/C4CP01767F

Switching magnetic interactions in the NiFe Prussian Blue Analogue: an ab initio inspection

Tim Krah, Nadia Ben Amor, Vincent Robert

2014-04-03 Paper

DOI: 10.1039/C4CP00662C

Computational studies of electrochemical CO2 reduction on subnanometer transition metal clusters

Cong Liu, Haiying He, Peter Zapol, Larry A. Curtiss

2014-08-12 Paper

DOI: 10.1039/C4CP02690J

Unusual electroluminescence from n-ZnO@i-MgO core–shell nanowire color-tunable light-emitting diode at reverse bias

Xiaoming Mo, Guojia Fang, Hao Long, Songzhan Li, Haoning Wang, Zhao Chen, Huihui Huang, Wei Zeng, Yupeng Zhang, Chunxu Pan

2014-02-26 Paper

DOI: 10.1039/C3CP55505D

Effects on electrochemical performances for host material caused by structure change of modifying material

Yantao Zhang, Enlou Zhou, Dawei Song, Xixi Shi, Xiaoqing Wang, Jian Guo, Lianqi Zhang

2014-06-18 Paper

DOI: 10.1039/C4CP01897D

Polymer-grafted multiwall carbon nanotubes functionalized by nitrene chemistry: effect on cooperativity and phase miscibility

Goutam Prasanna Kar, Priti Xavier, Suryasarathi Bose

2014-06-25 Paper

DOI: 10.1039/C4CP01594K

您可能还喜欢

化合物问答

硅烷偶联剂ZQ-172(CAS号:1067-53-4)的主要用途是什么?

硅烷偶联剂ZQ-172主要用于增强无机填料与有机高分子材料之间的相容性,常见于橡胶、塑料、涂料和胶黏剂等复合体系中。其硅氧烷基团可与玻璃纤维、二氧化硅等无机物表...

1067-53-46-(2-Methoxyethoxy)-...
化合物问答

如何处理含有6-(2,4-二甲氧基苯基)-2-吡啶甲醇(CAS号:887981-31-9)的废料?

对于含有该化合物的废料,首先应收集并分类存放,避免与其它化学品混合。在处理前,需进行必要的检测,确定其含量和性质。随后,可以采用化学氧化、生物降解或物理吸附等方...

887981-31-9[6-(2,4-Dimethoxyphe...
化合物问答

甲砜霉素甘氨酸酯盐酸盐(CAS号:2611-61-2)的物理化学性质是什么?

该化合物为白色或类白色结晶性粉末,不溶于水,溶于乙醇和氯仿。分子量为403.03 g/mol。它具有手性,含有三个手性中心,分别为2S,3R构型。该化合物在酸性...

2611-61-2(2S,3R)-2-[(Dichloro...
化合物问答

如何储存反式-环丙烷-1,2-二胺双盐酸盐(CAS号:3187-76-6)?

反式-环丙烷-1,2-二胺双盐酸盐应存放在阴凉、干燥且通风良好的地方,避免阳光直射。储存容器应密封,以防挥发和受潮。同时,应远离火源和热源,确保储存环境温度不超...

3187-76-6trans-1,2-Diaminocyc...
化合物问答

什么是吩嗪硫酸甲酯(CAS号:299-11-6)?

吩嗪硫酸甲酯是一种有机化合物,化学结构由吩嗪环与甲酯基团构成,分子式为C10H9N2SO4。其为吩嗪类衍生物,具有典型的芳香环结构和酯基官能团,常作为氧化剂或染...

299-11-65-Methylphenazin-5-i...
化合物问答

N1-异丙基二乙烯三胺(CAS号:207399-20-0)的市场或研究趋势如何?

随着绿色化学和环保意识的提高,N1-异丙基二乙烯三胺的研究趋势正向低毒、环保的方向发展。市场趋势方面,由于其在功能性材料、药物合成等领域的需求,预计其市场需求将...

207399-20-0N-(2-Aminoethyl)-N'-...
化合物问答

4,4-Dimethyl-5,6-dihydro-4H-cyclopenta[d][1,3]thiazol-2-amine(CAS号:1182284-47-4)应用于哪些行业?

该化合物在医药、聚合物、传感器和半导体领域有潜在的应用。在医药领域,作为一种新型的噻唑类化合物,它可能具有抗炎、抗病毒等生物活性。在聚合物领域,该化合物可用作增...

1182284-47-44,4-Dimethyl-5,6-dih...
化合物问答

处理5-(PYRIDIN-4-YL)-OXAZOL-2-YLAMINE(CAS号:1014629-83-4)时应注意哪些实验室安全事项?

在处理5-(吡啶-4-基)-2-氧代-1-氧杂环己烷-3-胺时,应佩戴防护眼镜、手套和防护服。实验应在通风橱中进行,以避免吸入有害气体。如果发生泄露,应立即用大...

1014629-83-45-(4-Pyridinyl)-1,3-...
化合物问答

什么是伊托必利N-氧化物(CAS号:141996-98-7)?

伊托必利N-氧化物是一种化学化合物,其分子结构是伊托必利的N位进行氧化处理后的产物。它具有一定的生物活性,主要用于药物研究和开发。

141996-98-7Itopride N-Oxide
化合物问答

氟氯烟酸(CAS号:82671-06-5)安全吗?

氟氯烟酸属于有机氯化物,具有一定的毒性,需谨慎处理。在操作过程中,应佩戴防护手套、护目镜和实验服,避免吸入其粉尘或蒸汽。接触皮肤或眼睛可能导致刺激,应采取适当的...

82671-06-52,6-Dichloro-5-fluor...
免责声明
本页面提供的学术期刊信息仅供参考和研究使用。我们与任何期刊出版商均无关联,也不处理投稿事宜。如有投稿相关咨询,请直接联系相关期刊出版商。
如发现页面信息有误,请发送邮件至 support@chemtradehub.com 联系我们。我们将及时核实并处理您的问题。