Multipolar Atom Types from Theory and Statistical Clustering (MATTS) Data Bank: Impact of Surrounding Atoms on Electron Density from Cluster Analysis
The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM p...
Saved in:
Published in | Journal of chemical information and modeling Vol. 62; no. 16; pp. 3766 - 3783 |
---|---|
Main Authors | , , , |
Format | Journal Article |
Language | English |
Published |
Washington
American Chemical Society
22.08.2022
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Abstract | The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM parameters specific for atom types in proteins, nucleic acids, and organic molecules. However, it was not fully understood how the electron density of particular atoms responds to their surroundings and which factors describe the electron density in molecules within the MM. In this work, by applying clustering using descriptors available in the MATTS data bank, that is, topology and multipole parameters, we found the topology features with the biggest impact on the multipole parameters: the element of the central atom, the number of first neighbors, and planarity of the group. The similarities in the spatial distribution of electron density between and within atom type classes revealed distinct and unique atom types. The quality of existing types can be improved by adding better parametrization, definitions, and local coordinate systems. Future development of the MATTS data bank should lead to a wider range of atom types necessary to construct the electron density of any molecule. |
---|---|
AbstractList | The multipole model (MM) uses an aspherical approach
to describe
electron density and can be used to interpret data from X-ray diffraction
in a more accurate manner than using the spherical approximation.
The MATTS (multipolar atom types from theory and statistical clustering)
data bank gathers MM parameters specific for atom types in proteins,
nucleic acids, and organic molecules. However, it was not fully understood
how the electron density of particular atoms responds to their surroundings
and which factors describe the electron density in molecules within
the MM. In this work, by applying clustering using descriptors available
in the MATTS data bank, that is, topology and multipole parameters,
we found the topology features with the biggest impact on the multipole
parameters: the element of the central atom, the number of first neighbors,
and planarity of the group. The similarities in the spatial distribution
of electron density between and within atom type classes revealed
distinct and unique atom types. The quality of existing types can
be improved by adding better parametrization, definitions, and local
coordinate systems. Future development of the MATTS data bank should
lead to a wider range of atom types necessary to construct the electron
density of any molecule. The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM parameters specific for atom types in proteins, nucleic acids, and organic molecules. However, it was not fully understood how the electron density of particular atoms responds to their surroundings and which factors describe the electron density in molecules within the MM. In this work, by applying clustering using descriptors available in the MATTS data bank, that is, topology and multipole parameters, we found the topology features with the biggest impact on the multipole parameters: the element of the central atom, the number of first neighbors, and planarity of the group. The similarities in the spatial distribution of electron density between and within atom type classes revealed distinct and unique atom types. The quality of existing types can be improved by adding better parametrization, definitions, and local coordinate systems. Future development of the MATTS data bank should lead to a wider range of atom types necessary to construct the electron density of any molecule. |
Author | Dominiak, Paulina Maria Rybicka, Paulina Maria Chodkiewicz, Michał Leszek Kulik, Marta |
AuthorAffiliation | Biological and Chemical Research Centre, Department of Chemistry |
AuthorAffiliation_xml | – name: Biological and Chemical Research Centre, Department of Chemistry |
Author_xml | – sequence: 1 givenname: Paulina Maria orcidid: 0000-0003-0087-6208 surname: Rybicka fullname: Rybicka, Paulina Maria – sequence: 2 givenname: Marta orcidid: 0000-0003-2381-7665 surname: Kulik fullname: Kulik, Marta email: m.kulik@uw.edu.pl – sequence: 3 givenname: Michał Leszek orcidid: 0000-0003-4448-445X surname: Chodkiewicz fullname: Chodkiewicz, Michał Leszek – sequence: 4 givenname: Paulina Maria orcidid: 0000-0002-1466-1243 surname: Dominiak fullname: Dominiak, Paulina Maria email: pdomin@chem.uw.edu.pl |
BookMark | eNp1kU9v1DAQxS1URP_AnaMlLkViFzuOs2sOSMu2QKVWHHaRuFmOM2m9OHawHaR8kH7fOt0tEkic_Kz5zbNn3ik6ct4BQq8pmVNS0PdKx_lOm25eaEJoyZ-hE8pLMRMV-XH0pLmojtFpjDtCGBNV8QIdMy5KtmDiBN3fDDaZ3lsV8Cr5Dm_HHiJuwyTvwIcRK9fgTVLJxGS0snhth5ggGHeLz29W2-3mLb5QSeFPyv38gK-6XumEfYs3Qwh-cM0ETtYRe4cvLegUsrgAF00a9y8dLPHKKTtGE1-i562yEV4dzjP0_fPldv11dv3ty9V6dT1TJVumWQH1kgNdAqUUFG91nrVu21rUNS9AEc41E6SkVJEm35oFryhbLgUQWuiC1OwMfdz79kPdQaPBpaCs7IPpVBilV0b-XXHmTt7631KUed-kygbnB4Pgfw0Qk-xM1GCtcuCHKItFXjrlgk_om3_QnR9CHviRqsqKLQqaKbKndPAxBmj_fIYSOWUuc-ZyylweMs8t7_Ytj5Unz__iD6Gfsh4 |
CitedBy_id | crossref_primary_10_1016_j_csbj_2022_10_018 crossref_primary_10_1107_S2053229624005357 crossref_primary_10_3390_cryst14010077 crossref_primary_10_1021_acs_jcim_2c00144 |
Cites_doi | 10.1021/ct6001994 10.1107/S0567739478001886 10.1107/S0021889808033384 10.1107/S0108767312008197 10.1107/S1600576716011274 10.1107/S2053273319015304 10.1021/cg200971f 10.1021/j100016a071 10.1002/ijch.197700020 10.1107/S0108767310049731 10.1109/BigDataService.2018.00042 10.1080/0889311X.2020.1853712 10.1107/S0021889803017825 10.1145/3068335 10.1107/s0108767311042176 10.1021/acs.jctc.8b00640 10.1107/S1399004714002351 10.1002/jcc.20084 10.1107/S0907444909009433 10.1107/S2053273319000482 10.1107/s0907444906013072 10.1021/cg300129z 10.1021/acs.jcim.1c00227 10.1107/S0108767302010991 10.1016/j.bpj.2014.12.020 10.1021/ar970352j 10.1107/S0021889810042482 10.1107/S160057672100580X 10.1002/anie.200353596 10.1107/S2059798322005836 10.1002/jcc.22996 10.1107/S0108768191003932 10.1093/oso/9780195098235.001.0001 10.1107/S2052520616003954 10.1107/S2052519213002285 10.1145/2064676.2064695 10.1107/S1600576719014092 10.1145/276698.276876 10.1007/978-3-642-37431-9_44 10.1021/jp0379796 10.1021/ct4011129 10.1002/jcc.20035 10.1021/acs.jcim.2c00144 |
ContentType | Journal Article |
Copyright | 2022 The Authors. Published by American Chemical Society Copyright American Chemical Society Aug 22, 2022 2022 The Authors. Published by American Chemical Society 2022 The Authors |
Copyright_xml | – notice: 2022 The Authors. Published by American Chemical Society – notice: Copyright American Chemical Society Aug 22, 2022 – notice: 2022 The Authors. Published by American Chemical Society 2022 The Authors |
DBID | AAYXX CITATION 7SC 7SR 7U5 8BQ 8FD JG9 JQ2 L7M L~C L~D 7X8 5PM |
DOI | 10.1021/acs.jcim.2c00145 |
DatabaseName | CrossRef Computer and Information Systems Abstracts Engineered Materials Abstracts Solid State and Superconductivity Abstracts METADEX Technology Research Database Materials Research Database ProQuest Computer Science Collection Advanced Technologies Database with Aerospace Computer and Information Systems Abstracts Academic Computer and Information Systems Abstracts Professional MEDLINE - Academic PubMed Central (Full Participant titles) |
DatabaseTitle | CrossRef Materials Research Database Engineered Materials Abstracts Technology Research Database Computer and Information Systems Abstracts – Academic ProQuest Computer Science Collection Computer and Information Systems Abstracts Solid State and Superconductivity Abstracts Advanced Technologies Database with Aerospace METADEX Computer and Information Systems Abstracts Professional MEDLINE - Academic |
DatabaseTitleList | Materials Research Database |
DeliveryMethod | fulltext_linktorsrc |
Discipline | Chemistry |
EISSN | 1549-960X |
EndPage | 3783 |
ExternalDocumentID | 10_1021_acs_jcim_2c00145 c473759231 |
GrantInformation_xml | – fundername: ; grantid: UMO-2017/27/B/ST4/02721 |
GroupedDBID | - 4.4 55A 5GY 5VS 7~N AABXI ABFLS ABFRP ABMVS ABUCX ACGFS ACIWK ACNCT ACS ADHLV AEESW AENEX AFEFF AHGAQ ALMA_UNASSIGNED_HOLDINGS AQSVZ D0L DU5 EBS ED F5P GGK GNL IH9 JG P2P PQEST PQQKQ RNS ROL UI2 VF5 VG9 W1F X --- -~X AAYXX ABJNI ABQRX CITATION CUPRZ ED~ JG~ 7SC 7SR 7U5 8BQ 8FD JG9 JQ2 L7M L~C L~D 7X8 5PM |
ID | FETCH-LOGICAL-a438t-2eb85e18e111ea5fc959bffb9bb52ea055c390411a0da05d75613889e012c20b3 |
IEDL.DBID | ACS |
ISSN | 1549-9596 |
IngestDate | Tue Sep 17 21:17:36 EDT 2024 Sat Aug 17 02:21:43 EDT 2024 Thu Oct 10 16:40:26 EDT 2024 Fri Aug 23 01:40:32 EDT 2024 Wed Aug 24 06:44:50 EDT 2022 |
IsDoiOpenAccess | true |
IsOpenAccess | true |
IsPeerReviewed | true |
IsScholarly | true |
Issue | 16 |
Language | English |
License | Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
LinkModel | DirectLink |
MergedId | FETCHMERGED-LOGICAL-a438t-2eb85e18e111ea5fc959bffb9bb52ea055c390411a0da05d75613889e012c20b3 |
Notes | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23 |
ORCID | 0000-0003-0087-6208 0000-0003-2381-7665 0000-0002-1466-1243 0000-0003-4448-445X |
OpenAccessLink | https://pubmed.ncbi.nlm.nih.gov/PMC9400106 |
PMID | 35943739 |
PQID | 2706463721 |
PQPubID | 28739 |
PageCount | 18 |
ParticipantIDs | pubmedcentral_primary_oai_pubmedcentral_nih_gov_9400106 proquest_miscellaneous_2700315956 proquest_journals_2706463721 crossref_primary_10_1021_acs_jcim_2c00145 acs_journals_10_1021_acs_jcim_2c00145 |
PublicationCentury | 2000 |
PublicationDate | 2022-08-22 |
PublicationDateYYYYMMDD | 2022-08-22 |
PublicationDate_xml | – month: 08 year: 2022 text: 2022-08-22 day: 22 |
PublicationDecade | 2020 |
PublicationPlace | Washington |
PublicationPlace_xml | – name: Washington |
PublicationTitle | Journal of chemical information and modeling |
PublicationTitleAlternate | J. Chem. Inf. Model |
PublicationYear | 2022 |
Publisher | American Chemical Society |
Publisher_xml | – name: American Chemical Society |
References | ref9/cit9 ref45/cit45 ref6/cit6 ref36/cit36 ref3/cit3 ref27/cit27 Coppens P. (ref1/cit1) 1997 ref11/cit11 ref25/cit25 ref16/cit16 ref29/cit29 ref32/cit32 ref23/cit23 ref39/cit39 ref14/cit14 ref8/cit8 ref5/cit5 ref31/cit31 ref2/cit2 ref43/cit43 ref34/cit34 ref37/cit37 ref28/cit28 ref40/cit40 ref20/cit20 ref17/cit17 ref10/cit10 ref26/cit26 ref35/cit35 ref19/cit19 ref21/cit21 ref12/cit12 ref15/cit15 ref42/cit42 ref46/cit46 ref41/cit41 ref22/cit22 ref13/cit13 ref33/cit33 ref4/cit4 ref30/cit30 ref47/cit47 ref24/cit24 ref38/cit38 ref44/cit44 ref7/cit7 |
References_xml | – ident: ref9/cit9 doi: 10.1021/ct6001994 – ident: ref27/cit27 doi: 10.1107/S0567739478001886 – ident: ref33/cit33 doi: 10.1107/S0021889808033384 – ident: ref6/cit6 doi: 10.1107/S0108767312008197 – ident: ref22/cit22 doi: 10.1107/S1600576716011274 – ident: ref20/cit20 doi: 10.1107/S2053273319015304 – ident: ref26/cit26 doi: 10.1021/cg200971f – ident: ref5/cit5 doi: 10.1021/j100016a071 – ident: ref36/cit36 doi: 10.1002/ijch.197700020 – ident: ref12/cit12 doi: 10.1107/S0108767310049731 – ident: ref40/cit40 doi: 10.1109/BigDataService.2018.00042 – ident: ref35/cit35 doi: 10.1080/0889311X.2020.1853712 – ident: ref32/cit32 doi: 10.1107/S0021889803017825 – ident: ref39/cit39 doi: 10.1145/3068335 – ident: ref10/cit10 doi: 10.1107/s0108767311042176 – ident: ref45/cit45 – ident: ref25/cit25 doi: 10.1021/acs.jctc.8b00640 – ident: ref19/cit19 doi: 10.1107/S1399004714002351 – ident: ref41/cit41 doi: 10.1002/jcc.20084 – ident: ref17/cit17 doi: 10.1107/S0907444909009433 – ident: ref31/cit31 doi: 10.1107/S2053273319000482 – ident: ref15/cit15 doi: 10.1107/s0907444906013072 – ident: ref13/cit13 doi: 10.1021/cg300129z – ident: ref46/cit46 doi: 10.1021/acs.jcim.1c00227 – ident: ref7/cit7 doi: 10.1107/S0108767302010991 – ident: ref16/cit16 doi: 10.1016/j.bpj.2014.12.020 – ident: ref14/cit14 doi: 10.1021/ar970352j – ident: ref43/cit43 doi: 10.1107/S0021889810042482 – ident: ref21/cit21 doi: 10.1107/S160057672100580X – ident: ref3/cit3 doi: 10.1002/anie.200353596 – ident: ref47/cit47 doi: 10.1107/S2059798322005836 – ident: ref34/cit34 doi: 10.1002/jcc.22996 – ident: ref2/cit2 doi: 10.1107/S0108768191003932 – volume-title: X-Ray Charge Densities and Chemical Bonding year: 1997 ident: ref1/cit1 doi: 10.1093/oso/9780195098235.001.0001 contributor: fullname: Coppens P. – ident: ref30/cit30 doi: 10.1107/S2052520616003954 – ident: ref4/cit4 doi: 10.1107/S2052519213002285 – ident: ref28/cit28 doi: 10.1145/2064676.2064695 – ident: ref44/cit44 doi: 10.1107/S1600576719014092 – ident: ref29/cit29 doi: 10.1145/276698.276876 – ident: ref37/cit37 doi: 10.1007/978-3-642-37431-9_44 – ident: ref8/cit8 doi: 10.1021/jp0379796 – ident: ref24/cit24 doi: 10.1021/ct4011129 – ident: ref42/cit42 – ident: ref23/cit23 doi: 10.1002/jcc.20035 – ident: ref11/cit11 doi: 10.1021/acs.jcim.2c00144 – ident: ref38/cit38 |
SSID | ssj0033962 |
Score | 2.457027 |
Snippet | The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate... The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate... |
SourceID | pubmedcentral proquest crossref acs |
SourceType | Open Access Repository Aggregation Database Publisher |
StartPage | 3766 |
SubjectTerms | Cluster analysis Clustering Computational Chemistry Coordinates Data banks Electron density Electrons Multipoles Nucleic acids Organic chemistry Parameterization Parameters Spatial distribution Topology |
Title | Multipolar Atom Types from Theory and Statistical Clustering (MATTS) Data Bank: Impact of Surrounding Atoms on Electron Density from Cluster Analysis |
URI | http://dx.doi.org/10.1021/acs.jcim.2c00145 https://www.proquest.com/docview/2706463721 https://search.proquest.com/docview/2700315956 https://pubmed.ncbi.nlm.nih.gov/PMC9400106 |
Volume | 62 |
hasFullText | 1 |
inHoldings | 1 |
isFullTextHit | |
isPrint | |
link | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Nb9QwELWgHOBCy5dYWtAggUQPWRI7TmJu222rglQuu5V6i2zHEaU0qTbJAf4H_5cZJ-mygFCPia2xbI898zSeN4y9oWCVUNIGxiUIUCKdBibVJiiEKBJVSmNLAoqnn5OTs_jTuTxf0-T8GcHn0Xttm-lXe3E15Zb8eXmX3eMpng1yg-aL8dYVQvniocQ4FiipxpDkvySQIbLNpiFae5ebbyN_MzbH233VosZzFNIbk8tp15qp_fE3g-Mt5rHDHg4-J8x6JXnE7rjqMbs_H0u9PWE_fRruNaFcmLX1FRA6bYBST6BP3gddFUCOqed1RmHzbx1RLKDhg3ens-VysQ-HutVwoKvLD_DR515CXcKiW62ochN1JNEN1BUcDbV34JCez7ff-5EGkTDypDxlZ8dHy_lJMNRrCHQssjbgzmTSRZnD-9NpWVrcB1OWRhkjudOhlFaoMI4iHRb4VaQEXrJMOTSSlodGPGNbVV255wws-oWiUEZohxAulYYLrVwRmoTuIBFP2Ftcz3w4b03uQ-k8yv1PXOR8WOQJ2x83Ob_u6Tv-03dv1IK1YFS1JE4E4uQJe33TjNtDcRVdubrzfahWBiLNCUs3tOdmTCLx3mypLr54Mm8qTI-w_MUtJ7TLHnBKwAjxfuN7bKtdde4lukWteeXPwy920ArJ |
link.rule.ids | 230,315,783,787,888,2772,27088,27936,27937,57066,57116 |
linkProvider | American Chemical Society |
linkToHtml | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB6VciiX8hYLBYwEEj1kSeI4ibkt21Zb6PbAbqXeIttxRGmbVJvkAP-D_8uMN9llEUJwjGON3575NJ5vAF6Ts4pLYTxtYwQogUo8nSjt5ZznsSyENgUBxelpPDmLPp6L8y0I-lgY7ESNkmrnxF-zCwTvqOyrubgehobMenELbosE9SVZQ-NZf_lyLl0OUSIe86SQvWfyTxJIH5l6Ux-tjczNJ5K_6Jyju_B51Vv31ORy2DZ6aL7_RuT4X8O5B7udBcpGyy1zH7Zs-QB2xn3it4fwwwXl3hDmZaOmumaEVWtGgShsGcrPVJkzMlMdyzMKG1-1RLiAapC9nY7m89k-O1CNYh9UefmeHbtITFYVbNYuFpTHiSqS6JpVJTvsMvGwA3pM33xbttSJZD1ryiM4Ozqcjydel73BUxFPGy-0OhU2SC3eplaJwuBy6KLQUmsRWuULYbj0oyBQfo5feUJQJk2lRZVpQl_zx7BdVqV9AsyglchzqbmyCOgSoUOupM19HdONxKMBvMH5zLrTV2fOsR4GmSvESc66SR7Afr_W2c2SzOMvdff6zbAWHOKOi2KOqHkAr1a_cXnIy6JKW7WuDmXOQNw5gGRjE63aJErvzT_lxRdH7U1p6hGkP_3HAb2Encl8epKdHJ9-egZ3QgrN8PHmC_dgu1m09jkaTI1-4Y7IT4C6Ey4 |
linkToPdf | http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3db9MwED-NIcFexrdWNsBIILGHlCSuk5i30q7agE1I7dDeIttxtA-WVE3yAP8H_y93btJRhBA8xrHO37776Xy_A3hFziouhfG0jRCgBCr2dKy0l3GeRTIX2uQEFI9PosPTwYczcbYBoouFwU5UKKlyTnw61fMsbxkGgrdUfmkurvuhIdNe3ILbIg6cd3Y4mnYXMOfS5REl8jFPCtl5J_8kgXSSqdZ10o2huf5M8he9M7kHX1Y9ds9NrvpNrfvm-29kjv89pPuw3VqibLjcOg9gwxYP4e6oSwD3CH644Nw5YV82rMtrRpi1YhSQwpYh_UwVGSNz1bE9o7DR14aIF1AdsjfHw9lsus_GqlbsvSqu3rEjF5HJypxNm8WC8jlRRRJdsbJgB21GHjamR_X1t2VLrUjWsac8htPJwWx06LVZHDw14EnthVYnwgaJxVvVKpEbXBKd51pqLUKrfCEMl_4gCJSf4VcWE6RJEmlRdZrQ1_wJbBZlYXeAGbQWeSY1VxaBXSx0yJW0ma8jupn4oAevcT7T9hRWqXOwh0HqCnGS03aSe7DfrXc6X5J6_KXuXrchbgSHMVpyEUf03IOXq9-4PORtUYUtG1eHMmgg_uxBvLaRVm0Stff6n-Li3FF8U7p6BOtP_3FAL-DO5_Ek_XR08nEXtkKK0PDxAgz3YLNeNPYZ2k21fu5OyU8rnBWo |
openUrl | ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multipolar+Atom+Types+from+Theory+and+Statistical+Clustering+%28MATTS%29+Data+Bank%3A+Impact+of+Surrounding+Atoms+on+Electron+Density+from+Cluster+Analysis&rft.jtitle=Journal+of+chemical+information+and+modeling&rft.au=Rybicka%2C+Paulina+Maria&rft.au=Kulik%2C+Marta&rft.au=Chodkiewicz%2C+Micha%C5%82+Leszek&rft.au=Dominiak%2C+Paulina+Maria&rft.date=2022-08-22&rft.eissn=1549-960X&rft.volume=62&rft.issue=16&rft.spage=3766&rft.epage=3783&rft_id=info:doi/10.1021%2Facs.jcim.2c00145&rft.externalDBID=NO_FULL_TEXT |
thumbnail_l | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-9596&client=summon |
thumbnail_m | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-9596&client=summon |
thumbnail_s | http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-9596&client=summon |