Multipolar Atom Types from Theory and Statistical Clustering (MATTS) Data Bank: Impact of Surrounding Atoms on Electron Density from Cluster Analysis

The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM p...

Full description

Saved in:
Bibliographic Details
Published inJournal of chemical information and modeling Vol. 62; no. 16; pp. 3766 - 3783
Main Authors Rybicka, Paulina Maria, Kulik, Marta, Chodkiewicz, Michał Leszek, Dominiak, Paulina Maria
Format Journal Article
LanguageEnglish
Published Washington American Chemical Society 22.08.2022
Subjects
Online AccessGet full text

Cover

Loading…
Abstract The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM parameters specific for atom types in proteins, nucleic acids, and organic molecules. However, it was not fully understood how the electron density of particular atoms responds to their surroundings and which factors describe the electron density in molecules within the MM. In this work, by applying clustering using descriptors available in the MATTS data bank, that is, topology and multipole parameters, we found the topology features with the biggest impact on the multipole parameters: the element of the central atom, the number of first neighbors, and planarity of the group. The similarities in the spatial distribution of electron density between and within atom type classes revealed distinct and unique atom types. The quality of existing types can be improved by adding better parametrization, definitions, and local coordinate systems. Future development of the MATTS data bank should lead to a wider range of atom types necessary to construct the electron density of any molecule.
AbstractList The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM parameters specific for atom types in proteins, nucleic acids, and organic molecules. However, it was not fully understood how the electron density of particular atoms responds to their surroundings and which factors describe the electron density in molecules within the MM. In this work, by applying clustering using descriptors available in the MATTS data bank, that is, topology and multipole parameters, we found the topology features with the biggest impact on the multipole parameters: the element of the central atom, the number of first neighbors, and planarity of the group. The similarities in the spatial distribution of electron density between and within atom type classes revealed distinct and unique atom types. The quality of existing types can be improved by adding better parametrization, definitions, and local coordinate systems. Future development of the MATTS data bank should lead to a wider range of atom types necessary to construct the electron density of any molecule.
The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate manner than using the spherical approximation. The MATTS (multipolar atom types from theory and statistical clustering) data bank gathers MM parameters specific for atom types in proteins, nucleic acids, and organic molecules. However, it was not fully understood how the electron density of particular atoms responds to their surroundings and which factors describe the electron density in molecules within the MM. In this work, by applying clustering using descriptors available in the MATTS data bank, that is, topology and multipole parameters, we found the topology features with the biggest impact on the multipole parameters: the element of the central atom, the number of first neighbors, and planarity of the group. The similarities in the spatial distribution of electron density between and within atom type classes revealed distinct and unique atom types. The quality of existing types can be improved by adding better parametrization, definitions, and local coordinate systems. Future development of the MATTS data bank should lead to a wider range of atom types necessary to construct the electron density of any molecule.
Author Dominiak, Paulina Maria
Rybicka, Paulina Maria
Chodkiewicz, Michał Leszek
Kulik, Marta
AuthorAffiliation Biological and Chemical Research Centre, Department of Chemistry
AuthorAffiliation_xml – name: Biological and Chemical Research Centre, Department of Chemistry
Author_xml – sequence: 1
  givenname: Paulina Maria
  orcidid: 0000-0003-0087-6208
  surname: Rybicka
  fullname: Rybicka, Paulina Maria
– sequence: 2
  givenname: Marta
  orcidid: 0000-0003-2381-7665
  surname: Kulik
  fullname: Kulik, Marta
  email: m.kulik@uw.edu.pl
– sequence: 3
  givenname: Michał Leszek
  orcidid: 0000-0003-4448-445X
  surname: Chodkiewicz
  fullname: Chodkiewicz, Michał Leszek
– sequence: 4
  givenname: Paulina Maria
  orcidid: 0000-0002-1466-1243
  surname: Dominiak
  fullname: Dominiak, Paulina Maria
  email: pdomin@chem.uw.edu.pl
BookMark eNp1kU9v1DAQxS1URP_AnaMlLkViFzuOs2sOSMu2QKVWHHaRuFmOM2m9OHawHaR8kH7fOt0tEkic_Kz5zbNn3ik6ct4BQq8pmVNS0PdKx_lOm25eaEJoyZ-hE8pLMRMV-XH0pLmojtFpjDtCGBNV8QIdMy5KtmDiBN3fDDaZ3lsV8Cr5Dm_HHiJuwyTvwIcRK9fgTVLJxGS0snhth5ggGHeLz29W2-3mLb5QSeFPyv38gK-6XumEfYs3Qwh-cM0ETtYRe4cvLegUsrgAF00a9y8dLPHKKTtGE1-i562yEV4dzjP0_fPldv11dv3ty9V6dT1TJVumWQH1kgNdAqUUFG91nrVu21rUNS9AEc41E6SkVJEm35oFryhbLgUQWuiC1OwMfdz79kPdQaPBpaCs7IPpVBilV0b-XXHmTt7631KUed-kygbnB4Pgfw0Qk-xM1GCtcuCHKItFXjrlgk_om3_QnR9CHviRqsqKLQqaKbKndPAxBmj_fIYSOWUuc-ZyylweMs8t7_Ytj5Unz__iD6Gfsh4
CitedBy_id crossref_primary_10_1016_j_csbj_2022_10_018
crossref_primary_10_1107_S2053229624005357
crossref_primary_10_3390_cryst14010077
crossref_primary_10_1021_acs_jcim_2c00144
Cites_doi 10.1021/ct6001994
10.1107/S0567739478001886
10.1107/S0021889808033384
10.1107/S0108767312008197
10.1107/S1600576716011274
10.1107/S2053273319015304
10.1021/cg200971f
10.1021/j100016a071
10.1002/ijch.197700020
10.1107/S0108767310049731
10.1109/BigDataService.2018.00042
10.1080/0889311X.2020.1853712
10.1107/S0021889803017825
10.1145/3068335
10.1107/s0108767311042176
10.1021/acs.jctc.8b00640
10.1107/S1399004714002351
10.1002/jcc.20084
10.1107/S0907444909009433
10.1107/S2053273319000482
10.1107/s0907444906013072
10.1021/cg300129z
10.1021/acs.jcim.1c00227
10.1107/S0108767302010991
10.1016/j.bpj.2014.12.020
10.1021/ar970352j
10.1107/S0021889810042482
10.1107/S160057672100580X
10.1002/anie.200353596
10.1107/S2059798322005836
10.1002/jcc.22996
10.1107/S0108768191003932
10.1093/oso/9780195098235.001.0001
10.1107/S2052520616003954
10.1107/S2052519213002285
10.1145/2064676.2064695
10.1107/S1600576719014092
10.1145/276698.276876
10.1007/978-3-642-37431-9_44
10.1021/jp0379796
10.1021/ct4011129
10.1002/jcc.20035
10.1021/acs.jcim.2c00144
ContentType Journal Article
Copyright 2022 The Authors. Published by American Chemical Society
Copyright American Chemical Society Aug 22, 2022
2022 The Authors. Published by American Chemical Society 2022 The Authors
Copyright_xml – notice: 2022 The Authors. Published by American Chemical Society
– notice: Copyright American Chemical Society Aug 22, 2022
– notice: 2022 The Authors. Published by American Chemical Society 2022 The Authors
DBID AAYXX
CITATION
7SC
7SR
7U5
8BQ
8FD
JG9
JQ2
L7M
L~C
L~D
7X8
5PM
DOI 10.1021/acs.jcim.2c00145
DatabaseName CrossRef
Computer and Information Systems Abstracts
Engineered Materials Abstracts
Solid State and Superconductivity Abstracts
METADEX
Technology Research Database
Materials Research Database
ProQuest Computer Science Collection
Advanced Technologies Database with Aerospace
Computer and Information Systems Abstracts – Academic
Computer and Information Systems Abstracts Professional
MEDLINE - Academic
PubMed Central (Full Participant titles)
DatabaseTitle CrossRef
Materials Research Database
Engineered Materials Abstracts
Technology Research Database
Computer and Information Systems Abstracts – Academic
ProQuest Computer Science Collection
Computer and Information Systems Abstracts
Solid State and Superconductivity Abstracts
Advanced Technologies Database with Aerospace
METADEX
Computer and Information Systems Abstracts Professional
MEDLINE - Academic
DatabaseTitleList
Materials Research Database

DeliveryMethod fulltext_linktorsrc
Discipline Chemistry
EISSN 1549-960X
EndPage 3783
ExternalDocumentID 10_1021_acs_jcim_2c00145
c473759231
GrantInformation_xml – fundername: ;
  grantid: UMO-2017/27/B/ST4/02721
GroupedDBID -
4.4
55A
5GY
5VS
7~N
AABXI
ABFLS
ABFRP
ABMVS
ABUCX
ACGFS
ACIWK
ACNCT
ACS
ADHLV
AEESW
AENEX
AFEFF
AHGAQ
ALMA_UNASSIGNED_HOLDINGS
AQSVZ
D0L
DU5
EBS
ED
F5P
GGK
GNL
IH9
JG
P2P
PQEST
PQQKQ
RNS
ROL
UI2
VF5
VG9
W1F
X
---
-~X
AAYXX
ABJNI
ABQRX
CITATION
CUPRZ
ED~
JG~
7SC
7SR
7U5
8BQ
8FD
JG9
JQ2
L7M
L~C
L~D
7X8
5PM
ID FETCH-LOGICAL-a438t-2eb85e18e111ea5fc959bffb9bb52ea055c390411a0da05d75613889e012c20b3
IEDL.DBID ACS
ISSN 1549-9596
IngestDate Tue Sep 17 21:17:36 EDT 2024
Sat Aug 17 02:21:43 EDT 2024
Thu Oct 10 16:40:26 EDT 2024
Fri Aug 23 01:40:32 EDT 2024
Wed Aug 24 06:44:50 EDT 2022
IsDoiOpenAccess true
IsOpenAccess true
IsPeerReviewed true
IsScholarly true
Issue 16
Language English
License Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-a438t-2eb85e18e111ea5fc959bffb9bb52ea055c390411a0da05d75613889e012c20b3
Notes ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 23
ORCID 0000-0003-0087-6208
0000-0003-2381-7665
0000-0002-1466-1243
0000-0003-4448-445X
OpenAccessLink https://pubmed.ncbi.nlm.nih.gov/PMC9400106
PMID 35943739
PQID 2706463721
PQPubID 28739
PageCount 18
ParticipantIDs pubmedcentral_primary_oai_pubmedcentral_nih_gov_9400106
proquest_miscellaneous_2700315956
proquest_journals_2706463721
crossref_primary_10_1021_acs_jcim_2c00145
acs_journals_10_1021_acs_jcim_2c00145
PublicationCentury 2000
PublicationDate 2022-08-22
PublicationDateYYYYMMDD 2022-08-22
PublicationDate_xml – month: 08
  year: 2022
  text: 2022-08-22
  day: 22
PublicationDecade 2020
PublicationPlace Washington
PublicationPlace_xml – name: Washington
PublicationTitle Journal of chemical information and modeling
PublicationTitleAlternate J. Chem. Inf. Model
PublicationYear 2022
Publisher American Chemical Society
Publisher_xml – name: American Chemical Society
References ref9/cit9
ref45/cit45
ref6/cit6
ref36/cit36
ref3/cit3
ref27/cit27
Coppens P. (ref1/cit1) 1997
ref11/cit11
ref25/cit25
ref16/cit16
ref29/cit29
ref32/cit32
ref23/cit23
ref39/cit39
ref14/cit14
ref8/cit8
ref5/cit5
ref31/cit31
ref2/cit2
ref43/cit43
ref34/cit34
ref37/cit37
ref28/cit28
ref40/cit40
ref20/cit20
ref17/cit17
ref10/cit10
ref26/cit26
ref35/cit35
ref19/cit19
ref21/cit21
ref12/cit12
ref15/cit15
ref42/cit42
ref46/cit46
ref41/cit41
ref22/cit22
ref13/cit13
ref33/cit33
ref4/cit4
ref30/cit30
ref47/cit47
ref24/cit24
ref38/cit38
ref44/cit44
ref7/cit7
References_xml – ident: ref9/cit9
  doi: 10.1021/ct6001994
– ident: ref27/cit27
  doi: 10.1107/S0567739478001886
– ident: ref33/cit33
  doi: 10.1107/S0021889808033384
– ident: ref6/cit6
  doi: 10.1107/S0108767312008197
– ident: ref22/cit22
  doi: 10.1107/S1600576716011274
– ident: ref20/cit20
  doi: 10.1107/S2053273319015304
– ident: ref26/cit26
  doi: 10.1021/cg200971f
– ident: ref5/cit5
  doi: 10.1021/j100016a071
– ident: ref36/cit36
  doi: 10.1002/ijch.197700020
– ident: ref12/cit12
  doi: 10.1107/S0108767310049731
– ident: ref40/cit40
  doi: 10.1109/BigDataService.2018.00042
– ident: ref35/cit35
  doi: 10.1080/0889311X.2020.1853712
– ident: ref32/cit32
  doi: 10.1107/S0021889803017825
– ident: ref39/cit39
  doi: 10.1145/3068335
– ident: ref10/cit10
  doi: 10.1107/s0108767311042176
– ident: ref45/cit45
– ident: ref25/cit25
  doi: 10.1021/acs.jctc.8b00640
– ident: ref19/cit19
  doi: 10.1107/S1399004714002351
– ident: ref41/cit41
  doi: 10.1002/jcc.20084
– ident: ref17/cit17
  doi: 10.1107/S0907444909009433
– ident: ref31/cit31
  doi: 10.1107/S2053273319000482
– ident: ref15/cit15
  doi: 10.1107/s0907444906013072
– ident: ref13/cit13
  doi: 10.1021/cg300129z
– ident: ref46/cit46
  doi: 10.1021/acs.jcim.1c00227
– ident: ref7/cit7
  doi: 10.1107/S0108767302010991
– ident: ref16/cit16
  doi: 10.1016/j.bpj.2014.12.020
– ident: ref14/cit14
  doi: 10.1021/ar970352j
– ident: ref43/cit43
  doi: 10.1107/S0021889810042482
– ident: ref21/cit21
  doi: 10.1107/S160057672100580X
– ident: ref3/cit3
  doi: 10.1002/anie.200353596
– ident: ref47/cit47
  doi: 10.1107/S2059798322005836
– ident: ref34/cit34
  doi: 10.1002/jcc.22996
– ident: ref2/cit2
  doi: 10.1107/S0108768191003932
– volume-title: X-Ray Charge Densities and Chemical Bonding
  year: 1997
  ident: ref1/cit1
  doi: 10.1093/oso/9780195098235.001.0001
  contributor:
    fullname: Coppens P.
– ident: ref30/cit30
  doi: 10.1107/S2052520616003954
– ident: ref4/cit4
  doi: 10.1107/S2052519213002285
– ident: ref28/cit28
  doi: 10.1145/2064676.2064695
– ident: ref44/cit44
  doi: 10.1107/S1600576719014092
– ident: ref29/cit29
  doi: 10.1145/276698.276876
– ident: ref37/cit37
  doi: 10.1007/978-3-642-37431-9_44
– ident: ref8/cit8
  doi: 10.1021/jp0379796
– ident: ref24/cit24
  doi: 10.1021/ct4011129
– ident: ref42/cit42
– ident: ref23/cit23
  doi: 10.1002/jcc.20035
– ident: ref11/cit11
  doi: 10.1021/acs.jcim.2c00144
– ident: ref38/cit38
SSID ssj0033962
Score 2.457027
Snippet The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate...
The multipole model (MM) uses an aspherical approach to describe electron density and can be used to interpret data from X-ray diffraction in a more accurate...
SourceID pubmedcentral
proquest
crossref
acs
SourceType Open Access Repository
Aggregation Database
Publisher
StartPage 3766
SubjectTerms Cluster analysis
Clustering
Computational Chemistry
Coordinates
Data banks
Electron density
Electrons
Multipoles
Nucleic acids
Organic chemistry
Parameterization
Parameters
Spatial distribution
Topology
Title Multipolar Atom Types from Theory and Statistical Clustering (MATTS) Data Bank: Impact of Surrounding Atoms on Electron Density from Cluster Analysis
URI http://dx.doi.org/10.1021/acs.jcim.2c00145
https://www.proquest.com/docview/2706463721
https://search.proquest.com/docview/2700315956
https://pubmed.ncbi.nlm.nih.gov/PMC9400106
Volume 62
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
link http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwjV1Nb9QwELWgHOBCy5dYWtAggUQPWRI7TmJu222rglQuu5V6i2zHEaU0qTbJAf4H_5cZJ-mygFCPia2xbI898zSeN4y9oWCVUNIGxiUIUCKdBibVJiiEKBJVSmNLAoqnn5OTs_jTuTxf0-T8GcHn0Xttm-lXe3E15Zb8eXmX3eMpng1yg-aL8dYVQvniocQ4FiipxpDkvySQIbLNpiFae5ebbyN_MzbH233VosZzFNIbk8tp15qp_fE3g-Mt5rHDHg4-J8x6JXnE7rjqMbs_H0u9PWE_fRruNaFcmLX1FRA6bYBST6BP3gddFUCOqed1RmHzbx1RLKDhg3ens-VysQ-HutVwoKvLD_DR515CXcKiW62ochN1JNEN1BUcDbV34JCez7ff-5EGkTDypDxlZ8dHy_lJMNRrCHQssjbgzmTSRZnD-9NpWVrcB1OWRhkjudOhlFaoMI4iHRb4VaQEXrJMOTSSlodGPGNbVV255wws-oWiUEZohxAulYYLrVwRmoTuIBFP2Ftcz3w4b03uQ-k8yv1PXOR8WOQJ2x83Ob_u6Tv-03dv1IK1YFS1JE4E4uQJe33TjNtDcRVdubrzfahWBiLNCUs3tOdmTCLx3mypLr54Mm8qTI-w_MUtJ7TLHnBKwAjxfuN7bKtdde4lukWteeXPwy920ArJ
link.rule.ids 230,315,783,787,888,2772,27088,27936,27937,57066,57116
linkProvider American Chemical Society
linkToHtml http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1Lb9QwEB6VciiX8hYLBYwEEj1kSeI4ibkt21Zb6PbAbqXeIttxRGmbVJvkAP-D_8uMN9llEUJwjGON3575NJ5vAF6Ts4pLYTxtYwQogUo8nSjt5ZznsSyENgUBxelpPDmLPp6L8y0I-lgY7ESNkmrnxF-zCwTvqOyrubgehobMenELbosE9SVZQ-NZf_lyLl0OUSIe86SQvWfyTxJIH5l6Ux-tjczNJ5K_6Jyju_B51Vv31ORy2DZ6aL7_RuT4X8O5B7udBcpGyy1zH7Zs-QB2xn3it4fwwwXl3hDmZaOmumaEVWtGgShsGcrPVJkzMlMdyzMKG1-1RLiAapC9nY7m89k-O1CNYh9UefmeHbtITFYVbNYuFpTHiSqS6JpVJTvsMvGwA3pM33xbttSJZD1ryiM4Ozqcjydel73BUxFPGy-0OhU2SC3eplaJwuBy6KLQUmsRWuULYbj0oyBQfo5feUJQJk2lRZVpQl_zx7BdVqV9AsyglchzqbmyCOgSoUOupM19HdONxKMBvMH5zLrTV2fOsR4GmSvESc66SR7Afr_W2c2SzOMvdff6zbAWHOKOi2KOqHkAr1a_cXnIy6JKW7WuDmXOQNw5gGRjE63aJErvzT_lxRdH7U1p6hGkP_3HAb2Encl8epKdHJ9-egZ3QgrN8PHmC_dgu1m09jkaTI1-4Y7IT4C6Ey4
linkToPdf http://utb.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3db9MwED-NIcFexrdWNsBIILGHlCSuk5i30q7agE1I7dDeIttxtA-WVE3yAP8H_y93btJRhBA8xrHO37776Xy_A3hFziouhfG0jRCgBCr2dKy0l3GeRTIX2uQEFI9PosPTwYczcbYBoouFwU5UKKlyTnw61fMsbxkGgrdUfmkurvuhIdNe3ILbIg6cd3Y4mnYXMOfS5REl8jFPCtl5J_8kgXSSqdZ10o2huf5M8he9M7kHX1Y9ds9NrvpNrfvm-29kjv89pPuw3VqibLjcOg9gwxYP4e6oSwD3CH644Nw5YV82rMtrRpi1YhSQwpYh_UwVGSNz1bE9o7DR14aIF1AdsjfHw9lsus_GqlbsvSqu3rEjF5HJypxNm8WC8jlRRRJdsbJgB21GHjamR_X1t2VLrUjWsac8htPJwWx06LVZHDw14EnthVYnwgaJxVvVKpEbXBKd51pqLUKrfCEMl_4gCJSf4VcWE6RJEmlRdZrQ1_wJbBZlYXeAGbQWeSY1VxaBXSx0yJW0ma8jupn4oAevcT7T9hRWqXOwh0HqCnGS03aSe7DfrXc6X5J6_KXuXrchbgSHMVpyEUf03IOXq9-4PORtUYUtG1eHMmgg_uxBvLaRVm0Stff6n-Li3FF8U7p6BOtP_3FAL-DO5_Ek_XR08nEXtkKK0PDxAgz3YLNeNPYZ2k21fu5OyU8rnBWo
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Multipolar+Atom+Types+from+Theory+and+Statistical+Clustering+%28MATTS%29+Data+Bank%3A+Impact+of+Surrounding+Atoms+on+Electron+Density+from+Cluster+Analysis&rft.jtitle=Journal+of+chemical+information+and+modeling&rft.au=Rybicka%2C+Paulina+Maria&rft.au=Kulik%2C+Marta&rft.au=Chodkiewicz%2C+Micha%C5%82+Leszek&rft.au=Dominiak%2C+Paulina+Maria&rft.date=2022-08-22&rft.eissn=1549-960X&rft.volume=62&rft.issue=16&rft.spage=3766&rft.epage=3783&rft_id=info:doi/10.1021%2Facs.jcim.2c00145&rft.externalDBID=NO_FULL_TEXT
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1549-9596&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1549-9596&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1549-9596&client=summon