• Home
  • About
  • Repositories
  • Search
  • Web API
  • Feedback
<< Go Back

Metadata

Name
DESS66 and DESS66x8
Repository
ZENODO
Identifier
doi:10.5281/zenodo.5676284
Description
DESRES Data Sets (DESS66 and S66x8)
=========================================
Please see the original paper at https://doi.org/10.1038/s41597-021-00833-x for
more information about this dataset.

This package contains two&nbsp;datasets described by Donchev et al. [1]: DESS66 and DESS66x8, they are&nbsp;presented as CSVs (.../DESS66.csv and .../DESS66x8.csv) and .mol files
(.../geometries/&lt;system_id&gt;/DESS66[x8]_&lt;geom_id&gt;.mol). Also included is for each is a metadata
file DESS66[x8]_meta.csv, which contains a set of long-form column descriptions
replicating those in [1], as well as data types and units (when applicable) for
each column.

Manifest
--------
- DESS66.csv &nbsp; &nbsp; &nbsp; : S66 geometries computed in the same style as DES370K,
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;containing interaction energies calculated using CCSD(T),
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;MP2, HF, and SAPT0, as well as dimer geometries, and
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;silver-standard [3] reference values.

- DESS66_meta.csv &nbsp;: Long-form descriptions of the columns in DESS66, as well
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;as datatypes and units (when applicable) for each column

- DESS66x8.csv &nbsp; &nbsp; &nbsp; : S66x8 geometries computed in the same style as DES370K,
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;containing interaction energies calculated using
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;CCSD(T), MP2, HF, and SAPT0, as well as dimer
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;geometries, and bronze-standard [3] reference values.

- DESS66x8_meta.csv &nbsp;: Long-form descriptions of the columns in DESS66x8, as
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;well as datatypes and units (when applicable) for each
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;column

- LICENSE.txt &nbsp; &nbsp; &nbsp;: License for using and redistributing the datasets
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp;provided.

- README.md &nbsp; &nbsp; &nbsp; &nbsp;: This file.

Loading the Datset
------------------
The datasets are presented as CSVs as a compromise between human-readability,
format uniformity, and parsing speed. While an almost uncountable number of
packages exist to read CSV files, we recommend using the python data analysis

References
----------
[1] &nbsp;A. G. Donchev, A. G. Taube, E. Decolvenaere, C. Hargus, R. T. McGibbon,
&nbsp; &nbsp; &nbsp; K.-H. Law, B. A. Gregersen, J.-L. Li, K. Palmo, K. Siva, M. Bergdorf,
&nbsp; &nbsp; &nbsp; J. L. Klepeis, and D. E. Shaw. &quot;Quantum chemical benchmark database of
&nbsp; &nbsp; &nbsp; dimer interaction energies at a &ldquo;gold standard&rdquo; level of accuracy&quot;

[2] &nbsp;R. T. McGibbon, A. G. Taube, A. G. Donchev, K. Siva, F. Fernandez, C. Hargus,
&nbsp; &nbsp; &nbsp; K.-H. Law, J.L. Klepeis, and D. E. Shaw. &quot;Improving the accuracy of
&nbsp; &nbsp; &nbsp; Moller-Plesset perturbation theory with neural networks&quot;

[3] &nbsp;M. K. Kesharwani, A. Karton, N. Sylvetsky, J. M. L. Nitai. &quot;The S66
&nbsp; &nbsp; &nbsp; non-covalent interactions benchmark reconsidered using explicitly
&nbsp; &nbsp; &nbsp; correlated methods near the basis set limit.&quot;

License
-------
```
&nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; DESRES DATA SETS LICENSE AGREEMENT

Copyright 2020, D. E. Shaw Research. All rights reserved.

Redistribution and use of electronic structure data released in the DESRES
Data Sets (DES370K, DES15K, DES5M, DESS66, and DESS66x8) with or without
modification, is permitted provided that the following conditions are met:

&nbsp; &nbsp; * Redistributions of the data must retain the above copyright notice,
&nbsp; &nbsp; this list of conditions, and the following disclaimer.

&nbsp; &nbsp; * Redistributions in binary form must reproduce the above copyright
&nbsp; &nbsp; notice, this list of conditions, and the following disclaimer in the
&nbsp; &nbsp; documentation and/or other materials provided with the distribution.

Neither the name of D. E. Shaw Research nor the names of its contributors may
be used to endorse or promote products derived from this software without
specific prior written permission.

THIS SOFTWARE AND DATA ARE PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
&quot;AS IS&quot; AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT OWNER OR CONTRIBUTORS BE LIABLE
FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR
SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER
CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY,
OR TORT (INCLUDINGNEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
OF THIS SOFTWARE AND/OR DATA, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
&nbsp;
Data or Study Types
multiple
Source Organization
Unknown
Access Conditions
available
Year
2021
Access Hyperlink
https://doi.org/10.5281/zenodo.5676284

Distributions

  • Encoding Format: HTML ; URL: https://doi.org/10.5281/zenodo.5676284
This project was funded in part by grant U24AI117966 from the NIH National Institute of Allergy and Infectious Diseases as part of the Big Data to Knowledge program. We thank all members of the bioCADDIE community for their valuable input on the overall project.