• Complain

Thorsten Gressling - Data Science in Chemistry (de Gruyter Textbook)

Here you can read online Thorsten Gressling - Data Science in Chemistry (de Gruyter Textbook) full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2020, publisher: De Gruyter, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

No cover
  • Book:
    Data Science in Chemistry (de Gruyter Textbook)
  • Author:
  • Publisher:
    De Gruyter
  • Genre:
  • Year:
    2020
  • Rating:
    5 / 5
  • Favourites:
    Add to favourites
  • Your mark:
    • 100
    • 1
    • 2
    • 3
    • 4
    • 5

Data Science in Chemistry (de Gruyter Textbook): summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Data Science in Chemistry (de Gruyter Textbook)" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

The ever-growing wealth of information has led to the emergence of a fourth paradigm of science. This new field of activity - data science - includes computer science, mathematics and a given specialist domain. This book focuses on chemistry, explaining how to use data science for deep insights and take chemical research and engineering to the next level. It covers modern aspects like Big Data, Artificial Intelligence and Quantum computing.

Thorsten Gressling: author's other books


Who wrote Data Science in Chemistry (de Gruyter Textbook)? Find out the surname, the name of the author of the book and a list of all author's works by series.

Data Science in Chemistry (de Gruyter Textbook) — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Data Science in Chemistry (de Gruyter Textbook)" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

Reset

Interval:

Bookmark:

Make
De Gruyter Textbook EBSCOhost - printed on 2142022 1129 AM via All use - photo 1

De Gruyter Textbook

EBSCOhost - printed on 2/14/2022 11:29 AM via . All use subject to https://www.ebsco.com/terms-of-use

De Gruyter Textbook

EBSCOhost - printed on 2/14/2022 11:29 AM via . All use subject to https://www.ebsco.com/terms-of-use

ISBN 9783110629392

e-ISBN (PDF) 9783110629453

e-ISBN (EPUB) 9783110630534

Bibliographic information published by the Deutsche Nationalbibliothek

The Deutsche Nationalbibliothek lists this publication in the Deutsche Nationalbibliografie; detailed bibliographic data are available on the Internet at http://dnb.dnb.de.

2021 Walter de Gruyter GmbH, Berlin/Boston

EBSCOhost - printed on 2/14/2022 11:29 AM via . All use subject to https://www.ebsco.com/terms-of-use
Data science: introduction

Data science [a] is an interdisciplinary field that usesscientificmethods, processes, algorithms, and systems toextract knowledgeand insights from structural and unstructured data. Data science is related to data mining and big data.

The keyword in Data Science is not Data, it is Science [a]

Figure 11 The major tasks and mathematical setup of a supervised machine - photo 2

Figure 1.1: The major tasks and mathematical setup of a supervised machine learning workflow3 (Hachmann).

The first problem is that most great data scientists dont sufficiently understand business and most great business leaders dont sufficiently understand data science. [a]

Relation of science and digital research

Since the beginning of the 1980s, the term data science has appeared in various contexts, but was never well defined in the scientific community. [b]

Definitions

Data science is performed to analyze, understand, and extract actual phenomena in data. The challenge is to identify unique patterns and variables.

The goal is to:

  • understand

  • extract insights

To do this, data science is a multidisciplinary field that brings together concepts from:

  • computer science

  • statistics/machine learning

  • data analysis

  • domain knowledge

Bringing together these fields of expertise in data science is also a concept of unification.

Discussion

Leek [b] discusses the relation of structure and the desired results:

  • It is easy to discover structure or networks in a data set. There will always be correlations for a thousand reasons if you collect enough data.

  • Understanding whether these correlations matter for specific, interesting questions is much harder.

  • Often the structure you found on the first pass is due to phenomena (measurement error, artifacts, and data processing) that do not answer an interesting question.

The two paradigms of data research
Hypothesis driven

Given a problem, what kind

of data do we need to help solve it?

Data driven

Given some data, what interesting

problems can be solved with it?

The heart of data science is to always ask questions:

  1. What can we learn from this data?

  2. What actions can we take, once we find whatever it is we are looking for?

Main types of problems

Two problems arise repeatedly in data science. This is discussed in detail in Chapters 33 to 37. As a rule of thumb, these are:

  • Classification: Assigning something to a discrete set of possibilities

  • Regression: Predicting a numerical value

References

Data science (Wikipedia)

Leek, J. Simply Statistics

Haghighatlari, M.; Hachmann, J. Advances of Machine Learning in Molecular Modeling and Simulation https://www.researchgate.net/publication/330845218_Advances_of_Machine_Learning_in_Molecular_Modeling_and_Simulation.

Boyle, D. Data Science vs. the C Suite

Che-Workshop. Framing the Role of Big Data and Modern Data Science in Chemistry

EBSCOhost - printed on 2/14/2022 11:29 AM via . All use subject to https://www.ebsco.com/terms-of-use
Data science: the fourth paradigm of science
Statistics + computer science + domain knowledge = data science

Data science interrogates (scientific) data at scale. Additional success can be achieved when data science paradigms integrate tools with domain-specific knowledge and expertise. [a]

  • Transform chemical sciences and engineering

  • Knowledge discovery at scale

  • Interdisciplinary: statistics, computer science, applied math, AI, and domain tools

The fourth paradigm: knowledge discovery from data (KDD)

The Fourth Paradigm:Data-Intensive Scientific Discovery [a] is a 2009 anthology of essays on the topic of data science-based on data-intensive computing.

  • Theory

  • Experiment

  • Simulation

  • Data science

Increase in the use of data is bringing a paradigm shift to the nature of science..

a Draxl Scheffler New technologies and approaches are generating large - photo 3

[a] (Draxl, Scheffler).

New technologies and approaches are generating large, diverse data sets. Data science offers methods and tools that are needed to integrate, analyze, and manage these data sets. However, data science applications in the chemical sciences and engineering communities have been relatively limited and many opportunities for advancing the fields have gone unexplored. [a]

Data science life cycle

In general, the life cycle has phases of exploration and production as shown in [a]

Figure 22 Data science life cycle Gressling Challenges Although data - photo 4

Figure 2.2: Data science life cycle (Gressling).

Challenges

Although data science is a rapidly growing field, some of the building blocks still experience difficulties:

Big data
  • Data sets are so large that standard approaches and tools for storage, analysis, and sharing fail.

  • Data may be too large to fit in a 10 TB memory.

Data mining
  • Extracting and understanding human-relevant insight and predictions

  • Knowledge is a process of piling up facts; wisdom lies in their simplificationM. H. Fischer (Physician)

Machine learning/statistical learning
  • Algorithms that learn without explicit human instruction

  • Unlearning

  • Transfer learning

Artificial intelligence
  • Machine behavior that mimics human cognition

  • Turing test: Intelligent behavior equivalent to or indistinguishable from that of a human Alan Turing

So, data science brings statistical methods to new scales and prioritizes approximation and uncertainty. It brings new challenges to IT with demand for computing power, memory, and hardware.

Next page
Light

Font size:

Reset

Interval:

Bookmark:

Make

Similar books «Data Science in Chemistry (de Gruyter Textbook)»

Look at similar books to Data Science in Chemistry (de Gruyter Textbook). We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.


Reviews about «Data Science in Chemistry (de Gruyter Textbook)»

Discussion, reviews of the book Data Science in Chemistry (de Gruyter Textbook) and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.