LitArk » Books » Computer

Robert Slane - Big Data Essentials

Here you can read online Robert Slane - Big Data Essentials full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 2018, genre: Computer. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Big Data Essentials
Author:
Robert Slane
Genre:
Books / Computer
Year:
2018
Rating:
4 / 5
Favourites:
Add to favourites
Your mark:
- 80
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Big Data Essentials: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Big Data Essentials" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

This books fills up up the need for an easy and natural details about essential Big Details technological advancement. Written in a clear and simple language free from language and concept, the following information provides an intuition for Big Details from organization as well as technological opinions. This details are designed to provide people with the intuition behind this modifying area, along with a strong toolset of the major big computer technological advancement such as Hadoop, MapReduce, Spark Running, and NoSql databases. A complete study of making a web log analyzer is involved. The details also contains two primers on Thinking managing and Details Discovery. It also contains two books on establishing Hadoop and Spark. The details contains caselets from real-world encounters learners across a variety of academic careers such as organization, technology, analysis, technological advancement, and others interested in the idea of utilizing Big Details for new concepts and concepts from information, can use this as a book.

Robert Slane: author's other books

Who wrote Big Data Essentials? Find out the surname, the name of the author of the book and a list of all author's works by series.

Big Data Essentials — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Big Data Essentials" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Contents Preface a Introduction a a a a a a s s s - photo 1

Contents

Preface

Introduction

s Introduction u s s a s s Sensors s s s t Conclusion - photo 3

Introduction

Sensors

Conclusion

e Introduction e n Netflix Ebay VMWare y TicketMaster - photo 4

Introduction

Netflix

Ebay

VMWare

TicketMaster

Paypal

CERN

Conclusion

p Introduction k m y S S s YARN Conclusion s e Introduction - photo 5

p Introduction

YARN

Conclusion

e Introduction

Conclusion

s Introduction

HBase

Cassandra

Conclusion

k Introduction

)

g MLlib

SparkR

SparkSQL

p Conclusion

Wholeness

Producers

Consumers

Broker

Topic

Distribution

Guarantees

a Conclusion

References

Introduction

Conclusion

y Introduction

Requirements

HDFS

MongoDB

Conclusion

)

Chapter 1 Wholeness of Big Data

Introduction

Big Data is an all-inclusive term that refers to extremely large, very fast, diverse, and complex data that cannot be managed with traditional data management tools. Ideally, Big Data would harness all kinds of data, and deliver the right information, to the right person, in the right quantity, at the right time, to help make the right decision. Big Data can be managed by developing infinitely scalable, totally flexible, and evolutionary data architectures, coupled with the use of extremely cost-effective computing components. The infinite potential knowledge embedded within this cosmic computer would help connect everything to the Unified Field of all the laws of nature.

This book will provide a complete overview of Big Data for the executive and the data specialist. This chapter will cover the key challenges and benefits of Big Data, and the essential tools and technologies now available for organizing and manipulating Big Data.

Understanding Big Data

Big Data can be examined on two levels. On a fundamental level, it is data that can be analyzed and utilized for the benefit of the business. On another level, it is a special kind of data that poses unique challenges. This is the level that this book will focus on.

Figure 1- 1 Big Data Context At the level of business data generated by - photo 6

Figure 1- 1: Big Data Context

At the level of business, data generated by business operations, can be analyzed to generate insights that can help the business make better decisions. This makes the business grow bigger, and generate even more data, and the cycle continues. This is represented by the blue cycle on the top-right of Figure 1.1. This aspect is discussed in Chapter 10, a primer on Data Analytics.

On another level, Big Data is different from traditional data in every way: space, time, and function. The quantity of Big Data is 1,000 times more than that of traditional data. The speed of data generation and transmission is 1,000 times faster. The forms and functions of Big Data are much more diverse: from numbers to text, pictures, audio, videos, activity logs, machine data, and more. There are also many more sources of data, from individuals to organizations to governments, using a range of devices from mobile phones to computers to industrial machines. Not all data will be of equal quality and value. This is represented by the red cycle on the bottom left of Figure 1.1. This aspect of Big Data, and its new technologies, is the main focus of this book.

Big Data is mostly unstructured data. Every type of data is structured differently, and will have to be dealt with differently. There are huge opportunities for technology providers to innovate and manage the entire life cycle of Big Data to generate, gather, store, organize, analyze, and visualize this data.

CASELET: IBM Watson: A Big Data system

IBM created the Watson system as a way of pushing the boundaries of Artificial Intelligence and natural language understanding technologies. Watson beat the world champion human players of Jeopardy (quiz style TV show) in Feb 2011. Watson reads up on data about everything on the web including the entire Wikipedia. It digests and absorbs the data based on simple generic rules such as: books have authors; stories have heroes; and drugs treat ailments. A jeopardy clue, received in the form of a cryptic phrase, is broken down into many possible potential sub-clues of the correct answer. Each sub-clue is examined to see the likeliness of its answer being the correct answer for the main problem. Watson calculates the confidence level of each possible answer. If the confidence level reaches more than a threshold level, it decides to offer the answer to the clue. It manages to do all this in a mere 3 seconds.

Watson is now being applied to diagnosing diseases, especially cancer. Watson can read all the new research published in the medical journals to update its knowledge base. It is being used to diagnose the probability of various diseases, by applying factors such as patients current symptoms, health history, genetic history, medication records, and other factors to recommend a particular diagnosis. (Source: Smartest machines on Earth: youtube.com/watch?v=TCOhyaw5bwg)

Figure 12 IBM Watson playing Jeopardy Q1 What kinds of Big Data knowledge - photo 7

Figure 1.2: IBM Watson playing Jeopardy

Q1: What kinds of Big Data knowledge, technologies and skills are required to build a system like Watson? What kind of resources are needed?

Q2: Will doctors be able to compete with Watson in diagnosing diseases

and prescribing medications? Who else could benefit from a system like Watson?

Capturing Big Data

If data were simply growing too large, OR only moving too fast, OR only becoming too diverse, it would be relatively easy. However, when the four Vs (Volume, Velocity, Variety, and Veracity) arrive together in an interactive manner, it creates a perfect storm. While the Volume and Velocity of data drive the major technological concerns and the

costs of managing Big Data, these two Vs are themselves being driven by the 3 rd V, the Variety of forms and functions and sources of data.

Volume of Data

The quantity of data has been relentlessly doubling every 12-18 months. Traditional data is measured in Gigabytes (GB) and Terabytes (TB), but Big Data is measured in Petabytes (PB) and Exabytes (1 Exabyte = 1 Million TB).

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Big Data Essentials»

Look at similar books to Big Data Essentials. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Raj Pethuru(Editor)

A deep dive into NoSQL databases the use cases and applications

Roos

Hadoop for dummies: [Understand the value of big data and how Hadoop can help manage it ; navigate the Hadoop 2 ecosystem and create clusters ; use applications for data mining, problem-solving,

Achari

Hadoop essentials: delve into the key concepts of Hadoop and get a thorough understanding of the Hadoop ecosystem

Lillian Pierson

Data Science for Dummies, 2nd Edition

Adam Fowler

NoSQL For Dummies

Grover Mark

Hadoop Application Architectures: Designing Real-World Big Data Applications

Warshow

Rembrandt: Details

LazyProgrammer

Big Data, MapReduce, Hadoop, and Spark with Python

Ted Dunning

Real-World Hadoop

Mark Grover

Hadoop Application Architectures

Shiva Achari

Hadoop Essentials

Dirk deRoos

Hadoop For Dummies

Reviews about «Big Data Essentials»

Discussion, reviews of the book Big Data Essentials and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.