LitArk » Books » Children

Wolfgang Karl HГ¤rdle - Applied Multivariate Statistical Analysis

Here you can read online Wolfgang Karl HГ¤rdle - Applied Multivariate Statistical Analysis full text of the book (entire story) in english for free. Download pdf and epub, get meaning, cover and reviews about this ebook. year: 0, publisher: Springer Berlin Heidelberg, Berlin, Heidelberg, genre: Children. Description of the work, (preface) as well as reviews are available. Best literature library LitArk.com created for fans of good reading and offers a wide selection of genres:

Romance novel Science fiction Adventure Detective Science History Home and family Prose Art Politics Computer Non-fiction Religion Business Children Humor

Choose a favorite category and find really read worthwhile books. Enjoy immersion in the world of imagination, feel the emotions of the characters or learn something new for yourself, make an fascinating discovery.

Book:
Applied Multivariate Statistical Analysis
Author:
Wolfgang Karl Hrdle / Lopold Simar
Publisher:
Springer Berlin Heidelberg, Berlin, Heidelberg
Genre:
Books / Children
Year:
0
Rating:
5 / 5
Favourites:
Add to favourites
Your mark:
- 100
- 1
- 2
- 3
- 4
- 5

Description
Author's other books
Similar books

Applied Multivariate Statistical Analysis: summary, description and annotation

We offer to read an annotation, description, summary or preface (depends on what the author of the book "Applied Multivariate Statistical Analysis" wrote himself). If you haven't found the necessary information about the book — write in the comments, we will try to find it.

Wolfgang Karl HГ¤rdle: author's other books

Who wrote Applied Multivariate Statistical Analysis? Find out the surname, the name of the author of the book and a list of all author's works by series.

Applied Multivariate Statistical Analysis — read online for free the complete book (whole text) full work

Below is the text of the book, divided by pages. System saving the place of the last page read, allows you to conveniently read the book "Applied Multivariate Statistical Analysis" online for free, without having to search again every time where you left off. Put a bookmark, and you can go to the page where you finished reading at any time.

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Part I
Descriptive Techniques

Springer-Verlag Berlin Heidelberg 2015

Wolfgang Karl Hrdle and Lopold Simar Applied Multivariate Statistical Analysis 10.1007/978-3-662-45171-7_1

1. Comparison of Batches

Wolfgang Karl Hrdle 1 and Lopold Simar 2

(1)

C.A.S.E. Centre f. Appl. Stat. & Econ. School of Business and Economics, Humboldt-Universitt zu Berlin, Berlin, Germany

(2)

Center of Operations Research & Econometrics (CORE), Katholieke Univeristeit Leuven Inst. Statistics, Leuven, Belgium

Multivariate statistical analysis is concerned with analysing and understanding data in high dimensions. We suppose that we are given a set Applied Multivariate Statistical Analysis - image 1

Applied Multivariate Statistical Analysis - image 1

of n observations of a variable vector X in Applied Multivariate Statistical Analysis - image 2

. That is, we suppose that each observation x i has p dimensions:

Applied Multivariate Statistical Analysis - image 3

and that it is an observed value of a variable vector Applied Multivariate Statistical Analysis - image 4

. Therefore, X is composed of p random variables:

Applied Multivariate Statistical Analysis - image 5

where X j , for Applied Multivariate Statistical Analysis - image 6

, is a one-dimensional random variable. How do we begin to analyse this kind of data? Before we investigate questions on what inferences we can reach from the data, we should think about how to look at the data. This involves descriptive techniques. Questions that we could answer by descriptive techniques are:

Are there components of X that are more spread out than others?
Are there some elements of X that indicate sub-groups of the data?
Are there outliers in the components of X ?
How normal is the distribution of the data?
Are there low-dimensional linear combinations of X that show non-normal behaviour?

One difficulty of descriptive methods for high-dimensional data is the human perceptional system. Point clouds in two dimensions are easy to understand and to interpret. With modern interactive computing techniques we have the possibility to see real time 3D rotations and thus to perceive also three-dimensional data. A sliding technique as described in Hrdle and Scott () may give insight into four-dimensional structures by presenting dynamic 3D density contours as the fourth variable is changed over its range.

A qualitative jump in presentation difficulties occurs for dimensions greater than or equal to 5, unless the high-dimensional structure can be mapped into lower-dimensional components (Klinke & Polzehl, ). Features like clustered sub-groups or outliers, however, can be detected using a purely graphical analysis.

In this chapter, we investigate the basic descriptive and graphical techniques allowing simple exploratory data analysis. We begin the exploration of a data set using boxplots. A boxplot is a simple univariate device that detects outliers component by component and that can compare distributions of the data among different groups. Next, several multivariate techniques are introduced (Flury faces, Andrews curves and parallel coordinates plots (PCPs)) which provide graphical displays addressing the questions formulated above. The advantages and the disadvantages of each of these techniques are stressed.

Two basic techniques for estimating densities are also presented: histograms and kernel densities. A density estimate gives a quick insight into the shape of the distribution of the data. We show that kernel density estimates (KDEs) overcome some of the drawbacks of the histograms.

Finally, scatterplots are shown to be very useful for plotting bivariate or trivariate variables against each other: they help to understand the nature of the relationship among variables in a data set and allow for the detection of groups or clusters of points. Draftman plots or matrix plots are the visualisation of several bivariate scatterplots on the same display. They help detect structures in conditional dependencies by brushing across the plots. Outliers and observations that need special attention may be discovered with Andrews curves and PCPs. This chapter ends with an explanatory analysis of the Boston Housing data.

1.1 Boxplots

Example 1.1

The Swiss bank data (see Chap. ) consists of 200 measurements on Swiss bank notes. The first half of these measurements are from genuine bank notes, the other half are from counterfeit bank notes.

Fig. 1.1

An old Swiss 1000-franc bank note

The authorities measured, as indicated in Fig.,

These data are taken from Flury and Riedwyl The aim is to study how these - photo 8

These data are taken from Flury and Riedwyl (). The aim is to study how these measurements may be used in determining whether a bill is genuine or counterfeit.

The boxplot is a graphical technique that displays the distribution of variables. It helps us see the location, skewness, spread, tail length and outlying points.

It is particularly useful in comparing different batches. The boxplot is a graphical representation of the Five Number Summary . To introduce the Five Number Summary, let us consider for a moment a smaller, one-dimensional data set: the population of the 15 largest world cities in 2006 (Table ).

Table 1.1

The 15 largest world cities in 2006

City	Country	Pop. (10,000)	Order statistics
Tokyo	Japan	3,420	x (15)
Mexico city	Mexico	2,280	x (14)
Seoul	South Korea	2,230	x (13)
New York	USA	2,190	x (12)
Sao Paulo	Brazil	2,020	x (11)
Bombay	India	1,985	x (10)
Delhi	India	1,970	x (9)
Shanghai	China	1,815	x (8)
Los Angeles	USA	1,800	x (7)
Osaka	Japan	1,680	x (6)
Jakarta	Indonesia	1,655	x (5)
Calcutta	India	1,565	x (4)

Light

Font size:

↓

↑

Reset

Interval:

↓

↑

Bookmark:

Make

Similar books «Applied Multivariate Statistical Analysis»

Look at similar books to Applied Multivariate Statistical Analysis. We have selected literature similar in name and meaning in the hope of providing readers with more options to find new, interesting, not yet read works.

Konstantinos N. Zafeiris (editor)

Data Analysis and Related Applications, Volume 2: Multivariate, Health and Demographic Data Analysis

Simona Balzano

Statistical Learning and Modeling in Data Analysis: Methods and Applications

Szymon Borak Wolfgang Karl Härdle

Statistics of Financial Markets: Exercises and Solutions

Zelterman

Applied Multivariate Statistics with R

Jürgen Franke Wolfgang Karl Härdle

Statistics of Financial Markets

Wolfgang Karl HГ¤rdle Cathy Yi-Hsuan Chen

Applied Quantitative Finance

John D. Fox

An R Companion to Applied Regression

Bill Shipley

Cause and Correlation in Biology: A User’s Guide to Path Analysis, Structural Equations and Causal Inference with R

Roger D. Peng

Exploratory data analysis with R

Keenan A. Pituch

Applied Multivariate Statistics for the Social Sciences: Analyses with SAS and IBMs SPSS

Wolfgang Yourgrau

Treatise on Irreversible and Statistical Thermodynamics: An Introduction to Nonclassical Thermodynamics

Jussi Klemelä

Multivariate Nonparametric Regression and Visualization: With R and Applications to Finance

Reviews about «Applied Multivariate Statistical Analysis»

Discussion, reviews of the book Applied Multivariate Statistical Analysis and just readers' own opinions. Leave your comments, write what you think about the work, its meaning or the main characters. Specify what exactly you liked and what you didn't like, and why you think so.