Date: Fri, 10 Apr 2020 06:38:47 +0000 (GMT) Message-ID: <2054530037.3454.1586500727650@c4e8295e3740> Subject: Exported From Confluence MIME-Version: 1.0 Content-Type: multipart/related; boundary="----=_Part_3453_818808231.1586500727649" ------=_Part_3453_818808231.1586500727649 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Content-Location: file:///C:/exported.html Box Plot (5.1)

# Context

#### Data | Charts # New: Box Plot

A Box Plot (also known as a box-and-whisker plot) is a classical tool used in = Descriptive Statistics to analyze the distributions of numerical varia= bles.

It shows:

• the First Quartile Q1
• the Median Qthe Last Quartile Q3&nb= sp;
• the Minimum, i.e. the lowest datum still within 1.5 of the Int= erquartile Range (IQR =3D Q3 =E2=88=92 =  Q1) of the first quartile
• the Maximum, i.e. the highest datum still within 1.5 IQR of the last qu= artile
• the first Notch , where N is the numbe= r of observations
• the last Notch . Notches are use= ful in offering a rough guide to significance of difference of medians.
• the Mean
• Lower Suspect values that are within the Minimum and&= nbsp; • Upper Suspect values that are within the Maximum=  and • Lower Extreme values that are inferior to • Upper Extreme values that are superior to <= /li>
Example
=20
Wi= thout Selector Variable

You select the variable to be analyzed in the Param= eter window. Upon clicking on Display, the box plots are generated. Hovering over eac= h element returns its description and numerical value in the upper part of = the window. You can also zoom in vertically by selecting the y-range as illustrated = below. Double-clicking on the graph returns the default view. =  =

Clicking on a Suspect Point or Extreme Point (as indicated in the header) brings up a table with the corres= ponding data record. <= /p>

Wi= th a Selector Variable

This option is useful for comparing the conditional distributions of a v= ariable given the states of a Selector Variable. The graph below allows to compare the distributions of two variables, Smoothness (Mean) and Smoothness (Worst), given the two sta= tes of the variable Diagnosis (B and M). =20