New: All the best for your Final Examination!!!!
Numerical methods for presenting data are known as descriptive statistics. They include the following four measures:
The tendency of the variate values to get clustered around a certain representative value of the series is known as central tendency.
The value of the variate which is thoroughly representative of the series is referred to as the central tendency .
Various methods of determining the actual value at which the variate values of the data tend to cluster are called the measures of central tendency.
Arithmetic mean may be defined as the quotient obtained by dividing the total of all the values of a variable with the total number of observations.
If X1, X2, X3, ..........., Xn are the individual observations pertaining to variable X, then the arithmetic mean is obtained as:
Σ Xi
`X = -------
n
Where,
`X = Arithmetic mean
Xi = ith value of variable X
Σ = sum of values of X from 1st to nth
n = Total number of observations or items.
Note: The unit of measurement should be mentioned with the value of the mean.
Direct method
Where,
ΣfX = Sum of the product of the values and their corresponding frequencies.
Σfm = Sum of the product of mid-values of classes and their frequencies.
n = Σf = Total number of observations.
Where,
A = Assumed mean
d = (X – A) or (m – A) = Deviation of individual items or mid-values from assumed mean
Σd = Sum of deviations of individual observations (or mid values of the classes) from the
assumed mean
Σfd = Sum of the products of the deviations and their corresponding frequencies
Where,
C = Common factor (or constant) by which each deviation is divided, and
Where,
X = Last value of the variable arranged in ascending order for individual or discrete series.
m = Mid-value of the last class arranged in ascending order for continuous series.
i = Interval of equal magnitude between X or m-values depending upon the series used.
When data on increase in the ratio or percentage, the geometric mean is suitable Measure of Central Tendency.
Suppose we have an investment which returns 10% the first year, 50% the second year, and 30% the third year. What is its average rate of return? It is not the arithmetic mean, because what these numbers mean is that on the first year our investment was multiplied (not added to) by 1.10, on the second year it was multiplied by 1.50, and during the third year it was multiplied by 1.30.
Geometric mean can be defined as the nth root of the product of values of the given set of data.
1. Root Method
The given formula is appropriate when the values of the items are small, simple and capable of being factorized easily. If the values are big or complicated, then the geometric mean is more easily calculated by the logarithmic operation method
2. Log method
Direct Method
Short cut method
Where,
Log A = Log of the assumed mean,
Σd (log A) = Sum of deviation of log X and log A = (log X – log A)
GMn=X1.X2......Xn
5. The geometric mean of different series with same n and same product is always equal,
e.g. geometric mean of (1, 36), (2,18) and (4, 9) = √36 = 6.
6. The product of a series of numbers remains the same even when each number of the series is replaced by its geometric mean.
The Harmonic Mean is defined as the reciprocal of the arithmetic mean of the reciprocals of the values of a variable.
Direct Method
Short cut method
Where, Rec. A = Reciprocal of the assumed mean.
For any series AM ≥ GM ≥ HM. But if all the observations are same, then these three means are equal.
For two observations (X and Y):
If X1, X2, .........., Xn denote n values of a variable X and W1, W2, ......., Wn are their weights respectively, then their weighted mean is given by,
The median can be defined as the value of the middle item of a series arranged in ascending or descending order of magnitude.
1. Tabular Method
The value of the median is located from table as follows:
(i)When n = Odd:-
Median = (n+1)/2 th item in an array
When n = even :-
Median = Average of (n/2)th and (n+2)/2th item
When all the classes are of equal length
Where, L0 = Lower limit of median class, L1 = Upper limit of median class
f = Frequency of median class, Cf = Cumulative frequency up to the class preceding the median class, and i = Class interval
When the classes are not of equal length
2. Graphic method
Method-I
Method-2
We can obtain the median by drawing both less than and more than ogive in the same curve simultaneously. The point of intersection of both the cumulative frequency curves with respect to variate value on the horizontal axis will be the median of given continuous frequency distribution.
Example: Determine the median from the following frequency distribution by graphic method and compare the same with that obtained from the formula.
Method I
Method II
Calculation from the formula
Median number = (n/2) = (56/2) = 28
(n/2) - cf
Median = L0 + ------------- X i
f
28 – 20
Median =40 + ----------- X 10 = 40 + 5.33 = 45.33
15
The mode is defined as the value of a series that occurs maximum number of times, i.e. the item or value having maximum frequency.
1. Method of inspection
Mode or the modal class is determined by simple inspection of the distribution.
The value occurring maximum number of times or the class value against which the maximum frequency stands is taken as the modal value or the modal class.
2. Method of Grouping
(a) Preparation of grouping table
(b) Preparation of an analysis table
After determining the modal class, the exact value of the mode is obtained by applying any one of the following formula of interpolation:
Where,
L0 = Lower limit of modal class
L1 = Upper limit of modal class
f0 = Frequency of modal class
f1 = Frequency of the class preceding the modal class
f2 = Frequency of the class succeeding the modal class
3. Graphic Method
Suitable in case of frequency distribution of a continuous nature.
Procedure:
4. Method of Empirical Relation
This method, proposed by Karl Pearson, is based on the pertinent relationship among the mean, median and mode in case of moderately asymmetric or skewed distribution.
Mode = 3 Median - 2 Mean