Stat-I Que Chapterwise with ans

STAT-I que with ans

Chapter:1

State with suitable examples the role played by computer technology in applied statistics and the role of statistics in information technology.

Define primary data and secondary data and explain the difference between them.


Primary data is the data that is collected for the first time through personal experiences or evidence, particularly for research. It is also described as raw data or first-hand information. The mode of assembling the information is costly, as the analysis is done by an agency or an external organisation, and needs human resources and investment. The investigator supervises and controls the data collection process directly.

Secondary data is a second-hand data that is already collected and recorded by some researchers for their purpose, and not for the current research problem. It is accessible in the form of data collected from different sources such as government publications, censuses, internal records of the organisation, books, journal articles, websites and reports, etc.

Difference Between Primary Data and Seconday Data:

BasisPrimary DataSecondary Data
DefinitionPrimary data are those that are collected for the first time.Secondary data refer to those data that have already been collected by some other person.
OriginalityThese are original because these are collected by the investigator for the first time.These are not original because someone else has collected these for his own purpose.
Nature of DataThese are in the form of raw materials.These are in the finished form.
Reliability and SuitabilityThese are more reliable and suitable for the enquiry because these are collected for a particular purpose.These are less reliable and less suitable as someone else has collected the data which may not perfectly match our purpose.
Time and MoneyCollecting primary data is quite expensive both in the terms of time and money.Secondary data requires less time and money; hence it is economical.
Precaution and EditingNo particular precaution or editing is required while using the primary data as these were collected with a definite purpose.Both precaution and editing are essential as secondary data were collected by someone else for his own purpose.

What do your mean by measurement scale? Describe the different types of measurement scales used in statistics.

The measurement consisting of counting the number of units or parts of units displayed by objects and phenomena is called measurement scale. Following are the different types of measurement scale.

a. Nominal Scale:

It is a smallest and lowest level of measurement scale. It is simply a system of assigning number or the symbol to objects or events to distinguish one from anotehr in order to label them. The symbols or numbers have no numerical meaning. The arthematic operations cannot be used for these numerals.

b) Ordinal Scale:

The second and the lowest level of ordered scale is  the ordinal scale. It is the quantification of items by ranking. In this scale, the numerals are arranged in some order but the gaps between the positions of the numerals are not made equal. It represents quantitative values in ascending or descending order.

c) Interval Scale:

In addition to ordering the data, this scale was equidistant units to measure the difference between the scores. It assumes data has equal intervals. This scale doesn’t have absolute zero but only arbitary zero. Interval scale is the developed from of ordinary scale.

d) Ratio Scale:

Ratio scale is the ideal scale and on extended form of interval scale. It is most powerful scale of measurement. It possesses the characteristics of nominal, ordinal and interval scale. Ratio scale has as absolute zero or ture zero or natural zero of measurement.

Chapter:2

What do you understand by measures of central tendency and dispersion in descriptive statistics? What are the different measures of dispersion? The grouped frequency distribution in the table represents masses of a sample of 50 of the people from the data file, “Brain size”.

Mass(lbs)1-22-33-44-55-66-7
Frequency81015962

Compute mean, standard deviation, variance and coefficient of variation and interpret them.

Write notes on any two:

  1. Nominal and ordinal scale
  2. Kurtosis
  3. Five number summary

The lifetime of a certain electronic component is a normal random variate with the expectation of 5000 hours and a standard deviation of 100 hours. Compute the probabilities under the following conditions

  1. Lifetime of components between 3000 to 6500 hours
  2. Lifetime of components between 3000 t0 6500 hours
  3. Lifetime of components more than 6000 hours

What are different methods of measuring dispersion. Sample of polythene bags from two manufactures, A, B, are tested by a prospective buyer for bursting pressure and the results are as follows.

Bursting Pressure5-1010-1515-2020-2525-3030-35
Number of bags manufactured byA292954115
B91118322713

Which set of bags has more uniform pressure? If price are the same, Which manufacture’s bags would be preferred by buyer? Use appropriate statistical tool

Range, interquartile range, standard deviation. and variance are the three commonly used measures of dispersion.

Solution:

Let X be the mid value so that 𝑑=17.455. Let f and f be the number of bags of manufactures A and B respectively.

Classx𝑑=𝑥17.455fffdfd‘2fdfd‘2
5-9.97.45-229-48-1836
10-14.912.45-2911-99-1111
15-19.917.45029180000
20-24.922.451543254543232
25-29.927.4521127222254108
30-34.932.453513151539117
Σd = 3N=10N = 110Σfd=78Σfd‘2=160Σfd = 96Σfd‘2=304

For manufacture A:

𝑀𝑒𝑎𝑛(𝑋¯𝐴)=𝐴+𝑓𝑑𝑁×

17.45+78100×5

= 21

Standard Deviation(σA) = 𝑓𝑑2𝑁(𝑓𝑑𝑁)2

5×160110(78110)2

= 5 x 0.975

= 4.875

Covariance(A) = σ𝐴𝑋¯𝐴×100

4.87521×100

= 23.2%

For manufacture B:

𝑀𝑒𝑎𝑛(𝑋¯𝐵)=𝐴+𝑓𝑑𝑁×

17.45+96100×5

= 21.8

Standard Deviation(σA) = 𝑓𝑑2𝑁(𝑓𝑑𝑁)2

5×204110(96110)2

= 5 x 1.41

= 7.05

Covariance(A) = σ𝐴𝑋¯𝐴×100

7.0521.8×100

= 32.34%

Since, the cofficient of variation for A is less than that for B, the set of bags manufactured by A has more uniform pressure.Hence, the buyer would perfer to buy bags manufactured by A.

Another Method:

Brusting PressureMABAMBM
5-107.5291567.5
10-1512.5911112.5137.5
15-2017.52918507.5315
20-2522.554321215720
ΣA = 94ΣB = 70ΣAM = 1850ΣBM = 1240

Brusting pressure of Polythene A = 𝐴𝑀𝐴

185094

= 19.68

Brusting pressure of Polythene B = 𝐵𝑀𝐵

124070

= 17.68

Since, the brusting pressure of polythene A is greater than of polythene B. So, Buyers perfers the polythene A.


Compute percentile coefficient of kurtosis from the following data and interpret the result.

Hourly wages (Rs)23-2728-3233-3738-4243-4748-52
Number of workers22169431

Solution:

xfcf
23-272222
28-321638
33-37947
38-42451
43-47354
48-52155
N = 55

Now,

Calculating P75:

P75 = 75(𝑁100)𝑡𝑖𝑡𝑒𝑚 = 75(55100)𝑡 = 41.25th

In cf, the value just greater than 41.25 is 4. so, class  = 33 – 37

Now,

P75 = 𝐿+75𝑁100𝑐𝑓𝑓×𝑖

33+41.25389×4

= 34.45

Calculating P25:

P25 = 25(𝑁100)𝑡𝑖𝑡𝑒𝑚 = 25(55100)𝑡 = 13.75th

In cf, the value just greater than 13.755 is 22. so, class  = 23 – 27

Now,

P25 = 𝐿+25𝑁100𝑐𝑓𝑓×𝑖

23+13.75022×4

= 23.625

Calculating P90:

P90 = 90(𝑁100)𝑡𝑖𝑡𝑒𝑚 = 90(55100)𝑡 = 49.5th

In cf, the value just greater than 49.5 is 51. so, class  = 38 – 42

Now,

P90 = 𝐿+90𝑁100𝑐𝑓𝑓×𝑖

23+49.5474×4

= 40.5

Calculating P10:

P10 = 10(𝑁100)𝑡𝑖𝑡𝑒𝑚 = 10(55100)𝑡 = 5.5th

In cf, the value just greater than 5.5 is 21. so, class  = 23-27

Now,

P10 = 𝐿+10𝑁100𝑐𝑓𝑓×𝑖

23+5.5022×4

= 24

Now,

Percentile cofficient of Kurtosis

𝑃75𝑃252(𝑃90𝑃10)

34.4523.6252(40.524)

10.82533

= 0.33


Distinguish between absolute and relative measure of dispersion. Two computer manufacturers A and B compete for profitable and prestigious contract. In their rivalry, each claim that their computer a consistent. For this it was decided to start execution of the same program simultaneously on 50 computers of each company and recorded the time as given below.

Time (in seconds)0-22-44-66-88-1010-12
Nature of computers manufactured byA51613754
B27121991

Which company’s computer is more consistent?

Measure of dispersion is descriptive statistical measure used to measure the variation or spread or scatteriess in the data set.

There are two types of measures of dispersion i.e absolute and relative measure of dispersion. The difference between them are:

Absolute MeasureRelative Measure
A measure of dispersion is saied to be an absolute if it is expressed interms of original units of data.A measure of dispersion is saied to be relative if it is independent of original units of data and it is simply a cofficient.
Some absolute measures:

  1. Range
  2. Quartile deviation
  3. Mean deviation
  4. Standard deviation
Some relative measures are:

  1. Cofficient of range
  2. Cofficient of Quartile deviation
  3. Cofficient of Mean deviation
  4. Cofficient of variation
Absolute measures are in fixed valueRelative measures may be in ratio or percentage

Solution:

For Company A

Time(x)No of Comp.(f)mfmfm2
0-25155
2-416348144
4-613565325
6-87749343
8-105945405
10-1241144484
N = 50Σfm = 256Σfm2 = 1706

Now,

𝑋¯=𝑓𝑚𝑁=25650=5.12

Also, Standard deviation(σ) is

σ=𝑓𝑚2𝑁(𝑓𝑚𝑁)2

=170650(5.12)2

= 2.812

Also, Covariance (C.V) = σ𝑋¯×100

=2.8125.12×100

= 54.92%

For Company B

Time(x)No of Comp.(f)mfmfm2
0-22122
2-4732163
4-612560300
6-8197133931
8-109981729
10-1211111121
N = 50Σfm = 308Σfm2 = 2146

Now,

𝑋¯=𝑓𝑚𝑁=30850=6.16

Also, Standard deviation(σ) is

σ=𝑓𝑚2𝑁(𝑓𝑚𝑁)2

=214650(6.16)2

= 1.725

Also, Covariance (C.V) = σ𝑋¯×100

=1.7256.16×100

= 28%

Since, the standard deviation and covarience of company B is less than that of A. So, Company B is considered more consistent.

What are the roles of measure of dispersion in descriptive statistics? Following table gives the frequency distribution of thickness of computer chips (in nanometer) manufactured by two companies.

Thickness of computer chips51015202530
Number of chipsCompany A101524201813
company B12182022244

Measure of dispression is a descriotive statistical measure used to measure the variation or spreed or scatter less in the data set.

It gives additional information that enables to judge the reliability of measure of central tendency.

The measure of dispersion is important because it determines the margin of error. You’ll have when measuring the central tendency.

Solution Part:

For Company A:

xffxfx2
51050250
10151501500
15243605400
202040010000
251845011250
301339011700
N = 100Σfx = 1800Σfx2 = 40100

Now,

𝑥¯=𝑓𝑥𝑁 = 1800100 = 18

Standard deviation(σ) = 𝑓𝑥2𝑁(𝑓𝑥𝑁)2

40100100(1800100)2

= 8.77

And, Co-varience (c.v) = σ𝑥¯×100

8.7718×100

= 48.72%

For Company b:

xffxfx2
51260300
10181801800
15203004500
20224408800
252460015000
3041203600
N = 100Σfx = 1700Σfx2 = 34000

Now,

𝑥¯=𝑓𝑥𝑁 = 1700100 = 17

Standard deviation(σ) = 𝑓𝑥2𝑁(𝑓𝑥𝑁)2

34000100(1700100)2

= 7.14

And, Co-varience (c.v) = σ𝑥¯×100

7.1417×100

= 42%

Since, the standard deviation and covarience of company B is less than that of company A. so, company B is considered more consistent in terms of thickness of computer chip.


Chapter-3

Define conditional probability. A problem of statistics is given to three students. A, B, and C whose chances of solving the problem are in a ratio of 2:3:5. Find the probability that (i) non of them solve the problem (ii) the problem will be solved.

Define a random variable. For the following bi-variants probability distribution of X and Y , find

  1. marginal probability mass function of X and Y ,
  2. P(x≤1, Y=2),
  3. P(X≤1)
X/Y123456
0001/322/322/323/32
11/161/161/81/81/81/8
21/321/321/641/641/641/64

A variable whose value id determined by the outcome of a random experiment is called a random variable.

Given,

xy123456P(X=x)
0001/322/322/323/328/32
11/161/161/81/81/81/85/8
21/321/321/641/641/641/641/8
P(Y=y)3/323/321/6413/6413/6413/64sum = 1

Solution i):

Marginal pmf of x = p(X = xi) = 𝑗=1𝑛𝑝(𝑥𝑖𝑦𝑖)

∴P(X = 0) = P(X=0, Y=1) + P(X = 0, Y = 2) + P(X = 0, Y = 3) + P(X = 0, Y = 4) + P(X = 0, Y = 5) + P(X = 0, Y = 6)

= 8/32

∴P(X = 1) = P(X=1, Y=1) + P(X = 1, Y = 2) + P(X = 1, Y = 3) + P(X = 1, Y = 4) + P(X = 1, Y = 5) + P(X = 1, Y = 6)

= 5/8

∴P(X = 2) = P(X=2, Y=1) + P(X = 2, Y = 2) + P(X = 2, Y = 3) + P(X = 2, Y = 4) + P(X = 2, Y = 5) + P(X = 2, Y = 6)

= 1/8

Also, Marginal pmf of Y = P(Y=yi) = 𝑖=1𝑛𝑝(𝑥𝑖𝑦𝑖)

∴P(Y = 1) = P(X=0, Y=1) + P(X = 1, Y = 1) + P(X = 1, Y = 2)

= 3/32

Similarly,

P(Y = 2) = 3/32

P(Y = 3) = 11/64

P(Y = 4) = 13/64

P(Y = 5) = 13/64

P(Y = 6) = 15/64

Solution ii):

P(X ≤ 1, Y = 2)

= P(X = 0, Y= 2) + P(X = 1, Y = 2)

= 0 + 1/16

= 1/16

Solution ii)

P(X ≤ 1)

= P(X = 0) + P(X = 1)

= 8/32 + 5/8

= 7/8


Comments

Popular posts from this blog

LAB WORK OF PHYSICS (CSIT 1st Sem)

LAB WORK OF DIGITAL LOGICS: (CSIT 1st SEM)

CONQUER IIT(1st SEM) EXAM targeted !!