Development of Estimation Procedure of Population Mean in Two-Phase Stratified Sampling

Partha Parichha; Kajla Basu; Arnab Bandyopadhyay

doi:10.5772/intechopen.82850

Abstract

This article describes the problem of estimation of finite population mean in two-phase stratified random sampling. Using information on two auxiliary variables, a class of product to regression chain type estimators has been proposed and its characteristic is discussed. The unbiased version of the proposed class of estimators has been constructed and the optimality condition for the proposed class of estimators is derived. The efficacy of the proposed methodology has been justified through empirical investigations carried over the data set of natural population as well as the data set of artificially generated population. The survey statistician may be suggested to use it.

Keywords

stratified random sampling
double sampling
auxiliary variables
chain type estimators
bias
mean square error
efficiency
AMS 2000 Mathematics Subject Classification: 62D05

Author Information

Show +

Partha Parichha
- Department of Mathematics, Asansol Engineering College, India
Kajla Basu
- Department of Mathematics, National Institute of Technology, India
Arnab Bandyopadhyay*
- Department of Mathematics, Asansol Engineering College, India

*Address all correspondence to: arnabbandyopadhyay4@gmail.com

1. Introduction

In this present paper we have made use of Auxiliary information extracted from the variables having correlation with study variable. Auxiliary information may be utilized at planning, design and estimation stages to develop improved estimation procedures in sample surveys. Sometimes, information on auxiliary variable may be readily available for all the units of population; for example, tonnage (or seat capacity) of each vehicle or ship is known in survey sampling of transportation and number of beds available in different hospitals may be known well in advance in health care surveys. If such information lacks, it is sometimes, relatively cheap to take a large preliminary sample where auxiliary variable alone is measured, such practice is applicable in two-phase (or double) sampling. Two-phase stratified sampling happens to be a powerful and cost effective (economical) technique for obtaining the reliable estimate in first-phase (preliminary) sample for the unknown parameters of the auxiliary variables. For example, Sukhatme [1] mentioned that in a survey to estimate the production of lime crop based on orchards as sampling units, a comparatively larger sample is drawn to determine the acreage under the crop while the yield rate is determined from a sub sample of the orchards selected for determining acreage.

In order to construct an efficient estimator of the population mean of the auxiliary variable in first-phase (preliminary) sample, Chand [2] introduced a technique of chaining another auxiliary variable with the first auxiliary variable by using the ratio estimator in the first phase sample. The estimator is known as chain-type ratio estimator. This work was further extended by Kiregyera [3, 4], Tracy et al. [5], Singh and Espejo [6], Gupta and Shabbir [7], Shukla et al. [8], Choudhury and Singh [9], Parichha et al. [10] and among others, where they proposed various chain-type ratio and regression estimators.

In practice, the population may often consist of heterogeneous units. For example, in socio-economic surveys, people may live in rural areas, urban localities, ordinary domestic houses, hostels, hospitals and jail, etc. In such a situation one should carefully study the population according to the characteristics of regions and then apply sampling scheme strata wise independently. This procedure is known as stratified random sampling. It may be noted that most of the developments in two-phase sampling scheme are based on simple random sampling only while limited number of attempts are taken to address the problems of two-phase sampling scheme in the platform of stratified random sampling. It may be also noticeable that the most of the research work on two-phase sampling are producing biased estimates. However, biased becomes a serious drawback in sample surveys. A sampling method is called biased if it systematically favors some outcomes over others. It results in a biased sample of a population (or non-human factors) in which all individuals, or instances, were not equally likely to have been selected. If this is not accounted for, results can be erroneously attributed to the phenomenon under study rather than to the method of sampling. For example, telephone sampling is common in marketing surveys. A simple random sample may be chosen from the sampling frame consisting of a list of telephone numbers of people in the area being surveyed. This method does involve taking a simple random sample, but it is not a simple random sample of the target population (consumers in the area being surveyed). It will miss people who do not have a phone. It may also miss people who only have a cell phone that has an area code not in the region being surveyed. It will also miss people who do not wish to be surveyed, including those who monitor calls on an answering machine and don’t answer those from telephone surveyors. Thus the method systematically excludes certain types of consumers in the area. It is obvious that the inferences from a biased sample are not as trustworthy as conclusions from a truly random sample.

Encouraged with the above work, we have proposed a class of product to regression chain type estimators in stratified sampling using two auxiliary variables under double sampling. The unbiased version of the proposed class of estimators has been obtained which make the estimation strategy more practicable. The dominance of the proposed estimation strategy over the conventional ones has been established through empirical investigations carried over the data set of natural as well as artificially generated population.

2. Sampling structures and notations

Consider a finite population U = {1, 2,…, N} of N identifiable units divided into L homogeneous strata with the hth stratum (h = 1, 2,…, L) having N h . Let y and (x, z) be the study variable and two auxiliary variables respectively taking values y ih and ( x ih , z ih ), respectively, for the unit i = 1,2,… N h of the hth stratum. Y ¯ = ∑ h = 1 L Y ¯ h W h , X ¯ = ∑ h = 1 L X ¯ h W h , Z ¯ = ∑ h = 1 L Z ¯ h W h be population means of the study and the auxiliary variables, and Y ¯ h = ∑ i = 1 N h y hi N h , X ¯ h = ∑ i = 1 N h x hi N h , Z ¯ h = ∑ i = 1 N h z hi N h be the corresponding stratum means. Here W h = N h N is the known stratum weight.

Let C yh = S yh Y ¯ h , C xh = S xh X ¯ h and C zh = S zh Z ¯ h be the coefficients of variation where S y h = ∑ i = 1 N h y hi − Y ¯ h 2 N h − 1 , S x h = ∑ i = 1 N h x hi − X ¯ h 2 N h − 1 , S z h = ∑ i = 1 N h z hi − Z ¯ h 2 N h − 1 are the population standard deviations in the hth stratum.

Let ρ yx h , ρ yz h and ρ xz h be the correlation coefficients between (y, x), (y, z), and (x, z) respectively in the hth stratum. Chand [2] and Kiregyera [3, 4] discussed a situation in simple random sampling when information on x is unknown but another auxiliary variable z is easily available. It is assumed that population mean of one auxiliary variable z is known in advance and the population mean of the other auxiliary variable x is unknown. We seek to estimate through a two-phase stratified sampling design. Using a simple random sample without replacement (SRSWOR) sampling scheme at each phase, we adopt the double sampling scheme as follows.

In the first phase, a preliminary large sample of size n h ′ is drawn from the hth stratum of size N h (h = 1, 2,…, L) and information on the auxiliary variables x and z is observed.
In the second phase, a sub-sample of size n h is drawn from first phase sample n h ′ units from the h th stratum of size N h and information on both the study variable y and the auxiliary variables x and z is taken.

y ¯ h = 1 n h ∑ i = 1 n h y hi , x ¯ h = 1 n h ∑ i = 1 n h x hi , z ¯ h = 1 n h ∑ i = 1 n h z hi , x ¯ h ′ = 1 n h ′ ∑ i = 1 n h ′ x hi , and z ¯ h ′ = 1 n h ′ ∑ i = 1 n h ′ z hi be the corresponding sample means in the hth stratum.

3. Discussion on existing estimation strategies

The usual stratified mean estimator ( y ¯ st ) for population mean ( Y ¯ ), is given by

y ¯ st = ∑ h = 1 L w h y ¯ h E1

The mean square error (MSE) of y ¯ st , is given by

MES y ¯ st = ∑ h = 1 ` L w h 2 1 n h − 1 N h s yh 2 E2

Motivated with the technique adopted by Chand [2], one may frame the chain ratio-product type estimator in stratified sampling structure as

y ¯ RP h = ∑ h = 1 L w h y ¯ h X ¯ h ′ x ¯ h Z ¯ h z ¯ h ′ E3

The bias and MSE respectively of y ¯ h RP , to first order of approximation, are obtained as

Bias y ¯ RP h ≅ ∑ h = 1 L w h y ¯ h 1 n h − 1 n h ′ A 1 h + 1 n h ′ − 1 N h A 2 h E4

MSE y ¯ RP h = ∑ h = 1 L w h 2 s yh 2 1 n h − 1 n h ′ A 3 h + 1 n h ′ − 1 N h A 4 h + 1 n h − 1 N h E5

where

A 1 h = C xh 2 − ρ yxh C yh C xh and A 2 h = C zh 2 − ρ yzh C yh C zh

A 3 h = C xh 2 C yh 2 − 2 ρ yxh C xh C yh and A 4 h = C zh 2 C yh 2 − 2 ρ yzh C zh C yh

Similarly, inspired with the technique adopted by Choudhary and Sing [9], one may frame the two-phase stratified random sampling estimator in stratified sampling as

y ¯ cs h = ∑ h = 1 L w h y ¯ h k h x ¯ h ′ x ¯ h Z ¯ h z ¯ h ′ + 1 − k h x ¯ h x ¯ h ′ z ¯ h ′ Z ¯ h E6

where k h is constant.

Bias y ¯ cs h ≅ ∑ h = 1 L w h y ¯ h A 5 h

A 5 h = 1 − 2 k h C yh 1 n h − 1 n h ′ ρ yxh C xh + 1 n h ′ − 1 N h ρ yzh C zh + k h 1 n h − 1 N h C xh 2 + 1 n h ′ − 1 N h C zh 2 E7

And MSE y ¯ cs h min = ∑ h = 1 L w h 2 s y h 2 × 1 n h − 1 N h − 1 n h − 1 n h ′ ρ yx h C x h − 1 n h ′ − 1 N h ρ yz h C z h 2 1 n h − 1 N h C x h 2 + 1 n h ′ − 1 N h C z h 2 E8

4. Formulation of proposed estimation strategy

Motivated with the earlier work, discussed above, we have constructed a class of product to regression chain type estimators as

t p = ∑ h = 1 L w h y ¯ h k h x ¯ h ′ x ¯ h + 1 − k h x ¯ id h ′ x ¯ h E9

where k h h = 1 2 … L is a real constant which can be suitably determined by minimizing the M. S. E. of the class of estimator t p and x ¯ d h ′ = x ¯ ′ + b xz h n h ′ Z ¯ h − z ¯ h ′ ; where b xz h n h ′ is the regression coefficient between the variables x and z at the hth stratum.

5. Bias and mean square errors of the proposed class of estimator t_p

It can be easily noted that the proposed class of estimators t p defined in Eqs. (8) is chain product and regression type estimator. Therefore, it is biased estimator for population mean Y ¯ . So, we obtain biases and mean square errors under large sample approximations using the following transformations:

y ¯ h = Y ¯ h 1 + e 1 , x ¯ h = X ¯ h 1 + e 2 , x ¯ h ′ = X ¯ h 1 + e 3 , z ¯ h ′ = Z ¯ h 1 + e 4 , s xz h ′ = S xz h 1 + e 5 , s z h ′ 2 = S z h 2 1 + e 6

and E e i = 0 for (i = 1, 2,…, 6), e_i for (i = 1, 2,…, 6) are relative error term.

Under above transformations the class of estimator t_p may be represented as

t p = ∑ h = 1 L w h Y ¯ 1 + e 1 1 − k h 1 + e 3 1 + e 2 − 1 + k h 1 + e 3 − Z ¯ h X ¯ h β xz h e 4 + e 4 e 5 − e 4 e 6 1 + e 2 − 1 E10

We have the following expectations of the sample statistics of two-phase stratified sampling as

E e 1 2 = f 1 C y h 2 , E e 2 2 = f 1 C x h 2 , E e 3 2 = f 2 C x h 2 , E e 4 2 = f 2 C z h 2 E e 1 e 2 = f 1 ρ yx h C y h C x h , E e 1 e 3 = f 2 ρ yx h C y h C x h , E e 2 e 3 = f 2 C x h 2 , E e 2 e 4 = E e 3 e 4 = f 2 ρ xz h C x h C z h , E e 4 e 5 = f 2 μ 102 Z ¯ h S xz h , E e 4 e 6 = f 2 μ 003 Z ¯ h S z h 2 , E e 2 e 5 = f 2 μ 201 X ¯ h S xz h , E e 2 e 6 = f 2 μ 102 X ¯ h S z h 2 , E e 1 e 4 = f 2 ρ yz h C y h C z h . E11

where

f 1 = 1 n h − 1 N h , f 3 = 1 n h − 1 n h ′ , f 2 = 1 n h ′ − 1 N h ,

μ pqr = 1 N h ∑ i = 1 N h x i − X ¯ h p y i − Y ¯ h q z i − Z ¯ h r ; p q r ≥ 0

Expanding binomially, using results from Eq. (1) and retaining the terms up to first order of sample size, we have derived the expressions of bias B(.) and mean square error M(.) of the class of estimators t_p as

B t p = E t p − Y h ¯ = ∑ h = 1 L w h Y ¯ 1 − k h b xz h Z h ¯ X h ¯ f 2 S xz h X ¯ h Z h ¯ − f 1 S yz h Y ¯ h Z h ¯ − f 2 μ 102 S xz h Z h ¯ − μ 003 S Z h 2 Z h ¯ + f 3 S x h 2 X ¯ h 2 − S yx h Y ¯ h X ¯ h E12

M t p = E t p − Y ¯ h 2 = ∑ h = 1 L w h Y ¯ 2 h f 1 C y h 2 + k h 2 a + 2 k h b + c E13

where a = f 2 ρ xz h 2 C x h 2 and b = f 2 ρ yz h ρ xz h C y h C x h − f 2 ρ xz h 2 C x h 2 c = f 3 C x h 2 − 2 f 3 ρ yx h C y h C x h + f 2 ρ xz h 2 C x h 2 − 2 f 2 ρ yz h ρ xz h C y h C x h .

6. Bias reduction for the proposed class of estimators

In recent time serious drawback is bias of an estimator. Therefore, unbiased versions of the proposed classes of estimators are more desirable. Motivated with this argument and influenced by the bias correction techniques of Tracy et al. [5] and Bandyopadhyay and Singh [11] we proceed to derive the unbiased version of our proposed class of estimator t_p.

From Eq. (12), we observe that the expression of bias of the estimator t_p contains the population parameters such as μ 003 , μ 102 , S yx h , S yz h , S x h 2 , S y h 2 , Y ¯ h , X ¯ h , S yz h and S z h 2 . Since S z h 2 is known while μ 003 , μ 102 , S yx h , S yz h , S x h 2 , S y h 2 , Y ¯ h , X ¯ h and S yz h are unknown, replacing μ 003 , μ 102 , S yx h , S yz h , S x h 2 , S y h 2 , Y ¯ h , X ¯ h , by their respective sample estimator (based on the second phase sample of size m) m 003 , m 102 , s yz h , s x h 2 , s y h 2 , y ¯ h , x ¯ h and s yz h , we get an estimator of B(t_p) and

b t p = ∑ h = 1 L w h y ¯ h 1 − k h b xz h z ¯ h x ¯ h f 2 s xz h x ¯ h z ¯ h − f 1 s yz h y ¯ h z ¯ h − f 2 m 102 s xz h z ¯ h − m 003 s Z h 2 z ¯ h + f 3 s x h 2 x ¯ 2 h − s yx h y ¯ h x ¯ h . E14

where m pqr = 1 m ∑ i = 1 m x hi − x ¯ h p y hi − y ¯ h q z hi − z ¯ h r .

Motivating with the bias reduction techniques of Tracy et al. [5] and Bandyopadhyay and Singh [11], we have derived the unbiased version of the proposed class of estimators t_p to the first order of approximations two-phase stratified sampling.

t p ′ = t p − b t p

which becomes

t p ′ = ∑ h = 1 L w h y ¯ h k h x ¯ h ′ x ¯ h + 1 − k h x ¯ id h ′ x ¯ h − y h ¯ 1 − k h b xz h z h ¯ x h ¯ f 2 s xz h x h ¯ z h ¯ − f 1 s yz h y h ¯ z h ¯ − f 2 m 102 s xz h z h ¯ − m 003 s Z h 2 z h ¯ + f 3 s x h 2 x h ¯ 2 − s yx h y h x h ¯ E15

Thus, the variance of t p ′ to the first order of approximation are obtained as

V t p ′ = M t p = ∑ h = 1 L Y ¯ h 2 f 1 C y h 2 + k h 2 a + 2 k h b + c E16

From Eqs. (10) and (15) it is to be noted that the class of estimators t p ′ is preferable over the class of estimators t p of two –phase sampling set up as t p ′ is unbiased (up to first order of sample size) class of estimator of Y ¯ h while the class of estimator t_p is biased.

7. Minimum variance of proposed class of estimators

It is obvious from the Eq. (16) that the variances of the proposed class of estimator t p ′ depend on the value of the constant k_h. Therefore, we desire to minimize their variances and discussed them below. The optimality condition under which proposed class of estimators t p ′ have minimum variance is obtained as

k h = − b a E17

Substituting the optimum value of the constant k_h in Eq. (19), we have the minimum variance of the class of estimators t p ′ as

Min . V t p ′ = ∑ h = 1 L W 2 h Y ¯ h 2 f 1 C y h 2 − b 2 a + C E18

8. Efficiency comparison of the proposed strategy

It is important to investigate the performance of the proposed class of estimators with respect to the existing ones. We use the two natural population and one artificially generated population data set to justify the supremacy of the proposed strategy.

8.1 Empirical investigations through natural populations

The data set of two natural populations has been presented below.

Population I (Source: Murthy [12], p. 228)
y: Factory output in thousand rupees, x: Number of workers in the factory, and z:Fixed capital of factory in thousand rupees.

The data consist of 80 observations which are divided into four strata according to the auxiliary variable z as: (i) z ≤ 500, (ii) 500 < z ≤ 1000, (iii) 1000 < z ≤ 2000, and z > 2000 respectively for allocation of sample size to different strata, Proportional allocation is used.

Stratum 1 z ≤ 500

N 1 = 19 , n 1 ′ = 11 , n 1 = 5 , Y ¯ 1 = 2669.247 , X ¯ 1 = 65.15789 Z ¯ 1 = 349.6842 , C y 1 = 0.28363 , C x 1 = 0.17153 , C z 1 = 0.31299 ρ yx 1 = 0.81381 , ρ yz 1 = 0.9364 , ρ xz 1 = 0.9044

Stratum 2 500 < z ≤ 1000

N 2 = 32 , n 2 ′ = 17 , n 2 = 8 , Y ¯ 2 = 4657.625 , X ¯ 2 = 139.9668 Z ¯ 2 = 706.5938 , C y 2 = 0.14366 , C x 2 = 0.3169 , C z 2 = 0.15457 ρ yx 2 = 0.8883 , ρ yz 2 = 0.9259 , ρ xz 2 = 0.8456

Stratum 3 1000 < z ≤ 2000

N 3 = 14 , n 3 ′ = 8 , n 3 = 3 , Y ¯ 3 = 6537.214 , X ¯ 3 = 403.2143 Z ¯ 3 = 1539.571 , C y 3 = 0.06365 , C x 3 = 0.20117 , C z 3 = 0.18004 ρ yx 3 = 0.9295 , ρ yz 3 = 0.9835 , ρ xz 3 = 0.9366

Stratum 4 z > 2000

N 4 = 15 , n 4 ′ = 9 , n 4 = 4 , Y ¯ 4 = 7843.667 , X ¯ 4 = 763.2 Z ¯ 4 = 2620.533 , C y 4 = 0.08232 , C x 4 = 0.22464 , C z 4 = 0.14156 ρ yx 4 = 0.9787 , ρ yz 4 = 0.9692 , ρ xz 4 = 0.9454

Population II (Source: Koyuncu and Kadilar [13]).

y: Number of teachers, x: Number of students both primary and secondary schools, and z: Number of classes both primary and secondary schools. There are 923 districts in 6 regions (as: (i) Marmara, (ii) Agean, (iii) Mediterranean, (iv) Central Anatolia, (v) Black Sea, (vi): East and Southeast Anatolia) in Turkey in 2007 (source: The Turkish Republic Ministry of Education).

Marmara region

N 1 = 127 , n 1 ′ = 60 , n 1 = 31 , Y ¯ 1 = 703.74 , X ¯ 1 = 20804.59 Z ¯ 1 = 498.28 , C y 1 = 1.25591 , C x 1 = 1.46538 , C z 1 = 1.115 ρ yx 1 = 0.936 , ρ yz 1 = 0.97891 , ρ xz 1 = 0.93958

Agean region

N 2 = 117 , n 2 ′ = 40 , n 2 = 21 , Y ¯ 2 = 413 , X ¯ 2 = 9211.79 Z ¯ 2 = 318.83 , C y 2 = 1.56155 , C x 2 = 1.64797 , C z 2 = 1.14804 ρ yx 2 = 0.996 , ρ yz 2 = 0.97624 , ρ xz 2 = 0.96958

Mediterranean

N 3 = 103 , n 3 ′ = 50 , n 3 = 29 , Y ¯ 3 = 573.17 , X ¯ 3 = 14309.3 Z ¯ 3 = 431.36 , C y 3 = 1.80307 , C x 3 = 1.9253 , C z 3 = 1.42097 ρ yx 3 = 0.994 , ρ yz 3 = 0.98351 , ρ xz 3 = 0.97655

Central Anatolia region

N 4 = 170 , n 4 ′ = 75 , n 4 = 38 , Y ¯ 4 = 424.66 , X ¯ 4 = 9478.85 Z ¯ 4 = 311.32 , C y 4 = 1.90878 , C x 4 = 1.92206 , C z 4 = 1.47124 ρ yx 4 = 0.983 , ρ yz 4 = 0.98296 , ρ xz 4 = 0.96362

Black sea region

N 5 = 205 , n 5 ′ = 40 , n 5 = 25 , Y ¯ 5 = 267.03 , X ¯ 5 = 5569.95 Z ¯ 5 = 227.20 , C y 5 = 1.51162 , C x 5 = 1.52564 , C z 5 = 1.14811 ρ yx 5 = 0.989 , ρ yz 5 = 0.96434 , ρ xz 5 = 0.96725 .

The percentage relative efficiencies (PRE) the proposed class of estimators t p ′ with respect to different estimators under their respective optimum conditions are shown below.

8.2 Empirical investigations through artificially generated population

An important aspect of simulation is that one builds a simulation model to replicate the actual system. Simulation allows comparison of analytical techniques and helps in concluding whether a newly developed technique is better than the existing ones. Motivated by Singh and Deo [14], Singh et al. [15] and Maji et al. [16] who have been adopted the artificial population generation techniques, we have generated five sets of independent random numbers of size N (N = 100) namely x 1 k ′ , y 1 k ′ , x 2 k ′ , y 2 k ′ and z k ′ k = 1 2 3 … N from a standard normal distribution with the help of R-software. By varying the correlation coefficients ρ yx and ρ xz , we have generated the following transformed variables of the population U with the values of σ y 2 = 50 , μ y = 10 , σ x 2 = 100 , μ x = 50 , σ z 2 = 50 and μ z = 20 as

y 1 k = μ y + σ y ρ xy x 1 k ′ + 1 − ρ yx 2 y 1 k ′ x 1 k = μ x + σ x x 1 k ′ z k = μ z + σ z ρ xz x 1 k ′ + 1 − ρ xz 2 z k ′ y 2 k = y 1 k and x 2 k = x 1 k .

We have split total population of size N = 100 into 5 strata each of size 20 i . e . N h = 20 h = 1 2 … 5 taking them sequentially and consider n h ′ = 12 and n h = 8 ; h = 1 2 … 5 for the efficiency comparison of the proposed strategy.

The percentage relative efficiencies the proposed class of estimators t p ′ with respect to different estimators (under their respective optimum conditions) are derived through the data set of the artificially generated population are obtained as:

9. Conclusion

From the construction of estimation strategy and efficiency comparison of the proposed methodology, following matters are noted.

Form Table 1, it is clear that the proposed class of estimators is at least 1% better than the existing one in estimating the population mean.
Similarly from Table 2 it is found that the new estimator is at least 28% better than the existing one.
It may also be noted from Tables 1 and 2 that the artificially generated population is homogeneous (the mean and variance of the respective variables are almost same for different strata) where the natural populations are heterogeneous (the mean and variance of the respective variables are different for different strata) in nature. Our suggested estimators performs with equal efficiency for both the types.
The unbiased version of the proposed technique has been obtained which make the proposed class of estimators much more practicable.

Estimator	PRE
Estimator	Population I	Population II
y ¯ st	173.3608	192.951
y ¯ h RP	101.1429	131.5654
y ¯ cs h	118.3215	172.226

Table 1.

PRE of the proposed estimator t p ′ with respect to different estimators through data set of natural population.

We use following expression to obtain the percent relative efficiency (PRE) of the proposed estimator t p ′ with respect to different estimators as PRE = V y ¯ Min . V t p ′ × 100 .

Estimator	PRE
Estimator	Artificially generated population
y ¯ st	179.623
y ¯ h RP	128.256
y ¯ cs h	154.879

Table 2.

PRE of the proposed estimator t p ′ with respect to different estimators through data set of artificially generated population.

We use following expression to obtain the percent relative efficiency (PRE) of the proposed estimator t p ′ with respect to different estimators as PRE = V y ¯ Min . V t p ′ × 100 .

Thus, it is found that the proposed estimation technique has addressed the problems of estimation through two-phase stratified sampling which may truthful for real life application where population is especially heterogeneous in nature and stratification is essential. Due to the benefits achieved by the new estimator, the survey statistician may be suggested to use it.

References

1. Sukhatme B. Some ratio type estimators in two-phase sampling. Journal of the American Statistics Associations. 1962;57:628-632
2. Chand L. Some ratio type estimators based on two or more auxiliary variables [unpublished PhD thesis]. Ames, Iowa (USA): Iowa State University; 1975
3. Kiregyera B. A chain ratio type estimators in finite population double sampling using two auxiliary variables. Metrika. 1980;17:217-223
4. Kiregyera B. Regression type estimators using two auxiliary variables and the model of double sampling from finite populations. Metrika. 1984;31:215-226
5. Tracy DS, Singh HP, Singh R. An alternative to the ratio-cum-product estimator in sample surveys. Journal of Statistical Planning and Inference. 1996;53:375-387
6. Singh HP, Espejo MR. Double sampling ratio-product estimator of a finite population mean in sampling surveys. Journal of Applied Statistics. 2007;34(1):71-85
7. Gupta S, Shabbir J. on the use of transformed auxiliary variables in estimating population mean by using two auxiliary variables. Journal of Statistical Planning and Inference. 2007;137:1606-1611
8. Shukla D, Pathak S, Thakur NS. Estimation of population mean using two auxiliary sources in sample surveys. Statistics in Transition. 2012;13(1):21-36
9. Choudhury S, Singh BK. A class of chain ratio–product type estimators with two auxiliary variables under double sampling scheme. Journal of the Korean Statistical Society. 2012;41:247-256
10. Parichha P, Basu K, Bandyopadhyay A, Mukhopadhyay P. Development of efficient estimation technique for population mean in two phase sampling using fuzzy tools. Journal of Applied Mathematics, Statistics and Informatics. 2017;13(2):5-28. DOI: 10.1515/jamsi-2017-0006
11. Bandyopadhyay A, Singh GN. Predictive estimation of population mean in two-phase sampling. Communications in Statistics: Theory and Methods. 2016;45(14):4249-4267. DOI: 10.1080/03610926.2014.919396
12. Murthy MN. Sampling Theory and Methods. Calcutta: Statistical Publishing Society; 1967
13. Koyuncu N, Kadilar C. Family of estimators of population mean using two auxiliary variables in stratified sampling. Communications in Statistics: Theory and Methods. 2009;38:2398-2417
14. Singh S, Deo B. Imputation by power transformation. Statistical Papers. 2003;4:555-579
15. Singh S, Joarder AH, Tracy DS. Median estimation using double sampling. Australian & New Zealand Journal of Statistics. 2001;43(1):33-46
16. Maji R, Singh GN, Bandyopadhyay A. Estimation of population mean in presence of random non-response in two-stage cluster sampling. Communications in Statistics: Theory and Methods, ISSN: 0361-0926. 2018. DOI: 10.1080/03610926.2018.1478101

[1] 1. Sukhatme B. Some ratio type estimators in two-phase sampling. Journal of the American Statistics Associations. 1962;57:628-632

[2] 2. Chand L. Some ratio type estimators based on two or more auxiliary variables [unpublished PhD thesis]. Ames, Iowa (USA): Iowa State University; 1975

[3] 3. Kiregyera B. A chain ratio type estimators in finite population double sampling using two auxiliary variables. Metrika. 1980;17:217-223

[4] 4. Kiregyera B. Regression type estimators using two auxiliary variables and the model of double sampling from finite populations. Metrika. 1984;31:215-226

[5] 5. Tracy DS, Singh HP, Singh R. An alternative to the ratio-cum-product estimator in sample surveys. Journal of Statistical Planning and Inference. 1996;53:375-387

[6] 6. Singh HP, Espejo MR. Double sampling ratio-product estimator of a finite population mean in sampling surveys. Journal of Applied Statistics. 2007;34(1):71-85

[7] 7. Gupta S, Shabbir J. on the use of transformed auxiliary variables in estimating population mean by using two auxiliary variables. Journal of Statistical Planning and Inference. 2007;137:1606-1611

[8] 8. Shukla D, Pathak S, Thakur NS. Estimation of population mean using two auxiliary sources in sample surveys. Statistics in Transition. 2012;13(1):21-36

[9] 9. Choudhury S, Singh BK. A class of chain ratio–product type estimators with two auxiliary variables under double sampling scheme. Journal of the Korean Statistical Society. 2012;41:247-256

[10] 10. Parichha P, Basu K, Bandyopadhyay A, Mukhopadhyay P. Development of efficient estimation technique for population mean in two phase sampling using fuzzy tools. Journal of Applied Mathematics, Statistics and Informatics. 2017;13(2):5-28. DOI: 10.1515/jamsi-2017-0006

[11] 11. Bandyopadhyay A, Singh GN. Predictive estimation of population mean in two-phase sampling. Communications in Statistics: Theory and Methods. 2016;45(14):4249-4267. DOI: 10.1080/03610926.2014.919396

[12] 12. Murthy MN. Sampling Theory and Methods. Calcutta: Statistical Publishing Society; 1967

[13] 13. Koyuncu N, Kadilar C. Family of estimators of population mean using two auxiliary variables in stratified sampling. Communications in Statistics: Theory and Methods. 2009;38:2398-2417

[14] 14. Singh S, Deo B. Imputation by power transformation. Statistical Papers. 2003;4:555-579

[15] 15. Singh S, Joarder AH, Tracy DS. Median estimation using double sampling. Australian & New Zealand Journal of Statistics. 2001;43(1):33-46

[16] 16. Maji R, Singh GN, Bandyopadhyay A. Estimation of population mean in presence of random non-response in two-stage cluster sampling. Communications in Statistics: Theory and Methods, ISSN: 0361-0926. 2018. DOI: 10.1080/03610926.2018.1478101

Development of Estimation Procedure of Population Mean in Two-Phase Stratified Sampling

Statistical Methodologies

Abstract

Keywords

Author Information

Partha Parichha

Kajla Basu

Arnab Bandyopadhyay*

1. Introduction

2. Sampling structures and notations

3. Discussion on existing estimation strategies

4. Formulation of proposed estimation strategy

5. Bias and mean square errors of the proposed class of estimator t_p

6. Bias reduction for the proposed class of estimators

7. Minimum variance of proposed class of estimators

8. Efficiency comparison of the proposed strategy

8.1 Empirical investigations through natural populations

8.2 Empirical investigations through artificially generated population

9. Conclusion

Table 1.

Table 2.

References

Methods of Russian Patent Analysis

Development of Estimation Procedure of Population Mean in Two-Phase Stratified Sampling

Statistical Methodologies

Abstract

Keywords

Author Information

Partha Parichha

Kajla Basu

Arnab Bandyopadhyay*

1. Introduction

2. Sampling structures and notations

3. Discussion on existing estimation strategies

4. Formulation of proposed estimation strategy

5. Bias and mean square errors of the proposed class of estimator tp

6. Bias reduction for the proposed class of estimators

7. Minimum variance of proposed class of estimators

8. Efficiency comparison of the proposed strategy

8.1 Empirical investigations through natural populations

8.2 Empirical investigations through artificially generated population

9. Conclusion

Table 1.

Table 2.

References

Continue reading from the same book

Statistical Methodologies

5. Bias and mean square errors of the proposed class of estimator t_p