Application of a randomized response technique for the sensitive question survey in female sex workers in Xichang, China
Original Article

Application of a randomized response technique for the sensitive question survey in female sex workers in Xichang, China

Kejin Chen1,2, Zhilei Mao3, Zongda Jin2,4, Ge Gao2, Chaorong Bian5

1Group Health Division, Changzhou Maternal and Child Health Care Hospital, Changzhou, China;2Department of Epidemiology and Biostatistics, School of Public Health, Medical College of Soochow University, Suzhou, China;3Science and Education Department, Changzhou Maternal and Child Health Care Hospital, Changzhou, China;4Medical Service Division, The Third Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China;5Medical Service Division, Changzhou Children’s Hospital Affiliated to Nantong University, Changzhou, China

Contributions: (I) Conception and design: G Gao, K Chen; (II) Administrative support: G Gao, C Bian; (III) Provision of study materials or patients: G Gao, Z Jin; (IV) Collection and assembly of data: Z Jin, Z Mao; (V) Data analysis and interpretation: K Chen, C Bian, Z Jin; (VI) Manuscript writing: All authors; (VII) Final approval of manuscript: All authors.

Correspondence to: Dr. Chaorong Bian. Medical Service Division, Changzhou Children’s Hospital Affiliated to Nantong University, 468 Yanling Road, Changzhou 213000, China. Email: 895215152@qq.com.

Background: The technique of sampling is a particularly important aspect of sampling surveys, and the method of determining sample size is the key link in sampling technique. However, a sample size determination method for complex sampling surveys of sensitive issues using a randomized response model is not yet available.

Methods: In this work, we drew from the theory of probability and mathematical statistics, and developed a series of formulas for sample size determination that are required to estimate the overall proportion and mean on dichotomous sensitive questions in 3-stage sampling using Simmons models.

Results: Shortly after generating the formulas, we successfully applied them to a pre-survey regarding sensitive issues among female sex workers (FSWs) in Xichang, and found their practical application to achieve satisfactory effects. By using a large number of Monte Carlo simulation sampling tests, the reliability of the survey methods and formulas for sensitive question surveys were found to be high, but the randomized response technique (RRT) cannot completely eliminate the possibility of social acceptability bias.

Conclusions: RRT is an acceptable method for sensitive question surveys within sensitive populations, and this new sampling method is liable to be used for sensitive question surveys of large populations. We believe that the results obtained in the field of public health can provide some data support for clinical practice.

Keywords: Randomized response technique (RRT); sensitive question survey; female sex worker (FSW)


Submitted Oct 28, 2020. Accepted for publication Jan 02, 2021.

doi: 10.21037/apm-20-2262


Introduction

Surveillance data from 2004 to 2013 in China showed that the incidence of human immunodeficiency virus (HIV) and syphilis infection had the most rapid increase, with an annual increase in prevalence of 16.3% [95% confidence interval (CI), 11.5–21.2], and 16.3% (95% CI, 13.8–18.8) respectively (1). The increased incidence of sexually transmitted diseases (STDs) was attributed to a variety of reasons, including internal labor migration, unsafe sexual practices, and lack of consulting healthcare services (2-4). Female sex workers (FSWs) play a very important role in the spread of STDs, and have been viewed as key links of such diseases between groups of STD infections and the general population (5). There has been much investigation into FSWs, but the majority of them have used the traditional questionnaire inquiry method only (6,7). As commercial sexual activities are illegal in China, and Chinese women are conservative about sex, this demographic is highly private and has a strong sense of self-protection. It is difficult to obtain data from the respondents encompassing sensitive topics such as sex services. Effective policies and measures to resolve these increasing problems can only be created using the results of an authentic survey of the STD epidemic in China.

The basic thinking of the randomized response technique (RRT), as used in this study, is still based on the idea of the Warner model (8), but we incorporated a non-sensitive and unrelated issue, as opposed to a control issue as, to directly oppose the sensitive issue as in the Warner model, with the aim of further alleviating the privacy concerns of the respondents. In our model, the respondents believed that their answers would not expose their profession, so they could more freely cooperate with the investigators. Due to the absence of the investigators during the inquiry, the conductors did not know which question the respondents answered, only the respondents themselves knew this detail (9). This enabled the respondents have a sense of security and therefore the answers are more authentic.

In this study, we applied the above method to sensitive issues and complex sampling, as mentioned in another research article (10). Beyond this, through a preliminary survey on the features of sensitive issues for FSWs in Xichang, Sichuan Province, China, we obtained the required relevant statistical values in the calculated formulae to estimate the sample size. This would be helpful in informing policy makers when developing prevention strategies and for guiding further research.

We present the following article in accordance with the SURGE reporting checklist (available at http://dx.doi.org/10.21037/apm-20-2262).


Methods

Study population

We surveyed FSWs from May to July 2018 in Xichang, Sichuan Province, China. According to the sixth national census performed in 2010, there was a population of 278,023 in downtown Xichang. The male to female ratio there was 1.0562:1, and the female population was 135,212 at the time. According to the HIV and AIDS (acquired immune deficiency syndrome) Data Hub for Asia-Pacific (https://www.aidsdatahub.org/), FSWs account for 3.4–3.6% of urban women in the southwest cities of China such as Guiyang and Xingyi. Xichang is located in the hinterland of Liangshan, in the southwest of the Sichuan Province. The Liangshan Prefecture has been regarded as one of the hardest-hit areas for HIV in China. Based on that proportion, we can deduce that the number of FSWs in Xichang, the target population of this research, is 4,876.

Ethics statement

All participants provided their written informed consent to participate in this study anonymously. The data were also collected and analyzed anonymously. This research, including the consent procedure, was approved by the Ethics Committee of Soochow University. All procedures performed in this study involving human participants were in accordance with the Declaration of Helsinki (as revised in 2013). We confirm that all methods were performed in accordance with the relevant guidelines and regulations.

Public and patient involvement

The survey in this study was an application of the new sampling method, and was a cross-sectional study. We recruited the FSWs at active locations in 3-stage sampling. The places of activity of providing sexual services could be divided into high, middle, and low grade, and the places we selected included all these grades. Our investigators explained RRT to each respondent, ensuring that everyone completed the questionnaires independently after they had signed the informed consent form. The authors of this article trained the investigators in Xichang, and did not directly participate in the investigation themselves.

Survey methods

In general, sensitive issues are related to the privacy or interests of some institutions, organizations, individuals, or what most people are reluctant to state publicly, such as attitudes toward ethics, economic income, and life conduct (11). In accordance with the general characteristics of sensitive questions, they can be divided into sensitive questions of quantitative characteristics and sensitive questions of attribute characteristics (12), which can be further divided into those of 2 attribute characteristics and those of multiple attribute characteristics, according to the number of attribute characteristics. The 2 answers to sensitive questions of 2 attribute characteristics must be mutually independent and exclusive. For example, when the question is “Did you use a condom the last time when you had sex?”, the answers must be either “Yes” or “No”.

A randomized device was conducted as follows. Red and white balls, which were indistinguishable in size, weight, and texture, were placed into a bag. The proportion of red balls in the bag is referred to as P. Every participant was instructed to draw a ball randomly from the bag. If the participant drew a red ball, they would be asked to answer “Yes” or “No” to the sensitive question A: “Do you have the characteristics A?” If the participant drew a white ball, then they would be asked to answer “Yes” or “No” to the non-sensitive question B: “Do you have the characteristics B?” The respondents participated in this exercise in isolation and put the ball back to the bag between 2 respondents to preserve the ratio P.

Suppose that totality is composed of N1 primary units, and the ith primary unit is composed of Ni2 second-stage units (i=1, 2…N1). Then the jth second-stage unit of the ith primary unit is comprised of Nij3 third-stage units (i=1, 2…N1; j=1, 2…Ni2). The population consists of N tertiary units. In other words, each primary unit is made up of N1 second-stage units, and each second-stage unit is made up of N3 tertiary units.

In the first phase, n1 primary units were randomly selected. In the second phase, nij3 second-stage units were randomly drawn from the ith selected primary unit (i=1, 2…n1). In the third phase, nij3 third-stage units were randomly chosen from the jth drawn second-stage unit of the ith selected primary unit (i=1, 2…N1; j=1, 2…Ni2). That is to say, n2 secondary units were drawn from each selected primary unit, and n3 tertiary units were chosen from each drawn secondary unit of each selected primary unit. Every selected tertiary unit was investigated with the Simmons model for dichotomous sensitive questions.

In this study, we combined the Simmons Model for dichotomous sensitive questions with a 3-stage sampling survey method to estimate sample size in each stage. Our previous study (13) had derived the formulas for the optimal sample size of 3-stage sampling; on this basis, we used RRT to minimize sampling error under the condition that the cost of the investigation was limited, and to minimize the cost under the condition that the sampling error was limited. After that, we used SAS® (Statistical Analysis System, SAS Institute Inc., Cary, NC, USA) software to evaluate the survey methods and formulas by Monte Carlo simulation, and found that they had both high reliability and validity.

Formulae applied in the survey

The formulas (13) (See Appendix 1) applied in the survey were listed as follows:

p=N1Nn1i=1n1Ni2ni2j=1ni2Nij3pij[1]

V(p)=σ12n1(1n1N1)+σ22n1n¯2(1n¯2N¯2)+σ32n1n¯2n¯3(1n¯3N¯3)[2]

σ12=1N11i=1N1(πiπ)2[3]

σ22=1N1i=1N11Ni21j=1Ni2(πijπi)2[4]

σ32=1N1i=1N11Ni2j=1Ni2πij(1πij)[5]

s12=1n11i=1n1(pip)2[6]

s22=1n1i=1n11ni21j=1ni2(pijpi)2[7]

s32=1n1i=1n11ni2j=1ni2{pij(1πij)nij3+pijg(Y,A)+(1πij)g(N,A¯)nij3[g(Y,A)1][g(N,A¯)1]}g(Y,A)=P+(1P)Rij(1P)Rijg(N,A¯)=1(1P)Rij(1P)(1Rij)[8]

pij=mij/nij3(1P)RijP[9]

C=C0+C1n1+C2n1n¯2+C3n1n¯2n¯3[10]

V(p)=σ12N1+1n1(σ12σ22/N¯2)+1n1n¯2(σ22σ32/N¯3)+σ32n1n¯2n¯3[11]

n¯2=σ22σ32/N¯3σ12σ22/N¯2·C1C2[12]

n¯3=σ32σ22σ32/N¯3·C2C3[13]

n1=σ12σ22/N¯2+(σ22σ32/N¯3)/n¯2+σ32/n¯2n¯3V+σ12/N1[14]

n1=CC0C1+C2n¯2+C3n¯2n¯3[15]

ni2=Ni2·n¯2N¯2[16]

nij3=Nij3·n¯3N¯3[17]

Investigation target

In our survey, we asked the FSWs to answer some sensitive questions with 2 attribute characteristics, and the answers had to be mutually independent and exclusive. In order to shorten the experiments and improve the enthusiasm of the respondents simultaneously, we found the following 3 sensitive issues to be relatively useful for our goals: whether you possess a regular sex partner (RSP) or not? (Issue C01), whether you stop sexual services after suffering from venereal diseases or not? (Issue C02), whether you support the legalization of prostitution or not? (Issue C03) (See Appendix 2).

RRT model

The Simmons model for dichotomous sensitive questions was applied in the survey. The randomized device was a semi-closed bag, including 6 red balls (P=0.6) and 4 white balls (P=0.4). The balls were identical in size, weight, and texture. The respondents who retrieved the red ball answered the sensitive question. Otherwise, they answered the non-sensitive question. In the survey about the proportion of FSWs possessing an RSP, the sensitive question was: “Apart from your chargeable clients, do you have a spouse or other regular sexual partner?” The non-sensitive question was: “Is your date of birth an odd number?” Depending on the ball they picked out, the participants should answer “Yes” or “No” to the corresponding question in accordance with their actual situation. There were 3 independent sensitive issues in our survey, and we designed 3 sets of randomized devices accordingly, which were identical except that the sensitive issues were different.

We took some measures in order to better play the role of RRT. Our investigators had a deep understanding of the principles and practices of RRT. They explained RRT to each respondent, ensuring everyone realized that RRT was a scientific survey method and could cleverly protect their privacy. If necessary, we could allow the respondents to check the randomized device to dispel their misgivings completely. In our survey, we were cautious to choose the non-sensitive issues since they may involve general characteristics of individual. Our investigators were strangers to the FSWs, otherwise the unrelated question would not be able to achieve its proper protection. Moreover, the unrelated question we chose was very easy to answer for the respondents, and it did not require contemplation. Our survey was of an adequate sample size to ensure the authenticity of findings.

Three-stage sampling

Considering that the population of each administrative region in Xichang may not be the same, we divided Xichang into 10 districts of similar population, here referred to as first-stage units (N1=10). Places known to provide paid sexual services, such as nightclubs, saunas, karaoke bars, and bars, were taken as survey spots, here referred to as second-stage units (N2=12). A total of 4876 FSWs were taken as third-stage units (N3=41). According to the formulas for optimum sample size of 3-stage sampling derived by Jianfeng Wang, we could minimize the sampling error for a specified cost. In our 3-stage sampling, we randomly chose 5 districts from those 10 first stage districts (n1=5), and chose 6 second stage survey spots in each district randomly and equally (n2=6). At each survey spot, we randomly selected an average of 27 people in the third stage unit (n3=27). Finally, a total of 800 FSWs were recruited. So, relative to the total number, the sampling fractions for the first, second, and third stages were 50%, 25%, 16.4%, respectively.

We recruited the FSWs at sites of activity. The staff of the State Key Laboratory of Infectious Disease Prevention and Control in China also used this approach to recruit FSWs and conducted 3 cross-sectional surveys of HIV infection and behavior from 2010 to 2015 (14). The places activity for providing sexual services could be divided into high, middle, and low grade, and the places we selected included all the grade divisions. Therefore, the population we selected was a very good representative sample; however, we cannot rule out that very few FSWs provide commercial services in the name of an individual.

Monte Carlo simulation

In line with the answers to the dichotomous sensitive issues among FSWs in the actual survey in Xichang, the sample proportions and their standard deviations were taken as simulative summary measures, and we then set up an analog population through SAS® (Statistical Analysis System, SAS Institute Inc., Cary, NC, USA) (See Appendix 3).

In this work, for the investigation method on dichotomous sensitive questions in 3-stage sampling under Simmons models, we used SAS® to simulate 100 pre-survey samples, each of which equally contained 2,000 simulated respondents. According to the calculation formulas of optimal sample size deduced in this paper, we could calculate the sample sizes at each stage required for the 100 sampling surveys respectively (See https://cdn.amegroups.cn/static/application/239522e014618b88688e70af39d65930/apm-20-2262-1.pdf). After that, we conducted 100 formal sampling surveys on the simulated population, and obtained 100-point estimates and their 95% CIs of the population proportion (See https://cdn.amegroups.cn/static/application/db099841f298c0defc3b617c569c01fd/apm-20-2262-2.pdf). If almost all of the 100 95% CIs contained the overall proportion of the simulated population and almost all of the 100 sample proportions were close to the same value, the survey method would be considered to have good reliability and validity.

Statistical analysis

All participants provided written informed consent. The questionnaires were completed by the respondents independently, and all questionnaires were checked carefully. The response rate was 100%, and all returned questionnaires qualified for inclusion. The collected data were checked twice and entered into a database with EpiData (version 3.1, EpiData Association, Odense, Denmark). The software SAS® (version 9.3, SAS Institute Inc., Cary, NC, USA) was used to analyze the data.


Results

Survey and quantitative results

Taking Issue C01 as an example, in investigation site 1 in district A, 23 FSWs were randomly selected (nij3=n113=23), of which 12 answered “yes” (mij=m11=12) (See https://cdn.amegroups.cn/static/application/3144ad73360446f10a7ad469ad08e447/apm-20-2262-3.pdf). Based on λ^ij=mij/nij3, the ratio of answering “yes” was λ^11=m11/n113=12/23. The true proportion of respondents having a date of birth being an odd number was 0.5 (Rij=R11=0.5), and the proportion of respondents being asked to answer the sensitive question was 0.6 (P=0.6).

According to Eq. [9], p11 (i=1; j=1), the sample proportion of FSWs possessing an RSP in site 1 in district A was:

p11=λ^11(1P)R11P=m11/n113(1P)R11P     =12/23(10.6)×0.50.6=0.5362[18]

The proportion of FSWs possessing RSP in other investigation sites in district A or in other districts could be determined in the same way (Table 1).

Table 1
Table 1 The proportion of FSWs possessing an RSP in each site of each district in Xichang using Simmons model for qualitative characteristic questions under three-stage sampling
Full table

Analytical results of sensitive issues

According to Eq. [1], the estimate of the overall rate of FSWs possessing an RSP in Xichang, here referred to as p, could be determined as follows:

p=104876×5i=15Ni2ni2j=1ni2Nij3pij0.5594[19]

Concurrently, based on Eqs. [6], [7], and [8], the estimate based on sample of σ21, σ22, and σ23, here referred to as S21, S22, and S23, respectively, could be determined:

s12=151i=15(pip)20.0098[20]

s22=15i=151ni21j=1ni2(pijpi)20.0251[21]

s32=1n1i=1n11ni2j=1ni2{pij(1πij)nij3+pijg(Y,A)+(1πij)g(N,A¯)nij3[g(Y,A)1][g(N,A¯)1]}   0.0251[22]

Then, through Eq. [2], we could obtain the sample variance of p:

v(p)=s12n1(1n1N1)+s22n1n¯2(1n¯2N¯2)+s32n1n¯2n¯3(1n¯3N¯3)0.0014[23]

From this, we could obtain the 95% CI of the proportion of FSWs possessing an RSP in Xichang:

p±1.96×v(p)0.48580.6330[24]

By the same token (See https://cdn.amegroups.cn/static/application/341debe83b0f0575fc0b83c88a70a335/apm-20-2262-4.pdf, https://cdn.amegroups.cn/static/application/cc61a978cda5cefbaf21f1b175ebcd7c/apm-20-2262-5.pdf), the analytical results and their 95% CIs of 3 sensitive issues, could be determined as shown in Table 2.

Table 2
Table 2 Analytical results of dichotomous sensitive questions
Full table

The estimation of sample size

As mentioned above, we calculated the variances of the sample proportion of FSWs possessing an RSP in Xichang: S21=0.0098, S22=0.0251, S23=0.0251. According to Eq. [12], we could then calculate the quantity of survey spots which should be selected randomly from each chosen district (σi2si2):

n¯2=s22s32/N¯3s12s22/N¯2·C1C26[25]

According to Eq. [13], we could compute the number of FSWs to be recruited equally and randomly from every selected spot (σi2si2):

n¯3=s32s22s32/N¯3·C2C321[26]

In our survey, under the circumstance of minimizing the cost for the fixed sampling error of V=0.0015, with the help of Eq. [14], the required minimum number of districts in Xichang to be investigated could be determined (σi2si2):

n1=s12s22/N¯2+(s22s32/N¯3)/n¯2+s32/n¯2n¯3V+s12/N16[27]

According to the budget, the basic outlay of our entire survey was C0=100,000 yuan, including cooperation fees for institutions of the project, travel expenses, training expenses of staff, and so on. The average cost of investigating each district was C1=100,000 yuan, including cooperation costs for agencies of the project, car fares, communication expenses, and so on. The average cost of looking into each place of activity was C2=10,000 yuan, including cooperation costs for places providing paid sexual services, rewards to female volunteers for intervention in HIV, and so on. The direct cost of interviewing each FSW on average was C3=25 yuan, which was paid as a reward to the FSW for undergoing interview.

In our survey, under the circumstance of minimizing sampling error for the fixed cost of C=100000 yuan, with the help of Eq. [15], the required minimum number of districts in Xichang to be investigated could be determined (σi2si2):

n1=CC0C1+C2n¯2+C3n¯2n¯36[28]

Then, we could estimate the required sample size for each dichotomous sensitive question as listed below (Table 3).

Table 3
Table 3 Estimation of required sample size for each dichotomous sensitive question
Full table

After deducing n2, according to Eq. [16], the number of survey spots randomly chosen from the ith drawn district could be determined. For example, if there were a total of 12 survey spots in a certain district, the quantity of spots randomly selected from this district should have been:

ni2=Ni2·n¯2N¯26[29]

After working out n3, according to Eq. [17], the number of FSWs randomly selected from jth drawn site of the ith chosen district could be calculated. For instance, if there were 36 FSWs in a certain site of a particular district, the number of FSWs randomly selected from there should have been:

nij3=Nij3·n¯3N¯319[30]

Determination of the final sample size

When minimizing the sampling error for a fixed survey cost, we needed to randomly select 24 FSWs from each of the 7 sites in each of the 6 districts in each stage.

Evaluation of reliability and validity

Regarding the dichotomous sensitive issue C01, “Whether there is a spouse or any regular sexual partner other than those paying for sexual services?”, the overall proportion calculated from the simulated population was 0.5594. The Simmons model was used to carry out analogue investigations on the dichotomous sensitive issue for all the respondents of 100 simulated samples respectively, and the results are shown in Table 4.

Table 4
Table 4 The result of computer simulation regarding the issue C01
Full table

According to Table 4, we can see 96 (except the sample no’s 30, 37, 55, 76) of the 100 95% CIs contain the overall proportion of the simulated population, that is 0.5594. Meanwhile, 96 of the 100 sample proportions are close to 0.5594. These 2 results indicate that the survey method and the calculation formulas both had good reliability and good validity.


Discussion

Culturally sensitive issues appear to be particularly prominent in the field of public health, and have a large negative impact on economic development and social stability. It is difficult for investigators to draw a sample directly due to the wide variance and enormous numbers of the sample, and the dispersed distribution of the hidden population. Multi-stage sampling is a scientific and wise choice to achieve the goals of a survey and improve sampling efficiency and representativeness. Multi-stage sampling refers to a sampling process that is carried out in stages, and the sampling method used in each stage is usually different. It has many characteristics of organizing the participants easily and sampling flexibility. By contrast, proportional sampling does not take the sample variability into account, and only uses a uniform proportion for sampling. It is absolutely simple and convenient, but could lead to a low representative level of the sample. The extraction of the basic unit to be investigated is not a step in place, furthermore, sampling frames for the second or lower order unit at the time of sampling are not required.

Over the past 40 years, RRT has been widely applied to different domains, such as tax evasion, drug abuse, academic deception and so on. Bias caused by social expectations could be reduced by RRT. Furthermore, when compared with traditional investigation methods, such as self-administered questionnaires (SAQ), face-to-face interviews, telephone surveys, and computer-assisted interviews, RRT has significant advantages in terms of accuracy, reliability, and parameter estimation (15). The existence of social acceptability bias can’t be ruled out when using RRT, and we cannot guarantee that all FSWs were inclined to admit behaviors that may have a negative impact on them. A slight increase in variance may be due to RRT; however, if other investigation methods were used, this would have resulted in an increase in the errors that could not be estimated and a decrease in the credibility of the results. Therefore, with the aims of obtaining relatively reliable data and better understanding behavioral characteristics, we used RRT to desensitize FSWs to the sensitive questions.

The proportion asked the sensitive question or the level of confidentiality could affect the efficiency of the Simmons model. In this paper, we have already mentioned the proportion of red balls (P=0.6) in the randomized devices, which has been proven to maximize the efficiency of the Simmons model (16). We cannot provide the accurate proportion of those who were asked the sensitive questions during the practical investigation, on account of the investigators having been absent at that time, but the formulas we had derived took into account the variance produced by using RRT, which made the result more reliable.

Previous studies on sampling techniques of culturally sensitive issues have been confined to improving the model in simple random sampling. This application is only useful in special populations or at small scales, although studies on sensitive issues have been conducted extensively in China and abroad. Few studies have been conducted regarding RRT under complex sampling methods, such as multi-stage sampling and stratified multi-stage sampling (10). In addition, the estimation of sample size under complex sampling is rarely considered. Therefore, it is important to estimate the sample size for RRT model under multi-stage sampling and to evaluate its reliability and validity.

In this paper, we presented an investigative method for culturally sensitive issues using the Simmons model under 3-stage sampling. And have provided formulae concerning the optimal sample size in 2 conditions of satisfying minimum sampling error and lowest investigation costs. The application in this study was the pre-survey of the Chinese National Natural Science Fund project conducted by our team in 2019–2020. Based on the formulas for sample size derived from this study, we can scientifically estimate the sample size required for each sampling stage. The prevalence of STDs among FSWs in Xichang was obtained preliminarily, which provided evidence for the study of behavioral characteristics of groups at high risk of STDs in Xichang. Effective methods and scientific statistical formulas are vital for successful investigation of sensitive issues, but sample size must be estimated before a sampling survey begins.

Reliability and validity are 2 important indicators for evaluating a method measurement. A high-quality measuring method should have both high reliability and validity (17). Few evaluations concerning the reliability and validity of investigation methods for culturally sensitive issues have been conducted. In the last decade, our team has carried out much research on the methods for investigating various sensitive issues, which has combined various random response models with different sampling methods, such as cluster sampling, stratified cluster sampling, stratified random sampling, 2-stage sampling, stratified 2-stage sampling, and stratified 3-stage sampling methods (18-23). Formulas for the overall proportion, the population mean, and its variance were deduced. The performances of the sampling methods and formulas in the survey of premarital sex and cheating on exams at Soochow University, as well as sensitive issues among men who have sex with men (MSM), were also provided. The reliability of the survey methods and formulas for sensitive questions was found to be high.

This survey showed that 55.94% of the FSWs have a regular sexual partner apart from clients, and 75.85% stop providing sexual services after contracting venereal diseases. Their working situations easily lead to their infection with STDs. We also feel that the local FSWs lacked traditional workplace skills and relied on the provision of sexual services as their primary source of income. The local government needs to commit to assisting the re-employment of the FSWs, including the establishment and improvement of the training of laid-off female workers and attracting investment to increase local employment opportunities. Of the FSWs interviewed, 56.77% support the legalization of prostitution. Legalization might protect the rights and interests of FSWs and reduce crime, which has already become the case in some developed countries. In China, it is difficult to implement legalization of prostitution, but the local relevant departments can strengthen visibility to enhance the awareness and self-protection capabilities of FSWs.


Conclusions

In this paper, we introduced a new sampling method to draw a random sample that is representative for such a hidden population as FSWs who are illegal and stigmatized in China. We derived the formulae to estimate the optimal sample size in 2 conditions of satisfying minimum sampling error and lowest investigation costs. Moreover, the sample sizes required for each stage in the survey were ascertained, which provided critical data for the technique of sampling. We applied the method to the pre-survey of FSWs in Xichang and obtained important data concerning characteristics of sexual behaviour, which could be helpful in informing policy makers in developing prevention strategies and guiding further research. In this study, a large number of Monte Carlo simulation sampling tests were used to validate reliability and validity, which were found to be high. This new sampling method is liable to be used for sensitive question surveys of large populations.


Acknowledgments

We are most grateful to the female sex workers in Xichang who made it possible for us to perform this study. We extend our thanks to the Chinese Center for Disease Control and Prevention and Xichang Dermatology and Venereal Disease Prevention and Control Station for working with us. Finally, Kejin Chen wants to thank, in particular, the invaluable support received by Qihua Chen and Yang Cao over the years.

Funding: This work was supported by the National Natural Science Foundation of China (81273188), China Association for the Prevention and Control of STDs and AIDS· AbbVie Fund for Maternal and Child Care and Prevention of Mother-to-Child Transmission (Project No.PMTCT201904), Changzhou Health Young Cultivation Project (CZQM2020101).


Footnote

Reporting Checklist: The authors have completed the SURGE reporting checklist. Available at http://dx.doi.org/10.21037/apm-20-2262

Data Sharing Statement: Available at http://dx.doi.org/10.21037/apm-20-2262

Conflicts of Interest: All authors have completed the ICMJE uniform disclosure form (available at http://dx.doi.org/10.21037/apm-20-2262). The authors have no conflicts of interest to declare.

Ethical Statement: The authors are accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All participants provided their written informed consent to participate in this study anonymously. The data were also collected and analyzed anonymously. This research, including the consent procedure, was approved by the Ethics Committee of Soochow University (No. 2017084). All procedures performed in this study involving human participants were in accordance with the Declaration of Helsinki (as revised in 2013).

Open Access Statement: This is an Open Access article distributed in accordance with the Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License (CC BY-NC-ND 4.0), which permits the non-commercial replication and distribution of the article with the strict proviso that no changes or edits are made and the original work is properly cited (including links to both the formal publication through the relevant DOI and the license). See: https://creativecommons.org/licenses/by-nc-nd/4.0/.


References

  1. Yang S, Wu J, Ding C, et al. Epidemiological features of and changes in incidence of infectious diseases in China in the first decade after the SARS outbreak: an observational trend study. Lancet Infect Dis 2017;17:716-25. [Crossref] [PubMed]
  2. Lin CC, Gao X, Chen XS, et al. China's syphilis epidemic: a systematic review of seroprevalence studies. Sex Transm Dis 2006;33:726-36. [Crossref] [PubMed]
  3. Xu JJ, Yu YQ, Hu QH, et al. Treatment-seeking behaviour and barriers to service access for sexually transmitted diseases among men who have sex with men in China: a multicentre cross-sectional survey. Infect Dis Poverty 2017;6:15. [Crossref] [PubMed]
  4. Zou X, Chow EP, Zhao P, et al. Rural-to-urban migrants are at high risk of sexually transmitted and viral hepatitis infections in China: a systematic review and meta-analysis. BMC Infect Dis 2014;14:490. [Crossref] [PubMed]
  5. Shah NS, Shiraishi RW, Subhachaturas W, et al. Bridging populations-sexual risk behaviors and HIV prevalence in clients and partners of female sex workers, Bangkok, Thailand 2007. J Urban Health 2011;88:533-44. [Crossref] [PubMed]
  6. Jeal N, Patel R, Redmond NM, et al. Drug use in street sex workers (DUSSK) study protocol: a feasibility and acceptability study of a complex intervention to reduce illicit drug use in drug-dependent female street sex workers. BMJ Open 2018;8:e022728. [Crossref] [PubMed]
  7. Prangnell A, Shannon K, Nosova E, et al. Workplace violence among female sex workers who use drugs in Vancouver, Canada: does client-targeted policing increase safety? J Public Health Policy 2018;39:86-99. [Crossref] [PubMed]
  8. Warner HR, Cox A. A mathematical model of heart rate control by sympathetic and vagus efferent information. J Appl Physiol 1962;17:349-55. [Crossref] [PubMed]
  9. Arnab R, Singh S. Randomized response techniques: An application to the Botswana AIDS impact survey. Journal of Statistical Planning and Inference 2010;140:941-53. [Crossref]
  10. Pu X, Gao G, Fan Y, et al. Parameter Estimation in Stratified Cluster Sampling under Randomized Response Models for Sensitive Question Survey. PLoS One 2016;11:e0148267. [Crossref] [PubMed]
  11. Moshagen M, Musch J, Ostapczuk M, et al. Reducing socially desirable responses in epidemiologic surveys: an extension of the randomized-response technique. Epidemiology 2010;21:379-82. [Crossref] [PubMed]
  12. Chaudhuri A, Dihidar K. Estimating Means of Stigmatizing Qualitative and Quantitative Variables from Discretionary Responses Randomized or Direct. The Indian Journal of Statistics 2009;71:123-36.
  13. Wang J, Ge G, Fan Y, et al. The estimation of sample size in multi-stage sampling and its application in medical survey. Applied Mathematics and Computation 2006;178:239-49. [Crossref]
  14. Xiang Z, Xiaoyi Y, Wanli Z. Investigation of HIV infection and related behaviors among female sex workers in Xichang, 2010-2015. China Tropical Medicine 2017;17:569-72.
  15. Lensvelt-Mulders GJLM, Hox JJ, Heijden PGMVD, et al. Meta-analysis of randomized response research thirty-five years of validation. Sociological Methods & Research 2004;33:319-48. [Crossref]
  16. Giordano S, Perri PF. Efficiency comparison of unrelated question models based on same privacy protection degree. Statistical Papers 2011;53:987-99. [Crossref]
  17. Dare OO, Cleland JG. Reliability and validity of survey data on sexual behaviour. Health Transit Rev 1994;4 Suppl:93-110. [PubMed]
  18. Bo Y, Ge G, Zhilong H. The Survey among MSM Population in Beijing Using Two-Stage Sampling Method on RRT Model. Chinese Journal of Health Statistics 2012;29:351-3.
  19. Chen X, Du Q, Jin Z, et al. The Randomized Response Technique Application in the Survey of Homosexual Commercial Sex among Men in Beijing. Iran J Public Health 2014;43:416-22. [PubMed]
  20. Lei W, Ge G, Mingrun Y. Stratified Two-stage Cluster Sampling and Its Application on the Greenberg Model for Sensitive Question Survey. Chinese Journal of Health Statistics 2011;28:37-8, 46.
  21. Mian W, Ge G. Cluster Sampling and its Application on Quantitative Sensitive Questions. Chinese Journal of Health Statistics 2008;25:586-9.
  22. Yunhua Z, Ge G, Xiangke P. The Method to Estimate the Sample Size for Quantitative Sensitive Questions in Stratified Two-Stage Sampling and its Application. Chinese Journal of Health Statistics 2014;31:45-8.
  23. Zhilong H, Ge G, Mian W. Statistical Methods of Two-stage Sampling on the Randomized Response Technique for Sensitive Question Survey with Multiple Choices and its Application. Chinese Journal of Health Statistics 2009;26:580-2.

(English Language Editor: J. Jones)

Cite this article as: Chen K, Mao Z, Jin Z, Gao G, Bian C. Application of a randomized response technique for the sensitive question survey in female sex workers in Xichang, China. Ann Palliat Med 2021;10(1):392-403. doi: 10.21037/apm-20-2262

Download Citation