"

1. Plan to Conduct Self-Administered Surveys

1.4 Sampling methods

If you decide to survey a sample and use inferential statistics to generalize findings to the entire population, you need to select an appropriate sampling method. Researchers have developed several sampling strategies to ensure that the sample accurately represents the population (Fraenkel & Wallen, 1990[1]). For example, let’s return to the earlier scenario: You want to select a sample of 250 employees from a population of 3,000 employees. The method you choose for selecting those 250 employees will influence the quality and generalizability of your survey results.

1.4.1     Simple random sampling

With simple random sampling, you randomly select individuals from the entire population. This method increases the change that the sample characteristics are similar to those of its population. For example, a group of 250 employees selected from a population of 3,000 employees would likely reflect the overall demographics, experiences, or opinions of the entire employee(Figure 5).

Figure 5 Illustrating Simple Random Sampling

A diagram showing the process of drawing a sample from a population, represented by a large cluster of silhouette individuals (population) and an arrow pointing to a much smaller grouping of silhouette individuals (sample). Individuals in the population are randomly selected to form a sample.

To carry out simple random sampling using Microsoft Excel, assign a unique ID number from 1 to 3,000 to all 3,000 employees. Next, use the =RANDBETWEEN(1,3000) function to generate 250 random numbers. Then, select 250 employees whose ID numbers match the randomly generated numbers.

Alternatively, you may use Excel’s Sampling function from the Data Analysis Toolpak. First, you may need to activate the Data Analysis menu in Excel. In Excel,

  1. Click the File tab, and click Options.
  2. Click Add-Ins, and then in the Manage box, select Excel Add-ins.
  3. Click Go.
  4. In the Add-Ins available box, select the Analysis ToolPak checkbox, and click OK.
  5. After successfully loading the Analysis ToolPak, the Data Analysis tool appears in the Analysis group on the Data tab (Figure 6).

Figure 6 Data Analysis Under the Data Tab

A screenshot of Excel menubar with a red arrow pointing at the Data Analysis tool

 

To randomly select 250 employee IDs from 3,000 IDs,

  1. Open an Excel file that contains 3,000 Employee IDs in a column. You may use Ch1.4-SamplingData.xlsx for practice (Figure 7).

Figure 7 An Excel File Containing Population Members’ IDs

A screenshot of Excel file containing population members' IDs, listed in Column A and rows 1 - 13. A population of N = 3,000 is stated at the bottom of the screenshot.

  1. Select Data > Data Analysis from the menu.
  2. Select Sampling and click OK (Figure 8).

Figure 8 Selecting Sampling in the Data Analysis Window

Excel's Data Analysis window with menu options including Descriptive Statistics, Exponential Smoothing, F-Test Two-Sample for Variances, Fourier Analysis, Histogram, Moving Average, Random Number Generation, Rank and Percentile, Regression, and Sampling. Sampling is selected.

  1. In the Input Range field, enter $A$1:$A$3000 (or select from A1 to A3000). Click Random, enter 250 in the Number of Samples field, and click OK (Figure 9).

Figure 9 Entering Input in the Sampling Window

Excel's Sampling window where users enter their input. Options are organized by Input (users can provide an Input Range and a checkbox allows them to select labels), Sampling Method (users can choose between Periodic and provide a period, or Random and provide the number of samples), and Output Options (users can choose between an output range and provide this information, a New Worksheet Ply and provide this information, or a New Workbook. Selections made in this screenshot include an Input Range of SAS1: SA$3000 in Sampling, Random sampling with 250 number of samples, and New Worksheet Ply.

  1. In a new worksheet, a randomly selected 250 IDs will be presented. You may sort the numbers in the column from smallest to largest.

1.4.2     Stratified random sampling

Use stratified random sampling when you want your sample to reflect the same proportions of specific subgroups (e.g., gender, department, branch) as in the overall population. For example, suppose your population of 3,000 employees includes 40% of employees who identify as men, 56% as women, and 4% as nonbinary or another gender identity. To maintain that same proportion in a sample of 250, you would randomly select 100 employees (40% of 250) who identify as men, 140 employees (56% of 250) who identify as women, and 10 employees (4% of 250) who identify as nonbinary or another gender identity (Figure 10).

Figure 10 Illustrating Stratified Random Sampling

A diagram illustrating stratified random sampling. A large pie chart with 40% man, 56% woman, and 4% binary or others represents a population, and a smaller pie chart with the same proportion represents a sample.

1.4.3     Cluster random sampling

Let’s say, your organization has 3,000 employees distributed across 60 branches, with each branch considered a cluster. On average, each branch (cluster) has approximately 50 employees. Instead of randomly selecting individuals from the entire 3,000 employee pool, you can randomly select five branches (clusters) as illustrated in Figure 11 (i.e., 5 random branches x approx. 50 employees at each branch = approx. 250 employees). However, this method may not work well for selecting a sample of 250 employees if the number of employees per branch varies significantly. In that case, you can first perform cluster random sampling to select branches, and then apply simple random sampling or stratified random sampling within each selected cluster to ensure a more accurate and representative sample.

Figure 11 Illustrating Cluster Random Sampling

Cluster sampling visually represented by a large population illustrated as a collection of clusters with an arrow pointing from this collection to a small number of clusters, randomly selected as a sample.

1.4.4     Systematic sampling

With systematic sampling, you select every nth individual from a list—in this case, every 12th employee from a coded list of 3,000 employees. To do this in Excel:

  1. Assign a unique ID to each employee in Column A, from 1 to 3,000.
  2. In cell C1, enter the following formula: =OFFSET($A$1,(ROW(A1)-1)*12,0)
  3. Copy this formula down for 250 rows in Column C.

This will return the ID of every 12th employee from the list, giving you a systematic sample of 250 employees (Figure 12).

Figure 12 Selecting Every 12th ID in Excel

Excel spreadsheet demonstrating a systematic sampling method by selecting every 12th ID in the data. Across the top of the screenshot, cell C1 is selected and a formula =OFFSET(SAS1,(ROW(A1)-1)*12,0) identified. Columns A-G and rows 1-9 are visible, with Columns A and C listing values. Column A values include 1, 2, 3, 4, 5, 6, 7, 8, 9. Column C values include 1, 13, 25, 37, 49, 61, 73, 85, and 97. Cell C1 is highlighted.

1.4.5     Convenience sampling

With convenience sampling, you select participants who are easy to access or willing to participate. For example, you might survey 250 employees from a couple of departments where you have strong working relationships and support for your survey study. This is the least rigorous sampling method because the sample may be biased and not representative of the large population. As a result, findings based on convenience samples should be interpreted with caution, especially when generalizing to the full population.

To determine a sample size based on the population size, confidence level, and margin of error, you may use a sample size calculator freely available on the web such as: SurveyMonkey’s sample size calculator.


  1. Fraenkel, J. R., & Wallen, N. E. (1990). How to design and evaluate research in education. McGraw-Hill.

License

Icon for the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License

Survey Design and Data Analysis Copyright © 2025 by Seung Youn (Yonnie) Chyung, Ed.D. is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, except where otherwise noted.