BioStart Logo

Combinatorics: Learning Objectives 1, 2, 3, 4

Question 1: Give the formula for each of the following, using variables $n$ and $r$:
  1. The number of ways to select $r$ items from $n$ total, where order does not matter and items are not replaced after being drawn.
  2. The number of ways to select $r$ items from $n$ total, where order does matter and items are not replaced after being drawn.
  3. The number of ways to select $r$ items from $n$ total, where order does matter and items are replaced after being drawn.

Part (a) describes a combination. Part (b) describes a permutation. For part (c), consider how many options there are for each of the r choices.

  1. $ \binom{n}{r} = \frac{n!}{r!(n-r)!} $
  2. $ P(n,r) = \frac{n!}{(n-r)!} $
  3. $ n^r $

  1. This scenario describes a combination, as the order of selection does not matter. The formula is "n choose r".
  2. This scenario describes a permutation, as the order of selection is important.
  3. In this scenario, for each of the $r$ choices, there are $n$ possibilities because the items are replaced. This is an application of the multiplication principle.

Question 2: For each of the following scenarios, determine whether a permutation or a combination would be used to calculate the total number of possible outcomes.
  1. A committee of 10 members must select a president and a vice president.
  2. Three fruits are randomly chosen from a basket containing five different fruits.
  3. The set of answers for a multiple choice-test consisting of 15 questions, each with five possible answer choices.
  4. There are 40 applicants competing for 3 available summer internship positions in biostatistics.

The key question is: "Does the order of selection matter?" If selecting person A then B is a different outcome from selecting B then A, it's a permutation. If it's the same outcome, it's a combination.

  1. Permutation
  2. Combination
  3. Permutation (with replacement)
  4. Combination

  1. Permutation. The roles of president and vice president are distinct. Choosing Member A as president and Member B as vice president is a different outcome from choosing B as president and A as vice president. Order matters.
  2. Combination. The group of three fruits is the same regardless of the order in which they were picked. Choosing an apple, banana, and cherry is the same outcome as choosing a cherry, banana, and apple. Order does not matter.
  3. Permutation (with replacement). The sequence of answers creates a unique submission. Answering A for question 1 and B for question 2 is different from answering B for question 1 and A for question 2. Order matters.
  4. Combination. The three internship positions are identical. One group of three selected interns (Alice, Bob, Carol) is identical to (Carol, Alice, Bob). Order does not matter.

Question 3: Determine the number of distinct permutations of the letters for each of the following words.
  1. "great"
  2. "greet"
  3. "statistics"

Start with the factorial of the total number of letters. If there are repeated letters, you must divide by the factorial of the count of each repeated letter to eliminate overcounting.

  1. $5! = 120$
  2. $\frac{5!}{2!} = 60$
  3. $\frac{10!}{3!3!2!} = 50,400$

  1. "great": There are 5 distinct letters. The number of permutations is $5! = 5 \times 4 \times 3 \times 2 \times 1 = 120$.
  2. "greet": There are 5 letters in total, but the letter 'e' is repeated twice. To find the number of distinct permutations, we calculate $\frac{5!}{2!} = \frac{120}{2} = 60$.
  3. "statistics": There are 10 letters in total. The letter 's' is repeated 3 times, 't' is repeated 3 times, and 'i' is repeated 2 times. The number of distinct permutations is $\frac{10!}{3!3!2!} = \frac{3,628,800}{(6)(6)(2)} = \frac{3,628,800}{72} = 50,400$.

Question 4: How many distinct permutations of the word "statistics" begin and end with the letter "s"?

Fix an 's' at the beginning and an 's' at the end. This leaves 8 letters in the middle to be arranged. Count the repetitions among these 8 middle letters and use the formula for permutations with repeated items.

$\frac{8!}{3!2!} = 3,360$

The word is "statistics". We fix two of the 's' letters at the beginning and end: s _ _ _ _ _ _ _ _ s.
This leaves 8 letters to arrange in the middle. The remaining letters are: t, a, t, i, s, t, i, c.
In this middle section of 8 letters, we have repetitions:
- 't' appears 3 times
- 'i' appears 2 times
- 's', 'a', 'c' appear 1 time each.
The number of distinct permutations for these 8 middle letters is calculated as:
$\frac{8!}{3!2!} = \frac{40,320}{(6)(2)} = \frac{40,320}{12} = 3,360$.

Question 5: A public health department must appoint two members to an advisory board on infectious disease policy. Among the seven candidates are three epidemiologists and four biostatisticians.
  1. In how many ways can any two candidates be selected?
  2. In how many ways can two biostatisticians be selected?
  3. In how many ways can one epidemiologist and one biostatistician be selected?

The positions on the advisory board are not distinct, so order does not matter. This means you should use combinations. For part (c), you will need to calculate the number of ways to choose from each group separately and then multiply the results together.

  1. 21
  2. 6
  3. 12

  1. We are choosing a group of 2 people from a total of 7 candidates. Since the order of selection doesn't matter, we use a combination: $\binom{7}{2} = \frac{7!}{2!(7-2)!} = \frac{7!}{2!5!} = \frac{5040}{(2)(120)} = \frac{5040}{240} = 21$.
  2. We are choosing 2 people from the 4 available biostatisticians. Again, order doesn't matter: $\binom{4}{2} = \frac{4!}{2!(4-2)!} = \frac{4!}{2!2!} = \frac{24}{(2)(2)} = 6$.
  3. We must choose 1 epidemiologist from 3 AND 1 biostatistician from 4. We calculate the combinations for each and multiply them: $\binom{3}{1} \times \binom{4}{1} = \frac{3!}{1!2!} \times \frac{4!}{1!3!} = 3 \times 4 = 12$.

Question 6: There are 420 ways that one A, three B's, two C's, and one F can be distributed among seven students. Show how this number can be obtained using:
  1. Permutations
  2. Combinations

For Permutations: Think of this as arranging the 7 grades (A, B, B, B, C, C, F) in a line. This is a permutation with repeated items.
For Combinations: Think of this as a sequence of choices. First, choose which of the 7 students gets the A. Then, from the remaining students, choose which ones get the B's, and so on.

  1. Permutation with repetitions: $\frac{7!}{1!3!2!1!} = 420$
  2. Sequential combinations: $\binom{7}{1}\binom{6}{3}\binom{3}{2}\binom{1}{1} = 420$

  1. Using Permutations: We are arranging 7 items (the grades). However, some grades are identical. This is a permutation with repetitions. The formula is $\frac{n!}{n_1!n_2!...n_k!}$, where $n$ is the total number of items, and $n_1, n_2, ...$ are the counts of each repeated item.
    Here, $n=7$, $n_A=1$, $n_B=3$, $n_C=2$, $n_F=1$.
    $\frac{7!}{1!3!2!1!} = \frac{5040}{1 \cdot 6 \cdot 2 \cdot 1} = \frac{5040}{12} = 420$.
  2. Using Combinations: We can think of this as a sequence of tasks.
    1. Choose 1 student out of 7 to receive the A: $\binom{7}{1} = 7$ ways.
    2. Choose 3 students out of the remaining 6 to receive B's: $\binom{6}{3} = \frac{6!}{3!3!} = 20$ ways.
    3. Choose 2 students out of the remaining 3 to receive C's: $\binom{3}{2} = 3$ ways.
    4. Choose 1 student out of the remaining 1 to receive the F: $\binom{1}{1} = 1$ way.
    By the multiplication principle, the total number of ways is $7 \times 20 \times 3 \times 1 = 420$.

Question 7: A public health department is monitoring the spread of a contagious disease in a small community of 20 people. In truth, 3 of the 20 community members have the disease. Suppose the public health department randomly selects 5 individuals from the community for a study sample. Let $S$ be the sample space consisting of all possible study samples of 5 individuals that can be selected from the 20-person community.
  1. How many different ways can the public health department select a sample of 5 individuals from the 20-person community? That is, how many elements are in the sample space $S$?
  2. Explain why a combination is used to compute this quantity and not a permutation.

For part (a), you are choosing a group of 5 from 20. Does the order in which they are chosen matter for the final sample? This will tell you whether to use a combination or permutation.

  1. $\binom{20}{5} = 15,504$
  2. A combination is used because the sample is a group of individuals, and the order in which they are selected does not change the composition of the sample.

  1. We are selecting a group of 5 people from a total of 20. Since the order of selection does not matter for the final sample, we use a combination:
    $\binom{20}{5} = \frac{20!}{5!(20-5)!} = \frac{20!}{5!15!} = \frac{20 \times 19 \times 18 \times 17 \times 16}{5 \times 4 \times 3 \times 2 \times 1} = 15,504$.
    There are 15,504 possible samples.
  2. A combination is used because a "study sample" refers to the final group of individuals, not the sequence in which they were picked. A sample containing persons {A, B, C, D, E} is identical to a sample containing {E, D, C, B, A}. Since the order does not create a different outcome, combinations are the appropriate method.

Question 8: An event $A$ is a subset of elements in $S$ that satisfy a specific condition. The probability of the event $A$, denoted $P(A)$, when all outcomes in S are equally likely, is defined as: $$P(A)=\frac{\text{Number of outcomes in A}}{\text{Total number of outcomes in S}}$$ We will use this definition to obtain the probability of events for the 5-person study sample described in the previous question.
  1. How many ways can the 5-person sample contain one diseased individual?
  2. Let $A$ be the event that the random sample contains exactly one diseased individual. Using the number of elements in the sample space $S$ and the number of events that satisfy event $A$, compute $P(A)$.
  3. Let $B$ be the event that the random sample does not contain a diseased individual. Compute $P(B)$.
  4. Suppose the event of interest is the random sample containing zero or one diseased individual. Express this event using set notation and compute its probability.
  5. Let $C$ be the event that the random sample contains at least one diseased individual. Compute $P(C)$.

For part (a), you need to choose 1 person from the 3 who are diseased AND 4 people from the 17 who are not. For part (e), remember that the probability of "at least one" is equal to 1 minus the probability of "none".

  1. 7,140
  2. $P(A) = \frac{7140}{15504} \approx 0.4605$
  3. $P(B) = \frac{6840}{15504} \approx 0.4412$
  4. $P(A \cup B) = P(A) + P(B) \approx 0.9017$
  5. $P(C) = 1 - P(B) \approx 0.5588$

There are 20 people total: 3 diseased and 17 not diseased. The total number of possible 5-person samples is $\binom{20}{5} = 15,504$.
  1. To get a sample with exactly one diseased person, we must choose 1 of the 3 diseased people AND 4 of the 17 non-diseased people.
    Ways = $\binom{3}{1} \times \binom{17}{4} = 3 \times \frac{17!}{4!13!} = 3 \times 2380 = 7,140$.
  2. $P(A) = \frac{\text{Ways to get 1 diseased}}{\text{Total possible samples}} = \frac{7140}{15504} \approx 0.4605$.
  3. Event B is a sample with zero diseased individuals. This means we must choose 0 of the 3 diseased people AND 5 of the 17 non-diseased people.
    Ways to get 0 diseased = $\binom{3}{0} \times \binom{17}{5} = 1 \times \frac{17!}{5!12!} = 1 \times 6188 = 6,840$.
    $P(B) = \frac{6840}{15504} \approx 0.4412$.
  4. This event is the union of event A (one diseased) and event B (zero diseased). In set notation, this is $A \cup B$. Since A and B are mutually exclusive (a sample can't have exactly one and exactly zero diseased people at the same time), the probability is $P(A \cup B) = P(A) + P(B) = \frac{7140}{15504} + \frac{6840}{15504} = \frac{13980}{15504} \approx 0.9017$.
  5. Event C, "at least one diseased", is the complement of event B, "no diseased individuals". The easiest way to compute this is $P(C) = 1 - P(B)$.
    $P(C) = 1 - 0.4412 = 0.5588$. (Alternatively, one could calculate the probabilities of getting 1, 2, or 3 diseased individuals and add them up).

©