# How do I compare categorical data with multiple uneven populations?

Is there a difference in lineage occurrence/proportion between study 1 compared to the other 4 studies?

There are 5 populations. We need to compare population 1 to each of the other 4 populations (4 previous studies).

We are comparing the occurrence of Lineages (4 Levels- A, B, C, D) in population 1 compared to their occurrence in each of the other 4 populations.

The Lineages are categorical (A,B,C, D) every sample will fit within one of these lineages. The sample sizes are uneven between the different studies, therefore I thought it best to work with proportions.

I thought that I could just use a proportion test or chi-square test but I'm a bit confused since I have the 4 lineages and the 5 studies. Do I just look at one lineage at a time and since it's a proportion, it factors in the other lineages?

i.e.: LineageB <- prop.test(x = c(198,140,31,35,205), n = c(213,140,31,39,250)); LineageB

p-value = 3.502e-08 (This isn't specific enough, tells me there is likely a different between all studies)

Since I'm only concerned with population 1 compared to each of the others should I only include my study and one other at a time?

Lineages | Pop 1 |
Pop 2 |
Pop 3 |
Pop 4 |
Pop 5 |

A | 4 | 0 | 0 | 0 | 0 |

B | 198 | 140 | 31 | 35 | 205 |

C | 6 | 0 | 0 | 4 | 18 |

D | 5 | 0 | 0 | 0 | 27 |

Total |
213 | 140 | 31 | 39 | 250 |

## Answer

**Answers can be viewed only if**

- The questioner was satisfied and accepted the answer, or
- The answer was disputed, but the judge evaluated it as 100% correct.

- answered
- 366 views
- $6.00

### Related Questions

- Five married couples are seated around a table at random. Let $X$ be the number of wives who sit next to their husbands. What is the expectation of $X$ $(E[X])$?
- real analysis
- Limit of an Integral of a $C^\infty$-Smooth Function with Compact Support
- [Real Analysis] Let $a>1$ and $K>0$. Show that there exists $n_0∈N$ such that $a^{n_0}>K$.
- Uniform convergence of functions
- Probabilities
- How safe is this driver?
- What is the normal probability distribution function?