In statistics, one of the first distinctions to understand is between a population and a sample. When you're studying a group, a **population** involves the entire group of interest. In our exercise, the data set lists ages at death for all deceased U.S. presidents. Since it contains information on the entire set of individuals we want to understand, it qualifies as a population.
In many cases, though, researchers deal with a sample instead. A **sample** is a smaller group selected from the population, used to make inferences about the whole group. Samples are generally used when it is impractical or impossible to collect data from everyone in a population.
Choosing between using a population or a sample depends on the resources available and the scope of the research. For comprehensive studies like a census, populations are ideal. For everyday scientific studies, samples often make more practical sense.
- **Population**: Entire group of interest
- **Sample**: Subset of the population