When is frequency distribution used




















Frequency distribution tables can be used for both categorical and numeric variables. Continuous variables should only be used with class intervals, which will be explained shortly. A survey was taken on Maple Avenue. In each of 20 homes, people were asked how many cars were registered to their households. The results were recorded as follows:. By looking at this frequency distribution table quickly, we can see that out of 20 households surveyed, 4 households had no cars, 6 households had 1 car, etc.

A cumulative frequency distribution table is a more detailed table. It looks almost the same as a frequency distribution table but it has added columns that give the cumulative frequency and the cumulative percentage of the results, as well.

At a recent chess tournament, all 10 of the participants had to fill out a form that gave their names, address and age. The ages of the participants were recorded as follows:. If a variable takes a large number of values, then it is easier to present and handle the data by grouping the values into class intervals. Continuous variables are more likely to be presented in class intervals, while discrete variables can be grouped into class intervals or not. To illustrate, suppose we set out age ranges for a study of young people, while allowing for the possibility that some older people may also fall into the scope of our study.

The frequency of a class interval is the number of observations that occur in a particular predefined interval. So, for example, if 20 people aged 5 to 9 appear in our study's data, the frequency for the 5—9 interval is The endpoints of a class interval are the lowest and highest values that a variable can take.

So, the intervals in our study are 0 to 4 years, 5 to 9 years, 10 to 14 years, 15 to 19 years, 20 to 24 years, and 25 years and over. The endpoints of the first interval are 0 and 4 if the variable is discrete, and 0 and 4. The endpoints of the other class intervals would be determined in the same way. Class interval width is the difference between the lower endpoint of an interval and the lower endpoint of the next interval. Thus, if our study's continuous intervals are 0 to 4, 5 to 9, etc.

The intervals could also be written as 0 to less than 5, 5 to less than 10, 10 to less than 15, 15 to less than 20, 20 to less than 25, and 25 and over. In summary, follow these basic rules when constructing a frequency distribution table for a data set that contains a large number of observations:.

In deciding on the width of the class intervals, you will have to find a compromise between having intervals short enough so that not all of the observations fall in the same interval, but long enough so that you do not end up with only one observation per interval. It is also important to make sure that the class intervals are mutually exclusive and collectively exhaustive. Thirty AA batteries were tested to determine how long they would last.

The results, to the nearest minute, were recorded as follows:. Use the steps in Example 1 and the above rules to help you construct a frequency distribution table. Using the given data and a class interval of 10, the interval for the first class is to and includes the lowest value.

Remember, there should always be enough class intervals so that the highest value is included. An analyst studying the data from example 3 might want to know not only how long batteries last, but also what proportion of the batteries falls into each class interval of battery life.

The percentage frequency is found by multiplying each relative frequency value by Use the data from Example 3 to make a table giving the relative frequency and percentage frequency of each interval of battery life. As previously shown for example 2, cumulative frequency is used to determine the number of observations that lie below a particular value in a data set.

The cumulative frequency is calculated by adding each frequency from a frequency distribution table to the sum of its predecessors. The last value will always be equal to the total for all observations, since all frequencies will already have been added to the previous total. The daily number of rock climbers in Lake Louise, Alberta was recorded over a day period. The results are as follows:. The number of rock climbers ranges from 4 to In order to create a frequency table, the data are best grouped in class intervals of Rather than displaying the frequencies from each class, a cumulative frequency distribution displays a running total of all the preceding frequencies.

Constructing a cumulative frequency distribution is not that much different than constructing a regular frequency distribution. The third column should be labeled Cumulative Frequency. Then, count the number of data points that falls in each class and write that number in column two. The first entry will be the same as the first entry in the Frequency column.

The second entry will be the sum of the first two entries in the Frequency column, the third entry will be the sum of the first three entries in the Frequency column, etc. The last entry in the Cumulative Frequency column should equal the number of total data points, if the math has been done correctly.

There are a number of ways in which cumulative frequency distributions can be displayed graphically. Histograms are common, as are frequency polygons. Frequency polygons are a graphical device for understanding the shapes of distributions. They serve the same purpose as histograms, but are especially helpful in comparing sets of data.

Frequency Polygon : This graph shows an example of a cumulative frequency polygon. Frequency Histograms : This image shows the difference between an ordinary histogram and a cumulative frequency histogram. A plot is a graphical technique for representing a data set, usually as a graph showing the relationship between two or more variables.

Graphs of functions are used in mathematics, sciences, engineering, technology, finance, and other areas where a visual representation of the relationship between variables would be useful. Graphs can also be used to read off the value of an unknown variable plotted as a function of a known one.

Graphical procedures are also used to gain insight into a data set in terms of:. Plots play an important role in statistics and data analysis. The procedures here can broadly be split into two parts: quantitative and graphical. Quantitative techniques are the set of statistical procedures that yield numeric or tabular output. Some examples of quantitative techniques include:. There are also many statistical tools generally referred to as graphical techniques which include:.

Scatter plot: This is a type of mathematical diagram using Cartesian coordinates to display values for two variables for a set of data. The data is displayed as a collection of points, each having the value of one variable determining the position on the horizontal axis and the value of the other variable determining the position on the vertical axis. This kind of plot is also called a scatter chart, scattergram, scatter diagram, or scatter graph.

Histogram: In statistics, a histogram is a graphical representation of the distribution of data. It is an estimate of the probability distribution of a continuous variable or can be used to plot the frequency of an event number of times an event occurs in an experiment or study. Box plot: In descriptive statistics, a boxplot, also known as a box-and-whisker diagram, is a convenient way of graphically depicting groups of numerical data through their five-number summaries the smallest observation, lower quartile Q1 , median Q2 , upper quartile Q3 , and largest observation.

A boxplot may also indicate which observations, if any, might be considered outliers. Scatter Plot : This is an example of a scatter plot, depicting the waiting time between eruptions and the duration of the eruption for the Old Faithful geyser in Yellowstone National Park, Wyoming, USA.

In statistics, distributions can take on a variety of shapes. Considerations of the shape of a distribution arise in statistical data analysis, where simple quantitative descriptive statistics and plotting techniques, such as histograms, can lead to the selection of a particular family of distributions for modelling purposes.

In a symmetrical distribution, the two sides of the distribution are mirror images of each other. A normal distribution is an example of a truly symmetric distribution of data item values. When a histogram is constructed on values that are normally distributed, the shape of the columns form a symmetrical bell shape.

Also, there is only one mode, and most of the data are clustered around the center. The more extreme values on either side of the center become more rare as distance from the center increases. This is known as the empirical rule or the 3-sigma rule. Normal Distribution : This image shows a normal distribution.

In an asymmetrical distribution, the two sides will not be mirror images of each other. Skewness is the tendency for the values to be more frequent around the high or low ends of the x-axis. When a histogram is constructed for skewed data, it is possible to identify skewness by looking at the shape of the distribution. A distribution is said to be positively skewed or skewed to the right when the tail on the right side of the histogram is longer than the left side.

Most of the values tend to cluster toward the left side of the x-axis i. In this case, the median is less than the mean. Positively Skewed Distribution : This distribution is said to be positively skewed or skewed to the right because the tail on the right side of the histogram is longer than the left side.

A distribution is said to be negatively skewed or skewed to the left when the tail on the left side of the histogram is longer than the right side. Most of the values tend to cluster toward the right side of the x-axis i. In this case, the median is greater than the mean. Negatively Skewed Distribution : This distribution is said to be negatively skewed or skewed to the left because the tail on the left side of the histogram is longer than the right side. When data are skewed, the median is usually a more appropriate measure of central tendency than the mean.

This means there is one mode a value that occurs more frequently than any other for the data. A bi-modal distribution occurs when there are two modes. Multi-modal distributions with more than two modes are also possible.

A raw score is an original datum, or observation, that has not been transformed. This may include, for example, the original result obtained by a student on a test i. It requires knowing the population parameters, not the statistics of a sample drawn from the population of interest. However, in cases where it is impossible to measure every member of a population, the standard deviation may be estimated using a random sample.

Normal Distribution and Scales : Shown here is a chart comparing the various grading methods in a normal distribution. Privacy Policy. Frequency distributions can be displayed in a table, histogram, line graph, dot plot, or a pie chart, to just name a few. Frequency tables are useful for analyzing categorical data and for screening data for data entry errors. Note that we will refer to two types of categorical variables: Categorical and Grouping or Break.

Grouping variables are used to split a database into subgroups. Frequency tables, pie charts, and bar charts are the most appropriate graphical displays for categorical variables. Below are a frequency table, a pie chart, and a bar graph for data concerning Mental Health Admission numbers. Categorical variables represent types of data which may be divided into groups. Examples of categorical variables are race, sex, age group, and educational level.

Quantitative Information — Involves a measurable quantity—numbers are used. Some examples are length, mass, temperature, and time. Quantitative information is often called data, but can also be things other than numbers. Qualitative Information — Involves a descriptive judgment using concept words instead of numbers.

Begin typing your search term above and press enter to search. Press ESC to cancel. Skip to content Home Essay What are the uses of frequency distribution?

Ben Davis May 1, What are the uses of frequency distribution? What is the importance of frequency distribution table in our lives? What is frequency distribution explain with an example? What are the advantages and disadvantages of using frequency distributions? What is the use of relative frequency? Why is it bad to have too few or too many categories in a grouped frequency distribution? What is frequency distribution of qualitative data and why is it useful? What are the components seen on a basic frequency distribution?



0コメント

  • 1000 / 1000