Center Home -> Content Areas Home -> Math Home -> Project Activities -> Fathom Activities  ->

The Correlation Between Shoe Size and Reading Level

Activity Description Activity GuideResources


Part 1: Plotting and Curve Fitting Shoe Size and Reading Level Data

Scientists have found a strong correlation between shoe size and reading level of school children. One way to measure reading level is to use a scale from 1.0 to 12.9 (5.7 is the level of reading which one would have in the 5th grade at the end of the 7th month). In Table 1 is sample data for 8 people.

Name 

Shoe Size 

Reading Score 

Agida

10

8.6

Carey

8.5

6.6

Vince

11

9.5

Paula

6

7.1

Tom

3

2.5

Jim

1

1.0

Kathy

4

3.9

Scott

6

4.7

Table 1

1.  Look at the data in Table 1. Describe any relationships you see between the variables reading score and shoe size. Can the data in Table 1 be reorganized to help someone to see relationships? Explain your answer and reasons.

2.  Enter this data into a Fathom document and reorganize the data to illustrate your answer to the question above.

How to open Fathom document:       

If you do not have a blank document after you started Fathom, create one by clicking File, then New

 

How to enter data: 

  1. You may choose “a” or “b” to get an empty case table.

    1. Click Insert, then click Case Table

    2. Click, hold, and grab the Case Table icon (below the Display on the menu bar), and drag it to the workspace.

  2. To enter an attribute click on <new> , type in the first attribute (e.g., Name), and hit Enter . Note: Attribute names must be one word (e.g., ShoeSize).

  3. Repeat this process to enter all attributes. Note: You may drag the edges of the table or the attribute boxes to make them wider.

  4. To title this table, double click on the label Collection 1, type in the title, and hit OK

  5. Enter data in the empty cells. Use Tab to move to the next cell.

  6. Save your file by clicking File then Save

Note: You may always undo or redo changes by clicking Edit, then either Undo, or Redo

 

How to sort the data:

1. Select the column heading of the attribute you want to sort.

2.Click Data, and then choose either Sort Ascending, or Sort Descending

3.  Describe patterns you observe in the reorganized data.

4.  Graph the data and relate features of the graph to the patterns you described above.

How to create a graph: 

  1. Click Insert, choose Graph. You may also make a graph by dragging the graph icon (under Insert in the menu bar) to an empty area on the workspace.
  2. Drag the column header for shoesize to the horizontal axis of the graph over the spot labeled Drop an attribute here. (As you move the mouse over the x-axis, a black border appears, showing that you can release the attribute there.)
  3. Similarly, drag the column header for readingscore and drop it on the vertical axis of the graph.

Note: Fathom autoscales the axis. To rescale an axis, grab and drag it. To set a y-intercept at zero, choose Lock Intercept at Zero under the Graph menu.

Note: When you move the mouse on the graph window, the coordinates of the tip of the arrow will show up in the status bar at the bottom left of the Fathom window.

5.  What type of function would best fit this data? Manually fit an approximate line (Movable Line) to this data.

How to manually fit a line: (estimating, not calculating, a the line of best fit)

  1. Click on the graph window.
  2. Click Graph, choose Movable Line.
  3. To move this line parallel to itself, grab the middle of the line and drag it.
  4. To change the slope, grab and drag the line at either “end.”

6.  Click on graph window to activate it. Choose Graph, then Show Squares. Discuss what the length of each square represents. 

 

Note: This length represents the difference between the original y-value (data point) and predicted y-value (line of best fit) and is referred to as a residual.

How to show residuals:

  1. Click on graph window to activate it.
  2. Choose Graph menu, choose Show Squares. This graph is now shows squares constructed from each point to the line. The length of the side of each square shows residuals.

7.  Compare the magnitude of the squares for Scott and Paula. According to your data, whose reading score is closer to that of a typical person who wears a size six shoe? Explain your reasoning.

8.  Manipulate the Movable Line. How do the residuals change when you move the line?

9.  Graph a residual plot and observe the difference between the original y-value of each point and its predicted y-value. Explain the relationship between the original graph and the residual plot.

 

How to create a residual plot:

  1. Click on Graph menu.

  2. Choose Make Residual Plot.

10.  Using the residual graph as an aid, manipulate the Movable Line to best fit this data. Explain why you think this line fits the data best.

11.  Adjust the line so that the sum of the areas of the squares is approximately at a minimum. Note the equation of this line. The line that satisfies this criterion is called the least-squares regression line. Compute the least-squares regression line using Fathom.

How to compute the least-squares regression line:

  1. Click on the graph window

  2. Select Graph  from the menu bar and choose Least-Squares Line 

12. Compare the equations of the Movable Line and the least-squares regression line. Turn off the      Movable Line.

Part 2: Interpreting Outliers. 

1.  Look at the scatterplot and decide which data point least fits the general trend of the data. Make a note of this point. Interpret the meaning of this data point in terms of the variables.

2.  Come up with a statement describing Paula’s coordinates. Write it down. We refer to that point as an outlier. How would you define an outlier? Compare your definition to one in a dictionary. Discuss how the formal definition could be interpreted.

3.  Write down the coordinates of the outlier. Predict the effect on the least-squares regression line when the outlier is moved closer to or farther from the line. Explain your prediction. Drag the point of the outlier to fit the line. See the change of the residual. Discuss the influence of an outlier on a regression line.

 

How to drag a data point: 

1.  Move the mouse so that the tip of the arrow is on top of the outlier.
2.  When the arrow becomes black, click and drag the point.

Note: The coordinates change in the table as well as in the status bar.

4.  Place your outlier back to its original position by typing the initial coordinates in the table (Those were Paula’s shoe size and reading score).

Part 3: Prediction and Causality. 

1.  Explain why this mathematical model makes sense with this population. Is this model applicable to the general population?

2.  Tammy’s shoe size is 5.5. Predict her reading score. How did you arrive at this prediction?  Discuss the appropriateness of this prediction.

3.  Would your prediction change if you know that Tammy is a college student?

4.  Which of the following statements are appropriate according to our model:

·        People with big feet read better than people with small feet;

·        There is a strong correlation between reading level of school children and their shoe size; and 

·        School children with larger shoe size read better than school children with smaller shoe sizes.



Back to Project Activities | Back to Math Homepage

Send questions or comments here.
Last modified on November 12, 2001.