Map Medicare spending

Maps are important decision-making tools. They help determine problem areas and indicate where resources can be better spent. But maps don't always present a single truth. Sometimes you can find different views of the truth in the same set of data. In this case, you'll consider the per capita (per person) cost of the Medicare program in 2022. These costs vary appreciably from place to place. When you make a map, you need to make decisions about how to group these varying cost values. Which range of costs is high or low? Your decisions help create spatial patterns, and these patterns lead map users to draw conclusions. This raises concerns as to the best way to visualize data and find reliable patterns.

First, you'll compare some common techniques for classifying (grouping) data and see how your choices affect spatial patterns on the map. You'll work with Medicare cost data aggregated by county. Medicare is a United States government health insurance program covering about 50 million people who are over age 65 or who meet certain medical conditions. Information about the Medicare program is available from the Centers for Medicare & Medicaid Services.

Open the map

In this section, you'll open a map, familiarize yourself with its features and attributes, and save your own version of the map for further work.

  1. Open the Medicare Spending by County map.

    The map appears in Map Viewer showing all the counties in the United States.

    The Medicare Spending by County map open in Map Viewer

    The map contains a layer with data on 2022 Medicare spending in each county. You'll use this data to style the map to show where there are high and low levels of spending.

  2. If necessary, click Sign In. Sign in to your ArcGIS organizational account.
    Note:

    If you don't have an organizational account, see options for software access.

  3. On the Contents (dark) toolbar, confirm that Layers is selected.

    Layers in the Contents toolbar and the Layers pane

    The map contains two layers: State Boundaries (which is turned off) and Medicare Spending by County.

  4. In the Layers pane, for the State Boundaries layer, click the Visibility button.

    Visibility button for the State Boundaries layer

    You can now see state boundaries. Next, you'll explore the pop-ups for the Medicare Spending by County layer.

  5. On the map, zoom in until you can see individual counties. Click any county.

    A pop-up appears, showing the name of the county, state, and the amount of Medicare spending in 2022 per capita.

    Pop-up showing 2022 Medicare spending for Ness County, Kansas

    The per capita cost data you'll work with reflects the standardized risk-adjusted cost. It differs from the actual cost in two ways. First, it's standardized to even out differences in wages and the cost of goods and services from one part of the country to another. Second, it's risk adjusted to account for differences in age, sex, existing health conditions, and other relevant demographic factors. The standardized risk-adjusted value is the best estimate of what the actual costs would be if socioeconomic, demographic, and health conditions were uniform across the country.

    Note:

    For a detailed explanation of how standardization and risk adjustment are calculated, see Medicare Data for the Fee-for-Service Geographic Variation Public Use File: A Methodological Overview (May 2024 Update).

  6. Click a few other counties to review their pop-ups. When finished, close the pop-up.

    Pop-ups tell you about individual features, but they don't help you see spatial patterns. To see patterns, you must symbolize the data. You'll create your own copy of the map so you can save the changes you make.

  7. On the Contents toolbar, click Save and open and choose Save as.

    Save as in the Save and open menu on the Contents toolbar

  8. In the Save map window, set the following parameters:
    • For Title, type Medicare Costs per Capita in 2022.
    • For Categories, remove the category Tutorial data / Tutorial Start Data.
    • For Summary, type Map showing hot spots of Medicare costs by county based on standardized per capita expenditure data from 2022.

    Parameters in the Save map window

  9. Click Save.

Style by natural breaks

You've saved a copy of the map. Now, you can style the map to answer your research question: Where is there significantly high Medicare spending in the country?

A typical way to present spatial patterns on a map is to associate ranges of data values with a color ramp. There are a few common methods for specifying value ranges. You'll use the Natural breaks method.

  1. In the Layers pane, click the Medicare Spending by County layer to select it.

    Medicare Spending by County layer in the Layers pane

    The blue line next to the layer name indicates that the layer is selected.

  2. On the Settings (light) toolbar, click Styles.

    Styles on the Settings toolbar

  3. In the Styles pane, for Choose attributes, click Field.

    Field button in the Styles pane

  4. In the Select fields window, check the box for Standard Payment per capita.

    Standard Payment per capita field in the Select fields window

  5. Click Add.

    Once you add the attribute, the available drawing styles are presented. A suggested style, Counts and Amounts (color), is automatically applied and is indicated by a check mark in the Styles pane.

    On the map, the counties are now drawn in shades of blue. Darker shades represent regions where Medicare expenditures were higher in 2022.

    Layer styled to show Medicare expenditures in 2022 by county

    To better understand the layer style, you'll explore the layer's style options.

  6. In the Styles pane, for the Counts and Amounts (color) style, click Style options.

    Style options button for the counts and Amounts (color) style

    The Style options pane appears with options for styling the layer. It includes a histogram, which shows you the range of values in the field you are using to style the layer with and the corresponding colors to symbolize those values.

    Histogram of Medicare spending data unclassed in the Style options pane

    The histogram provides helpful information about how your data and layer is being styled. The top and bottom of the histogram show you the lowest and highest values in your dataset. On the right side of the histogram, the middle value is the mean value of the data. The values above and below the mean are the standard deviation values.

    The style is currently using a continuous, unclassed method, meaning the symbol colors change gradually from the minimum value to the maximum value.

    You'll experiment with using a classified method. By classifying the data (dividing it into classes or groups) you change the ranges and breaks for each of the classes. Different classification methods will create different-looking maps.

  7. Toward the bottom of the Style options pane, turn on Classify data. For Number of classes, type 5.

    Classify data turned on and Number of classes set to 5

    Choosing five classes provides more variation in the map without adding too many classes, which could make it difficult to see differences between each class.

    In the map legend, the range of cost values is grouped into five classes by the default Natural breaks classification method.

    Legend showing the five classes of Medicare Spending per capita

    The Natural breaks method uses clusters and gaps in the value range to define classes.

    One characteristic of this method is that value ranges may be different from class to class. Here, the value range of the lowest class ($4,244 to $9,263) is $5,019, while the range of the next class ($9,263 to $10,848) is just $1,585. Another characteristic is that classes may have different numbers of members. For example, the highest class includes 118 counties while the lowest includes 536 counties.

    It is also important to consider if you have any null values.

  8. In the Style options pane, under the histogram, turn on Show features with out of range or no values.

    Show features with out of range or no values option

    The counties in the state of Connecticut now are shown in gray.

    Counties in Connecticut styled in gray because they have no value in the layer dataset

    In 2022, Connecticut made changes to their county boundaries. The Medicare spending data for 2022 used different boundaries than the current boundaries, resulting in null values. For the purpose of this tutorial, you'll exclude this data from your analysis.

    Note:

    To learn more about the change in Connecticut's counties, see Change to County-Equivalents in the State of Connecticut for 2022 ACS.

    Next, you'll adjust layer style by changing the color ramp.

  9. In the Style options pane, click the symbol under Symbol style.

    Symbol style for the Medicare Spending by County layer in the Style options pane

  10. In the Symbol style window, click the color ramp for Colors. In the Ramp window, choose Purple 2.
    Tip:

    To see the name of a color ramp, point to the color ramp.

    The Purple 2 color ramp in the Ramp window

  11. Click Done.

    The layer style updates.

    Layer styled updated with the Purple 2 color ramp

    The map shows distinct patterns. High rates of expenditure are shown throughout the South, especially in Texas, Louisiana, Mississippi, and Florida. High levels of spending were also prevalent through the Great Plains region, notably in Oklahoma and Kansas. There are also isolated high spending in other areas of the country.

  12. In the Style options pane, click Done twice.
  13. On the Contents toolbar, click Save and open and choose Save.

Explore spatial patterns

You'll explore the spatial patterns on the map by looking at the legend of the Medicare Spending by County layer and then zooming to different geographic areas and opening pop-ups.

  1. On the Contents toolbar, click Legend.

    Legend on the Contents toolbar

    The legend shows the value range associated with each color. In any classification scheme, class breaks are important because they lead map users to form judgments: in one place costs seem to be high, while in another they seem to be very high. In fact, however, the difference between a given pair of values in different classes may be small.

  2. On the Contents toolbar, click Bookmarks. In the Bookmarks pane, click Southwest.

    Southwest bookmark in the Bookmarks pane

    The map navigates to the southwestern United States.

  3. Click one of the counties styled for the middle of the five classes.

    San Bernardino county pop-up

    San Bernardino County, shown in the example image, reported $11,986 per capita Medicare Expenditure in 2022. The middle class ranges from $10,848 to $12,452 per capita.

  4. Close the pop-up. Click the county in the highest class in Nevada, where Las Vegas is located.

    Clark County in Nevada styled in the highest class

    In Clark County, the 2022 expenditure was $12,660. The difference between the two regions is only $674, but it's enough to put them in different classes—at least with the Natural Breaks classification method.

  5. Close the pop-up.
  6. Navigate to the Midwest and Northeast bookmarks and compare other counties' Medicare expenditure.

    Some counties had very high amounts of Medicare spending compared to their neighboring counties, such as Monroe County in the southern part of Iowa. In some cases, the amount spent in one county styled in the highest class does appear strikingly high compared to a neighboring county styled in the lowest class. But there may be some examples where the difference does not seem very large despite the class style difference. There are other methods to style the data that might better communicate the differences between counties.

  7. In the Bookmarks pane, click USA to navigate back to the continental United States.
  8. On Contents toolbar, click Layers. In the Layers pane, point to the State Boundaries layer and click the Visibility button to turn the layer off.
  9. Save the map.

Classify the data by other methods

The Natural breaks method isn't the only available classification method. You'll see to what extent the spatial patterns change when you use the Equal interval and Quantile methods.

  1. In the Layers pane, confirm that Medicare Spending by County is selected. On the Settings toolbar, click Styles.
  2. In the Styles pane, under Counts and Amounts (color), click Style options.
  3. Under Classify data, for Method, choose Equal interval.

    Equal interval set for Method in the Styles pane

    These class breaks are different. The defining characteristic of the Equal interval method is that value ranges are the same among all classes. In this case, the range is about $4,307. A class can have any number of counties, or even no counties.

    Layer styled using the Equal interval classification method

    Although a fairly similar pattern of high and low values is evident, a different impression is created. Fewer regions fall into the lowest and highest classes, making them stand out, and the map has an overall homogeneous appearance.

  4. In the Style options pane, for Method, choose Quantile.

    The classes change again. The defining characteristic of the Quantile method is that all classes have the same number of members (in this case, either 626 or 627 counties). The value ranges among classes may be very different. Here, the value range of the lowest class is $5,266, while the range of the middle class is $714.

    Legend showing the five classes with the Quantile classification method

    In contrast to the previous map, the Quantile method tends to emphasize highs and lows and may exaggerate their importance.

    Map styled to show Medicare expenditure by county using the Quantile classification method

    None of the classification methods you've looked at is right or wrong. The Quantile and Equal interval methods give accurate results when data is continuously and evenly distributed throughout the value range. This is often not the case, however. When there are gaps and clusters in the data, the Natural breaks method is recommended.

    In this situation, the data has a fairly normal, or bell-shaped, distribution. With this linear distribution, the Quantile method is recommended. Because features are grouped in equal numbers in each class, if you have an unevenly distributed dataset, the resulting map can often be misleading.

    Histogram for Medicare expenditures using the Quantile classification method

    The Quantile method is also a useful style for determining resource allocation. For example, if you need to develop a health policy that targets supporting areas with the most need, you could use the Quantile method with five classes, the highest class represents the top 20 percent of counties that should receive this funding first.

    Note:

    To learn more about classification methods, see Use style options (Map Viewer) - Classification methods and the video Configure a choropleth map.

  5. In the Styles pane, click Done.
  6. Save the map.

Decisions about how to classify data are at least partly subjective. You might like the way a map looks or you might want to convey a certain message. No classification method is wrong, and each may help emphasize an aspect of the data that's not apparent from the others. But you may wonder if it's possible to get a better sense of which spatial patterns are stable and reliable, to know which places definitely stand apart from the others.

The answer is yes: there are analysis techniques that help you group and visualize data in less subjective ways. Next, you'll explore hot spot analysis and see how statistical evaluation can find spatial clusters of significantly high and low values in your data.


Analyze Medicare spending hot spots

Previously, you observed how the spatial patterns on a map change depending on the data classification method. Next, you'll run a hot spot analysis on the data to draw more definite conclusions about its patterns. Hot spot analysis applies statistical tests to find areas where values are significantly different from the norm.

Find hot spots

The maps you styled showed a variation in the amount of Medicare spending around the country. In 2022, the amount of spending was relatively higher in states in the South and the Great Plains. Spending was low in New England, parts of the Midwest, the Northwest, and the Rocky Mountain range region. But styling the map by different classification methods does not tell you if there are statistically significant differences. The Find Hot Spots tool identifies statistically significant spatial clustering of high values (hot spots) and low values (cold spots) or data counts using the Getis-Ord Gi* statistic.

  1. In the Layers pane, confirm that the Medicare Spending by County layer is selected.
  2. In the Settings toolbar, click Analysis.

    Analysis button

  3. In the Analysis pane, click Tools. In the Tools pane, in the search bar, type hot spot and press Enter.

    Find Hot Spots tool in the search results in the Tools pane

    The Find Hot Spots tool appears in the list of results. This tool employs a spatial statistical technique to identify spatial patterns, providing a confidence level for the presence of high or low-value clusters.

  4. In the list of results, click the Find Hot Spots tool.

    The Find Hot Spots pane appears.

    Tip:

    At the top of the pane, next to the tool name, the information button will open a window with a description of the tool. In this window, the Learn More link will take you to a web page with more information about the tool.

    The first parameter is Input layer. This parameter determines the layer that contains the point or polygon features on which hot spot analysis will be performed.

  5. In the Find Hot Spots pane, for Input layer, choose Medicare Spending by County.

    Input layer parameter

    In the Hot spot settings section, Analysis field is the field that will be analyzed for clusters of high values (hot spots) and low values (cold spots). You want to analyze hot and cold spots of Medicare spending.

  6. For the Analysis field, choose Standard Payment per capita.

    The optional Divide by parameter allows you to divide the Analysis field values by the values in another field. An example of why you would use this parameter is if your analysis field data is influenced by population, such as the total amount of Medicare spending per county. Counties with more people would likely have more total spending, so dividing the analysis field by population ensures that analysis is being done per capita (per person). In your case, the Standard Payment per capita field is already calculated per capita, so you'll leave this parameter unchanged.

    Finally, you'll provide the name of the layer that will be created when the tool is run.

  7. For Output name, type Medicare Spending Hot Spots and add your name or initials to ensure the name is unique within your organization.

    Output name parameter

  8. Click Estimate credits.

    Estimate credits button

    Credits are the currency used across ArcGIS Online. They are consumed during specific transactions, such as performing analytics, storing features, and geocoding.

    Running this tool will require 3.143 credits.

    Note:

    To learn more about credits, see Understand credits. You can learn how many remaining credits are in your ArcGIS Online account if your organization administrator has enabled you to view that information. If it is enabled, at the top of the page, click your user name and choose My settings. On the My settings page, click Credits to see how many remaining credits are in your account. If it is not enabled, contact your organizational account administrator.

  9. Click Run.

    As the tool runs, you can view its progress by clicking the History tab in the Analysis pane.

    After a few minutes, the Medicare Spending Hot Spots layer is added to the map.

    Hot spots analysis results layer adds to the map

    On the map, red and blue areas represent statistically significant clusters of high and low costs, respectively. In regions symbolized in white, the amount of spending did not stand out as significantly high or low.

    The confidence levels of significance reveal the likelihood of high or low values in the study area being clustered. Hot and cold spots with over 90 percent confidence imply that this spatial clustering is likely not due to random chance, but rather the result of some spatial process. A higher confidence level increases our certainty that the observed patterns are occurring for a specific reason.

    The results clarify that there is in fact statistically significant high spending in several states in the South and the great plains. One high spending hot spot that was not as obvious earlier is counties in New Jersey and around New York City.

  10. On the Contents toolbar, click Legend.

    The labels on the layer legend explain the symbols. For example, a hot spot with 99 percent confidence means there is only a 1 percent chance that a cluster of high costs occurred randomly.

    Legend on the Contents toolbar and the Legend pane showing the Hot Spot layer categories

    The legend heading is generated by a field name alias in the layer table. You'll change this heading to something meaningful later in the tutorial.

  11. Save the map.

Change layer symbology

Your map's users may find it more helpful to see the hot and cold spots displayed with state boundaries than the county boundaries.

  1. On the Contents toolbar, click Layers. Confirm that Medicare Spending Hot Spots is selected.
  2. On the Settings toolbar, click the Styles button.
  3. In the Styles pane, for Counts and Amounts (color), click Style options.
  4. In the Style options pane, click the Symbol style button.

    Symbol style on the Style options pane

  5. In the Symbol style window, for Outline width, type 0.

    Outline width set to 0

  6. In the Style options pane, click Done twice.
  7. In the Layers pane, for the State Boundaries layer, click the Visibility button.
  8. In the Layers pane, click the dots next to the State Boundaries layer and drag the layer to the top of the Layers pane.

    The State Boundaries layer dragged to the top of the list of layers in the Layers pane

    The map now only shows the state boundaries.

    State boundaries layer visible on the map above the hot spot results layer

    The hot spots of Medicare spending are in the states in the Gulf Coast region, Oklahoma, Kansas, and New Jersey. The major cold spot regions are in the Northwest, Rockies, portions of the Midwest, New England, and Virginia.

  9. Save the map.

Understand the results

You'll further explore the resulting hot spot layer to better understand what the tool analyzed. First, you'll explore the hot spot layer fields that were generated by the Find Hot Spots tool.

  1. In the Layers pane, for the Medicare Spending Hot Spots layer, click the Options button and choose Show table.

    Show table in the Options menu in the Medicare Spending Hot Spots layer in the Layers pane

    The table appears.

    Table for the Medicare Spending Hot Spots layer

    Tip:

    To better view the field names in the table, you can close panes and collapse toolbars. You can also point to a field name to see the full field name.

    You'll configure the table to show the key fields you want to explore and compare the results for different counties to better understand the analysis results.

  2. At the top of the table, click the Field visibility button.

    Field visibility button at the top of the attribute table

  3. In the Field visibility window, uncheck Source_ID and Standard Payments per capita. Click Done.

    Only five fields are visible in the table.

  4. On the map, click a red county.

    A selected feature highlights in bright cyan.

  5. In the table, click the Show selected button.

    Show selected button in the attribute table and a red county selected in the map

    The table filters to only show the selected record.

    Table filtered for the selected county

  6. On the map, click a blue cold spot county and a no significance white county.

    The counties you click are added to the filtered table. Now you have three different hot spot result records to compare to one another as you learn more about the resulting fields. In the table, the GiPValue and GiZScore fields show important information.

    GiPValue and GiZScore field names in the attribute table

    The Gi part of the field name refers to the Getis-Ord Gi* (pronounced G-i-star) statistic, which is used calculate z-scores and p-values. Gi* correlates each feature with its neighbors and then compares the local average to the average of all the features in the study area to calculate the probabilities of this value cluster being significantly higher or lower in the overall study area. This tool works by looking at each feature within the context of neighboring features.

    Note:

    To learn more about the Getis-Ord Gi* statistic, see How Hot Spot Analysis (Getis-Ord Gi*) works.

    The number at the end of the field alias, 174529, is distance band used to decide the neighborhood size.

    The GiPValue field is the p-value, in which a value less than 0.01 indicates statistical significance with 99 percent confidence. The word fixed in the field alias name indicates that the neighborhood method used is fixed distance band.

    The GiZScore field is the resulting z-score, which is the measure of standard deviation. For example, a z-score of 2 means the amount of Medicare spending in that county was 2 standard deviations higher than all the other counties.

    Note:

    Learn more about z-scores and p-values.

    In the example image, the red hot spot record has a z-score of 7.29 and a p-value of 0.00. This means the amount of Medicare spending in this county was more than seven standard deviations higher than all the other counties in the country. The low p-value means there is 99 percent confidence that this result is not random.

  7. If necessary, in the table, scroll to the Gi_Bin field.

    GI_Bin in the attribute table

    The Gi_Bin field identifies statistically significant hot and cold spots.

    • Features in the +/-3 bins reflect statistical significance with a 99 percent confidence level.
    • Features in the +/-2 bins reflect a 95 percent confidence level.
    • Features in the +/-1 bins reflect a 90 percent confidence level.
    • The clustering for features in bin 0 is not statistically significant.

    FDR stands for False Discovery Rate, and the FDR correction is applied to Find Hot Spots in Map Viewer by default. The FDR correction reduces the significance threshold (p-value) to account for the common multiple testing problem in statistical testing and also the spatial dependency due to the repetitive testing across all features in one dataset.

    Note:

    Learn more about FDR corrections.

    For the red hot spot county record, the Gi_Bin field is 3, meaning this county is determined to have statistically significantly higher Medicare spending with the FDR correction.

    The Statistical Significance field is a text field that can serve as a label for the overall hot spot analysis results.

  8. Using what you have learned, answer the following questions about the other two records:
    • For the record that has a Gi_Bin value of 0, which field explains why it is not statistically significant?
    • How many standard deviations below the mean is the cold spot county?
    • Which field tells you the cold spot record has a 99 percent confidence level?
    Tip:

    Consider selecting more counties on the map that are in Gi_Bin +/- 1 or 2 to compare in the table.

    Next, you'll observe the last field generated by the tool.

  9. In the table, scroll to the NNeighbors field.

    The NNeighbors field name also includes the scale of analysis value like the other fields. But where does this number come from? To better understand these values, you'll view the analysis history results.

  10. On the Settings toolbar, click Analysis. In the Analysis pane, click the History tab.

    History tab in the Analysis pane

  11. For the Find Hot Spots tool history item, click the options button and choose View details.

    View details option

    The Results tab appears for the Find Hot Spots window.

    Results tab for the Find Hot Spots tool that was run

    The Results tab provides important details, such as how many outliers were determined and that they were not included in calculating the scale of analysis value.

    The tool by default calculates the optimal fixed distance band by averaging the distance to the nearest 30 neighbors. The fixed distance band used in this analysis is 174,529 meters, which is the value you saw at the end of the field names in the attribute table. For each feature, the features inside the 174,529 meter buffer zone will be considered neighbors of the feature. This field tells you how many neighbors are used within the 174,529 meter buffer for each feature.

    Under Hot Spot Analysis, you also see that the FDR correction determined statistical significance for 2,127 of the 3,134 features.

  12. Save the map.

In this tutorial, you explored classification methods to show where Medicare expenditures were highest across the country. Using the Find Hot Spots tool, you took the analysis further and determined statistically significant areas of high and low Medicare spending. You also explored the hot spot analysis results to better understand the resulting fields and calculations that went into the analysis tool.

While these maps do not provide a causal explanation for why spending was high or low in certain regions of the country, it provides insight that there are geographic differences in health-care spending. Creating maps often invites further exploration and inspires new questions. For example, the maps created in this tutorial raise questions such as Why is the Medicare spending especially high in the Gulf Coast region states? Why is New Jersey a lone hot spot in the Northeast?

There are additional spatial statistical methods to help answer these questions, such as a regression analysis. Regression analysis is a statistical technique that helps you understand relationships among variables in your data. Consider taking the analysis further by applying more spatial statistical methods to better understand why health care costs more in some areas than others.

Note:

To learn more about regression analysis, explore the tutorial Determine how location impacts interest rates.

You can find more tutorials in the tutorial gallery.