Add disaggregated data to a composite index

With a composite index, you can take a complex issue and break it down into smaller, more manageable pieces and build them into a single indicator. The index tool is helpful for identifying where potential inequities exist, but you want to build on this idea and determine who is experiencing inequity.

Unfortunately, an index is often built either by grouping all people of color or leaving out race and ethnicity. When done strategically and thoughtfully, disaggregated data can help identify where interventions are most needed so that everyone has equal access and opportunities regardless of their social identity or socioeconomic background.

Explore the indicators

To begin, you will open an ArcGIS Pro package containing a composite index for senior food access. You will explore the indicators that make up the index.

  1. Download and open the SeniorIndexArcGIS Pro package.

    The ArcGIS Pro project appears with a map of Hennepin County and the Index Score layer visible, styled by the mean index score field.

    The ArcGIS Pro package opens showing map of mean Index Score values by tract in Hennepin County, Minnesota.

    First, you will explore the index indicators using data engineering.

  2. In the Contents pane, right-click Index Score and click Data Engineering.

    Data Engineering in the menu for Index Score

    The Data Engineering view for the Index Score layer appears.

    Data Engineering view pane

  3. In the Data Engineering view, in the fields list, drag the Percent 65+ Lives Alone field to the statistics panel.

    Percent 65+ Lives Alone indicator dragged into the statistics panel on the Data Engineering view

  4. Drag the following fields into the statistics panel:
    • Percent 60+ below poverty level in past year
    • Percent 65+ renters spend at least 30 percent housing
    • Percent 65+ No Health Insurance Coverage
    • Miles to nearest grocery
    • Percent of adults 18 year and over in limited English households
    • Percent households with no vehicles available
    Note:

    The data for the indicators are derived from the ACS Context for Senior Well-Being - Boundaries, ACS English Ability and Linguistic Isolation Variables - Boundaries, and ACS Vehicle Availability Variables - Boundaries layers from ArcGIS Living Atlas of the World. The grocery store data was derived from data by Data Axle using ArcGIS Business Analyst.

    You have added the seven indicators used to create the index into the statistics panel of the Data Engineering view.

  5. At the top of the statistics panel, click Calculate.

    Calculate button at the top of the Data Engineering view

    Several types of statistics are calculated for each of the indicators. Before you explore the results, you will freeze the Alias column.

  6. In the statistics panel, right-click Alias and click Freeze/Unfreeze.

    The Alias column is now locked to the side of the table.

    Note:

    If necessary, you can expand the width of the Alias column so you can see more of the Alias text.

    In the statistics panel, for Chart Preview, you can see a histogram distribution of all the census tracts in Hennepin county for each of the indicators. This can be helpful in understanding if there are outliers to consider.

    Chart Preview column showing histogram of each indicator

    The distribution for Percent 65+ Lives Alone appears like a normal distribution with some skewness. You will explore this further.

  7. Scroll through the statistics panel until the Third Quartile column is visible. Right-click the cell for Percent 65+ Lives Alone under the Third Quartile column, point to Select, and click Above Quartile.

    Above Quartile option in Select menu for the Percent 65+ Lives Alone indicator in the Data Engineering view

    On the map, all the census tracts with Percent 65+ Lives Alone in the top quartile are highlighted. The quartile breaks data down into quarters or four sections, each representing 25 percent of the data points. Therefore, the highlighted areas are the tracts in the top 25 percentile, indicating a higher proportion of older adults living alone.

    Map highlighted with tracts in the highest quartile for seniors who live alone

  8. On the ribbon, on the Data Engineering tab, in the Selection group, click Clear.

    Clear in the Selection group on the Data Engineering tab

    You also noticed that for the Percent 65+ renters spend at least 30 percent housing field, the histogram showed a large number of census tracts with a low percent and a large number of census tracts with a high number of high percent. You want to understand where is the area with high percentage of renters who are housing burdened.

  9. In the Data Engineering view, on the field list, point to Percent 65+ renters spend at least 30 percent housing and click the Create Chart button.

    Create Chart for the Percent 65+ renters spend at least 30 percent housing indicator

    A chart of the distribution for this field appears.

  10. Click the bar with the highest percentages of renters who are housing burdened.

    Bar with highest percentage of renters who are housing burdened selected on the chart

    The tracts corresponding to these values are selected on the map. On the maps, you can see the distribution of senior renters that are housing-burdened.

    Tracts with the highest percentage of seniors renters who are housing burdened are selected on the map.

  11. On the ribbon, click the Data Engineering tab, and in the Selection group, click Clear.
  12. Continue to explore other indicators.

In this section, you used data engineering to better understand the statistics and distribution of each indicator that made up the senior food access index. Data engineering is a powerful tool to better understand complex datasets and to ask important questions about how the data was used to build and contribute to an index's final score. Next, you will continue to explore the index results using charts.

Analyze index results using charts

This example index was created using the Calculate Composite Index tool. In addition to several maps, the tool produces charts that can be used to help interpret and answer various questions about the index.

Note:

To learn more, see the Interpret results section of the How Calculate Composite Index works help topic.

  1. In the Contents pane, under the Index Score layer, double-click the Distribution of Index chart.

    Distribution of Index chart in the Contents pane

    The chart appears.

    Distribution of Index chart

    The primary index layer in the group layer output contains a histogram of the index distribution. Along with the map, this can help you understand the distribution of the results.

    Like the charts you created in the Data Engineering view, you can click or draw a box around several bins on the chart to select the corresponding tracts, which will highlight on the map.

  2. In the Contents pane, double-click the Relationships of Scaled Variables and Index chart.

    The chart appears.

    Relationships of Scaled Variables and Index chart

    To interpret the matrix chart, view the bottom row, which shows the correlation of the index score to each of the indicators. Indicators that are correlated are shown in green and indicators that are inversely correlated are in pink. Because the matrix chart does not show any two variables are highly correlated, there is no need to remove any redundant variables.

    Note:

    To learn more about additional considerations when creating and evaluating an index, consult the Creating Composite Indices Using ArcGIS: Best Practices (PDF). The document includes recommendations for using cluster and regression analysis tools to further explore index results on page 42.

Obtain disaggregated census data

Data disaggregated by race and ethnicity is a crucial step in any analysis for ensuring equity in our society. By breaking down data into more specific categories, you can begin to uncover patterns of inequity and better understand which groups of people are being most affected by certain policies or practices.

In this section, you will obtain demographic data for specific Asian groups in Hennepin County from the U.S. Census Bureau Data website.

  1. Go to data.census.gov.
  2. Click Advanced Search.

    Advanced Search on the Explore Census Data page

  3. On the side panel, under Geography, click Census Tract.

    Census Tract in the menu for Geography

    A list of states appears.

  4. In the list of states, click Minnesota and click Hennepin County, Minnesota. Check All Census Tracts within Hennepin County, Minnesota.

    All Census Tracts within Hennepin County, Minnesota checked

  5. In the side panel, under Topics, click Race and Ethnicity.

    A list of categories for race and ethnicity appears.

  6. Click Asian and check the box for Asian.
  7. Click the Tables tab. Choose B02015 Asian Alone by Selected Groups.

    The B02015 table in the Tables tab

  8. On the ribbon above the table, click Margin of Error to deactivate it and click the CSV button to download a .csv file of the data.

    Margin of Error deactivated and the CSV button

    Tip:

    If necessary, you may need to collapse or minimize the Tables pane to see the CSV button.

    The .csv file downloads to your computer.

    Note:

    If you were unable to download the .csv, you can download a copy of it here.

  9. Open the .csv in Microsoft Excel.
    Note:

    This tutorial uses a non-coding workflow in a spreadsheet editing software to prepare the data for use in ArcGIS Pro. There are other methods to prepare data for a GIS using macros or Python to prepare data. See Creating an economic distress index layer using Census data with Python and Performing principal component analysis (PCA) to determine weights for index indicators.

    Next, you will transpose the columns and rows so that the census tracts are each of the records.

  10. Press Ctrl+A to highlight all the cells and press Ctrl+C to copy.
  11. At the bottom of the spreadsheet, click the New sheet button.

    New sheet button

  12. Right-click cell A1 of the new sheet. Under Paste options, click Transpose.

    Transpose paste option for the A1 cell

    The table is transposed.

    Table transposed on the new sheet

  13. Expand the width of Column A.

    Column A expanded

    You want to separate the text that includes the words Census Tract and the numbers that follow so you have an identifying field when you run the Join Field tool in ArcGIS Pro.

  14. Right-click Column A and click Insert.

    A new column is added.

    Next, you will use the Left function in Microsoft Excel to return all the text in column B that is to the left of the first comma it finds.

  15. In cell A2, type =LEFT(B2,(FIND(",",B2,1)-1)) and press Enter.

    The function successfully returns the text left of the common from Column B.

    Left function result

  16. Select cell A2 and point to the bottom right corner of the cell. When you see the pointer symbol change to a plus sign, click and drag downward to fill the rest of the column with the same function.

    Drag function for the remaining cells in Column A

    The function has been completed for all the rows.

    Function filled for Column A

  17. Scroll back to the top of the spreadsheet. For A1, type Tract Name. For C1, type Total Asian.

    Updated field names

  18. Click File and choose Save As. Save the file to a location you will be able to easily access like your Desktop folder. Be sure to save the file as a .csv file.

You have obtained demographic data for specific Asian groups by tracts in Hennepin County, Minnesota. You used Excel to prepare the data to join to your index data in ArcGIS Pro.

Note:

ArcGIS Living Atlas of the World contains many datasets for race and ethnicity categories, including one for specific Asian groups. While it is a reliable and growing library of data, there may be circumstances where ArcGIS Living Atlas does not yet have the specific data your community or scenario requires. Therefore, this tutorial aimed to demonstrate how you might go about obtaining data directly from the Census. To see other examples of using race and ethnicity data from ArcGIS Living Atlas, see the ArcGIS tutorials Create a social equity index and Map and analyze food access.

Join census data

You now have a good baseline understanding of the indicators used in your senior food access index and you have obtained and prepared ethnicity data for specific Asian groups in your county. In this section, you will join the census data to the index and prepare the index for visualizing disaggregated data.

  1. In ArcGIS Pro, on the ribbon, on the Map tab, in the Layer section, click Add Data.

    Add Data button in the Layer group on the Map tab

  2. Browse to the folder where you saved the .csv file. Click the file and click OK.

    Folder containing the .csv file downloaded from the census website

    The .csv appears in the Contents pane under Standalone Tables.

    Next, you will use the Join Fields tool.

  3. On the ribbon, click the Analysis tab. In the Geoprocessing group, click Tools.

    Tools on the Geoprocessing group in the Analysis tab

    The Geoprocessing pane appears.

  4. In the Geoprocessing pane, in the search bar, type join field. In the list of results, click the Join Field tool.

    Join field in the Geoprocessing pane

    The Join Field tool appears.

  5. In the Join Field tool pane, choose the following parameters:
    • For Input Table, choose Index Score.
    • For Input Join Field, choose Tract Name.
    • For Join Table, choose your .csv table.
    • For Join Table Field, choose Tract Name.

    Parameters entered in the Join Field

    Next, you will choose the fields from your .csv that you want to join to the Index Score layer.

  6. For Transfer Fields, click the Add Many button. Check all the fields from Total Asian to Vietnamese and click Add.

    Add Many menu with all the specific Asian groups checked and the Add button

    The fields add to the Join Field tool pane.

    Fields added to the Join Field tool pane

  7. Click Run.
  8. In the Contents pane, right-click Index Score and click Attribute Table.

    The table for Index Score appears.

  9. In the Index Score table, scroll until you see the fields for specific Asian groups you just joined.

    Specific Asian Groups joined to the Index Score feature layer in the attribute tables

    You have successfully joined the census demographic data to the Index Score layer.

  10. Close the table.
  11. On the Quick Access Toolbar, click Save to save the project.

    Save on the Quick Access Toolbar

In this section, you explored the indicators of a composite index using data engineering and charts. You accessed and joined disaggregated census race and ethnicity data. Next, you will visualize the index disaggregated by the major Asian specific groups to better target outreach efforts to improve senior food access.


Visualize disaggregated data and identify priority areas

In many examples of equity workflows, using quintiles or deciles is a useful way to measure access and opportunity distributions across a project area.

In this tutorial, you want to understand the distribution of insufficient food access indicators among older adults. You will divide the distribution of the composite index score for these indicators into five equal parts. This means the first quintile contains the lowest fifth of the population for inadequate food access (for example, the 20 percent of the population with the lowest food insecurity). The fifth quintile represents the top 20 percent of the population with the highest food insecurity, meaning this population would require the most support and funding for an equitable approach to improving food access.

Reclassify scores by quintiles

You will use the Reclassify tool to assign a quintile category for each of the census tracts based on their Index Score values.

  1. If necessary, on the ribbon, on the Analysis tab, in the Geoprocessing group, click Tools to open the Geoprocessing pane.
  2. In the Geoprocessing pane, search for and open the Reclassify Field tool.
  3. In the Reclassify Field tool pane, enter the following:
    • For Input Table, choose Index Score.
    • For Field to Reclassify, choose Index Score - Mean (Percentile).
    • For Reclassification Method, choose Quantile.
    • For Number of Classes, type 5.

    Reclassify Field parameters entered

  4. Click Run.
  5. In the Contents pane, right-click Index Score and choose Symbology.
  6. In the Symbology pane, for Primary Symbology, choose Graduated colors. For Field, choose INDEX_PCTRL_QUANTILE_CLASS.
  7. For Color scheme, choose Blue Purple (5-Classes).
    Tip:

    To see the name of a color ramp, point to the color ramp.

    Color ramp set for Graduated Colors style

    The map updates to show the census tracts in Hennepin County by Index Score values in quintiles.

    Map styled by index score quintiles

  8. Save the project.

Create chart to identify largest groups

In this section, you will create charts to visualize disaggregated data by the largest specific Asian groups in the county.

  1. In the Contents pane, right-click Index Score, point to Create Chart, and click Pie Chart.

    Pie Chart in the Create Chart menu for the Index Score layer

    A blank chart view and the Chart Properties pane appear.

  2. In the Chart Properties pane, under Variables, for Numeric fields(s), click the Select button.

    Select button under Numeric field(s) in the Chart Properties pane

  3. Check the boxes for all the specific Asian groups and click Apply.

    The specific Asian groups checked and the Apply button in the Numeric fields menu

    The fields add to the Chart Properties pane and the chart view shows a pie chart.

  4. In the Chart Properties pane, click the General tab. For Chart title, replace the existing text with Specific Asian Groups.

    Chart title entered on the General tab on the Chart Properties pane

    The chart title updates in the chart view.

    Pie chart showing the Specific Asian Groups in Hennepin County

    The five largest specific Asian groups in the county are Hmong; Asian Indian; Chinese, except Taiwanese; Vietnamese; and Korean. Next, you will create disaggregated bar charts for each of these specific groups and a comparison bar chart for Total Asians.

  5. In the Contents pane, right-click Index Score, point to Create Chart, and choose Bar Chart.
  6. In the Chart Properties pane, under Variables, for Category or Date, choose INDEX_PCTL_QUANTILE_CLASS.
  7. For Numeric fields, click the Select button and check the box for Total Asian and click Apply.

    Total Asian adds to the Chart Properties pane.

    Total Asian added under Numeric fields in the Chart Properties pane

  8. For Split by (optional), choose INDEX_PCTL_QUANTILE_CLASS.

    Next, you will rename the chart title and adjust the bar colors so that the chart better matches the map symbology.

  9. In the Chart Properties pane, click the General tab. For Chart title, delete the existing text and type Total Asian.
  10. Click the Series tab.

    Series tab on the Chart Properties

  11. Click the symbol for 1 and choose Sodalite Blue.
    Tip:

    To see the name of a color, point to the color.

    Sodalite Blue on the color palette for the 1 Value symbol

  12. Choose the following colors for the remaining Values:
    • For 2, choose Sugalite Sky.
    • For 3, choose Violet Dust.
    • For 4, choose Aster Purple.
    • For 5, choose Blackberry.

    Colors on the color palette

    The chart for Total Asian is complete.

    Chart for Total Asian split by index scores quintiles

    You now have a chart that shows the number of the Total Asian population disaggregated by the index score quintile values. For all the census tracts belonging to each quintile, the population in the Total Asian field is summed up, and the result is shown in the chart.

    The Total Asian population living in the highest quintile group is just under 12,000. People living in census tracts in the fifth quintile are more likely to experience inadequate access to food based on the index. Overall, most of the Total Asian population is in the lowest quintile group for inadequate food access and, therefore, thriving. This group is shown in the first quintile of the chart.

    Next, you will duplicate this chart to show disaggregated data for the Hmong specific Asian group

  13. Close any open charts.
  14. Save the project.
    Tip:

    You can also save the project by pressing Ctrl+S.

Create disaggregated charts for specific groups

You will duplicate the Total Asian chart and configure five additional charts for the five largest specific Asian groups. The disaggregated data charts will enable you to better understand which Asian seniors may be experiencing inadequate access to food.

  1. In the Contents pane, right-click the Total Asian chart and click Duplicate.

    Duplicate for the Total Asian chart

  2. Double-click the duplicated chart to open it.
  3. In the Chart Properties pane, under Numeric fields(s), click the Select button.
  4. Uncheck Total Asian and check Hmong. Click Apply.

    Uncheck Total Asian, check Hmong, and click the Apply button in the Numeric fields menu.

    The chart updates with data for Hmong. Next, you will rename the chart title.

  5. Click the General tab. For Chart title, replace the existing text with Hmong.

    You now have a disaggregated chart for the Hmong group.

    Hmong chart split by index score quintiles

  6. Use what you have learned to create charts for Asian Indian; Chinese, except Taiwanese; Vietnamese; and Korean specific groups.
    Tip:

    • Duplicate the Total Asian chart.
    • Open the duplicated chart.
    • In the Chart Properties pane, for Numeric fields, click Select. Uncheck Total Asian and check the specific Asian group.
    • Click the General tab and update the Chart title for the specific Asian group.

    You have created six charts showing disaggregated index score data for specific Asian groups and the Total Asian populations.

    Charts completed in the Contents pane

    Next, you will arrange the charts in a way that you can view all six at the same time to compare the data.

  7. If necessary, open all six of the bar charts.
  8. Drag the tab for the Total Asian bar chart so it is a separate window.

    Drag Total Asian chart into a separate window.

  9. If necessary, resize the window for the Total Asian chart.

    Total Asian chart in a separate window

  10. Drag the Hmong chart tab and drop it in the right dock of the Total Asian chart window.

    Drag the Hmong chart to dock to the right of the Total Asian chart

    Now the Hmong chart is side-by-side with the Total Asian chart in a separate window.

    Charts visible side by side in a separate window

  11. Continue dragging each of the charts into this separate window so that there are two rows of three charts. Rearrange and adjust the size of the windows so they are all relatively even.

    You can now view all six charts at the same time and compare the disaggregated data.

    The six disaggregated charts visible side by side for comparison

You have reviewed the quintile map and understood that the fifth quintile represents the highest need for access to food for older adults. Therefore, the dark purple areas show a higher need. Now you can answer the question: which groups may be experiencing food insecurity?

In the charts disaggregated by specific Asian groups, you can quickly see high-level patterns within each group and the proportion of folks within each quintile. Asian Indian and Korean communities have a higher proportion of people living in higher-need tracts.

This will inform what kinds of programs, policies, and communication strategies to increase food access for older adults and language needs to be in place to reach this audience.

Identify priority areas

Next, you will use data engineering and the Select by Attribute tool to narrow in on the areas in the county to create priority areas. You will identify these areas based on high index scores, specific Asian groups, and inadequate food access for older adults.

You will start with the Korean group. First, you will select the Census tracts with a higher presence of the Korean population.

  1. In the Contents pane, right-click Index score and choose Data Engineering.
  2. Drag the Korean field into the statistics panel. Click Calculate.
  3. Right-click the Third Quartile value for Korean, point to Select and choose Above Quartile.

    Above Quartile in the Select menu for the Third Quartile column for the Korean field in the Data Engineering view

    The tracts selected on the map.

    Tracts with the highest percentages of Korean population highlighted

    Next, from the tracts already selected, you will keep the ones where older adults have inadequate access to food.

  4. On the ribbon, on the Map tab, in the Selection group, click Select By Attributes.

    Select By Attributes in the Selection group on the Map tab

  5. In the Select By Attributes window, for Selection Type, choose Select subset from the current selection.
  6. For Expression, build the expression Where INDEX_PCTL_QUANTILE_CLASS is equal to 5.

    Expression built to select the highest quintile for index score among the currently selected tracts

  7. Click Apply.

    The tracts that remain selected on the map have both the highest quartile of Korean population and represent areas with the highest need for food access support for older adults.

    Map highlighting tracts with the highest percentages of the Korean population and highest index scores

With this information, you can now plan for a more targeted outreach program with consideration for language needs and location focus and prioritize funding allocation.

In this tutorial, you started with an index that did not include disaggregated data, but you were able to indirectly estimate some of the inequity by joining Census data for specific Asian groups to the index data. Obtaining and combining the data measured by race and ethnicity is the best way to ensure you can accurately apply your equity lens.

It is important to note that this workflow only covers a component of the racial equity and social justice workflow. It only focuses on mapping, analyzing, and operationalizing the data. The racial equity and social justice workflow also includes managing performance and engaging communities. To engage communities, consider sharing the results of the index and the disaggregated charts with community groups and community-based organizations and gathering feedback. There may be experiences and nuances in the local community that will confirm the findings or identify gaps in the way the data was used. The process of developing an equity plan is iterative. It requires gathering data, analyzing, sharing, and repeating as new understanding and data are uncovered through community engagement.

Note:

This tutorial was created based on Applying an equity lens to your index.

GIS professionals have a unique opportunity to tackle complex social issues by combining their expertise in spatial data analysis with a commitment to equitable, inclusive outcomes. In this tutorial, you used one method to disaggregate the index by race and ethnicity. It is one of many approaches to applying an equity lens to GIS workflows. Continue exploring and learning more methods and strategies to create more meaningful and accurate ways to use a composite index for real-world applications.

You can find more tutorials in the tutorial gallery.