Geographical representation of the expenditures by state
Research Question:
Design choices:
The plots below complement the bar plots published in the previous post and reflect the same spending patterns. However, while the bar plots focus on relative ranking, the choropleths put emphasis on the variation patterns, revealing the areas that are more homogeneous in their expenditure amounts, and accentuating any extreme outliers. The mid-point for the scale is set at the average national value, thus, all states mapped in cold colors (blue) spend below the national average, and the states in mapped in warm colors (brown) spend above the national average level.

The overall range of values has been restricted to exclude District of Columbia: it has the highest total spending per student across all states, significantly exceeding the runner-up state of New York. Including DC in the range would result in assigning it the darkest color which would not be visible on the map due to the small size of the District of Columbia. At the expense of DC, the visualization illustrates the overall variation more distinctly.


The states mapped in gray did not have their spending data reported.

The states mapped in gray did not have their spending data reported.
total_map %>%
ggplot(aes(geometry = geometry,
fill = spend_per_stu_k,
color = "transparent"),
alpha = 0.9) +
geom_sf(color = "#4f4a4a", size = 0.06) +
guides(fill = guide_colorsteps(barwidth = 15,
barheight = 1,
title.position = "top",
title.hjust = .5,
title = "Total Sending\n[1000 US$ / Year]")) +
scale_fill_scico( palette = 'vik',
midpoint = 13,
breaks = c(8,11,14, 17, 20, 23),
limits = c(7, 23))+
labs(title = "State total spendings on education per student",
caption = "Source: National Center for Education Statistics") +
theme_void(20) +
theme(legend.position = "bottom",
plot.title.position = "plot",
plot.title = element_text(hjust = .08),
plot.subtitle = element_text(hjust = .17))
The regional patterns revealed by the bar plots are easily observable on the maps. With regards to the total spending, the Northeast region is colored in warm brown colors to reflect above the average total spending of all the states in this area. At the same time, West and South are mapped in blue that corresponds to the lower than average spending of the states in those areas. The map is effective in showing that the majority of Midwestern states are very close to the national average level. Compared to the bar plot, the map makes it easier to draw attention to the outliers due to their highly saturated colors, especially when they don’t conform to the general pattern of the region. For example, Wyoming and Alaska that are much higher spenders compared to the rest of the Western states really stand out here.
The patterns on the map presenting instructional spending closely align with the patterns of the total spending (highest expenditure in the Northeast, lowest in the West). When looked at side by side, the two maps look almost identical - something that is not easily identifiable on the bar plots due to the emphasis on scaling in the latter.
The maps illustrating spending on special education and textbooks are not ideal, because the data on those spending categories was missing from a number of states. Nevertheless, the general patterns are easily noticeable. The highest spenders (Pennsylvania for textbooks and New Hampshire for special education) and the lowest spenders (South Dakota and Utah for special education; Kentucky, Alabama and Hawaii for textbooks) stand out and are easily distinguishable. The map for textbooks reveals the lack of any regional patterns.
Prior Versions
The previous iterations of the map visualization of the expenditure allocation mostly feature utilization of different palettes, since it was my biggest design choice in this part. I am providing the prior versions of the total expenditure visualization to illustrate the design evolution. The prior plots for instructional, special education, and textbook spending featured the same changes in format and design choices.

The first version of the map visualization features the viridis inferno palette. I opted not to use this color scheme because it is overly saturated and does not align with the aesthetic in other posts.

In the second attempt, I used a palette with a more natural and muted color scheme, bilbao. It lets the states on the higher end of spending scale stand out and be easily distinguishable. However, I did not like how the lighter browns do not fully reflect the variation in spending in the states that are on the lower spending side.

The third version used the same continuous scale, bilbao, but with the reversed scale. I tried this version in an atttempt to address the lack of variation created by the original bilbao. I believe, it helps make the variation in the states with lower total spending easily observable (e.g, compared to version 2, it is easier to see that Idaho, Utah, Oklahoma, Tennessee yield the lowest spending amount). At the same time, the Northeast in lighter colors, together with Alaska and Wyoming, stand apart. However, I was not completely satisfied with this version, because mapping higher spend with the lighter color seemed counter-intuitive to me.
This version also incorporates some minor refinements, including the position and size of the title, the height of the legend guide, different color of the state outline.


Next, I tried to apply the divergent palette, Earth, instead of a continuous scale. The first attempt at using it was not successful, but setting up the mean value to the mid-point fixed it. It appears to me, that it is counterintuitive to map highest spending amounts on the colder color, that is why I tried the reverse order of the palette. I was satisfied with the result, upon the whole, however, the representation of the variance was still lacking for me with the current colors. The temperature of the colors was there to reflect the difference, but I wanted more saturation as well.

Vik was the second divergent scale that O tried using. I was very pleased with the division along the average line: cold blue colors for low spending and warm brown colors for high ones. The saturation is not there yet in the current attempt, but I was able to achieve higher saturation range in the final version with restricting the limit and the breaks of the axis. I think the divergent scale is effective in not only depicting the variation and outliers, but also allows a better possibility for relative comparison. The final point in favor of the current palette was the fact hat Vik turned out to be more color-blind friendly.