Does increased spending on education result in improved students’ performance?
Research Question:
Design choices:
The first bivariate map below is showing the combination of the total spending on education per student and the average demonstrated proficiency in Reading and Langguage Arts, while the second one - the relationship between the total spending on education per student and the average demonstrated proficiency in Math.


We can see that the increased total spending on a student resulted in increased proficiency test scores, in both math and RLA, in such states Illinois, New York, Maryland, Connecticut, and District of Columbia. In New Hampshire and Ohio, increased total spending correlate with the increased performance on RLA, but with average performance in math, while in North Dakota, above average spending is associated with increased math and average RLA performances. Aligning with this trend are the states such as Arizona, Oklahoma, Missouri, Tennessee, Mississippi, where the total spend on Education was the lowest relative to other states, and the students demonstrated the lowest scores in both math and RLA. In Florida, lowest relative spending is associated with below average RLA scores and average math performance.
However, this trend is not categorical across all the states. States like Texas, Idaho, Colorado, and Georgia demonstrated relatively high RLA and math scores while they spend less per student. Similarly, Utah reports one of the lowest total spending, but demonstrates above average performance in RLA, and average scores in math, and South Dakota and North Carolina are among the lowest total spenders, but its students perform above average in math and average on RLA. At the same time, Maine exhibits the opposite pattern: the high total spending on Education there negatively correlates with the students’ both RLA and math proficiency. The situation is similar in Rhode Island, where the education spendings are high, but students demonstrate poor math and average RLA scores.
The first bivariate map below is showing the combination of the total spending on instruction per student and the average demonstrated proficiency in Reading and Language Arts, while the second one - the correlation between the total spending on instruction and the average math scores.


The states mapped in gray did not have their spending data reported.
We can observe the general tendency of positive correlation between Expenditure on Instruction and math and RLA proficiency. In New York, Maryland, District of Columbia, and Connecticut increased spending on instruction results in both increased math and RLA scores. Similarly, increased spending in Nebraska and Pennsylvania correlates with increased math proficiency and average RLA scores, and in New Hampshire - with average math and increased RLA performance. Arizona, Oklahoma, New Mexico, Tennessee, Mississippi, report the lowest total spend on Instruction, which correlates with the relatively low demonstrated proficiency in both math and RLA by students of those states.
Interestingly, Texas, Colorado and Idaho that demonstrated the high RLA and math scores despite small total spending, exhibit the same pattern here: high math scores and high RLA scores despite low spending on Instruction. The experience of those states should be looked into in more detail to try to figure out what exactly is done differently in those states to have such amazing results. On the flip side, Maine again exhibits the opposite pattern: the high instructional spending negatively correlates with the students’ math and RLA proficiency. Here (i.e., when accounting for instructional spending only), the same trend is observed in Minnesota and Hawaii with regards to both math and RLA scores.
The first bivariate map below is showing the combination of the total spending on special education per student and the average demonstrated proficiency of students with disabilities in Reading and Language Arts, while the second one - the correlation between the total spending on special education and the average math scores of students with disabilities.


The states mapped in gray did not have their spending data reported.
The two plots compare the spending and the performance of only students with disabilities, so there it no need to compare with previous plots. The most interesting observation, in my opinion, is that there are no states that report higher than average spending on students with disabilities but where students demonstrate poor performance in math, and there is only one state, Vermont, that is among the higher spenders on special education but that also reports below average performance in RLA. It gives us ground to say, that increased spending on special education is never wasted and very often results in improved performance.
The second interesting observation, is that there are a lot more states that show correlation between increased spending on Special Ed and above average performance in math (e.g., New York, Pennsylvania, Virginia, North Carolina, Michigan. etc.), compared to the results of students with disabilities in RLA, and to results of analysis of other categories. This suggests that increased spending on special education boosts math performance of students with disabilities in particular.
The first bivariate map below is showing the combination of the total spending on textbooks per student and the average demonstrated proficiency in Reading and Language Arts, while the second one - the correlation between the total spending on textbooks and the average math scores.


The states mapped in gray did not have their spending data reported.
The most interesting observation, in my opinion, is that there are states (e.g., Texas, Georgia, North Carolina, etc.)who were among “low spenders but high performers” in the previous analysis, but the increased spending on textbooks actually directly correlates with improved performance in both RLA and Math. It might suggests, that when everything else is controlled for, increased spending on textbooks might be that deciding factor that determines students performance. If that is indeed the case, it is very empowering, given how small the expenditure on textbook is compared to instruction, for example. no states that report higher than average spending on students with disabilities but where students demonstrate poor performance in math, and there is only one state, Vermont, that is among the higher spenders on special education but that also reports below average performance in RLA. It gives us ground to say, that increased spending on special education is never wasted and very often results in improved performance.
The second interesting observation, is that there are a lot more states that show correlation between increased spending on textbooks and above average performance in math (e.g., New York, Pennsylvania, Virginia, North Carolina, Michigan. etc.), compared to RLA scores. This matches the similar trend in Special education funding, where increase in spending affects performance in math more significantly compared to RLA. This may suggest that math performance might be more susceptible to changes in expenditure and depend on it to a certain extent, while performance on RLA is primarily determined by other factors, unrelated to expenditure.
#Plotting bivariate map to express relations between RLA score and total spending
biscale2 <-
scores_instruction %>%
group_by(STNAME) %>%
summarize(
rla = mean(mean_rla_all, na.rm=TRUE),
instr = mean(instr_per_stu_k, na.rm=TRUE)
) %>%
bi_class(x = rla, y = instr, style = "quantile", dim = 3)
names(biscale2)[1]<-"full"
map_join2<- biscale2%>%left_join(statepop, by= "full")
p3<-plot_usmap(data = map_join2, values = "bi_class", labels = TRUE, label_color = "black") +
bi_scale_fill(pal = "DkCyan", dim = 3, guide = F) +
scale_x_continuous(breaks = NULL) +
scale_y_continuous(breaks = NULL) +
theme(
plot.title = element_text(margin = margin(b = 8),
color = "#ffffff",face = "bold",size = 9,
hjust = 0.5,
family = "Arial"),
plot.subtitle = element_text(margin = margin(t=10,b = 25),
color = "#ffffff", size = 6, family = "Arial",
hjust = 0.5),
plot.caption = element_text(margin = margin(t = 20),
color = "#ffffff", size = 5, family = "Arial",
hjust = 0.95),
axis.title.x = element_blank(),
axis.title.y = element_blank(),
legend.title = element_blank(),
axis.text.x = element_blank(),
axis.text.y = element_blank(),
panel.background = element_blank(),
panel.grid.major = element_blank(),
panel.grid.major.y = element_blank(),
panel.grid.minor = element_blank(),
plot.background = element_rect(fill = "#f3f3f3", color = NA),
panel.border = element_blank(),
plot.margin = unit(c(1, 1, 1, 1), "cm"),
axis.ticks = element_blank()
)
p3$layers[[2]]$aes_params$size <- 2.5
legend_US3<-
bi_legend(pal = "DkCyan",
dim = 3,
xlab = "Proficiency in RLA",
ylab = "Instructional Spend",
size = 30) +
theme(rect = element_rect(fill = "grey10"),
panel.border = element_blank(),
axis.text = element_blank(),
plot.background = element_rect(fill = "#f3f3f3", color = NA),
axis.title.x = element_text(size = 12,
color = "grey20",
face = "bold"),
axis.title.y = element_text(size = 12,
color = "grey20",
face = "bold"),
legend.text = element_text(size = 5),
legend.text.align = 0)
ggdraw() +
draw_plot(p3, 0, 0, 1, 1) +
draw_plot(legend_US3, 0, 0.1, 0.2, 0.3) +
draw_label("Source:National Center for Education Statistics)",
color = "#a1a1a1", size = 7.5, angle = 0, x = 0.9, y = 0.05) +
draw_label("In which states the total spending on Instruction per student positively affects performance in RLA?",
color = "#000000", size = 15, angle = 0, x =0.5, y = 0.97, fontface = "bold") +
theme(plot.background = element_rect(fill = "#f3f3f3", color = NA))
Prior Versions

The first version of the bivariate map visualization features lables of states in default sizing, that I ended up changing, specifically to have the Northeast region look neater. The labels were also in white color, while I opted for black. This plot also shows the original versions of the legend, title and subtitle, which I cleaned up in the final version.

This plot utilizes 2 dimensions to represent two variables instead of 3. I opted to have 3 dimensions to show more nuanced details, which should be of interest to policy makers (my intended audience). However, since we are interested in the extremes anyway (groups like ‘highest spending-poor performance’, ‘highest spending-best performance’), they are visible in the 2-dimensional map anyway. And even though with the expense of certain nuances, the 2-dimensional version is easier to interpret. Thus, even though I chose to use a 3-dimensional scale, I would recommend to use a 2-dimensional one if presented to the general public.

In an attempt to find the optimal palette that would make the interpretation of a 3-dimensional bivariate plot easier, I tried several other palettes. Here is the version with DkBlue palette, which ended being not color-blind friendly.

This version is rendered with the use of DkViolet palette. Unlike DkBlue it was suitable for all types of color-blindness, but I made a decision to not use because it is too close to those palettes used for political outcomes in the US for my liking.