This is a study of gerrymandering in Alabama. We will test three methods of shape-based compactness scores, assess representativeness of districts based on prior presidential elections and race. We will then extend prior studies by calculating representativeness of the convex hull of district polygons.
Key words
: Alabama, gerrymandering, compactness, convex
hull, political representationSubject
: Social and Behavioral Sciences: Geography:
Geographic Information SciencesDate created
: 2025-02-17Date modified
: 2025-02-17Spatial Coverage
: Alabama OSM:161950Spatial Resolution
: Census Block GroupsSpatial Reference System
: EPSG:4269 NAD 1983 Geographic
Coordinate SystemTemporal Coverage
: 2020-2024 population and voting
dataTemporal Resolution
: Decennial censusThis is an original study based on literature on gerrymandering metrics.
It is an exploratory study to evaluate usefulness of a new gerrymandering metric based on the convex hull of a congressional district and the representativeness inside the convex hull compared to the congressional district.
I plan on using package … for …
WE plan on using data sources …. , … ….
Several data layers are compiled and provided in the
districts
geopackage.
districts_file <- here("data", "raw", "public", "districts.gpkg")
st_layers(districts_file)
## Driver: GPKG
## Available layers:
## layer_name geometry_type features fields crs_name
## 1 districts21 Multi Polygon 7 4 WGS 84
## 2 districts23 Multi Polygon 7 4 NAD83
## 3 precincts20 Multi Polygon 1972 8 NAD83
The congressional districts enacted in 2021 were used in the 2022 mid-term elections, and then deemed unconstitutional gerrymanders by the Supreme Court.
Load the districts.
districts21 <- st_read(districts_file, layer="districts21")
## Reading layer `districts21' from data source
## `C:\GitHub\josephholler\OR-Gerrymander-Alabama\data\raw\public\districts.gpkg'
## using driver `GPKG'
## Simple feature collection with 7 features and 4 fields
## Geometry type: MULTIPOLYGON
## Dimension: XY
## Bounding box: xmin: -88.47323 ymin: 30.14443 xmax: -84.88825 ymax: 35.00803
## Geodetic CRS: WGS 84
Map the districts
tmap_mode(mode = "plot")
## ℹ tmap mode set to "plot".
districts21map <- districts21 |>
tm_shape() +
tm_polygons(fill_alpha = 0,
col = "red") +
tm_labels(text = "DISTRICT",
col="red",
bgcol = "white",
bgcol_alpha = 0.5,
on_surface = TRUE,
just = c("center", "center")
)
##
## ── tmap v3 code detected ───────────────────────────────────────────────────────
## [v3->v4] `tm_text()`: migrate the layer options 'just' to 'options =
## opt_tm_text(<HERE>)'[tm_text()] Argument `on_surface` unknown.
districts21map
includeMarkdown(here("data", "metadata", "precincts20.md"))
Title
: Voting Precincts 2020Abstract
: Alabama voting data for 2020 elections by precinct.Spatial Coverage
: AlabamaSpatial Resolution
: Voting precinctsSpatial Reference System
: EPSG 4269 NAD 1983 geographic coordinate systemTemporal Coverage
: precincts used for tabulating the 2020 censusTemporal Resolution
: annual electionLineage
: Saved a sgeopackage format. Processing prior to download is explained in al_vest_20_validation_report.pdf and readme_al_vest_20.txtDistribution
: Data available at Redistricting Data Hub with free loginConstraints
: Permitted for noncommercial and nonpartisan use only. Copright and use constrains explained in redistrictingdatahub_legal.txtData Quality
: State any planned quality assessmentVariables
: For each variable, enter the following information. If you have two or more variables per data source, you may want to present this information in table form (shown below)
Label
: variable name as used in the data or codeAlias
: intuitive natural language nameDefinition
: Short description or definition of the variable. Include measurement units in description.Type
: data type, e.g. character string, integer, realAccuracy
: e.g. uncertainty of measurementsDomain
: Expected range of Maximum and Minimum of numerical data, or codes or categories of nominal data, or reference to a standard codebookMissing Data Value(s)
: Values used to represent missing data and frequency of missing data observationsMissing Data Frequency
: Frequency of missing data observations: not yet known for data to be collectedLabel | Alias | Definition | Type | Accuracy | Domain | Missing Data Value(s) | Missing Data Frequency |
---|---|---|---|---|---|---|---|
VTDST20 | … | Voting district ID | … | … | … | … | … |
GEOID20 | … | Unique geographic ID | … | … | … | … | … |
G20PRETRU | … | total votes for Trump in 2020 | … | … | … | … | … |
G20PREBID | … | total votes for Biden in 2020 | … | … | … | … | … |
We acquire decennial census data in block groups using the
tidycensus
package. First, query metadata for the
pl
public law data series.
census_metadata_file <- here("data", "metadata", "census2020pl_vars.csv")
if(file.exists(census_metadata_file)){
census2020pl_vars <- read.csv(census_metadata_file)
} else {
census2020pl_vars <- load_variables(2020, "pl")
write.csv(census2020pl_vars, here("data", "metadata", "census2020pl_vars.csv"))
}
The issue in the 2023 court cases on Alabama’s gerrymandering was a
racial gerrymander discriminating against people identifying as Black or
African American. Therefore, we will analyze people of voting age (18 or
older) identifying as Black and or African as one race in any
combination with other races. This data is found in table
P3
.
Query the public law data series table P3 on “race for the population 18 years and over”.
blockgroup_file <- here("data", "raw", "public", "block_groups.gpkg")
# if the data is already downloaded, just load it
# otherwise, query from the census and save
if(file.exists(blockgroup_file)){
blockgroups <- st_read(blockgroup_file)
} else {
blockgroups <- get_decennial(geography = "block group",
sumfile = "pl",
table = "P3",
year = 2020,
state = "Alabama",
output = "wide",
geometry = TRUE,
keep_geo_vars = TRUE)
st_write(blockgroups, blockgroup_file)
}
## Reading layer `block_groups' from data source
## `C:\GitHub\josephholler\OR-Gerrymander-Alabama\data\raw\public\block_groups.gpkg'
## using driver `GPKG'
## Simple feature collection with 3925 features and 83 fields (with 1 geometry empty)
## Geometry type: MULTIPOLYGON
## Dimension: XY
## Bounding box: xmin: -88.47323 ymin: 30.22333 xmax: -84.88908 ymax: 35.00803
## Geodetic CRS: NAD83
We have previously investigated the compactness scores of Alabama’s congressional districts as well as the percentage of Biden voters from the 2020 elections and the percentage of the population 18 years or older that is not Hispanic and is Black or African American.
We have never calculated the minimum bounding circle or convex hulls of Alabama’s congressional districts.
This study is explicitly an investigation to the modifiable areal unit problem. Aspects of the study are extremely sensitive to the combination of edge effects and scale, whereby complex borders formed by natural features, e.g. coastlines or rivers, vary greatly in perimeter depending on the scale of analysis. We hope that in part, this study establishes a method that is more robust (less sensitive) to the threats to validity caused by scale and edge effects in studies of gerrymandering and district shapes.
Transform Districts into NAD 1983 coordinate system, and calculate the percentage of population identifying as Black.
districts21 <- districts21 |> st_transform(4269) |>
mutate(pctBlack = round(BLACK / POPULATION * 100, 1))
Census data needs to be transformed from the WGS 1984 geographic coordinate system to the NAD 1983 geographic coordinate system.
blockgroups <- st_transform(blockgroups, 4269)
Find the total of people identifying as Black or African American as one race or any combination of multiple races. First, make a list of all the variables inclusive of people identifying as Black or African American.
black_vars <- census2020pl_vars |>
dplyr::filter(str_detect(name, "P3"),
str_detect(label, "Black")) |>
select(-concept)
black_vars |> kable()
X | name | label |
---|---|---|
151 | P3_004N | !!Total:!!Population of one race:!!Black or African American alone |
158 | P3_011N | !!Total:!!Population of two or more races:!!Population of two races:!!White; Black or African American |
163 | P3_016N | !!Total:!!Population of two or more races:!!Population of two races:!!Black or African American; American Indian and Alaska Native |
164 | P3_017N | !!Total:!!Population of two or more races:!!Population of two races:!!Black or African American; Asian |
165 | P3_018N | !!Total:!!Population of two or more races:!!Population of two races:!!Black or African American; Native Hawaiian and Other Pacific Islander |
166 | P3_019N | !!Total:!!Population of two or more races:!!Population of two races:!!Black or African American; Some Other Race |
174 | P3_027N | !!Total:!!Population of two or more races:!!Population of three races:!!White; Black or African American; American Indian and Alaska Native |
175 | P3_028N | !!Total:!!Population of two or more races:!!Population of three races:!!White; Black or African American; Asian |
176 | P3_029N | !!Total:!!Population of two or more races:!!Population of three races:!!White; Black or African American; Native Hawaiian and Other Pacific Islander |
177 | P3_030N | !!Total:!!Population of two or more races:!!Population of three races:!!White; Black or African American; Some Other Race |
184 | P3_037N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; American Indian and Alaska Native; Asian |
185 | P3_038N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; American Indian and Alaska Native; Native Hawaiian and Other Pacific Islander |
186 | P3_039N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; American Indian and Alaska Native; Some Other Race |
187 | P3_040N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; Asian; Native Hawaiian and Other Pacific Islander |
188 | P3_041N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; Asian; Some Other Race |
189 | P3_042N | !!Total:!!Population of two or more races:!!Population of three races:!!Black or African American; Native Hawaiian and Other Pacific Islander; Some Other Race |
195 | P3_048N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; American Indian and Alaska Native; Asian |
196 | P3_049N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; American Indian and Alaska Native; Native Hawaiian and Other Pacific Islander |
197 | P3_050N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; American Indian and Alaska Native; Some Other Race |
198 | P3_051N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; Asian; Native Hawaiian and Other Pacific Islander |
199 | P3_052N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; Asian; Some Other Race |
200 | P3_053N | !!Total:!!Population of two or more races:!!Population of four races:!!White; Black or African American; Native Hawaiian and Other Pacific Islander; Some Other Race |
205 | P3_058N | !!Total:!!Population of two or more races:!!Population of four races:!!Black or African American; American Indian and Alaska Native; Asian; Native Hawaiian and Other Pacific Islander |
206 | P3_059N | !!Total:!!Population of two or more races:!!Population of four races:!!Black or African American; American Indian and Alaska Native; Asian; Some Other Race |
207 | P3_060N | !!Total:!!Population of two or more races:!!Population of four races:!!Black or African American; American Indian and Alaska Native; Native Hawaiian and Other Pacific Islander; Some Other Race |
208 | P3_061N | !!Total:!!Population of two or more races:!!Population of four races:!!Black or African American; Asian; Native Hawaiian and Other Pacific Islander; Some Other Race |
211 | P3_064N | !!Total:!!Population of two or more races:!!Population of five races:!!White; Black or African American; American Indian and Alaska Native; Asian; Native Hawaiian and Other Pacific Islander |
212 | P3_065N | !!Total:!!Population of two or more races:!!Population of five races:!!White; Black or African American; American Indian and Alaska Native; Asian; Some Other Race |
213 | P3_066N | !!Total:!!Population of two or more races:!!Population of five races:!!White; Black or African American; American Indian and Alaska Native; Native Hawaiian and Other Pacific Islander; Some Other Race |
214 | P3_067N | !!Total:!!Population of two or more races:!!Population of five races:!!White; Black or African American; Asian; Native Hawaiian and Other Pacific Islander; Some Other Race |
216 | P3_069N | !!Total:!!Population of two or more races:!!Population of five races:!!Black or African American; American Indian and Alaska Native; Asian; Native Hawaiian and Other Pacific Islander; Some Other Race |
218 | P3_071N | !!Total:!!Population of two or more races:!!Population of six races:!!White; Black or African American; American Indian and Alaska Native; Asian; Native Hawaiian and Other Pacific Islander; Some Other Race |
Next, calculate new columns. Black
is a sum of all 32
columns shown above, in which any of the racial categories by which
someone identifies is Black or African American.
Total
is a copy of the population 18 years or over,
variable P3_001N
.
PctBlack
is calculated as
Black / Total * 100
CheckPct
is calculated as the percentage of the population
18 years or older that is either white of one race only
(P3_003N
) or Black or African American as
calculated above. In Alabama, we can expect that this will be close to
100% for most block groups, and should never exceed 100%.
blockgroups_calc <- blockgroups |>
rowwise() |>
mutate(Black = sum(c_across(all_of(black_vars$name)))) |>
ungroup() |>
mutate(bgarea = st_area(geom),
Total = P3_001N,
PctBlack = Black / Total * 100,
CheckPct = (Black + P3_003N) / Total * 100
) |>
select(GEOID, bgarea, Black, Total, PctBlack, CheckPct)
Save the results as blockgroups_calc.gpkg
st_write(blockgroups_calc,
here("data", "derived", "public", "blockgroups_calc.gpkg"),
append=FALSE)
Map the percentage of the population 18 or over that is Black or African American.
tmap_mode(mode = "plot")
blkgrp_black_map <- tm_shape(blockgroups_calc) +
tm_polygons(
fill = "PctBlack",
col_alpha = 0.2,
lwd = 0.1,
col = "grey90"
)
blkgrp_black_map
Make an interactive map of the 2021 districts over the black
population
tmap_mode(mode = "view")
## ℹ tmap mode set to "view".
blkgrp_black_map +
districts21map
## Registered S3 method overwritten by 'jsonify':
## method from
## print.json jsonlite
## Variable bgcol and bgcol_alpha not supported by view mode