Proposal

STA/ISS 313L - Spring 2026 - Project 1

Author

Team name

library(tidyverse)

Data

A brief description of your dataset including its provenance (e.g., “The data were originally collected as part of Gorman, Williams, and Fraser (2014) and distributed via Community (2024) and Horst, Hill, and Gorman (2020).”), dimensions, etc. as well as the reason why you chose this dataset.

Make sure to load the data and use inline code for some of this information, as exemplified below.

penguins <- read_csv("data/penguins.csv")
Rows: 344 Columns: 8
── Column specification ────────────────────────────────────────────────────────
Delimiter: ","
chr (3): species, island, sex
dbl (5): bill_length_mm, bill_depth_mm, flipper_length_mm, body_mass_g, year

ℹ Use `spec()` to retrieve the full column specification for this data.
ℹ Specify the column types or set `show_col_types = FALSE` to quiet this message.
glimpse(penguins)
Rows: 344
Columns: 8
$ species           <chr> "Adelie", "Adelie", "Adelie", "Adelie", "Adelie", "A…
$ island            <chr> "Torgersen", "Torgersen", "Torgersen", "Torgersen", …
$ bill_length_mm    <dbl> 39.1, 39.5, 40.3, NA, 36.7, 39.3, 38.9, 39.2, 34.1, …
$ bill_depth_mm     <dbl> 18.7, 17.4, 18.0, NA, 19.3, 20.6, 17.8, 19.6, 18.1, …
$ flipper_length_mm <dbl> 181, 186, 195, NA, 193, 190, 181, 195, 193, 190, 186…
$ body_mass_g       <dbl> 3750, 3800, 3250, NA, 3450, 3650, 3625, 4675, 3475, …
$ sex               <chr> "male", "female", "female", NA, "female", "male", "f…
$ year              <dbl> 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007, 2007…

There are 344 observations in the data.

Questions

The two questions you want to answer.

Analysis plan

A plan for answering each of the questions including the variables involved, variables to be created (if any), external data to be merged in (if any). This should also include a feasibility assessment on how the data you have can be used to answer the questions you stated.

References

Community, Data Science Learning. 2024. “Tidy Tuesday: A Weekly Social Data Project.” https://tidytues.day.
Gorman, Kristen B, Tony D Williams, and William R Fraser. 2014. “Ecological Sexual Dimorphism and Environmental Variability Within a Community of Antarctic Penguins (Genus Pygoscelis).” PloS One 9 (3): e90081.
Horst, Allison Marie, Alison Presmanes Hill, and Kristen B Gorman. 2020. Palmerpenguins: Palmer Archipelago (Antarctica) Penguin Data. https://doi.org/10.5281/zenodo.3960218.

Resources

AI

State whether you’ve used AI resources in developing your project. If yes, cite them as you usually do in your homework assignments. If no, explicitly state so.

Other

State whether you’ve used non-AI resources, other than those provided as part of course materials, in developing your project. If yes, cite them as you usually do in your homework assignments. If no, explicitly state so.