Plants in Danger
Data manipulation, Web scraping (rvest package) and SelectorGadget
Notable topics: Data manipulation, Web scraping (rvest package) and SelectorGadget
Recorded on: 2020-08-17
Timestamps by: Eric Fletcher
Screencast
Timestamps
Using count, fct_lump, and fct_reorder to get an overview of categorical data
Using fct_relevel to reorder the "Before 1900" level to the first location leaving the other levels in their existing order
Using n and sum in fct_reorder to reorder factor levels when there are multiple categories in count
Using reorder_within and scale_y_reordered such that the values are ordered within each facet
Using `axis.text.x" to rotate overlapping labels
Using filter and fct_lump to lump all levels except for the 8 most frequest facet panels
Using separate to separate the character column binomial_name into multiple columns (genus and species)
Using fct_lump within count to lump all levels except for the 8 most frequent genus
Using rvest and SelectorGadget to web scrape list of species
Using str_trim to remove whitespace from character string
Using separate to separate character string into genus, species, and rest/citation columns and using extra = "merge" to merge extra pieces into the rest/citation column
Using rvest and SelectorGadget to web scrape image links
Summary of screencast