Appendix C — Updates

Published

April 27, 2025

C.1 April 2025

  • Did a complete rewrite of section 7.7 to address changes in the websites to be javascript driven instead of static HTML. Introduce {rplaywright}.
  • Turned off evaluation in 7.9 as the websites have changed to the code does not all work but remains illustirative of the aapproach. Added a warning.
  • Updated a few expired links due to changes in publisher sites.
  • Updated the list of browsers for html_live in 7.5.7

C.2 October 2024

  • Update to Web scraping to use {rvest} read_html_live().

C.3 August 2024

  • Extensive update to section 7 on webscraping to account for changes in IMDB website.
    • Added new example with Selector Gadget with UN Aid organizations
    • Changed all the IMDB sections to use Developer Tools with new CSS selectors
    • Added section on using {chromote} pacage to get scroll the IMDB pages
    • Adjusted the Taylor Swift example to match new column headers on Wikipedia.

C.4 July 2024

  • Update appendix A for installing software
  • Update appendix B to eliminate PATs for GitHub and include 2 factor authentication and GCM for both Mac and Windows.

C.5 May 2024

  • Section 7.8: Update from the {tabulizer} package for scraping PDFs to its replacement, the {tabulapdf} package.