Starred repositories
Community developed Quarto Extension to Embed webR for HTML Documents, RevealJS, Websites, Blogs, and Books.
Friends don't let friends make certain types of data visualization - What are they and why are they bad.
✂️ Extract Tables from Microsoft Word Documents with R
Code and data for "Skeptic priors and climate consensus" (McDermott, 2021)
Software for humanities scholars using quantitative or computational methods.
dataset and baseline models of entity annotations in historical Dutch colonial archives
In-class notebooks for the Spring 2023 seminar on quantitative literary analysis
Practical Approaches to Data Science with Text
gpttools extends gptstudio for package development to help you document code, write tests, or even explain code
Natural language processing pipeline for book-length documents (archival Java version; for current Python version, see: https://github.com/booknlp/booknlp)
A curated list of awesome ggplot2 tutorials, packages etc.
Digital Pedagogy in the Humanities: Concepts, Models, and Experiments
Materials for a workshop on image search for heritage data
Download and import OpenStreetMap data from Geofabrik and other providers
Poetry Identification Code from my dissertation runs on zip files containing DJVUXML from the Internet Archive.
Jekyll-based static site for The Programming Historian
A standalone React/Redux web application for for presenting unique printed books and manuscripts in digital facsimile.
Tools for Generating, Visualising, and Analysing Link Communities in Networks
A small script which creates a historical list of forenames and genders
Generic project template for a data analysis project aiming for publication.
A post-processing tool for scanned sheets of paper.
Jupyter book showing how to build an ML powered book genre classifier
Network-related modules and pipelines for kiara.