Stars
Code for "Learning End-to-End Patient Representations through Self-Supervised Covariate Balancing for Causal Treatment Effect Estimation"
Georeferenced Rasters and Statistics of Nightlights from NASA Black Marble
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Stata package to plot distributions after absorbing variance from fixed effects and linear controls
Data and code to accompany the paper: Halterman, Keith, Sarwar, and O'Connor. "Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence." Findings of AC…
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia tex…
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Comments on the Nature of Being a Graduate Student
This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.
collection of Indian open government data related scripts.
This project contains the website code for the Urbanization Project
Source code and assets of pascalmichaillat.org
repo for "Natural Language Processing for Law and Social Science" @ ETH Zurich, Spring 2022
Materials for PhD course on text data in economics
Jupyter notebooks for the Natural Language Processing with Transformers book
Extract features from Proquest newspaper search results and stores in csv for more than the 100 Proquest limit.
Code and notebooks for my Medium blog posts
Causal Inference II Mixtape Session taught by Scott Cunningham
Doing Applied Economics Research Mixtape Track taught by Mark Anderson and Daniel Rees
Causal Inference 1 Mixtape Session taught by Scott Cunningham
Advanced Differnce-in-Differences Mixtape Track taught by Jonathan Roth
A user-created catalog of great economics writing