Stars
Finetune Llama 3.2, Mistral, Phi, Qwen & Gemma LLMs 2-5x faster with 80% less memory
Synthetic Difference in Differences for Stata
A repository of academic resources to help make use of the Facebook Social Connectedness Index data.
A model(ing framework) for sample efficient OCR
Analysis of GPS mobility datasets for disaster risk management and urban planning.
Repo for the open source GW SATP autocoding pipeline
unexploredtest / neural-networks-and-deep-learning
Forked from mnielsen/neural-networks-and-deep-learningCode samples for my book "Neural Networks and Deep Learning"
Download data from the Opportunity Insights Economic Tracker — https://tracktherecovery.org/
A scalable machine learning library on Apache Spark
Code for "Learning End-to-End Patient Representations through Self-Supervised Covariate Balancing for Causal Treatment Effect Estimation"
Georeferenced Rasters and Statistics of Nightlights from NASA Black Marble
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
Stata package to plot distributions after absorbing variance from fixed effects and linear controls
Data and code to accompany the paper: Halterman, Keith, Sarwar, and O'Connor. "Corpus-Level Evaluation for Event QA: The IndiaPoliceEvents Corpus Covering the 2002 Gujarat Violence." Findings of AC…
The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the dataset includes a large collection of native script Wikipedia tex…
A convenient way to link, deduplicate, aggregate and cluster data(frames) in Python using deep learning
Comments on the Nature of Being a Graduate Student
This offers a Jupyter Notebook introduction on how to use Large Language Models for text analysis within the social sciences.
collection of Indian open government data related scripts.