Skip to content
View fer-aguirre's full-sized avatar
🚀
🚀

Organizations

@DataCritica

Block or report fer-aguirre

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
fer-aguirre/README.md

Hi there! 👋

As a data journalist, I focus on data-driven investigations that expose abuses of power. My work includes scraping and cleaning data, creating data memos, conducting research and fostering understanding of data work within the team.

About me


Contents


NLP

Repository Description
discursos-milei Scraper y análisis de discursos de Javier Milei
ai4foia Proof-of-concept to recommend recipients for FOIA requests
hackathon-somos-nlp-2023 Fine-tuning LLMs for detecting hate speech categories in Spanish
customized-headlines Proof-of-concept to create customized headlines from news content based on demographic data
explained-recommendations API for a system recommendation explained using generative AI
opportunities-db Scraper to extract data from opportunity-related websites (e.g. funds, scholarships, etc.) and convert them into structured data
ner-spanish A repository for extracting Named Entity Recognition (NER) in Spanish data
pmdm Fine-tuned pre-trained language model that detects hate speech against women in Spanish and Portuguese
attackdetector Research for hate speech on Twitter against journalists and environmental activists in Mexico and Brazil
topicos-discursos-amlo Analysis with topic modeling to AMLO's speeches
bad-bunny Analysis of Bad Bunny's songs

Data Analysis

Repository Description
travesticidios-argentina Data analysis on court decisions on transvesticides in Argentina from 2018 to 2023
elecciones-argentina-2023 Data analysis of attacks against journalists in Twitter during the elections in Argentina in 2023
recomendaciones-escritoras Recommendation system for Latin American women writers
cancilleria-colombia Data analysis of public servants of Foreign Affairs in Colombia
gptzero-ai-articles Data analysis of articles talking about ChatGPT that were created with generative AI models
capir-transfronteriza2-2023 Data analysis and topic modeling of anti-rights groups from Brazil, Ecuador and Colombia
migrantes-desaparecidos-eeu Data analysis on missing migrants en route to the U.S.
covid19-venezuela Data analysis on covid-19 deaths in Venezuela
violencia-obstetrica-cuba Data analysis of obstetric violence in Cuba

Data Visualization

Repository Description
ping-pong-caba Mapa con ubicaciones de mesas de ping pong en lugares públicos de CABA
comision-revision-bolivia Map showing the rate of femicides in Bolivia per 100,000 women from 2013 to 2020
escritoras-latinas Web scraping of Wikipedia entries for Latin American women writers and network graph visualization
wifi-gratuito-cdmx Map showing locations of public free internet service in Mexico City [ARCHIVED]
mapa-huertos Map with locations of urban orchards in Mexico City [ARCHIVED]
maps-examples Maps examples using folium and prettymaps modules in Python [ARCHIVED]
directorix-disidente Digital directory of professions to build networks among the queer community of Mexico City [ARCHIVED]

Web Scraping

Repository Description
cij-argentina Scraper to convert PDF files from the CIJ website in Argentina into structured data
pdf-2-ner Web application to convert scanned PDF files to text-based data and apply Named Entity Recognition (NER) to extract entities in Spanish

Tools

Repository Description
pubmed-scraper A python command-line tool which scrapes PubMed based on keywords search and URL extraction
oportunidades-perioidstas-latam Sitio web para difundir oportunidades para periodistas en Latinoamérica
meta-threat-disruptions Track updates on Meta’s threat disruptions website
numerical-expressions A python command-line tool which describes the change between two numerical values
data-annotator Web application for text-based data labeling [ARCHIVED]

Project Templates

Repository Description
cookiecutter-data-analysis-extensive A cookiecutter template for data analysis projects using Python
cookiecutter-data-analysis-lite A starter template for data analysis projects that offers a simplified and beginner-friendly structure
cookiecutter-data-journalism A cookiecutter template for data journalism projects using Python

Learning Resources

Repository Description
csvconf-nlp Sesión de introducción a NLP en la csv,conf,v8 de Puebla, México en 2024
taller-cookiecutter Taller sobre cómo crear plantillas de proyectos para análisis de datos
taller-python Jupyter notebooks for learning the basics of Python
learn-python Collection of Python scripts organized by topics
learn-react-d3 Examples for data visualization with React and D3.js
learn-scrollama Examples for scrollytelling with scrollama
twitter-python Examples for Twitter data collection with Tweepy in Python [ARCHIVED]

Pinned Loading

  1. pmdm pmdm Public

    Political Misogynistic Discourse Monitor team from the 2021 JournalismAI Collab Challenges

    Jupyter Notebook 20 6

  2. numerical-expressions numerical-expressions Public

    A python command-line tool which describes the change between two numerical values 🧮

    Python

  3. cookiecutter-data-analysis-lite cookiecutter-data-analysis-lite Public template

    A starter template for data analysis projects that offers a simplified and beginner-friendly structure.

    Python

  4. cookiecutter-data-analysis-extensive cookiecutter-data-analysis-extensive Public template

    A cookiecutter template for data analysis projects using Python.

    Python

  5. DataCritica/cookiecutter-data-journalism DataCritica/cookiecutter-data-journalism Public template

    A cookiecutter template for data journalism projects using Python

    Python 2