A Python script for researching the use of Git notes in popular GitHub repositories. This script pulls popular repositories based on stars, checks if they use Git notes, and collects information about these notes.
Git notes are a powerful but often underutilized feature of Git. This script aims to gather data on the use of Git notes in popular GitHub repositories, including information about the notes themselves, their content, and when they were last updated. All with the aim of understanding if Git Notes is still in use.
Before using this script, ensure you have the following prerequisites installed:
- Python 3.11
- duckdb
- py_ulid
You can install the required Python packages using pip
:
pip install -r requirements.txt
To run the script, use the following command:
python git_notes_research.py
This will initiate the data collection process and populate the DuckDB database with the relevant information.
The project is organized as follows:
git-notes-research/
│
├── git_notes_research.py # Main script
├── modules/ # Custom modules
│ ├── db.py # database interactions
│ └── git.py # Git interactions
│ └── setup.py # Setup steps
├── requirements.txt # Required Python packages
├── README.md # This README file
└── .gitignore # Gitignore file
This script uses DuckDB to store and query the collected data. The database structure is defined in the db.py
module.
Contributions are welcome! If you would like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Make your changes and commit them.
- Push your changes to your fork.
- Submit a pull request to the main repository.
This project is licensed under the MIT License - see the LICENSE file for details.