This project is still being built, however it is deployed on Render so that changes can be viewed on a live site in realtime. Please feel free to have a browse here, but note that it is still a work in progress! Also note that it is currently deployed on the free tier and as such scales down with inactivity so the web services may take a moment to fire up when accessed.
This project is an information hub for research, news and social posts about Long Covid. It utilises several APIs, web scrapers and a neural model for summarisation built using Python to obtain information and display it to the user.
The project uses a Python/Flask backend and a TypeScript/React frontend, with MongoDB used to store scraped data. Currently the hub has 3 sections, News, Data and Reddit Feed. News articles and Reddit posts are obtained using API calls to the respective services.
The Research section visualizes data on long COVID articles from PubMed. The data is scraped from the PubMed website using BeautifulSoup which grabs the Title, Author, Date, and Abstract from the articles. The abstract is summarised using an AI model and stored in MongoDB, which is read by the Flask backend using Pandas. The backend serves the data as JSON, which is then fetched by the React frontend and visualized using Plotly.
To install the project, follow these steps:
-
Clone the repository:
-
Navigate into the project directory and install dependencies:
cd backend && pip install -r requirements.txt
cd ../frontend && npm install
-
Install MongoDB
brew tap mongodb/brew brew install [email protected]
-
Start MongoDB
brew services start [email protected]
- To start the backend, run:
cd backend && flask run
- To start the frontend, run:
cd frontend && npm start
- open https://localhost:3000/ in your web browser.
To run the backend tests, run:
cd backend
pytest
To run the frontend tests, run:
cd frontend
npm run test