NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding

Ming Hu, Lin Wang, Siyuan Yan, Don Ma, Qingli Ren, Peng Xia, Wei Feng, Peibo Duan, Lie Ju, Zongyuan Ge.

✨ Introduction

NurViD is a large video dataset with expert-level annotation for nursing procedure activity understanding. NurViD consists of over 1.5k videos totaling 144 hours. Notably, it encompasses 51 distinct nursing procedures and 177 action steps.

🥳 News

[2023.09.22] NurViD was accepted at NeurIPS 2023 Track Datasets and Benchmarks!

🤠Installation

This package has the following requirements:

GCC >= 4.9
python >= 3.8
PyTorch >= 1.8
Denseflow
MMAction2
PySlowFast

1.Create a virtual environment

conda create --name nurvid python=3.9 -y
conda activate nurvid
pip install -r requirements.txt

2.MMAction2、PySlowFast、Denseflow

Please refer to the official websites of MMAction2, PySlowFast, Denseflow(Optional: A GPU-accelerated library designed for efficient extraction of optical flow features) for detailed instructions.

🤭 Directory Structure

In the context of the whole project, the folder structure will look like:

NurViD-benchmark
├── annotations
│   ├── task1&3
│   │   ├── train.csv
│   │   ├── val.csv
│   │   ├── test.csv.csv
│   ├── task2
│   │   ├── procedure_train.csv
│   │   ├── procedure_val.csv
│   │   ├── procedure_testcsv
│   │   ├── action_train.csv
│   │   ├── action_val.csv
│   │   ├── action_test.csv
│   ├── NurViD_annotations.json
│   ├── Procedure&Action_ID.xlsx
├── feature_extraction
│   ├── feature
│   │   ├── --Ly-qjodoIs.npz
│   │   ├── -0z1P7sw2qs.npz
│   │   ├── ..
│   ├── build_rawframes.py
│   ├── extract_features.py
│   ├── ..
├── tools
│   ├── downloader.py
│   ├── preprocess_videos.py
│   ├── clip.py
├── model (Baseline models.)
│   ├── SlowFast
│   ├── C3D
│   ├── I3D
├── dataset
│   ├── Original_videos
│   │   ├── --Ly-qjodoI.mp4
│   │   ├── -0z1P7sw2qs.mp4
│   │   ├── ..
│   ├── Preprocessed videos
│   │   ├── --Ly-qjodoI.mp4
│   │   ├── -0z1P7sw2qs.mp4
│   │   ├── ..
│   ├── Segments
│   │   ├── --Ly-qjodoI_1.mp4
│   │   ├── --Ly-qjodoI_2.mp4
│   │   ├── --Ly-qjodoI_3.mp4
│   │   ├── ..

😎 Dataset Preparation

1.Download Videos

Download videos automatically from the source YouTube by running the script below：

python /tools/downloader.py

2.Preprocess Videos

By running the script below, the video will be resized to the short edge size of 256 and a frame rate of 25 FPS:

python /tools/preprocess_videos.py

3.Create Trimmed Segments

We clip the video into segments according to the order specified in the JSON annotation file and add a sequential number as a label.

python /tools/clip.py

4.Extract RGB and Flow Features

We start by extracting frames from each video at 25 frames per second and optical flow using the TV-L1 algorithm.:

python /feature_extraction/build_rawframes.py /video_path /rgb&flow_frmaes_save_path --level 1 --flow-type tvl1 --ext mp4 --task both

Next, we utilize a pre-trained I3D model on the ImageNet dataset to generate features for each RGB and optical flow frame:

python /feature_extraction/extract_features.py --mode rgb --load_model models/rgb_imagenet.pt --input_dir /rgb&flow_frmaes_save_path --output_dir /rgb_feature_save_path --batch_size 100 --sample_mode resize --no-usezip
python /feature_extraction/extract_features.py --mode flow --load_model models/flow_imagenet.pt --input_dir /rgb&flow_frmaes_save_path --output_dir /rgb_feature_save_path --batch_size 100 --sample_mode resize --no-usezip

To handle varying video durations, we perform uniform interpolation to generate 100 fixed-length features for each video. Lastly, we combine the RGB and optical flow features into a 2048-dimensional embedding as the model input.

5.Our Source

We also provide a method to directly access our data, but it requires you to sign the data agreement form. Once you have completed the form, you will receive an email from our team with Google Drive and Baidu Netdisk download links.

🧸 Acknowledgement

Part of our code is borrowed from the following repositories:

🥳 Citation

If you find this work useful, please cite:

@inproceedings{ming2023nurvid,
  title={NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding},
  author={Ming, Hu and Lin, Wang and Siyuan, Yan and Don, Ma and Qingli, Ren and Peng, Xia and Wei, Feng and Peibo, Duan and Lie, Ju and Zongyuan, Ge},
  booktitle={Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
  year={2023}
}

✅ License and Disclaimer

The CC-BY-4.0 license and disclaimer statement for the project can be found in the following file description：

CC BY 4.0
Disclaimer.txt

🏥 Contributors

This research was supported by a team from Shanxi Medical University. We are grateful for their dedication in the data annotation process.

Name	Name	Name
Qingli Ren	Peizhe Zhang	Hao Guo
Yidi Liu	Yaokai Xing	Jiaqi Li
Rujie Gao	Zhen Lv	Jun Wang
Jiayu Tian	Guangyan Niu	Ruixin Wang
Huikang Huang	Yuxin Zhao	Jing Li
Yijin Wang	Yajing Hao	Wenxua Wu
Ziyi Wang	Xu Guo	Yuhua Cai
Xinrong Guo	Xueying Ma	Yingjuan Zhang
Yuqi Zhang	Liru Ma	Sinan Li

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding

✨ Introduction

🥳 News

🤠Installation

1.Create a virtual environment

2.MMAction2、PySlowFast、Denseflow

🤭 Directory Structure

😎 Dataset Preparation

1.Download Videos

2.Preprocess Videos

3.Create Trimmed Segments

4.Extract RGB and Flow Features

5.Our Source

🧸 Acknowledgement

🥳 Citation

✅ License and Disclaimer

🏥 Contributors

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
annotations		annotations
feature_extraction		feature_extraction
tools		tools
CC-BY-4.0		CC-BY-4.0
Disclaimer.txt		Disclaimer.txt
README.md		README.md
localization.png		localization.png
requirements.txt		requirements.txt

minghu0830/NurViD-benchmark

Folders and files

Latest commit

History

Repository files navigation

NurViD: A Large Expert-Level Video Database for Nursing Procedure Activity Understanding

✨ Introduction

🥳 News

🤠Installation

1.Create a virtual environment

2.MMAction2、PySlowFast、Denseflow

🤭 Directory Structure

😎 Dataset Preparation

1.Download Videos

2.Preprocess Videos

3.Create Trimmed Segments

4.Extract RGB and Flow Features

5.Our Source

🧸 Acknowledgement

🥳 Citation

✅ License and Disclaimer

🏥 Contributors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages