Skip to content

DCUFormer: Revisiting Transformer Decoder with DualCross/UpSampling Attention for Robust Crack Segmentation

License

Notifications You must be signed in to change notification settings

SHAN-JH/DCUFormer

 
 

Repository files navigation

DCUFormer

DCUFormer: Enhancing Pavement Crack Segmentation in Complex Environments with DualCross/UpSampling Attention

Network Architecture

architecture

Installation

We use MMSegmentation v1.2.0 as the codebase.

For install and data preparation, please find the guidelines in MMSegmentation v1.2.0 for the installation and data preparation.

Datasets

1. Crack500 dataset

The Crack500 dataset contains 500 images of size around 2000 × 1500 pixels taken by cell phones on main campus of Temple University. And each image was cropped into 16 non-overlapped image regions and only the region containing more than 1000 pixels of crack is kept. Through this way, the training data consists of 1,896 images, validation data contains 348 images, test data contains 1124 images. Download the Crack500 dataset from this link.

|-- Crack500
    |-- train
        |-- images
        |   |-- 119.jpg
            ......
        |-- masks
        |   |-- 119.png
            ......
    |-- test
        |-- images
        |   |-- 124.jpg
            ......
        |-- masks
        |   |-- 124.png
            ......
    |-- val
        |-- images
        |   |-- 131.jpg
            ......
        |-- masks
        |   |-- 131.png
            ......

2. CrackSC dataset

The CrackSC dataset consists of 197 road damage images (320 × 480 pixels) captured by an iPhone 8 around Enoree Ave, Columbia, SC. This dataset emphasizes complex pavement distress scenes with interference factors like shadows, leaves, and moss, which pose significant challenges to crack detection. Without a predefined dataset division by the authors, we divided it into 99 training images, 19 validation images, and 79 testing images, adhering to a 5:1:4 distribution ratio. Download the CrackSC dataset from this link.

|-- CrackSC
    |-- train
        |-- images
        |   |-- 20160222_081011_1_361.jpg
            ......
        |-- masks
        |   |-- 20160222_081011_1_361.png
            ......
    |-- test
        |-- images
        |   |-- 20160222_080933_361_1.jpg
            ......
        |-- masks
        |   |-- 20160222_080933_361_1.png
            ......
    |-- val
        |-- images
        |   |-- 20160222_080850_1_361.jpg
            ......
        |-- masks
        |   |-- 20160222_080850_1_361.png
            ......

3. UAV-Crack500 dataset

The UAV-Crack500 dataset contains 500 images with a size of 672 × 378 pixels, captured using a drone flying at an altitude of 50 meters with a 4× zoom camera at a speed of 2.5 meters per second. The original images had a resolution of 2688 × 1512 pixels, corresponding to a ground coverage area of 16 meters by 9 meters, with a pixel size of 6 millimeters by 6 millimeters. The images were cropped to 672 × 378 pixels to focus on the minor proportions of cracks. The dataset includes various interfering factors such as different lighting conditions, shadows from trees, and manhole covers. The images were randomly divided into a training set (250 images), a validation set (50 images), and a test set (200 images).

|-- UAV-Crack500
    |-- train
        |-- images
        |   |-- DJI_20221011133543_0038_Z_20.jpg
            ......
        |-- masks
        |   |-- DJI_20221011133543_0038_Z_20.png
            ......
    |-- test
        |-- images
        |   |-- DJI_20221011133713_0083_Z_22.jpg
            ......
        |-- masks
        |   |-- DJI_20221011133713_0083_Z_22.png
            ......
    |-- val
        |-- images
        |   |-- DJI_20221011133747_0100_Z_20.jpg
            ......
        |-- masks
        |   |-- DJI_20221011133747_0100_Z_20.png
            ......

License

This project is released under the Apache 2.0 license.

About

DCUFormer: Revisiting Transformer Decoder with DualCross/UpSampling Attention for Robust Crack Segmentation

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 99.2%
  • Other 0.8%