Continuously increasing RAM with Pre-training #77

abhisheksgumadi · 2022-07-05T13:34:02Z

Dear Team,

I am using the pre-training script to pre-train BLIP on a custom dataset (containing around 1M image/text pairs).

I see that the machine RAM utilization continuously increases and at a point it reaches 100%. The machine has 120GB RAM!.

Any idea where the problem could be?

woctezuma · 2022-07-05T14:24:44Z

Do you have custom code which could have a memory leak?

abhisheksgumadi · 2022-07-05T14:30:19Z

We have a a custom dataloder that loads images and text from a parquet file.

abhisheksgumadi · 2022-07-06T20:20:46Z

We have 1 Million images stored on disk and we have prepared the JSON file as described in the Github read me page. The Dataloader we have loads the json file in memory in the __init__ method and then in the __get_item__ method it loads the image from the corresponding path inside the json file. Also returns back the text.

Now sure why the RAM utilization is so high? Any idea please? Thanks

LiJunnan1992 · 2022-07-12T08:00:05Z

Hi, it could be related to the dataloader.

abhisheksgumadi · 2022-07-12T08:51:25Z

We ended up using the pretrain_dataset.py file and formatted the data as a json file exactly as mentioned in the readme file. Even then we see the RAM utilization go to 100%. So now we have just formatted the dataset as required with no changes to the code. So we dont even have our own custom code.

abhisheksgumadi · 2022-07-12T12:40:35Z

We are happy to follow any other debugging steps to make this a success please. - thanks

asgsaeid · 2022-11-30T21:05:31Z

Was wondering if there has been any update on this. We ran the pretrain.py and saw the same issue: RAM size increases when the jason files are being read and at some point, RAM explodes. For pretraining, what python version did you use and what was the RAM size?

LiJunnan1992 · 2022-12-01T00:20:55Z

@abhisheksgumadi @asgsaeid
You may want to try out our new library which supports BLIP and see if the issue still remains:
https://github.com/salesforce/LAVIS

abhisheksgumadi · 2022-12-01T00:54:34Z

Thanks, will take a look

dyashuni · 2023-01-11T04:45:01Z

hope this helps
https://ppwwyyxx.com/blog/2022/Demystify-RAM-Usage-in-Multiprocess-DataLoader/

aries-young · 2023-07-03T07:26:25Z

Was wondering if there has been any update on this. We ran the pretrain.py and saw the same issue: RAM size increases when the jason files are being read and at some point, RAM explodes. For pretraining, what python version did you use and what was the RAM size?

Have you solved this problem？Could you kindly provide some suggestions ?

aries-young · 2023-07-03T07:26:34Z

Thanks, will take a look

Have you solved this problem？Could you kindly provide some suggestions ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuously increasing RAM with Pre-training #77

Continuously increasing RAM with Pre-training #77

abhisheksgumadi commented Jul 5, 2022

woctezuma commented Jul 5, 2022

abhisheksgumadi commented Jul 5, 2022

abhisheksgumadi commented Jul 6, 2022

LiJunnan1992 commented Jul 12, 2022

abhisheksgumadi commented Jul 12, 2022

abhisheksgumadi commented Jul 12, 2022

asgsaeid commented Nov 30, 2022

LiJunnan1992 commented Dec 1, 2022

abhisheksgumadi commented Dec 1, 2022

dyashuni commented Jan 11, 2023

aries-young commented Jul 3, 2023

aries-young commented Jul 3, 2023

Continuously increasing RAM with Pre-training #77

Continuously increasing RAM with Pre-training #77

Comments

abhisheksgumadi commented Jul 5, 2022

woctezuma commented Jul 5, 2022

abhisheksgumadi commented Jul 5, 2022

abhisheksgumadi commented Jul 6, 2022

LiJunnan1992 commented Jul 12, 2022

abhisheksgumadi commented Jul 12, 2022

abhisheksgumadi commented Jul 12, 2022

asgsaeid commented Nov 30, 2022

LiJunnan1992 commented Dec 1, 2022

abhisheksgumadi commented Dec 1, 2022

dyashuni commented Jan 11, 2023

aries-young commented Jul 3, 2023

aries-young commented Jul 3, 2023