Add variable seqlen and sparsity parameters to jagged_sum benchmark #2324

jananisriram · 2024-06-20T23:42:45Z

Summary:
Modify existing jagged_sum operator benchmark to optionally accept any of the following parameters: B (dimension 0 of nested tensor), M (dimension 2 of nested tensor), seqlen (maximum sequence length on ragged dimension), or sparsity (average sparsity on ragged dimension). This diff fixes the provided command line parameters and varies all other parameters above, enabling testing of all combinations of multiple parameters in parallel.

The following errors persist with sufficiently large inputs:

RuntimeError: numel needs to be smaller than int32_t max; otherwise, please use packed_accessor64 (when running command buck2 run @mode/{opt,inplace} //pytorch/benchmark:triton -- --op jagged_sum --B 1024 --M 1024 --sparsity 0.3)
torch.OutOfMemoryError: CUDA out of memory.

Reviewed By: davidberard98

Differential Revision: D58772201

facebook-github-bot · 2024-06-20T23:43:03Z

This pull request was exported from Phabricator. Differential Revision: D58772201

Summary: Modify existing `jagged_sum` operator benchmark to optionally accept any of the following parameters: `B` (dimension 0 of nested tensor), `M` (dimension 2 of nested tensor), `seqlen` (maximum sequence length on ragged dimension), or `sparsity` (average sparsity on ragged dimension). This diff fixes the provided command line parameters and varies all other parameters above, enabling testing of all combinations of multiple parameters in parallel. The following errors persist with sufficiently large inputs: - `RuntimeError: numel needs to be smaller than int32_t max; otherwise, please use packed_accessor64` (when running command `buck2 run mode/{opt,inplace} //pytorch/benchmark:triton -- --op jagged_sum --B 1024 --M 1024 --sparsity 0.3`) - `torch.OutOfMemoryError: CUDA out of memory.` Reviewed By: davidberard98 Differential Revision: D58772201

facebook-github-bot · 2024-06-21T07:16:04Z

This pull request was exported from Phabricator. Differential Revision: D58772201

facebook-github-bot · 2024-06-21T08:23:24Z

This pull request has been merged in 1425f68.

facebook-github-bot added the cla signed label Jun 20, 2024

jananisriram had a problem deploying to docker-s3-upload June 20, 2024 23:42 — with GitHub Actions Failure

jananisriram had a problem deploying to docker-s3-upload June 20, 2024 23:43 — with GitHub Actions Failure

facebook-github-bot added the fb-exported label Jun 20, 2024

jananisriram force-pushed the export-D58772201 branch from c0c23ac to 728540d Compare June 21, 2024 07:16

jananisriram had a problem deploying to docker-s3-upload June 21, 2024 07:16 — with GitHub Actions Failure

facebook-github-bot closed this in 1425f68 Jun 21, 2024

facebook-github-bot added the Merged label Jun 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add variable seqlen and sparsity parameters to jagged_sum benchmark #2324

Add variable seqlen and sparsity parameters to jagged_sum benchmark #2324

jananisriram commented Jun 20, 2024 •

edited

Loading

facebook-github-bot commented Jun 20, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

Add variable seqlen and sparsity parameters to jagged_sum benchmark #2324

Add variable seqlen and sparsity parameters to jagged_sum benchmark #2324

Conversation

jananisriram commented Jun 20, 2024 • edited Loading

facebook-github-bot commented Jun 20, 2024

facebook-github-bot commented Jun 21, 2024

facebook-github-bot commented Jun 21, 2024

jananisriram commented Jun 20, 2024 •

edited

Loading