webgpu: Use (16, 1, 1) as the work group size for binary and unary ops #2013

qjia7 · 2019-09-11T07:35:35Z

With this change, the add benchmark has close to 100% speedup.
And PoseNet+ResNet demo improves from 3 fps to 6 fps.

We should try to avoid using (1, 1, 1) as the default work group size.

PERF

To see the logs from the Cloud Build CI, please join either our discussion or announcement mailing list.

This change is

With this change, the add benchmark has close to 100% speedup. And PoseNet+ResNet demo improves from 3 fps to 6 fps. We should try to avoid using (1, 1, 1) as the default work group size. PERF

qjia7 · 2019-09-11T07:39:37Z

@annxingyuan @kainino0x Please take a look. Thanks.

annxingyuan

LGTM - thank you this is so awesome! I had mistakenly believed that there would be no performance benefit from simply bundling threads into thread groups when shared memory is not being used.

webgpu: Use (16, 1, 1) as the work group size for binary and unary ops

0a92279

With this change, the add benchmark has close to 100% speedup. And PoseNet+ResNet demo improves from 3 fps to 6 fps. We should try to avoid using (1, 1, 1) as the default work group size. PERF

googlebot added the cla: yes label Sep 11, 2019

annxingyuan approved these changes Sep 11, 2019

View reviewed changes

annxingyuan merged commit eef8a32 into tensorflow:master Sep 11, 2019

qjia7 deleted the workGroupSize branch August 13, 2020 07:01

Z3r0S3v3n mentioned this pull request Nov 28, 2023

[Snyk] Security upgrade googleapis from 39.2.0 to 49.0.0 Z3r0S3v3n/tfjs#50

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

webgpu: Use (16, 1, 1) as the work group size for binary and unary ops #2013

webgpu: Use (16, 1, 1) as the work group size for binary and unary ops #2013

Uh oh!

qjia7 commented Sep 11, 2019 •

edited by nsthorat

Loading

Uh oh!

qjia7 commented Sep 11, 2019

Uh oh!

annxingyuan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

webgpu: Use (16, 1, 1) as the work group size for binary and unary ops #2013

webgpu: Use (16, 1, 1) as the work group size for binary and unary ops #2013

Uh oh!

Conversation

qjia7 commented Sep 11, 2019 • edited by nsthorat Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qjia7 commented Sep 11, 2019

Uh oh!

annxingyuan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qjia7 commented Sep 11, 2019 •

edited by nsthorat

Loading