Multibroadcast find_mul_conv #1384

CharlieL7 · 2022-09-14T16:06:57Z

Change find_mul_conv to work with multibroadcast also. Checks the strides instead of the broadcast axis.

… into multib_mul_conv

migraphx-bot · 2022-09-14T16:59:16Z

Test	Rate new 31de42	Rate old 1b00c6	Diff	Compare
torchvision-resnet50	2,222.99	2,224.65	-0.07%	🔴
torchvision-resnet50_fp16	4,813.98	4,813.66	0.01%	✅
torchvision-alexnet	4,964.78	4,970.95	-0.12%	🔴
torchvision-alexnet_fp16	26,176.94	26,187.97	-0.04%	✅
torchvision-densenet121	1,804.55	1,807.40	-0.16%	🔴
torchvision-densenet121_fp16	3,263.81	3,203.02	1.90%	🔆
torchvision-inceptionv3	1,104.74	1,094.97	0.89%	🔆
torchvision-inceptionv3_fp16	2,037.23	2,040.60	-0.16%	✅
torchvision-vgg16	894.34	894.65	-0.03%	🔴
torchvision-vgg16_fp16	1,724.31	1,724.83	-0.03%	✅
cadene-inceptionv4	535.01	534.01	0.19%	🔆
cadene-resnext64x4	575.66	574.74	0.16%	🔆
slim-mobilenet	6,384.04	6,255.07	2.06%	🔆
slim-nasnetalarge	207.90	206.97	0.45%	🔆
slim-resnet50v2	nan	nan	nan%	❌
bert-mrpc-onnx	893.07	889.39	0.41%	✅
bert-mrpc-tf	313.21	308.01	1.69%	🔆
pytorch-examples-wlang-gru	437.84	424.34	3.18%	🔆
pytorch-examples-wlang-lstm	332.69	331.48	0.37%	✅
torchvision-resnet50_1	517.61	514.73	0.56%	🔆
torchvision-inceptionv3_1	305.44	302.88	0.85%	🔆
torchvision-vgg16_1	462.93	460.86	0.45%	🔆
cadene-dpn92_1	330.69	324.36	1.95%	🔆
cadene-resnext101_1	234.90	210.63	11.52%	🔆
slim-vgg16_1	64.00	63.90	0.16%	🔆
slim-mobilenet_1	1,986.74	1,998.77	-0.60%	✅
slim-inceptionv4_1	193.16	198.23	-2.56%	🔴
onnx-taau-downsample	259.09	257.81	0.50%	🔆

This build is not recommended to merge 🔴

src/simplify_algebra.cpp

test/simplify_algebra_test.cpp

src/simplify_algebra.cpp

test/simplify_algebra_test.cpp

src/simplify_algebra.cpp

pfultz2 · 2022-09-17T17:48:32Z

src/simplify_algebra.cpp

+        {
+            if(invalid_sl(i))
+                invalid_case = true;
+        }


It would be simpler to write this as:

auto is_broadcasted_axis = [](auto len, auto stride) { return len == 1 or stride == 0; }; if (not is_broadcasted_axis(a_lens.front(), a_strides.front())) return; if (not std::equal(a_lens.begin()+2, a_lens.end(), a_strides.begin()+2, a_strides.end(), is_broadcasted_axis)) return;

pfultz2 · 2022-09-17T17:48:52Z

src/simplify_algebra.cpp

+
+        // check broadcasted along channels
+        auto a_lens    = a_ins->get_shape().lens();
+        auto a_strides = a_ins->get_shape().strides();


These should be const auto&.

pfultz2 · 2022-09-17T17:52:03Z

src/simplify_algebra.cpp

+                invalid_case = true;
+        }
+
+        if(invalid_sl(0) or a_strides.at(1) != 1 or invalid_case)


You should check a_strides.at(1) != 1 before checking the broadcasted axis. Also, we should probably check the rank as well(ie a_lens.size() > 2) before we start checking everything else.

Shouldn't rank >= 4 to be a valid convolution output?

CharlieL7 added 4 commits September 13, 2022 19:06

Initial

5347fe3

Tidy fix

7ff674b

Merge branch 'develop' of github.com:ROCmSoftwarePlatform/AMDMIGraphX…

ad260bb

… into multib_mul_conv

Simplify

17a799f

CharlieL7 requested review from pfultz2 and umangyadav September 14, 2022 16:06

CharlieL7 mentioned this pull request Sep 14, 2022

Rewrite ONNX parse batch norm #1362

Merged

umangyadav reviewed Sep 14, 2022

View reviewed changes

src/simplify_algebra.cpp Show resolved Hide resolved

test/simplify_algebra_test.cpp Show resolved Hide resolved

src/simplify_algebra.cpp Show resolved Hide resolved

Handle edge cases

9c3697b

CharlieL7 commented Sep 14, 2022

View reviewed changes

src/simplify_algebra.cpp Outdated Show resolved Hide resolved