Given the commits <a class="issue-link js-issue-link" data-error-text="Failed to load

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

closing issue as it has been solved by <a class="issue-link js-issue-link" data-error-

Select indirect BGEMM kernels - Benchmarking grouped binary convolutions about compute-engine HOT 3 CLOSED

simonmaurer commented on July 3, 2024

Select indirect BGEMM kernels - Benchmarking grouped binary convolutions

from compute-engine.

Comments (3)

Tombana commented on July 3, 2024 1

We currently don't have a CLI flag in lce_benchmark_model to choose between these. For internal benchmarks we simply replaced the registration on the following line:

compute-engine/larq_compute_engine/tflite/kernels/lce_ops_register.h

Lines 31 to 32 in a2611f8

    
           resolver->AddCustom("LceBconv2d", 
        
                               compute_engine::tflite::Register_BCONV_2D());

with Register_BCONV_2D_OPT_INDIRECT_BGEMM.

I'd welcome a PR to make this into a commandline flag, my suggestion would be:

Add a bool use_indirect_bgemm (default false) argument to RegisterLCECustomOps with another if-branch next to use_reference_bconv in lce_ops_register.h.
To add it as a commandline flag, I'd say the simplest (without modifying the TFLite benchmark BenchmarkTfLiteModel code) is to parse the commandline flags in lce_benchmark_main.cc and store the result as a global bool in that file, which can then be passed to RegisterLCECustomOps on line 26.

Note that use_reference_bconv uses core/bconv2d/reference.h which supports 'everything' such as zero-padding, one-padding and groups. The optimized implementations, however, don't support all of those.

from compute-engine.

simonmaurer commented on July 3, 2024 1

@Tombana thanks a lot for pointing me to the right direction.
can do a PR and include a filtering of the arguments, so we can parse the flag (as suggested by you) and remove it from argv before passing it to the BenchmarkTfLiteModel as I assume (need to verify though) this will throw an unrecognized argument error

from compute-engine.

simonmaurer commented on July 3, 2024

closing issue as it has been solved by #717

from compute-engine.

Recommend Projects

Select indirect BGEMM kernels - Benchmarking grouped binary convolutions about compute-engine HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

	resolver->AddCustom("LceBconv2d",
	compute_engine::tflite::Register_BCONV_2D());