Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ARM32 kernel implementation #432

Merged
merged 24 commits into from
Aug 4, 2020
Merged
Changes from 1 commit
Commits
Show all changes
24 commits
Select commit Hold shift + click to select a range
7494af9
Add arm32 4x4 kernel impl
Jul 22, 2020
d21344a
Add arm32 4x4 kernel impl
honglh Jul 22, 2020
d94b1e6
Add arm32 4x4 kernel impl
honglh Jul 22, 2020
3263994
Update to clang-format-5.0 format
Jul 30, 2020
f132e64
Remove redundant #if 1
Jul 30, 2020
62fe3f4
Undef constatns after use
Jul 30, 2020
695a4c7
Corrected incorrect ifdef comment
honglh Jul 30, 2020
cba31a5
Add bgemm_kernes_arm32.h as depdendent
honglh Jul 30, 2020
89aac02
Use vpaddl.u8 and u16 better because popcnt will not be negative
Jul 30, 2020
229e94a
use proper kernel for compiled arch
Aug 2, 2020
9716065
use proper kernel for compiled arch
Aug 2, 2020
7cee40f
use proper kernel for compiled arch
Aug 2, 2020
762b55b
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
73600a1
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
6fa0901
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
0a375d4
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
f7b65cc
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
0b141a7
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
9c32d6e
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
4332f55
Update larq_compute_engine/core/bgemm_kernels_arm.h
honglh Aug 3, 2020
32c954c
Update larq_compute_engine/core/bgemm_kernels_arm32.h
honglh Aug 3, 2020
bc4b7f5
Use kNeon for 32-bit input only
honglh Aug 4, 2020
d64ae7c
Use kNeon for 32-bit input only
honglh Aug 4, 2020
4daf896
fix clang-format incompliance
honglh Aug 4, 2020
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Use kNeon for 32-bit input only
Co-authored-by: Adam Hillier <[email protected]>
  • Loading branch information
honglh and AdamHillier committed Aug 4, 2020
commit d64ae7cfa1581fde020e7ac31816a551dd0143f2
4 changes: 0 additions & 4 deletions larq_compute_engine/tflite/tests/bconv2d_test.cc
Original file line number Diff line number Diff line change
Expand Up @@ -276,13 +276,9 @@ struct TestParam {
};

const auto kKernelMap = new std::map<string, register_function>({
#if RUY_PLATFORM_ARM_32
{"BConv2D32OPT", compute_engine::tflite::Register_BCONV_2D32_OPT},
#elif RUY_PLATFORM_ARM_64
{"BConv2D64OPT", compute_engine::tflite::Register_BCONV_2D64_OPT},
#else
{"BConv2D32REF", compute_engine::tflite::Register_BCONV_2D32_REF},
#endif
});

class BConv2DOpTest : public ::testing::TestWithParam<TestParamTuple> {
Expand Down