Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[pull] master from alibaba:master #6

Open
wants to merge 257 commits into
base: master
Choose a base branch
from
Open

[pull] master from alibaba:master #6

wants to merge 257 commits into from

Conversation

pull[bot]
Copy link

@pull pull bot commented Feb 15, 2023

See Commits and Changes for more details.


Created by pull[bot]

Can you help keep this open source service alive? 💖 Please sponsor : )

@pull pull bot added the ⤵️ pull label Feb 15, 2023
wangzhaode and others added 29 commits February 20, 2023 11:23
add create new jni function: nativeFromBuffer
[MNN:Sync] Sync Internal Gitlab (2.4)
fixes typo of Transformer architecture
fix(opencl): fix mLWS dimension bigger than gws
[MNN:Sync] Sync Internal 2.4.1
build(hiai): fix for update hiai to 100.520.020.010
Currently the yield is occuring every time a completion loop
iterates and this is quite an expensive kernel system call. It is
not really required if we break out of the loop, so move the
yield to the end of the do-while loop to reduce the yielding overhead

Perf metrics show that the current code eats up ~2.4% CPU yielding
whereas this change reduces this down to ~0.6% of the total CPU run
time.

Signed-off-by: Colin Ian King <[email protected]>
Only yield at the end if the completion loop needs another pass
[MNN:Sync] Sync Internal Gitlab 2.4.3
jxt1234 and others added 30 commits October 9, 2024 17:49
hiai ddk sunset ddk v2 interface
[update] support repository_dispatch for pymnn_release.
MNN:Bugfix: Fix bug for dynammic quant nan for same value input
MNN:Sync: Sync Internal 2.9.6
Put KleidiAI files in folder source/backend/cpu/arm/kleidiAI/kai,
download from arm gitlab and remain unchanged. Maybe will remove
these files and download them when build.

MNNKleidiAI.cpp is interface between MNN and KleidiAI.

Rewrite function in class DenseConvInt8TiledExecutor
, in ConvInt8TiledExecutor.cpp, to call KleidiAI functions.
Maybe implement a new execution later.

Changes to GeometryConvUtils.cpp and ShapeTensorConvert.cpp are for
the input and output of DenseConvInt8TiledExecutor is NCHW,
rather than NC4HW4, to avoid redundant pack/unpack and get better
performance.
…ithub/workflows/actions/download-artifact-4.1.7

Build(deps): Bump actions/download-artifact from 3 to 4.1.7 in /.github/workflows
Integrate kleidiAI release v0.3.0 into MNN 2.9.6
[MNN:CI] update actions/upload-artifact@v4.
[MNN:CI] update upload/download-artifact@v4.
[MNN:CI] add merge-multiple in download-artifact
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.