Skip to content

[v7] add fp16 conv3x3s2_depthwise for armv7#8140

Merged
chenjiaoAngel merged 2 commits intoPaddlePaddle:developfrom
mjp9527:v7_dw3x3s2
Dec 31, 2021
Merged

[v7] add fp16 conv3x3s2_depthwise for armv7#8140
chenjiaoAngel merged 2 commits intoPaddlePaddle:developfrom
mjp9527:v7_dw3x3s2

Conversation

@mjp9527
Copy link
Collaborator

@mjp9527 mjp9527 commented Dec 29, 2021

No description provided.

@paddle-bot-old
Copy link

Thanks for your contribution!

@mjp9527
Copy link
Collaborator Author

mjp9527 commented Dec 29, 2021

image

*(doutr1 + j) = tmp1[j]; \
}

inline std::pair<uint16_t, uint16_t> right_mask_3x3s2p01_fp16(int w_in,
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

为什么删除inline

: "cc", "memory", "q0", "q1", "q2", "q3", "q4", "q5", "q6", "q7",\
"q8", "q9", "q10", "q11", "q12", "q13", "q14", "q15");

asm volatile(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

为什么分开写呢?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这个我在注释里标了,写在一起通用寄存器不够用

Copy link
Collaborator

@chenjiaoAngel chenjiaoAngel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@chenjiaoAngel chenjiaoAngel merged commit 17e52c0 into PaddlePaddle:develop Dec 31, 2021
WeiLi233 pushed a commit to WeiLi233/Paddle-Lite that referenced this pull request Mar 29, 2022
@mjp9527 mjp9527 deleted the v7_dw3x3s2 branch November 28, 2022 12:05
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants