-
Notifications
You must be signed in to change notification settings - Fork 233
Weight Preshuffle Block Scale gemm support #2785
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall. Added a few comments and please sync up with the develop branch to resolve the merge conflicts.
{ | ||
using BlockWarps = typename Problem::BlockGemmShape::BlockWarps; | ||
using WarpTile = typename Problem::BlockGemmShape::WarpTile; | ||
// static_assert(std::is_same_v<typename Problem::ComputeDataType, ck_tile::pk_int4_t>, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Extra comments.
}; | ||
|
||
template <typename Problem, typename PipelinePolicy = GemmWPQuantPipelineAgBgCrPolicy> | ||
struct WPQuantBPipelineAgBgCrV1 : public BaseWPQuantBPipelineAgBgCrV1<Problem> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We are now upgrading to V2 let's use the V2 pipeline.
Proposed changes
Added support for Preshuffle B(weight) tensor with block scale gemm.
Checklist
Please put an
x
into the boxes that apply. You can also fill these out after creating the PR. If you're not sure, please don't hesitate to ask.clang-format
on all changed filesDiscussion
If this is a relatively large or complex change, feel free to start a discussion by explaining why you chose the solution you did and what alternatives you considered