-
Notifications
You must be signed in to change notification settings - Fork 34
Add benchmark for grouping by GroupID for maven packages #747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
@wbxyz should we do this in the SQL query instead? Although the tables don't seem to have individual group and artifact columns. |
c4f13aa
to
def2985
Compare
def2985
to
c28d47d
Compare
I added another benchmark |
} | ||
close(pkgs) | ||
}() | ||
seenGroups := make(map[string]struct{}) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If this were a map[string]bool then you can assign true as the value and just do something like if seenGroups[g] { ...
if len(psp.Versions) >= 5 { | ||
continue | ||
} | ||
nameParts := strings.SplitN(p.Package, ":", 2) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can this be combined with the group split above?
log.Fatal(err.Error()) | ||
} | ||
query := client.Query(` | ||
SELECT |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this SQL needs to be updated following some of the other PRs
Achieved by #772. |
During inference, the errors are often duplicated in the same group ID. This will ensure that benchmark has unique group IDs and which roughly implies unique source repositories.