-
Notifications
You must be signed in to change notification settings - Fork 29k
SPARK-1628: Add missing hashCode methods in Partitioner subclasses #549
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Can one of the admins verify this patch? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If using numPartitions = partitions, there is a chance that p1 == p2 && p1.numPartitions != p2.numPartitions is true. For example, if rdd.sample is empty, p1 = new RangePartitioner[...](10, rdd, true), and p2 = new RangePartitioner[...](1, rdd, true).
That's confusing. So I changed partitions to rangeBounds.length + 1.
|
Is there any further suggestion about this one? |
…pache#549. remove actorToWorker in master.scala, which is actually not used actorToWorker is actually not used in the code....just remove it Author: CodingCat <[email protected]> == Merge branch commits == commit 52656c2d4bbf9abcd8bef65d454badb9cb14a32c Author: CodingCat <[email protected]> Date: Thu Feb 6 00:28:26 2014 -0500 remove actorToWorker in master.scala, which is actually not used
|
Jenkins, test this please. |
|
This looks good to me pending test passes. |
|
Merged build triggered. |
|
Merged build started. |
|
Merged build finished. All automated tests passed. |
|
All automated tests passed. |
|
Thanks. I'm merging this in master. |
Adding a paragraph clarifying a weird behavior in RangePartitioner. See also #549. Author: Reynold Xin <[email protected]> Closes #1012 from rxin/partitioner-doc and squashes the following commits: 6f0109e [Reynold Xin] SPARK-1628 follow up: Improve RangePartitioner's documentation.
JIRA: https://issues.apache.org/jira/browse/SPARK-1628 Added `hashCode` in HashPartitioner, RangePartitioner, PythonPartitioner and PageRankUtils.CustomPartitioner. Author: zsxwing <[email protected]> Closes apache#549 from zsxwing/SPARK-1628 and squashes the following commits: 2620936 [zsxwing] SPARK-1628: Add missing hashCode methods in Partitioner subclasses
Adding a paragraph clarifying a weird behavior in RangePartitioner. See also apache#549. Author: Reynold Xin <[email protected]> Closes apache#1012 from rxin/partitioner-doc and squashes the following commits: 6f0109e [Reynold Xin] SPARK-1628 follow up: Improve RangePartitioner's documentation.
JIRA: https://issues.apache.org/jira/browse/SPARK-1628 Added `hashCode` in HashPartitioner, RangePartitioner, PythonPartitioner and PageRankUtils.CustomPartitioner. Author: zsxwing <[email protected]> Closes apache#549 from zsxwing/SPARK-1628 and squashes the following commits: 2620936 [zsxwing] SPARK-1628: Add missing hashCode methods in Partitioner subclasses
Adding a paragraph clarifying a weird behavior in RangePartitioner. See also apache#549. Author: Reynold Xin <[email protected]> Closes apache#1012 from rxin/partitioner-doc and squashes the following commits: 6f0109e [Reynold Xin] SPARK-1628 follow up: Improve RangePartitioner's documentation.
should use -a in tee
### What changes were proposed in this pull request? In ProjectingInternalRow, accessing colOrdinals causes poor performace. Replace colOrdinals with the IndexedSeq type. ### Why are the changes needed? Replace colOrdinals with the IndexedSeq type. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? No need to add UT ### Was this patch authored or co-authored using generative AI tooling? No Closes apache#47890 from wzx140/project-row-fix. Lead-authored-by: wzx <[email protected]> (cherry picked from commit 37f2fa9) Signed-off-by: Kent Yao <[email protected]> Co-authored-by: wzx <[email protected]> Co-authored-by: Kent Yao <[email protected]>
JIRA: https://issues.apache.org/jira/browse/SPARK-1628
Added
hashCodein HashPartitioner, RangePartitioner, PythonPartitioner and PageRankUtils.CustomPartitioner.