Skip to content

Conversation

@markhamstra
Copy link

No description provided.

rxin and others added 12 commits September 23, 2015 16:43
… in Python DataFrame.

Python DataFrame.head/take now requires scanning all the partitions. This pull request changes them to delegate the actual implementation to Scala DataFrame (by calling DataFrame.take).

This is more of a hack for fixing this issue in 1.5.1. A more proper fix is to change executeCollect and executeTake to return InternalRow rather than Row, and thus eliminate the extra round-trip conversion.

Author: Reynold Xin <[email protected]>

Closes apache#8876 from rxin/SPARK-10731.

(cherry picked from commit 9952217)
Signed-off-by: Reynold Xin <[email protected]>
…array (round 2)

This patch reverts most of the changes in a previous fix apache#8827.

The real cause of the issue is that in `TungstenAggregate`'s prepare method we only reserve 1 page, but later when we switch to sort-based aggregation we try to acquire 1 page AND a pointer array. The longer-term fix should be to reserve also the pointer array, but for now ***we will simply not track the pointer array***. (Note that elsewhere we already don't track the pointer array, e.g. [here](https://github.com/apache/spark/blob/a18208047f06a4244703c17023bb20cbe1f59d73/sql/core/src/main/java/org/apache/spark/sql/execution/UnsafeKVExternalSorter.java#L88))

Note: This patch reuses the unit test added in apache#8827 so it doesn't show up in the diff.

Author: Andrew Or <[email protected]>

Closes apache#8888 from andrewor14/dont-track-pointer-array.

(cherry picked from commit 83f6f54)
Signed-off-by: Andrew Or <[email protected]>
Author: Reynold Xin <[email protected]>

Closes apache#8890 from rxin/release-1.5.1.
…aming UI to clear failed batches

Slightly modified version of apache#8818, all credit goes to zsxwing

Author: zsxwing <[email protected]>
Author: Tathagata Das <[email protected]>

Closes apache#8892 from tdas/SPARK-10692.

(cherry picked from commit 758c9d2)
Signed-off-by: Reynold Xin <[email protected]>
markhamstra added a commit that referenced this pull request Sep 25, 2015
@markhamstra markhamstra merged commit fdba912 into alteryx:csd-1.5 Sep 25, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants