Skip to content

Conversation

@HyukjinKwon
Copy link
Member

What changes were proposed in this pull request?

This PR proposes to fix Binder integration by using Dockerfile directly.

Why are the changes needed?

Binder integration is broken now (https://mybinder.org/v2/gh/apache/spark/bb7846dd487?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb):

Screenshot 2024-08-27 at 2 04 35 PM

This seems to be related to the size of the repository (jupyterhub/mybinder.org-deploy#3074).

I tried all the ways out but could not find the way except using Dockerfile.

Does this PR introduce any user-facing change?

Yes. This should recover the Binder integration.

How was this patch tested?

Manually tested within my fork:

https://mybinder.org/v2/gh/HyukjinKwon/spark/binder-test1?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb

Was this patch authored or co-authored using generative AI tooling?

No.

Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM.

@HyukjinKwon
Copy link
Member Author

Thanks all!

Merged to master, branch-3.5 and branch-3.4.

HyukjinKwon added a commit that referenced this pull request Aug 27, 2024
This PR proposes to fix Binder integration by using `Dockerfile` directly.

Binder integration is broken now (https://mybinder.org/v2/gh/apache/spark/bb7846dd487?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb):

![Screenshot 2024-08-27 at 2 04 35 PM](https://github.com/user-attachments/assets/29222fc2-7cc6-43fa-8e04-a65c8384c4d5)

This seems to be related to the size of the repository (jupyterhub/mybinder.org-deploy#3074).

I tried all the ways out but could not find the way except using `Dockerfile`.

Yes. This should recover the Binder integration.

Manually tested within my fork:

https://mybinder.org/v2/gh/HyukjinKwon/spark/binder-test1?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb

No.

Closes #47883 from HyukjinKwon/binder-test1.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit 9fc1e05)
Signed-off-by: Hyukjin Kwon <[email protected]>
HyukjinKwon added a commit that referenced this pull request Aug 27, 2024
This PR proposes to fix Binder integration by using `Dockerfile` directly.

Binder integration is broken now (https://mybinder.org/v2/gh/apache/spark/bb7846dd487?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb):

![Screenshot 2024-08-27 at 2 04 35 PM](https://github.com/user-attachments/assets/29222fc2-7cc6-43fa-8e04-a65c8384c4d5)

This seems to be related to the size of the repository (jupyterhub/mybinder.org-deploy#3074).

I tried all the ways out but could not find the way except using `Dockerfile`.

Yes. This should recover the Binder integration.

Manually tested within my fork:

https://mybinder.org/v2/gh/HyukjinKwon/spark/binder-test1?filepath=python%2Fdocs%2Fsource%2Fgetting_started%2Fquickstart_df.ipynb

No.

Closes #47883 from HyukjinKwon/binder-test1.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit 9fc1e05)
Signed-off-by: Hyukjin Kwon <[email protected]>
HyukjinKwon added a commit that referenced this pull request Aug 28, 2024
…nnect notebook

### What changes were proposed in this pull request?

This is a followup of #47883 that adds manual `source ~/.profile`.

### Why are the changes needed?

Ever since we switched to `Dockerfile`, none of `./profile`, `/.bashrc`, `./bash_profile`, etc seems working. There are a couple of related issues in Jupyter but I cannot figure it out.

This is the only cell it needs the environment variable so decided to simply work around.

### Does this PR introduce _any_ user-facing change?

No.

### How was this patch tested?

Manually tested.

### Was this patch authored or co-authored using generative AI tooling?

No.

Closes #47902 from HyukjinKwon/SPARK-49402-followup.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
HyukjinKwon added a commit that referenced this pull request Aug 28, 2024
…nnect notebook

This is a followup of #47883 that adds manual `source ~/.profile`.

Ever since we switched to `Dockerfile`, none of `./profile`, `/.bashrc`, `./bash_profile`, etc seems working. There are a couple of related issues in Jupyter but I cannot figure it out.

This is the only cell it needs the environment variable so decided to simply work around.

No.

Manually tested.

No.

Closes #47902 from HyukjinKwon/SPARK-49402-followup.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit 1c9cde5)
Signed-off-by: Hyukjin Kwon <[email protected]>
HyukjinKwon added a commit that referenced this pull request Aug 28, 2024
…nnect notebook

This is a followup of #47883 that adds manual `source ~/.profile`.

Ever since we switched to `Dockerfile`, none of `./profile`, `/.bashrc`, `./bash_profile`, etc seems working. There are a couple of related issues in Jupyter but I cannot figure it out.

This is the only cell it needs the environment variable so decided to simply work around.

No.

Manually tested.

No.

Closes #47902 from HyukjinKwon/SPARK-49402-followup.

Authored-by: Hyukjin Kwon <[email protected]>
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit 1c9cde5)
Signed-off-by: Hyukjin Kwon <[email protected]>
(cherry picked from commit df07aa7)
Signed-off-by: Hyukjin Kwon <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants