Skip to content

Conversation

@felixcheung
Copy link
Member

What changes were proposed in this pull request?

Update doc to remove external for createTable, add refreshByPath in python

How was this patch tested?

manual

@SparkQA
Copy link

SparkQA commented Apr 2, 2017

Test build #75466 has finished for PR 17512 at commit 573b265.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@felixcheung
Copy link
Member Author

@gatorsmile

@felixcheung
Copy link
Member Author

will update after #17518 + changes to R doc too

@felixcheung
Copy link
Member Author

updated @gatorsmile

@felixcheung felixcheung changed the title [SPARK-20196][PYTHON][SQL] update doc for catalog functions, pyspark refreshByPath API [SPARK-20196][PYTHON][SQL] update doc for catalog functions for all languages, add pyspark refreshByPath API Apr 4, 2017
@SparkQA
Copy link

SparkQA commented Apr 4, 2017

Test build #75515 has finished for PR 17512 at commit 2900afe.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

}

#' Create a SparkDataFrame from a SparkSQL Table
#' Create a SparkDataFrame from a SparkSQL table or temporary view
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

table or view

Here, actually, it includes both temporary views or persistent views.

#'
#' Returns the specified Table as a SparkDataFrame. The Table must have already been registered
#' in the SparkSession.
#' Returns the specified table or temporary view as a SparkDataFrame. The temporary view must have
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same here.

"""Recover all the partitions of the given table and update the catalog."""
"""Recovers all the partitions of the given table and update the catalog.
Only works with a partitioned table, and not a temporary view.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

a temporary view -> a view

Both temporary and persistent views are not supported. We will detect and issue exceptions.


/**
* Recovers all the partitions in the directory of a table and update the catalog.
* Only works with a partitioned table, and not a temporary view.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same here.

/**
* Refreshes the cache entry and the associated metadata for all Dataset (if any), that contain
* the given data source path.
* the given data source path. Path matching is by prefix, i.e. "/" would invalidate
Copy link
Member

@gatorsmile gatorsmile Apr 4, 2017

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

invalidate -> invalidate and refresh

We also do the re-cache, but the new version cached lazily.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

For some reason in here, CatalogImpl.scala is very different from Catalog.scala - let me know if you want me to change them - for now I've updated the first sentence.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes. I found this sentence is copied from Catalog.scala. Maybe, we can update them to

Path matching is by prefix, i.e. "/" would invalidate all the cached entries and make the new versions cached lazily.


/**
* Recovers all the partitions in the directory of a table and update the catalog.
* Only works with a partitioned table, and not a temporary view.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The same here.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not a temporary view.
->
not a view.

@SparkQA
Copy link

SparkQA commented Apr 5, 2017

Test build #75534 has started for PR 17512 at commit 3e0ccd3.

@felixcheung
Copy link
Member Author

Jenkins, retest this please

@SparkQA
Copy link

SparkQA commented Apr 5, 2017

Test build #75536 has finished for PR 17512 at commit 3e0ccd3.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

#'
#' Returns the specified Table as a SparkDataFrame. The Table must have already been registered
#' in the SparkSession.
#' Returns the specified table or view as a SparkDataFrame. The table or view must already exists or
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: exists -> exist

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fixed, thanks for catching this!

@gatorsmile
Copy link
Member

LGTM except minor comments.

@SparkQA
Copy link

SparkQA commented Apr 5, 2017

Test build #75555 has finished for PR 17512 at commit 5ed1950.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@asfgit asfgit closed this in bccc330 Apr 6, 2017
@felixcheung
Copy link
Member Author

merged to master, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants