Skip to content

[SPARK-2871] [PySpark] Add missing API#1791

Closed
davies wants to merge 24 commits into
apache:masterfrom
davies:api
Closed

[SPARK-2871] [PySpark] Add missing API#1791
davies wants to merge 24 commits into
apache:masterfrom
davies:api

Conversation

@davies

@davies davies commented Aug 5, 2014

Copy link
Copy Markdown
Contributor

Add the following APIs:

SparkContext.conf
SparkContext.isLocal
SparkContext.startTime

@SparkQA

SparkQA commented Aug 5, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17953/consoleFull

@SparkQA

SparkQA commented Aug 5, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17953/consoleFull

@SparkQA

SparkQA commented Aug 5, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17958/consoleFull

@davies davies changed the title [WIP] [PySpark] Add missing API [PySpark] Add missing API Aug 5, 2014
@SparkQA

SparkQA commented Aug 5, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17959/consoleFull

@davies davies changed the title [PySpark] Add missing API [SPARK-2871] [PySpark] Add missing API Aug 6, 2014
@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/17958/consoleFull

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Convert a RDD of Java objects to and RDD of serialized Python objects
=>
Convert an RDD of Java objects to an RDD of serialized Python objects ?

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18001/consoleFull

@davies

davies commented Aug 6, 2014

Copy link
Copy Markdown
Contributor Author

The histogram() had been implemented in pure Python, it will support integer better, also it will support RDD of strings and other comparable objects.

This was inspired by #1783 et, and much improved.

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18001/consoleFull

Comment thread python/pyspark/rdd.py Outdated

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This was smart!

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18032/consoleFull

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds no public classes

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18032/consoleFull

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18055/consoleFull

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch FAILED unit tests.
- This patch merges cleanly
- This patch adds the following public classes (experimental):
class BoundedFloat(float):

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18055/consoleFull

@SparkQA

SparkQA commented Aug 6, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18058/consoleFull

@davies

davies commented Aug 14, 2014

Copy link
Copy Markdown
Contributor Author

Jenkins, test this please

@SparkQA

SparkQA commented Aug 14, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791. This patch merges cleanly.
View progress: https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18498/consoleFull

@SparkQA

SparkQA commented Aug 14, 2014

Copy link
Copy Markdown

QA results for PR 1791:
- This patch PASSES unit tests.
- This patch merges cleanly
- This patch adds the following public classes (experimental):
class BoundedFloat(float):

For more information see test ouptut:
https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/18498/consoleFull

Comment thread python/pyspark/rdd.py Outdated

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe explain what "comp" is in the doc comment

@mateiz

mateiz commented Aug 14, 2014

Copy link
Copy Markdown
Contributor

@davies I looked over all of this now and made some comments, but you should have Josh check too. Just to be clear though, I don't think this can make it into 1.1, so we can hold off on it for a while while we fix issues for 1.1. But these are great APIs to have.

1. implement lookup(), similar to that in Scala
2. handle None, nan, inf in histogram, add many tests
3. remove collectPartitions()
4. improve docs
@davies

davies commented Aug 14, 2014

Copy link
Copy Markdown
Contributor Author

@mateiz thanks for review this, I had addressed all you comments.

@JoshRosen could you take a look a this again?

@davies

davies commented Aug 14, 2014

Copy link
Copy Markdown
Contributor Author

The description had been updated to list all the added APIs.

Conflicts:
	python/pyspark/rdd.py
	python/pyspark/tests.py
@davies

davies commented Aug 22, 2014

Copy link
Copy Markdown
Contributor Author

@mateiz @JoshRosen some APIs has been splitted out as separated PRs: #2091, #2092, #2093, #2094, #2095

Conflicts:
	core/src/main/scala/org/apache/spark/api/python/PythonRDD.scala
	python/pyspark/rdd.py
@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791 at commit 28fd368.

  • This patch merges cleanly.

@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791 at commit 1ac98d6.

  • This patch merges cleanly.

@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have started for PR 1791 at commit 657a09b.

  • This patch merges cleanly.

@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have finished for PR 1791 at commit 28fd368.

  • This patch fails unit tests.
  • This patch merges cleanly.
  • This patch adds the following public classes (experimental):
    • class BoundedFloat(float):

@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have finished for PR 1791 at commit 1ac98d6.

  • This patch fails unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@SparkQA

SparkQA commented Aug 26, 2014

Copy link
Copy Markdown

QA tests have finished for PR 1791 at commit 657a09b.

  • This patch fails unit tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@davies

davies commented Aug 27, 2014

Copy link
Copy Markdown
Contributor Author

Most of useful parts have been merged separately, so close this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants