Skip to content

[SPARK-17645][MLLIB][ML][FOLLOW-UP] document minor change#16434

Closed
mpjlu wants to merge 3 commits into
apache:masterfrom
mpjlu:fdr_fwe_update
Closed

[SPARK-17645][MLLIB][ML][FOLLOW-UP] document minor change#16434
mpjlu wants to merge 3 commits into
apache:masterfrom
mpjlu:fdr_fwe_update

Conversation

@mpjlu

@mpjlu mpjlu commented Dec 29, 2016

Copy link
Copy Markdown

What changes were proposed in this pull request?

Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for #15212.

How was this patch tested?

ut

@SparkQA

SparkQA commented Dec 29, 2016

Copy link
Copy Markdown

Test build #70721 has finished for PR 16434 at commit 17a7288.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@jkbradley jkbradley left a comment

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the PR! Responding to your comment in the other PR, I do think it's worth creating a new test set (or replacing the current one?) which lets us test FDR.

Comment thread python/pyspark/ml/feature.py Outdated

Chi-Squared feature selection, which selects categorical features to use for predicting a
categorical label.
Creates a ChiSquared feature selector.

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This line is now duplicated info, so I'd remove it.

@jkbradley

Copy link
Copy Markdown
Member

Also, could you please change the PR description to be self-contained (rather than just referencing another PR)? The description becomes the commit message.

@SparkQA

SparkQA commented Dec 30, 2016

Copy link
Copy Markdown

Test build #70744 has finished for PR 16434 at commit 2d87dd3.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@mpjlu

mpjlu commented Jan 5, 2017

Copy link
Copy Markdown
Author

Hi @jkbradley , I have updated this PR per your comments. Thanks.

chisq.test(x2,y)
chisq.test(x3,y)
/*
* Contingency tables

Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It's kind of nice seeing these tables, but for validation, it's much easier for users to check using R. Could you please include a section for verifying the results with R, like there was before?

@jkbradley

Copy link
Copy Markdown
Member

Thanks @mpjlu ! The changes look good, except that I'd like to have a code snippet for verifying with R.

@SparkQA

SparkQA commented Jan 6, 2017

Copy link
Copy Markdown

Test build #70957 has finished for PR 16434 at commit 840ddfc.

  • This patch passes all tests.
  • This patch merges cleanly.
  • This patch adds no public classes.

@mpjlu

mpjlu commented Jan 6, 2017

Copy link
Copy Markdown
Author

Thanks @jkbradley @srowen , I have added a code snippet for verifying with R.
The statics and p-value data in the comment is generated by R.

@srowen

srowen commented Jan 10, 2017

Copy link
Copy Markdown
Member

Merged to master

@asfgit asfgit closed this in 32286ba Jan 10, 2017
uzadude pushed a commit to uzadude/spark that referenced this pull request Jan 27, 2017
## What changes were proposed in this pull request?
Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for apache#15212.

## How was this patch tested?
ut

Author: Peng, Meng <peng.meng@intel.com>

Closes apache#16434 from mpjlu/fdr_fwe_update.
cmonkey pushed a commit to cmonkey/spark that referenced this pull request Feb 15, 2017
## What changes were proposed in this pull request?
Add FDR test case in ml/feature/ChiSqSelectorSuite.
Improve some comments in the code.
This is a follow-up pr for apache#15212.

## How was this patch tested?
ut

Author: Peng, Meng <peng.meng@intel.com>

Closes apache#16434 from mpjlu/fdr_fwe_update.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants