Feature/extensible parsers by blrnw3 · Pull Request #1 · quartethealth/spark-csv

blrnw3 · 2016-02-11T23:02:03Z

Currently a lot of functionality is hidden behind private access modifiers making it difficult to extend the functionality of the parsers without copying large chunks of code. These small changes make re-use much easier.

Was able to reinstate some of the access modifiers and instead extracted out extensible code from those methods into new overridable methods. Added some hierarchy to the parsers/readers so they can be switched out.

mustafashabib · 2016-02-11T23:11:04Z

@thyming @pheuter Hopefully you connected with @blrnw3 about this because I think I utterly confused myself - but I think if this is merged then you can import it into your projects and all is right in the world.

blrnw3 · 2016-02-12T16:46:14Z

Sorry, that was my fault for not explaining it well at all. I'm going to merge this in now since it is the same PR as the one for databricks-spark-csv: databricks#259

So if you guys have any comments they'd be better off going on that PR.

blrnw3 · 2016-02-12T20:42:00Z

@mustafashabib Please can you give me write access to this repo? Thanks

`remote: Permission to quartethealth/spark-csv.git denied to blrnw3.
fatal: unable to access 'https://github.com/quartethealth/spark-csv.git/': The requested URL returned error: 403
'

mustafashabib · 2016-02-12T21:08:02Z

Sorry about that Ben - can you try again?

On Feb 12, 2016, at 3:42 PM, Ben LR <notifications@github.com mailto:notifications@github.com> wrote:

@mustafashabib https://github.com/mustafashabib Please can you give me write access to this repo? Thanks

`remote: Permission to quartethealth/spark-csv.git denied to blrnw3.
fatal: unable to access 'https://github.com/quartethealth/spark-csv.git/ https://github.com/quartethealth/spark-csv.git/': The requested URL returned error: 403
'

—
Reply to this email directly or view it on GitHub #1 (comment).

Feature/extensible parsers

blrnw3 · 2016-02-12T21:09:56Z

Works now. thanks!

thyming · 2016-02-17T15:06:44Z

+}
+
+/**
+ * Allows for greater extensibility


This comment and the one above don't really add anything IMO

thyming · 2016-02-17T15:11:37Z

Can you create tests for this?

thyming · 2016-02-17T15:21:52Z

 import org.apache.spark.rdd.RDD

-private[csv] object TextFile {
+object TextFile {


why is this access modifier changed?

I need it here: https://github.com/quartethealth/spark-fixedwidth/blob/feature/fixed-width-parsing/src/main/scala/com/quartethealth/spark/fixedwidth/package.scala#L18

thyming · 2016-02-17T15:24:16Z

is there a different PR that you're preparing for dealing with fixed width data?

blrnw3 · 2016-02-17T15:28:45Z

Thanks for the comments. I'll work on those changes now.
I'm not sure there's any value in extra tests for this, as the changes are pretty low-level / implementation-oriented. I've verified that the existing tests run ok.

The fixed-width parser PR: quartethealth/spark-fixedwidth#1

blrnw3 and others added 2 commits February 10, 2016 17:59

Change a few access modifiers to improve extensibility.

67cf5fd

Currently a lot of functionality is hidden behind private access modifiers making it difficult to extend the functionality of the parsers without copying large chunks of code. These small changes make re-use much easier.

Further extensibility improvements.

be52806

Was able to reinstate some of the access modifiers and instead extracted out extensible code from those methods into new overridable methods. Added some hierarchy to the parsers/readers so they can be switched out.

blrnw3 pushed a commit that referenced this pull request Feb 12, 2016

Merge pull request #1 from blrnw3/feature/extensible_parsers

6a9e50c

Feature/extensible parsers

blrnw3 merged commit 6a9e50c into quartethealth:master Feb 12, 2016

thyming reviewed Feb 17, 2016
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature/extensible parsers#1

Feature/extensible parsers#1
blrnw3 merged 2 commits into
quartethealth:masterfrom
blrnw3:feature/extensible_parsers

blrnw3 commented Feb 11, 2016

Uh oh!

mustafashabib commented Feb 11, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

mustafashabib commented Feb 12, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

thyming Feb 17, 2016

Uh oh!

thyming commented Feb 17, 2016

Uh oh!

thyming Feb 17, 2016

Uh oh!

blrnw3 Feb 17, 2016

Uh oh!

thyming commented Feb 17, 2016

Uh oh!

blrnw3 commented Feb 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

blrnw3 commented Feb 11, 2016

Uh oh!

mustafashabib commented Feb 11, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

mustafashabib commented Feb 12, 2016

Uh oh!

blrnw3 commented Feb 12, 2016

Uh oh!

thyming Feb 17, 2016

Choose a reason for hiding this comment

Uh oh!

thyming commented Feb 17, 2016

Uh oh!

thyming Feb 17, 2016

Choose a reason for hiding this comment

Uh oh!

blrnw3 Feb 17, 2016

Choose a reason for hiding this comment

Uh oh!

thyming commented Feb 17, 2016

Uh oh!

blrnw3 commented Feb 17, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants