You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To make the 'cascading' gem available to your JRuby scripts, install it as usual:
jruby -S gem install cascading.jruby
However, this won't help you download the dependencies you require (Cascading and Hadoop) and won't enable you to run the samples, which are housed in the cascading.jruby repository, but not packaged with the gem. The following sections will get you setup to run the samples locally.
Samples
Prerequisites
Java
Ant
JRuby
Bundler
Now install for local development, which will also allow you to run the samples:
jruby -S bundle install
Running the Samples
The cascading.jruby repository comes with a fairly extensive set of example jobs that do not ship with the gem.
You can run them with (this will download Cascading and Hadoop jars into build/lib, a one-time process):
jruby -S bundle exec rake samples
Or individually like this (once you've done the above once to get the dependencies):
./samples/group_by.rb
Given all this setup, you can finally paste the word count example from the README into a file named "wordcount.rb" and run it like this:
jruby -J-cp "build/lib/*" wordcount.rb README.md && less output/wordcount
The salient point being that once you have all the dependencies you need on the CLASSPATH, cascading.jruby scripts behave like any other JRuby script.