Skip to content

Demonstration of Google Cloud Dataproc for running Spark jobs with Java

License

Notifications You must be signed in to change notification settings

garystafford/dataproc-java-demo

Repository files navigation

Google Cloud Dataproc Java/Spark Demo

Code repository for post, Big Data Analytics with Java and Python, using Cloud Dataproc, Google’s Fully-Managed Spark and Hadoop Service.

Run with Arguments

To run InternationalLoansAppDataproc.java use the following arguments, locally:

"data"
"ibrd-statement-of-loans-latest-available-snapshot.csv"
"ibrd-small-spark"

.master("yarn") must be changes to .master("local[*]")

To run InternationalLoansAppDataproc.java on Dataproc:

"gs://dataproc-demo-bucket"
"ibrd-statement-of-loans-historical-data.csv"
"ibrd-large-spark"

About

Demonstration of Google Cloud Dataproc for running Spark jobs with Java

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages