From the course: Scala Essential Training for Data Science

Unlock the full course today

Join today to access over 23,200 courses taught by industry experts.

Summary of Scala and Spark DataFrames

Summary of Scala and Spark DataFrames - Scala Tutorial

From the course: Scala Essential Training for Data Science

Summary of Scala and Spark DataFrames

- [Instructor] DataFrames are a real useful data structure for data scientists working with Spark and Scala. DataFrames are table-like data structures and in Spark it's very easy to load data from either Comma Separated Value files or JSON files, and in fact several other formats are supported as well. One of the especially useful features about DataFrames is that we can use SQL statements to filter and aggregate the data. We can also join DataFrames to create new DataFrames based on data that we already have in existing DataFrames.

Contents