From the course: Scala Essential Training for Data Science
Unlock the full course today
Join today to access over 23,200 courses taught by industry experts.
Summary of Scala and Spark DataFrames - Scala Tutorial
From the course: Scala Essential Training for Data Science
Summary of Scala and Spark DataFrames
- [Instructor] DataFrames are a real useful data structure for data scientists working with Spark and Scala. DataFrames are table-like data structures and in Spark it's very easy to load data from either Comma Separated Value files or JSON files, and in fact several other formats are supported as well. One of the especially useful features about DataFrames is that we can use SQL statements to filter and aggregate the data. We can also join DataFrames to create new DataFrames based on data that we already have in existing DataFrames.
Practice while you learn with exercise files
Download the files the instructor uses to teach the course. Follow along and learn by watching, listening and practicing.