PySpark Cookbook
上QQ阅读APP看书,第一时间看更新

Creating DataFrames

A Spark DataFrame is an immutable collection of data distributed within a cluster. The data inside a DataFrame is organized into named columns that can be compared to tables in a relational database.

In this recipe, we will learn how to create Spark DataFrames.