PySpark Cookbook
上QQ阅读APP看书,第一时间看更新

Abstracting Data with DataFrames

In this chapter, you will learn about the following recipes:

  • Creating DataFrames
  • Accessing underlying RDDs
  • Performance optimizations
  • Inferring the schema using reflection
  • Specifying the schema programmatically
  • Creating a temporary table
  • Using SQL to interact with DataFrames
  • Overview of DataFrame transformations
  • Overview of DataFrame actions