A DataFrame is a distributed collection of data, which is organized into named columns. Conceptually, it is equivalent to relational tables with good optimization techniques.
id | name | age |
---|---|---|
1201 | phil | 25 |
1202 | barbara | 28 |
1203 | jon | 39 |
1204 | dirk | 23 |