Introduction to Polars
Polars is a fast, efficient dataframe library designed to work with large datasets with a lower memory footprint compared to pandas. It’s ideal for scenarios where performance and scalability are crucial. However, it is a newer library, so documentation may be sparse.PolarsReader and PLLazyFrameReader Classes
Gluestick now supports two additional reader classes specifically designed for Polars:PolarsReader: Reads sync output into Polars DataframesPLLazyFrameReader: Reads sync output into Polars Lazyframes
Reader class.
For more information, see the Polars docs.
PLLazyFrameReader
A Lazyframe is an abstraction of a dataframe that streams your data from your sync output, applies relevant transformations, and then writes to your export format without ever loading the entire dataset into memory.PolarsReader
While Lazyframes are more efficient, certain operations can be trickier to implement, For small-data operations, you may be better of using the standard Polars Dataframe withPolarsReader: