datareactor.reactor module¶
Classes
|
Transform datasets by generating derived columns. |
-
class
datareactor.reactor.DataReactor(atoms=None, sieve=None)[source]¶ Bases:
objectTransform datasets by generating derived columns.
The DataReactor class provides methods for transforming relational datasets by creating derived columns with known lineage.
Methods
transform(source, destination)Read, transform, and write the dataset.
-
atoms¶ A list of Atom objects to apply to generate columns.
-
sieve¶ The sieve to use to filter columns.
-
transform(source, destination)[source]¶ Read, transform, and write the dataset.
This function reads the dataset from the source location, generates derived columns using the atoms, filters the derived columns using a sieve, and writes the modified dataset to the destination location.
- Parameters
source (str) – The dataset path to read from.
destination (str) – The dataset path to write to.
-