datareactor.reactor module¶
Classes
|
Transform datasets by generating derived columns. |
-
class
datareactor.reactor.
DataReactor
(atoms=None, sieve=None)[source]¶ Bases:
object
Transform datasets by generating derived columns.
The DataReactor class provides methods for transforming relational datasets by creating derived columns with known lineage.
Methods
transform
(source, destination)Read, transform, and write the dataset.
-
atoms
¶ A list of Atom objects to apply to generate columns.
-
sieve
¶ The sieve to use to filter columns.
-
transform
(source, destination)[source]¶ Read, transform, and write the dataset.
This function reads the dataset from the source location, generates derived columns using the atoms, filters the derived columns using a sieve, and writes the modified dataset to the destination location.
- Parameters
source (str) – The dataset path to read from.
destination (str) – The dataset path to write to.
-