datareactor.reactor module

Classes

DataReactor([atoms, sieve])

Transform datasets by generating derived columns.

class datareactor.reactor.DataReactor(atoms=None, sieve=None)[source]

Bases: object

Transform datasets by generating derived columns.

The DataReactor class provides methods for transforming relational datasets by creating derived columns with known lineage.

Methods

transform(source, destination)

Read, transform, and write the dataset.

atoms

A list of Atom objects to apply to generate columns.

sieve

The sieve to use to filter columns.

transform(source, destination)[source]

Read, transform, and write the dataset.

This function reads the dataset from the source location, generates derived columns using the atoms, filters the derived columns using a sieve, and writes the modified dataset to the destination location.

Parameters
  • source (str) – The dataset path to read from.

  • destination (str) – The dataset path to write to.