gfw.common.beam.pipeline.LinearDag#
- class LinearDag(sources=(), core=None, side_inputs=None, sinks=())[source]#
A linear DAG implementation for Apache Beam pipelines.
- This DAG:
Applies multiple sources PTransforms and merges outputs into a single PCollection.
Applies a single core PTransform, with an optional side inputs PTransform.
Applies multiple sinks PTransforms.
- Parameters:
sources (Tuple[PTransform[Any, Any], ...]) – A list of PTransforms that read input data.
core (PTransform[Any, Any] | None) – The core PTransform that processes the data.
side_inputs (PTransform[Any, Any] | None) – A PTransform used to read side inputs that will be injected into the core transform.
sinks (Tuple[PTransform[Any, Any], ...]) – A list of PTransforms that write the output data.
- output_paths#
A list of output paths for each sink, if they contain the path attribute.
Methods
Applies the linear DAG implementation to an Apache Beam pipeline.
Attributes
Resolves and returns a list of output paths for each sink in the pipeline.