AAG-project’s Documentation

Usage:

Add the initial data to a “data” folder inside the pipeline folder. This is the initial data provided by AAG that has to be cleaned. Using the provided conda environment, run the pipeline to create a final parquet file that is used for the next part, clustering.

Indices and tables