AAG-project’s Documentation
Usage:
Add the initial data to a “data” folder inside the pipeline folder. This is the initial data provided by AAG that has to be cleaned. Using the provided conda environment, run the pipeline to create a final parquet file that is used for the next part, clustering.