Step-1 : Search for DeltaMax on Google Cloud Marketplace
Step-1 : Search for DeltaMax on Google Cloud Marketplace
1) You will need to create a project on Google Cloud
2) The project should be associated with your corporate billing account.
Step-2 : Deploy the DeltaMax Virtual Machine in your Organization
Getting the DeltaMax VM provisioned and configured within your Organization Settings. Check the settings for
VPC
SubNet
Zones
Firewalls
Step-3 : Ensure that VM is updated, if required install python3-full latest version, venv
Step-4 : Start running the codes - DeltaMax_synthetic_data_generator.py
Step-5 : Install jupyter nbcovert to convert python note books to python text files for running on CLI environment
Step-6 : Install pandas for running python moudles
Step-7 : Run M1_data loading and Initial Checks.txt
Step-8 : Run M2_Column Type Analysis and Filetering.txt
Step-9 : Run M3_Anomaly Detection (IQR and Isolation Forest).txt
Step-10 : Run M4_Variance Analysis.txt
Step-11 : Run M5_Inter-File Comparison(T-Tests)
Step-12 : Run M6_Inter-File Comparison(PSI)
Step-13: Run M7_Decimal Formatting Mismatches
Step-14: Optional Run M8_Bit field changes (Skipping this step)
Step-15: Run M9_String Length Mismatches.txt
Step-16: Run M10_Data Type Mismatch Detection.txt
Step-17: Run M11_Data Preprocessing and Imputation.txt
Step-18: Run M12_Business Unique Analysis.txt
Step-19: Run M13_Create A or B Merged file with col_A and col_B.txt
Step-20: Create a gcloud storage and load the csv files
Step-21: Load the csv files to Big Query Tables
Step-22: Run the SQL query to create A_or_B_with_Anomalies table
Step-23: Load the Big Query Tables to Looker and create reports and Visualizations as needed
Step-24: Reach out to Katalyst Street Professional Services for Custom Visualizations and Reports