Python Data Anonymisation Pipeline

Build and run a step-by-step anonymisation pipeline using Python techniques. Upload or generate sample data, apply techniques, and download the anonymised result.

Anonymisation Pipeline

Follow the pipeline from raw CSV input through anonymisation techniques to secure output. Click "Run Pipeline" to execute each step and see the transformation.

Raw Dataset
CSV Input
Pseudo-IDs
uuid.uuid4()
Text Shifting
ord(char) + 2
Data Masking
***masked***
Generalisation
age → range
Drop Columns
.drop()
Export
.to_csv()

Step 1: Raw Dataset (CSV Input)

Start with a sample dataset or paste your own CSV. This simulates loading a raw dataset with personally identifiable information (PII).

RAW Original Dataset
Python import pandas as pd

# Load raw dataset
df = pd.read_csv('raw_data.csv')
print(df.head())

Step 2: Select Anonymisation Techniques

Choose which techniques to apply to the dataset. Each technique targets different types of PII with different Python implementations.