Skip to content

Naming Conventions

To maintain clarity across the multiple transformation steps (Splitting → Spy Injection → Augmentation), PAYN uses specific prefixes for all DataFrame artifacts.

Prefix Meaning Scientific Context
true_ Ground Truth The original, unmodified data. Includes known positives and negatives (if available). Used for validation only.
Example: true_train_data, true_test_data
spy_inf_ Spy Infused Training data where a fraction of positives have been masked and injected into the unlabeled pool. Used to train the Spy Model.
Example: spy_inf_train_data
augm_ Augmented Data that has been processed by the Spy Model. Contains predicted probabilities and the "Reliable Negative" classification labels.
Example: augm_validation_data