Skip to content

Latest commit

 

History

History
42 lines (21 loc) · 1.82 KB

File metadata and controls

42 lines (21 loc) · 1.82 KB

Note that all mathematical formulas are written for only a layer in a given neural network instead of the whole neural network.

Original Rule-based Method: Determine whether the provenance of input is learned
Potential Method 1: Compute SP (Sample Probability) for each neuron & Determine whether l1-distance between SP(LP) and provenance of input is close enough
Potential Method 2: Compute SP (Sample Probability) for each neuron & Determine whether l1-distance between SP(LP) and provenance of input is close enough, where we filter out neurons that is relatively close (< beta).
Potential Method 3: Compute the probability that each neuron to be benign & Multiple all probabilities of neurons to determine whether a given input is benign/adversarial by a probability threshold value.
Potential Method 4: Compute the probability that each neuron to be benign & Multiple all probabilities of neurons to determine whether a given input is benign/adversarial by a probability threshold value, where we filter out neurons with probabilities relatively ambigious (e.g., 0.3 - 0.7)