Security and Privacy in Data Science (CS 763)
September 04, 2019
Remember: definitions are tricky things!
Hope: find a few things that interest you
You must attend lectures and participate
Signup for slots and projects here
A triumph for machine learning contests!
A new approach to formulating privacy goals: the risk to one’s privacy, or in general, any type of risk… should not substantially increase as a result of participating in a statistical database. This is captured by differential privacy.
A query Q is (\varepsilon, \delta)-differentially private if for every two databases db, db' that differ in one individual’s record, and for every subset S of outputs, we have:
\Pr[ Q(db) \in S ] \leq e^\varepsilon \cdot \Pr[ Q(db') \in S ] + \delta
Output of program doesn’t depend too much on any single person’s data