Security and Privacy in Data Science (CS 763)
September 02, 2020
If you wouldn’t do it in a real classroom, you probably shouldn’t do it virtually.
Let me know ASAP if you are remote so I can set you up with paper reviews
Remember: definitions are tricky things!
Hope: find a few things that interest you
You should attend/watch all lectures
Sign up for slots and projects here
A triumph for machine learning contests!
A new approach to formulating privacy goals: the risk to one’s privacy, or in general, any type of risk… should not substantially increase as a result of participating in a statistical database. This is captured by differential privacy.
A query Q is (\varepsilon, \delta)-differentially private if for every two databases db, db' that differ in one individual’s record, and for every subset S of outputs, we have:
\Pr[ Q(db) \in S ] \leq e^\varepsilon \cdot \Pr[ Q(db') \in S ] + \delta
Output of program doesn’t depend too much on any single person’s data