(a) Is the dataset 2-anonymous using the quasi-identifier (age, gender)? If your answer is "no", which rows are (uniquely) quasi-identified?
(b) Is the dataset 2-anonymous using the quasi-identifier (age, occupation)? If your answer is "no", which rows are (uniquely) quasi-identified?
(c) What is the maximum value of f for which the dataset could be l-diverse with respect to the sensitive attribute illness?
id | age | gender | occupation | illness |
1 | 25 | F | 1 | no |
2 | 25 | F | 1 | yes |
3 | 25 | F | 4 | yes |
4 | 25 | M | 2 | yes |
5 | 25 | M | 2 | no |
6 | 49 | F | 3 | yes |
7 | 49 | F | 3 | yes |
8 | 49 | F | 4 | no |
9 | 49 | M | 5 | no |
10 | 49 | M | 5 | no |