PersAD: adversarial protection against text-based personality inference
As text-based personality inference approaches increasingly mature, personality privacy invasion has become a new threat applied to human-central malicious activities in recent years. When facing text-based personality inference attacks, the adversarial example method is a protection pathway that mo...
Saved in:
Published in | Data mining and knowledge discovery Vol. 39; no. 5; p. 61 |
---|---|
Main Authors | , , , , , |
Format | Journal Article |
Language | English |
Published |
New York
Springer US
01.09.2025
Springer Nature B.V |
Subjects | |
Online Access | Get full text |
ISSN | 1384-5810 1573-756X |
DOI | 10.1007/s10618-025-01144-0 |
Cover
Loading…
Summary: | As text-based personality inference approaches increasingly mature, personality privacy invasion has become a new threat applied to human-central malicious activities in recent years. When facing text-based personality inference attacks, the adversarial example method is a protection pathway that modifies the original texts to convert the personality characteristics. However, most studies focus on the local word or phrase perturbations instead of the global document semantics, making it challenging to ensure content semantic consistency while effectively reversing personality signals. Simultaneously, most methods are difficult to defend against various personality inference models without psychological knowledge. To address these challenges, we propose an adversarial protection method,
PersAD
for short. In our method, we extract the knowledge from personality psychology books to construct knowledge-enhanced prompts. Then, the forward-backward adversarial generation is implemented to convert original posts to protected posts with high semantic similarity and low personality relevancy via the LLM. Moreover, the protect success rate (PSR) and cumulative flipping rate (CFR) are proposed to validate the PersAD and compare the performance with other adversarial example methods. The extensive experiments demonstrate the protection performance of PersAD over various text-based personality inference models. Under the protection of PersAD, the average PSR improves in a range of 41.75% to 45.91%, exceeding 3 times the performance of traditional methods. For the CFR of two or more personality dimensions, PersAD achieves results between 84.26 and 87.50%, which is over 40% higher than other baseline methods. Our data & code are available at the Github repository (
https://github.com/Fr4n13Z/PersAD
). |
---|---|
Bibliography: | ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 |
ISSN: | 1384-5810 1573-756X |
DOI: | 10.1007/s10618-025-01144-0 |