PersAD: adversarial protection against text-based personality inference

As text-based personality inference approaches increasingly mature, personality privacy invasion has become a new threat applied to human-central malicious activities in recent years. When facing text-based personality inference attacks, the adversarial example method is a protection pathway that mo...

Full description

Saved in:
Bibliographic Details
Published inData mining and knowledge discovery Vol. 39; no. 5; p. 61
Main Authors Qiu, Houjie, Ma, Xingkong, Liu, Bo, Cai, Yiqing, Peng, Baoyun, Huang, Dan
Format Journal Article
LanguageEnglish
Published New York Springer US 01.09.2025
Springer Nature B.V
Subjects
Online AccessGet full text
ISSN1384-5810
1573-756X
DOI10.1007/s10618-025-01144-0

Cover

Loading…
More Information
Summary:As text-based personality inference approaches increasingly mature, personality privacy invasion has become a new threat applied to human-central malicious activities in recent years. When facing text-based personality inference attacks, the adversarial example method is a protection pathway that modifies the original texts to convert the personality characteristics. However, most studies focus on the local word or phrase perturbations instead of the global document semantics, making it challenging to ensure content semantic consistency while effectively reversing personality signals. Simultaneously, most methods are difficult to defend against various personality inference models without psychological knowledge. To address these challenges, we propose an adversarial protection method, PersAD for short. In our method, we extract the knowledge from personality psychology books to construct knowledge-enhanced prompts. Then, the forward-backward adversarial generation is implemented to convert original posts to protected posts with high semantic similarity and low personality relevancy via the LLM. Moreover, the protect success rate (PSR) and cumulative flipping rate (CFR) are proposed to validate the PersAD and compare the performance with other adversarial example methods. The extensive experiments demonstrate the protection performance of PersAD over various text-based personality inference models. Under the protection of PersAD, the average PSR improves in a range of 41.75% to 45.91%, exceeding 3 times the performance of traditional methods. For the CFR of two or more personality dimensions, PersAD achieves results between 84.26 and 87.50%, which is over 40% higher than other baseline methods. Our data & code are available at the Github repository ( https://github.com/Fr4n13Z/PersAD ).
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
ISSN:1384-5810
1573-756X
DOI:10.1007/s10618-025-01144-0