Robust Truth Inference in Crowdsourcing under Adversarial Attacks via Graph Neural Networks

Main Article Content

Arif Dağ
Simay Sahin
Mehmet Karaköse

Abstract

Crowdsourcing is widely used to obtain labeled data for machine learning, but the openness of these platforms makes label quality vulnerable to noise and adversarial manipulation. Coordinated behaviors such as collusion and Sybil-style participation can create a consistent yet incorrect consensus, causing classical truth inference methods to amplify attacker-induced agreement and degrade downstream model performance. This paper proposes a graph neural network (GNN) framework for robust truth inference under adversarial attacks. We represent the worker–task ecosystem as a bipartite interaction graph in which edges correspond to submitted labels and node embeddings are learned via message passing, allowing the model to capture higherorder agreement and co-labeling structure. To avoid being dominated by correlated malicious behavior, the method incorporates task-content features for image and text tasks as an external semantic anchor that stabilizes learning when observed labels are heavily corrupted. The framework is trained with a joint objective that (i) predicts task labels and (ii) estimates worker trustworthiness, enabling simultaneous aggregation and risk-aware worker profiling. We evaluate the method in a controlled simulation setting on multiple datasets and attack strategies, including random labeling, single-class attacks, label flipping, and coordinated collusion. Experiments analyze label-aggregation accuracy, robustness under increasing adversarial participation, and malicious-worker detection under varying attack rates and task difficulty. Results indicate that combining relational signals with content anchoring improves robustness in adversarial label aggregation, especially under correlated attacks that target hard tasks, while producing conservative trust scores that support worker-risk scoring and downstream quality control. Overall, the proposed approach offers a unified graph-based solution for robust aggregation and adversary-aware monitoring in crowdsourced labeling workflows.

Article Details

Section

Regular Paper

How to Cite

Robust Truth Inference in Crowdsourcing under Adversarial Attacks via Graph Neural Networks. (2026). International Journal of Management and Data Analytics (IJMADA), 6(1), 163-178. https://ijmada.com/index.php/ijmada/article/view/127

References

Y. Jin et al. “A technical survey on statistical modelling and design methods for crowdsourcing quality control”. In: Artificial Intelligence 287 (2020), p. 103351.

Y. Zheng et al. “Truth inference in crowdsourcing: Is the problem solved?” In: Proceedings of the VLDB Endowment 10.5 (2017), pp. 541–552.

U. Gadiraju et al. “Understanding malicious behavior in crowdsourcing platforms: The case of online surveys”. In: Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems (CHI). 2015, pp. 1631–1640.

A. Checco, J. Bates, and G. Demartini. “Adversarial attacks on crowdsourcing quality control”. In: Journal of Artificial Intelligence Research 67 (2020), pp. 375–408.

P. P. Chen, H. L. Sun, and Z. Chen. “Data poisoning attacks on crowdsourcing learning”. In: APWeb/WAIM Joint International Conference on Web and Big Data. Springer, 2021, pp. 164–179.

F. Tahmasebian et al. “Crowdsourcing under data poisoning attacks: A comparative study”. In: IFIP Annual Conference on Data and Applications Security and Privacy. Springer, 2020, pp. 310–332.

C. Miao et al. “Attack under disguise: An intelligent data poisoning attack mechanism in crowdsourcing”. In: Proceedings of the 2018 World Wide Web Conference (WWW). 2018, pp. 13–22.

P. P. Chen et al. “Collusion-proof result inference in crowdsourcing”. In: Journal of Computer Science and Technology 33 (2018), pp. 351–365.

Y. Wang, K. Wang, and C. Miao. “Truth discovery against strategic sybil attack in crowdsourcing”. In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD). 2020, pp. 95–104.

D. Tao et al. “Domain-weighted majority voting for crowdsourcing”. In: IEEE Transactions on Neural Networks and Learning Systems 30.1 (2019), pp. 163–174. doi: 10.1109/TNNLS.2018.2836969.

M. Karaköse, H. Yetiş, and M. Çeçen. “A New Approach for Effective Medical Deepfake Detection in Medical Images”. In: IEEE Access 12 (2024), pp. 52205–52214. doi: 10. 1109/ACCESS.2024.3386644.

T. N. Kipf and M. Welling. “Semi-supervised classification with graph convolutional networks”. In: International Conference on Learning Representations (ICLR). arXiv:1609.02907. 2017.

J. Gilmer et al. “Neural message passing for quantum chemistry”. In: Proceedings of the 34th International Conference on Machine Learning (ICML). 2017, pp. 1263–1272.

Z. Wu et al. “A comprehensive survey on graph neural networks”. In: IEEE Transactions on Neural Networks and Learning Systems 32.1 (2021), pp. 4–24.

J. Zhou et al. “Graph neural networks: A review of methods and applications”. In: AI Open 1 (2020), pp. 57–81.

K. He et al. “Deep residual learning for image recognition”. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2016, pp. 770–778.

A. P. Dawid and A. M. Skene. “Maximum likelihood estimation of observer error-rates using the EM algorithm”. In: Journal of the Royal Statistical Society: Series C (Applied Statistics) 28.1 (1979), pp. 20–28.

J. Whitehill et al. “Whose vote should count more: Optimal integration of labels from labelers of unknown expertise”. In: Advances in Neural Information Processing Systems (NeurIPS). 2009.

D. Karger, S. Oh, and D. Shah. “Iterative learning for reliable crowdsourcing systems”. In: Advances in Neural Information Processing Systems (NeurIPS). 2011.

Qianqian Ma and Alex Olshevsky. “Adversarial Crowdsourcing Through Robust RankOne Matrix Completion”. In: Advances in Neural Information Processing Systems (NeurIPS). 2020. arXiv: 2010.12181 [cs.LG]. url: https://arxiv.org/abs/2010. 12181.

C. Song, K. Liu, and X. Zhang. “Collusion Detection and Ground Truth Inference in Crowdsourcing for Labeling Tasks”. In: Journal of Machine Learning Research 22.190 (2021), pp. 1–45.

A. R. Kurup and G. P. Sajeev. “Classifying workers for mitigating adversarial attacks in crowdsourcing”. In: IEEE Access 13 (2025), pp. 142713–142727. doi: 10.1109/ ACCESS.2025.3598463.

H. Kim et al. “Graph Anomaly Detection with Graph Neural Networks: Current Status and Challenges”. In: IEEE Access 10 (2022). also available as arXiv:2209.14930, pp. 111820–111829. doi: 10.1109/ACCESS.2022.3211306.

T. Luo et al. “Graph Neural Networks for Trust Evaluation: Criteria, State-of-the-Art, and Future Directions”. In: IEEE Network 39.4 (2025), pp. 37–46. doi: 10.1109/ MNET.2025.3551068.

S. Heeb, A. Plesner, and R. Wattenhofer. “Sybil detection using graph neural networks”. In: arXiv preprint arXiv:2409.08631 (2024). arXiv: 2409.08631.

Hanlu Wu et al. “Exploiting Heterogeneous Graph Neural Networks with Latent Worker/Task Correlation Information for Label Aggregation in Crowdsourcing”. In: ACM Transactions on Knowledge Discovery from Data 16.2 (2022), 27:1–27:18. doi: 10.1145/3460865.

Zijian Ying et al. “A Little Truth Injection But a Big Reward: Label Aggregation With Graph Neural Networks”. In: IEEE Transactions on Pattern Analysis and Machine Intelligence 46.5 (2024), pp. 3169–3182. doi: 10.1109/TPAMI.2023.3338216.

Dmitry Ustalov, Nikita Pavlichenko, and Boris Tseitlin. “Learning from Crowds with Crowd-Kit”. In: Journal of Open Source Software 9.96 (2024), p. 6227. doi: 10.21105/ joss.06227.

Z. Tan and M. Karaköse. “On-Policy Deep Reinforcement Learning Approach to Multi Agent Problems”. In: Interdisciplinary Research in Technology and Management. CRC Press, 2021, pp. 369–376. doi: 10.1201/9781003202240-58.

N. F. Bar and M. Karaköse. “Collaborative approach for swarm robot systems based on distributed DRL”. In: Engineering Science and Technology, an International Journal 53 (2024), p. 101701. doi: 10.1016/j.jestch.2024.101701.

A. Krizhevsky. Learning Multiple Layers of Features from Tiny Images. Tech. rep. Computer Science Department, University of Toronto, 2009.

A. L. Maas et al. “Learning Word Vectors for Sentiment Analysis”. In: Proceedings of the ACL-HLT. 2011, pp. 142–150.

K. Lang. “NewsWeeder: Learning to Filter Netnews”. In: Proceedings of the International Conference on Machine Learning (ICML). 1995, pp. 331–339.

Similar Articles

You may also start an advanced similarity search for this article.

Most read articles by the same author(s)