Evaluating the Performance of Vision Transformers and Convolutional Neural Networks for Hostile Image Detection

Authors

  • Zakir Hossain College of Engineering and Computer Science, California State University
  • Md Emran Hossain College of Technology and Engineering, Westcliff University
  • Nisher Ahmed College of Technology and Engineering, Westcliff University
  • Md Farhad Kabir Marshall School of Business, University of Southern California
  • Iffat Sania Hossain Martin V. Smith School of Business and Economics, California State University

DOI:

https://doi.org/10.55927/ijar.v4i1.13681

Keywords:

Malicious Images, ViTs (Vision Transformers), CNNs (Convolutional Neural Networks), Adversarial Perturbations, Image Classification

Abstract

Detecting malicious or adversarial images, for example in security and surveillance systems, is an important problem in computer vision. These results highlight the effectiveness of ViTs when compared to CNNs when confronting hostile images. However, CNNs have stiff competition from ViTs and have been the go-to architecture for image classification and object detection for many years, due to the existence of spatial hierarchies in images. Using benchmark datasets containing a combination of adversarial and clean images, this study compares the ability of both models to (i) detect hostile images, (ii) generalize to unseen dataset, and (iii) the overall computational efficiency of both models. While ViTs can be even more computationally expensive than incurred with task3 input, we demonstrate that, in fact, our architecture generalizes truncation -- both in power and action -- exceptionally well and can simply outperform performance-per-dollar in more robust pattern recognition tasks, especially under adversarial perturbations. In contrast, CNNs are faster to inference and less likely to overfit on small data. This finding informed decisions showing trade-offs between the two architectures, including a potential path for hybrid approaches and future enhancements in the adversarial defense against hostile image detection.

Downloads

Download data is not yet available.

References

Arthan, N., Kacheru, G., & Bajjuru, R. (2019). Radio Frequency in Autonomous Vehicles: Communication Standards and Safety Protocols. Revista de Inteligencia Artificial en Medicina, 10(1), 449478.

Chen, J., Lu, X., & Wang, Z. (2020). Deep learning for cardiovascular imaging: A review. Journal of Cardiovascular Magnetic Resonance, 22(1), 115. https://doi.org/10.1186/s12968020006207

Dalal, A. (2018). Cybersecurity And Artificial Intelligence: How AI Is Being Used in Cybersecurity To Improve Detection And Response To Cyber Threats. Turkish Journal of Computer and Mathematics Education (TURCOMAT), 9(3), 14161423.

Dalal, A., & Mahjabeen, F. (2011). Public Key Infrastructure for Enhanced Enterprise Security: Implementation Challenges in the US, Canada, and Japan. Revista de Inteligencia Artificial en Medicina, 2(1), 110.

Dalal, A., & Mahjabeen, F. (2011). Strengthening Cybersecurity Infrastructure in the US and Canada: A Comparative Study of Threat Detection Models. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 2(1), 19.

Dalal, A., & Mahjabeen, F. (2012). Cloud Storage Security: Balancing Privacy and Security in the US, Canada, EU, and Asia. Revista de Inteligencia Artificial en Medicina, 3(1), 1927.

Dalal, A., & Mahjabeen, F. (2012). Cybersecurity Challenges and Solutions in SAP ERP Systems: Enhancing Application Security, GRC, and Audit Controls. Revista de Inteligencia Artificial en Medicina, 3(1), 118.

Dalal, A., & Mahjabeen, F. (2012). Managing Bring Your Own Device (BYOD) Security: A Comparative Study in the US, Australia, and Asia. Revista de Inteligencia Artificial en Medicina, 3(1), 1930.

Dalal, A., & Mahjabeen, F. (2013). Securing Critical Infrastructure: Cybersecurity for Industrial Control Systems in the US, Canada, and the EU. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 4(1), 1828.

Dalal, A., & Mahjabeen, F. (2013). Strengthening SAP and ERP Security for US and European Enterprises: Addressing Emerging Threats in Critical Systems. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 4(1), 117.

Dalal, A., & Mahjabeen, F. (2014). Enhancing SAP Security in Cloud Environments: Challenges and Solutions. Revista de Inteligencia Artificial en Medicina, 5(1), 119.

Dalal, A., & Mahjabeen, F. (2015). The Rise of Ransomware: Mitigating Cyber Threats in the US, Canada, Europe, and Australia. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 6(1), 2131.

Dalal, A., & Mahjabeen, F. (2015). Securing CloudBased Applications: Addressing the New Wave of Cyber Threats.

Dalal, A., & Roy, R. (2021). CYBERSECURITY AND PRIVACY: BALANCING SECURITY AND INDIVIDUAL RIGHTS IN THE DIGITAL AGE. JOURNAL OF BASIC SCIENCE AND ENGINEERING, 18(1).

Dalal, A., Abdul, S., & Mahjabeen, F. (2016). Ensuring ERP Security in Edge Computing Deployments: Challenges and Innovations for SAP Systems. Revista de Inteligencia Artificial en Medicina, 7(1), 117.

Dalal, A., Abdul, S., & Mahjabeen, F. (2016). Leveraging Artificial Intelligence for Cyber Threat Intelligence: Perspectives from the US, Canada, and Japan. Revista de Inteligencia Artificial en Medicina, 7(1), 1828.

Dalal, A., Abdul, S., & Mahjabeen, F. (2018). Blockchain Applications for Data Integrity and Privacy: A Comparative Analysis in the US, EU, and Asia. International Journal of Advanced Engineering Technologies and Innovations, 1(4), 2535.

Dalal, A., Abdul, S., & Mahjabeen, F. (2019). Defending Machine Learning Systems: Adversarial Attacks and Robust Defenses in the US and Asia. International Journal of Advanced Engineering Technologies and Innovations, 1(1), 102109.

Dalal, A., Abdul, S., & Mahjabeen, F. (2020). AI Powered Threat Hunting in SAP and ERP Environments: Proactive Approaches to Cyber Defense. International Journal of Advanced Engineering Technologies and Innovations, 1(2), 95112.

Dalal, A., Abdul, S., & Mahjabeen, F. (2021). Quantum Safe Strategies for SAP and ERP Systems: Preparing for the Future of Data Protection. International Journal of Advanced Engineering Technologies and Innovations, 1(2), 127141.

Dalal, A., Abdul, S., Kothamali, P. R., & Mahjabeen, F. (2015). Cybersecurity Challenges for the Internet of Things: Securing IoT in the US, Canada, and EU. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 6(1), 5364.

Dalal, A., Abdul, S., Kothamali, P. R., & Mahjabeen, F. (2017). Integrating Blockchain with ERP Systems: Revolutionizing Data Security and Process Transparency in SAP. Revista de Inteligencia Artificial en Medicina, 8(1), 6677.

Dalal, A., Abdul, S., Mahjabeen, F., & Kothamali, P. R. (2018). Advanced Governance, Risk, and Compliance Strategies for SAP and ERP Systems in the US and Europe: Leveraging Automation and Analytics. International Journal of Advanced Engineering Technologies and Innovations, 1(2), 3043.

Dalal, A., Abdul, S., Mahjabeen, F., & Kothamali, P. R. (2019). Leveraging Artificial Intelligence and Machine Learning for Enhanced Application Security. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 10(1), 8299.

Datta, R., Halimuzzaman, M., & Honey, S. (2024). A Comparative Analysis of Safety Performance in Commercial and Residential Construction: Unraveling Critical Insights. Journal of Control & Instrumentation, 15(01), 110.

Datta, R., Pankaj Sarker, K., Shikdar, L., Halimuzzaman, M., & Rezaul Karim, M. (2024). Mobile Applications for Enhancing Safety Audits in Healthcare Construction Sites. Journal of Angiotherapy, 8(9), 16.

Habib, H. (2015). Awareness about special education in Hyderabad. International Journal of Science and Research (IJSR), 4(5), 12961300.

Habib, H., & Janae, J. (2024). Breaking Barriers: How AI is Transforming Special Education Classrooms. Bulletin of Engineering Science and Technology, 1(02), 86108.

Habib, H., Jelani, S. A. K., & Najla, S. (2022). Revolutionizing Inclusion: AI in Adaptive Learning for Students with Disabilities. Multidisciplinary Science Journal, 1(01), 111.

Habib, H., Jelani, S. A. K., & Rasheed, N. T. (2021). Tailored Education: AI in the Development of Individualized Education Programs (IEPs). Multidisciplinary Science Journal, 1(01), 818.

Habib, H., Jelani, S. A. K., Ali, S. S., & Kadari, J. (2023). From Assessment to Empowerment: The Role of AI in Special Education Progress Monitoring. Journal of Multidisciplinary Research, 9(01), 6798.

Habib, H., Jelani, S. A. K., Alizzi, M., & Numair, H. (2020). Personalized Learning Paths: AI Applications in Special Education. Journal of Multidisciplinary Research, 6(01).

Habib, H., Jelani, S. A. K., Numair, H., & Mubeen, S. (2019). Enhancing Communication Skills: AI Technologies for Students with Speech and Language Needs. Journal of Multidisciplinary Research, 5(01).

Halimuzzaman, M., & Sharma, J. (2022). Applications of accounting information system (AIS) under Enterprise resource planning (ERP): A comprehensive review. International Journal of Early Childhood Special Education (INTJECSE), 14(2), 68016806.

Halimuzzaman, M., & Sharma, J. (2024). The Role of Enterprise Resource Planning (ERP) in Improving the Accounting Information System for Organizations. In Revolutionizing the AIDigital Landscape (pp. 263274). Productivity Press.

Halimuzzaman, M., Khaiar, M. A., & Hoque, M. M. (2014). An analysis of progress of rural development scheme (RDS) by IBBL: A study on Kushtia Branch. Bangla Vision, 13(1), 169180.

Halimuzzaman, M., Sharma, D. J., Bhattacharjee, T., Mallik, B., Rahman, R., Rezaul Karim, M., ... & Fokhrul Islam, M. (2024). Blockchain technology for integrating electronic records of digital healthcare system. Journal of Angiotherapy, 8(7).

Halimuzzaman, M., Sharma, J., & Khang, A. (2024). Enterprise Resource Planning and Accounting Information Systems: Modeling the Relationship in Manufacturing. In Machine Vision and Industrial Robotics in Manufacturing (pp. 418434). CRC Press.

Halimuzzaman, M., Sharma, J., Hossain, M. I., Akand, F., Islam, M. N., Ikram, M. M., & Khan, N. N. Healthcare Service Quality Digitization with Enterprise Resource Planning.

Halimuzzaman, M., Sharma, J., Islam, D., Habib, F., & Ahmed, S. S. FINANCIAL IMPACT OF ENTERPRISE RESOURCE PLANNING (ERP) ON ACCOUNTING INFORMATION SYSTEMS (AIS): A STUDY ON PETROLEUM COMPANIES IN BANGLADESH.

Halimuzzaman, M., Sharma, J., Karim, M. R., Hossain, M. R., Azad, M. A. K., & Alam, M. M. (2024). Enhancement of Organizational Accounting Information Systems and Financial Control through Enterprise Resource Planning. In Synergy of AI and Fintech in the Digital Gig Economy (pp. 315331). CRC Press.

Hasan, A. S., Debu, S. S. S. D., Eti, I. J., Halimuzzaman, M., & Rezaul, M. Machine Learning Models for Predicting Risky Pregnancies in Early Clinical Interventions.

Hossain, M. A., & Rahman, T. Y. (2024). Human factors and employee resistance to adopting new cybersecurity protocols and technologies. Bulletin of Engineering Science and Technology, 1(03), 175-199.

Islam, M. F., Debnath, S., Das, H., Hasan, F., Sultana, S., Datta, R., ... & Halimuzzaman, M. (2024). Impact of Rapid Economic Development with Rising Carbon Emissions on Public Health and Healthcare Costs in Bangladesh. Journal of Angiotherapy, 8(7), 19.

Islam, M. F., Eity, S. B., Barua, P., & Halimuzzaman, M. (2023). Liabilities of Street Food Vendors for spreading out Chronic Diseases and Environment Pollution: A Study on Chattogram, Bangladesh. JETIR, 10 (11), Article 11.

Kacheru, G., Bajjuru, R., & Arthan, N. (2019). Security Considerations When Automating Software Development. Revista de Inteligencia Artificial en Medicina, 10(1), 598617.

Kacheru, G., Bajjuru, R., & Arthan, N. (2022). Surge of Cyber Scams during the COVID19 Pandemic: Analyzing the Shift in Tactics. BULLET: Jurnal Multidisiplin Ilmu, 1(02), 192202.

Leiner, T., Rueckert, D., Suinesiaputra, A., et al. (2019). Machine learning in cardiovascular magnetic resonance: Basic concepts and applications. Journal of Cardiovascular Magnetic Resonance, 21(1), 61. https://doi.org/10.1186/s129680190575y

Litjens, G., Kooi, T., Bejnordi, B. E., et al. (2017). A survey on deep learning in medical image analysis. Medical Image Analysis, 42, 6088. https://doi.org/10.1016/j.media.2017.07.005

Muhammad, S., Meerjat, F., Meerjat, A., & Dalal, A. (2024). Safeguarding Data Privacy: Enhancing Cybersecurity Measures for Protecting Personal Data in the United States. International Journal of Machine Learning Research in Cybersecurity and Artificial Intelligence, 15(1), 141176.

Muhammad, S., Meerjat, F., Meerjat, A., Dalal, A., & Abdul, S. (2023). Enhancing cybersecurity measures for blockchain: Securing transactions in decentralized systems. Unique Endeavor in Business & Social Sciences, 2(1), 120141.

Muhammad, S., Meerjat, F., Meerjat, A., Naz, S., & Dalal, A. (2023). Strengthening Mobile Platform Cybersecurity in the United States: Strategies and Innovations. Revista de Inteligencia Artificial en Medicina, 14(1), 84112.

Muhammad, S., Meerjat, F., Meerjat, A., Naz, S., & Dalal, A. (2024). Enhancing Cybersecurity Measures for Robust Fraud Detection and Prevention in US Online Banking. International Journal of Advanced Engineering Technologies and Innovations, 1(3), 510541.

Rana, M. M., Kalam, A., & Halimuzzaman, M. (2012). CO RPO RATE SO C IAL RESPO NSIBILITY (C SR) OF DUTC HBANG LA BANK LIMITED: A CASE STUDY.

RASEL, M., Bommu, R., Shovon, R. B., & Islam, M. A. (2022). BlockchainEnabled Secure Interoperability: Advancing Electronic Health Records (EHR) Data Exchange. International Journal of Advanced Engineering Technologies and Innovations, 1(2), 193211.

RASEL, M., Bommu, R., Shovon, R. B., & Islam, M. A. (2023). Ensuring Data Security in Interoperable EHR Systems: Exploring Blockchain Solutions for Healthcare Integration. International Journal of Advanced Engineering Technologies and Innovations, 1(01), 212232.

Rasel, M., Salam, M. A., & Mohammad, A. (2023). Safeguarding Media Integrity: Cybersecurity Strategies for Resilient Broadcast Systems and Combatting Fake News. Unique Endeavor in Business & Social Sciences, 2(1), 7293.

Rieke, N., Hancox, J., Li, W., et al. (2020). The future of digital health with federated learning. npj Digital Medicine, 3(1), 17. https://doi.org/10.1038/s41746020003231

Sohel, M. S., Shi, G., Zaman, N. T., Hossain, B., Halimuzzaman, M., Akintunde, T. Y., & Liu, H. (2022). Understanding the food insecurity and coping strategies of indigenous households during COVID19 crisis in Chittagong hill tracts, Bangladesh: A qualitative study. Foods, 11(19), 3103.

Tamraparani, V. (2019). A Practical Approach to Model Risk Management and Governance in Insurance: A Practitioner’s Perspective. Journal of Computational Analysis and Applications, 27(7).

Tamraparani, V. (2019). DataDriven Strategies for Reducing Employee Health Insurance Costs: A Collaborative Approach with Carriers and Brokers. International Journal of Advanced Engineering Technologies and Innovations, 1(1), 110127.

Tamraparani, V. (2020). Automating Invoice Processing in Fund Management: Insights from RPA and Data Integration Techniques. Journal of Computational Analysis and Applications, 28(6).

Tamraparani, V. (2021). Cloud and Data Transformation in Banking: Managing Middle and Back Office Operations Using Snowflake and Databricks. Journal of Computational Analysis and Applications, 29(4).

Tamraparani, V. (2022). Enhancing Cybersecurity and Firm Resilience Through Data Lineage: Best Practices and ML Ops for AutoDetection. International Journal of Advanced Engineering Technologies and Innovations, 1(2), 415427.

Tamraparani, V. (2023). Leveraging AI for Fraud Detection in Identity and Access Management: A Focus on LargeScale Customer Data. Journal of Computational Analysis and Applications, 31(4).

Tamraparani, V. (2024). Applying Robotic Process Automation & AI techniques to reduce time to market for medical devices compliance & provisioning. Revista de Inteligencia Artificial en Medicina, 15(1).

Tamraparani, V. (2024). Revolutionizing payments infrastructure with AI & ML to enable secure cross border payments. Journal of Multidisciplinary Research, 10(02), 4970.

Tamraparani, V., & Dalal, A. (2022). Developing a robust CRM Analytics strategy for Hedge Fund institutions to improve investment diversification. Unique Endeavor in Business & Social Sciences, 5(1), 110.

Tamraparani, V., & Dalal, A. (2023). Self generating & self healing test automation scripts using AI for automating regulatory & compliance functions in financial institutions. Revista de Inteligencia Artificial en Medicina, 14(1), 784796.

Tamraparani, V., & Islam, M. A. (2021). Improving Accuracy of Fraud Detection Models in Health Insurance Claims Using Deep Learning/AI. International Journal of Advanced Engineering Technologies and Innovations, 1(4).

Tamraparani, V., & Islam, M. A. (2023). Enhancing data privacy in healthcare with deep learning models & AI personalization techniques. International Journal of Advanced Engineering Technologies and Innovations, 1(01), 397418.

Tamraparani, Venugopal. (2022). Ethical Implications of Implementing AI in Wealth Management for Personalized Investment Strategies. International Journal of Science and Research (IJSR). 11. 16251633. 10.21275/SR220309091129.

Tjoa, E., & Guan, C. (2020). A survey on explainable artificial intelligence (XAI): Toward transparent AI. IEEE Access, 8, 220712220742. https://doi.org/10.1109/ACCESS.2020.3026739

Topol, E. J. (2019). Highperformance medicine: The convergence of human and artificial intelligence. Nature Medicine, 25(1), 4456. https://doi.org/10.1038/s4159101803007

Venaik, U., Dalal, A., Mittal, M., Kushwaha, A., & Kumar, L. (2024). NLP Project Report: Textual EmotionCause Pair Extraction in Conversations. Journal of Computational Analysis and Applications, 33(7).

Yang, G., Ye, Q., & Xia, J. (2019). Unbox AI: Explaining artificial intelligence for medical image analysis. IEEE Transactions on Medical Imaging, 39(4), 10241035. https://doi.org/10.1109/TMI.2019.2940363

Downloads

Published

2025-01-31

How to Cite

Hossain, Z., Hossain, M. E. ., Ahmed, N. ., Kabir, M. F. ., & Hossain, I. S. . (2025). Evaluating the Performance of Vision Transformers and Convolutional Neural Networks for Hostile Image Detection. Indonesian Journal of Advanced Research, 4(1), 111–130. https://doi.org/10.55927/ijar.v4i1.13681