<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//TaxonX//DTD Taxonomic Treatment Publishing DTD v0 20100105//EN" "../../nlm/tax-treatment-NS0.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:tp="http://www.plazi.org/taxpub" article-type="research-article" dtd-version="3.0" xml:lang="en">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">69</journal-id>
      <journal-id journal-id-type="index">urn:lsid:arphahub.com:pub:8D21F818-6EEF-540F-91C7-D50E3E5A13E0</journal-id>
      <journal-title-group>
        <journal-title xml:lang="en">Maandblad voor Accountancy en Bedrijfseconomie</journal-title>
        <abbrev-journal-title xml:lang="en">MAB</abbrev-journal-title>
      </journal-title-group>
      <issn pub-type="ppub">0924-6304</issn>
      <issn pub-type="epub">2543-1684</issn>
      <publisher>
        <publisher-name>Amsterdam University Press</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.5117/mab.99.149299</article-id>
      <article-id pub-id-type="publisher-id">149299</article-id>
      <article-categories>
        <subj-group subj-group-type="heading">
          <subject>Research Article</subject>
        </subj-group>
        <subj-group subj-group-type="scientific_subject">
          <subject>Accountantscontrole (Auditing)</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>﻿Generative AI and cybersecurity: Exploring opportunities and threats at their intersection</article-title>
      </title-group>
      <contrib-group content-type="authors">
        <contrib contrib-type="author" corresp="yes">
          <name name-style="western">
            <surname>Orpak</surname>
            <given-names>Kunter</given-names>
          </name>
          <email xlink:type="simple">kunterorpak@gmail.com</email>
          <xref ref-type="aff" rid="A1">1</xref>
        </contrib>
      </contrib-group>
      <aff id="A1">
        <label>1</label>
        <addr-line content-type="verbatim">University of Amsterdam, Amstelveen, Netherlands</addr-line>
        <institution>University of Amsterdam</institution>
        <addr-line content-type="city">Amstelveen</addr-line>
        <country>Netherlands</country>
      </aff>
      <author-notes>
        <fn fn-type="corresp">
          <p>Corresponding author: Kunter Orpak (<email xlink:type="simple">kunterorpak@gmail.com</email>).</p>
        </fn>
        <fn fn-type="edited-by">
          <p>Academic editor: Annemarie Oord</p>
        </fn>
      </author-notes>
      <pub-date pub-type="collection">
        <year>2025</year>
      </pub-date>
      <pub-date pub-type="epub">
        <day>11</day>
        <month>09</month>
        <year>2025</year>
      </pub-date>
      <volume>99</volume>
      <issue>4</issue>
      <fpage>221</fpage>
      <lpage>230</lpage>
      <uri content-type="arpha" xlink:href="http://openbiodiv.net/03A1148E-8D48-510E-9B43-BEEBEA0D552E">03A1148E-8D48-510E-9B43-BEEBEA0D552E</uri>
      <uri content-type="zenodo_dep_id" xlink:href="https://zenodo.org/record/17111813">17111813</uri>
      <history>
        <date date-type="received">
          <day>10</day>
          <month>02</month>
          <year>2025</year>
        </date>
        <date date-type="accepted">
          <day>16</day>
          <month>04</month>
          <year>2025</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>Kunter Orpak</copyright-statement>
        <license license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by-nc-nd/4.0/" xlink:type="simple">
          <license-p>This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY-NC-ND 4.0), which permits to copy and distribute the article for non-commercial purposes, provided that the article is not altered or modified and the original author and source are credited.</license-p>
        </license>
      </permissions>
      <abstract>
        <label>﻿Abstract</label>
        <p>Generative AI, particularly large language models (<abbrev xlink:title="large language models" id="ABBRID0EGC">LLMs</abbrev>), is reshaping the cybersecurity landscape by enabling both innovative defense mechanisms and novel forms of attack. This article explores the dual role of generative AI in both offensive and defensive cybersecurity operations. While GenAI offers significant advancements in defensive capabilities, it is also being leveraged by nation-state actors to enhance the sophistication and success rates of cyberattacks. The article analyzes how <abbrev xlink:title="large language models" id="ABBRID0EKC">LLMs</abbrev> are applied in offensive engagements such as red teaming, penetration testing, and threat intelligence, while also identifying emerging technical, operational, and strategic risks associated with their deployment. Special attention is given to the cybersecurity challenges of generative AI systems themselves, highlighting limitations in conventional frameworks and proposing governance-oriented mitigations such as model evaluation, human-in-the-loop oversight, GenAI-specific red teaming, and the structured dissemination of threat intelligence derived from GenAI-enabled security practices.</p>
      </abstract>
      <kwd-group>
        <label>Keywords</label>
        <kwd>Generative AI</kwd>
        <kwd>cybersecurity</kwd>
        <kwd>AI risk management</kwd>
        <kwd>LLM security</kwd>
        <kwd>AI governance</kwd>
        <kwd>cyber threat intelligence</kwd>
        <kwd>adversarial AI attacks</kwd>
        <kwd>penetration testing</kwd>
        <kwd>AI in offensive security</kwd>
        <kwd>AI in cyber defense</kwd>
        <kwd>AI red teaming</kwd>
        <kwd>AI compliance in audit</kwd>
      </kwd-group>
      <funding-group>
        <award-group>
          <funding-source>
            <named-content content-type="funder_name">Universiteit van Amsterdam</named-content>
            <named-content content-type="funder_identifier">501100001827</named-content>
            <named-content content-type="funder_ror">https://ror.org/04dkp9463</named-content>
            <named-content content-type="funder_doi">http://doi.org/10.13039/501100001827</named-content>
          </funding-source>
        </award-group>
      </funding-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="﻿Relevance to practice" id="SECID0E4C">
      <title>﻿Relevance to practice</title>
      <p>As generative AI systems rapidly integrate into business and IT environments, internal auditors, internal control specialists, and IT audit professionals should understand their cybersecurity implications. This article explores the intersection of generative AI and cybersecurity, providing insights into both opportunities and risks. By examining AI-driven offensive and defensive security applications, associated threats, and mitigation strategies, the article equips professionals with the knowledge to assess and manage AI-related cyber risks in organizations.</p>
    </sec>
    <sec sec-type="﻿1. Introduction" id="SECID0EDD">
      <title>﻿1. Introduction</title>
      <p>Generative AI differs from other AI models primarily in its ability to generate novel content rather than just analyzing or acting on existing data. Traditional AI models typically use specific data to solve specific problems and generate specific answers based on input data. In contrast, generative AI models, like large language models (<abbrev xlink:title="large language models" id="ABBRID0EJD">LLMs</abbrev>), are capable of creating new and original content by mapping input information into a high-dimensional latent space and driving stochastic behavior to produce novel outputs even with the same input stimuli (<xref ref-type="bibr" rid="B10">Corchado et al. 2023</xref>). LLM-based agents have demonstrated significant potential in attaining human-like intelligence by leveraging comprehensive training datasets and a substantial number of model parameters. These agents possess more comprehensive internal world knowledge compared to traditional reinforcement learning models, enabling more informed actions without specific domain training. Furthermore, LLM-based agents offer natural language interfaces, providing flexible and explainable interactions with human operators (<xref ref-type="bibr" rid="B45">Wang et al. 2024</xref>). For the purposes of this article, the term ‘<italic>LLM</italic>’ will be used to refer to any generative AI model that accepts various forms of input and produces new content as output. The scope of this article is primarily focused on the application of generative AI – particularly <abbrev xlink:title="large language models" id="ABBRID0EXD">LLMs</abbrev> – in the domain of cybersecurity. This article is intended to raise awareness of the capabilities, risks, and implications of <abbrev xlink:title="large language models" id="ABBRID0E2D">LLMs</abbrev> in cybersecurity. It does not aim to provide guidance for malicious use but rather to inform cybersecurity, audit, and risk professionals about emerging threats and responsibilities.</p>
      <p>In this article, “<italic>offensive cybersecurity</italic>” refers to proactive security testing methods such as penetration testing, red teaming and threat simulation, aimed at identifying vulnerabilities before malicious actors exploit them. In contrast, “<italic>defensive cybersecurity</italic>” encompasses technologies and processes focused on prevention, detection, response, and recovery from cyber threats.</p>
      <p>This article is structured as follows: Section 2.1 explores the role of Generative AI in offensive cybersecurity, while Section 2.2 examines its applications in cyber defense. Section 3 highlights key risks associated with AI-driven security practices, followed by Section 4 discussing governance and mitigation strategies. Finally, the conclusion summarizes insights for internal audit, IT audit and internal control professionals.</p>
    </sec>
    <sec sec-type="﻿2. Impact of Generative AI in cybersecurity domain" id="SECID0EGE">
      <title>﻿2. Impact of Generative AI in cybersecurity domain</title>
      <p>The relationship between Generative AI and cybersecurity can be outlined across four distinct categories (<xref ref-type="bibr" rid="B15">Gupta et al. 2023</xref>): 1.) Using Gen AI in cybersecurity offensive domain, 2.) Using Gen AI in cyber defense operations, 3.) Risks associated with Gen AI and 4.) Cybersecurity of Gen AI models.</p>
      <sec sec-type="﻿2.1. Using Generative AI in cybersecurity offensive domain" id="SECID0EQE">
        <title>﻿2.1. Using Generative AI in cybersecurity offensive domain</title>
        <p>Generative AI has significantly impacted offensive cybersecurity by increasing the sophistication and scale of cyber threats by allowing more complex and varied types of cyber-attacks (<xref ref-type="bibr" rid="B31">Palani et al. 2024</xref>). This dual nature of Generative AI highlights its role as a double-edged sword in cybersecurity. <abbrev xlink:title="large language models" id="ABBRID0E1E">LLMs</abbrev> can be a valuable tool for cyber security professionals aiding in tasks such as penetration testing and developing security solutions (<xref ref-type="bibr" rid="B2">Al-Hawawreh et al. 2023</xref>). The use of <abbrev xlink:title="large language models" id="ABBRID0ECF">LLMs</abbrev> has been shown to automate cyber-attacks effectively, with language models like ChatGPT able to generate executable attack code and script fragments (<xref ref-type="bibr" rid="B20">Iturbe et al. 2024</xref>). Generative AI-powered attacks achieve a 67% higher success rate and a 72% reduction in operational complexity, enhancing the effectiveness of cyber offensive operations by simulating sophisticated attack scenarios (<xref ref-type="bibr" rid="B37">Reddem 2024</xref>). Microsoft (<xref ref-type="bibr" rid="B22">Karamthulla et al. 2024</xref>) observed that threat actors like Forest Blizzard, Emerald Sleet, Crimson Sandstorm, Charcoal Typhoon, and Salmon Typhoon are increasingly looking to AI, including Generative AI, to enhance their attacks. The analysis revealed that these LLM-enhanced attack campaigns correspond to several tactics described in the MITRE ATT&amp;CK framework. During the reconnaissance phase, threat actors utilized <abbrev xlink:title="large language models" id="ABBRID0ESF">LLMs</abbrev> to scan and collect victim-specific information, thereby tailoring their attacks more effectively. In the resource development phase, <abbrev xlink:title="large language models" id="ABBRID0EWF">LLMs</abbrev> was applied to enhance malware capabilities and develop supporting infrastructure. For initial access, attackers launched targeted spear phishing campaigns, where <abbrev xlink:title="large language models" id="ABBRID0E1F">LLMs</abbrev> generated highly convincing and context-specific content. Defense evasion was observed through the use of AI-generated obfuscation techniques to bypass detection mechanisms on compromised systems. Finally, during the collection phase, adversaries employed generative models to extract information from sensitive data repositories, often targeting high-profile individuals or critical thematic domains.</p>
        <p>Traditional cybersecurity offensive testing methods, like red teaming and penetration testing, are often time and resource-intensive, necessitating the adoption of specialized tools and algorithms for improved efficiency. Integrating <abbrev xlink:title="large language models" id="ABBRID0EAG">LLMs</abbrev> into the red team testing process offers new opportunities to enhance efficiency, precision, and cost-effectiveness by automating complex tasks, improving decision-making, and providing real-time insights during engagements (<xref ref-type="bibr" rid="B33">Patil et al. 2024</xref>). <abbrev xlink:title="large language models" id="ABBRID0EIG">LLMs</abbrev> can significantly reduce the time required for each testing phase by rapidly processing extensive datasets and proposing tailored actions (<xref ref-type="bibr" rid="B50">Zaydi and Maleh 2024</xref>). It allows even inexperienced IT operations staff to execute tests by providing a more efficient and accessible approach to penetration testing, potentially reducing costs and increasing the frequency of security assessments for organizations, particularly small and medium enterprises that may lack the budget for professional security testing services (Valea and Oprişa 2020).</p>
        <p>The use of <abbrev xlink:title="large language models" id="ABBRID0ESG">LLMs</abbrev> in the threat intelligence phases of offensive security tests significantly enhances the accuracy and speed of information extraction and analysis. <abbrev xlink:title="large language models" id="ABBRID0EWG">LLMs</abbrev> can automate the extraction and summarization of important information from large datasets, such as historical cyber incident reports, thereby improving the accuracy of threat intelligence and the ability to forecast future threats (<xref ref-type="bibr" rid="B41">Sufi 2024</xref>). Overall, <abbrev xlink:title="large language models" id="ABBRID0E5G">LLMs</abbrev> provide deep insights that help offensive cybersecurity professionals respond more efficiently to emerging threats and risks (<xref ref-type="bibr" rid="B17">Hassanin and Moustafa 2024</xref>).</p>
        <p><abbrev xlink:title="large language models" id="ABBRID0EIH">LLMs</abbrev>, particularly ChatGPT, have high potential to enhance cyberattacks done by individuals with entry-level skills (<xref ref-type="bibr" rid="B49">Yigit et al. 2024</xref>). Using generative AI in cybersecurity, particularly in Capture the Flag (<abbrev xlink:title="Capture the Flag" id="ABBRID0EQH">CTF</abbrev>) exercises, has significant potential (<xref ref-type="bibr" rid="B7">Chamberlain and Casey 2024</xref>). <abbrev xlink:title="large language models" id="ABBRID0EYH">LLMs</abbrev> can automate the generation of attack scenarios, provide personalized feedback, and simulate real-world threat actors, enhancing the realism and effectiveness of <abbrev xlink:title="Capture the Flag" id="ABBRID0E3H">CTF</abbrev> exercises. When <abbrev xlink:title="large language models" id="ABBRID0EBAAC">LLMs</abbrev> are properly fine-tuned and combined with prompt engineering techniques like Chain of Thought (<abbrev xlink:title="Chain of Thought" id="ABBRID0EFAAC">CoT</abbrev>) and Optimization by PROmpting (<abbrev xlink:title="Optimization by PROmpting" id="ABBRID0EJAAC">OPRO</abbrev>), they can effectively automate threat modelling, involving simulating attacks to identify vulnerabilities (<xref ref-type="bibr" rid="B47">Yang et al. 2024</xref>).</p>
        <p>LLM agents are valuable tools for the reconnaissance phase of penetration tests (<xref ref-type="bibr" rid="B43">Temara 2023</xref>). In the reconnaissance phase, LLM agents generate detailed reports, enhancing initial information gathering (<xref ref-type="bibr" rid="B18">Hilario et al. 2024</xref>). That means that they provide insightful information, like technology stack, domain names, SSL/TLS configurations, ports and services used, that can be directly used for planning the next phase of a penetration test, offering meaningful insights that previously required multiple tools to obtain. During scanning, it automated test scenario generation, streamlining vulnerability detection. In exploitation, the LLM quickly responded to vulnerabilities, providing strategic exploitation options (<xref ref-type="bibr" rid="B18">Hilario et al. 2024</xref>). LLM-generated phishing and spam emails crafted to be more sophisticated and realistic, often bypassing the keyword-based and heuristic approaches that traditional spam detectors rely on. These spam detectors struggle with zero-shot and few-shot rephrased learning scenarios, where the emails are designed to evade detection by mimicking legitimate communication more closely (<xref ref-type="bibr" rid="B1">Afane et al. 2024</xref>).</p>
        <p>Through neural machine translation, <abbrev xlink:title="large language models" id="ABBRID0EFBAC">LLMs</abbrev> can effectively generate syntactically and semantically correct software exploits from natural language descriptions, though minor errors prevent full automation, indicating great potential (<xref ref-type="bibr" rid="B25">Liguori et al. 2021</xref>). MITRE ATT&amp;CK top tactics, where <abbrev xlink:title="large language models" id="ABBRID0ENBAC">LLMs</abbrev> are effective in generating successful executable code fragments, are “initial access” (TA0001), “defense evasion” (TA0005), and “discovery” (TA0007). The tactics “persistence” (TA0003), “privilege escalation” (TA0004), and “exfiltration” (TA0010) also showed satisfactory outcomes (<xref ref-type="bibr" rid="B20">Iturbe et al. 2024</xref>).</p>
        <p><abbrev xlink:title="large language models" id="ABBRID0EXBAC">LLMs</abbrev> can effectively facilitate cyber offensive attacks, specifically generating viruses and polymorphic malwares (<xref ref-type="bibr" rid="B15">Gupta et al. 2023</xref>). They can be leveraged to generate code that targets CPU vulnerabilities, such as those that allow viruses to read kernel memory, thereby gaining control over the system. Additionally, <abbrev xlink:title="large language models" id="ABBRID0E6BAC">LLMs</abbrev> can be leveraged to generate polymorphic malware, which is designed to alter its code with each execution to evade detection by traditional antivirus systems.</p>
        <p>There are some early successful examples of LLM applications illustrating the potential of LLM-based automation to transform cybersecurity by reducing manual effort, enhancing accuracy, and enabling comprehensive threat assessment. PTHelper (Gracia and Sánchez-Macián 2024) streamlines the penetration testing process by automating transitions between phases, using modules for scanning, exploiting, natural language processing, and reporting, demonstrating effectiveness in both black-box and controlled environments. PentestGPT (<xref ref-type="bibr" rid="B12">Deng et al. 2023a</xref>) simplifies testing by guiding <abbrev xlink:title="large language models" id="ABBRID0EJCAC">LLMs</abbrev> through micro-steps, reducing reliance on domain expertise, though expert oversight remains essential for accuracy. AutoAttacker (<xref ref-type="bibr" rid="B46">Xu et al. 2024</xref>) leverages GPT-4 for automating post-breach cyber-attack stages, excelling in lateral movement and credential gathering with modular components for planning and navigation. GAIL-PT (<xref ref-type="bibr" rid="B8">Chen et al. 2022</xref>) uses generative adversarial imitation learning to address the challenges of high-dimensional action spaces, integrating expert knowledge to improve decision-making in penetration testing.</p>
        <p>Figure <xref ref-type="fig" rid="F1">1</xref> illustrates how <abbrev xlink:title="large language models" id="ABBRID0E2CAC">LLMs</abbrev> can be integrated into different phases of offensive cybersecurity operations, including penetration testing, red teaming, and threat intelligence-based simulations. The figure emphasizes how <abbrev xlink:title="large language models" id="ABBRID0E6CAC">LLMs</abbrev> not only streamline traditional workflows but also enable new capabilities, such as automated scenario generation, enhanced reconnaissance, and dynamic exploitation options. This visual supports the argument that <abbrev xlink:title="large language models" id="ABBRID0EDDAC">LLMs</abbrev> can significantly improve the efficiency, scalability, and accessibility of offensive security testing.</p>
        <fig id="F1" position="float" orientation="portrait">
          <object-id content-type="arpha">A2EABCAF-B637-53B0-BE24-EE02C97E6E17</object-id>
          <label>Figure 1.</label>
          <caption>
            <p>Key applications of <abbrev xlink:title="large language models" id="ABBRID0EPDAC">LLMs</abbrev> across phases of cyber offensive security testing.</p>
          </caption>
          <graphic xlink:href="mab-99-221-g001.jpg" position="float" orientation="portrait" xlink:type="simple" id="oo_1412360.jpg">
            <uri content-type="original_file">https://binary.pensoft.net/fig/1412360</uri>
          </graphic>
        </fig>
      </sec>
      <sec sec-type="﻿2.2 Using Gen AI on cyber defense operations" id="SECID0EYDAC">
        <title>﻿2.2 Using Gen AI on cyber defense operations</title>
        <p>In the context of cyber defense, <abbrev xlink:title="large language models" id="ABBRID0E5DAC">LLMs</abbrev> excel in tasks such as threat detection, vulnerability analysis, and automated defense mechanisms (<xref ref-type="bibr" rid="B52">Zhou et al. 2024</xref>). They offer adaptive and intelligent technologies that can dynamically create and deploy actionable defense mechanisms, thereby increasing the efficiency of security operations. <abbrev xlink:title="large language models" id="ABBRID0EGEAC">LLMs</abbrev> enhance cybersecurity by analyzing historical and real-time data to accurately predict future threats and vulnerabilities, enabling organizations to implement proactive security measures and strengthen their defenses (<xref ref-type="bibr" rid="B28">Metta et al. 2024</xref>). Furthermore, their capability to automate routine cyber operations tasks like threat analysis and incident response, allowing cybersecurity professionals to dedicate more time to strategic decision-making and complex investigations.</p>
        <p><abbrev xlink:title="large language models" id="ABBRID0EQEAC">LLMs</abbrev> hold transformative potential in the field of cybersecurity defensive operations, offering significant advancements across various applications including threat intelligence, cybersecurity risk monitoring, vulnerability management, static malware analysis, dynamic debugging, anomaly detection and behavior analysis, web content security, phishing and spam detection, digital forensic, fuzz testing, program repairing, secure code generation, honeypots, and incident response and recovery (<xref ref-type="bibr" rid="B51">Zhang et al. 2025</xref>). Additionally, by being trained on frameworks like MITRE ATT&amp;CK and D3FEND, <abbrev xlink:title="large language models" id="ABBRID0EYEAC">LLMs</abbrev> can provide comprehensive insights into both attack techniques and corresponding defense procedures, facilitating a more robust cybersecurity posture. This dual capability of <abbrev xlink:title="large language models" id="ABBRID0E3EAC">LLMs</abbrev> not only accelerates the detection and response to cyber threats but also empowers cybersecurity professionals to develop more sophisticated defense strategies (<xref ref-type="bibr" rid="B3">Alotaibi et al. 2024</xref>).</p>
        <p>A recent comprehensive review by <xref ref-type="bibr" rid="B14">Ding et al. (2025)</xref> highlights that the integration of <abbrev xlink:title="large language models" id="ABBRID0EKFAC">LLMs</abbrev> into cyber defense operations offers significant advancements in managing and enhancing cybersecurity posture. By analyzing extensive datasets, <abbrev xlink:title="large language models" id="ABBRID0EOFAC">LLMs</abbrev> can extract valuable features and information, providing strategic recommendations to mitigate cyberattacks and effectively detect threats. Their application in security datasets allows for the generation of human-like text, which aids in threat and risk detection, facilitating rapid responses to potential threats. However, the successful deployment of <abbrev xlink:title="large language models" id="ABBRID0ESFAC">LLMs</abbrev> necessitates access to comprehensive datasets, including security data, network traffic, and log files, which are crucial for accurate threat detection and risk mitigation. Organizations must exercise caution in several areas before implementing <abbrev xlink:title="large language models" id="ABBRID0EWFAC">LLMs</abbrev> in their cyber operations. Ensuring data privacy and protection is critical, given the large-scale datasets involved. Awareness of the potential vulnerabilities and threats that <abbrev xlink:title="large language models" id="ABBRID0E1FAC">LLMs</abbrev> might introduce is essential to maintaining a robust cybersecurity posture. Compliance with regulatory requirements is also vital to avoid legal and operational challenges. Furthermore, <abbrev xlink:title="large language models" id="ABBRID0E5FAC">LLMs</abbrev> should be seamlessly integrated with existing cybersecurity systems to maximize their effectiveness. Lastly, ethical and responsible use of <abbrev xlink:title="large language models" id="ABBRID0ECGAC">LLMs</abbrev> is crucial to prevent misuse and maintain trust in their outputs, ensuring that these advanced tools contribute positively to cybersecurity efforts (<xref ref-type="bibr" rid="B14">Ding et al. 2025</xref>).</p>
        <p>In the current cyber security landscape various real-world cybersecurity products are being used in the cybersecurity operations that leverage Generative AI to enhance security measures (<xref ref-type="bibr" rid="B38">Sai et al. 2024</xref>). Google Cloud Security AI Workbench and Microsoft Security Copilot are designed to enhance threat detection and response. SentinelOne Purple AI focuses on addressing emerging threats with advanced AI techniques. Talon Enterprise Browser integrates with Microsoft Azure OpenAI Service to provide enterprise-grade access to Generative AI tools like ChatGPT, enhancing data protection and productivity. SlashNext Generative Human AI defends against advanced threats such as business email compromise and financial fraud by mimicking human threat researchers. Recorded Future AI leverages over a decade of threat analysis data to provide real-time threat landscape analysis and improve analyst efficiency. SecurityScorecard integrates with OpenAI’s GPT-4 to enhance its cybersecurity assessments, providing more comprehensive insights into potential vulnerabilities.</p>
        <p>Figure <xref ref-type="fig" rid="F2">2</xref> illustrates how <abbrev xlink:title="large language models" id="ABBRID0EWGAC">LLMs</abbrev> and LLM integrated products can support cyber security operations. This figure illustrates also the increasing role of generative AI in operational cyber defense, showing how LLM-integrated tools support threat identification, detection, protection, response and recovery.</p>
        <fig id="F2" position="float" orientation="portrait">
          <object-id content-type="arpha">48AF7BEA-2BC3-56DD-AEB6-AFE9AECC91F7</object-id>
          <label>Figure 2.</label>
          <caption>
            <p>Gen AI on cyber defense operations.</p>
          </caption>
          <graphic xlink:href="mab-99-221-g002.jpg" position="float" orientation="portrait" xlink:type="simple" id="oo_1412361.jpg">
            <uri content-type="original_file">https://binary.pensoft.net/fig/1412361</uri>
          </graphic>
        </fig>
      </sec>
    </sec>
    <sec sec-type="﻿3. Risks associated with Gen AI" id="SECID0EGHAC">
      <title>﻿3. Risks associated with Gen AI</title>
      <p>The risks associated with Generative AI can be classified under 3 categories particularly operational, technical and lastly systemic and strategic risk. These categories are derived from a synthesis of academic literature in this article.</p>
      <p>As shown in Figure <xref ref-type="fig" rid="F3">3</xref>, the risks associated with generative AI span operational, technical, and systemic &amp; strategic categories. Each type of risk requires a different set of mitigation strategies, as discussed in the following sections. While many of the risks associated with generative AI are systemic in nature, operational and technical risks, such as hallucinations, jailbreak vulnerabilities, or model theft, can manifest in both offensive and defensive cybersecurity operations contexts.</p>
      <fig id="F3" position="float" orientation="portrait">
        <object-id content-type="arpha">976AD0C2-8533-5F80-8786-179CB11855AA</object-id>
        <label>Figure 3.</label>
        <caption>
          <p>Risks associated with Generative AI.</p>
        </caption>
        <graphic xlink:href="mab-99-221-g003.jpg" position="float" orientation="portrait" xlink:type="simple" id="oo_1412362.jpg">
          <uri content-type="original_file">https://binary.pensoft.net/fig/1412362</uri>
        </graphic>
      </fig>
      <sec sec-type="﻿3.1. Operational risks" id="SECID0E4HAC">
        <title>﻿3.1. Operational risks</title>
        <p>While <abbrev xlink:title="large language models" id="ABBRID0EDIAC">LLMs</abbrev> have significant potential in cybersecurity, particularly in threat intelligence process, they are not yet perfectly accurate due to hallucination where <abbrev xlink:title="large language models" id="ABBRID0EHIAC">LLMs</abbrev> generate false information (<xref ref-type="bibr" rid="B34">Patsakis et al. 2024</xref>). LLM models can produce biased and unreliable content, and their increased consistency might make such content more credible and potentially more dangerous (<xref ref-type="bibr" rid="B10">Corchado et al. 2023</xref>). They may exhibit biases due to their training datasets, potentially leading to inaccurate or skewed recommendations (<xref ref-type="bibr" rid="B50">Zaydi and Maleh 2024</xref>). Another common operational concern is data poisoning, where malicious actors can corrupt the training datasets, leading to compromised AI performance and decision-making (<xref ref-type="bibr" rid="B26">Maryam et al. 2024</xref>). Attackers can manipulate the training data to cause the algorithm to make incorrect decisions, like misclassified cyber threats, misleading events/alerts and incorrect mitigations. The complexity of integrating AI systems with existing cybersecurity infrastructure poses a significant hurdle, as it requires substantial skills, resources and expertise (<xref ref-type="bibr" rid="B21">Jana et al. 2024</xref>).</p>
      </sec>
      <sec sec-type="﻿3.2. Technical risks" id="SECID0E6IAC">
        <title>﻿3.2. Technical risks</title>
        <p>First major challenge is the potential for adversarial attacks, where malicious actors can exploit LLM models by feeding them deceptive inputs to manipulate their outputs (<xref ref-type="bibr" rid="B21">Jana et al. 2024</xref>). Adversarial attacks, particularly prompt injection attacks, pose a significant risk, as they involve manipulating LLM inputs to generate unauthorized or harmful outputs, which can be exploited to simulate adversarial tactics and test system defenses against such manipulations (<xref ref-type="bibr" rid="B42">Taghavi and Feyzi 2024</xref>). A well-known example is Morris II, the first worm specifically designed to target Generative AI ecosystems using adversarial self-replicating prompts, highlighting a novel attack vector that exploits the interconnected nature of Generative AI-powered applications (<xref ref-type="bibr" rid="B9">Cohen et al. 2024</xref>). This example revealed the potential for adversarial attacks to compromise Generative AI systems, leading to malicious activities such as spamming, data exfiltration, and phishing.</p>
        <p>Model theft is another critical risk, where unauthorized entities gain access to and replicate AI models, undermining proprietary technologies and security protocols (<xref ref-type="bibr" rid="B26">Maryam et al. 2024</xref>). Model theft may pose significant losses on organizations, reputational risks, including using the stolen model for malicious purposes (<xref ref-type="bibr" rid="B42">Taghavi and Feyzi 2024</xref>).</p>
        <p>Although many developers have adopted AI technology in their workflows and generally find the code provided by AI to be usable and fairly accurate, there is caution regarding AI-generated code being insecure and inaccurate in coding and scripting practices (<xref ref-type="bibr" rid="B39">Sergeyuk et al. 2024</xref>). The use of AI-powered tools in cyber security operations may lead to the production of insecure code, posing significant risks (<xref ref-type="bibr" rid="B30">Oh et al. 2023</xref>). This can result in cyber security practitioners unknowingly incorporating insecure code into their systems, potentially compromising the integrity, effectiveness and security of their operations.</p>
        <p>Despite the deployment of undisclosed defenses by service providers, LLM agents are vulnerable to jailbreak attacks, where malicious prompts can manipulate these models to bypass their safeguards and generate harmful or sensitive content (<xref ref-type="bibr" rid="B13">Deng et al. 2023b</xref>). Users could manipulate the LLM by crafting jailbreak prompts that bypass internal controls, leading the model to generate unauthorized or harmful information.</p>
      </sec>
      <sec sec-type="﻿3.3. Systemic and strategic risks" id="SECID0ELKAC">
        <title>﻿3.3. Systemic and strategic risks</title>
        <p>An important concern regarding the systemic risks of <abbrev xlink:title="large language models" id="ABBRID0ERKAC">LLMs</abbrev> is about their self-replication capability. AI systems driven by <abbrev xlink:title="large language models" id="ABBRID0EVKAC">LLMs</abbrev> such as Meta’s Llama31-70B-Instruct and Alibaba’s Qwen25-72B-Instruct were demonstrated to be able to autonomously create separate copies of themselves, which could lead to the uncontrolled proliferation of AI systems, potentially forming independent networks that might act against its usage purpose (<xref ref-type="bibr" rid="B32">Pan et al. 2024</xref>). There are also concerns about the ethical aspects and potential misuse of AI-driven cyber offensive applications (<xref ref-type="bibr" rid="B36">Raman et al. 2024</xref>) and AI-generated content (<xref ref-type="bibr" rid="B21">Jana et al. 2024</xref>), which can be used to create convincing cyber-attacks or deepfakes, complicating the detection of genuine threats. Furthermore, the use of <abbrev xlink:title="large language models" id="ABBRID0EFLAC">LLMs</abbrev> in cybersecurity processes poses significant data leakage risks due to their need for accessing sensitive system information, which can lead to unauthorized access, especially in cloud-hosted environments (<xref ref-type="bibr" rid="B50">Zaydi and Maleh 2024</xref>). Lastly, some regulatory challenges have been defined in using <abbrev xlink:title="large language models" id="ABBRID0ENLAC">LLMs</abbrev> for cybersecurity (<xref ref-type="bibr" rid="B38">Sai et al. 2024</xref>), including the risk of intellectual property violations due to content generation similar to proprietary research, and the need for quality control and standardization to ensure consistent AI-generated advice. Additional challenges include defining data ownership, ensuring continuous monitoring and validation of AI performance, obtaining informed consent from users, maintaining interpretability and transparency of AI decision-making processes, and preventing over-reliance on LLM models, which could diminish human expertise.</p>
        <p>The inherent black-box nature of <abbrev xlink:title="large language models" id="ABBRID0EXLAC">LLMs</abbrev> presents significant challenges in understanding and controlling their operations, which raises critical concerns about transparency and accountability (<xref ref-type="bibr" rid="B4">Barman et al. 2024</xref>). Due to their complex and opaque internal workings, it is difficult for users and developers to predict or explain the outputs generated by these models. The lack of explainability in <abbrev xlink:title="large language models" id="ABBRID0E6LAC">LLMs</abbrev> limits their utility as coding and scripting tools, which could hinder the understanding and mitigation of security risks during cyber operations (<xref ref-type="bibr" rid="B23">Khoury et al. 2023</xref>). This lack of explainability and transparency can make it also difficult to assess the reliability and accuracy of the AI’s threat detection and exploit capabilities (<xref ref-type="bibr" rid="B29">Mohammed 2024</xref>).</p>
        <p>Overreliance on content generated by <abbrev xlink:title="large language models" id="ABBRID0ENMAC">LLMs</abbrev> poses significant risks, particularly due to the difficulty in detecting incorrect or misleading information produced by these models (<xref ref-type="bibr" rid="B48">Yao et al. 2024</xref>). <abbrev xlink:title="large language models" id="ABBRID0EVMAC">LLMs</abbrev> are capable of generating highly convincing text that can easily be mistaken for accurate cyber intelligence information, leading to incorrect actions and conclusions.</p>
      </sec>
    </sec>
    <sec sec-type="﻿4. Cybersecurity of Gen AI models" id="SECID0EZMAC">
      <title>﻿4. Cybersecurity of Gen AI models</title>
      <p>The wide use of LLM based agents in the IT landscape has widened the exploit surface available to attackers. This expansion is driven by several risk factors outlined in Section 3 of this article, including vulnerabilities inherent to generative AI models, the increased feasibility of adversarial and jailbreak attacks, and the misuse of <abbrev xlink:title="large language models" id="ABBRID0E6MAC">LLMs</abbrev> for generating potentially harmful or exploitable code. It is necessary to address cybersecurity risks specific to generative AI systems, on the top of traditional cybersecurity practices.</p>
      <p><abbrev xlink:title="large language models" id="ABBRID0EFNAC">LLMs</abbrev> can amplify existing security risks and introduce new ones, emphasizing the need for a thorough understanding of the system’s capabilities and applications. About the amplified cybersecurity risks, Microsoft has published its early lessons learned from its red teaming of 100 generative AI products (<xref ref-type="bibr" rid="B6">Bullwinkel et al. 2025</xref>). Cyber security professionals and LLM practitioners are advised to implement system-level mitigations, such as input sanitization, and model-level improvements, like instruction hierarchies, to manage and prioritize instructions effectively. Model-level evaluation is a critical AI governance infrastructure, providing insights into the safety and alignment of models, particularly about responsible training, responsible deployment, transparency and security (<xref ref-type="bibr" rid="B40">Shevlane et al. 2023</xref>). Microsoft’s publication also warns that <abbrev xlink:title="large language models" id="ABBRID0ERNAC">LLMs</abbrev> exposed to untrusted inputs may produce arbitrary outputs, including private information, emphasizing the necessity for robust input validation and data handling protocols. Additionally, the involvement of subject matter experts has been considered crucial for evaluating LLM outputs in specialized domains, including cyber security, where <abbrev xlink:title="large language models" id="ABBRID0EVNAC">LLMs</abbrev> may not be reliable. Regular AI red teaming practices is recommended to enhance the communication of methods and findings, thereby improving the overall security posture.</p>
      <p>Figure <xref ref-type="fig" rid="F4">4</xref> provides a structured overview of key cybersecurity practices tailored to generative AI systems, particularly <abbrev xlink:title="large language models" id="ABBRID0E6NAC">LLMs</abbrev>. This approach reflects the broader message of this section: securing generative AI requires controls beyond conventional cybersecurity frameworks.</p>
      <fig id="F4" position="float" orientation="portrait">
        <object-id content-type="arpha">66E4DDDF-F4F9-5B6E-8B47-1FF8CF5F758F</object-id>
        <label>Figure 4.</label>
        <caption>
          <p>Cybersecurity approach for Gen AI Models.</p>
        </caption>
        <graphic xlink:href="mab-99-221-g004.jpg" position="float" orientation="portrait" xlink:type="simple" id="oo_1412363.jpg">
          <uri content-type="original_file">https://binary.pensoft.net/fig/1412363</uri>
        </graphic>
      </fig>
      <p>A critical enabler of this is AI governance, which plays a key role in ensuring secure and ethical AI deployment. It addresses systemic concerns such as algorithmic bias, data privacy, transparency, and responsible use of AI technologies (<xref ref-type="bibr" rid="B29">Mohammed 2024</xref>). Within this context, model-level evaluation is essential for limiting the creation, deployment, and proliferation of generative AI systems that may pose risks to organizations (<xref ref-type="bibr" rid="B40">Shevlane et al. 2023</xref>). Such evaluations help identify whether a Gen AI model possesses potentially harmful capabilities or a tendency to apply these capabilities inappropriately.</p>
      <p>Human oversight in Gen AI operations remains equally vital, since it involves assessing Gen AI safety questions that require emotional intelligence and understanding the full range of interactions users might have with Gen AI systems (<xref ref-type="bibr" rid="B6">Bullwinkel et al. 2025</xref>). Only human subject matter experts can evaluate model responses within specific domains and judge whether outputs are inappropriate, misleading, or harmful. Existing cybersecurity frameworks, such as NIST CSF 2.0, COBIT 2019, ISO 27001:2022, and the latest ISO 42001:2023, still exhibit significant gaps in addressing the multifaceted risks associated with <abbrev xlink:title="large language models" id="ABBRID0E6OAC">LLMs</abbrev>, necessitating enhancements and the integration of human-expert-in-the-loop validation processes to ensure secure and compliant LLM integration (<xref ref-type="bibr" rid="B27">McIntosh et al. 2024</xref>). In this context, cybersecurity professionals in the fields of internal audit and IT audit can play a pivotal role by reviewing and validating LLM-generated outputs, particularly in high-risk or regulated environments, thereby reinforcing trust, accuracy, and accountability in AI-driven cybersecurity operations.</p>
      <p>Furthermore, given that <abbrev xlink:title="large language models" id="ABBRID0EJPAC">LLMs</abbrev> can amplify existing security risks and introduce new ones, Gen AI red teaming is a crucial practice for assessing the safety and security of Gen AI systems, as it pushes beyond model-level safety benchmarks by emulating real-world attacks against end-to-end systems (<xref ref-type="bibr" rid="B6">Bullwinkel et al. 2025</xref>).</p>
      <p>Finally, aligning various cybersecurity efforts, including Gen AI red teaming, with real-world risks is indispensable, which necessitates the dissemination of insights and threat intelligence gathered from extensive cybersecurity practices (<xref ref-type="bibr" rid="B6">Bullwinkel et al. 2025</xref>).</p>
      <p>A new dimension in securing <abbrev xlink:title="large language models" id="ABBRID0EZPAC">LLMs</abbrev> is the integration of <abbrev xlink:title="large language models" id="ABBRID0E4PAC">LLMs</abbrev> into red teaming practices, such as an automated red teaming LLM agent that simulates adversarial conversations with <abbrev xlink:title="large language models" id="ABBRID0ECAAE">LLMs</abbrev>, leveraging multiple adversarial prompting techniques, allowing for scalable and efficient stress-testing of known vulnerabilities, thus freeing human testers to explore new risk areas (<xref ref-type="bibr" rid="B35">Pavlova et al. 2024</xref>). This approach enhances productivity and efficiency by automating prompt generation, conversion, and response scoring, allowing for extensive coverage of potential risks (<xref ref-type="bibr" rid="B16">Haider et al. 2024</xref>). However, manual red-teaming on Generative AI systems remains still crucial for capturing issues that automated methods might miss, particularly in complex, nuanced interactions (<xref ref-type="bibr" rid="B5">Bengio et al. 2025</xref>).</p>
    </sec>
    <sec sec-type="﻿5. Conclusions" id="SECID0ESAAE">
      <title>﻿5. Conclusions</title>
      <p>Generative AI, particularly <abbrev xlink:title="large language models" id="ABBRID0EYAAE">LLMs</abbrev>, has rapidly emerged as a powerful tool in cybersecurity, benefiting both cyber defenders and adversaries. On one hand, cybersecurity professionals leverage <abbrev xlink:title="large language models" id="ABBRID0E3AAE">LLMs</abbrev> to enhance penetration testing, red teaming, and threat intelligence-driven security tests, enabling faster, more sophisticated, and cost-effective offensive security operations. On the other hand, malicious actors exploit the same technology to automate cyberattacks, craft advanced phishing campaigns, and develop polymorphic malware, expanding the cyber threat landscape. The cybersecurity community maintains a balanced perspective on the adoption of <abbrev xlink:title="large language models" id="ABBRID0EABAE">LLMs</abbrev>, recognizing both their value in strengthening defense operations and the significant challenges, risks, and potential for misuse they introduce. By openly addressing both the offensive and defensive capabilities of generative AI, this article aims to equip professionals with the knowledge to anticipate threats, not to support their misuse. Responsible innovation and risk-informed governance remain essential.</p>
      <p>As AI-driven cyber security applications evolve, so do the risks and regulatory challenges associated with their use. Generative AI introduces vulnerabilities such as model exploitation, adversarial attacks, jailbreak exploits, and biased or unreliable outputs, which could undermine security efforts if not properly managed. While AI provides remarkable efficiencies, it also increases organizations’ exploit surfaces, requiring new control frameworks and continuous risk assessment.</p>
      <p>For internal auditors, IT auditors, and internal control professionals, generative AI is not just an IT concern but a governance and risk management issue. To mitigate the risks associated with generative AI, these professionals can play a key role by assessing whether appropriate AI governance frameworks are in place and integrating AI-specific risks into enterprise risk management and audit plans. These professionals should understand the implications of generative AI in cybersecurity, ensuring that organizations harness AI’s benefits while mitigating its risks. By balancing innovation with security, they can contribute to the responsible adoption of AI, strengthen ethical AI governance, and ensure compliance with evolving regulatory standards. As generative AI continues to shape the cybersecurity domain, the key challenge will be ensuring AI remains an asset rather than a liability. By proactively addressing the risks and opportunities of AI in security, professionals across cybersecurity, audit, and internal control fields can play a pivotal role in securing the AI-driven future.</p>
      <sec sec-type="﻿Future research opportunities" id="SECID0EGBAE">
        <title>﻿Future research opportunities</title>
        <p>This article aimed to highlight the intersection between generative AI and cybersecurity, focusing on both opportunities and associated risks. Future research could further explore how audit, risk, and internal control functions can enhance the cybersecurity assurance of GenAI systems. This includes examining control frameworks, audit methodologies, and regulatory compliance strategies tailored to the unique characteristics of AI-based technologies.</p>
        <boxed-text id="box1" position="float" orientation="portrait">
          <p><bold>K. Orpak RE CISSP CCSP CISA CIA ISO27001LA CSX-F CDPO CFSA CCSA – Kunter</bold>, Senior Supervision Officer – DORA TLPT / TIBER-EU Test Manager, Dutch Authority for the Financial Markets (AFM). PhD Researcher, Faculty of Economics and Business, University of Amsterdam. <italic>This article has been written within the scope of his academic affiliation with the University of Amsterdam</italic>.</p>
        </boxed-text>
        <boxed-text id="box2" position="float" orientation="portrait">
          <p>The author confirms having no financial interests or conflicts of interest related to the subject matter or materials discussed in this article.</p>
        </boxed-text>
      </sec>
    </sec>
  </body>
  <back>
    <ack>
      <title>﻿Acknowledgements</title>
      <p>The author acknowledges the use of ChatGPT-4o, an advanced language model, to assist in the linguistic refinement and structural improvements of this manuscript. The tool was used solely for linguistic and structural refinement; all conceptual contributions, critical analysis, and findings are entirely the author’s own.</p>
    </ack>
    <ref-list>
      <title>﻿References</title>
      <ref id="B1">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Afane</surname><given-names>K</given-names></name><name name-style="western"><surname>Wei</surname><given-names>W</given-names></name><name name-style="western"><surname>Mao</surname><given-names>Y</given-names></name><name name-style="western"><surname>Farooq</surname><given-names>J</given-names></name><name name-style="western"><surname>Chen</surname><given-names>J</given-names></name></person-group> (<year>2024</year>) Next-generation phishing: how LLM agents empower cyber attackers. arXiv. <ext-link xlink:href="10.1109/BigData62323.2024.10825018" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/BigData62323.2024.10825018</ext-link></mixed-citation>
      </ref>
      <ref id="B2">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Al-Hawawreh</surname><given-names>M</given-names></name><name name-style="western"><surname>Aljuhani</surname><given-names>A</given-names></name><name name-style="western"><surname>Jararweh</surname><given-names>Y</given-names></name></person-group> (<year>2023</year>) <article-title>ChatGPT for cybersecurity: practical applications, challenges, and future directions.</article-title><source>Cluster Computing</source><volume>26</volume>: <fpage>3421</fpage>–<lpage>3436</lpage>. <ext-link xlink:href="10.1007/s10586-023-04124-5" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1007/s10586-023-04124-5</ext-link></mixed-citation>
      </ref>
      <ref id="B3">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Alotaibi</surname><given-names>L</given-names></name><name name-style="western"><surname>Seher</surname><given-names>S</given-names></name><name name-style="western"><surname>Mohammad</surname><given-names>N</given-names></name></person-group> (<year>2024</year>) Cyberattacks using ChatGPT: exploring malicious content generation through prompt engineering. 2024 ASU Int Conf Emerg Technol Sustain Intell Syst (ICETSIS) 00: 1304–1311 (ICETSIS) 00: 1304–1311 (2024). <ext-link xlink:href="10.1109/ICETSIS61505.2024.10459698" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/ICETSIS61505.2024.10459698</ext-link></mixed-citation>
      </ref>
      <ref id="B4">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Barman</surname><given-names>D</given-names></name><name name-style="western"><surname>Guo</surname><given-names>Z</given-names></name><name name-style="western"><surname>Conlan</surname><given-names>O</given-names></name></person-group> (<year>2024</year>) The dark side of language models: exploring the potential of LLMs in multimedia disinformation generation and dissemination. Machine Learning with Applications 16: 100545. <ext-link xlink:href="10.1016/j.mlwa.2024.100545" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.mlwa.2024.100545</ext-link></mixed-citation>
      </ref>
      <ref id="B5">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Bengio</surname><given-names>Y</given-names></name><name name-style="western"><surname>Mindermann</surname><given-names>S</given-names></name><name name-style="western"><surname>Privitera</surname><given-names>D</given-names></name><name name-style="western"><surname>Besiroglu</surname><given-names>T</given-names></name><name name-style="western"><surname>Bommasani</surname><given-names>R</given-names></name><name name-style="western"><surname>Casper</surname><given-names>S</given-names></name><name name-style="western"><surname>Choi</surname><given-names>Y</given-names></name><name name-style="western"><surname>Fox</surname><given-names>P</given-names></name><name name-style="western"><surname>Garfinkel</surname><given-names>B</given-names></name><name name-style="western"><surname>Goldfarb</surname><given-names>D</given-names></name><name name-style="western"><surname>Heidari</surname><given-names>H</given-names></name><name name-style="western"><surname>Ho</surname><given-names>A</given-names></name><name name-style="western"><surname>Kapoor</surname><given-names>S</given-names></name><name name-style="western"><surname>Khalatbari</surname><given-names>L</given-names></name><name name-style="western"><surname>Longpre</surname><given-names>S</given-names></name><name name-style="western"><surname>Manning</surname><given-names>S</given-names></name><name name-style="western"><surname>Mavroudis</surname><given-names>V</given-names></name><name name-style="western"><surname>Mazeika</surname><given-names>M</given-names></name><name name-style="western"><surname>Michael</surname><given-names>J</given-names></name><name name-style="western"><surname>Newman</surname><given-names>J</given-names></name><name name-style="western"><surname>Ng</surname><given-names>KY</given-names></name><name name-style="western"><surname>Okolo</surname><given-names>CT</given-names></name><name name-style="western"><surname>Raji</surname><given-names>D</given-names></name><name name-style="western"><surname>Sastry</surname><given-names>G</given-names></name><name name-style="western"><surname>Seger</surname><given-names>E</given-names></name><name name-style="western"><surname>Skeadas</surname><given-names>T</given-names></name><name name-style="western"><surname>South</surname><given-names>T</given-names></name><name name-style="western"><surname>Strubell</surname><given-names>E</given-names></name><name name-style="western"><surname>Tramèr</surname><given-names>F</given-names></name><name name-style="western"><surname>Velasco</surname><given-names>L</given-names></name><name name-style="western"><surname>Wheeler</surname><given-names>N</given-names></name><name name-style="western"><surname>Acemoglu</surname><given-names>D</given-names></name><name name-style="western"><surname>Adekanmbi</surname><given-names>O</given-names></name><name name-style="western"><surname>Dalrymple</surname><given-names>D</given-names></name><name name-style="western"><surname>Dietterich</surname><given-names>TG</given-names></name><name name-style="western"><surname>Felten</surname><given-names>EW</given-names></name><name name-style="western"><surname>Fung</surname><given-names>P</given-names></name><name name-style="western"><surname>Gourinchas</surname><given-names>P-O</given-names></name><name name-style="western"><surname>Heintz</surname><given-names>F</given-names></name><name name-style="western"><surname>Hinton</surname><given-names>G</given-names></name><name name-style="western"><surname>Jennings</surname><given-names>N</given-names></name><name name-style="western"><surname>Krause</surname><given-names>A</given-names></name><name name-style="western"><surname>Leavy</surname><given-names>S</given-names></name><name name-style="western"><surname>Liang</surname><given-names>P</given-names></name><name name-style="western"><surname>Ludermir</surname><given-names>T</given-names></name><name name-style="western"><surname>Marda</surname><given-names>V</given-names></name><name name-style="western"><surname>Margetts</surname><given-names>H</given-names></name><name name-style="western"><surname>McDermid</surname><given-names>J</given-names></name><name name-style="western"><surname>Munga</surname><given-names>J</given-names></name><name name-style="western"><surname>Narayanan</surname><given-names>A</given-names></name><name name-style="western"><surname>Nelson</surname><given-names>A</given-names></name><name name-style="western"><surname>Neppel</surname><given-names>C</given-names></name><name name-style="western"><surname>Oh</surname><given-names>A</given-names></name><name name-style="western"><surname>Ramchurn</surname><given-names>G</given-names></name><name name-style="western"><surname>Russell</surname><given-names>S</given-names></name><name name-style="western"><surname>Schaake</surname><given-names>M</given-names></name><name name-style="western"><surname>Schölkopf</surname><given-names>B</given-names></name><name name-style="western"><surname>Song</surname><given-names>D</given-names></name><name name-style="western"><surname>Soto</surname><given-names>A</given-names></name><name name-style="western"><surname>Tiedrich</surname><given-names>L</given-names></name><name name-style="western"><surname>Varoquaux</surname><given-names>G</given-names></name><name name-style="western"><surname>Yao</surname><given-names>A</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>Y-Q</given-names></name><name name-style="western"><surname>Albalawi</surname><given-names>F</given-names></name><name name-style="western"><surname>Alserkal</surname><given-names>M</given-names></name><name name-style="western"><surname>Ajala</surname><given-names>O</given-names></name><name name-style="western"><surname>Avrin</surname><given-names>G</given-names></name><name name-style="western"><surname>Busch</surname><given-names>C</given-names></name><name name-style="western"><surname>de Leon Ferreira de Carvalho</surname><given-names>ACP</given-names></name><name name-style="western"><surname>Fox</surname><given-names>B</given-names></name><name name-style="western"><surname>Gill</surname><given-names>AS</given-names></name><name name-style="western"><surname>Hatip</surname><given-names>AH</given-names></name><name name-style="western"><surname>Heikkilä</surname><given-names>J</given-names></name><name name-style="western"><surname>Jolly</surname><given-names>G</given-names></name><name name-style="western"><surname>Katzir</surname><given-names>Z</given-names></name><name name-style="western"><surname>Kitano</surname><given-names>H</given-names></name><name name-style="western"><surname>Krüger</surname><given-names>A</given-names></name><name name-style="western"><surname>Johnson</surname><given-names>C</given-names></name><name name-style="western"><surname>Khan</surname><given-names>SM</given-names></name><name name-style="western"><surname>Lee</surname><given-names>KM</given-names></name><name name-style="western"><surname>Ligot</surname><given-names>DV</given-names></name><name name-style="western"><surname>Molchanovskyi</surname><given-names>O</given-names></name><name name-style="western"><surname>Monti</surname><given-names>A</given-names></name><name name-style="western"><surname>Mwamanzi</surname><given-names>N</given-names></name><name name-style="western"><surname>Nemer</surname><given-names>M</given-names></name><name name-style="western"><surname>Oliver</surname><given-names>N</given-names></name><name name-style="western"><surname>Portillo</surname><given-names>JRL</given-names></name><name name-style="western"><surname>Ravindran</surname><given-names>B</given-names></name><name name-style="western"><surname>Rivera</surname><given-names>RP</given-names></name><name name-style="western"><surname>Riza</surname><given-names>H</given-names></name><name name-style="western"><surname>Rugege</surname><given-names>C</given-names></name><name name-style="western"><surname>Seoighe</surname><given-names>C</given-names></name><name name-style="western"><surname>Sheehan</surname><given-names>J</given-names></name><name name-style="western"><surname>Sheikh</surname><given-names>H</given-names></name><name name-style="western"><surname>Wong</surname><given-names>D</given-names></name><name name-style="western"><surname>Zeng</surname><given-names>Y</given-names></name></person-group> (<year>2025</year>) International AI safety report. arXiv. <ext-link xlink:href="10.48550/arXiv.2501.17805" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arXiv.2501.17805</ext-link></mixed-citation>
      </ref>
      <ref id="B6">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Bullwinkel</surname><given-names>B</given-names></name><name name-style="western"><surname>Minnich</surname><given-names>A</given-names></name><name name-style="western"><surname>Chawla</surname><given-names>S</given-names></name><name name-style="western"><surname>Lopez</surname><given-names>G</given-names></name><name name-style="western"><surname>Pouliot</surname><given-names>M</given-names></name><name name-style="western"><surname>Maxwell</surname><given-names>W</given-names></name><name name-style="western"><surname>de Gruyter</surname><given-names>J</given-names></name><name name-style="western"><surname>Pratt</surname><given-names>K</given-names></name><name name-style="western"><surname>Qi</surname><given-names>S</given-names></name><name name-style="western"><surname>Chikanov</surname><given-names>N</given-names></name><name name-style="western"><surname>Lutz</surname><given-names>R</given-names></name><name name-style="western"><surname>Dheekonda</surname><given-names>RSR</given-names></name><name name-style="western"><surname>Jagdagdorj</surname><given-names>B-E</given-names></name><name name-style="western"><surname>Kim</surname><given-names>E</given-names></name><name name-style="western"><surname>Song</surname><given-names>J</given-names></name><name name-style="western"><surname>Hines</surname><given-names>K</given-names></name><name name-style="western"><surname>Jones</surname><given-names>D</given-names></name><name name-style="western"><surname>Severi</surname><given-names>G</given-names></name><name name-style="western"><surname>Lundeen</surname><given-names>R</given-names></name><name name-style="western"><surname>Vaughan</surname><given-names>S</given-names></name><name name-style="western"><surname>Westerhoff</surname><given-names>V</given-names></name><name name-style="western"><surname>Bryan</surname><given-names>P</given-names></name><name name-style="western"><surname>Kumar</surname><given-names>RSS</given-names></name><name name-style="western"><surname>Zunger</surname><given-names>Y</given-names></name><name name-style="western"><surname>Kawaguchi</surname><given-names>C</given-names></name><name name-style="western"><surname>Russinovich</surname><given-names>M</given-names></name></person-group> (<year>2025</year>) Lessons from red teaming 100 generative AI products. arXiv. <ext-link xlink:href="10.48550/arXiv.2501.07238" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arXiv.2501.07238</ext-link></mixed-citation>
      </ref>
      <ref id="B7">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Chamberlain</surname><given-names>D</given-names></name><name name-style="western"><surname>Casey</surname><given-names>E</given-names></name></person-group> (<year>2024</year>) <article-title>Capture the flag with ChatGPT: security testing with AI chatbots.</article-title><source>International Conference on Cyber Warfare and Security</source><volume>19</volume>: <fpage>43</fpage>–<lpage>54</lpage>. <ext-link xlink:href="10.34190/iccws.19.1.2171" ext-link-type="doi" xlink:type="simple">https://doi.org/10.34190/iccws.19.1.2171</ext-link></mixed-citation>
      </ref>
      <ref id="B8">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Chen</surname><given-names>J</given-names></name><name name-style="western"><surname>Hu</surname><given-names>S</given-names></name><name name-style="western"><surname>Zheng</surname><given-names>H</given-names></name><name name-style="western"><surname>Xing</surname><given-names>C</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>G</given-names></name></person-group> (<year>2022</year>) GAIL-PT: a generic intelligent penetration testing framework with generative adversarial imitation learning. arXiv. <ext-link xlink:href="10.1016/j.cose.2022.103055" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.cose.2022.103055</ext-link></mixed-citation>
      </ref>
      <ref id="B9">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Cohen</surname><given-names>S</given-names></name><name name-style="western"><surname>Bitton</surname><given-names>R</given-names></name><name name-style="western"><surname>Nassi</surname><given-names>B</given-names></name></person-group> (<year>2024</year>) Here comes the AI worm: unleashing zero-click worms that target GenAI-powered applications. arXiv. <ext-link xlink:href="10.48550/arxiv.2403.02817" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2403.02817</ext-link></mixed-citation>
      </ref>
      <ref id="B10">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Corchado</surname><given-names>JM</given-names></name><name name-style="western"><surname>Garcia</surname><given-names>SR</given-names></name><name name-style="western"><surname>Núñez</surname><given-names>VJM</given-names></name><name name-style="western"><surname>López</surname><given-names>FS</given-names></name><name name-style="western"><surname>Chamoso</surname><given-names>P</given-names></name></person-group> (<year>2023</year>) Generative artificial intelligence: fundamentals. Advances in Distributed Computing and Artificial Intelligence Journal 12(1): e31704. <ext-link xlink:href="10.14201/adcaij.31704" ext-link-type="doi" xlink:type="simple">https://doi.org/10.14201/adcaij.31704</ext-link></mixed-citation>
      </ref>
      <ref id="B11">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>De Gracia</surname><given-names>JC</given-names></name><name name-style="western"><surname>Sánchez-Macián</surname><given-names>A</given-names></name></person-group> (<year>2024</year>) PTHelper: an open source tool to support the penetration testing process. arXiv. <ext-link xlink:href="10.48550/arxiv.2406.08242" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2406.08242</ext-link></mixed-citation>
      </ref>
      <ref id="B12">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Deng</surname><given-names>G</given-names></name><name name-style="western"><surname>Liu</surname><given-names>Y</given-names></name><name name-style="western"><surname>Mayoral-Vilches</surname><given-names>V</given-names></name><name name-style="western"><surname>Liu</surname><given-names>P</given-names></name><name name-style="western"><surname>Li</surname><given-names>Y</given-names></name><name name-style="western"><surname>Xu</surname><given-names>Y</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>T</given-names></name><name name-style="western"><surname>Liu</surname><given-names>Y</given-names></name><name name-style="western"><surname>Pinzger</surname><given-names>M</given-names></name><name name-style="western"><surname>Rass</surname><given-names>S</given-names></name></person-group> (<year>2023a</year>) PentestGPT: an LLM-empowered automatic penetration testing tool. arXiv. <ext-link xlink:href="10.48550/arxiv.2308.06782" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2308.06782</ext-link></mixed-citation>
      </ref>
      <ref id="B13">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Deng</surname><given-names>G</given-names></name><name name-style="western"><surname>Liu</surname><given-names>Y</given-names></name><name name-style="western"><surname>Li</surname><given-names>Y</given-names></name><name name-style="western"><surname>Wang</surname><given-names>K</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>Y</given-names></name><name name-style="western"><surname>Li</surname><given-names>Z</given-names></name><name name-style="western"><surname>Wang</surname><given-names>H</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>T</given-names></name><name name-style="western"><surname>Liu</surname><given-names>Y</given-names></name></person-group> (<year>2023b</year>) Jailbreaker: automated jailbreak across multiple large language model chatbots. arXiv. <ext-link xlink:href="10.14722/ndss.2024.24188" ext-link-type="doi" xlink:type="simple">https://doi.org/10.14722/ndss.2024.24188</ext-link></mixed-citation>
      </ref>
      <ref id="B14">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Ding</surname><given-names>W</given-names></name><name name-style="western"><surname>Abdel-Basset</surname><given-names>M</given-names></name><name name-style="western"><surname>Ali</surname><given-names>AM</given-names></name><name name-style="western"><surname>Moustafa</surname><given-names>N</given-names></name></person-group> (<year>2025</year>) Large language models for cyber resilience: a comprehensive review, challenges, and future perspectives. Applied Soft Computing 170: 112663. <ext-link xlink:href="10.1016/j.asoc.2024.112663" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.asoc.2024.112663</ext-link></mixed-citation>
      </ref>
      <ref id="B15">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Gupta</surname><given-names>M</given-names></name><name name-style="western"><surname>Akiri</surname><given-names>C</given-names></name><name name-style="western"><surname>Aryal</surname><given-names>K</given-names></name><name name-style="western"><surname>Parker</surname><given-names>E</given-names></name><name name-style="western"><surname>Praharaj</surname><given-names>L</given-names></name></person-group> (<year>2023</year>) <article-title>From ChatGPT to ThreatGPT: impact of generative AI in cybersecurity and privacy.</article-title><source>IEEE Access</source><volume>11</volume>: <fpage>80218</fpage>–<lpage>80245</lpage>. <ext-link xlink:href="10.1109/ACCESS.2023.3300381" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/ACCESS.2023.3300381</ext-link></mixed-citation>
      </ref>
      <ref id="B16">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Haider</surname><given-names>E</given-names></name><name name-style="western"><surname>Perez-Becker</surname><given-names>D</given-names></name><name name-style="western"><surname>Portet</surname><given-names>T</given-names></name><name name-style="western"><surname>Madan</surname><given-names>P</given-names></name><name name-style="western"><surname>Garg</surname><given-names>A</given-names></name><name name-style="western"><surname>Ashfaq</surname><given-names>A</given-names></name><name name-style="western"><surname>Majercak</surname><given-names>D</given-names></name><name name-style="western"><surname>Wen</surname><given-names>W</given-names></name><name name-style="western"><surname>Kim</surname><given-names>D</given-names></name><name name-style="western"><surname>Yang</surname><given-names>Z</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>J</given-names></name><name name-style="western"><surname>Sharma</surname><given-names>H</given-names></name><name name-style="western"><surname>Bullwinkel</surname><given-names>B</given-names></name><name name-style="western"><surname>Pouliot</surname><given-names>M</given-names></name><name name-style="western"><surname>Minnich</surname><given-names>A</given-names></name><name name-style="western"><surname>Chawla</surname><given-names>S</given-names></name><name name-style="western"><surname>Herrera</surname><given-names>S</given-names></name><name name-style="western"><surname>Warreth</surname><given-names>S</given-names></name><name name-style="western"><surname>Engler</surname><given-names>M</given-names></name><name name-style="western"><surname>Lopez</surname><given-names>G</given-names></name><name name-style="western"><surname>Chikanov</surname><given-names>N</given-names></name><name name-style="western"><surname>Dheekonda</surname><given-names>RSR</given-names></name><name name-style="western"><surname>Jagdagdorj</surname><given-names>B-E</given-names></name><name name-style="western"><surname>Lutz</surname><given-names>R</given-names></name><name name-style="western"><surname>Lundeen</surname><given-names>R</given-names></name><name name-style="western"><surname>Westerhoff</surname><given-names>T</given-names></name><name name-style="western"><surname>Bryan</surname><given-names>P</given-names></name><name name-style="western"><surname>Seifert</surname><given-names>C</given-names></name><name name-style="western"><surname>Kumar</surname><given-names>RSS</given-names></name><name name-style="western"><surname>Berkley</surname><given-names>A</given-names></name><name name-style="western"><surname>Kessler</surname><given-names>A</given-names></name></person-group> (<year>2024</year>) Phi-3 safety post-training: aligning language models with a “break-fix” cycle. arXiv. <ext-link xlink:href="10.48550/arxiv.2407.13833" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2407.13833</ext-link></mixed-citation>
      </ref>
      <ref id="B17">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Hassanin</surname><given-names>M</given-names></name><name name-style="western"><surname>Moustafa</surname><given-names>N</given-names></name></person-group> (<year>2024</year>) A comprehensive overview of large language models (LLMs) for cyber defences: opportunities and directions. arXiv. <ext-link xlink:href="10.48550/arxiv.2405.14487" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2405.14487</ext-link></mixed-citation>
      </ref>
      <ref id="B18">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Hilario</surname><given-names>E</given-names></name><name name-style="western"><surname>Azam</surname><given-names>S</given-names></name><name name-style="western"><surname>Sundaram</surname><given-names>J</given-names></name><name name-style="western"><surname>Mohammed</surname><given-names>KI</given-names></name><name name-style="western"><surname>Shanmugam</surname><given-names>B</given-names></name></person-group> (<year>2024</year>) Generative AI for pentesting: the good, the bad, the ugly. International Journal of Information Security 1–23. <ext-link xlink:href="10.1007/s10207-024-00835-x" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1007/s10207-024-00835-x</ext-link></mixed-citation>
      </ref>
      <ref id="B19">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Huang</surname><given-names>J</given-names></name><name name-style="western"><surname>Zhu</surname><given-names>Q</given-names></name></person-group> (<year>2024</year>) PenHeal: a two-stage LLM framework for automated pentesting and optimal remediation. arXiv. <ext-link xlink:href="10.2139/ssrn.4941478" ext-link-type="doi" xlink:type="simple">https://doi.org/10.2139/ssrn.4941478</ext-link></mixed-citation>
      </ref>
      <ref id="B20">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Iturbe</surname><given-names>E</given-names></name><name name-style="western"><surname>Llorente-Vazquez</surname><given-names>O</given-names></name><name name-style="western"><surname>Rego</surname><given-names>A</given-names></name><name name-style="western"><surname>Rios</surname><given-names>E</given-names></name><name name-style="western"><surname>Toledo</surname><given-names>N</given-names></name></person-group> (<year>2024</year>) Unleashing offensive artificial intelligence: automated attack technique code generation. Computers and Security 147: 104077. <ext-link xlink:href="10.1016/j.cose.2024.104077" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.cose.2024.104077</ext-link></mixed-citation>
      </ref>
      <ref id="B21">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Jana</surname><given-names>S</given-names></name><name name-style="western"><surname>Biswas</surname><given-names>R</given-names></name><name name-style="western"><surname>Banerjee</surname><given-names>C</given-names></name><name name-style="western"><surname>Patra</surname><given-names>T</given-names></name><name name-style="western"><surname>Pal</surname><given-names>M</given-names></name><name name-style="western"><surname>Pal</surname><given-names>K</given-names></name></person-group> (<year>2024</year>) Leveraging artificial intelligence for enhancing cybersecurity: a comprehensive review and analysis. International Journal of Advanced Research in Science, Communication and Technology (IJARSCT): 173–183. <ext-link xlink:href="10.48175/IJARSCT-19030" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48175/IJARSCT-19030</ext-link></mixed-citation>
      </ref>
      <ref id="B22">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Karamthulla</surname><given-names>MJ</given-names></name><name name-style="western"><surname>Tadimarri</surname><given-names>A</given-names></name><name name-style="western"><surname>Tillu</surname><given-names>R</given-names></name><name name-style="western"><surname>Muthusubramanian</surname><given-names>M</given-names></name></person-group> (<year>2024</year>) Navigating the future: AI-driven project management in the digital era. International Journal For Multidisciplinary Research 6(2). <ext-link xlink:href="10.36948/ijfmr.2024.v06i02.15295" ext-link-type="doi" xlink:type="simple">https://doi.org/10.36948/ijfmr.2024.v06i02.15295</ext-link></mixed-citation>
      </ref>
      <ref id="B23">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Khoury</surname><given-names>R</given-names></name><name name-style="western"><surname>Avila</surname><given-names>AR</given-names></name><name name-style="western"><surname>Brunelle</surname><given-names>J</given-names></name><name name-style="western"><surname>Camara</surname><given-names>BM</given-names></name></person-group> (<year>2023</year>) How secure is code generated by ChatGPT? arXiv. <ext-link xlink:href="10.1109/SMC53992.2023.10394237" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/SMC53992.2023.10394237</ext-link></mixed-citation>
      </ref>
      <ref id="B24">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Lanka</surname><given-names>P</given-names></name><name name-style="western"><surname>Gupta</surname><given-names>K</given-names></name><name name-style="western"><surname>Varol</surname><given-names>C</given-names></name></person-group> (<year>2024</year>) Intelligent threat detection – AI-driven analysis of honeypot data to counter cyber threats. Electronics 13: 2465. <ext-link xlink:href="10.3390/electronics13132465" ext-link-type="doi" xlink:type="simple">https://doi.org/10.3390/electronics13132465</ext-link></mixed-citation>
      </ref>
      <ref id="B25">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Liguori</surname><given-names>P</given-names></name><name name-style="western"><surname>Al-Hossami</surname><given-names>E</given-names></name><name name-style="western"><surname>Orbinato</surname><given-names>V</given-names></name><name name-style="western"><surname>Natella</surname><given-names>R</given-names></name><name name-style="western"><surname>Shaikh</surname><given-names>S</given-names></name><name name-style="western"><surname>Cotroneo</surname><given-names>D</given-names></name><name name-style="western"><surname>Cukic</surname><given-names>B</given-names></name></person-group> (<year>2021</year>) EVIL: exploiting software via natural language. arXiv. <ext-link xlink:href="10.1109/ISSRE52982.2021.00042" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/ISSRE52982.2021.00042</ext-link></mixed-citation>
      </ref>
      <ref id="B26">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Maryam</surname><given-names>R</given-names></name><name name-style="western"><surname>Mahir</surname><given-names>RK</given-names></name><name name-style="western"><surname>Natalie</surname><given-names>NS</given-names></name></person-group> (<year>2024</year>) <article-title>Navigating AI cybersecurity: evolving landscape and challenges.</article-title><source>Journal of Intelligent Learning Systems and Applications</source><volume>16</volume>: <fpage>155</fpage>–<lpage>174</lpage>. <ext-link xlink:href="10.4236/jilsa.2024.163010" ext-link-type="doi" xlink:type="simple">https://doi.org/10.4236/jilsa.2024.163010</ext-link></mixed-citation>
      </ref>
      <ref id="B27">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>McIntosh</surname><given-names>TR</given-names></name><name name-style="western"><surname>Susnjak</surname><given-names>T</given-names></name><name name-style="western"><surname>Liu</surname><given-names>T</given-names></name><name name-style="western"><surname>Watters</surname><given-names>P</given-names></name><name name-style="western"><surname>Nowrozy</surname><given-names>R</given-names></name><name name-style="western"><surname>Halgamuge</surname><given-names>MN</given-names></name></person-group> (<year>2024</year>) From COBIT to ISO 42001: evaluating cybersecurity frameworks for opportunities, risks, and regulatory compliance in commercializing large language models. Computers and Security 144: 103964. <ext-link xlink:href="10.1016/j.cose.2024.103964" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.cose.2024.103964</ext-link></mixed-citation>
      </ref>
      <ref id="B28">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Metta</surname><given-names>S</given-names></name><name name-style="western"><surname>Chang</surname><given-names>I</given-names></name><name name-style="western"><surname>Parker</surname><given-names>J</given-names></name><name name-style="western"><surname>Roman</surname><given-names>MP</given-names></name><name name-style="western"><surname>Ehuan</surname><given-names>AF</given-names></name></person-group> (<year>2024</year>) Generative AI in cybersecurity. arXiv. <ext-link xlink:href="10.48550/arxiv.2405.01674" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2405.01674</ext-link></mixed-citation>
      </ref>
      <ref id="B29">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Mohammed</surname><given-names>B</given-names></name></person-group> (<year>2024</year>) The impact of Artificial Intelligence on cyberspace security and market dynamics. Brazilian Journal of Technology 7(4): e74677. <ext-link xlink:href="10.38152/bjtv7n4-019" ext-link-type="doi" xlink:type="simple">https://doi.org/10.38152/bjtv7n4-019</ext-link></mixed-citation>
      </ref>
      <ref id="B30">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Oh</surname><given-names>S</given-names></name><name name-style="western"><surname>Lee</surname><given-names>K</given-names></name><name name-style="western"><surname>Park</surname><given-names>S</given-names></name><name name-style="western"><surname>Kim</surname><given-names>D</given-names></name><name name-style="western"><surname>Kim</surname><given-names>H</given-names></name></person-group> (<year>2023</year>) Poisoned ChatGPT finds work for idle hands: exploring developers’ coding practices with insecure suggestions from poisoned AI models. arXiv. <ext-link xlink:href="10.1109/SP54263.2024.00046" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/SP54263.2024.00046</ext-link></mixed-citation>
      </ref>
      <ref id="B31">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Palani</surname><given-names>K</given-names></name><name name-style="western"><surname>Kethar</surname><given-names>J</given-names></name><name name-style="western"><surname>Prasad</surname><given-names>S</given-names></name><name name-style="western"><surname>Torremocha</surname><given-names>V</given-names></name></person-group> (<year>2024</year>) Impact of AI and Generative AI in transforming Cybersecurity. Journal of Student Research 13(2). <ext-link xlink:href="10.47611/jsrhs.v13i2.6710" ext-link-type="doi" xlink:type="simple">https://doi.org/10.47611/jsrhs.v13i2.6710</ext-link></mixed-citation>
      </ref>
      <ref id="B32">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Pan</surname><given-names>X</given-names></name><name name-style="western"><surname>Dai</surname><given-names>J</given-names></name><name name-style="western"><surname>Fan</surname><given-names>Y</given-names></name><name name-style="western"><surname>Yang</surname><given-names>M</given-names></name></person-group> (<year>2024</year>) Frontier AI systems have surpassed the self-replicating red line. arXiv. <ext-link xlink:href="10.48550/arxiv.2412.12140" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2412.12140</ext-link></mixed-citation>
      </ref>
      <ref id="B33">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Patil</surname><given-names>M</given-names></name><name name-style="western"><surname>Thakare</surname><given-names>D</given-names></name><name name-style="western"><surname>Bhure</surname><given-names>A</given-names></name><name name-style="western"><surname>Kaundanyapure</surname><given-names>S</given-names></name><name name-style="western"><surname>Mune</surname><given-names>DA</given-names></name></person-group> (<year>2024</year>) <article-title>An AI-based approach for automating penetration testing.</article-title><source>International Journal For Research in Applied Science and Engineering Technology</source><volume>12</volume>: <fpage>5019</fpage>–<lpage>5028</lpage>. <ext-link xlink:href="10.22214/ijraset.2024.61113" ext-link-type="doi" xlink:type="simple">https://doi.org/10.22214/ijraset.2024.61113</ext-link></mixed-citation>
      </ref>
      <ref id="B34">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Patsakis</surname><given-names>C</given-names></name><name name-style="western"><surname>Casino</surname><given-names>F</given-names></name><name name-style="western"><surname>Lykousas</surname><given-names>N</given-names></name></person-group> (<year>2024</year>) Assessing LLMs in malicious code deobfuscation of real-world malware campaigns. Expert Systems with Applications 256: 124912. <ext-link xlink:href="10.1016/j.eswa.2024.124912" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.eswa.2024.124912</ext-link></mixed-citation>
      </ref>
      <ref id="B35">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Pavlova</surname><given-names>M</given-names></name><name name-style="western"><surname>Brinkman</surname><given-names>E</given-names></name><name name-style="western"><surname>Iyer</surname><given-names>K</given-names></name><name name-style="western"><surname>Albiero</surname><given-names>V</given-names></name><name name-style="western"><surname>Bitton</surname><given-names>J</given-names></name><name name-style="western"><surname>Nguyen</surname><given-names>H</given-names></name><name name-style="western"><surname>Li</surname><given-names>J</given-names></name><name name-style="western"><surname>Ferrer</surname><given-names>CC</given-names></name><name name-style="western"><surname>Evtimov</surname><given-names>I</given-names></name><name name-style="western"><surname>Grattafiori</surname><given-names>A</given-names></name></person-group> (<year>2024</year>) Automated red teaming with GOAT: the generative offensive agent tester. arXiv. <ext-link xlink:href="10.48550/arxiv.2410.01606" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2410.01606</ext-link></mixed-citation>
      </ref>
      <ref id="B36">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Raman</surname><given-names>R</given-names></name><name name-style="western"><surname>Calyam</surname><given-names>P</given-names></name><name name-style="western"><surname>Achuthan</surname><given-names>K</given-names></name></person-group> (<year>2024</year>) ChatGPT or Bard: who is a better certified ethical hacker? Computers &amp; Security 140: 103804. <ext-link xlink:href="10.1016/j.cose.2024.103804" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.cose.2024.103804</ext-link></mixed-citation>
      </ref>
      <ref id="B37">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Reddem</surname><given-names>P</given-names></name></person-group> (<year>2024</year>) The rise of AI-powered cybercrime: A data-driven analysis of emerging threats. IJFMR 2582–2160. <ext-link xlink:href="10.36948/ijfmr.2024.v06i06.30744" ext-link-type="doi" xlink:type="simple">https://doi.org/10.36948/ijfmr.2024.v06i06.30744</ext-link></mixed-citation>
      </ref>
      <ref id="B38">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Sai</surname><given-names>S</given-names></name><name name-style="western"><surname>Yashvardhan</surname><given-names>U</given-names></name><name name-style="western"><surname>Chamola</surname><given-names>V</given-names></name><name name-style="western"><surname>Sikdar</surname><given-names>B</given-names></name></person-group> (<year>2024</year>) Generative AI for cyber security: analyzing the potential of ChatGPT, DALL-E and other models for enhancing the security space. IEEE Access PP (99): 1–1. <ext-link xlink:href="10.1109/ACCESS.2024.3385107" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/ACCESS.2024.3385107</ext-link></mixed-citation>
      </ref>
      <ref id="B39">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Sergeyuk</surname><given-names>A</given-names></name><name name-style="western"><surname>Golubev</surname><given-names>Y</given-names></name><name name-style="western"><surname>Bryksin</surname><given-names>T</given-names></name><name name-style="western"><surname>Ahmed</surname><given-names>I</given-names></name></person-group> (<year>2024</year>) Using AI-based coding assistants in practice: state of affairs, perceptions, and ways forward. arXiv. <ext-link xlink:href="10.2139/ssrn.4900362" ext-link-type="doi" xlink:type="simple">https://doi.org/10.2139/ssrn.4900362</ext-link></mixed-citation>
      </ref>
      <ref id="B40">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Shevlane</surname><given-names>T</given-names></name><name name-style="western"><surname>Farquhar</surname><given-names>S</given-names></name><name name-style="western"><surname>Garfinkel</surname><given-names>B</given-names></name><name name-style="western"><surname>Phuong</surname><given-names>M</given-names></name><name name-style="western"><surname>Whittlestone</surname><given-names>J</given-names></name><name name-style="western"><surname>Leung</surname><given-names>J</given-names></name><name name-style="western"><surname>Kokotajlo</surname><given-names>D</given-names></name><name name-style="western"><surname>Marchal</surname><given-names>N</given-names></name><name name-style="western"><surname>Anderljung</surname><given-names>M</given-names></name><name name-style="western"><surname>Kolt</surname><given-names>N</given-names></name><name name-style="western"><surname>Ho</surname><given-names>L</given-names></name><name name-style="western"><surname>Siddarth</surname><given-names>D</given-names></name><name name-style="western"><surname>Avin</surname><given-names>S</given-names></name><name name-style="western"><surname>Hawkins</surname><given-names>W</given-names></name><name name-style="western"><surname>Kim</surname><given-names>B</given-names></name><name name-style="western"><surname>Gabriel</surname><given-names>I</given-names></name><name name-style="western"><surname>Bolina</surname><given-names>V</given-names></name><name name-style="western"><surname>Clark</surname><given-names>J</given-names></name><name name-style="western"><surname>Bengio</surname><given-names>Y</given-names></name><name name-style="western"><surname>Christiano</surname><given-names>P</given-names></name><name name-style="western"><surname>Dafoe</surname><given-names>A</given-names></name></person-group> (<year>2023</year>) Model evaluation for extreme risks. arXiv. <ext-link xlink:href="10.48550/arxiv.2305.15324" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2305.15324</ext-link></mixed-citation>
      </ref>
      <ref id="B41">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Sufi</surname><given-names>F</given-names></name></person-group> (<year>2024</year>) An innovative GPT-based open-source intelligence using historical cyber incident reports. Natural Language Processing Journal 7: 100074. <ext-link xlink:href="10.1016/j.nlp.2024.100074" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.nlp.2024.100074</ext-link></mixed-citation>
      </ref>
      <ref id="B42">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Taghavi</surname><given-names>SM</given-names></name><name name-style="western"><surname>Feyzi</surname><given-names>F</given-names></name></person-group> (<year>2024</year>) Using large language models to better detect and handle software vulnerabilities and cyber security threats. <ext-link xlink:href="10.21203/rs.3.rs-4387414/v1" ext-link-type="doi" xlink:type="simple">https://doi.org/10.21203/rs.3.rs-4387414/v1</ext-link></mixed-citation>
      </ref>
      <ref id="B43">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Temara</surname><given-names>S</given-names></name></person-group> (<year>2023</year>) Maximizing penetration testing success with effective reconnaissance techniques using ChatGPT. arXiv. <ext-link xlink:href="10.22541/au.167947026.68710739/v1" ext-link-type="doi" xlink:type="simple">https://doi.org/10.22541/au.167947026.68710739/v1</ext-link></mixed-citation>
      </ref>
      <ref id="B44">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Valea</surname><given-names>O</given-names></name><name name-style="western"><surname>Oprișa</surname><given-names>C</given-names></name></person-group> (<year>2020</year>) Towards pentesting automation using the Metasploit framework. 2020 IEEE 16<sup>th</sup> International Conference on Intelligent Computer Communication and Processing (ICCP): 171–178. <ext-link xlink:href="10.1109/ICCP51029.2020.9266234" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1109/ICCP51029.2020.9266234</ext-link></mixed-citation>
      </ref>
      <ref id="B45">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Wang</surname><given-names>L</given-names></name><name name-style="western"><surname>Ma</surname><given-names>C</given-names></name><name name-style="western"><surname>Feng</surname><given-names>X</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>Z</given-names></name><name name-style="western"><surname>Yang</surname><given-names>H</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>J</given-names></name><name name-style="western"><surname>Chen</surname><given-names>Z</given-names></name><name name-style="western"><surname>Tang</surname><given-names>J</given-names></name><name name-style="western"><surname>Chen</surname><given-names>X</given-names></name><name name-style="western"><surname>Lin</surname><given-names>Y</given-names></name><name name-style="western"><surname>Zhao</surname><given-names>WX</given-names></name><name name-style="western"><surname>Wei</surname><given-names>Z</given-names></name><name name-style="western"><surname>Wen</surname><given-names>J</given-names></name></person-group> (<year>2024</year>) A survey on large language model based autonomous agents. Frontiers of Computer Science 18: 186345. <ext-link xlink:href="10.1007/s11704-024-40231-1" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1007/s11704-024-40231-1</ext-link></mixed-citation>
      </ref>
      <ref id="B46">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Xu</surname><given-names>J</given-names></name><name name-style="western"><surname>Stokes</surname><given-names>JW</given-names></name><name name-style="western"><surname>McDonald</surname><given-names>G</given-names></name><name name-style="western"><surname>Bai</surname><given-names>X</given-names></name><name name-style="western"><surname>Marshall</surname><given-names>D</given-names></name><name name-style="western"><surname>Wang</surname><given-names>S</given-names></name><name name-style="western"><surname>Swaminathan</surname><given-names>A</given-names></name><name name-style="western"><surname>Li</surname><given-names>Z</given-names></name></person-group> (<year>2024</year>) AutoAttacker: a large language model guided system to implement automatic cyber-attacks. arXiv. <ext-link xlink:href="10.48550/arxiv.2403.01038" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2403.01038</ext-link></mixed-citation>
      </ref>
      <ref id="B47">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Yang</surname><given-names>S</given-names></name><name name-style="western"><surname>Yang</surname><given-names>S</given-names></name><name name-style="western"><surname>Liu</surname><given-names>S</given-names></name><name name-style="western"><surname>Nguyen</surname><given-names>D</given-names></name><name name-style="western"><surname>Jang</surname><given-names>S</given-names></name><name name-style="western"><surname>Abuadbba</surname><given-names>A</given-names></name></person-group> (<year>2024</year>) ThreatModeling-LLM: automating threat modeling using large language models for banking system. arXiv. <ext-link xlink:href="10.48550/arxiv.2411.17058" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2411.17058</ext-link></mixed-citation>
      </ref>
      <ref id="B48">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Yao</surname><given-names>Y</given-names></name><name name-style="western"><surname>Duan</surname><given-names>J</given-names></name><name name-style="western"><surname>Xu</surname><given-names>K</given-names></name><name name-style="western"><surname>Cai</surname><given-names>Y</given-names></name><name name-style="western"><surname>Sun</surname><given-names>Z</given-names></name><name name-style="western"><surname>Zhang</surname><given-names>Y</given-names></name></person-group> (<year>2024</year>) A survey on large language model (LLM) security and privacy: the good, the bad, and the ugly. High-Confidence Computing 4: 100211. <ext-link xlink:href="10.1016/j.hcc.2024.100211" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1016/j.hcc.2024.100211</ext-link></mixed-citation>
      </ref>
      <ref id="B49">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Yigit</surname><given-names>Y</given-names></name><name name-style="western"><surname>Buchanan</surname><given-names>WJ</given-names></name><name name-style="western"><surname>Tehrani</surname><given-names>MG</given-names></name><name name-style="western"><surname>Maglaras</surname><given-names>L</given-names></name></person-group> (<year>2024</year>) Review of generative AI methods in cybersecurity. arXiv. <ext-link xlink:href="10.48550/arxiv.2403.08701" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2403.08701</ext-link></mixed-citation>
      </ref>
      <ref id="B50">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Zaydi</surname><given-names>M</given-names></name><name name-style="western"><surname>Maleh</surname><given-names>Y</given-names></name></person-group> (<year>2024</year>) Empowering red teams with generative AI: transforming penetration testing through adaptive intelligence. EDPACS ahead-of-print: 1–26. <ext-link xlink:href="10.1080/07366981.2024.2439628" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1080/07366981.2024.2439628</ext-link></mixed-citation>
      </ref>
      <ref id="B51">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Zhang</surname><given-names>J</given-names></name><name name-style="western"><surname>Bu</surname><given-names>H</given-names></name><name name-style="western"><surname>Wen</surname><given-names>H</given-names></name><name name-style="western"><surname>Liu</surname><given-names>Y</given-names></name></person-group> (<year>2025</year>) When LLMs meet cybersecurity: a systematic literature review. Cybersecurity 8: 55. <ext-link xlink:href="10.1186/s42400-025-00361-w" ext-link-type="doi" xlink:type="simple">https://doi.org/10.1186/s42400-025-00361-w</ext-link></mixed-citation>
      </ref>
      <ref id="B52">
        <mixed-citation xlink:type="simple"><person-group><name name-style="western"><surname>Zhou</surname><given-names>Y</given-names></name><name name-style="western"><surname>Cheng</surname><given-names>G</given-names></name><name name-style="western"><surname>Du</surname><given-names>K</given-names></name><name name-style="western"><surname>Chen</surname><given-names>Z</given-names></name></person-group> (<year>2024</year>) Toward intelligent and secure cloud: large language model empowered proactive defense. arXiv. <ext-link xlink:href="10.48550/arxiv.2412.21051" ext-link-type="doi" xlink:type="simple">https://doi.org/10.48550/arxiv.2412.21051</ext-link></mixed-citation>
      </ref>
    </ref-list>
  </back>
</article>
