AttackER: towards enhancing cyber-attack attribution with a named entity recognition dataset

  • Pritam Deka
  • , Sampath Rajapaksha
  • , Ruby Rani
  • , Amirah Almutairi
  • , Erisa Karafili*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

Cyber-attack attribution is an important process that allows experts to put in place attacker-oriented countermeasures and legal actions. The analysts mainly perform attribution manually, given the complex nature of this task. AI and, more specifically, Natural Language Processing (NLP) techniques can be leveraged to support cybersecurity analysts during the attribution process. However powerful these techniques may be, they must address the lack of datasets in the attack attribution domain. In this work, we will fill this gap and will provide, to the best of our knowledge, the first dataset on cyber-attack attribution. We designed our dataset with the primary goal of extracting attack attribution information from cybersecurity texts, utilizing named entity recognition (NER) methodologies from the field of NLP. Unlike other cybersecurity NER datasets, ours offers a rich set of annotations with contextual details, including some that span phrases and sentences. We conducted extensive experiments and applied NLP techniques to demonstrate the dataset’s effectiveness for attack attribution. These experiments highlight the potential of Large Language Models (LLMs) capabilities to improve the NER tasks in cybersecurity datasets for cyber-attack attribution.

Original languageEnglish
Title of host publicationWeb Information Systems Engineering - WISE 2024 - 25th International Conference: Proceedings
EditorsMahmoud Barhamgi, Hua Wang, Xin Wang
PublisherSpringer Singapore
Pages255-270
Number of pages16
ISBN (Electronic)9789819605767
ISBN (Print)9789819605750
DOIs
Publication statusPublished - 26 Nov 2024
Externally publishedYes
Event25th International Conference on Web Information Systems Engineering, WISE 2024 - Doha, Qatar
Duration: 02 Dec 202405 Dec 2024

Publication series

NameLecture Notes in Computer Science (LNCS)
Volume15440
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference25th International Conference on Web Information Systems Engineering, WISE 2024
Country/TerritoryQatar
CityDoha
Period02/12/202405/12/2024

Keywords

  • attribution
  • dataset
  • LLMs
  • Named entity recognition
  • NLP

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'AttackER: towards enhancing cyber-attack attribution with a named entity recognition dataset'. Together they form a unique fingerprint.

Cite this