Automated patent classification for crop protection via domain adaptation

Dimitrios Christofidellis*, Marzena Maria Lehmann, Torsten Luksch, Marco Stenta, Matteo Manica

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

52 Downloads (Pure)

Abstract

Patents show how technology evolves in most scientific fields over time. The best way to use this valuable knowledge base is to use efficient and effective information retrieval and searches for related prior art. Patent classification, that is, assigning a patent to one or more predefined categories, is a fundamental step towards synthesizing the information content of an invention. To this end, architectures based on Transformers, especially those derived from the BERT family have already been proposed in the literature and they have shown remarkable results by setting a new state-of-the-art performance for the classification task. Here, we study how domain adaptation can push the performance boundaries in patent classification by rigorously evaluating and implementing a collection of recent transfer learning techniques, for example, domain-adaptive pretraining and adapters. Our analysis shows how leveraging these advancements enables the development of state-of-the-art models with increased precision, recall, and F1-score. We base our evaluation on both standard patent classification datasets derived from patent offices-defined code hierarchies and more practical real-world use-case scenarios containing labels from the agrochemical industrial domain. The application of these domain adapted techniques to patent classification in a multilingual setting is also examined and evaluated.

Original languageEnglish
Article numbere80
Number of pages15
JournalApplied AI Letters
Volume4
Issue number1
DOIs
Publication statusPublished - 27 Feb 2023

Keywords

  • BERT
  • LETTER
  • LETTERS
  • NLP
  • domain‐adaption
  • patent analysis
  • patent classification
  • transformers

Fingerprint

Dive into the research topics of 'Automated patent classification for crop protection via domain adaptation'. Together they form a unique fingerprint.

Cite this