Text-image-video summary generation using joint integer linear programming

Anubhav Jangra, Adam Jatowt*, Mohammad Hasanuzzaman, Sriparna Saha

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

20 Citations (Scopus)

Abstract

Automatically generating a summary for asynchronous data can help users to keep up with the rapid growth of multi-modal information on the Internet. However, the current multi-modal systems usually generate summaries composed of text and images. In this paper, we propose a novel research problem of text-image-video summary generation (TIVS). We first develop a multi-modal dataset containing text documents, images and videos. We then propose a novel joint integer linear programming multi-modal summarization (JILP-MMS) framework. We report the performance of our model on the developed dataset.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 42nd European Conference on IR Research, ECIR 2020, Proceedings
EditorsJoemon M. Jose, Emine Yilmaz, João Magalhães, Pablo Castells, Nicola Ferro, Mário J. Silva, Flávio Martins
PublisherSpringer Singapore
Pages190-198
Number of pages9
ISBN (Electronic)9783030454425
ISBN (Print)9783030454418
DOIs
Publication statusPublished - 08 Apr 2020
Externally publishedYes
Event42nd European Conference on IR Research, ECIR 2020 - Lisbon, Portugal
Duration: 14 Apr 202017 Apr 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12036 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference42nd European Conference on IR Research, ECIR 2020
Country/TerritoryPortugal
CityLisbon
Period14/04/202017/04/2020

Bibliographical note

Funding Information:
Acknowledgement. Dr. Sriparna Saha would like to acknowledge the support of Early Career Research Award of Science and Engineering Research Board (SERB) of Department of Science and Technology, India to carry out this research. Mohammed Hasanuzzaman would like to acknowledge ADAPT Centre for Digital Content Technology which is funded under the SFI Research Centres Programme (Grant 13/RC/2106).

Publisher Copyright:
© Springer Nature Switzerland AG 2020.

Keywords

  • Integer Linear Programming
  • Multi-modal summarization

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'Text-image-video summary generation using joint integer linear programming'. Together they form a unique fingerprint.

Cite this