Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
-
Updated
May 26, 2025 - Python
Models to perform neural summarization (extractive and abstractive) using machine learning transformers and a tool to convert abstractive summarization datasets to the extractive task.
This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021.
Gazeta: Dataset for automatic summarization of Russian news / Газета: набор данных для автоматического реферирования на русском языке
[TACL, EMNLP 2025 Oral] Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation"
Dataset and Codes for our EMNLP 2022 Main Conference Long Paper titled "ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts"
Klexikon: A German Dataset for Joint Summarization and Simplification
Code and data for the Dreyer et al (2023) paper on abstractiveness and factuality in abstractive summarization
Thai Crosslingual Summarization Datasets.
[EACL 2021] - Unsupervised Abstractive Summarization of Bengali Text Documents.
This is the official PyTorch codebase for the ACL 2023 paper: "What are the Desired Characteristics of Calibration Sets? Identifying Correlates on Long Form Scientific Summarization".
This repository contains the implementation of a Transformer-based model for abstractive text summarization and a rule-based approach for extractive text summarization.
[Computer Speech & Language, Elsevier] - Neural Sentence Fusion for Diversity Driven Abstractive Multi-Document Summarization.
M3LS : Multi-lingual Multi-modal summarization dataset
In deep learning NLP, using a model we are trying to summarization the text.
Evaluating summarization algorithms on bigPatent dataset
Dataset for abstractive summarization of long multimodal presentations
Using T5-Small and fine-tuning it using BBC's article summarization dataset.
Specific-Aspect Summarization on News According to Social Sentiments on Twitter
This repository contains evaluation script for all the LLMs evaluated with iCOPERNICUS for testing In-Context Personalization Learning w.r.t summarization
Add a description, image, and links to the summarization-dataset topic page so that developers can more easily learn about it.
To associate your repository with the summarization-dataset topic, visit your repo's landing page and select "manage topics."