Article | Text Data Augmentation Using Generative Adversarial Networks – A Systematic Review

Article

Text Data Augmentation Using Generative Adversarial Networks – A Systematic Review

Details

Citation

Silva K, Can B, Sarwar R, Blain F & Mitkov R (2023) Text Data Augmentation Using Generative Adversarial Networks – A Systematic Review. Journal of Computational and Applied Linguistics (JCAL), 1, pp. 6-38. https://ojs.nbu.bg/index.php/JCAL; https://doi.org/10.33919/JCAL.23.1.1

Abstract
Insufficient data is one of the main drawbacks in natural language processing tasks, and the most prevalent solution is to collect a decent amount of data that will be enough for the optimisation of the model. However, recent research directions are strategically moving towards increasing training examples due to the nature of the data-hungry neural models. Data augmentation is an emerging area that aims to ensure the diversity of data without attempting to collect new data exclusively to boost a model's performance. 7 Limitations in data augmentation, especially for textual data, are mainly due to the nature of language data, which is precisely discrete. Generative Ad-versarial Networks (GANs) were initially introduced for computer vision applications , aiming to generate highly realistic images by learning the image representations. Recent research has focused on using GANs for text generation and augmentation. This systematic review aims to present the theoretical background of GANs and their use for text augmentation alongside a systematic review of recent textual data augmentation applications such as sentiment analysis, low resource language generation, hate speech detection and fraud review analysis. Further, a notion of challenges in current research and future directions of GAN-based text augmentation are discussed in this paper to pave the way for researchers especially working on low-text resources.

Keywords
Text Data Augmentation; Generative Adversarial Networks; Adversarial Training; Text Generation

Journal
Journal of Computational and Applied Linguistics (JCAL): Volume 1

Status	Published
Publication date	01/06/2023
Publication date online	01/06/2023
Date accepted by journal	01/02/2023
URL
Publisher URL
ISSN	2815-4967

People (1)

Dr Burcu Can Buglalilar

Lecturer in Computing Science, Computing Science

Files (1)

1 33 Compressed

我要吃瓜

Text Data Augmentation Using Generative Adversarial Networks – A Systematic Review

Details

People (1)

Files (1)