Splitfed learning methods for natural language processing

Amna Faisal, N. Z. Jhanjhi, Sayan Kumar Ray, H. L. Gururaj, Farzeen Ashfaq, Shampa Rani Das

    Research output: Chapter in Book/Report/Conference proceedingChapter

    Abstract

    The growing importance of data privacy has spurred the development of novel techniques for training natural language processing (NLP) models without compromising user confidentiality. This chapter explores two such techniques: federated learning (FL) and splitfed learning (SFL). FL enables distributed training on private datasets across devices, sharing only model updates with a central server. SFL, a sub-technique, takes a further step by splitting the model itself for training on both local devices and a central server, exchanging only intermediate results. This chapter explores SFL for privacy-preserving NLP tasks like text categorization and question answering (QA). Traditional approaches often necessitate centralized data storage, raising privacy concerns. FL and SFL address this by enabling distributed model training on user devices without sharing raw data. We discuss the benefits and shortcomings of each approach, highlighting FL’s ability to handle complex models while acknowledging its potential communication overhead and performance limitations. We emphasize the relative newness of SFL and the early stages of research on its application in NLP tasks. Finally, we explore potential areas for future work, including reducing communication overhead, investigating optimal model architectures, and developing robust methods for handling non-IID data. Overcoming these challenges can ensure FL and SFL techniques have a promising future in NLP, enabling powerful model development while safeguarding user privacy.

    Original languageEnglish
    Title of host publicationSplit Federated Learning for Secure IoT Applications
    Subtitle of host publicationConcepts, frameworks, applications and case studies
    PublisherInstitution of Engineering and Technology
    Pages47-65
    Number of pages19
    ISBN (Electronic)9781839539466
    ISBN (Print)9781839539459
    DOIs
    Publication statusPublished - 01-01-2024

    All Science Journal Classification (ASJC) codes

    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Splitfed learning methods for natural language processing'. Together they form a unique fingerprint.

    Cite this