Skip to main navigation Skip to search Skip to main content

Hierarchical Attention Mechanism for Domain-invariant Spoken Language Identification via Inter-domain Alignment and Intra-domain Language Discrimination

  • Urvashi Goswami*
  • , H. Muralikrishna
  • , Sujeet Kumar
  • , A. D. Dileep
  • , Veena Thenkanidiyoor
  • *Corresponding author for this work

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    Modern methods for spoken language identification (LID) have demonstrated promising results when trained on large datasets. However, their effectiveness is considerably impacted by the discrepancies between the training and testing distributions. Unsupervised domain adaptation (UDA) addresses the domain discrepancies by aligning feature distributions across source and target domains without using labeled target data, thereby ensuring the model performs consistently across both domains. This alignment, however, may come at the cost of reduced model discriminability. To enhance performance under domainmismatched conditions, a LID system should learn domaininvariant representations of speech while maintaining language discriminability at every level, from initial segment-level representations to final utterance-level representations. To address this issue, in this paper, we propose a hierarchical attention network that simultaneously enhances model domain invariance through inter-domain alignment and improves model's language discriminability using intra-domain contrastive discrimination. The performance on the target domain shows that the proposed method exceeds the effectiveness of the top-performing existing approaches.

    Original languageEnglish
    Title of host publication2024 IEEE Conference on Engineering Informatics, ICEI 2024
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    ISBN (Electronic)9798331505776
    DOIs
    Publication statusPublished - 2024
    Event2024 IEEE Conference on Engineering Informatics, ICEI 2024 - Melbourne, Australia
    Duration: 20-11-202428-11-2024

    Publication series

    Name2024 IEEE Conference on Engineering Informatics, ICEI 2024

    Conference

    Conference2024 IEEE Conference on Engineering Informatics, ICEI 2024
    Country/TerritoryAustralia
    CityMelbourne
    Period20-11-2428-11-24

    All Science Journal Classification (ASJC) codes

    • Fluid Flow and Transfer Processes
    • Artificial Intelligence
    • Computer Vision and Pattern Recognition
    • Information Systems
    • Mechanical Engineering
    • Health Informatics
    • Media Technology
    • Control and Optimization

    Fingerprint

    Dive into the research topics of 'Hierarchical Attention Mechanism for Domain-invariant Spoken Language Identification via Inter-domain Alignment and Intra-domain Language Discrimination'. Together they form a unique fingerprint.

    Cite this