Embedding Hallucination for Few-Shot Language Fine-tuning

less than 1 minute read

Introduction

  • What is the name of the EmbedHalluc paper?

    Embedding Hallucination for Few-Shot Language Fine-tuning

    Yiren JianChongyang GaoSoroush Vosoughi

    • Same authors as SupCON paper
  • What are the main contributions of the EmbedHalluc paper?
    • cWGAN based augmentation in the embedding space
  • The Wasserstein GAN uses Wasserstein distance as the objective function to stabilize the training of GAN.

Method

  • Hallucinated embedding in HallucEmbed are pseudolabeled by a teacher model

  • HallucEmbed trains a GAN discriminator to distinguish between real and hallucinated embeddings

    Untitled

Results

  • EmbedHalluc roBERTa large results pretty good few shot performance
    • not as good as my DE method

    Untitled

Reference

@misc{https://doi.org/10.48550/arxiv.2205.01307,
  doi = {10.48550/ARXIV.2205.01307},
  
  url = {https://arxiv.org/abs/2205.01307},
  
  author = {Jian, Yiren and Gao, Chongyang and Vosoughi, Soroush},
  
  keywords = {Computation and Language (cs.CL), Artificial Intelligence (cs.AI), FOS: Computer and information sciences, FOS: Computer and information sciences},
  
  title = {Embedding Hallucination for Few-Shot Language Fine-tuning},
  
  publisher = {arXiv},
  
  year = {2022},
  
  copyright = {arXiv.org perpetual, non-exclusive license}
}

Updated: