Project

Anti-Noise Fine-Grained Entity Typing

Fine-Grained Entity Typing

Date

    06/2017 - Present

Sponsored by

    National Natural Science Foundation of China (NSFC) No.61572469

Advisor:

    Yantao Jia, Yuanzhuo Wang

Project Objectives:

    Fine-grained entity typing aims to assign entity mentions in the free text with types (e.g., person, politician, etc.), which are arranged in a hierarchical structure. Traditional distant supervision based methods employ a structured data source as a weak supervision and do not need hand-labeled data, but they neglect the label noise in the automatically labeled training corpus. For example, the entity ''Donald Trump'' has types ''person", ''businessman'' and ''politician'' in KBs, thus all three types are annotated for its mentions in the training corpora. But in sentence ''Donald Trump announced his candidacy for President of US.'', only ''person'' and ''politician'' are correct types, while ''businessman'' can not be deduced from the sentence, so serves as noise.
    The project aims to improve the performance of fine-grained entity typing with reducing noise.

My Work 1

  • Assisted in designing the Path-based Attention Neural model (PAN) to dynamically reduce the weights of wrong labeled sentences for each type. Specifically, propose to formulate path as semantic composition of all the types on the path by different composition operators.
  • Implemented baseline (Shimaoka et al. 2016) by PyTorch, and then assisted in conducting experiments on two datasets to demonstrate the superiority of PAN in reducing noise.
  • PAN achieved more than 4% improvement of typing accuracy.Co-authored paper was submitted as a poster to AAAI 2018, and is under review now.

My Work 2

  • Came up with the idea to use Gaussian to capture the hierarchical structures of types.
  • Designed Gaussian loss function to learn Gaussian embedding, and to reduce noise by using a possibility to select partial labels. (in progress)