Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A Simple Method for Commonsense Reasoning #18

Open
flrngel opened this issue Jun 24, 2018 · 0 comments
Open

A Simple Method for Commonsense Reasoning #18

flrngel opened this issue Jun 24, 2018 · 0 comments

Comments

@flrngel
Copy link
Owner

flrngel commented Jun 24, 2018

https://arxiv.org/abs/1806.02847
google brain

Abstract

  • paper's model outperforms previous state-of-the-art methods in Winograd Schema challenges
  • paper model uses large RNN Language Model

1. Introduction

  • previous models on Winograd Challenge are poor
    • "The trophy doesn't fit in the suitcase because it is too big. What is too big?"
      • Answer0: the trophy [O]
      • Answer1: the suitcase [X]
    • "The trophy doesn't fit in the suitcase because it is too small. What is too small?"
      • Answer0: the trophy [X]
      • Answer1: the suitcase [O]

2. Related Work

3. Methods

  • substitute the pronoun in original sentence with each of the candidate choices
    • this makes LM as solution
  • 2 methods for scoring
    image
    image
    image
  • Negative sampling
    image

5. Main results

image
image

6. Analysis

6.2 Partial scoring is better than full scoring

image

Appendix

  • Author tested this paper with 2 version
    • both uses LSTM model
    • word level
      • 1024 dimension
    • character level
      • shape per word (50, 16) with <start of word>, <end of word>
      • use convolution and 2 fully connected layer to encode word (1024 dimension)

Note

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant