Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bidirectional LSTM-CRF for Clinical Concept Extraction #150

Closed
agitter opened this issue Nov 28, 2016 · 7 comments
Closed

Bidirectional LSTM-CRF for Clinical Concept Extraction #150

agitter opened this issue Nov 28, 2016 · 7 comments

Comments

@agitter
Copy link
Collaborator

agitter commented Nov 28, 2016

https://arxiv.org/abs/1611.08373

Automated extraction of concepts from patient clinical records is an essential facilitator of clinical research. For this reason, the 2010 i2b2/VA Natural Language Processing Challenges for Clinical Records introduced a concept extraction task aimed at identifying and classifying concepts into predefined categories (i.e., treatments, tests and problems). State-of-the-art concept extraction approaches heavily rely on handcrafted features and domain-specific resources which are hard to collect and define. For this reason, this paper proposes an alternative, streamlined approach: a recurrent neural network (the bidirectional LSTM with CRF decoding) initialized with general-purpose, off-the-shelf word embeddings. The experimental results achieved on the 2010 i2b2/VA reference corpora using the proposed framework outperform all recent methods and ranks closely to the best submission from the original 2010 i2b2/VA challenge.

@cgreene
Copy link
Member

cgreene commented Dec 22, 2016

This is a short paper that employs an LTSM network with random, Word2Vec, and GloVe embeddings. Performance is generally strong without hand-tuning or hand-constructed features applied on top of embeddings, though even the best LTSM method was pareto dominated by existing methods (i.e. there are methods that are better in both precision and recall). I'll probably mention it but it's definitely not transformative over existing approaches. Of note, the authors provide source code for the analyses.

@gwaybio
Copy link
Contributor

gwaybio commented Dec 22, 2016

Of note, the authors provide source code for the analyses.

This could be something we stress heavily in the discussion about progressing the field. Made me think of a tweet I saw today about #159

@cgreene
Copy link
Member

cgreene commented Dec 22, 2016

@gwaygenomics : I like the idea of noting exactly which contributions have provided source code. That might be loads of work though...

@cgreene
Copy link
Member

cgreene commented Dec 22, 2016

@gwaygenomics : can you also note that tweet in the #159 discussion?

@cgreene
Copy link
Member

cgreene commented Dec 22, 2016

Might as well provide the link to the code here in case someone comes along later and wants it:

https://github.com/raghavchalapathy/Bidirectional-LSTM-CRF-for-Clinical-Concept-Extraction

@cgreene
Copy link
Member

cgreene commented Dec 22, 2016

I guess this means we can tag @raghavchalapathy also!

@cgreene
Copy link
Member

cgreene commented Dec 23, 2016

Discussed in #167

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants