JuJu

This repo implements a NER model using Julia and Flux (glove + Bilstm + softmax);

Currently, we got chunk accuracy 78% on Conll2003 dataset.

Why julia？

https://www.oschina.net/news/99104/what-is-julia

Dependency

Please see the REQUIRE file

Task

Given a sentence, given a tag to each word (contain punctuation). The classic application is Named Entity Recognition. Here is an example.

John   lives in New   York
B-PER  O     O  B-LOC I-LOC

Model

the code related to model generation

model = Chain(
    Dense_m(Weight),
    MyBiLSTM(EmbedSize, HiddenSize),
    Dropout(0.5),
    lower_dim(HiddenSize * 2),
    Dense(HiddenSize * 2, ClassNum),    softmax
    )

an embedding layer to do word embedding, now we choose glove (Here we use the developing dataset to determine the hyperparameter EmbedSize);
run a bi-lstm on each sentence to extract contextual representation of each word;
one dropout layer;
one fully connected layer to do the decode.

Getting started

Download the initial data (Conll2003 dataset)

or

git clone https://github.com/GGchencan/JuJu.git

we put the inital Conll2003 data in our demo folder

use the data preprocess program to preprocess the data

julia data_preprocess_custom.jl train.txt test.txt dev.txt```

this function will help you to build the dataset into six different data file, used for train, eval and test. The order of the parameters is the path to train data, test data and evaluation data.

_3. cd the main.jl, simply run the file

```julia
julia main.jl

wait for several minutes, the train process will be finished.

make sure the generated model file exist in the JuJu folder, run the demo.jl to show the result of your training.

julia demo/demo.jl

the result is expected to like

Prepare your own data

The training data must be identical in the Conll2003 data format.

A default data example

John B-PER
lives O
in O
New B-LOC
York I-LOC
. O

This O
is O
another O
sentence

After your prepare your own data and seperate it as train, test, eval, use step2 in Getting started to process your own data and do the training.

Results

The result shows as following:

example1 epoch = 10, dim(dimension of word embedding) = 50

example2 epoch = 10, dim(dimension of word embedding) = 300

example3 epoch = 20, dim(dimension of word embedding) = 300

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
Data		Data
demo		demo
model_dir		model_dir
pytorch_impl		pytorch_impl
test		test
Dense_m.jl		Dense_m.jl
README.md		README.md
REQUIRE		REQUIRE
evaluate_.jl		evaluate_.jl
loadembedding.jl		loadembedding.jl
loader.jl		loader.jl
lstm_custom.jl		lstm_custom.jl
main.jl		main.jl
predict_label.jl		predict_label.jl
result_dim=300_epoch=20.png		result_dim=300_epoch=20.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JuJu

Why julia？

Dependency

Task

Model

Getting started

Prepare your own data

Results

About

Releases

Packages

Contributors 7

Languages

GGchencan/JuJu

Folders and files

Latest commit

History

Repository files navigation

JuJu

Why julia？

Dependency

Task

Model

Getting started

Prepare your own data

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 7

Languages

Packages