About

This is an experiment using Julia to do sequence-based NLP tasks.

Status

This project is not in a ready state for use. Several tests are set up for forward and backward propagation, but the training loop is not established yet.

Issues

There are performance problems on the GPU; the forward pass is GPU-optimized and takes ~150 ms, but the backward pass performed either as

Flux.train!(Loss, θ, [(X, Y)], opt)

or as

grads = Flux.tracker.gradient(() -> Loss(X, Y), θ)
Flux.Tracker.update!(opt, θ, grads)

takes >10 seconds to complete on the GPU, compared with ~1.8 seconds on the CPU. Help with this from someone more experienced with Julia, Flux and/or CuArrays would be appreciated.

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
models/AttentionEmbed		models/AttentionEmbed
src		src
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Status

Issues

About

Releases

Packages

Languages

License

brainsqueeze/SequenceModels.jl

Folders and files

Latest commit

History

Repository files navigation

About

Status

Issues

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages