v0.0.2
Here we added major improvements:
- Now we avoid calculation of dense full-size matrix via batching calculation of log-likelihood;
- We implemented cyclic coordinate descent.
Learning rate is still fixed, in our tests this approach is good enough for most of the situations (line search is very computationally expensive)