a much faster longest_common_prefix for words #19322

videlec · 2015-10-01T02:47:25Z

I had to do some computations of the following kind... which are damn slow

sage: w = words.FibonacciWord()[:10000]
sage: %time L = [[len(w[n:].longest_common_prefix(w[n+fibonacci(i):])) for i in range(5,20)] for n in range(1,1000)]
CPU times: user 20.6 s, sys: 44 ms, total: 20.7 s
Wall time: 20.5 s

sage: w = Words([0,1])(list(words.FibonacciWord()[:10000]))
sage: %time L = [[len(w[n:].longest_common_prefix(w[n+fibonacci(i):])) for i in range(5,20)] for n in range(1,1000)]
CPU times: user 7.99 s, sys: 56 ms, total: 8.04 s
Wall time: 7.93 s

and with the branch

sage: w = Words([0,1])(list(words.FibonacciWord()[:10000]))
sage: %time L = [[len(w[n:].longest_common_prefix(w[n+fibonacci(i):])) for i in range(5,20)] for n in range(1,1000)]
CPU times: user 172 ms, sys: 0 ns, total: 172 ms
Wall time: 168 ms

We also implement longest_common_suffix and fix the following annoying feature of has_prefix

sage: W = Words([0,1,2])
sage: w = W([0,1,0,2])
sage: w.has_prefix(words.FibonacciWord())
False

CC: @seblabbe

Component: combinatorics

Author: Vincent Delecroix

Branch/Commit: ebbc28d

Reviewer: Nathann Cohen

Issue created by migration from https://trac.sagemath.org/ticket/19322

The text was updated successfully, but these errors were encountered:

videlec · 2015-10-01T02:49:48Z

Commit: fe306f9

videlec · 2015-10-01T02:49:48Z

Branch: u/vdelecroix/19322

videlec · 2015-10-01T02:49:48Z

New commits:

`fe306f9`	`Trac 19322: faster longest_common_prefix`

nathanncohen · 2015-10-02T08:03:58Z

comment:2

Hello Vincent,

Looks good. A couple of remarks:

You can use min/max in Cython code. Contrary to Python :-P
longest_suffix/Python case: what about 'caching' len(other) instead of recomputing it at every test?
longest prefix/Python case: the code generated by python for this 'slice' is scary. Isn't it possible to reimplement it without it? Plus if you need to import 'islice' it is maybe better to do so at module level, it's not thaaat bad either and it may happen that the prefix be one character long, in which case importing stuff could be comparatively more expensive (?).
Have you considered returning a 'new word' (through new_c) even when that new word is equal to one of self and other? It would simplify the code, and I don't know if it matters as it does not allocate new memory anyway?
I do not understand your 'not able to initialize a word from [..]'. The code does not try to initialize a word, does it? It's more something like "unsupported type"?..

Nathann

sagetrac-git · 2015-10-04T22:33:15Z

Branch pushed to git repo; I updated commit sha1. New commits:

`ebbc28d`	`Trac 19322: reviewer comments`

sagetrac-git · 2015-10-04T22:33:15Z

Changed commit from fe306f9 to ebbc28d

videlec · 2015-10-04T22:34:52Z

comment:4

Hello Nathann,

I implemented your remarks excepted 4. It does allocate memory to call _new_c: the one you need for a Python object. But I don't know whether it is justified or not (ie the ratio between "simple code" versus "efficient code").

Vincent

nathanncohen · 2015-10-05T07:17:09Z

Reviewer: Nathann Cohen

nathanncohen · 2015-10-05T07:17:09Z

comment:5

Yoooooooooo !

I implemented your remarks excepted 4. It does allocate memory to call _new_c: the one you need for a Python object. But I don't know whether it is justified or not (ie the ratio between "simple code" versus "efficient code").

Okay okay, you decide, just felt like bringing it up when I reviewed this code.

Stamped, and good to go.

Nathann

videlec · 2015-10-05T10:06:20Z

comment:6

Thanks!

vbraun · 2015-10-12T22:52:51Z

Changed branch from u/vdelecroix/19322 to ebbc28d

videlec added this to the sage-6.9 milestone Oct 1, 2015

videlec added c: combinatorics labels Oct 1, 2015

videlec added the s: needs review label Oct 1, 2015

nathanncohen mannequin added s: positive review and removed s: needs review labels Oct 5, 2015

vbraun removed the s: positive review label Oct 12, 2015

vbraun closed this as completed in da02e9f Oct 12, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a much faster longest_common_prefix for words #19322

a much faster longest_common_prefix for words #19322

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

nathanncohen mannequin commented Oct 2, 2015

sagetrac-git mannequin commented Oct 4, 2015

sagetrac-git mannequin commented Oct 4, 2015

videlec commented Oct 4, 2015

nathanncohen mannequin commented Oct 5, 2015

nathanncohen mannequin commented Oct 5, 2015

videlec commented Oct 5, 2015

vbraun commented Oct 12, 2015

a much faster longest_common_prefix for words #19322

a much faster longest_common_prefix for words #19322

Comments

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

videlec commented Oct 1, 2015

nathanncohen mannequin commented Oct 2, 2015

sagetrac-git mannequin commented Oct 4, 2015

sagetrac-git mannequin commented Oct 4, 2015

videlec commented Oct 4, 2015

nathanncohen mannequin commented Oct 5, 2015

nathanncohen mannequin commented Oct 5, 2015

videlec commented Oct 5, 2015

vbraun commented Oct 12, 2015