Debug: 1. cache refresh at every init; 2. compute cardinality only once #9

MarkDana · 2021-12-23T17:17:05Z

Bugs solved:

About cache refresh:

Manually refresh cache at every init. Added one refresh line at SkeletonDiscovery.py:L145 (for PC) and FCI.py:L615 (for FCI).
Deprecate hash(data.tobytes()) - it's slow. @chenweiDelight also mentions a faster hash(str(data)) - but let's just init refresh and use no hash.

About passing cardinality chisq or gsq:

Added cardinalities and is_discrete at Fas.py:L10, and calculate cardinalities only once at SkeletonDiscovery.py:L165 (for PC) and FCI.py:L625 (for FCI). No need to np.max() every time.

TODO: still debugging:

Now PC results with/without fas (change PC.py:L71) respectively:

data (#nodes/#edges) time (sec)	without fas time	with fas time	SHD(without fas, with fas)
cancer 5/4	0.009	0.009	0
earthquake 5/4	0.012	0.01	0
survey 6/6	0.014	0.015	0
asia 8/8	0.024	0.025	0
sachs 11/17	0.142	0.148	0
child 20/25	0.618	0.699	4
insurance 27/52	1.45	1.823	3
water 32/66	0.321	0.365	0
alarm 37/46	0.873	1.015	1
barley 48/84	3.466	4.95	0
hailfinder 56/66	0.938	1.588	0
hepar2 70/123	9.482	11.949	3
win95pts 76/112	3.381	4.504	0
andes 223/338	26.741	45.722	0

So two problems:

Still small SHD difference at datasets e.g. child.
Still time difference, e.g. andes 26s vs 45s - though before solving cardinalities issue, it's ~300s.
See profiling stat: callings count on chisq is different on with/without fas.

MarkDana · 2021-12-26T04:58:53Z

Closed this pr - the two issues (cache for different data & pass cardinality to chisq/gsq) has already been covered in @chenweiDelight 's newest commit (732b9d1).

MarkDana added 3 commits December 23, 2021 12:03

Debug: 1. cache refresh at every init; 2. compute cardinality only once

76aaa03

Package the search cache + CI test into a function in FAS

a255562

Added Fas.citest_cache to cg_1.citest_cache, for UCSepset.uc_sepset

ab597c0

MarkDana closed this Dec 26, 2021

MarkDana mentioned this pull request Jul 2, 2022

Rewrite CITests as a class && re-use covariance matrix for fisherz #46

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Debug: 1. cache refresh at every init; 2. compute cardinality only once #9

Debug: 1. cache refresh at every init; 2. compute cardinality only once #9

Uh oh!

MarkDana commented Dec 23, 2021

Uh oh!

MarkDana commented Dec 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Debug: 1. cache refresh at every init; 2. compute cardinality only once #9

Debug: 1. cache refresh at every init; 2. compute cardinality only once #9

Uh oh!

Conversation

MarkDana commented Dec 23, 2021

Bugs solved:

TODO: still debugging:

Uh oh!

MarkDana commented Dec 26, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant