Feature step dependencies are a pain to work with #2

larskotthoff · 2015-07-16T17:37:39Z

The way feature steps are currently implicitly encoded is a pain. First, you have to read the spec very carefully to understand the semantics (which are the opposite of what at least I would intuitively expect), and modifying the features/feature steps (e.g. for feature filtering) is a complex and error-prone operation.

In particular, to remove a feature step, you have to check all the other feature steps if they contain features that are also provided by the feature step that was removed and if so remove those.

Another (albeit minor niggle) is that the format of description.txt is unnecessarily hard to parse and write because the key-value convention is broken for the feature steps (the key is not a primitive value but constructed from other things).

I propose two changes. First, use YAML for description.txt, which will introduce only minor changes but allow us to use off-the-shelf libraries for parsing and writing rather than having to write custom code. Second, encode feature step dependencies explicitly through requires and provide keys.

Example:

scenario_id: SAT11-HAND
performance_measures: runtime
maximize: false
performance_type: runtime
algorithm_cutoff_time: 5000
algorithm_cutoff_memory: ?
features_cutoff_time: 5000
features_cutoff_memory: ?
features_deterministic: nvarsOrig,nclausesOrig,nvars,nclauses,reducedVars,reducedClauses,vars_clauses_ratio,POSNEG_RATIO_CLAUSE_mean,POSNEG_RATIO_CLAUSE_coeff_variation,POSNEG_RATIO_CLAUSE_min,POSNEG_RATIO_CLAUSE_max,POSNEG_RATIO_CLAUSE_entropy,VCG_CLAUSE_mean,VCG_CLAUSE_coeff_variation,VCG_CLAUSE_min,VCG_CLAUSE_max,VCG_CLAUSE_entropy,UNARY,BINARYp,TRINARYp,VCG_VAR_mean,VCG_VAR_coeff_variation,VCG_VAR_min,VCG_VAR_max,VCG_VAR_entropy,POSNEG_RATIO_VAR_mean,POSNEG_RATIO_VAR_stdev,POSNEG_RATIO_VAR_min,POSNEG_RATIO_VAR_max,POSNEG_RATIO_VAR_entropy,HORNY_VAR_mean,HORNY_VAR_coeff_variation,HORNY_VAR_min,HORNY_VAR_max,HORNY_VAR_entropy,horn_clauses_fraction,VG_mean,VG_coeff_variation,VG_min,VG_max,CG_mean,CG_coeff_variation,CG_min,CG_max,CG_entropy,cluster_coeff_mean,cluster_coeff_coeff_variation,cluster_coeff_min,cluster_coeff_max,cluster_coeff_entropy,DIAMETER_mean,DIAMETER_coeff_variation,DIAMETER_min,DIAMETER_max,DIAMETER_entropy,cl_num_mean,cl_num_coeff_variation,cl_num_min,cl_num_max,cl_num_q90,cl_num_q10,cl_num_q75,cl_num_q25,cl_num_q50,cl_size_mean,cl_size_coeff_variation,cl_size_min,cl_size_max,cl_size_q90,cl_size_q10,cl_size_q75,cl_size_q25,cl_size_q50,SP_bias_mean,SP_bias_coeff_variation,SP_bias_min,SP_bias_max,SP_bias_q90,SP_bias_q10,SP_bias_q75,SP_bias_q25,SP_bias_q50,SP_unconstraint_mean,SP_unconstraint_coeff_variation,SP_unconstraint_min,SP_unconstraint_max,SP_unconstraint_q90,SP_unconstraint_q10,SP_unconstraint_q75,SP_unconstraint_q25,SP_unconstraint_q50,saps_BestSolution_Mean,saps_BestSolution_CoeffVariance,saps_FirstLocalMinStep_Mean,saps_FirstLocalMinStep_CoeffVariance,saps_FirstLocalMinStep_Median,saps_FirstLocalMinStep_Q10,saps_FirstLocalMinStep_Q90,saps_BestAvgImprovement_Mean,saps_BestAvgImprovement_CoeffVariance,saps_FirstLocalMinRatio_Mean,saps_FirstLocalMinRatio_CoeffVariance,gsat_BestSolution_Mean,gsat_BestSolution_CoeffVariance,gsat_FirstLocalMinStep_Mean,gsat_FirstLocalMinStep_CoeffVariance,gsat_FirstLocalMinStep_Median,gsat_FirstLocalMinStep_Q10,gsat_FirstLocalMinStep_Q90,gsat_BestAvgImprovement_Mean,gsat_BestAvgImprovement_CoeffVariance,gsat_FirstLocalMinRatio_Mean,gsat_FirstLocalMinRatio_CoeffVariance,lobjois_mean_depth_over_vars,lobjois_log_num_nodes_over_vars
features_stochastic: 
algorithms_deterministic: MPhaseSAT_2011-02-15,Sol_2011-04-04,QuteRSat_2011-05-12_fixed_,CryptoMiniSat_Strange-Night2-st_fixed_,PicoSAT_941,glucose_2,clasp_2.0-R4092-crafted,SAT07referencesolverminisat_SAT2007,jMiniSat_2011,RestartSAT_B95,SAT09referencesolverclasp_1.2.0-SAT09-32,sathys_2011-04-01,SApperloT2010_2011-05-15_fixed_,sattime+_2011-03-02,sattime_2011-03-02 
algorithms_stochastic: 
number_of_feature_steps: 10
default_steps: Pre, Basic, KLB, CG
feature_steps:
  - name: Pre

  - name: Basic
    provides: vars_clauses_ratio,POSNEG_RATIO_CLAUSE_mean,POSNEG_RATIO_CLAUSE_coeff_variation,POSNEG_RATIO_CLAUSE_min,POSNEG_RATIO_CLAUSE_max,POSNEG_RATIO_CLAUSE_entropy,VCG_CLAUSE_mean,VCG_CLAUSE_coeff_variation,VCG_CLAUSE_min,VCG_CLAUSE_max,VCG_CLAUSE_entropy,UNARY,BINARYp,TRINARYp
    requires: Pre

  - name: KLB
    provides: VCG_VAR_mean,VCG_VAR_coeff_variation,VCG_VAR_min,VCG_VAR_max,VCG_VAR_entropy,POSNEG_RATIO_VAR_mean,POSNEG_RATIO_VAR_stdev,POSNEG_RATIO_VAR_min,POSNEG_RATIO_VAR_max,POSNEG_RATIO_VAR_entropy,HORNY_VAR_mean,HORNY_VAR_coeff_variation,HORNY_VAR_min,HORNY_VAR_max,HORNY_VAR_entropy,horn_clauses_fraction,VG_mean,VG_coeff_variation,VG_min,VG_max
    requires: Pre

  - name: CG
    provides: CG_mean,CG_coeff_variation,CG_min,CG_max,CG_entropy,cluster_coeff_mean,cluster_coeff_coeff_variation,cluster_coeff_min,cluster_coeff_max,cluster_coeff_entropy
    requires: Pre

  - name: DIAMETER
    provides: DIAMETER_mean,DIAMETER_coeff_variation,DIAMETER_min,DIAMETER_max,DIAMETER_entropy
    requires: Pre

  - name: cl
    provides: cl_num_mean,cl_num_coeff_variation,cl_num_min,cl_num_max,cl_num_q90,cl_num_q10,cl_num_q75,cl_num_q25,cl_num_q50,cl_size_mean,cl_size_coeff_variation,cl_size_min,cl_size_max,cl_size_q90,cl_size_q10,cl_size_q75,cl_size_q25,cl_size_q50
    requires: Pre

  - name: sp
    provides: SP_bias_mean,SP_bias_coeff_variation,SP_bias_min,SP_bias_max,SP_bias_q90,SP_bias_q10,SP_bias_q75,SP_bias_q25,SP_bias_q50,SP_unconstraint_mean,SP_unconstraint_coeff_variation,SP_unconstraint_min,SP_unconstraint_max,SP_unconstraint_q90,SP_unconstraint_q10,SP_unconstraint_q75,SP_unconstraint_q25,SP_unconstraint_q50
    requires: Pre

  - name: ls_saps
    provides: saps_BestSolution_Mean,saps_BestSolution_CoeffVariance,saps_FirstLocalMinStep_Mean,saps_FirstLocalMinStep_CoeffVariance,saps_FirstLocalMinStep_Median,saps_FirstLocalMinStep_Q10,saps_FirstLocalMinStep_Q90,saps_BestAvgImprovement_Mean,saps_BestAvgImprovement_CoeffVariance,saps_FirstLocalMinRatio_Mean,saps_FirstLocalMinRatio_CoeffVariance
    requires: Pre

 - name: ls_gsat
   provides: gsat_BestSolution_Mean,gsat_BestSolution_CoeffVariance,gsat_FirstLocalMinStep_Mean,gsat_FirstLocalMinStep_CoeffVariance,gsat_FirstLocalMinStep_Median,gsat_FirstLocalMinStep_Q10,gsat_FirstLocalMinStep_Q90,gsat_BestAvgImprovement_Mean,gsat_BestAvgImprovement_CoeffVariance,gsat_FirstLocalMinRatio_Mean,gsat_FirstLocalMinRatio_CoeffVariance
   requires: Pre

  - name: lobjois
    provides: lobjois_mean_depth_over_vars,lobjois_log_num_nodes_over_vars
    requires: Pre

This makes it intuitively clear what Pre does and that it doesn't actually provide any features on its own. It also makes the number_of_feature_steps attribute redundant and it could be removed.

@mlindauer @berndbischl

The text was updated successfully, but these errors were encountered:

berndbischl · 2015-07-16T19:21:17Z

i guees i agrree. how do we parse yaml in r?

larskotthoff · 2015-07-16T19:23:21Z

https://cran.r-project.org/web/packages/yaml/index.html

berndbischl · 2015-07-16T19:26:09Z

ok sorry :)

larskotthoff · 2015-07-16T19:27:25Z

No worries :)
As far as I can see at the moment this would give us exactly the same data structure as we have at the moment (apart from the feature steps of course), so any changes should be minimal.

mlindauer · 2015-07-16T19:45:32Z

Hi Lars,

I agree that the current format of the feature groups is an issue.
I also like the idea of "provide" and "requires".

However, please note first of all,
that your example is wrong.
Pre provides the following features: "reducedVars,nvars,nclausesOrig,nvarsOrig,nclauses,reducedClauses".

Furthermore, I don't like YAML so much.
We use it for one of our homepages and it is always a pain to edit the yaml files.
Aren't there any better alternatives?

Cheers,
Marius

larskotthoff · 2015-07-16T19:52:36Z

Ok, that the example is wrong would have been much clearer in the new format :)

I don't see how editing YAML is more painful than editing a non-standard format.

mlindauer · 2015-07-16T19:55:51Z

In the end, I can live with YAML.
however, there is no way to specify this with arff, or?
If possible, I would like to prevent to use two different standard formats.

larskotthoff · 2015-07-16T19:58:23Z

Again, I don't see how using two different standard formats is worse than using a standard and a non-standard format. In principle I don't have a problem with using YAML for everything.

mlindauer · 2015-07-16T20:02:15Z

How would one of the other files look like in YAML?
I read in wikipedia that each JSON file is also valid YAML (>=1.2) file.
I like JSON but I don't know whether this is really user-friendly.

larskotthoff · 2015-07-16T20:05:04Z

Hmm, I guess something like

- instance_id: bla
  repetition: 1
  feature_x: foo

I don't really see a problem with being user friendly -- you're not supposed to edit/write those files manually.

mlindauer · 2015-07-16T20:07:50Z

such a format would blow up our files by more than factor 2 I guess.

The description.txt is a file I always write manually.

berndbischl · 2015-07-16T20:08:55Z

you can forget arff for such files immediatly

larskotthoff · 2015-07-16T20:09:57Z

Yes, everything would be much larger. But as I said, I'm not opposed to keeping everything but description.txt in arff. We also have citation.bib which is in yet another standard format.

mlindauer · 2015-07-17T07:56:45Z

OK.

I also asked Matthias whether he likes this new format. and he agreed.
so, please go on and make the changes.

Cheers,
Marius

larskotthoff · 2015-07-17T16:14:14Z

Ok, what's your feeling on making the lists proper YAML lists as well? I.e. instead of comma-separated they would be

provides:
  - CG_mean
  - CG_coeff_variation
  - etc.

mlindauer · 2015-07-17T16:25:07Z

I like the comma-separated more since I can look up the corresponding feature step to a feature by looking one line above (and not n lines).
To have a proper YAML (1.2), which is similiar to right now, we could use

[CG_mean, CG_coeff_variation,...]

However, we should change the entire file.
So for example also algorithms_deterministic.

larskotthoff · 2015-07-17T16:27:46Z

Ok, but presumably you're not going to parse the YAML yourself but use a library? And yes, that would apply for everything -- if the data structure is serialized by a YAML library we may not even be able to control which type of list we get (and don't need to care).

So I guess my real question is whether you're planning to use a library to parse/write the YAML.

berndbischl · 2015-07-17T16:31:03Z

parsing: of course.

but i would prefer it if people could still manually write (smaller) files without programing.

can we do that?

mlindauer · 2015-07-17T16:32:35Z

I often have a look into the description.txt files to get a better feeling for the scenarios, e.g., which algorithms are used; how many feature are used and how are the feature distributed in the feature groups.
I could write scripts for such things, but looking into the files is often faster.
So I would prefer that I can easily read the files.

berndbischl · 2015-07-17T16:35:27Z

well that argument i find slightly strange? why not use the eda overview?

larskotthoff · 2015-07-17T16:36:13Z

Of course you can still read/write the files manually and that shouldn't even be much more difficult than it is now. But it would be much easier to parse/write programmatically because we can just use YAML libraries.

berndbischl · 2015-07-17T16:36:50Z

i meam we invested lots of time to write exactly scripts for that purpose.... web based.....

larskotthoff · 2015-07-17T16:40:27Z

Which, come to think of it, we should rerun to update the web pages at some point.

berndbischl · 2015-07-17T17:23:44Z

Proposal: Use travis for that. People do PRs for a new scenario. Then travis builds all EDA stuff. This even checks the validity of the scenario files. Only then we merge. The only thing we then have to run manually might be the selector benchmarks.

larskotthoff · 2015-07-17T17:26:53Z

+1

mlindauer · 2015-07-17T18:40:32Z

i meam we invested lots of time to write exactly scripts for that purpose.... web based.....

I'm not always online.
I'm faster with my local files than finding the URL and then clicking through the web interface.

larskotthoff · 2015-07-17T19:06:26Z

Ok, so you think that

- name: Basic
  provides:
    - vars_clauses_ratio
    - POSNEG_RATIO_CLAUSE_mean
    - POSNEG_RATIO_CLAUSE_coeff_variation
    - POSNEG_RATIO_CLAUSE_min
    - POSNEG_RATIO_CLAUSE_max
    - POSNEG_RATIO_CLAUSE_entropy
    - VCG_CLAUSE_mean
    - VCG_CLAUSE_coeff_variation
    - VCG_CLAUSE_min
    - VCG_CLAUSE_max
    - VCG_CLAUSE_entropy
    - UNARY
    - BINARYp
    - TRINARYp
  requires: Pre

is harder to read than

- name: Basic
  provides: vars_clauses_ratio,POSNEG_RATIO_CLAUSE_mean,POSNEG_RATIO_CLAUSE_coeff_variation,POSNEG_RATIO_CLAUSE_min,POSNEG_RATIO_CLAUSE_max,POSNEG_RATIO_CLAUSE_entropy,VCG_CLAUSE_mean,VCG_CLAUSE_coeff_variation,VCG_CLAUSE_min,VCG_CLAUSE_max,VCG_CLAUSE_entropy,UNARY,BINARYp,TRINARYp
  requires: Pre

mlindauer · 2015-07-17T19:15:23Z

Yes, but in the end, I don't feel strongly about this.
So, I can also live with the first format if we don't have a nice way to automatically generate the second format.

)

larskotthoff · 2015-07-29T23:49:30Z

Ok, I've updated the spec, converted all the scenarios and updated the R code.

@mlindauer Could you please update the Python code/checker?

mlindauer · 2015-07-30T06:50:27Z

I'm on vacation for the next two weeks. I will do it afterwards.

larskotthoff · 2015-07-30T15:50:53Z

Ok, thanks. No rush :)

larskotthoff · 2015-08-04T19:46:31Z

It just occurred to me that we should also have a look at the feature_runstatus.arff files for instances that are presolved. The spec doesn't say what should happen to dependent feature steps in this case and the data is inconsistent. For example for ASP, feature steps that depend on one that presolved seem to be listed as presolved" as well but the costs aren't given, implying that they weren't actually run. For the SAT data sets, the runstatus of feature steps that depend on one that presolved are listed as unknown (which probably makes more sense in this case).

mlindauer · 2015-09-17T11:05:32Z

Hi,

I started to implement the new description.txt parser and I found an issue.
According to the spec, "performance_measures" specifies a list.
But looking at some of the description.txt files, e.g., ASP-POTASSCO, it is only a string:
performance_measures: runtime

So, the format according to YAML should be:
performance_measures:
- runtime

The same issue holds for "maximize" and "performance_type".

mlindauer · 2015-09-17T11:27:10Z

The same issue applies to feature_step->"requires" in same senarios.
In ASP-POTASSCO it is fine:

    Dynamic-1:
        requires:
            - Static

IN SAT11-HAND it is not OK:

    Basic:
      requires: Pre

mlindauer · 2015-09-17T12:14:07Z

I updated the checker tool (and flexfolio).
Right now, the checker tools complains about the issues raised above.

larskotthoff · 2015-09-17T13:35:19Z

Thanks, good catch. Could you fix the files please?

mlindauer · 2015-09-18T07:33:55Z

Hi, I fixed it. All scenarios in the master branch are now compatible with the checker tool again.

However, I found another issue.
At some point, we agreed that we need an order of the feature steps. This was implicitly given by the order of the feature steps in the description.txt.
Since, we use YAML now, we encode the "feature_steps" as dictionaries:

feature_steps:
    Pre:
      provides:
        - nvarsOrig
       [...]
    Basic:
      requires: 
        - Pre

Parsing this file (at least with Python) will give you a dictionary without a defined order of the feature steps. So, we either have to change "feature_steps" to list (which would look unintuitively and ugly imho) or we add another list, such as "feature_step_order".
What do you think?

Cheers,
Marius

larskotthoff · 2015-09-18T18:28:11Z

Just remind me what the order is needed for? You can derive any ordering constraints from the provides/requires right?

mlindauer · 2015-09-18T18:44:21Z

If I correctly remember, the problem was the presolved feature steps.

The features were computed in an (unknown) order and if a feature step pre-solved an instance, the remaining feature steps were not computed (at least true for ASP and SAT scenarios).
We discussed the definition of the oracle at some point. If we want to include the feature steps as a possible algorithm to solve an instance (important for some scenarios) in the oracle defintion, we have to know the right order of the feature steps, or else we have to solve an NP-hard problem (i.e., all possible orders of feature steps) to find an optimal order.
(3. there is more than one possible order, if we only consider "requires")

larskotthoff · 2015-09-18T19:31:41Z

Sounds to me like we should have a feature runstatus "not computed" then -- using the order to derive this is quite similar to how the dependencies were encoded. Not at all obvious and intuitive and bound to trip somebody up.
I remember -- what conclusion did we come to? It seems fair enough to me that oracle would be able to change the ordering of feature steps.
I don't see that as a disadvantage.

mlindauer · 2015-09-18T20:31:59Z

Saying some features are "not computed" sounds even more unintuitive. Without an order of the feature steps, it is not explained why they are not computed. And we assume so far that the data is complete as long as there are no good reasons for missing data.
I think we postponed the discussion to later and used simply the old definition of the oracle without consideration of feature steps.

larskotthoff · 2015-09-19T05:58:02Z

Ok, so let's have a feature status "not computed because instance presolved by previous feature step". We don't need to know what that feature step was, do we?

mlindauer · 2015-09-20T19:18:18Z

OK, I agree that we should have something like "not computed because instance presolved by previous feature step".
However, if we have such a status, I still think we should have some more information about the order of the feature steps - at least how they were generated; the user can still decide to use another order.
The arguments for such information are:

We would know which step was responsible for this new status
The optimal order of the feature steps (-> presolved status) is exactly the order in which the features were generated. I don't see an argument why the users should try to figure this out by themselves if we already know it. (In the same way, it is also important for a new oracle definition as mentioned before.)

larskotthoff · 2015-09-22T09:58:28Z

Should the order of the feature steps used when generating the data for the scenarios be part of the metadata?

mlindauer · 2015-09-22T10:27:42Z

Yes?

larskotthoff · 2015-09-22T10:34:25Z

Ok, then let's do that.

larskotthoff pushed a commit to coseal/aslib-r that referenced this issue Jul 29, 2015

change description spec (coseal/aslib-spec#2)

dbca1af

larskotthoff pushed a commit to coseal/aslib_data that referenced this issue Jul 29, 2015

convert all scenarios to new format (coseal/aslib-spec#2)

91669ad

larskotthoff pushed a commit that referenced this issue Jul 29, 2015

description as YAML and explicit provides/requires for feature steps (#2

ab2aec0

)

Feature step dependencies are a pain to work with #2

Feature step dependencies are a pain to work with #2

Comments

larskotthoff commented Jul 16, 2015

berndbischl commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

berndbischl commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

mlindauer commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

mlindauer commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

mlindauer commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

mlindauer commented Jul 16, 2015

berndbischl commented Jul 16, 2015

larskotthoff commented Jul 16, 2015

mlindauer commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

mlindauer commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

berndbischl commented Jul 17, 2015

mlindauer commented Jul 17, 2015

berndbischl commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

berndbischl commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

berndbischl commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

mlindauer commented Jul 17, 2015

larskotthoff commented Jul 17, 2015

mlindauer commented Jul 17, 2015

larskotthoff commented Jul 29, 2015

mlindauer commented Jul 30, 2015

larskotthoff commented Jul 30, 2015

larskotthoff commented Aug 4, 2015

mlindauer commented Sep 17, 2015

mlindauer commented Sep 17, 2015

mlindauer commented Sep 17, 2015

larskotthoff commented Sep 17, 2015

mlindauer commented Sep 18, 2015

larskotthoff commented Sep 18, 2015

mlindauer commented Sep 18, 2015

larskotthoff commented Sep 18, 2015

mlindauer commented Sep 18, 2015

larskotthoff commented Sep 19, 2015

mlindauer commented Sep 20, 2015

larskotthoff commented Sep 22, 2015

mlindauer commented Sep 22, 2015

larskotthoff commented Sep 22, 2015