SMTS submission 2025 #172

Tomaqa · 2025-06-12T09:10:16Z

No description provided.

github-actions · 2025-06-12T09:19:48Z

Summary of modified submissions

SMTS

3 authors
website: https://github.com/usi-verification-and-security/SMTS
Participations
- Parallel
  - QF_Equality
    - all
  - QF_Equality+LinearArith
    - ~~QF_UFDTLIA~~
    - ~~QF_UFDTLIRA~~
  - QF_LinearIntArith
    - ~~QF_LIRA~~
  - QF_LinearRealArith
    - all

#189: UltimateEliminator submission 2025 #188: Z3-Siri Submission 2025 #187: OSTRICH version 2 #186: yicesQS submission to the 2025 SMT comp #185: Bitwuzla 2025 submission. #184: Yices2 Submission SMTCOMP 2025 #183: cvc5 for SMT-COMP 2025 #182: Create iProver #181: Z3-Owl Submission 2025 #179: Z3-alpha SMT-COMP 2025 #178: Z3-Noodler-Mocha Submission for SMT-COMP 2025 #177: `bv_decide` submission 2025 #176: OpenSMT (min-ucore) submission 2025 #175: Z3-Noodler submission 2025 #172: SMTS submission 2025 #171: Bitwuzla-MachBV Submission for SMT-COMP 2025 #170: Z3-Parti-Z3++ Submission for SMT-COMP 2025 #169: STP-Parti-Bitwuzla Submission for SMT-COMP 2025 #168: SMTInterpol submission 2025 #167: OpenSMT submission 2025 #165: Amaya 2025 #164: SMT-RAT submission #163: COLIBRI submission #162: [Submission] colibri2 #156: upload z3-inc-z3++

martinjonas · 2025-06-27T06:18:16Z

@Tomaqa Thanks for submitting SMTS to SMT-COMP 2025! We have tried running test runs and we ran into few technical problems.

The path to the executable in the command field has to be relative to the directory where the archive was unpacked and the arguments should be separate items in the list. Ideally also do not call python3 executable, but directly your solver's entry point. And there is no variable <path>. When I changed the command in your submission from
```
"command": ["python3 ./server/smts.py -l -p -o 256", "-fp <path>"],
```
to
```
"command": ["./SMTS/server/smts.py", "-l", "-p", "-o 256", "-fp"],
```
I was able to execute your solver on our infrastructure. Can you make such a change in your submission? Thanks!
We do not have git installed on the worker machines in our cluster. So the import from version import version in server/smts.py fails because it calls version.sh, which calls git, which fails. Can you remove the dependency on git? We are not administrators of the cluster and cannot install git to all of the machines.
When I fixed that, I was able to execute the solver and get the results. You can find the results here: https://www.fi.muni.cz/~xjonas/smtcomp/tables/smts_parallel.table.html#/table

Unfortunately, the solver currently does not produce the output in SMT-LIB compliant format. For example, on the benchmark QF_UFIDL/QF_UFIDL_20210312-Bouvier_vlsat3_f10.smt2 , the output is
```
/tmp/vcloud_worker_vcloud-master_on_vcloud-master/run_dir_94a9aa27-0d02-4702-8db4-c6e781b870d1/unpack/1f46314b1d932a7a15eb21c43350fa85a0a85f4ad82c5d522615a4c528eb8da5/SMTS/server/utils.py:184: SyntaxWarning: invalid escape sequence '\{'
  s = re.sub('(\{[0-9]*\})', lambda x: strings[x.group(1)], s)
scrambled16421.smt2 unsat 24.13
```
instead of just
```
unsat
```

We are going to announce a deadline extension until the end of Sunday (GMT) soon. If you update the submission by then, I can rerun the test runs.

Tomaqa · 2025-06-27T14:30:35Z

I hope I fixed all your comments. Thank you for re-running the tests!

martinjonas · 2025-06-27T18:13:35Z

@Tomaqa Thanks a lot for all of the changes! Good news is that it solves almost all of the problems. The execution still does not work, but if you split all the command line arguments, the execution goes through (we have to document the JSON better next year and also provide CI for the Parallel track). I.e., please change

"command": ["./SMTS/server/smts.py", "-l", "-p", "-pt 2", "-o 256", "-fp"],

to

"command": ["./SMTS/server/smts.py", "-l", "-p", "-pt", "2", "-o", "256", "-fp"],

I have done that and ran the test runs. The results are here: https://www.fi.muni.cz/~xjonas/smtcomp/tables/smts_parallel.table.html#/table

The worse news is that the solver returns 4 incorrect results, i.e., returns unsat (false) on benchmarks with expected status sat (true). You can find the benchmarks in the table marked by the status in red font.

For the investigation, you can find the scrambled benchmarks here: https://www.fi.muni.cz/~xjonas/smtcomp/benchmarks/parallel.tar.gz . The archive contains for each original benchmark a file ORIGINAL_NAME.yml, in which you can find the name of the scrambled benchmark.

If you think that there is an issue on our side, please let me know.

Tomaqa · 2025-06-27T22:22:30Z

Thank you for the update. I indeed reproduced one of the bugs and observed that it works properly when using just 64 solvers. We will need to investigate - after the competition.
I updated the submission accordingly. Could you please run the tests once again? I hope it will be fine now.
Thanks.

martinjonas · 2025-06-28T14:55:39Z

Thanks for the update, you can find the new results on the same link as before. The execution is working fine now. Unfortunately, there are still two incorrect answers remaining. :/

Again, if you think that the problem is on our side or the benchmark has wrong expected result, let me know. Also let me know if you want another test run with a fixed version. We still have some time left and the test executions are quite quick.

Tomaqa · 2025-06-29T10:59:15Z

I tried the above link (https://www.fi.muni.cz/~xjonas/smtcomp/tables/smts_parallel.table.html#/table) but even though it shows the date 06-28, there are more incorrect results than two, and the one that I supposedly fixed does not seem to be fixed (QF_LRA_miplib_danoint-66). Are the results at that link up-to-date or not?
If not, can you tell me on which two instances it failed?

martinjonas · 2025-06-29T12:29:31Z

Sorry for the confusion, I must have uploaded some inconsistent mix of two result sets. To be sure, I ran the tests again and updated the results. There is still one incorrect result in the latest run.

Tomaqa · 2025-06-29T22:01:09Z

I hope I found the culprit. Even if not, let's use this anyway.

Tomaqa force-pushed the smts25 branch 3 times, most recently from c562d3c to 5d5d6a1 Compare June 12, 2025 14:24

bobot added the submission Submissions for SMT-COMP label Jun 14, 2025

Tomaqa force-pushed the smts25 branch from 5d5d6a1 to 74bf0d9 Compare June 25, 2025 08:58

Tomaqa added 2 commits June 25, 2025 11:05

SMTS submission 2025

f617926

SMTS submission 2025 - final

a7bf544

Tomaqa force-pushed the smts25 branch from 74bf0d9 to a7bf544 Compare June 25, 2025 09:30

Tomaqa force-pushed the smts25 branch from e09c4fd to ed43416 Compare June 27, 2025 22:18

Tomaqa force-pushed the smts25 branch from ed43416 to 8c04e09 Compare June 29, 2025 21:55

SMTS submission 2025 - final (corrected version)

08d741f

Tomaqa force-pushed the smts25 branch from 8c04e09 to 08d741f Compare June 29, 2025 21:59

martinjonas merged commit c2d2cf9 into SMT-COMP:master Jun 30, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

SMTS submission 2025 #172

SMTS submission 2025 #172

Uh oh!

Tomaqa commented Jun 12, 2025

Uh oh!

github-actions bot commented Jun 12, 2025

SMTS

Uh oh!

martinjonas commented Jun 27, 2025

Uh oh!

Tomaqa commented Jun 27, 2025

Uh oh!

martinjonas commented Jun 27, 2025

Uh oh!

Tomaqa commented Jun 27, 2025

Uh oh!

martinjonas commented Jun 28, 2025

Uh oh!

Tomaqa commented Jun 29, 2025 •

edited

Loading

Uh oh!

martinjonas commented Jun 29, 2025

Uh oh!

Tomaqa commented Jun 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

SMTS submission 2025 #172

SMTS submission 2025 #172

Uh oh!

Conversation

Tomaqa commented Jun 12, 2025

Uh oh!

github-actions bot commented Jun 12, 2025

SMTS

Uh oh!

martinjonas commented Jun 27, 2025

Uh oh!

Tomaqa commented Jun 27, 2025

Uh oh!

martinjonas commented Jun 27, 2025

Uh oh!

Tomaqa commented Jun 27, 2025

Uh oh!

martinjonas commented Jun 28, 2025

Uh oh!

Tomaqa commented Jun 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

martinjonas commented Jun 29, 2025

Uh oh!

Tomaqa commented Jun 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Tomaqa commented Jun 29, 2025 •

edited

Loading