Ankou fails during initialization #2

Rumata888 · 2020-09-10T23:31:30Z

Hi. I'm using the latest version of AFLPlusPlus with afl-clang-fast. The compiled harness definitely accepts input. However, for some reason ankou gives the following error during initialization for each case:
Problem when writing in control pipe: invalid argument
I run it with the following command:
./Ankou -app ./fuzzed_app -threads 1 -dict my.dict -i fuzz/ankou_input -o fuzz/ankou_output
Any ideas what could be the issue here?
Running with regular afl-fuzz works

Jiliac · 2020-09-11T05:18:51Z

Hey @Rumata888 . Thanks for trying out Ankou!

This is a problem of communication between Ankou and the compiled binary. Ankou intends to reproduce the way AFL works but there may be issues. Could you give more details on how you compiled this binary so I can reproduce and debug the issue?

vanhauser-thc · 2020-09-11T06:58:39Z

afl++ has an advanced feature where there can be a back-and-forth communication between target and afl-fuzz.
likely it was just an unlucky value that afl-fuzz sent and with retrying it might just work.

But I just added a check to only accept an afl-fuzz transfer request if it was compiled with LTO.
you have to checkout the dev branch though and recompile your target.

Rumata888 · 2020-09-11T11:45:06Z

HI. I don't see a dev branch @vanhauser-thc . It seems you haven't pushed the commit.
I had problems with compiling with lto, so I just used regular afl-clang-fast.
@Jiliac Tested with an example vulnerable app: https://raw.githubusercontent.com/mykter/afl-training/master/quickstart/vulnerable.c
Compiled it with afl. Same probllem.
And thank you. I really like the idea behind Ankou. That's why I wanted to try it out. Hope, I will be able to.

Rumata888 · 2020-09-11T12:18:40Z

Ah, @vanhauser-thc, sorry. You meant in afl++. Will try that.

Rumata888 · 2020-09-11T12:45:01Z

Still getting the same problem. Tried with afl-gcc and everything worked. Any ideas?

Jiliac · 2020-09-11T12:53:44Z

Just to confirm my understanding: Using vanilla AFL (not AFL++), you compiled with afl-gcc and it worked, but using afl-clang-fast Ankou crashed with the control pipe error mentioned above? afl-gcc was mostly used during the development of Ankou, so it's possible there is a problem with the clang version of the compiler. I will look at this more closely over the weekend.

vanhauser-thc · 2020-09-11T12:56:25Z

I tried a compiled binary with afl++ 3.00a with vanilla afl-fuzz from google and as expected this works fine.
seems to be an ankou issue

Rumata888 · 2020-09-11T13:00:40Z

I was using AFL++, not vanilla AFL (wanted to try ankou with laf-intel). afl-clang-fast failed on me, afl-gcc worked.

Jiliac · 2020-09-12T12:07:36Z

Actually, the whole communication with the binary is broken. The fork server doesn't even start correctly. I think we never tried to use afl-clang-fast as a compiler actually, so this might not be due to a specific feature of afl++.

2020/09/12 11:52:32 Fork server did not start correctly: signal 47.                                            
2020/09/12 11:52:32 Problem in starting frk srv                                                                
2020/09/12 11:52:32 Problem when writing in control pipe: invalid argument

This is the function doing the initialization in Ankou. As I remember, it was just a translation of the equivalent init_forkserver of vanilla AFL. I don't see that there is a special case of afl-clang-fast in AFL code. Do you know what I am missing @vanhauser-thc ?

vanhauser-thc · 2020-09-14T12:52:19Z

it looks to me like the forkserver control in fuzz-loop.go is implemented wrong.
intially there must be a read from afl-fuzz once the target is started (so a write happens in the target to the pipe).
then comes the endless loop afl-fuzz -> target (ready to send testcase), afl-fuzz <- target (sends pid), afl-fuzz <- target (done, return code)

the initial first read is not made and that might be the reason.

Jiliac · 2020-09-14T17:59:21Z

Thanks a lot for looking at it ❤️ ! Will try to get this fix before next week.

Jiliac · 2020-09-20T14:11:19Z

I got back in the code. Actually the read is done, but in the initForkserver function. With the gcc binary, this read goes along correctly; The status returned is 0. But in the case of the clang binary, the status returned is 0xc000002f, which as far as I can figure out with the man 2 wait page, means the fork server was killed with signal 47? But then, there is no signal 47 defined AFAIK?

vanhauser-thc · 2020-09-21T07:48:27Z

Ah that is the issue.
you interpret the value of the first received int. that int has no sensible value. it can be unassigned/random or just a zero.
afl++ uses this to inform afl-fuzz about activated features.
vanilla afl and spin-offs ignore the value of the byte so they dont have an issue.
your implementation assess the value it seems it checks it if it is 0 which is unnecessary.

The value returned by the fork server at the first read should not be considered at all. Before, Ankou was in effect panic-ing if the value was not nil. See this issue: #2

Jiliac · 2020-09-21T09:45:28Z

Ok thanks a ton for the help! Pushed a commit which removes this check. @Rumata888 tell me if now it works for you 😀.

vanhauser-thc · 2020-09-21T10:00:07Z

If you removed the check just for the first read on the forkserver pipe, but all other reads do the crash check - then it is the correct fix :)

vanhauser-thc · 2020-09-21T10:03:29Z

off-topic @Jiliac if you want to improve your benchmarks on fuzzbench then just use afl++ as the compiler for the targets. I recommend CLASSIC + CTX + laf-intel for that

export AFL_LLVM_LAF_ALL=1
export AFL_LLVM_INSTRUMENT=CLASSIC,CTX

Jiliac · 2020-09-21T10:30:00Z

Very good point on fuzzbench settings! I should include them. But fuzzbench doesn't accept Ankou in its benchmarks so far because Ankou gets too memory hungry when the PCA mode starts. I think it might be doable to optimize, but it'd be a lot of work.

Rumata888 · 2020-09-21T12:59:24Z

Ok thanks a ton for the help! Pushed a commit which removes this check. @Rumata888 tell me if now it works for you .

Yes, it works, thanks.

Rumata888 · 2020-09-21T12:59:51Z

off-topic @Jiliac if you want to improve your benchmarks on fuzzbench then just use afl++ as the compiler for the targets. I recommend CLASSIC + CTX + laf-intel for that
export AFL_LLVM_LAF_ALL=1
export AFL_LLVM_INSTRUMENT=CLASSIC,CTX

Why not LTO? Isn't it faster supposedly?

vanhauser-thc · 2020-09-21T13:19:04Z

LTO and pcguard in afl++ are collision free, which is an amazing thing, however requires that the fuzzer knows how large the map of the target is. that is why we transfer that special value in the beginning that made ankou crash.
no other fuzzer supports that currently so LTO and afl++'s pcguard compiled binaries can only be used with afl++'s afl-fuzz

(actually not fully true, if the target has less than 64k edges then they work fine with any afl spin-off. however there will be no speed increase. although the coverage will not be colliding anymore)

Rumata888 · 2020-09-22T14:38:33Z

LTO and pcguard in afl++ are collision free, which is an amazing thing, however requires that the fuzzer knows how large the map of the target is. that is why we transfer that special value in the beginning that made ankou crash.
no other fuzzer supports that currently so LTO and afl++'s pcguard compiled binaries can only be used with afl++'s afl-fuzz

(actually not fully true, if the target has less than 64k edges then they work fine with any afl spin-off. however there will be no speed increase. although the coverage will not be colliding anymore)

Thanks for the explanation.
Btw @Jiliac have you tested ANKOU in a multithreaded setting? In the paper only 1-core setting is mentioned. I seem to be having a problem with CPU utilization. It's at 70%. Does ANKOU analyze the results of each test 1 by 1?

Jiliac · 2020-09-22T20:08:42Z

Yes Ankou was developped first for multicore, and then adapted to run on only 1 core to be able to benchmark the same way other fuzzing paper did. Can you give more details on your setting? How many cores you are using? All of them are at 70% for long period of time? Is the memory full?

Rumata888 · 2020-09-23T08:59:02Z

I'm using it in a Linux Virtual machine with 8 cores dedicated to it out of 8 cores on the host. When I'm running 8 instances of afl or libfuzzer in fork mode with 8 threads, the CPU utilization is close to 100%. When I'm running ANKOU with 8 threads it averages around 70%. 7 GB of 32 GB of memory dedicated to the virtual machine are in use. It seems that there is a bottleneck.
What makes it worse is that Ankou has a tendency to increase test case length. As a result throughput suffers.

Jiliac · 2020-09-23T09:21:08Z

Very weird, I never had this problem before. What's your target? The one you mentioned above? Is it from the beginning, or after some time, like 5 to 10 minutes, when the PCA mode activates?

Rumata888 · 2020-09-23T10:22:31Z

A mail parsing library. When the PCA mode activates. It's been fuzzing for more than 30h now, but CPU usage rarely goes higher than 80%. Also, a large chunk of that usage is ANKOU itself, not the fuzzed application

Jiliac · 2020-09-23T15:00:44Z

Then probably the PCA process is the bottleneck. It can potentially happen for programs with too many branches but I have never seen it happen in practice until now. Would have to experiment directly on the same target you are to be sure.

Rumata888 · 2020-09-28T11:49:55Z

Well, it's a pity. I think it could be enhanced by using SIMD on x64 targets. Also, it seems, that for test cases where token repetitions result in loops it quickly becomes ineffective because of test case growth. So while the idea is nice, you can't use it everywhere.

Jiliac · 2020-09-28T12:42:12Z

Yes I looked into SIMD, but I didn't see a simple way to benefit from it for Go code. Seems like all the operations should be written manually :/

The "loop problem" should be handle by the "log transformation" used by AFL, shouldn't it? From AFL doc:

In addition to detecting new tuples, the fuzzer also considers coarse tuple
hit counts. These are divided into several buckets:

  1, 2, 3, 4-7, 8-15, 16-31, 32-127, 128+

This way each new iteration in a loop is less meaningful than the previous ones. Did you see this problem in practice?

Rumata888 · 2020-09-28T13:50:47Z

Log buckets prevent AFL from adding test-cases with slight increases in particular edge count. Ankou determines whether a test-case should be added by how different it is from the current basis. The problem with that is that it will be much more likely that a test-case is distant enough from the current basis if one of its independent edges has a maximized hit count. But the hit count is logarithmic, so the test-case grows larger.
EDIT: I think, it could actually help a little, if you translated them back to linear scale.

Jiliac · 2020-09-29T06:12:19Z

Ah very good point! Actually, for the implementation, we used the AFL insight in the PCA. When we create the vector on which is used in Ankou algorithm (PCA + fitness function), we actually use the logarithm of the hit count. You can see this in the code (tr is the hit count).

Rumata888 · 2020-10-02T11:53:56Z

Yup. I looked at it once again and it seems the problem was that initially aflplusplus generated several long test cases. Since ANKOU added them and they had unique coverage, it made it harder to add smaller inputs (they have to trigger very different edges to be inserted into the corpus when they don't have loops). I think Ankou would benefit from an initial check and a disclaimer, that its performance can degrade when presented with such initial corpus.

Jiliac added a commit that referenced this issue Sep 21, 2020

Fix bug in fork server initialization

4ef8c45

The value returned by the fork server at the first read should not be considered at all. Before, Ankou was in effect panic-ing if the value was not nil. See this issue: #2

Jiliac closed this as completed Oct 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ankou fails during initialization #2

Ankou fails during initialization #2

Rumata888 commented Sep 10, 2020

Jiliac commented Sep 11, 2020

vanhauser-thc commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Jiliac commented Sep 11, 2020

vanhauser-thc commented Sep 11, 2020

Rumata888 commented Sep 11, 2020 •

edited

Jiliac commented Sep 12, 2020

vanhauser-thc commented Sep 14, 2020

Jiliac commented Sep 14, 2020

Jiliac commented Sep 20, 2020

vanhauser-thc commented Sep 21, 2020

Jiliac commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020

Jiliac commented Sep 21, 2020

Rumata888 commented Sep 21, 2020

Rumata888 commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020 •

edited

Rumata888 commented Sep 22, 2020

Jiliac commented Sep 22, 2020 •

edited

Rumata888 commented Sep 23, 2020 •

edited

Jiliac commented Sep 23, 2020

Rumata888 commented Sep 23, 2020

Jiliac commented Sep 23, 2020

Rumata888 commented Sep 28, 2020

Jiliac commented Sep 28, 2020

Rumata888 commented Sep 28, 2020 •

edited

Jiliac commented Sep 29, 2020

Rumata888 commented Oct 2, 2020

Ankou fails during initialization #2

Ankou fails during initialization #2

Comments

Rumata888 commented Sep 10, 2020

Jiliac commented Sep 11, 2020

vanhauser-thc commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Rumata888 commented Sep 11, 2020

Jiliac commented Sep 11, 2020

vanhauser-thc commented Sep 11, 2020

Rumata888 commented Sep 11, 2020 • edited

Jiliac commented Sep 12, 2020

vanhauser-thc commented Sep 14, 2020

Jiliac commented Sep 14, 2020

Jiliac commented Sep 20, 2020

vanhauser-thc commented Sep 21, 2020

Jiliac commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020

Jiliac commented Sep 21, 2020

Rumata888 commented Sep 21, 2020

Rumata888 commented Sep 21, 2020

vanhauser-thc commented Sep 21, 2020 • edited

Rumata888 commented Sep 22, 2020

Jiliac commented Sep 22, 2020 • edited

Rumata888 commented Sep 23, 2020 • edited

Jiliac commented Sep 23, 2020

Rumata888 commented Sep 23, 2020

Jiliac commented Sep 23, 2020

Rumata888 commented Sep 28, 2020

Jiliac commented Sep 28, 2020

Rumata888 commented Sep 28, 2020 • edited

Jiliac commented Sep 29, 2020

Rumata888 commented Oct 2, 2020

Rumata888 commented Sep 11, 2020 •

edited

vanhauser-thc commented Sep 21, 2020 •

edited

Jiliac commented Sep 22, 2020 •

edited

Rumata888 commented Sep 23, 2020 •

edited

Rumata888 commented Sep 28, 2020 •

edited