Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Media captions missed by new Android Whatsapp parser and UFDR chat parser #1555

Closed
gisms opened this issue Feb 28, 2023 · 13 comments
Closed
Assignees
Labels

Comments

@gisms
Copy link

gisms commented Feb 28, 2023

Hey!!

We decode a Whatsapp base from a ufdr file using both the internal and external parser. Extraction referring to iphone 14, Cellebrite - PA [7.60.1.]

We noticed that when decoding a forwarded message that contains image and text, the external decoder does not parse the text. The internal decoder works correctly, as shown in the image below.

image

@lfcnassif
Copy link
Member

lfcnassif commented Feb 28, 2023

What IPED version are you using? If 4.0 or later, could you send me the report.xml file in root folder of the UFDR (it's a zip) privately so I can reproduce?

@lfcnassif
Copy link
Member

lfcnassif commented Feb 28, 2023

Although I need the asked information, I took a look at one recent UFDR case here. Are you sure it isn't the opposite? Internal WhatsAppParser missing attachment titles and the external/UFED one is printing them? Seems to me it occurs when the message has some kind of text body. I think it is not related to ForwardedMessages, but it could be another issue, so sharing your triggering sample would help.

PS1: I detected the same thing with internal TelegramParser missing some attachment titles when text body already exists.

PS2: Conclusions above assume the UFED/UFDR results are correct, it isn't always true... Just analyzing the original WhatsApp/Telegram database to be sure.

@gisms
Copy link
Author

gisms commented Feb 28, 2023

Hi, I'm simulating the occurrence on a test cell phone here so that I can send you the ufdr file.

@lfcnassif
Copy link
Member

Thank you!

@gisms
Copy link
Author

gisms commented Mar 1, 2023

What IPED version are you using? If 4.0 or later, could you send me the report.xml file in root folder of the UFDR (it's a zip) privately so I can reproduce?

For the iphone, we tested both in 4.1 and 4.07 and in both the parser that came the text/legend of the photo was the iped parser. But we are searching for more images to see if its a pattern or not.
I'm doing tests on an LG and the behavior is different, and that one i will (can) send to you with more questions.

@gisms
Copy link
Author

gisms commented Mar 1, 2023

Update:

we confirm that we did not find a different behavior for the iphone. the parser that came the text/legend of the photo was the iped parser.

I sent you the email with the files about the LG test phone. (behavior is different)

@lfcnassif
Copy link
Member

Thank you very much @gisms for creating test data to reproduce the issue! I'll take a look soon.

@lfcnassif
Copy link
Member

lfcnassif commented Mar 9, 2023

Hi @gisms, I'm sorry for the delay here. I found some issues (like media caption missing) and some non implemented features in the internal parser for the "new" Android database schema. I'm trying to fix them and I'll notice it here.

PS: The issues I found are within the IPED parser, not with the Cellebrite one.

@lfcnassif lfcnassif self-assigned this Mar 9, 2023
@lfcnassif lfcnassif changed the title External Parser - Ufdr Whatsapp decoding issue New Android Whatsapp parser not decoding media captions Mar 9, 2023
@lfcnassif lfcnassif changed the title New Android Whatsapp parser not decoding media captions New android Whatsapp schema parser not decoding media captions Mar 9, 2023
@lfcnassif
Copy link
Member

lfcnassif commented Mar 9, 2023

Actually, the "new" schema is changing... There was no media_caption column in the database and captions were saved in text_data column of message table. In your test sample (thanks for creating it), there is a new media_caption column in message_media table, but it is always empty and captions are still into text_data. Possibly the new column will be used in the near future...

@lfcnassif
Copy link
Member

lfcnassif commented Mar 9, 2023

The issues I found are with the IPED parser, not with the Cellebrite one.

Update: there are a few (not all of them) messages from external parser missing captions indeed. And almost all media messages are duplicated, I'll open a separate issue for the duplication issue.

@lfcnassif lfcnassif changed the title New android Whatsapp schema parser not decoding media captions Media captions missed by new Android Whatsapp parser and UFED chat parser Mar 10, 2023
@lfcnassif
Copy link
Member

lfcnassif commented Mar 10, 2023

Hi @gisms, I think I fixed the issue you reported and some I've found, please test if this snapshot works for your real case:
https://github.com/sepinf-inc/IPED/actions/runs/4380667802
https://github.com/sepinf-inc/IPED/actions/runs/4381083188

Thanks

@lfcnassif
Copy link
Member

lfcnassif commented Mar 10, 2023

I tested again all fixes above, they look fine. Please test the last snapshot pointed out above and reopen if it doesn't work for your real case, thank you for reporting and providing test data to reproduce!

@gisms
Copy link
Author

gisms commented Mar 10, 2023 via email

@lfcnassif lfcnassif changed the title Media captions missed by new Android Whatsapp parser and UFED chat parser Media captions missed by new Android Whatsapp parser and UFDR chat parser Mar 13, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants