Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dehashed - junk data returns each time along with the valid results #50

Closed
emanuelstravinsky opened this issue Feb 11, 2020 · 3 comments
Labels

Comments

@emanuelstravinsky
Copy link

  • h8mail version:latest
  • Python version:
  • Operating System:Linux

Description

I configured the API keys for the dehashed.com service, and it all works well, but each time I make a query(any, email or domain) it returns a bunch of junk data(1000+) along with some valid results at the beginning.

@kkris9
Copy link

kkris9 commented Feb 12, 2020

Same here. The output shows every e-mail address from dehashed which contains the word "email".
fyi:
When you search for a domain on the dehashed website like they suggest (email:"*@domain.com"), you also receive over a million results (junk) which seems to be the same output like in h8mail. But when you search for *domain.com the results are correct.

@khast3x
Copy link
Owner

khast3x commented Feb 14, 2020

Hello all,

Yes the Dehashed API returns quite a lot of data. h8mail already does a first pass of cleaning blank results, but some results do still have a very verbose structure.
I had contacted the support while developing this version to mention the blank fields, and they answered the following:

The reason we keep the structure intact is because some people can't parse them in/out. We've had them removed before but people requested we keep the structure there so we decided to serve it even if it's blank/null.

I am not sure if this also applies to the junk data but it might. Going to notify support about this issue, will update.

@kkris9 noted for the globing, I'll check if it applies to the API too.

In the meantime, you can check the raw data returned by the API using --debug.

Cheers!

@khast3x
Copy link
Owner

khast3x commented Aug 5, 2020

Hello,

Fixed with PR #97 .
Run pip3 install --upgrade h8mail to update h8mail to the latest version (2.5.4).

Cheers!

@khast3x khast3x closed this as completed Aug 5, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants