Dangerous network data parsing using eval() #30

RemiCardona · 2012-11-13T14:27:12Z

Using eval() to parse data received from the network is a huge security hole. There are 2 ways to fix this:

the ast.literal_eval() function should happily parse Solr's pythonic output,
use the json module on all python versions since mysolr seems to support python 2.6 and up.

I have opted for the second solution since the code being already there, I assumed it worked properly on newer python versions.

NB: I have tested this patch on python 2.6 against solr 3.6, on Debian Squeeze.

The text was updated successfully, but these errors were encountered:

the 'json' module is in python's standard library since 2.6 and using eval() is a huge security risk.

moliware · 2012-11-13T14:47:55Z

Hi,

Thanks for contributing! We started to use eval because evaluating a python expression is much faster than parsing a json or xml.

It's true that is less secure but we gain a lot of speed, so I think I will use your first way that it is also safer than using eval directly.

Thanks!!

RemiCardona · 2012-11-13T15:47:38Z

Hi Miguel,

I just did a couple of tests on a 5000-row resultset coming from solr. A friend of mine suggested I add anyjson to the list. Here's what I got:

eval(): ~0.5s
ast.literal_eval(): ~1s
json.loads(): ~1.2s
anyjson: ~0.1s

That last test just blew me off, so I triple-checked and anyjson is just a simple wrapper over various json libraries and it used simplejson on my test machine.

So I would definitely suggest using anyjson (which is available on pypi) or simplejson directly using a simple try/except ImportError.

Cheers

moliware · 2012-11-13T15:51:32Z

Wow, I'm going to test it and include anyjson.

Thank you so much for your contribution

RemiCardona · 2012-11-13T16:27:16Z

The updated patch works fine on my system. Cheers.

moliware · 2012-11-14T14:06:55Z

Hi,

From anyjson doc:

"Anyjson loads whichever is the fastest JSON module"

So, I will try each json library for seeing what library is the fastest in order to include this in the documentation. Then I will merge the pull request.

Thanks!

RemiCardona pushed a commit to RemiCardona/mysolr that referenced this issue Nov 13, 2012

Never use eval() to parse data from the network, closes RedTuna#30

ef323fc

the 'json' module is in python's standard library since 2.6 and using eval() is a huge security risk.

RemiCardona closed this as completed Nov 13, 2012

RemiCardona reopened this Nov 13, 2012

moliware closed this as completed in dead720 Nov 24, 2012

marts mentioned this issue Mar 23, 2014

Release new version to pypi #37

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dangerous network data parsing using eval() #30

Dangerous network data parsing using eval() #30

RemiCardona commented Nov 13, 2012

moliware commented Nov 13, 2012

RemiCardona commented Nov 13, 2012

moliware commented Nov 13, 2012

RemiCardona commented Nov 13, 2012

moliware commented Nov 14, 2012

Dangerous network data parsing using eval() #30

Dangerous network data parsing using eval() #30

Comments

RemiCardona commented Nov 13, 2012

moliware commented Nov 13, 2012

RemiCardona commented Nov 13, 2012

moliware commented Nov 13, 2012

RemiCardona commented Nov 13, 2012

moliware commented Nov 14, 2012