Refactor requirements file parsing #2697

qwcode · 2015-04-18T21:00:25Z

Two major changes:

Re-use the optparse options in pip.cmdoptions and build a standard optparse parser, instead of maintaining a custom parser. This improves consistency and reduces bugs when we make options changes, and should make the eventual click-rehaul of our cli easier.
As a result of No instructions for how to install pip #1, the call stack is simpler:
- from: parse_requirements -> parse_content -> parse_line
- to: parse_requirements -> process_line

Changes beyond 1 & 2:

Adjusting the tests to match
Adding more tests (specifically testing every supported option, and functional test that confirms Add --install-options and --global-options to the requirements file parser. #2537)
Minor cosmetics

There are some behavior changes:

The parsing is now consistent with the cli parsing. For example, before we were allowing things like "-i=url", and possibly other subtle differences.
We now allow multiple of the same option (but not different options) on the same line, largely due to not having an easy way to detect this using optparse. Is there a way?

two major changes: 1) re-use the optparse options in pip.cmdoptions instead of maintaining a custom parser 2) as a result of #1, simplify the call stack from: parse_requirements -> parse_content -> parse_line to: parse_requirements -> process_line beyond #1/#2, minor cosmetics and adjusting the tests to match

…s for problems that were found

- consistent use of the finder fixture

- report the option string when using an incorrect option with a requirement

don't error with tracebacks

qwcode · 2015-04-21T02:45:51Z

@pfmoore thoughts? since you looked at this code recently?

pfmoore · 2015-04-21T08:46:36Z

I've had a quick look. I don't have much time over the next few days to do a full review, but the new code looks a lot clearer than the previous version, which is great. Processing looks fine.

There's a lot of changes to the tests, which I haven't looked at in detail but which seem to be mostly because they are testing internal functions of the old code, and so are no longer applicable. I'm not sure to what extent (either before or after this patch) we test the "higher level" aspects of requirement file processing (i.e. for various requirement files, does the parsing give the overall results that are expected), but that's not directly relevant to this patch - it may be useful to review as part of the more general "improve our tests" process. But for this patch, I'm assuming (for now) that test coverage remains sufficient.

I'll try to get time to do a more thorough review, but it might not be for a couple of weeks, so don't wait on me. Overall, though, the change looks good to me.

qwcode · 2015-04-21T16:12:08Z

thanks @pfmoore . I think the tests are overall improved in this PR. Specifically, the functional test now confirms the override behavior for real using "--prefix", i.e. it confirms a prefix value in the req file overrides the value in the cli. Also, there are now tests for every supported option.

thoughts @gvalkov ?

gvalkov · 2015-04-21T17:57:21Z

pip/req/req_file.py

+
+    if args:
+        # don't allow multiple requirements
+        if len(args) > 1:


Wouldn't this prevent requirement lines like the following from working?

req1 >= 1.0 # args = ['req', '>=', '1.0']

I think all positional arguments should be consumed first (i.e the requirement specifier) and then only the options are to be passed to parse_args(). Something along the lines of:

if ' --' in line: req, args = line.split(' --', 1) args = shlex.split('--%s' % args) opts, args = parser.parse_args(args) else: req = line opts = None if opts: ...

good catch. I think I'll remove the ">1" check all together, and just pass all the args (as a joined string) into the InstallRequirement constructor. If they did really try to put multiple requirements on one line, it will fail at that point. that's good enough I think.

and I'll certainly add a test for this.

That can work, yes. Hopefully nobody figures out that arguments and options can be interspersed :)

pillow --install-option one >= --global-option=two 2.8.1

I'm conflicted here, but I guess I'm biased towards keeping it simple, and letting this possibility exist.

the custom parsing could get more complicated if we ever support short options, since we'd be dealing with "--" and "-"

Agreed - keep it simple. If this does become a genuine issue, I'd suggest something more along the lines of

opts, args = parser.parse_args(orig_args) if len(args) > 1: positions = [orig_args.index(a) for a in args] if (the values in positions aren't all next to each other): raise an error

But that's a costly check, so I wouldn't bother unless it's a genuine issue.

gvalkov · 2015-04-21T18:29:15Z

I really like this - it's definitely an improvement/simplification over the previous implementation.

Using spaces in the requirement specifier is allowed, right? I think I use it like that all the time (i.e pillow >= 2.8.1). This PR would mandate that all specifiers single words (i.e pillow>=2.8.1). It's surprising that there wasn't a test to catch this.

Happy to hear that pip is considering click for its command-line interface - I find the sub-parser handling in it nicely done, but I'm not too excited about functions with up to 20 arguments (as would be the case for the install command).

qwcode · 2015-04-21T19:53:43Z

Using spaces in the requirement specifier is allowed, right?

yes, the only caveat is the need to quote when using as cli arguments in the shell

Conflicts: pip/req/req_file.py

Refactor requirements file parsing

> - **BACKWARD INCOMPATIBLE** Requirements in requirements files containing markers must now be quoted due to parser changes from ([PR #2697](pypa/pip#2697)) and ([PR #2725](pypa/pip#2725)). For example, use `"SomeProject; python_version < '2.7'"`, not simply `SomeProject; python_version < '2.7'`

> - **BACKWARD INCOMPATIBLE** Requirements in requirements files containing markers must now be quoted due to parser changes from ([PR #2697](pypa/pip#2697) and ([PR #2725](pypa/pip#2725). For example, use `"SomeProject; python_version < '2.7'"`, not simply `SomeProject; python_version < '2.7'`

From [pip 7.0.0 release notes](https://pip.pypa.io/en/latest/news.html): > - **BACKWARD INCOMPATIBLE** Requirements in requirements files containing markers must now be quoted due to parser changes from ([PR #2697](pypa/pip#2697)) and ([PR #2725](pypa/pip#2725)). For example, use `"SomeProject; python_version < '2.7'"`, not simply `SomeProject; python_version < '2.7'`

qwcode added 12 commits April 16, 2015 22:10

process_line tests for setting attributes on the finder, and the fixe…

4378718

…s for problems that were found

tests for option variants

49e9ac1

remove duplicate test

bed77a9

- fixes to TestOptionVarants

9c66633

- consistent use of the finder fixture

tests for: joining lines, skipping regex, and appending options

b590ab4

Test --install-option in requirements file overrides same option in cli

285f71b

- inline the logic from the get_options_dest function

2c5be94

- report the option string when using an incorrect option with a requirement

make the requirements file exceptions "InstallationError"'s so they

0a265de

don't error with tracebacks

pep8 fix

cfd6961

pep8 fixes

5c4632f

pep8 fixes

1ca1f10

gvalkov reviewed Apr 21, 2015
View reviewed changes

qwcode added 3 commits April 23, 2015 01:31

handle requirement specifiers with spaces, e.g. "pkg >= 1"

55e7bd3

only calculate the dest strings once

fdd10ad

Merge remote-tracking branch 'pypa/develop' into refactor_req_file

b911339

Conflicts: pip/req/req_file.py

qwcode added a commit that referenced this pull request Apr 23, 2015

Merge pull request #2697 from qwcode/refactor_req_file

31eb67d

Refactor requirements file parsing

qwcode merged commit 31eb67d into pypa:develop Apr 23, 2015

aapa mentioned this pull request May 31, 2015

pip7 migrate (requirements.txt marker format changed) lepinkainen/pyfibot#172

Closed

aapa mentioned this pull request May 31, 2015

Update requirements.txt to reflect syntax changes in pip 7.0.0 lepinkainen/pyfibot#173

Merged

rodcloutier mentioned this pull request Jul 8, 2015

Regression using backslash in requirements file with version 7.x #2966

Closed

jayvdb mentioned this pull request Aug 4, 2015

reported line numbers incorrect in 7.x #3009

Closed

lock bot added the auto-locked Outdated issues that have been locked by automation label Jun 4, 2019

lock bot locked as resolved and limited conversation to collaborators Jun 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor requirements file parsing #2697

Refactor requirements file parsing #2697

qwcode commented Apr 18, 2015

qwcode commented Apr 21, 2015

pfmoore commented Apr 21, 2015

qwcode commented Apr 21, 2015

gvalkov Apr 21, 2015

qwcode Apr 21, 2015

gvalkov Apr 21, 2015

qwcode Apr 23, 2015

pfmoore Apr 23, 2015

gvalkov commented Apr 21, 2015

qwcode commented Apr 21, 2015

Refactor requirements file parsing #2697

Refactor requirements file parsing #2697

Conversation

qwcode commented Apr 18, 2015

qwcode commented Apr 21, 2015

pfmoore commented Apr 21, 2015

qwcode commented Apr 21, 2015

gvalkov Apr 21, 2015

Choose a reason for hiding this comment

qwcode Apr 21, 2015

Choose a reason for hiding this comment

gvalkov Apr 21, 2015

Choose a reason for hiding this comment

qwcode Apr 23, 2015

Choose a reason for hiding this comment

pfmoore Apr 23, 2015

Choose a reason for hiding this comment

gvalkov commented Apr 21, 2015

qwcode commented Apr 21, 2015