HTTP Crawler: don't expect page object for msg #16668

sempervictus · 2022-06-11T01:45:24Z

The crawler_process_page method in HttpCrawler assumes that the
page object passed into the method is not nil when formatting the
msg string for printing to console.
Address the assumption with a ternary check leaving the || "ERR"
handling for page.code itself being nil inside the assignment
when page is not nil.

Testing:
Error accessing page undefined method '[]' for nil:NilClass is
no longer being thrown when scanning an odd HTTP service.

The `crawler_process_page` method in HttpCrawler assumes that the `page` object passed into the method is not nil when formatting the `msg` string for printing to console. Address the assumption with a ternary check leaving the `|| "ERR"` handling for `page.code` itself being nil inside the assignment when page is not nil. Testing: `Error accessing page undefined method '[]' for nil:NilClass` is no longer being thrown when scanning an odd HTTP service.

lib/msf/core/auxiliary/http_crawler.rb

gwillcox-r7 · 2022-06-15T16:03:23Z

This actually is a bug in two places in the code as it seems that modules/auxiliary/scanner/http/crawler.rb actually overrides the definition at

metasploit-framework/modules/auxiliary/scanner/http/crawler.rb

Lines 57 to 65 in 024da20

    
           # 
        
           # The main callback from the crawler, redefines crawler_process_page() as 
        
           # defined by Msf::Auxiliary::HttpCrawler 
        
           # 
        
           # Data we will report: 
        
           # - The path of any URL found by the crawler (web.uri, :path => page.path) 
        
           # - The occurence of any form (web.form :path, :type (get|post|path_info), :params) 
        
           # 
        
           def crawler_process_page(t, page, cnt)

and also has the same bug that is noted in the library.

gwillcox-r7 · 2022-06-15T16:31:09Z

Alright I think the best solution to this would be to add a guard clause. The change you implemented is also valid for checking page.code is not invalid so I'll keep that in.

… it for processing pages

lib/msf/core/auxiliary/http_crawler.rb

gwillcox-r7 · 2022-07-21T21:35:03Z

This actually is a bug in two places in the code as it seems that modules/auxiliary/scanner/http/crawler.rb actually overrides the definition at

metasploit-framework/modules/auxiliary/scanner/http/crawler.rb

Lines 57 to 65 in 024da20

#

# The main callback from the crawler, redefines crawler_process_page() as

# defined by Msf::Auxiliary::HttpCrawler

#

# Data we will report:

# - The path of any URL found by the crawler (web.uri, :path => page.path)

# - The occurence of any form (web.form :path, :type (get|post|path_info), :params)

#

def crawler_process_page(t, page, cnt)

and also has the same bug that is noted in the library.

Fixed this in d20fa45

gwillcox-r7 · 2022-07-21T23:40:19Z

Release Notes

A bug has been fixed in the HTTP crawler module and its associated library whereby the code expected an object to be populated when it may not be. This has been fixed with additional validation.

adfoster-r7 reviewed Jun 11, 2022

View reviewed changes

lib/msf/core/auxiliary/http_crawler.rb Show resolved Hide resolved

gwillcox-r7 self-assigned this Jun 13, 2022

Add in guard clause to check that page isn't nil before trying to use…

d20fa45

… it for processing pages

adfoster-r7 reviewed Jun 15, 2022

View reviewed changes

lib/msf/core/auxiliary/http_crawler.rb Show resolved Hide resolved

gwillcox-r7 added module library bug labels Jun 16, 2022

gwillcox-r7 added the rn-fix release notes fix label Jul 21, 2022

gwillcox-r7 merged commit abe90c1 into rapid7:master Jul 21, 2022

gwillcox-r7 mentioned this pull request Jul 21, 2022

Revert "HTTP Crawler: don't expect page object for msg" #16808

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HTTP Crawler: don't expect page object for msg #16668

HTTP Crawler: don't expect page object for msg #16668

sempervictus commented Jun 11, 2022

gwillcox-r7 commented Jun 15, 2022

gwillcox-r7 commented Jun 15, 2022

gwillcox-r7 commented Jul 21, 2022

gwillcox-r7 commented Jul 21, 2022

HTTP Crawler: don't expect page object for msg #16668

HTTP Crawler: don't expect page object for msg #16668

Conversation

sempervictus commented Jun 11, 2022

gwillcox-r7 commented Jun 15, 2022

gwillcox-r7 commented Jun 15, 2022

gwillcox-r7 commented Jul 21, 2022

gwillcox-r7 commented Jul 21, 2022

Release Notes