JS: add query js/cleartext-logging · Pull Request #73 · github/codeql

ghost · 2018-08-20T06:39:52Z

This PR adds a taint analysis query that flags logging of sensitive data in clear text. Logged sensitive data may be stored, so this query is really just a sibling of js/clear-text-storage-of-sensitive-data, they therefore share a qhelp file.

Programmers construct log messages in very different ways, so this query is much more conservative that its sibling. In particular, his query considers "passwords" to be the only source of sensitive data.

Evaluation on our standard benchmarks reveal two true positives, and nothing else.

The performance seems fine for a new security query, here are the numbers for a comparison on the security suite: https://git.semmle.com/gist/esben/3e07f35b8aab1caff35eeab990615d19

xiemaisi

Great stuff! Looks like you put a lot of work into fine-tuning the heuristics.

A few suggestions, but overall the results speak for themselves.

xiemaisi · 2018-08-20T08:06:26Z

change-notes/1.18/analysis-javascript.md


 | **Query**                   | **Tags**  | **Purpose**                                                        |
 |-----------------------------|-----------|--------------------------------------------------------------------|
+| Clear text logging of sensitive information (`js/cleartext-logging`) | security, external/cwe/cwe-312, external/cwe/cwe-315, external/cwe/cwe-359 | Highlights logging of sensitive information, indicating a violation of [CWE-312](https://cwe.mitre.org/data/definitions/312.html). Results shown on lgtm by default. |


Here and below: "clear-text logging" should probably have a dash between "clear" and "text". Also, upper-case LGTM.

xiemaisi · 2018-08-20T08:08:11Z

javascript/ql/src/Security/CWE-312/CleartextLogging.ql

+/**
+ * Holds if `tl` is used in a browser environment.
+ */
+predicate inBrowserEnvironment(TopLevel tl) {


OOI, could this lead to false negatives for Electron apps?

I do not think that is a false negative.
This predicate is used to exclude alerts for logging that occurs on the user's own computer since that is innocent in practice.
Both browsers and electron apps (that use browser features) are run on the user's own computer.

Ah, you're right.

xiemaisi · 2018-08-20T08:08:46Z

javascript/ql/src/Security/CWE-312/CleartextLogging.ql

+/**
+ * Holds if `sink` only is reachable in a "test" environment.
+ */
+predicate inTestEnvironment(Sink sink) {


Couldn't we leave it to our file classification to hide results in test code?

This does not prevent results in test code, it prevents results like this:

if (environment.isTestEnv()) { console.log("Password is: " + password); // OK }

See the test:
https://github.com/Semmle/ql/pull/73/files#diff-48bc5394f61be303fa92728aec0a85a8R92

I see. To be honest, though, this predicate doesn't make me very happy. It looks very ad-hoc and brittle. Could we come up with something slightly more rigorous?

I have removed the check for now, it does not make a difference on our default benchmarks.

xiemaisi · 2018-08-20T08:09:54Z

javascript/ql/src/semmle/javascript/frameworks/Logging.qll

+}
+
+private string getAStandardLoggerMethodName() {
+  // log level names used in RFC5424, `npm`, `console`


Turn this into qldoc, since we have it anyway?

xiemaisi · 2018-08-20T08:10:08Z

javascript/ql/src/semmle/javascript/frameworks/Logging.qll

+  result = "info" or
+  result = "log" or
+  result = "notice" or
+  result = "silly" or


Yep.
https://docs.npmjs.com/misc/config#loglevel: "silent", "error", "warn", "notice", "http", "timing", "info", "verbose", "silly"

Oh, good. My favourite in this list is actually "http", though. Well done for not including it in this predicate.

xiemaisi · 2018-08-20T08:12:41Z