Skip to content

Releases: pygments/pygments

Compare
Choose a tag to compare
  • Added lexers:

  • Updated lexers:

    • Ada: support Ada 2022 (#2121); disable recognition of namespaces because it disturbs lexing of aspects (#2125)
    • Agda: allow straight quotes in module names (#2163)
    • C family (C, C++ and many others): allow comments between elements of function headers, e.g. between the arguments and the opening brace for the body (#1891)
    • C++: Resolve several cases of Error tokens (#2207, #2208)
    • Coq: add some common keywords, improve recognition of Set and qualified identifiers (#2158)
    • F*: Allow C-style comments anywhere in a line
    • Fortran: fix catastrophic backtracking with backslashes in strings (#2194)
    • Go: add support for generics (#2167)
    • Inform: update for version 6.40 (#2190)
    • Isabelle: recognize cartouches (#2089)
    • Java: support multiline strings aka. text blocks (#2132)
    • Kotlin: add value modifier (#2142)
    • LilyPond: add some missing builtins
    • Macaulay2: update builtins (#2139)
    • Matlab session: fix traceback when a line continuation ellipsis appears in the output (#2166)
    • .NET: add aliases for LibreOffice Basic, OpenOfficeBasic and
      StarOffice Basic (#2170)
    • Nim: use Name.Builtin instead of Keyword.Type (#2136)
    • PHP: fix \"$var\" inside strings (#2105)
    • Python: only recognize \N, \u and \U escape sequences in string literals, but not in bytes literals where they are not supported (#2204)
    • Tcl: support ${name} variables (#2145)
    • Terraform: accept leading whitespace for << heredoc delimiters (#2162)
    • Teraterm: Various improvements (#2165)
    • Spice: add support for the recently added features including more builtin functions and bin, oct, hex number formats (#2206)
  • Added styles:

  • Pygments now tries to use the importlib.metadata module to discover plugins instead of the slower pkg_resources (#2155). In particular, this largely speeds up the pygmentize script when the lexer is not specified.

    importlib.metadata is only available in the Python standard library since Python 3.8. For older versions, there exists an importlib_metadata backport on PyPI. For this reason, Pygments now defines a packaging extra plugins, which adds a requirement on importlib_metadata if the Python version is older than 3.8. Thus, in order to install Pygments with optimal plugin support even for old Python versions, you should do:

    pip install pygments[plugins]
    

    Pygments still falls back on pkg_resources if neither importlib.metadata nor importlib_metadata is found, but it will be slower.

  • Silently ignore BrokenPipeError in the command-line interface (#2193).

  • The HtmlFormatter now uses the linespans attribute for anchorlinenos if the lineanchors attribute is unset (#2026).

  • The highlight, lex and format functions no longer wrongly report "argument must be a lexer/formatter instance, not a class" in some cases where this is not the actual problem (#2123).

  • Fix warnings in doc build (#2124).

  • The codetagify filter now recognizes FIXME tags by default (#2150).

  • The pygmentize command now recognizes if the COLORTERM environment variable is set to a value indicating that true-color support is available. In that case, it uses the TerminalTrueColorFormatter by default (#2160)

  • Remove redundant caches for filename patterns (#2153)

  • Use new non-deprecated Pillow API for text bounding box in ImageFormatter
    (#2198)

  • Remove default_style (#930, #2183)

  • Stop treating DeprecationWarnings as errors in the unit tests (#2196)

Compare
Choose a tag to compare
  • Added lexers:

    • Cplint (#2045)
    • Macaulay2 (#1791)
    • Minecraft (#2107)
    • Qlik (#1925)
    • UnixConfigLexer for "colon-separated" config files, like /etc/passwd (#2112)
  • Updated lexers:

    • Agda: Update keyword list (#2017)

    • C family: Fix identifiers after case statements (#2084)

    • Clojure: Highlight ratios (#2042)

    • Csound: Update to 6.17 (#2064)

    • CSS: Update the list of properties (#2113)

    • Elpi:

    • Futhark: Add missing tokens (#2118)

    • Gherkin: Add But (#2046)

    • Inform6: Update to 6.36 (#2050)

    • JSON: Support C comments in JSON (#2049). Note: This doesn't mean the JSON parser now supports JSONC or JSON5 proper, just that it doesn't error out when seeing a /* */ or // style comment. If you need proper comment handling, consider using the JavaScript lexer.

    • LilyPond:

      • Fix incorrect lexing of names containing a built-in (#2071)
      • Fix properties containing dashes (#2099)
    • PHP: Update builtin function and keyword list (#2054, #2056)

    • Scheme: Various improvements (#2060)

    • Spice: Update the keyword list, add new types (#2063, #2067)

    • Terraform:

      • Support non-idiomatic comments (#2065, #2066)
      • Fix class name lexing (#2097)
  • Add plugins argument to get_all_lexers().

  • Bump minimal Python version to 3.6 (#2059)

  • Fix multiple lexers marking whitespace as Text (#2025)

  • Remove various redundant uses of re.UNICODE (#2058)

  • Associate .resource with the Robot framework (#2047)

  • Associate .cljc with Clojure (#2043)

  • Associate .tpp with C++ (#2031)

  • Remove traces of Python 2 from the documentation (#2039)

  • The native style was updated to meet the WCAG AAA contrast guidelines (#2038)

  • Fix various typos (#2030)

  • Fix Groff formatter not inheriting token styles correctly (#2024)

  • Various improvements to the CI (#2036)

  • The Ada lexer has been moved to a separate file (#2117)

  • When linenos=table is used, the <table> itself is now wrapped with a <div class="highlight"> tag instead of placing it inside the <td class="code"> cell (#632.) With this change, the output matches the documented behavior.
    ⚠️ If you have subclassed HtmlFormatter.wrap, you may have to adjust the logic.

Compare
Choose a tag to compare
  • Updated lexers:

    • C-family: Fix incorrect handling of labels (#2022, #1996, #1182)
    • Java: Fixed an issue with record keywords result in Error tokens in some cases (#2018)
  • Fix links to line numbers not working correctly (#2014)

  • Remove underline from Whitespace style in the Tango theme (#2020)

  • Fix IRC and Terminal256 formatters not backtracking correctly for custom token types, resulting in some unstyled tokens (#1986)

Compare
Choose a tag to compare
  • Updated lexers:

    • C-family: Handle return types with multiple tokens (e.g. unsigned int) (#2008)
    • JSON: Fix a regression which caused whitespace before : to result in Error tokens (#2010)
    • SPICE: Various improvements (#2009)
Compare
Choose a tag to compare
  • Added lexers:

  • Updated lexers:

    • ABNF: Allow one-character rules (#1804)
    • Assembly: Fix incorrect token endings (#1895, #1961)
    • Bibtex: Distinguish between comment and commentary (#1899, #1806)
    • C family: Support unicode identifiers (#1848)
    • CDDL: Fix slow lexing speed (#1959)
    • Debian control: Add missing fields (#1946)
    • Devicetree: Recognize hexadecimal addresses for nodes (#1949)
    • GDScript: Add void data type (#1948)
    • GSQL
      • Fix comment handling (#2002)
      • Fix catastrophic backtracking (#2006)
    • HTML, XML: Improve comment handling (#1896)
    • Java: Add yield (#1941) and sealed classes/record (#1902)
    • Makefiles (#1860, #1898)
    • objdump-nasm: Improve handling of --no-show-raw-insn dumps (#1981)
    • Prolog: Support escaped \ inside quoted strings (#1479)
    • Python:
      • Support ~ in tracebacks (#2004)
      • Support the pattern matching keywords (#1797, #1994)
    • RobotFramework: Improve empty brace handling (#1921, #1922)
    • Terraform
      • Add the 'set' type (#1909)
      • Support heredocs (#1909)
  • Added styles:

  • There is new infrastructure in place to improve style accessibility. The default style has been updated to conform to WCAG recommendations. All styles are now checked for sufficient contrast by default to prevent regressions. (#1919, #1937, #1938, #1940)

  • Clean up unused imports (#1887)

  • Fix multiple lexers producing repeated single-character tokens

  • Fix multiple lexers marking whitespace as Text (#1237, #1905, #1908, #1914, #1911, #1923, #1939, #1957, #1978)

  • Remove duplicated assignments in the Paraiso style (#1934)

  • pygmentize supports JSON output for the various list functions now, making it easier to consume them from scripts. (#1437, #1890)

  • Use the shell lexer for kshrc files (#1947)

  • Use the ruby lexer for Vagrantfile files (#1936)

  • Use the C lexer for .xbm and .xpm files (#1802)

  • Add a groff formatter (#1873)

  • Update documentation (#1928)

  • Line anchors now link to themselves (#1973)

  • Add official support for Python 3.10 (#1917)

  • Fix several missing colors in dark styles: Gruvbox dark, Monokai, Rrt, Sas, Strata dark (#1955)

  • Associate more file types with man pages

  • The HtmlFormatter can now emit tooltips for each token to ease debugging of lexers (#1822)

  • Add f90 as an alias for fortran (#2000)

Compare
Choose a tag to compare
  • Added lexers:

  • Updated lexers:

  • Fix assert statements in TNT lexer.

  • Token types across all lexers have been unified (using the most common token
    type name) (#1816, #1819)

  • Improve Jasmin min score analysis (#1619)

  • Add new alias for Go files (#1827)

  • Fix multi-line console highlighting (#1833)

  • Add a new trivial lexer which outputs everything as Text.Generic.Output (#1835, #1836)

  • Use the .ini lexer for systemd files (#1849)

  • Fix a FutureWarning related to words() (#1854)

  • pwsh is now recognized as an alias for PowerShell (#1876)

Compare
Choose a tag to compare
  • Added lexers:

  • Updated lexers:

  • Add Pango formatter (#1727)

  • Autopygmentize uses file first instead of pygments -N (#1786)

  • Fix links (#1716)

  • Fix issue with LaTeX formatter and minted (#1734, #1735, #1736, #1737)

  • Improve alias order (#1780)

  • Improve line number colors (#1779, #1778)

  • Fix CTag related issue (#1724)

  • Recognize .leex as Elixir templates

  • Updated filename handling in HTML formatter if linenos='table' (#1757)

    • Previously the filename would be emitted within the <td> holding the code, but outside the <pre>. This would invariably break the a lignment with line numbers.
    • Now if filename is specified, a separate <tr> is emitted before the table content which contains a single <th> with colspan=2 so it
      spans both the line number and code columns. The filename is still within <span class="filename">...</span> so any existing styles
      should still apply, although the CSS path may need to change.
    • For an example of the new output format see table_cls_step_1_start_1_special_0_noanchor_filename.html
      in the tests/html_linenos_expected_output/ directory.
    • For more details and discussion see the issue #1757
  • Added styles:

    • Gruvbox light+dark (#1763)
Compare
Choose a tag to compare
Compare
Choose a tag to compare
  • Added lexers:

  • Updated lexers:

    • AutoIt: Support single quoted strings (#1667, #1663)

    • C/C++ & related: Fix mishandling '*/' (#1695)

    • Cocoa: Add builtin types (#1703)

    • Console (#1672)

    • Eiffel: Fix performance issues (#1658)

    • Fortran: Improve combined keyword detection (#1677, #1188)

    • J: Fix operator ? lexing (#1700, #1149)

    • JavaScript/TypeScript: Fix escapes in backtick strings (#1679, #1686)

    • Kotlin: Improve string interpolation, modifier keyword handling, and various small issues (#1699)

    • LESS: Support single-line comments (#1046)

    • Matlab:

      • Add support for class properties (#1466)
      • Update builtin functions (#1705)
      • Various cleanups (#1673)
    • Python: Improve handling of raw f-strings (#1681, #1683)

    • Ruby: Better method name handling (#1531)

    • Stata: Updated keywords (#1470)

  • Added styles:

  • The pygmentize script now uses argparse, all options should work
    as before

  • Add pygmentize -C option to guess a lexer from content

  • With this release, Pygments moves to a new internal testing system (#1649.) See Contributing.md for details. The main advantage of this new change is a much better test coverage of all existing example lexers. It also makes it much easier to add new test snippets.

  • Make guessing prefer Python 3 lexer

  • Do not guess MIME or SQL without reason

  • Changed setuptools to use a declarative config through setup.cfg. Building Pygments now requires setuptools 39.2+.

  • Add markdown to MarkdownLexer aliases (#1687)

  • Change line number handling

    • In <table> based output, the td.linenos element will have either a normal or special class attached. Previously, only special line
      numbers got a class. This prevents styles from getting applied twice - once via <pre>, once via <span class="special">. This also means
      that td.linenos pre is no longer styled, instead, use td.linenos .normal and td.linenos .special.
    • In the "inline" style, the DOM element order was changed. The line number is added first, then the line is wrapped is wrapped by the highlighter. This fixes lines not being fully highlighted.
    • The visual output for inline and non-inline line numbers & highlighting, as well as class-based and inline styling is now consistent.
    • Line number styles are set to background-color: transparent and color: inherit by default. This works much better with dark styles which don't have colors set for line numbers.
  • Remove "raw" alias from RawTokenLexer, so that it cannot be selected by alias.

  • Fix RawTokenLexer to work in Python 3 and handle exceptions.

  • Add prompt colors to the Solarized theme (#1529)

  • Image formatter supports background colors now (#1374)

  • Add support for anchors in conjunction with inline line numbers (#1591)

  • Modernize the codebase using pyupgrade (#1622)

  • Add support for line numbers to the terminal256 formatter (#1674, #1653)

  • Improve analyze_text logic for ECL (#1610)

  • Improve analyze_text logic for CBM Basic V2 (#1607)

  • Improve LaTeX formatter (#1708, #1709)

Compare
Choose a tag to compare
  • Updated lexers:

    • Apache configurations: Improve handling of malformed tags (#1656)

    • CSS: Add support for variables (#1633, #1666)

    • Crystal (#1650, #1670)

    • Coq (#1648)

    • Fortran: Add missing keywords (#1635, #1665)

    • Ini (#1624)

    • JavaScript and variants (#1647 -- missing regex flags, #1651)

    • Markdown (#1623, #1617)

    • Shell

      • Lex trailing whitespace as part of the prompt (#1645)
      • Add missing in keyword (#1652)
    • SQL - Fix keywords (#1668)

    • Typescript: Fix incorrect punctuation handling (#1510, #1511)

  • Fix infinite loop in SML lexer (#1625)

  • Fix backtracking string regexes in JavaScript/TypeScript, Modula2
    and many other lexers (#1637)

  • Limit recursion with nesting Ruby heredocs (#1638)

  • Fix a few inefficient regexes for guessing lexers

  • Fix the raw token lexer handling of Unicode (#1616)

  • Revert a private API change in the HTML formatter (#1655) --
    please note that private APIs remain subject to change!

  • Fix several exponential/cubic-complexity regexes found by
    Ben Caller/Doyensec (#1675)

  • Fix incorrect MATLAB example (#1582)

Thanks to Google's OSS-Fuzz project for finding many of these bugs.