Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Special characters in entrypoint paths are always forcibly replaced #1362

Closed
airhorns opened this issue Jun 10, 2021 · 5 comments
Closed

Special characters in entrypoint paths are always forcibly replaced #1362

airhorns opened this issue Jun 10, 2021 · 5 comments

Comments

@airhorns
Copy link

airhorns commented Jun 10, 2021

Currently, esbuild replaces certain characters in entrypoint filenames for the corresponding output filenames when bundle is false. I assume this is desirable in some circumstances, but, if there's anything else that is expecting those file names to be consistent, or if those special characters have special meaning to other systems, this replacement causes some strife!

For me, I am using esbuild to build a bunch of TypeScript files for a nodejs project into JS files for deployment to production. We use a next.js style filepath-based routing system, where we have a file named GET-/foo.ts that the router discovers and registers as a route at /foo in our webserver. We glob our whole package for .ts files and then submit all of the found paths as entrypoints to esbuild with bundle: false, and esbuild builds them into a mirror image of the same directory structure. The problem comes from esbuild's special character replacement -- our filepath based router also supports wildcards, like GET-*.ts, or dynamic parameter segments, like POST-:application_slug.ts. esbuild replaces the : character and the * character in the output filenames such that the semantic meaning of them to the downstream filepath based router changes.

We're currently working around this by asking for the metafile from esbuild, building a mapping of entrypoint to output file, and moving the written outputs back to place they started so that the only thing that's different is the extension and not the filename itself. This is annoying and tricky though!

Could we add an option to disable the special character replacement? Or to pass a smaller list of characters that will be replaced?

PS: Thanks as always for an amazing piece of open source software!

@evanw
Copy link
Owner

evanw commented Jun 10, 2021

I have designed esbuild to be cross-platform. It deliberately avoids doing things that work only on some platforms but not others. This is one of those things. Colon characters in path names don't work on Windows according to https://docs.microsoft.com/en-us/windows/win32/fileio/naming-a-file:

  • Use any character in the current code page for a name, including Unicode characters and characters in the extended character set (128–255), except for the following:

    • The following reserved characters:

      • < (less than)
      • > (greater than)
      • : (colon)
      • " (double quote)
      • / (forward slash)
      • \ (backslash)
      • | (vertical bar or pipe)
      • ? (question mark)
      • * (asterisk)
    • Integer value zero, sometimes referred to as the ASCII NUL character.

    • Characters whose integer representations are in the range from 1 through 31, except for alternate data streams where these characters are allowed. For more information about file streams, see File Streams.

    • Any other character that the target file system does not allow.

If you want to have cross-platform incompatible file names, you are welcome to use write: false and then write out the files yourself. It sounds like you are already doing that.

@airhorns
Copy link
Author

airhorns commented Jun 11, 2021

I have designed esbuild to be cross-platform.

This makes sense, and yeah, there is a workaround, but, it is a little surprising that most but not all files end up with just a changed extension. For the unawares like me it was easy to assume that all files would follow the rule. Totally happy to chalk that up to me being thick and leave esbuild as is.

That said, is it important to you that the output is identical cross platform? Or just that esbuild works on windows? One option would be to turn on the character replacement on windows where it is necessary, but leave it off for *nix where it isn't. In my case, my input files wouldn't even be allowed on windows such that I don't think I could even feed esbuild these files in the first place. At first blush it doesn't seem that crazy to assume entrypoints with special characters mean that outputs with special characters wouldn't break anything on the machine esbuild is running on. If folks copied outputs between OSes it'd break though.

I actually ended up doing write: false and manually write the output files to the place where I want them to be, but it ends up with yet more downstream complexity trying to move the sourcemaps as well as the compiled outputs into the right spot. It is a little regrettable I have to stream all the content back into node to then write it out to the FS, but not the end of the world either. Here's what I have so far before getting the sourcemap part working just to demonstrate:

  const workspaceRoot = path.resolve(path.join(__dirname, "..", "..", ".."));

  const result = await build({
    platform: "node",
    format: "cjs",
    target: ["node14"],
    write: false,
    outdir: path.join(workspaceRoot, "packages"),
    bundle: false,
    sourcemap: true,
    metafile: true,
    entryPoints: await globby(
      [
        "packages/{@starscream,@gadgetinc,api,app-sandbox,docs,state-trees,api-client,gelly,web}/**/*.{ts,tsx}",
        "!**/node_modules",
        "!packages/api/spec/integration",
      ],
      { cwd: workspaceRoot }
    ),
    absWorkingDir: workspaceRoot,
  });

  const outputsByDestinationPath = keyBy(result.outputFiles, "path");

  /**
   * esbuild rewrites special characters out of filenames, replacing them with an underscore. The specific filenames are important to use for Starscream's routing syntax, so we need to write files to exactly the same path as the input file. By mapping the metafile that connects input files to output files we can ensure we keep the original filenames for any that might have been changed by esbuild.
   */
  const writes = [];
  for (const [destinationString, details] of Object.entries(result.metafile.outputs)) {
    if (details.entryPoint) {
      const entrypoint = path.parse(details.entryPoint);
      const absoluteDestination = path.join(workspaceRoot, destinationString);
      const buildResult = outputsByDestinationPath[absoluteDestination];
      if (!buildResult) {
        throw new Error(`no build result found for path ${absoluteDestination}`);
      }
      const esbuildDestination = path.parse(destinationString);

      const newDestination = path.join(workspaceRoot, entrypoint.dir, entrypoint.name + esbuildDestination.ext);
      writes.push(fs.writeFile(newDestination, buildResult.contents, "utf-8"));
    }
  }

  await Promise.all(writes);

If you are unconvinced feel free to close! Thanks for your consideration!

@nettybun
Copy link

nettybun commented Jun 21, 2021

@airhorns Did you try entryNames? https://esbuild.github.io/api/#entry-names Tthe documentation says:

[name]
This is the original file name of the entry point without the extension.

I'd hope "original" means unchanged original, and if not then maybe the documentation should be updated to say "This is the normalized file name ..."

@evanw not to advocate for feature creep but a backwards-compatible way to support this could be [name:unchanged] and [name:normalized] where [name] defaults to normalized.

evanw added a commit that referenced this issue Oct 17, 2021
@evanw
Copy link
Owner

evanw commented Oct 17, 2021

I think it's actually already possible to do this using the alternate form of entryPoints that allows you to override the automatically-generated name with a custom name. For example:

$ touch 'GET-*.ts'
$ esbuild 'GET-*.ts' --outdir=out

  out/GET-_.js  25b 

$ esbuild 'GET-*=GET-*.ts' --outdir=out

  out/GET-*.js  25b 

This is documented here: https://esbuild.github.io/api/#entry-points. Closing as this is an edge case and there is a straightforward workaround.

@evanw evanw closed this as completed Oct 17, 2021
@airhorns
Copy link
Author

Nice, that worked well for me, thanks!

eduardoboucas pushed a commit to netlify/esbuild that referenced this issue Dec 3, 2021
* fix evanw#1327: improve lowered template literals

* fix(linker): order of css imported from js (evanw#1342)

* release notes for evanw#1342

* update compat-table

* fix for "export default class" transform (evanw#1346)

* publish 0.12.6 to npm

* add support for es5-style identifiers (evanw#1349)

* runtime: remove "__platform" flag

* runtime: remove "__profiler" flag

* runtime: check "for-of" not "=>" for es6 support

* fix evanw#1349: quote modern unicode object properties

* fix evanw#1355: ignore tsconfig.json in node_modules

* fix evanw#1357: "--metafile" with "--watch"

* fix(linker): add missing esm flag (evanw#1338)

* Allow OnResolve plugins to mark modules as side effect free (evanw#1313)

* publish 0.12.7 to npm

* fix evanw#1358: remove warning about source map comment

* publish 0.12.8 to npm

* fix evanw#1361: allow "this" with "--define"

* fix evanw#1372: css minification bug with !important

* publish 0.12.9 to npm

* avoid checking "browser" for other platforms

* add an "es2021" target

* Avoid exporting a pointer to a loop variable in linker (evanw#1389)

The Bazel nogo (Go lint config) errored when I tried to compile esbuild:

    compilepkg: nogo: errors found by nogo during build-time code analysis:
    external/com_github_evanw_esbuild/internal/bundler/linker.go:3309:27:
     exporting a pointer for the loop variable stmt (export_loop_ref)

The simplified code nogo complains about is:

    for _, stmt := range partStmts {
      stmt.Data = &js_ast.SImport{
        StarNameLoc: &stmt.Loc,
      }
    }

The problem is `&stmt.Loc` points to the mutated loop variable `stmt`.  After
the loop iteration ends, all stored pointers will point to the last value of
`partStmts[-1].Loc`.

An alternative solution is to shadow `stmt` at the beginning of the loop, but
this felt cleaner:

    stmt := stmt

The lint rule is defined by https://github.com/kyoh86/exportloopref.

* feat: mangle Infinity (evanw#1385)

* add support for shorten transform/translate3d (evanw#1390)

* css: implement minification for all matrix forms

* fix evanw#1397: support "s" in css attribute selectors

* publish 0.12.10 to npm

* fix evanw#1399: avoid "os.MkdirAll" to fix WebAssembly

* fix evanw#1396: improve invalid loader error message

* improve sync performance of js api by ~20x (evanw#1000)

* fix windows issues

* publish 0.12.11 to npm

* move unique key prefix from compile to scan phase

* add "C" to unique keys for chunks

* fix evanw#1044: correct relative paths for file loader

* fix a windows path issue

* publish 0.12.12 to npm

* Fix using JS synchronous API from from non-main threads (evanw#1411)

* publish 0.12.13 to npm

* keep wasm tests self-contained

* factor out some code related to "outfile"

* pull out relative-to-outbase code

* fix evanw#1404: "file" loader always copies to "outdir"

* publish 0.12.14 to npm

* fix evanw#1421: bug with css color lowering and "var()"

* avoid "var()" issues with other css minifications

* publish 0.12.15 to npm

* update the compat table

* allow out-of-range tagged template unicode escapes

* fix evanw#1426: remove warning about bad CSS "@" rules

* fix evanw#1470: allow "ES2021" in "tsconfig.json"

* fix evanw#1462: avoid worker_threads in node <v12.17.0

* fix evanw#1466: paths with "node:" prefix are external

* Consider `\` and `/` to be the same in file paths (evanw#1472)

* publish 0.12.16 to npm

* fix evanw#1455: bundler hoisting bug with var+for loops

* fix evanw#1418: private fields and logical assignment

* Abort esbuild if stdin is closed when serving (evanw#1449)

* release notes for evanw#1449

* fix evanw#1424: always generate private method names

* publish 0.12.17 to npm

* fix evanw#1483: UTF-8 and utf-8 are the same @charset

* improve error about missing sub-condition (evanw#1484)

* refactor(deno): use denoflate instead of compress (evanw#1482)

deno.land/x/denoflate is about 10% smaller, and a lot more polished and
up to date than deno.land/x/compress.

* fix evanw#1493: nullish coalescing assignment edge case

* fix evanw#1489: do not warn about "es3" in node_modules

* fix evanw#1497: "this" before "super()" when minifying

* avoid shadowing "expr" in "lowerClass"

* fix evanw#1498: variable shadowing broke class lowering

* fix: CSS import relative paths (evanw#1494)

* add release notes for evanw#1494

* publish 0.12.18 to npm

* move source map code to source map module

* css: add location info to rules

* css: printer returns result object

* move span object to logger

* css: add support for source maps

* add extension to source map tests

* add a basic css source map test

* fix evanw#519: release notes for css source maps

* fix evanw#1507: wrong ts class field side effect order

* publish 0.12.19 to npm

* avoid printing "</style" in CSS code (evanw#1509)

* attempt to fix flaky test

* update browser compat data

* fix evanw#1512: asi issue with "." and type parameters

* fix evanw#1509: make `</script` escape case-insensitive

* publish 0.12.20 to npm

* update to go version 1.17.0

* fix evanw#995: windows arm64 support

* run go format from go 1.17.0

* css: terminate source map comment before "*/"

* add windows 64-bit arm build to installer (evanw#995)

* publish 0.12.21 to npm

* fix evanw#1536: http range requests now use less memory

* fix evanw#1538: minify bug for "var()" and "box-shadow"

* publish 0.12.22 to npm

* fix evanw#1553: rest bindings in TypeScript arrow types

* fix evanw#1545: "watch" is not allowed with "buildSync"

* fix evanw#1552: keep names + minify + nested functions

* forbid "watch" w/ "buildSync" w/o "worker_threads"

* publish 0.12.23 to npm

* fix direct "eval" variable renaming edge case

* publish 0.12.24 to npm

* fix evanw#1560: bug with "!" after "new" in TypeScript

* capture and report parser panics

* fix parser panic due to "#a in #b in c"

* class static blocks are a parse error

* illumos 64-bit support (evanw#1562)

* release notes for evanw#1562

* publish 0.12.25 to npm

* feat: Optimizing the __require function (evanw#1580)

* release notes for evanw#1579

* move "NO_COLOR" handling into the logger itself

* add an "analyze metafile" api

* add import paths to analysis

* add a "verbose" flag to analysis

* fix evanw#1568: release notes for "--analyze"

* upgrade "golang.org/x/sys" (evanw#1572)

* publish 0.12.26 to npm

* fix evanw#1594: update manual compat table overrides

* replace math.MaxInt usage (evanw#1585)

This constant is only available in go >= 1.17, so I've inlined its value
so dependents don't have to upgrade their go version.

reference implementation: https://cs.opensource.google/go/go/+/refs/tags/go1.17:src/math/const.go;l=38

* fix evanw#1589: server "stop()" waits for active builds

* use "math.MaxUint32" not "math.MaxInt"

* update go 1.17.0 => go 1.17.1

* publish 0.12.27 to npm

* fix evanw#1599: U+30FB and U+FF65 in ES5 vs. ES6+

* fix evanw#1600: "++" and "--" on class private fields

* publish 0.12.28 to npm

* fix evanw#1614: proxy from "__require" to "require"

* fix evanw#1623: ignore class fields marked "abstract"

* "typeof identifier" has no side effects

* fix "__require" to have no side effects

* fix mangle syntax edge case with "==" and "!="

* fix missing return in "IsNumericValue"

* add "--analyze" to cli help text

* publish 0.12.29 to npm

* no side effects for "typeof x != undefined && x"

* separate "ignore annotations" from "tree shaking" (evanw#1625)

* install using "optionalDependencies" (evanw#1621)

* release notes

* publish 0.13.0 to npm

* fix release gh action to ignore nested headers

* fix the "esbuild" package in yarn 2+

* yarn pnp compat: copy binary into the current pkg

* publish 0.13.1 to npm

* fix evanw#1628: "export {}" with "--tree-shaking=true"

* fix cache condition in iswin_wasm (evanw#1630)

* publish 0.13.2 to npm

* add "preferUnplugged: false" to binary packages

This is a yarn-specific "package.json" flag and is being added at the recommendation of the Yarn team. Even though esbuild's binary packages are listed as optional dependencies of the main package, Yarn still installs all of them (even though only one applies to the current platform). And unlike npm, which always installs a given package into a directory on the file system, Yarn can represent a given package either as a zip file or as a directory of files. So ideally as many packages as possible are represented as zip files to minimize wasted space on the file system (since zip files are compressed). One of the heuristics that Yarn uses is to represent a package as a directory if it contains a file ending in ".exe" so unfortunately esbuild's three Windows packages are always stored as directories instead of as zip files, which means they are uncompressed and are larger than necessary. Specifying "preferUnplugged: false" should avoid this. Hopefully someday Yarn won't even install these packages on the file system in the first place to eliminate the wasted space completely.

See also:

* https://yarnpkg.com/configuration/manifest/#preferUnplugged
* yarnpkg/berry#3317 (comment)

* support type-only import/export specifiers (evanw#1637)

* publish 0.13.3 to npm

* fix evanw#1642: permission issues with install script

* basic support for ".mts" and ".cts" from TS 4.5

* fix evanw#1647: add a fallback for "npm --no-optional"

* make pnpapi workaround platform-specific (evanw#1656)

I'm not sure if this will fix anything, but it probably couldn't hurt.

* no optimizations with yarn 1 just in case (evanw#1656)

* fix evanw#1657: invalid css transform of margin/padding

* remove ".mts" and ".cts" from resolve extensions

* publish 0.13.4 to npm

* fix evanw#1113: improve watch mode accuracy (evanw#1676)

* disallow certain "<" in ".mts/.cts" files

* fix evanw#1665: don’t remove empty @Keyframes (evanw#1669)

* release notes for evanw#1665

* Don't emit "duplicate label" error across function scopes. (evanw#1671)

* release notes for evanw#1671

* publish 0.13.5 to npm

* Add NetBSD amd64 binary (evanw#1624)

* https in changelog, rebalance makefile

* Allow bundled esbuild with ESBUILD_BINARY_PATH (evanw#1678)

* feat: drop catch binding when optional catch binding is supported (evanw#1660)

* fix subtle minify issues with eval

* ts: forbid "declare" fields from being initialized

* ts: forbid "declare" on non-field class properties

* fix evanw#1675: run decorators for "declare" fields

* avoid direct eval retaining unused imports in ts

* publish 0.13.6 to npm

* update parcel 2 version in benchmark

* remove now-unnecessary "@parcel/transformer-typescript-tsc"

* remove old bundler versions

* update rollup and webpack too

* update benchmark image

* fix evanw#1682: always use the shortest css alpha value

* fix evanw#1680: match node's core module behavior

* update go 1.17.1 => 1.17.2

* fix wasm on go 1.17.2 (evanw#1684)

* update rollup tests so they work on node v16.11.1

* publish 0.13.7 to npm

* fix evanw#1425: super inside arrow inside lowered async

* add "and CSS" to package description

* fix evanw#1661: remove implicit trailing "/" in "[dir]"

* add a test for evanw#1362

* publish 0.13.8 to npm

* fix evanw#1702: invalid css transform of border-radius

* make yaml formatting consistent

* add simple end-to-end tests

* fix evanw#1703: handle silent "rename" syscall failure

* add pnpm end-to-end tests

* check end-to-end test output

* resolver: rename "pe" => "pj"

* remove unused range

* fix evanw#1691: support "imports" in "package.json"

* publish 0.13.9 to npm

* yarn berry end-to-end test

* try running end-to-end tests on github

* check that esbuild builds on go 1.13

* Use `io.SeekStart` instead of deprecated `os.SEEK_SET` (evanw#1701)

`os.SEEK_SET` has been deprecated since Go 1.7.
Ref: https://pkg.go.dev/os#pkg-constants

* add "check out code" to old go version ci

* update @next targets for npm and yarn

* link from code to docs for vs code autocomplete

* remove invalid "es7" option in tsconfig parser

* update the compat table

* Allow target for ES-Version to be uppercase (evanw#1718)

* fix evanw#1539: implement legal comments for css

* update to unicode 14

* add ".mts" and ".cts" to exports kind checking

* publish 0.13.10 to npm

* reorder some functions

* get tests working on node 17+

* also run async transform tests un-transformed

* run tests w/ node 16 not 14 to avoid hard-crash

Node 14 has some bug that results in an "unreachable code" panic. For the record, the traceback is as follows:

     1: node::NodePlatform::GetStackTracePrinter()::$_3::__invoke()
     2: V8_Fatal(char const*, ...)
     3: v8::internal::interpreter::BytecodeGenerator::VisitCompoundAssignment(v8::internal::CompoundAssignment*)
     4: v8::internal::interpreter::BytecodeGenerator::VisitNoStackOverflowCheck(v8::internal::AstNode*)
     5: v8::internal::interpreter::BytecodeGenerator::GenerateBytecodeBody()
     6: v8::internal::interpreter::BytecodeGenerator::GenerateBytecode(unsigned long)
     7: v8::internal::interpreter::InterpreterCompilationJob::ExecuteJobImpl()
     8: v8::internal::(anonymous namespace)::ExecuteSingleUnoptimizedCompilationJob(v8::internal::ParseInfo*, v8::internal::FunctionLiteral*, v8::internal::AccountingAllocator*, std::__1::vector<v8::internal::FunctionLiteral*, std::__1::allocator<v8::internal::FunctionLiteral*> >*)
     9: v8::internal::(anonymous namespace)::IterativelyExecuteAndFinalizeUnoptimizedCompilationJobs(v8::internal::Isolate*, v8::internal::Handle<v8::internal::SharedFunctionInfo>, v8::internal::Handle<v8::internal::Script>, v8::internal::ParseInfo*, v8::internal::AccountingAllocator*, v8::internal::IsCompiledScope*, std::__1::vector<v8::internal::FinalizeUnoptimizedCompilationData, std::__1::allocator<v8::internal::FinalizeUnoptimizedCompilationData> >*)
    10: v8::internal::Compiler::Compile(v8::internal::Handle<v8::internal::SharedFunctionInfo>, v8::internal::Compiler::ClearExceptionFlag, v8::internal::IsCompiledScope*)
    11: v8::internal::Compiler::Compile(v8::internal::Handle<v8::internal::JSFunction>, v8::internal::Compiler::ClearExceptionFlag, v8::internal::IsCompiledScope*)
    12: v8::internal::Runtime_CompileLazy(int, unsigned long*, v8::internal::Isolate*)
    13: Builtins_CEntry_Return1_DontSaveFPRegs_ArgvOnStack_NoBuiltinExit
    14: Builtins_CompileLazy
    15: Builtins_InterpreterEntryTrampoline

* also run class lowering tests untransformed

* test coverage for super and object methods

* fix writing to a "super" property

* also handle "super" inside static class fields

* implement class static blocks (evanw#1729)

* publish 0.13.11 to npm

* enable tree shaking for empty "static {}" blocks

* fix evanw#1730: crash with legal comment and @import

* enable tree shaking of "Reflect" static methods

* implement "calc()" reduction for css (evanw#1731)

* publish 0.13.12 to npm

* fix evanw#1739: tree shaking bug with "var exports"

* border radius tests: use length instead of number

* Add css to help text for --loader (evanw#1744)

* allow empty string for CLI string arrays

* move "main fields" logic to a separate function

* make debug meta available to the entire resolver

* say if "main" is missing from main fields (evanw#1754)

* fix evanw#1755: merge adjacent selectors with same body

* add spack to benchmarks (not ready due to bugs)

* Shorten "top", "right" properties into "inset" property (evanw#1758)

* add credit to changelog

* publish 0.13.13 to npm

* chore: make build pass

* chore: update publishing scripts

Co-authored-by: Evan Wallace <evan.exe@gmail.com>
Co-authored-by: dmitrage <dmitrage@gmail.com>
Co-authored-by: Liu Bowen <Mr_lbw@outlook.com>
Co-authored-by: Chris Casola <chriscasola@gmail.com>
Co-authored-by: Joe Schafer <joe@jschaf.com>
Co-authored-by: Gusted <williamzijl7@hotmail.com>
Co-authored-by: Weilin Shi <934587911@qq.com>
Co-authored-by: José Valim <jose.valim@gmail.com>
Co-authored-by: Luca Casonato <hello@lcas.dev>
Co-authored-by: Rongjian Zhang <pd4d10@gmail.com>
Co-authored-by: Dominik Hassler <hadfl@omnios.org>
Co-authored-by: FM <15306225869@163.com>
Co-authored-by: David Zukowski <david@zuko.me>
Co-authored-by: John Doe <you@example.com>
Co-authored-by: Georges Varouchas <georges.varouchas@gmail.com>
Co-authored-by: Pig Fang <g-plane@hotmail.com>
Co-authored-by: Eelco Lempsink <eelco@framer.com>
Co-authored-by: Nevkontakte <nevkontakte@users.noreply.github.com>
Co-authored-by: Greg Troxel <gdt@lexort.com>
Co-authored-by: Piotr Krawiec <piotr@krawiec.me>
Co-authored-by: 翠 / green <green@sapphi.red>
Co-authored-by: y-yagi <yuuji.yaginuma@gmail.com>
Co-authored-by: timse <tim.sebastian@gmail.com>
Co-authored-by: Dan Rosén <danr42@gmail.com>
Co-authored-by: Netlify Team Account 1 <netlify-team-account-1@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants