Feature: Load ESM over https #127

viktor-ku · 2018-06-09T12:42:58Z

Hi,

Just closed my PR to the Node.js core about this feature. I'd like to propose a loader that could do this thing.

Example

import sum from 'https://gitsomething.com/origin/v1.1.0/main.js'

console.log(sum(1, 1)) // 2

How I think it should work

Import over https reached
Convert this https path (without any filename) to the hash representation (e.g. 'https://gitsomething.com/origin/v1.1.0' to 70b56bcaff85f7836cc4688021cfbfe385550fbbeb6e5e801eb16bd9e66d3134)
Go to the main location with stored node modules over http (in PR it uses /tmp/ or the path specified by NODE_HTTP_MODULES env. Essential is that it is some base dir like node_modules)
Look for the folder with this hashed name or download the package
Look for the file matching the loading one (e.g. main.js in this case)
Load the file content as usual esm

What I don't know yet

How to resolve package dependencies

The text was updated successfully, but these errors were encountered:

xtuc · 2018-06-09T12:50:26Z

I'm curious to know what do you think about allowing URL in Module specifier in WASM WebAssembly/esm-integration#11

devsnek · 2018-06-09T13:01:39Z

once again i think we need to do this the standardised way: html.spec.whatwg.org/multipage/webappapis.html#fetching-scripts

there are 1000000000000 different things that servers do with compression, transport security, content types, etc and node trying to figure all that out itself won't help anyone.

also why do you want to cache files?

xtuc · 2018-06-09T13:06:18Z

@devsnek Yes that's right, just wanted to bring that up because I think that's important to all agree on it.

also why do you want to cache files?

Sorry where's mentioned caching?

devsnek · 2018-06-09T13:08:21Z

@xtuc steps 3-5 are working on files cached in /tmp/ (which won't work on plenty of oses)

i'm unclear as to why source text is leaving node's memory

benjamingr · 2018-06-09T13:30:07Z

@devsnek presumably for the same reason ry caches files in deno - so it’s fast and consistent between loads and doesn’t delay startup time.

devsnek · 2018-06-09T15:00:52Z

@benjamingr wouldn't you just download the files in that case? i would expect an https import to hit the server every time.

ljharb · 2018-06-09T15:06:02Z

It must; in case there’s redirects or updated content. It might get a 304, of course.

viktor-ku · 2018-06-09T15:36:12Z

The reason behind caching is to avoid going to the internet if you already have this module downloaded. Why do you expect node to re-download module every time?

If you specify url with certain version (see example) then you wouldn't need it to be re-fetched
If you use master on un-versioned url then you will have this kind of issue, but then we can have a similar to ry idea with --reload command

viktor-ku · 2018-06-09T15:38:36Z

About tmp:

Guys, it is a draft. I am not gonna leave it as is. We simply need some convenient place to store this modules. Perhaps we can specify this place with flag or env (like in PR).

So you specify env variable and everything goes there.

guybedford · 2018-06-09T15:47:40Z

There’s nothing to stop this from being implemented in a loader already today. Basically a resolve hook can detect http requests, check if it is in the cache, downloading if necessary, then resolve to the file url of the stored cache entry through the resolve promise. Please do give it a go and share your findings, and just ask if you have any questions on the implementation or feedback on the loader hooks in the process too.

…

On Sat, 09 Jun 2018 at 17:38, Viktor Kuroljov ***@***.***> wrote: About *tmp*: Guys, it is a draft. I am not gonna leave it as is. We simply need some convenient place to store this modules. Perhaps we can specify this place with flag or env (like in PR). So you specify env variable and everything goes there. — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#127 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAkiyt3iYvN4r9QRaiOnJrahoqNMk4aNks5t6-v8gaJpZM4UhUb5> .

viktor-ku · 2018-06-09T16:25:45Z

@guybedford I don't know how to create a loader at the moment. But I will start to research that

ljharb · 2018-06-09T16:26:25Z

A url is designed to have its contents cached, but urls always still hit the network (service workers aside). I would expect the same in node.

viktor-ku · 2018-06-09T18:05:06Z

@ljharb so you will use --reload a lot I guess ⌨️

ljharb · 2018-06-09T18:16:04Z

No, I’d expect that to be the default.

benjamingr · 2018-06-09T19:19:50Z

Guys, it is a draft.

Prefer folks, people etc.

benjamingr · 2018-06-09T19:20:24Z

@kuroljov see https://nodejs.org/api/esm.html#esm_loader_hooks

viktor-ku · 2018-06-11T08:21:27Z

Prefer folks, people etc.

@benjamingr Okay. I will try to remember that. Bad habits I guess

SMotaal · 2018-06-12T11:25:43Z

@ljharb on the topic of service workers, could abstracting away https implementation into a similar mechanism make sense.

Say I have xyz origins, I create and register a worklet for those (I can use my imaginary favourite npm package to do so), and when a URL is requested, all node has to do is delegate to the worker and expect a Response-like object with content-type and body. It will involve a bit of origin trials... etc. so that node does delegate to the right worklet. It can also be statically defined for packages to use the specific worklet entry point specifiers.

Fishrock123 · 2018-06-20T19:02:43Z

I think this feature would be dangerous to implement.

If people would like to hot-load code from a network we should not make it easy for them. Node.js is too un-sandboxed to make this a viable default or built-in option.

SMotaal · 2018-06-27T11:42:30Z

@bmek One thing I keep hitting with the 20+ isolated experiments I did with loaders is the fact that sometimes the resolved URL to a module's actual file on disk is not the same as the URL of the module it intends to resolve as. For instance, if a loader transpiles and caches a module in a temporary file, that URL is the resolved URL, however it is should not be the import.meta.url nor should it be the parentURL for it's static linking calls to Loader#<resolve, import, …>(specifier, parentURL).

I believe this will be common beyond experimental, which imho applies to this kind of loader.

@viktor-ku did you explore this further with --loader and if so, were there any other pain-points came across?

bmeck · 2018-06-27T13:50:20Z

@SMotaal for now you can insert import.meta.url = ${JSON.stringify(desiredURL)} at top of your transpiled script.

However this is not enough, will continue why in #140

guybedford · 2018-06-27T13:52:39Z

Note this contextual handling is exactly why a translate hook is useful.

bmeck · 2018-06-27T14:22:26Z

@guybedford I don't think this implies we need a separate hook / translate, if you could explain that a bit it would help here.

guybedford · 2018-06-27T14:25:45Z

Translation allows setting the JS source of a resource at a given path, thereby maintining the correct referrer context to that given path.

The alternatives involve large-scale re-resolving all modules that depend on that module into a new ID space, that will then not match the file system.

bmeck · 2018-06-27T14:27:01Z

@guybedford couldn't that be rolled into the same hook? I'm asking because that matches the JS spec closer and seems to alleviate a series of issues.

guybedford · 2018-06-27T14:45:56Z

@bmeck do you mean combining with resolve or combining with load? A load hook can certainly work as an alternative to any translate hook.

bmeck · 2018-06-27T14:52:54Z

there is too much tribal knowledge in those terms of "load" vs "translate" for me to understand, I've been looking at having:

resolve(referrer : string, specifier : string, hostDefined: *) => Promise<{
  key? : string, // insertion point into cache
  body? : B, // the body, unclear on if this actually needs to be loaded ahead of return, can be on demand?
  data? : * // any message passing between loaders
}>

guybedford · 2018-06-27T14:56:03Z

Could you not just return a Blob URL directly in such a case?

Blobs can work certainly, but are a bit more work as any importers need to be able to look up the mapping from the file URL to the blob for that resource, which is avoided with a translate.

What I mean by load is basically a fetch hook that can choose to do translation (load effectively covering both concepts).

guybedford · 2018-06-27T14:57:26Z

Also the issue with resolvers getting data as you've written is that it's a many to one function, so precedence on body / data would apply.

bmeck · 2018-06-27T15:00:57Z

@guybedford blob: URLs are insufficient for cycles: w3c/FileAPI#97 . We want full Blobs with ability to separate reservation of the key so that it can be done ahead of content creation.

Also the issue with resolvers getting data as you've written is that it's a many to one function, so precedence on body / data would apply.

I'm unclear on this, data is purely used as a message passing system for loaders, it wouldn't affect any default loader I would think.

SMotaal · 2018-06-27T15:12:44Z

@guybedford I am +1 for a translation hook as long as chaining it is handled appropriately (which in my opinion is different from resolve's chaining).

Could you not just return a Blob URL directly in such a case?

My biggest concern in all my testing so far has been unchecked potential for redundant "stringing" garbage and mishandling outdated references to objects because we all do that sometimes. My second one is determinism of resulting URL, ie URL.createObjectURL does not allow for deterministic mapping for deadlock dependencies. So maybe the open web requires this kind of after-the-fact security however, Node's Loader specifically should meter both security and performance overhead.

devsnek added interoperability esm features web-platform labels Jun 9, 2018

GeoffreyBooth closed this as completed Jun 21, 2018

SMotaal mentioned this issue Jun 27, 2018

Loaders: resolved vs. referencing URL #140

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature: Load ESM over https #127

Feature: Load ESM over https #127

viktor-ku commented Jun 9, 2018

xtuc commented Jun 9, 2018 •

edited

Loading

devsnek commented Jun 9, 2018

xtuc commented Jun 9, 2018

devsnek commented Jun 9, 2018 •

edited

Loading

benjamingr commented Jun 9, 2018 •

edited

Loading

devsnek commented Jun 9, 2018

ljharb commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

guybedford commented Jun 9, 2018 via email

viktor-ku commented Jun 9, 2018

ljharb commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

ljharb commented Jun 9, 2018

benjamingr commented Jun 9, 2018

benjamingr commented Jun 9, 2018

viktor-ku commented Jun 11, 2018

SMotaal commented Jun 12, 2018 •

edited

Loading

Fishrock123 commented Jun 20, 2018

SMotaal commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

SMotaal commented Jun 27, 2018 •

edited

Loading

Feature: Load ESM over https #127

Feature: Load ESM over https #127

Comments

viktor-ku commented Jun 9, 2018

How I think it should work

What I don't know yet

xtuc commented Jun 9, 2018 • edited Loading

devsnek commented Jun 9, 2018

xtuc commented Jun 9, 2018

devsnek commented Jun 9, 2018 • edited Loading

benjamingr commented Jun 9, 2018 • edited Loading

devsnek commented Jun 9, 2018

ljharb commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

guybedford commented Jun 9, 2018 via email

viktor-ku commented Jun 9, 2018

ljharb commented Jun 9, 2018

viktor-ku commented Jun 9, 2018

ljharb commented Jun 9, 2018

benjamingr commented Jun 9, 2018

benjamingr commented Jun 9, 2018

viktor-ku commented Jun 11, 2018

SMotaal commented Jun 12, 2018 • edited Loading

Fishrock123 commented Jun 20, 2018

SMotaal commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

guybedford commented Jun 27, 2018

guybedford commented Jun 27, 2018

bmeck commented Jun 27, 2018

SMotaal commented Jun 27, 2018 • edited Loading

xtuc commented Jun 9, 2018 •

edited

Loading

devsnek commented Jun 9, 2018 •

edited

Loading

benjamingr commented Jun 9, 2018 •

edited

Loading

SMotaal commented Jun 12, 2018 •

edited

Loading

SMotaal commented Jun 27, 2018 •

edited

Loading