Rework runtime #535

dcodeIO · 2019-03-08T23:49:33Z

As mentioned a few times, I'm relatively unhappy with our internal runtime. More precisely, the current runtime stacked stuff on top of other stuff at random places in order to get things going, starting at memory allocation level, putting ArrayBuffers, Strings and other classes on top, introducing cumbersome macro helpers, and then there's GC.

So this PR aims at designing something better, a common runtime that can be used for all sorts of tasks, using a common header for all objects, providing common helpers for internal use, eliminating unnecessary helpers, simplifying buffer layout and so on.

See std/assembly/runtime.ts for the proposed implementation.

Differences are:

All objects have a common hidden HEADER with their class id (for instanceof etc.) and their payload size (for easy realloc and length/byteLength computation).
The LOAD and STORE macros are gone. Strings and ArrayBuffers don't have their own header anymore and start with their data immediately, making it possible to just load and store from/to them. Actually one can, theoretically, even cast between a String and an ArrayBuffer now.
The xyzUnsafe helpers become mostly redundant and most of them will be removed (still need to think about copyUnsafe etc.).
Objects can be reallocateed always and discarded as long as not registered yet.
Downside: Whenever an object is meant to become managed by GC, i.e. when returned from a standard library function, it must be registered or it will leak. Upside: One can work with scratch objects and throw them away if necessary.
The GC interface has been extended to eventually support reference counting GCs as well
The MM interface has been formalized
Various issues in standard library code have been fixed in the process

dcodeIO · 2019-03-11T22:31:32Z

The more I work on this I feel like redoing the entire standard library from scratch would be a better approach. Already stumbled over 20 places where stuff wasn't properly linked with GC and whatnot.

MaxGraey · 2019-03-11T23:09:01Z

std/assembly/internal/sort.ts

@@ -42,7 +33,7 @@ export function COMPARATOR<T>(): (a: T, b: T) => i32 {
      if (!alen && !blen) return 0;
      if (!alen) return -1;
      if (!blen) return  1;
-      return compareUnsafe(<string>a, 0, <string>b, 0, <usize>min(alen, blen));
+      return String.cmp(<string>a, 0, <string>b, 0, <usize>min(alen, blen));


Hmm, wdyt add String.ord(a: string, b: string): i32? But leave compareUnsafe as is

dcodeIO · 2019-04-15T09:40:54Z

Last commit integrates purerc and tlsf into one package (currently just named asrt for AssemblyScript Runtime), now with a single runtime header.

In the process I actually stumbled upon a couple strange things in TLSF (for instance get tail and set tail loaded from strange offsets, and tail merging seemed weird) that could have caused corruption when memory is grown (maybe related to #548). There's still a fixme on SL layout that I need to check and I haven't run a single test yet.

Also turns out that the common header (block overhead in TLSF) is exactly 16 bytes in WASM32, which is great. Doesn't hold true for WASM64, but considering that this isn't anywhere on the horizon this is fine with me.

dcodeIO · 2019-04-16T10:42:44Z

Still debugging TLSF. There's something very odd when picking fl/sl on allocations and determining maximum block size. Takes a while because I need to build a visual memory debugger for this to know for sure.

MaxGraey · 2019-04-16T10:45:55Z

I remember someone already built visual debugger and as I remember tlsf was very fragmented

dcodeIO · 2019-04-16T10:47:27Z

What I'm after specifically is something that shows fl/sl/heads visually, with a way to allocate and free by clicking, on top of a debug build of asrt that logs stuff :)

jtenner · 2019-04-16T11:27:03Z

Is there any way to query all the allocations? I have a few ideas.

dcodeIO · 2019-04-16T11:45:00Z

No, TLSF doesn't keep lists of "used" blocks, just free blocks. The way this works is that it picks a suitable block from free blocks, possibly splitting a larger one (leaving the remainder as a free block), and giving the allocated block to the user, forgetting about it. Once that's free'd again, it's inserted back into free lists and possibly merged with adjacent free blocks. I think the visualizer I'm working on will explain the concept quite well :)

dcodeIO · 2019-04-16T15:34:42Z

To try it out, run npm run build && npm run server in tests/runtime. Not really polished yet, though, but helped me to find some issues.

seems useful to keep the number of runtime functions in a user's binary to a minimum

dcodeIO · 2019-04-18T06:00:32Z

Ok I think the allocator should be fairly robust now, but feel free to play with the visualizer and break it. Let me know if you somehow manage to :)

Now thinking how to visualize RC. Maybe similar to the fuzz stuff on the purerc repo, with buttons to allocate objects and arrays of various sizes and dragging/dropping them or pushing retain/release buttons - but not sure if that's worth it and if I should instead wire stuff into the compiler right away.

dcodeIO · 2019-04-18T10:17:01Z

Idea now is that some sort of runtime must be present in every program in the future, be it our TLSF/PureRC runtime or something custom. Difference to before is that not the allocators and collectors are pluggable, but the runtime is pluggable and will be fixed to ARC (no tracing anymore because the cognitive and code overhead simply isn't worth it imo). It'll still be possible to have a stub runtime without a GC and just the arena allocator, of course, as long as it implements the RT interface.

Some info on the proposed interface is here.

dcodeIO · 2019-05-09T14:08:24Z

As you might have guessed, rethinking/rewriting this stuff over and over again turned out to be somewhat exhausting on my end. Current state is that I'm looking at completely red code that came from removing the old memory allocators, garbage collectors and previous runtime efforts, while trying to wrap my head around how reference counting (here: automatic insertion of runtime calls like retain/release) can integrate with stdlib which sometimes needs to use these interfaces manually by means of changetype. Not being able to test anything before everything is at least somewhat in place makes this quite a challenge.

dcodeIO · 2019-05-12T10:21:55Z

Quick update: Code isn't red anymore, integrates with updated stdlib and uses the new runtime API, but still lacks some of the RC integration when it comes to locals. Essentially, whenever a reference is provided as a function argument, it must be retained pre-call and released post-call (possibly triggering free), plus I still have to investigate retain/release calls on locals when assigning references between them. From what I've learned so far, much of the work on Swift, which uses ARC, evolves around static elimination of retain/release calls where refcount can be statically proven to remain unchanged, because each increment and decrement may involve a cache miss. From my current perspective, getting something working out the door first has priority ofc.

dcodeIO · 2019-05-14T07:38:37Z

So I've come to the conclusion that whenever I try to be smart about avoiding unnecessary retain/release calls, this tends to result in situations that can't be solved without further code analysis down the road, which we don't do but usually ask Binaryen to do after transforming to SSA form and similar. Currently looks like the compiler itself must be super paranoid about everything reference counting at first and then rely on optimization passes anyway. Then doing it the paranoid way leads to the return value problem again, where since we don't have a stack there must be some mechanism to make sure that these don't become released before the caller possibly retains them. I think Obj-C calls this autorelease pools or something, need to investigate.

dcodeIO · 2019-05-14T14:11:16Z

Turns out I might actually be able to be smarter than expected about this, completely eliminating retains/releases of locals and avoiding the need for autorelease pools, by using special temp locals with an AUTORELEASE property. Chances are that I'm overlooking something again, of course, but looks promising:

class Ref {}

// Returning a reference first retains it on return and then
// is tracked by the caller's surrounding scope:

function returnRef(): Ref {
  return /* __retain( */ changetype<Ref>(0) /* ) */;
}

export function testReturnRef(): void {
  /* TEMP = */ returnRef();
  // __release(TEMP)
}

// Taking a reference doesn't insert anything. If there was an
// allocation, it would be handled by the caller.

function takeRef(ref: Ref): void {
}

export function testTakeRef(): void {
  takeRef(changetype<Ref>(0));
}

// Allocating a reference keeps track of it in the surrounding
// scope and releases it on exit.

export function newRef(): void {
  /* TEMP = */ new Ref();
  // __release(TEMP)
}

// Assigning a reference to a global retains it, releasing the old value.

var glo: Ref;

export function assignGlobal(): void {
  glo = /* __retainRelease( */ changetype<Ref>(0) /* , glo) */;
}

// Assigning a reference to a field retains it, releasing the old value.

class Target { fld: Ref; }

export function assignField(): void {
  changetype<Target>(0).fld = /* __retainRelease( */ changetype<Ref>(0) /* , fld) */;
}

Remains the question: What am I missing this time? :)

Edit: Doesn't work, again.

dcodeIO · 2019-05-26T01:08:26Z

Closing in favor of #592

dcodeIO added 18 commits March 8, 2019 22:04

Runtime experiment

80fb450

fix

878ee3f

simplify

979a0b8

simplify more

dd5430a

design

911a4bb

integrate

0ad9d56

make it test again

4f1a971

test a few things

0c537c3

refactor

661e239

refactor

5c25b0c

what std/string would look like

5a2ab3d

unsafe, stub

cb77760

fix

85b740d

update std/string

f076826

unshiftify padEnd

ce82e54

untrampoline padEnd

ebe92c6

same for padStart

1198e71

arraybufferview

d9a5681

it's all broken

146dfdb

MaxGraey reviewed Mar 11, 2019

View reviewed changes

dcodeIO added 9 commits March 12, 2019 02:14

I'm lost

8e95867

slowly but steadily

36d54d6

more

e78c4c7

dataview

37d361b

more

707f2da

fix

e581f25

hmm

6f70826

more

e38f627

take a step back

6163a73

dcodeIO added 3 commits April 15, 2019 12:41

backport tlsf fixes, add asrt allocator test

586ca8b

16b alignment, cleanup

aee3a3e

implications of 16b alignment

ceffc18

asrt debugger, fixes

3e08e5c

dcodeIO added 3 commits April 16, 2019 19:15

crunch functions

0ba0d8a

seems useful to keep the number of runtime functions in a user's binary to a minimum

polish

504e207

realloc on mm level

ffdda4b

refactor into stdlib

8216cf3

dcodeIO added 4 commits April 18, 2019 12:53

unify, stub impl

18c3f0c

more

2b0a165

fix export star module exports, shiftify

2dec529

load/store Block

dd2bdd0

jtenner mentioned this pull request May 8, 2019

Refactor unsafe #477

Closed

MaxGraey mentioned this pull request May 10, 2019

radix in toString #586

Closed

dcodeIO closed this May 26, 2019

dcodeIO deleted the runtime branch September 20, 2019 09:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework runtime #535

Rework runtime #535

dcodeIO commented Mar 8, 2019 •

edited

Loading

dcodeIO commented Mar 11, 2019

MaxGraey Mar 11, 2019 •

edited

Loading

dcodeIO commented Apr 15, 2019

dcodeIO commented Apr 16, 2019

MaxGraey commented Apr 16, 2019

dcodeIO commented Apr 16, 2019

jtenner commented Apr 16, 2019 •

edited

Loading

dcodeIO commented Apr 16, 2019

dcodeIO commented Apr 16, 2019

dcodeIO commented Apr 18, 2019 •

edited

Loading

dcodeIO commented Apr 18, 2019 •

edited

Loading

dcodeIO commented May 9, 2019

dcodeIO commented May 12, 2019

dcodeIO commented May 14, 2019

dcodeIO commented May 14, 2019 •

edited

Loading

dcodeIO commented May 26, 2019

Rework runtime #535

Rework runtime #535

Conversation

dcodeIO commented Mar 8, 2019 • edited Loading

dcodeIO commented Mar 11, 2019

MaxGraey Mar 11, 2019 • edited Loading

Choose a reason for hiding this comment

dcodeIO commented Apr 15, 2019

dcodeIO commented Apr 16, 2019

MaxGraey commented Apr 16, 2019

dcodeIO commented Apr 16, 2019

jtenner commented Apr 16, 2019 • edited Loading

dcodeIO commented Apr 16, 2019

dcodeIO commented Apr 16, 2019

dcodeIO commented Apr 18, 2019 • edited Loading

dcodeIO commented Apr 18, 2019 • edited Loading

dcodeIO commented May 9, 2019

dcodeIO commented May 12, 2019

dcodeIO commented May 14, 2019

dcodeIO commented May 14, 2019 • edited Loading

dcodeIO commented May 26, 2019

dcodeIO commented Mar 8, 2019 •

edited

Loading

MaxGraey Mar 11, 2019 •

edited

Loading

jtenner commented Apr 16, 2019 •

edited

Loading

dcodeIO commented Apr 18, 2019 •

edited

Loading

dcodeIO commented Apr 18, 2019 •

edited

Loading

dcodeIO commented May 14, 2019 •

edited

Loading