0.9.5
More Performance
- The core dispatch loop is 60% faster
- GC instructions are orders of magnitude faster 10X speedup in local tests
- Most instructions have been dialled in towards there theoretical minimum execution time given the current architecture
Removal of garbage collection during invocation
Allocations from GC instructions are now deallocated with the store rather than at runtime when they fall off the stack. This was done to allow the pushing values to the stack without boxing which was slowing the runtime quite some. I may introduce a lightweight GC or reference counting solution in the future but the majority of workloads run on wasm at present are small and bursty and in this scenarios its fine to sweep the memory after invocation.