Commits · bbbbc263173cd22873f9a36d4786e8dd7f97f724 · card10 / micropython

May 17, 2019
- various: Add and update my copyright line based on git history. · 016d9a40
  Paul Sokolovsky authored 5 years ago
  
  For modules I initially created or made substantial contributions to.
  016d9a40
Dec 20, 2018

py/gc: Adjust gc_alloc() signature to be able to accept multiple flags. · 5ed578e5

Paul Sokolovsky authored 6 years ago

The older "bool has_finaliser" gets recast as GC_ALLOC_FLAG_HAS_FINALISER=1
so this is a backwards compatible change to the signature.  Since bool gets
implicitly converted to 1 this patch doesn't include conversion of all
calls.

5ed578e5

Aug 14, 2018

py/gc: In gc_alloc, reset n_free var right before search for free mem. · 91041945

Damien George authored 6 years ago

Otherwise there is the possibility that n_free starts out non-zero from the
previous iteration, which may have found a few (but not enough) free blocks
at the end of the heap. If this is the case, and if the very first blocks
that are scanned the second time around (starting at
gc_last_free_atb_index) are found to give enough memory (including the
blocks at the end of the heap from the previous iteration that left n_free
non-zero) then memory will be allocated starting before the location that
gc_last_free_atb_index points to, most likely leading to corruption.

This serious bug did not manifest itself in the past because a gc_collect
always resets gc_last_free_atb_index to point to the start of the GC heap,
and the first block there is almost always allocated to a long-lived
object (eg entries from sys.path, or mounted filesystem objects), which
means that n_free would be reset at the start of the search loop.

But with threading enabled with the GIL disabled it is possible to trigger
the bug via the following sequence of events:

1. Thread A runs gc_alloc, fails to find enough memory, and has a non-zero
n_free at the end of the search.
2. Thread A calls gc_collect and frees a bunch of blocks on the GC heap.
3. Just after gc_collect finishes in thread A, thread B takes gc_mutex and
does an allocation, moving gc_last_free_atb_index to point to the
interior of the heap, to a place where there is most likely a run of
available blocks.
4. Thread A regains gc_mutex and does its second search for free memory,
starting with a non-zero n_free. Since it's likely that the first block
it searches is available it will allocate memory which overlaps with the
memory before gc_last_free_atb_index.

91041945

Aug 02, 2018

py: Fix compiling with debug enabled and make more use of DEBUG_printf. · b630dfcc

Damien George authored 6 years ago

DEBUG_printf and MICROPY_DEBUG_PRINTER is now used instead of normal
printf, and a fault is fixed in mp_obj_class_lookup with debugging enabled;
see issue #3999.  Debugging can now be enabled on all ports including when
nan-boxing is used.

b630dfcc

Jun 12, 2018

py/gc: Add gc_sweep_all() function to run all remaining finalisers. · 522ea80f

Damien George authored 6 years ago

This patch adds the gc_sweep_all() function which does a garbage collection
without tracing any root pointers, so frees all the memory, and most
importantly runs any remaining finalisers.

This helps primarily for soft reset: it will close any open files, any open
sockets, and help to get the system back to a clean state upon soft reset.

522ea80f

May 21, 2018

py/gc: When GC threshold is hit don't unnecessarily collect twice. · 6bd78741

Damien George authored 6 years ago

Without this, if GC threshold is hit and there is not enough memory left to
satisfy the request, gc_collect() will run a second time and the search for
memory will happen again and will fail again.

Thanks to @adritium for pointing out this issue, see #3786.

6bd78741

May 13, 2018

py/mpstate.h: Adjust start of root pointer section to exclude non-ptrs. · 749b1617

Damien George authored 6 years ago

This patch moves the start of the root pointer section in mp_state_ctx_t
so that it skips entries that are not pointers and don't need scanning.

Previously, the start of the root pointer section was at the very beginning
of the mp_state_ctx_t struct (which is the beginning of mp_state_thread_t).
This was the original assembler version of the NLR code was hard-coded to
have the nlr_top pointer at the start of this state structure. But now
that the NLR code is partially written in C there is no longer this
restriction on the location of nlr_top (and a comment to this effect has
been removed in this patch).

So now the root pointer section starts part way through the
mp_state_thread_t structure, after the entries which are not root pointers.

This patch also moves the non-pointer entries for MICROPY_ENABLE_SCHEDULER
outside the root pointer section.

Moving non-pointer entries out of the root pointer section helps to make
the GC more precise and should help to prevent some cases of collectable
garbage being kept.

This patch also has a measurable improvement in performance of the
pystone.py benchmark: on unix x86-64 and stm32 there was an improvement of
roughly 0.6% (tested with both gcc 7.3 and gcc 8.1).

749b1617

Feb 19, 2018
- py/gc: Update comment now that gc_drain_stack is called gc_mark_subtree. · 2a0cbc0d
  Damien George authored 7 years ago
  
  2a0cbc0d
- py/gc: Make GC stack pointer a local variable. · 736faef2
  Ayke van Laethem authored 7 years ago
  
  This saves a bit in code size, and saves some precious .bss RAM: .text .bss minimal CROSS=1: -28 -4 unix (64-bit): -64 -8
  736faef2
- py/gc: Rename gc_drain_stack to gc_mark_subtree and pass it first block. · 5c9e5618
  Ayke van Laethem authored 7 years ago
  
  This saves a bit in code size: minimal CROSS=1: -44 unix: -96
  5c9e5618
- py/gc: Reduce code size by specialising VERIFY_MARK_AND_PUSH macro. · ea7cf2b7
  Ayke van Laethem authored 7 years ago
  
  This macro is written out explicitly in the two locations that it is used and then the code is optimised, opening possibilities for further optimisations and reducing code size: unix: -48 minimal CROSS=1: -32 stm32: -32
  ea7cf2b7
Dec 11, 2017

py: Introduce a Python stack for scoped allocation. · 02d830c0

Damien George authored 7 years ago

This patch introduces the MICROPY_ENABLE_PYSTACK option (disabled by
default) which enables a "Python stack" that allows to allocate and free
memory in a scoped, or Last-In-First-Out (LIFO) way, similar to alloca().

A new memory allocation API is introduced along with this Py-stack.  It
includes both "local" and "nonlocal" LIFO allocation.  Local allocation is
intended to be equivalent to using alloca(), whereby the same function must
free the memory.  Nonlocal allocation is where another function may free
the memory, so long as it's still LIFO.

Follow-up patches will convert all uses of alloca() and VLA to the new
scoped allocation API.  The old behaviour (using alloca()) will still be
available, but when MICROPY_ENABLE_PYSTACK is enabled then alloca() is no
longer required or used.

The benefits of enabling this option are (or will be once subsequent
patches are made to convert alloca()/VLA):
- Toolchains without alloca() can use this feature to obtain correct and
  efficient scoped memory allocation (compared to using the heap instead
  of alloca(), which is slower).
- Even if alloca() is available, enabling the Py-stack gives slightly more
  efficient use of stack space when calling nested Python functions, due to
  the way that compilers implement alloca().
- Enabling the Py-stack with the stackless mode allows for even more
  efficient stack usage, as well as retaining high performance (because the
  heap is no longer used to build and destroy stackless code states).
- With Py-stack and stackless enabled, Python-calling-Python is no longer
  recursive in the C mp_execute_bytecode function.

The micropython.pystack_use() function is included to measure usage of the
Python stack.

02d830c0

Dec 08, 2017
- py/gc: In sweep debug output, print pointer as a pointer. · dea3fb93
  Paul Sokolovsky authored 7 years ago
  
  Or it will be truncated on a 64-bit platform.
  dea3fb93
- py/gc: Factor out a macro to trace GC mark operations. · 5453d88d
  Paul Sokolovsky authored 7 years ago
  
  To allow easier override it for custom tracing.
  5453d88d
Dec 07, 2017

py/gc: Add CLEAR_ON_SWEEP option to debug mis-traced objects. · 9ef4be8b

Paul Sokolovsky authored 7 years ago

Accessing them will crash immediately instead still working for some time,
until overwritten by some other data, leading to much less deterministic
crashes.

9ef4be8b

Nov 29, 2017

py/gc: In gc_realloc, convert pointer sanity checks to assertions. · 74fad353

Damien George authored 7 years ago

These checks are assumed to be true in all cases where gc_realloc is
called with a valid pointer, so no need to waste code space and time
checking them in a non-debug build.

74fad353

Oct 04, 2017

all: Remove inclusion of internal py header files. · a3dc1b19

Damien George authored 7 years ago

Header files that are considered internal to the py core and should not
normally be included directly are:
    py/nlr.h - internal nlr configuration and declarations
    py/bc0.h - contains bytecode macro definitions
    py/runtime0.h - contains basic runtime enums

Instead, the top-level header files to include are one of:
    py/obj.h - includes runtime0.h and defines everything to use the
        mp_obj_t type
    py/runtime.h - includes mpstate.h and hence nlr.h, obj.h, runtime0.h,
        and defines everything to use the general runtime support functions

Additional, specific headers (eg py/objlist.h) can be included if needed.

a3dc1b19

Aug 15, 2017
- py: Add verbose debug compile-time flag MICROPY_DEBUG_VERBOSE. · ace9fb54
  Stefan Naumann authored 7 years ago
  
  It enables all the DEBUG_printf outputs in the py/ source code.
  ace9fb54
Jul 31, 2017
- all: Use the name MicroPython consistently in comments · 55f33240
  Alexander Steffen authored 7 years ago
  
  There were several different spellings of MicroPython present in comments, when there should be only one.
  55f33240
Jul 12, 2017

py/gc: Refactor assertions in gc_free function. · 12d4fa9b

Damien George authored 7 years ago

gc_free() expects either NULL or a valid pointer into the heap, so the
checks for a valid pointer can be turned into assertions.

12d4fa9b

Apr 12, 2017

py/gc: Execute finaliser code in a protected environment. · c7e8c6f7

Damien George authored 7 years ago

If a finaliser raises an exception then it must not propagate through the
GC sweep function. This patch protects against such a thing by running
finaliser code via the mp_call_function_1_protected call.

This patch also adds scheduler lock/unlock calls around the finaliser
execution to further protect against any possible reentrancy issues: the
memory manager is already locked when doing a collection, but we also don't
want to allow any scheduled code to run, KeyboardInterrupts to interupt the
code, nor threads to switch.

c7e8c6f7

Aug 26, 2016

py/gc: Add MICROPY_GC_CONSERVATIVE_CLEAR option to always zero memory. · 5ffe1d8d

Damien George authored 8 years ago

There can be stray pointers in memory blocks that are not properly zero'd
after allocation.  This patch adds a new config option to always zero all
allocated memory (via gc_alloc and gc_realloc) and hence help to eliminate
stray pointers.

See issue #2195.

5ffe1d8d

Jul 20, 2016

py/gc: Implement GC running by allocation threshold. · 93e353e3

Paul Sokolovsky authored 8 years ago

Currently, MicroPython runs GC when it could not allocate a block of memory,
which happens when heap is exhausted. However, that policy can't work well
with "inifinity" heaps, e.g. backed by a virtual memory - there will be a
lot of swap thrashing long before VM will be exhausted. Instead, in such
cases "allocation threshold" policy is used: a GC is run after some number of
allocations have been made. Details vary, for example, number or total amount
of allocations can be used, threshold may be self-adjusting based on GC
outcome, etc.

This change implements a simple variant of such policy for MicroPython. Amount
of allocated memory so far is used for threshold, to make it useful to typical
finite-size, and small, heaps as used with MicroPython ports. And such GC policy
is indeed useful for such types of heaps too, as it allows to better control
fragmentation. For example, if a threshold is set to half size of heap, then
for an application which usually makes big number of small allocations, that
will (try to) keep half of heap memory in a nice defragmented state for an
occasional large allocation.

For an application which doesn't exhibit such behavior, there won't be any
visible effects, except for GC running more frequently, which however may
affect performance. To address this, the GC threshold is configurable, and
by default is off so far. It's configured with gc.threshold(amount_in_bytes)
call (can be queries without an argument).

93e353e3

Jun 30, 2016

py/gc: Calculate (and report) maximum contiguous free block size. · 749cbaca
Paul Sokolovsky authored 8 years ago
```
Just as maximum allocated block size, it's reported in allocation units
(not bytes).
```
749cbaca

py/gc: Be sure to count last allocated block at heap end in stats. · 6a6e0b7e

Paul Sokolovsky authored 8 years ago

Previously, if there was chain of allocated blocks ending with the last
block of heap, it wasn't included in number of 1/2-block or max block
size stats.

6a6e0b7e

Jun 28, 2016

py: Don't use gc or qstr mutex when the GIL is enabled. · a1c93a62
Damien George authored 8 years ago
```
There is no need since the GIL already makes gc and qstr operations
atomic.
```
a1c93a62

py/gc: Fix GC+thread bug where ptr gets lost because it's not computed. · 3653f514

Damien George authored 8 years ago

GC_EXIT() can cause a pending thread (waiting on the mutex) to be
scheduled right away.  This other thread may trigger a garbage
collection.  If the pointer to the newly-allocated block (allocated by
the original thread) is not computed before the switch (so it's just left
as a block number) then the block will be wrongly reclaimed.

This patch makes sure the pointer is computed before allowing any thread
switch to occur.

3653f514

py/gc: Fix 2 cases of concurrent access to ATB and FTB. · e33806aa
Damien George authored 8 years ago

e33806aa

py/gc: Make memory manager and garbage collector thread safe. · c93d9caa

Damien George authored 8 years ago

By using a single, global mutex, all memory-related functions (alloc,
free, realloc, collect, etc) are made thread safe. This means that only
one thread can be in such a function at any one time.

c93d9caa

py: Add MP_STATE_THREAD to hold state specific to a given thread. · 330165a2
Damien George authored 8 years ago

330165a2

May 12, 2016

py/gc: gc_dump_alloc_table(): Dump heap offset instead of actual address. · 68a7a92c

Paul Sokolovsky authored 8 years ago

Address printed was truncated anyway and in general confusing to outsider.
A line which dumps it is still left in the source, commented, for peculiar
cases when it may be needed (e.g. when running under debugger).

68a7a92c

gc: gc_dump_alloc_table(): Use '=' char for tail blocks. · 9a8751b0

Paul Sokolovsky authored 8 years ago

'=' is pretty natural character for tail, and gives less dense picture
where it's easier to see what object types are actually there.

9a8751b0

May 11, 2016
- py/gc: Make (byte)array type dumping conditional on these types being enabled. · bc04dc27
  Paul Sokolovsky authored 8 years ago
  
  bc04dc27
- py/gc: gc_dump_alloc_table(): Show byte/str and (byte)array objects. · 3d7f3f00
  Paul Sokolovsky authored 8 years ago
  
  These are typical consumers of large chunks of memory, so it's useful to see at least their number (how much memory isn't clearly shown, as the data for these objects is allocated elsewhere).
  3d7f3f00
Dec 27, 2015

py/gc: Improve mark/sweep debug output. · 3ea03a11

Paul Sokolovsky authored 9 years ago

Previously, mark operation weren't logged at all, while it's quite useful
to see cascade of marks in case of over-marking (and in other cases too).
Previously, sweep was logged for each block of object in memory, but that
doesn't make much sense and just lead to longer output, harder to parse
by a human. Instead, log sweep only once per object. This is similar to
other memory manager operations, e.g. an object is allocated, then freed.
Or object is allocated, then marked, otherwise swept (one log entry per
operation, with the same memory address in each case).

3ea03a11

Dec 18, 2015

py/gc: When printing info, use %u instead of UINT_FMT for size_t args. · acaccb37

Damien George authored 9 years ago

Ideally we'd use %zu for size_t args, but that's unlikely to be supported
by all runtimes, and we would then need to implement it in mp_printf.
So simplest and most portable option is to use %u and cast the argument
to uint(=unsigned int).

Note: reason for the change is that UINT_FMT can be %llu (size suitable
for mp_uint_t) which is wider than size_t and prints incorrect results.

acaccb37

Dec 17, 2015

py/gc: Use size_t instead of mp_uint_t to count things related to heap. · d977d268

Damien George authored 9 years ago

size_t is the correct type to use to count things related to the size of
the address space.  Using size_t (instead of mp_uint_t) is important for
the efficiency of ports that configure mp_uint_t to larger than the
machine word size.

d977d268

py/gc: For finaliser, interpret a pointer into the heap as concrete obj. · f7782f80
Damien George authored 9 years ago

f7782f80

py/gc: Scan GC blocks as an array of pointers, not an array of objects. · 969e4bbe

Damien George authored 9 years ago

The GC should search for pointers within the heap.  This patch makes a
difference when an object is larger than a pointer (eg 64-bit NaN
boxing).

969e4bbe

Dec 02, 2015
- py/gc: Make GC block size be configurable. · 75feece2
  Paul Sokolovsky authored 9 years ago
  
  75feece2