Commits · f64a3e296e1c696b1c8a09bac68f7e075f419c8b · card10 / micropython

Mar 23, 2017
- py/lexer: Remove obsolete comment, since lexer can now raise exceptions. · f64a3e29
  Damien George authored 8 years ago
  
  f64a3e29
Mar 14, 2017

py: Allow lexer to raise exceptions during construction. · 1831034b

Damien George authored 8 years ago

This patch refactors the error handling in the lexer, to simplify it (ie
reduce code size).

A long time ago, when the lexer/parser/compiler were first written, the
lexer and parser were designed so they didn't use exceptions (ie nlr) to
report errors but rather returned an error code.  Over time that has
gradually changed, the parser in particular has more and more ways of
raising exceptions.  Also, the lexer never really handled all errors without
raising, eg there were some memory errors which could raise an exception
(and in these rare cases one would get a fatal nlr-not-handled fault).

This patch accepts the fact that the lexer can raise exceptions in some
cases and allows it to raise exceptions to handle all its errors, which are
for the most part just out-of-memory errors during construction of the
lexer.  This makes the lexer a bit simpler, and also the persistent code
stuff is simplified.

What this means for users of the lexer is that calls to it must be wrapped
in a nlr handler.  But all uses of the lexer already have such an nlr
handler for the parser (and compiler) so that doesn't put any extra burden
on the callers.

1831034b

Feb 17, 2017

py/lexer: Convert mp_uint_t to size_t where appropriate. · 5124a940
Damien George authored 8 years ago

5124a940

py: Do adjacent str/bytes literal concatenation in lexer, not compiler. · 534b7c36

Damien George authored 8 years ago

It's much more efficient in RAM and code size to do implicit literal string
concatenation in the lexer, as opposed to the compiler.

RAM usage is reduced because the concatenation can be done right away in the
tokeniser by just accumulating the string/bytes literals into the lexer's
vstr.  Prior to this patch adjacent strings/bytes would create a parse tree
(one node per string/bytes) and then in the compiler a whole new chunk of
memory was allocated to store the concatenated string, which used more than
double the memory compared to just accumulating in the lexer.

This patch also significantly reduces code size:

bare-arm: -204
minimal:  -204
unix x64: -328
stmhal:   -208
esp8266:  -284
cc3200:   -224

534b7c36

py/lexer: Simplify handling of line-continuation error. · 773278ec

Damien George authored 8 years ago

Previous to this patch there was an explicit check for errors with line
continuation (where backslash was not immediately followed by a newline).

But this check is not necessary: if there is an error then the remaining
logic of the tokeniser will reject the backslash and correctly produce a
syntax error.

773278ec

py/lexer: Use strcmp to make keyword searching more efficient. · ae436797

Damien George authored 8 years ago

Since the table of keywords is sorted, we can use strcmp to do the search
and stop part way through the search if the comparison is less-than.

Because all tokens that are names are subject to this search, this
optimisation will improve the overall speed of the lexer when processing
a script.

The change also decreases code size by a little bit because we now use
strcmp instead of the custom str_strn_equal function.

ae436797

Feb 16, 2017
- py/lexer: Move check for keyword to name-tokenising block. · a68c7546
  Damien George authored 8 years ago
  
  Keywords only needs to be searched for if the token is a MP_TOKEN_NAME, so we can move the seach to the part of the code that does the tokenising for MP_TOKEN_NAME.
  a68c7546
- py/lexer: Simplify handling of indenting of very first token. · 98b3072d
  Damien George authored 8 years ago
  
  98b3072d
- py/lexer: Don't generate string representation for period or ellipsis. · c2644147
  Damien George authored 8 years ago
  
  It's not needed.
  c2644147
Jan 30, 2017
- extmod/vfs_fat: Remove MICROPY_READER_FATFS component. · 8beba731
  Damien George authored 8 years ago
  
  8beba731
Jan 27, 2017

extmod: Add generic VFS sub-system. · dcb9ea72

Damien George authored 8 years ago

This provides mp_vfs_XXX functions (eg mount, open, listdir) which are
agnostic to the underlying filesystem type, and just require an object with
the relevant filesystem-like methods (eg .mount, .open, .listidr) which can
then be mounted.

These mp_vfs_XXX functions would typically be used by a port to implement
the "uos" module, and mp_vfs_open would be the builtin open function.

This feature is controlled by MICROPY_VFS, disabled by default.

dcb9ea72

Dec 21, 2016

py/lexer: Permanently disable the mp_lexer_show_token function. · c305ae32

Damien George authored 8 years ago

The lexer is very mature and this debug function is no longer used.  If
it's really needed one can uncomment it and recompile.

c305ae32

py/lexer: Remove unnecessary check for EOF in lexer's next_char func. · f4aebafe

Damien George authored 8 years ago

This check always fails (ie chr0 is never EOF) because the callers of this
function never call it past the end of the input stream. And even if they
did it would be harmless because 1) reader.readbyte must continue to
return an EOF char if the stream is exhausted; 2) next_char would just
count the subsequent EOF's as characters worth 1 column.

f4aebafe

py/lexer: Remove unreachable code in string tokeniser. · b9c47832
Damien George authored 8 years ago

b9c47832
tests/basics/lexer: Add a test for newline-escaping within a string. · adccafb4
Damien George authored 8 years ago

adccafb4

Nov 16, 2016
- py/lexer: Make lexer use an mp_reader as its source. · 5bdf1650
  Damien George authored 8 years ago
  
  5bdf1650
- py/lexer: Rewrite mp_lexer_new_from_fd in terms of mp_reader. · 66d955c2
  Damien George authored 8 years ago
  
  66d955c2
- py/lexer: Provide generic mp_lexer_new_from_file based on mp_reader. · e5ef15a9
  Damien George authored 8 years ago
  
  If a port defines MICROPY_READER_POSIX or MICROPY_READER_FATFS then lexer.c now provides an implementation of mp_lexer_new_from_file using the mp_reader_new_file function.
  e5ef15a9
- py/lexer: Rewrite mp_lexer_new_from_str_len in terms of mp_reader_mem. · 511c0838
  Damien George authored 8 years ago
  
  511c0838
Oct 12, 2016

py/lexer: Remove unnecessary code, and unreachable code. · 31101d91

Damien George authored 8 years ago

Setting emit_dent=0 is unnecessary because arriving in that part of the
if-logic will guarantee that emit_dent is already zero.

The block to check indent_top(lex)>0 is unreachable because a newline is
always inserted an the end of the input stream, and hence dedents are
always processed before EOF.

31101d91

Sep 19, 2016

py/vstr: Remove vstr.had_error flag and inline basic vstr functions. · 5da0d29d

Damien George authored 8 years ago

The vstr.had_error flag was a relic from the very early days which assumed
that the malloc functions (eg m_new, m_renew) returned NULL if they failed
to allocate.  But that's no longer the case: these functions will raise an
exception if they fail.

Since it was impossible for had_error to be set, this patch introduces no
change in behaviour.

An alternative option would be to change the malloc calls to the _maybe
variants, which return NULL instead of raising, but then a lot of code
will need to explicitly check if the vstr had an error and raise if it
did.

The code-size savings for this patch are, in bytes: bare-arm:188,
minimal:456, unix(NDEBUG,x86-64):368, stmhal:228, esp8266:360.

5da0d29d

May 20, 2016

py: Declare constant data as properly constant. · 3ff16ff5

Damien George authored 8 years ago

Otherwise some compilers (eg without optimisation) will put this read-only
data in RAM instead of ROM.

3ff16ff5

Apr 13, 2016

py: add async/await/async for/async with syntax · 81ebba7e

pohmelie authored 9 years ago

They are sugar for marking function as generator, "yield from"
and pep492 python "semantically equivalents" respectively.

@dpgeorge was the original author of this patch, but @pohmelie made
changes to implement `async for` and `async with`.

81ebba7e

Feb 25, 2016

py: Add MICROPY_DYNAMIC_COMPILER option to config compiler at runtime. · ea235204

Damien George authored 9 years ago

This new compile-time option allows to make the bytecode compiler
configurable at runtime by setting the fields in the mp_dynamic_compiler
structure.  By using this feature, the compiler can generate bytecode
that targets any MicroPython runtime/VM, regardless of the host and
target compile-time settings.

Options so far that fall under this dynamic setting are:
- maximum number of bits that a small int can hold;
- whether caching of lookups is used in the bytecode;
- whether to use unicode strings or not (lexer behaviour differs, and
  therefore generated string constants differ).

ea235204

Dec 18, 2015

py: Add MICROPY_ENABLE_COMPILER and MICROPY_PY_BUILTINS_EVAL_EXEC opts. · dd5353a4

Damien George authored 9 years ago

MICROPY_ENABLE_COMPILER can be used to enable/disable the entire compiler,
which is useful when only loading of pre-compiled bytecode is supported.
It is enabled by default.

MICROPY_PY_BUILTINS_EVAL_EXEC controls support of eval and exec builtin
functions.  By default they are only included if MICROPY_ENABLE_COMPILER
is enabled.

Disabling both options saves about 40k of code size on 32-bit x86.

dd5353a4

Sep 07, 2015
- py/lexer: Properly classify floats that look like hex numbers. · 2b000474
  Damien George authored 9 years ago
  
  Eg 0e0 almost looks like a hex number but in fact is a float.
  2b000474
- py/lexer: Raise SyntaxError when unicode char point out of range. · 0be3c70c
  Damien George authored 9 years ago
  
  0be3c70c
- py/lexer: Raise NotImplError for unicode name escape, instead of assert. · 081f9325
  Damien George authored 9 years ago
  
  081f9325
Jul 23, 2015
- py/lexer: Raise SyntaxError when str hex escape sequence is malformed. · d241c2a5
  Damien George authored 9 years ago
  
  Addresses issue #1390.
  d241c2a5
Jun 22, 2015
- py: Cast argument for printf to int, to be compatible with more ports. · 7f19a39a
  Damien George authored 9 years ago
  
  This allows stmhal to be compiled with MICROPY_DEBUG_PRINTERS.
  7f19a39a
Jun 09, 2015
- py: Support unicode (utf-8 encoded) identifiers in Python source. · 7ed58cb6
  Damien George authored 9 years ago
  
  Enabled simply by making the identifier lexing code 8-bit clean.
  7ed58cb6
May 20, 2015
- extmod: Add ubinascii.unhexlify · 3ad94d60
  Dave Hylands authored 9 years ago
  
  This also pulls out hex_digit from py/lexer.c and makes unichar_hex_digit
  3ad94d60
Mar 19, 2015
- py: Allow to compile with extra warnings (sign-compare, unused-param). · 2e2e404f
  Damien George authored 10 years ago
  
  2e2e404f
Feb 08, 2015

py: Parse big-int/float/imag constants directly in parser. · 7d414a1b

Damien George authored 10 years ago

Previous to this patch, a big-int, float or imag constant was interned
(made into a qstr) and then parsed at runtime to create an object each
time it was needed.  This is wasteful in RAM and not efficient.  Now,
these constants are parsed straight away in the parser and turned into
objects.  This allows constants with large numbers of digits (so
addresses issue #1103) and takes us a step closer to #722.

7d414a1b

Jan 30, 2015

py: Convert CR to LF and CR LF to LF in lexer. · 32bade19

Damien George authored 10 years ago

Only noticeable difference is how newlines are encoded in triple-quoted
strings.  The behaviour now matches CPython3.

32bade19

Jan 28, 2015
- py: Be more precise about unicode type and disabled unicode behaviour. · 16677ce3
  Damien George authored 10 years ago
  
  16677ce3
Jan 16, 2015
- py, unix: Allow to compile with -Wsign-compare. · 963a5a3e
  Damien George authored 10 years ago
  
  See issue #699.
  963a5a3e
Jan 07, 2015

py: Put all global state together in state structures. · b4b10fd3

Damien George authored 10 years ago

This patch consolidates all global variables in py/ core into one place,
in a global structure.  Root pointers are all located together to make
GC tracing easier and more efficient.

b4b10fd3

Jan 01, 2015
- py: Move to guarded includes, everywhere in py/ core. · 51dfcb4b
  Damien George authored 10 years ago
  
  Addresses issue #1022.
  51dfcb4b
Dec 05, 2014
- py: Fix printing of size_t entity; fix qemu-arm for changes to lexer. · 451a0870
  Damien George authored 10 years ago
  
  451a0870