yap-6.3/C/scanner.c

/*************************************************************************
*                   *
*   YAP Prolog                *
*                   *
*  Yap Prolog was developed at NCCUP - Universidade do Porto   *
*                   *
* Copyright L.Damas, V.S.Costa and Universidade do Porto 1985-2003   *
*                   *
**************************************************************************
*                   *
* File:    %W% %G%                     *
* Last rev:  22-1-03               *
* mods:                   *
* comments:  Prolog's scanner           *
*                   *
*************************************************************************/

/*
 * Description:
 *
 * This module produces a list of tokens for use by the parser. The calling
 * program should supply a routine int nextch(charpos) int *charpos; which,
 * when called should produce the next char or -1 if none availlable. The
 * scanner will stop producing tokens when it either finds an end of file
 * (-1) or a token consisting of just '.' followed by a blank or control
 * char. Scanner errors will be signalled by the scanner exiting with a non-
 * zero  ErrorMsg and ErrorPos. Note that, even in this case, the scanner
 * will try to find the end of the term. A function char
 * *AllocScannerMemory(nbytes) should be supplied for allocating (temporary)
 * space for strings and for the table of prolog variables occurring in the
 * term.
 *
 */

/**

@defgroup Formal_Syntax Syntax of Terms
@ingroup Syntax
@{


Prolog tokens are grouped into the following categories:

+ Numbers

Numbers can be further subdivided into integer and floating-point numbers.

  + Integers

Integer numbers
are described by the following regular expression:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

<integer> := {<digit>+<single-quote>|0{xXo}}<alpha_numeric_char>+

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

where {...} stands for optionality, \a + optional repetition (one or
more times), \a \\\<digit\\\> denotes one of the characters 0 ... 9, \a |
denotes or, and \a \\\<single-quote\\\> denotes the character "'". The digits
before the \a \\\<single-quote\\\> character, when present, form the number
basis, that can go from 0, 1 and up to 36. Letters from `A` to
`Z` are used when the basis is larger than 10.

Note that if no basis is specified then base 10 is assumed. Note also
that the last digit of an integer token can not be immediately followed
by one of the characters 'e', 'E', or '.'.

Following the ISO standard, YAP also accepts directives of the
form `0x` to represent numbers in hexadecimal base and of the form
`0o` to represent numbers in octal base. For usefulness,
YAP also accepts directives of the form `0X` to represent
numbers in hexadecimal base.

Example:
the following tokens all denote the same integer

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
10  2'1010  3'101  8'12  16'a  36'a  0xa  0o12
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Numbers of the form `0'a` are used to represent character
constants. So, the following tokens denote the same integer:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
0'd  100
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

YAP (version 6.3.4) supports integers that can fit
the word size of the machine. This is 32 bits in most current machines,
but 64 in some others, such as the Alpha running Linux or Digital
Unix. The scanner will read larger or smaller integers erroneously.

  + Floats

Floating-point numbers are described by:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   <float> := <digit>+{<dot><digit>+}
               <exponent-marker>{<sign>}<digit>+
            |<digit>+<dot><digit>+
               {<exponent-marker>{<sign>}<digit>+}
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

where \a \\\<dot\\\> denotes the decimal-point character '.',
\a \\\<exponent-marker\\\> denotes one of 'e' or 'E', and \a \\\<sign\\\>
denotes
one of '+' or '-'.

Examples:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
10.0   10e3   10e-3   3.1415e+3
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Floating-point numbers are represented as a double in the target
machine. This is usually a 64-bit number.

+ Strings Character Strings

Strings are described by the following rules:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  string --> '"' string_quoted_characters '"'
  string --> '`' string_quoted_characters '`'

  string_quoted_characters --> '"' '"' string_quoted_characters
  string_quoted_characters --> '`' '`' string_quoted_characters
string_quoted_characters --> '\'
                          escape_sequence string_quoted_characters
  string_quoted_characters -->
                          string_character string_quoted_characters

  escape_sequence --> 'a' | 'b' | 'r' | 'f' | 't' | 'n' | 'v'
  escape_sequence --> '\' | '"' | ''' | '`'
  escape_sequence --> at_most_3_octal_digit_seq_char '\'
  escape_sequence --> 'x' at_most_2_hexa_digit_seq_char '\'
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
where `string_character` in any character except the double quote
and escape characters.

Examples:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
""   "a string"   "a double-quote:"""
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The first string is an empty string, the last string shows the use of
double-quoting. The implementation of YAP represents strings as
lists of integers. Since YAP 4.3.0 there is no static limit on string
size.

Escape sequences can be used anf include the non-printable characters
`a` (alert), `b` (backspace), `r` (carriage return),
`f` (form feed), `t` (horizontal tabulation), `n` (new
line), and `v` (vertical tabulation). Escape sequences also be
include the meta-characters `\\`, `"`, `'`, and
```. Last, one can use escape sequences to include the characters
either as an octal or hexadecimal number.

The next examples demonstrates the use of escape sequences in YAP:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
"\x0c\" "\01\" "\f" "\\"
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

The first three examples return a list including only character 12 (form
feed). The last example escapes the escape character.

Escape sequences were not available in C-Prolog and in original
versions of YAP up to 4.2.0. Escape sequences can be disable by using:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
:- yap_flag(character_escapes,false).
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Since 6.3.4 YAP supports compact strings, that are not represented as
lists of codes, but instead as a sequence of UTF-8 encoded characters
in the execution stack. These strings do not require allocating a
symbol, as atoms do, but are much more compact than using lists of
codes.


+ Atoms Atoms

Atoms are defined by one of the following rules:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   atom --> solo-character
   atom --> lower-case-letter name-character*
   atom --> symbol-character+
   atom --> single-quote  single-quote
   atom --> ''' atom_quoted_characters '''

  atom_quoted_characters --> ''' ''' atom_quoted_characters
  atom_quoted_characters --> '\' atom_sequence string_quoted_characters
  atom_quoted_characters --> character string_quoted_characters
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
where:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   <solo-character>     denotes one of:    ! ;
   <symbol-character>   denotes one of:    # & * + - . / : <
                                           = > ? @ \ ^ ~ `
   <lower-case-letter>  denotes one of:    a...z
   <name-character>     denotes one of:    _ a...z A...Z 0....9
   <single-quote>       denotes:           '
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

and `string_character` denotes any character except the double quote
and escape characters. Note that escape sequences in strings and atoms
follow the same rules.

Examples:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
a   a12x   '$a'   !   =>  '1 2'
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Version `4.2.0` of YAP removed the previous limit of 256
characters on an atom. Size of an atom is now only limited by the space
available in the system.

+ Variables Variables

Variables are described by:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
   <variable-starter><variable-character>+
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
where

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  <variable-starter>   denotes one of:    _ A...Z
  <variable-character> denotes one of:    _ a...z A...Z
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

If a variable is referred only once in a term, it needs not to be named
and one can use the character `_` to represent the variable. These
variables are known as anonymous variables. Note that different
occurrences of `_` on the same term represent <em>different</em>
anonymous variables.

+ Punctuation Tokens

Punctuation tokens consist of one of the following characters:

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
( ) , [ ] { } |
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

These characters are used to group terms.

@subsection Layout Layout
Any characters with ASCII code less than or equal to 32 appearing before
a token are ignored.

All the text appearing in a line after the character \a % is taken to
be a comment and ignored (including \a %).  Comments can also be
inserted by using the sequence `/\*` to start the comment and
`\*` followed by `/` to finish it. In the presence of any sequence of comments
or
layout characters, the YAP parser behaves as if it had found a
single blank character. The end of a file also counts as a blank
character for this purpose.

+ Encoding Wide Character Support

YAP now implements a SWI-Prolog compatible interface to wide
characters and the Universal Character Set (UCS). The following text
was adapted from the SWI-Prolog manual.

YAP now  supports wide characters, characters with character
codes above 255 that cannot be represented in a single byte.
<em>Universal Character Set</em> (UCS) is the ISO/IEC 10646 standard
that specifies a unique 31-bits unsigned integer for any character in
any language.  It is a superset of 16-bit Unicode, which in turn is
a superset of ISO 8859-1 (ISO Latin-1), a superset of US-ASCII.  UCS
can handle strings holding characters from multiple languages and
character classification (uppercase, lowercase, digit, etc.) and
operations such as case-conversion are unambiguously defined.

For this reason YAP, following SWI-Prolog, has two representations for
atoms. If the text fits in ISO Latin-1, it is represented as an array
of 8-bit characters.  Otherwise the text is represented as an array of
wide chars, which may take 16 or 32 bits.  This representational issue
is completely transparent to the Prolog user.  Users of the foreign
language interface sometimes need to be aware of these issues though.

Character coding comes into view when characters of strings need to be
read from or written to file or when they have to be communicated to
other software components using the foreign language interface. In this
section we only deal with I/O through streams, which includes file I/O
as well as I/O through network sockets.

  + Stream_Encoding Wide character encodings on streams

Although characters are uniquely coded using the UCS standard
internally, streams and files are byte (8-bit) oriented and there are a
variety of ways to represent the larger UCS codes in an 8-bit octet
stream. The most popular one, especially in the context of the web, is
UTF-8. Bytes 0...127 represent simply the corresponding US-ASCII
character, while bytes 128...255 are used for multi-byte
encoding of characters placed higher in the UCS space. Especially on
MS-Windows the 16-bit Unicode standard, represented by pairs of bytes is
also popular.

Prolog I/O streams have a property called <em>encoding</em> which
specifies the used encoding that influence `get_code/2` and
`put_code/2` as well as all the other text I/O predicates.

The default encoding for files is derived from the Prolog flag
`encoding`, which is initialized from the environment.  If the
environment variable `LANG` ends in "UTF-8", this encoding is
assumed. Otherwise the default is `text` and the translation is
left to the wide-character functions of the C-library (note that the
Prolog native UTF-8 mode is considerably faster than the generic
`mbrtowc()` one).  The encoding can be specified explicitly in
load_files/2 for loading Prolog source with an alternative
encoding, `open/4` when opening files or using `set_stream/2` on
any open stream (not yet implemented). For Prolog source files we also
provide the `encoding/1` directive that can be used to switch
between encodings that are compatible to US-ASCII (`ascii`,
`iso_latin_1`, `utf8` and many locales).


For
additional information and Unicode resources, please visit
<http://www.unicode.org/>.

YAP currently defines and supports the following encodings:

  + octet
Default encoding for <em>binary</em> streams.  This causes
the stream to be read and written fully untranslated.

  + ascii
7-bit encoding in 8-bit bytes.  Equivalent to `iso_latin_1`,
but generates errors and warnings on encountering values above
127.

  + iso_latin_1
8-bit encoding supporting many western languages.  This causes
the stream to be read and written fully untranslated.

  + text
C-library default locale encoding for text files.  Files are read and
written using the C-library functions `mbrtowc()` and
`wcrtomb()`.  This may be the same as one of the other locales,
notably it may be the same as `iso_latin_1` for western
languages and `utf8` in a UTF-8 context.

  + utf8
Multi-byte encoding of full UCS, compatible to `ascii`.
See above.

  + unicode_be
Unicode Big Endian.  Reads input in pairs of bytes, most
significant byte first.  Can only represent 16-bit characters.

  + unicode_le
Unicode Little Endian.  Reads input in pairs of bytes, least
significant byte first.  Can only represent 16-bit characters.


Note that not all encodings can represent all characters. This implies
that writing text to a stream may cause errors because the stream
cannot represent these characters. The behaviour of a stream on these
errors can be controlled using `open/4` or `set_stream/2` (not
implemented). Initially the terminal stream write the characters using
Prolog escape sequences while other streams generate an I/O exception.

    + BOM BOM: Byte Order Mark

From Stream Encoding, you may have got the impression that
text-files are complicated. This section deals with a related topic,
making live often easier for the user, but providing another worry to
the programmer.   *BOM* or <em>Byte Order Marker</em> is a technique
for identifying Unicode text-files as well as the encoding they
use. Such files start with the Unicode character `0xFEFF`, a
non-breaking, zero-width space character. This is a pretty unique
sequence that is not likely to be the start of a non-Unicode file and
uniquely distinguishes the various Unicode file formats. As it is a
zero-width blank, it even doesn't produce any output. This solves all
problems, or ...

Some formats start of as US-ASCII and may contain some encoding mark to
switch to UTF-8, such as the `encoding="UTF-8"` in an XML header.
Such formats often explicitly forbid the the use of a UTF-8 BOM. In
other cases there is additional information telling the encoding making
the use of a BOM redundant or even illegal.

The BOM is handled by the `open/4` predicate. By default, text-files are
probed for the BOM when opened for reading. If a BOM is found, the
encoding is set accordingly and the property `bom(true)` is
available through stream_property/2. When opening a file for
writing, writing a BOM can be requested using the option
`bom(true)` with `open/4`.

                               */

#include "Yap.h"
#include "YapHeap.h"
#include "Yatom.h"
#include "alloc.h"
#include "eval.h"
#include "yapio.h"
/* stuff we want to use in standard YAP code */
#include "YapText.h"
#if _MSC_VER || defined(__MINGW32__)
#if HAVE_FINITE == 1
#undef HAVE_FINITE
#endif
#include <windows.h>
#endif
#include "iopreds.h"
#if HAVE_STRING_H
#include <string.h>
#endif
#if HAVE_WCTYPE_H
#include <wctype.h>
#endif
#if O_LOCALE
#include "locale.h"
#endif

/* You just can't trust some machines */
#define my_isxdigit(C, SU, SL)                                                 \
  (chtype(C) == NU || (C >= 'A' && C <= (SU)) || (C >= 'a' && C <= (SL)))
#define my_isupper(C) (C >= 'A' && C <= 'Z')
#define my_islower(C) (C >= 'a' && C <= 'z')

static Term float_send(char *, int);
static Term get_num(int *, int *, struct stream_desc *, char *, UInt, int);

static void Yap_setCurrentSourceLocation(struct stream_desc *s) {
  CACHE_REGS
#if HAVE_SOCKET
  if (s->status & Socket_Stream_f)
    LOCAL_SourceFileName = AtomSocket;
  else
#endif
      if (s->status & Pipe_Stream_f)
    LOCAL_SourceFileName = AtomPipe;
  else if (s->status & InMemory_Stream_f)
    LOCAL_SourceFileName = AtomCharsio;
  else
    LOCAL_SourceFileName = s->name;
  LOCAL_SourceFileLineno = s->linecount;
}

/* token table with some help from Richard O'Keefe's PD scanner */
char_kind_t Yap_chtype0[NUMBER_OF_CHARS + 1] = {
    EF,
    /* nul soh stx etx eot enq ack bel  bs  ht  nl  vt  np  cr  so  si
           */
    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,

    /* dle dc1 dc2 dc3 dc4 nak syn etb can  em sub esc  fs  gs  rs  us
           */
    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,

    /* sp   !   "   #   $   %   &   '   (   )   *   +   ,   -   .   / */
    BS, SL, DC, SY, SY, CC, SY, QT, BK, BK, SY, SY, BK, SY, SY, SY,

    /* 0   1   2   3   4   5   6   7   8   9   :   ;   <   =   >   ? */
    NU, NU, NU, NU, NU, NU, NU, NU, NU, NU, SY, SL, SY, SY, SY, SY,

    /* @   A   B   C   D   E   F   G   H   I   J   K   L   M   N   O */
    SY, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC,

    /* P   Q   R   S   T   U   V   W   X   Y   Z   [   \   ]   ^   _ */
    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, BK, SY, BK, SY, UL,

    /* `   a   b   c   d   e   f   g   h   i   j   k   l   m   n   o */
    SY, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC,

    /* p   q   r   s   t   u   v   w   x   y   z   {   |   }   ~ del */
    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, BK, BK, BK, SY, BS,

    /* 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143
           */
    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,

    /* 144 145    147 148 149 150 151 152 153 154 155 156 157 158 159
           */
    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,

    /*     ¡   ¢   £   ¤   ¥   ¦   §   ¨   ©   ª   «   ¬      ®   ¯ */
    BS, SY, SY, SY, SY, SY, SY, SY, SY, SY, LC, SY, SY, SY, SY, SY,

    /* °   ±   ²   ³   ´   µ   ¶   ·   ¸   ¹   º   »   ¼   ½   ¾   ¿ */
    SY, SY, LC, LC, SY, SY, SY, SY, SY, LC, LC, SY, SY, SY, SY, SY,

    /* À   Á   Â   Ã   Ä   Å   Æ   Ç   È   É   Ê   Ë   Ì   Í   Î   Ï */
    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC,

/* Ð   Ñ   Ò   Ó   Ô   Õ   Ö   ×   Ø   Ù   Ú   Û   Ü   Ý   Þ   ß    */
#ifdef vms
    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, LC,
#else
    UC, UC, UC, UC, UC, UC, UC, SY, UC, UC, UC, UC, UC, UC, UC, LC,
#endif
    /* à   á   â   ã   ä   å   æ   ç   è   é   ê   ë   ì   í   î   ï */
    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC,

/* ð   ñ   ò   ó   ô   õ   ö   ÷   ø   ù   ú   û   ü   cannot write the last
 * three because of lcc    */
#ifdef vms
    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC
#else
    LC, LC, LC, LC, LC, LC, LC, SY, LC, LC, LC, LC, LC, LC, LC, LC
#endif
};

typedef struct scanner_internals {
  StreamDesc *t;
  TokEntry *ctok;
  char *_ScannerStack; // = (char *)TR;
  char *ScannerExtraBlocks;
  CELL *CommentsTail;
  CELL *Comments;
  CELL *CommentsNextChar;
  wchar_t *CommentsBuff;
  size_t CommentsBuffLim;
} scanner_internals;

// standard get char, uses conversion table
// and converts to wide
#define getchr(inp) inp->stream_wgetc_for_read(inp - GLOBAL_Stream)
// get char for quoted data, eg, quoted atoms and so on
// converts to wide
#define getchrq(inp) inp->stream_wgetc(inp - GLOBAL_Stream)
// get char for UTF-8 quoted data, eg, quoted strings
// reads bytes
#define getchru(inp) inp->stream_getc_utf8(inp - GLOBAL_Stream)

/* in case there is an overflow */
typedef struct scanner_extra_alloc {
  struct scanner_extra_alloc *next;
  void *filler;
} ScannerExtraBlock;

#define CodeSpaceError(t, p, l) CodeSpaceError__(t, p, l PASS_REGS)
static TokEntry *CodeSpaceError__(TokEntry *t, TokEntry *p,
                                  TokEntry *l USES_REGS) {
  LOCAL_Error_TYPE = RESOURCE_ERROR_HEAP;
  LOCAL_ErrorMessage = "Code Space Overflow";
  if (t) {
    t->Tok = eot_tok;
    t->TokInfo = TermOutOfHeapError;
  }
  /* serious error now */
  return l;
}

#define TrailSpaceError(t, l) TrailSpaceError__(t, l PASS_REGS)
static TokEntry *TrailSpaceError__(TokEntry *t, TokEntry *l USES_REGS) {
  LOCAL_ErrorMessage = "Trail Overflow";
  LOCAL_Error_TYPE = RESOURCE_ERROR_TRAIL;
  if (t) {
    t->Tok = eot_tok;
    t->TokInfo = TermOutOfTrailError;
  }
  return l;
}

#define AuxSpaceError(p, l, msg) AuxSpaceError__(p, l, msg PASS_REGS)
static TokEntry *AuxSpaceError__(TokEntry *p, TokEntry *l,
                                 const char *msg USES_REGS) {
  /* huge atom or variable, we are in trouble */
  LOCAL_ErrorMessage = (char *)msg;
  LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
  Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
  if (p) {
    p->Tok = eot_tok;
    p->TokInfo = TermOutOfAuxspaceError;
  }
  /* serious error now */
  return l;
}

static void InitScannerMemory(void) {
  CACHE_REGS
  LOCAL_ErrorMessage = NULL;
  LOCAL_Error_Size = 0;
  LOCAL_ScannerStack = (char *)TR;
  LOCAL_ScannerExtraBlocks = NULL;
}

static char *AllocScannerMemory(unsigned int size) {
  CACHE_REGS
  char *AuxSpScan;

  AuxSpScan = LOCAL_ScannerStack;
  size = AdjustSize(size);
  if (LOCAL_ScannerExtraBlocks) {
    struct scanner_extra_alloc *ptr;

    if (!(ptr = (struct scanner_extra_alloc *)malloc(
              size + sizeof(ScannerExtraBlock)))) {
      return NULL;
    }
    ptr->next = LOCAL_ScannerExtraBlocks;
    LOCAL_ScannerExtraBlocks = ptr;
    return (char *)(ptr + 1);
  } else if (LOCAL_TrailTop <= AuxSpScan + size) {
    UInt alloc_size = sizeof(CELL) * K16;

    if (size > alloc_size)
      alloc_size = size;
    if (!Yap_growtrail(alloc_size, TRUE)) {
      struct scanner_extra_alloc *ptr;

      if (!(ptr = (struct scanner_extra_alloc *)malloc(
                size + sizeof(ScannerExtraBlock)))) {
        return NULL;
      }
      ptr->next = LOCAL_ScannerExtraBlocks;
      LOCAL_ScannerExtraBlocks = ptr;
      return (char *)(ptr + 1);
    }
  }
  LOCAL_ScannerStack = AuxSpScan + size;
  return AuxSpScan;
}

static void PopScannerMemory(char *block, unsigned int size) {
  CACHE_REGS
  if (block == LOCAL_ScannerStack - size) {
    LOCAL_ScannerStack -= size;
  } else if (block == (char *)(LOCAL_ScannerExtraBlocks + 1)) {
    struct scanner_extra_alloc *ptr = LOCAL_ScannerExtraBlocks;

    LOCAL_ScannerExtraBlocks = ptr->next;
    free(ptr);
  }
}

char *Yap_AllocScannerMemory(unsigned int size) {
  /* I assume memory has been initialized */
  return AllocScannerMemory(size);
}

extern double atof(const char *);

static Term float_send(char *s, int sign) {
  Float f = (Float)(sign * atof(s));
#if HAVE_ISFINITE || defined(isfinite)
  if (trueGlobalPrologFlag(ISO_FLAG)) { /* iso */
    if (!isfinite(f)) {
      CACHE_REGS
      LOCAL_ErrorMessage = "Float overflow while scanning";
      return (MkEvalFl(f));
    }
  }
#elif HAVE_FINITE
  if (trueGlobalPrologFlag(ISO_FLAG)) { /* iso */
    if (!finite(f)) {
      LOCAL_ErrorMessage = "Float overflow while scanning";
      return (MkEvalFl(f));
    }
  }
#endif
  {
    CACHE_REGS
    return (MkEvalFl(f));
  }
}

/* we have an overflow at s */
static Term read_int_overflow(const char *s, Int base, Int val, int sign) {
#ifdef USE_GMP
  /* try to scan it as a bignum */
  mpz_t new;
  Term t;

  mpz_init_set_str(new, s, base);
  if (sign < 0)
    mpz_neg(new, new);
  t = Yap_MkBigIntTerm(new);
  mpz_clear(new);
  return t;
#else
  CACHE_REGS
  /* try to scan it as a float */
  return MkIntegerTerm(val);
#endif
}

static int send_error_message(char s[]) {
  CACHE_REGS
  LOCAL_ErrorMessage = s;
  return 0;
}

static wchar_t read_quoted_char(int *scan_nextp,
                                struct stream_desc *inp_stream) {
  int ch;

/* escape sequence */
do_switch:
  ch = getchrq(inp_stream);
  switch (ch) {
  case 10:
    return 0;
  case '\\':
    return '\\';
  case 'a':
    return '\a';
  case 'b':
    return '\b';
  case 'c':
    while (chtype((ch = getchrq(inp_stream))) == BS)
      ;
    {
      if (ch == '\\') {
        goto do_switch;
      }
      return ch;
    }
  case 'd':
    return 127;
  case 'e':
    return '\x1B'; /* <ESC>, a.k.a. \e */
  case 'f':
    return '\f';
  case 'n':
    return '\n';
  case 'r':
    return '\r';
  case 's': /* space */
    return ' ';
  case 't':
    return '\t';
  case 'u': {
    int i;
    wchar_t wc = '\0';

    for (i = 0; i < 4; i++) {
      ch = getchrq(inp_stream);
      if (ch >= '0' && ch <= '9') {
        wc += (ch - '0') << ((3 - i) * 4);
      } else if (ch >= 'a' && ch <= 'f') {
        wc += ((ch - 'a') + 10) << ((3 - i) * 4);
      } else if (ch >= 'A' && ch <= 'F') {
        wc += ((ch - 'A') + 10) << ((3 - i) * 4);
      } else {
        return send_error_message("invalid escape sequence");
      }
    }
    return wc;
  }
  case 'U': {
    int i;
    wchar_t wc = '\0';

    for (i = 0; i < 8; i++) {
      ch = getchrq(inp_stream);
      if (ch >= '0' && ch <= '9') {
        wc += (ch - '0') << ((7 - i) * 4);
      } else if (ch >= 'a' && ch <= 'f') {
        wc += ((ch - 'a') + 10) << ((7 - i) * 4);
      } else if (ch >= 'A' && ch <= 'F') {
        wc += ((ch - 'A') + 10) << ((7 - i) * 4);
      } else {
        return send_error_message("invalid escape sequence");
      }
    }
    return wc;
  }
  case 'v':
    return '\v';
  case 'z': /* Prolog end-of-file */
    return send_error_message("invalid escape sequence \\z");
  case '\'':
    return '\'';
  case '"':
    return '"';
  case '`':
    return '`';
  case '^':
    if (trueGlobalPrologFlag(ISO_FLAG)) {
      return send_error_message("invalid escape sequence");
    } else {
      ch = getchrq(inp_stream);
      if (ch == '?') { /* delete character */
        return 127;
      } else if (ch >= 'a' && ch < 'z') { /* hexa */
        return ch - 'a';
      } else if (ch >= 'A' && ch < 'Z') { /* hexa */
        return ch - 'A';
      } else {
        return '^';
      }
    }
  case '0':
  case '1':
  case '2':
  case '3':
  case '4':
  case '5':
  case '6':
  case '7':
    /* character in octal: maximum of 3 digits, terminates with \ */
    /* follow ISO */
    {
      unsigned char so_far = ch - '0';
      ch = getchrq(inp_stream);
      if (ch >= '0' && ch < '8') { /* octal */
        so_far = so_far * 8 + (ch - '0');
        ch = getchrq(inp_stream);
        if (ch >= '0' && ch < '8') { /* octal */
          so_far = so_far * 8 + (ch - '0');
          ch = getchrq(inp_stream);
          if (ch != '\\') {
            return send_error_message("invalid octal escape sequence");
          }
          return so_far;
        } else if (ch == '\\') {
          return so_far;
        } else {
          return send_error_message("invalid octal escape sequence");
        }
      } else if (ch == '\\') {
        return so_far;
      } else {
        return send_error_message("invalid octal escape sequence");
      }
    }
  case 'x':
    /* hexadecimal character (YAP allows empty hexadecimal  */
    {
      unsigned char so_far = 0;
      ch = getchrq(inp_stream);
      if (my_isxdigit(ch, 'f', 'F')) { /* hexa */
        so_far =
            so_far * 16 + (chtype(ch) == NU
                               ? ch - '0'
                               : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
        ch = getchrq(inp_stream);
        if (my_isxdigit(ch, 'f', 'F')) { /* hexa */
          so_far =
              so_far * 16 + (chtype(ch) == NU
                                 ? ch - '0'
                                 : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
          ch = getchrq(inp_stream);
          if (ch == '\\') {
            return so_far;
          } else {
            return send_error_message("invalid hexadecimal escape sequence");
          }
        } else if (ch == '\\') {
          return so_far;
        } else {
          return send_error_message("invalid hexadecimal escape sequence");
        }
      } else if (ch == '\\') {
        return so_far;
      } else {
        return send_error_message("invalid hexadecimal escape sequence");
      }
    }
  default:
    /* accept sequence. Note that the ISO standard does not
     consider this sequence legal, whereas SICStus would
     eat up the escape sequence. */
    return send_error_message("invalid escape sequence");
  }
}

static int num_send_error_message(char s[]) {
  CACHE_REGS
  LOCAL_ErrorMessage = s;
  return TermNil;
}

/* reads a number, either integer or float */

static Term get_num(int *chp, int *chbuffp, StreamDesc *inp_stream, char *s,
                    UInt max_size, int sign) {
  char *sp = s;
  int ch = *chp;
  Int val = 0L, base = ch - '0';
  int might_be_float = TRUE, has_overflow = FALSE;
  const unsigned char *decimalpoint;

  *sp++ = ch;
  ch = getchr(inp_stream);
  /*
   * because of things like 00'2, 03'2 and even better 12'2, I need to
   * do this (have mercy)
   */
  if (chtype(ch) == NU) {
    *sp++ = ch;
    if (--max_size == 0) {
      return num_send_error_message("Number Too Long");
    }
    base = 10 * base + ch - '0';
    ch = getchr(inp_stream);
  }
  if (ch == '\'') {
    if (base > 36) {
      return num_send_error_message("Admissible bases are 0..36");
    }
    might_be_float = FALSE;
    if (--max_size == 0) {
      return num_send_error_message("Number Too Long");
    }
    *sp++ = ch;
    ch = getchr(inp_stream);
    if (base == 0) {
      CACHE_REGS
      wchar_t ascii = ch;
      int scan_extra = TRUE;

      if (ch == '\\' &&
          Yap_GetModuleEntry(CurrentModule)->flags & M_CHARESCAPE) {
        ascii = read_quoted_char(&scan_extra, inp_stream);
      }
      /* a quick way to represent ASCII */
      if (scan_extra)
        *chp = getchr(inp_stream);
      if (sign == -1) {
        return MkIntegerTerm(-ascii);
      }
      return MkIntegerTerm(ascii);
    } else if (base >= 10 && base <= 36) {
      int upper_case = 'A' - 11 + base;
      int lower_case = 'a' - 11 + base;

      while (my_isxdigit(ch, upper_case, lower_case)) {
        Int oval = val;
        int chval =
            (chtype(ch) == NU ? ch - '0'
                              : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
        if (--max_size == 0) {
          return num_send_error_message("Number Too Long");
        }
        *sp++ = ch;
        val = oval * base + chval;
        if (oval != (val - chval) / base) /* overflow */
          has_overflow = (has_overflow || TRUE);
        ch = getchr(inp_stream);
      }
    }
  } else if (ch == 'x' && base == 0) {
    might_be_float = FALSE;
    if (--max_size == 0) {
      return num_send_error_message("Number Too Long");
    }
    *sp++ = ch;
    ch = getchr(inp_stream);
    while (my_isxdigit(ch, 'F', 'f')) {
      Int oval = val;
      int chval =
          (chtype(ch) == NU ? ch - '0'
                            : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
      if (--max_size == 0) {
        return num_send_error_message("Number Too Long");
      }
      *sp++ = ch;
      val = val * 16 + chval;
      if (oval != (val - chval) / 16) /* overflow */
        has_overflow = TRUE;
      ch = getchr(inp_stream);
    }
    *chp = ch;
  } else if (ch == 'o' && base == 0) {
    might_be_float = FALSE;
    base = 8;
    ch = getchr(inp_stream);
  } else if (ch == 'b' && base == 0) {
    might_be_float = FALSE;
    base = 2;
    ch = getchr(inp_stream);
  } else {
    val = base;
    base = 10;
  }
  while (chtype(ch) == NU) {
    Int oval = val;
    if (!(val == 0 && ch == '0') || has_overflow) {
      if (--max_size == 0) {
        return num_send_error_message("Number Too Long");
      }
      *sp++ = ch;
    }
    if (ch - '0' >= base) {
      CACHE_REGS
      if (sign == -1)
        return MkIntegerTerm(-val);
      return MkIntegerTerm(val);
    }
    val = val * base + ch - '0';
    if (val / base != oval || val - oval * base != ch - '0') /* overflow */
      has_overflow = TRUE;
    ch = getchr(inp_stream);
  }
  if (might_be_float && (ch == '.' || ch == 'e' || ch == 'E')) {
    int has_dot = (ch == '.');
    if (has_dot) {
      unsigned char *dp;
      int dc;

      if (chtype(ch = getchr(inp_stream)) != NU) {
        if (ch == 'e' || ch == 'E') {
          if (trueGlobalPrologFlag(ISO_FLAG))
            return num_send_error_message(
                "Float format not allowed in ISO mode");
        } else { /* followed by a letter, end of term? */
          CACHE_REGS
          sp[0] = '\0';
          *chbuffp = '.';
          *chp = ch;
          if (has_overflow)
            return read_int_overflow(s, base, val, sign);
          if (sign == -1)
            return MkIntegerTerm(-val);
          return MkIntegerTerm(val);
        }
      }
#if O_LOCALE
      if ((decimalpoint = (unsigned char *)(localeconv()->decimal_point)) ==
          NULL)
#endif
        decimalpoint = (const unsigned char *)".";
      dp = (unsigned char *)decimalpoint;
      /* translate . to current locale */
      while ((dc = *dp++) != '\0') {
        *sp++ = dc;
        if (--max_size == 0) {
          return num_send_error_message("Number Too Long");
        }
      }
      /* numbers after . */
      if (chtype(ch) == NU) {
        do {
          if (--max_size == 0) {
            return num_send_error_message("Number Too Long");
          }
          *sp++ = ch;
        } while (chtype(ch = getchr(inp_stream)) == NU);
      }
    }
    if (ch == 'e' || ch == 'E') {
      if (--max_size == 0) {
        return num_send_error_message("Number Too Long");
      }
      *sp++ = ch;
      ch = getchr(inp_stream);
      if (ch == '-') {
        if (--max_size == 0) {
          return num_send_error_message("Number Too Long");
        }
        *sp++ = '-';
        ch = getchr(inp_stream);
      } else if (ch == '+') {
        ch = getchr(inp_stream);
      }
      if (chtype(ch) != NU) {
        CACHE_REGS
        if (has_dot)
          return float_send(s, sign);
        return MkIntegerTerm(sign * val);
      }
      do {
        if (--max_size == 0) {
          return num_send_error_message("Number Too Long");
        }
        *sp++ = ch;
      } while (chtype(ch = getchr(inp_stream)) == NU);
    }
    *sp = '\0';
    *chp = ch;
    return float_send(s, sign);
  } else if (has_overflow) {
    *sp = '\0';
    /* skip base */
    *chp = ch;
    if (s[0] == '0' && s[1] == 'x')
      return read_int_overflow(s + 2, 16, val, sign);
    else if (s[0] == '0' && s[1] == 'o')
      return read_int_overflow(s + 2, 8, val, sign);
    else if (s[0] == '0' && s[1] == 'b')
      return read_int_overflow(s + 2, 2, val, sign);
    if (s[1] == '\'')
      return read_int_overflow(s + 2, base, val, sign);
    if (s[2] == '\'')
      return read_int_overflow(s + 3, base, val, sign);
    return read_int_overflow(s, base, val, sign);
  } else {
    CACHE_REGS
    *chp = ch;
    return MkIntegerTerm(val * sign);
  }
}

/* given a function getchr scan until we  either find the number
   or end of file */
Term Yap_scan_num(StreamDesc *inp) {
  CACHE_REGS
  Term out;
  int sign = 1;
  int ch, cherr;
  char *ptr, *mp;
  int kind;
  void *old_tr = TR;

  InitScannerMemory();
  LOCAL_VarTable = LOCAL_AnonVarTable = NULL;
  if (!(ptr = AllocScannerMemory(4096))) {
    LOCAL_ErrorMessage = "Trail Overflow";
    LOCAL_Error_TYPE = RESOURCE_ERROR_TRAIL;
    return 0;
  }
  TokEntry *tokptr = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
  tokptr->TokPos = GetCurInpPos(inp);
  while ((ch = getchr(inp)) == BS)
    ;
  if (ch == '-') {
    sign = -1;
    ch = getchr(inp);
  } else if (ch == '+') {
    ch = getchr(inp);
  }
  if (chtype(ch) == NU) {
    cherr = '\0';
    if (ASP - HR < 1024) {
      Yap_clean_tokenizer(old_tr, NULL, NULL);
      LOCAL_ErrorMessage = "Stack Overflow";
      LOCAL_Error_TYPE = RESOURCE_ERROR_STACK;
      return 0;
    }
    out = get_num(&ch, &cherr, inp, ptr, 4096, sign); /*  */
  }
  if (LOCAL_ErrorMessage != NULL || ch != -1 || cherr) {
    CACHE_REGS
    char *s = ptr;
    int sign = 1;

    out = 0;
    if (s[0] == '+') {
      s++;
    }
    if (s[0] == '-') {
      s++;
      sign = -1;
    }
    if (strcmp(s, "inf") == 0) {
      if (sign > 0) {
        out = MkFloatTerm(INFINITY);
      } else {
        out = MkFloatTerm(-INFINITY);
      }
    }
    if (strcmp(s, "nan") == 0) {
      if (sign > 0) {
        out = MkFloatTerm(NAN);
      } else {
        out = MkFloatTerm(-NAN);
      }
    }
    if (out == 0) {
      TokEntry *e, *ef;
      size_t len = strlen(ptr);
      mp = AllocScannerMemory(len + 1);
      tokptr->Tok = Ord(kind = String_tok);
      tokptr->TokInfo = Unsigned(mp);
      e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
      ef = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
      tokptr->TokNext = e;
      e->Tok = Error_tok;
      if (!LOCAL_ErrorMessage) {
        LOCAL_ErrorMessage =
            "syntax error while converting from a string to a number";
      }
      e->TokInfo = MkAtomTerm(Yap_LookupAtom(LOCAL_ErrorMessage));
      e->TokPos = GetCurInpPos(inp);
      e->TokNext = ef;
      ef->Tok = Ord(kind = eot_tok);
      ef->TokInfo = TermSyntaxError;
      ef->TokPos = GetCurInpPos(inp);
      ef->TokNext = NULL;
      LOCAL_tokptr = tokptr;
      LOCAL_toktide = e;
      LOCAL_ErrorMessage = NULL;
      LOCAL_Error_Term = Yap_syntax_error(e, inp - GLOBAL_Stream);
      LOCAL_Error_TYPE = SYNTAX_ERROR;
    }
  }
  PopScannerMemory(ptr, 4096);
  Yap_clean_tokenizer(old_tr, NULL, NULL);
  return out;
}

#define CHECK_SPACE()                                                          \
  if (ASP - HR < 1024) {                                                       \
    LOCAL_ErrorMessage = "Stack Overflow";                                     \
    LOCAL_Error_TYPE = RESOURCE_ERROR_STACK;                                   \
    LOCAL_Error_Size = 0L;                                                     \
    if (p) {                                                                   \
      p->Tok = Ord(kind = eot_tok);                                            \
      p->TokInfo = TermOutOfStackError;                                        \
    }                                                                          \
    /* serious error now */                                                    \
    return l;                                                                  \
  }

const char *Yap_tokRep(TokEntry *tokptr, encoding_t encoding) {
  CACHE_REGS
  Term info = tokptr->TokInfo;
  char *buf = LOCAL_FileNameBuf2;
  size_t length;
  UInt flags = 0;

  switch (tokptr->Tok) {
  case Name_tok:
    if (IsWideAtom((Atom)info)) {
      wchar_t *wc = RepAtom((Atom)info)->WStrOfAE;
      Term s = Yap_WCharsToString(wc PASS_REGS);
      return StringOfTerm(s);
    }
    return RepAtom((Atom)info)->StrOfAE;
  case Number_tok:
    return Yap_TermToString(info, &length, encoding, flags);
  case Var_tok: {
    VarEntry *varinfo = (VarEntry *)info;
    varinfo->VarAdr = TermNil;
    return varinfo->VarRep;
  }
  case String_tok:
  case BQString_tok:
    return (char *)info;
  case WString_tok:
  case WBQString_tok: {
    wchar_t *op = (wchar_t *)info;
    wchar_t c;
    unsigned char *bp = (unsigned char *)buf;
    while ((c = *op++)) {
      bp += put_utf8(bp, c);
    }
    bp[0] = '\0';
    return buf;
  }
  case Error_tok:
    return "<ERR>";
  case eot_tok:
    return "<EOT>";
  case Ponctuation_tok: {
    buf[1] = '\0';
    if ((info) == 'l') {
      buf[0] = '(';
    } else {
      buf[0] = (char)info;
    }
  }
    return buf;
  case QuasiQuotes_tok:
  case WQuasiQuotes_tok:
    return "<QQ>";
  }
}

static void open_comment(int ch, StreamDesc *inp_stream USES_REGS) {
  CELL *h0 = HR;
  HR += 5;
  h0[0] = AbsAppl(h0 + 2);
  h0[1] = TermNil;
  if (!LOCAL_CommentsTail) {
    /* first comment */
    LOCAL_Comments = AbsPair(h0);
  } else {
    /* extra comment */
    *LOCAL_CommentsTail = AbsPair(h0);
  }
  LOCAL_CommentsTail = h0 + 1;
  h0 += 2;
  h0[0] = (CELL)FunctorMinus;
  h0[1] = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
  h0[2] = TermNil;
  LOCAL_CommentsNextChar = h0 + 2;
  LOCAL_CommentsBuff = (wchar_t *)malloc(1024 * sizeof(wchar_t));
  LOCAL_CommentsBuffLim = 1024;
  LOCAL_CommentsBuff[0] = ch;
  LOCAL_CommentsBuffPos = 1;
}

static void extend_comment(int ch USES_REGS) {
  LOCAL_CommentsBuff[LOCAL_CommentsBuffPos] = ch;
  LOCAL_CommentsBuffPos++;
  if (LOCAL_CommentsBuffPos == LOCAL_CommentsBuffLim - 1) {
    LOCAL_CommentsBuff = (wchar_t *)realloc(
        LOCAL_CommentsBuff, sizeof(wchar_t) * (LOCAL_CommentsBuffLim + 4096));
    LOCAL_CommentsBuffLim += 4096;
  }
}

static void close_comment(USES_REGS1) {
  LOCAL_CommentsBuff[LOCAL_CommentsBuffPos] = '\0';
  *LOCAL_CommentsNextChar = Yap_WCharsToString(LOCAL_CommentsBuff PASS_REGS);
  free(LOCAL_CommentsBuff);
  LOCAL_CommentsBuff = NULL;
  LOCAL_CommentsBuffLim = 0;
}

// mark that we reached EOF,
// next  token will be end_of_file)
static void mark_eof(struct stream_desc *inp_stream) {
  inp_stream->status |= Push_Eof_Stream_f;
}

static wchar_t *ch_to_wide(char *base, char *charp) {
  CACHE_REGS
  int n = charp - base, i;
  wchar_t *nb = (wchar_t *)base;

  if ((nb + n) + 1024 > (wchar_t *)AuxSp) {
    LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
    LOCAL_ErrorMessage =
        "Heap Overflow While Scanning: please increase code space (-h)";
    return NULL;
  }
  for (i = n; i > 0; i--) {
    nb[i - 1] = (unsigned char)base[i - 1];
  }
  return nb + n;
}

#define add_ch_to_buff(ch)                                                     \
  if (wcharp) {                                                                \
    *wcharp++ = (ch);                                                          \
    if (wcharp >= (wchar_t *)AuxSp - 1024)                                     \
      goto huge_var_error;                                                     \
    charp = (char *)wcharp;                                                    \
  } else {                                                                     \
    if (ch > MAX_ISO_LATIN1 && !wcharp) {                                      \
      /* does not fit in ISO-LATIN */                                          \
      wcharp = ch_to_wide(TokImage, charp);                                    \
      if (!wcharp)                                                             \
        goto huge_var_error;                                                   \
      *wcharp++ = (ch);                                                        \
      charp = (char *)wcharp;                                                  \
    } else {                                                                   \
      if (charp >= (char *)AuxSp - 1024)                                       \
        goto huge_var_error;                                                   \
      *charp++ = ch;                                                           \
    }                                                                          \
  }

#define add_ch_to_utf8_buff(ch)                                                \
  {                                                                            \
    if ((ch & 0xff) == ch) {                                                   \
      *charp++ = ch;                                                           \
    } else {                                                                   \
      charp = _PL__put_utf8(charp, chr);                                       \
    }                                                                          \
  }

TokEntry *Yap_tokenizer(struct stream_desc *inp_stream, bool store_comments,
                        Term *tposp) {

  CACHE_REGS
  TokEntry *t, *l, *p;
  enum TokenKinds kind;
  int solo_flag = TRUE;
  int ch;
  wchar_t *wcharp;
  struct qq_struct_t *cur_qq = NULL;

  InitScannerMemory();
  LOCAL_VarTable = NULL;
  LOCAL_AnonVarTable = NULL;
  l = NULL;
  p = NULL; /* Just to make lint happy */
  ch = getchr(inp_stream);
  while (chtype(ch) == BS) {
    ch = getchr(inp_stream);
  }
  *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
  Yap_setCurrentSourceLocation(inp_stream);
  LOCAL_StartLineCount = inp_stream->linecount;
  LOCAL_StartLinePos = inp_stream->linepos;
  do {
    wchar_t och, pch;
    int quote, isvar;
    char *charp, *mp;
    unsigned int len;
    char *TokImage = NULL;

    t = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
    t->TokNext = NULL;
    if (t == NULL) {
      return TrailSpaceError(p, l);
    }
    if (!l)
      l = t;
    else
      p->TokNext = t;
    p = t;
  restart:
    while (chtype(ch) == BS) {
      ch = getchr(inp_stream);
    }
    t->TokPos = GetCurInpPos(inp_stream);

    switch (chtype(ch)) {

    case CC:
      if (store_comments) {
        CHECK_SPACE();
        open_comment(ch, inp_stream PASS_REGS);
      continue_comment:
        while ((ch = getchr(inp_stream)) != 10 && chtype(ch) != EF) {
          CHECK_SPACE();
          extend_comment(ch PASS_REGS);
        }
        CHECK_SPACE();
        extend_comment(ch PASS_REGS);
        if (chtype(ch) != EF) {
          ch = getchr(inp_stream);
          if (chtype(ch) == CC) {
            extend_comment(ch PASS_REGS);
            goto continue_comment;
          }
        }
        close_comment(PASS_REGS1);
      } else {
        while ((ch = getchr(inp_stream)) != 10 && chtype(ch) != EF)
          ;
      }
      if (chtype(ch) != EF) {
        /* blank space */
        if (t == l) {
          /* we found a comment before reading characters */
          while (chtype(ch) == BS) {
            ch = getchr(inp_stream);
          }
          CHECK_SPACE();
          *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
          Yap_setCurrentSourceLocation(inp_stream);
        }
        goto restart;
      } else {
        t->Tok = Ord(kind = eot_tok);
        mark_eof(inp_stream);
        t->TokInfo = TermEof;
      }
      break;

    case UC:
    case UL:
    case LC:
      och = ch;
      ch = getchr(inp_stream);
    scan_name:
      TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
      charp = TokImage;
      wcharp = NULL;
      isvar = (chtype(och) != LC);
      add_ch_to_buff(och);
      for (; chtype(ch) <= NU; ch = getchr(inp_stream)) {
        if (charp == (char *)AuxSp - 1024) {
        huge_var_error:
          return AuxSpaceError(p, l, "Code Space Overflow due to huge atom");
          /* huge atom or variable, we are in trouble */
        }
        add_ch_to_buff(ch);
      }
      while (ch == '\'' && isvar &&
             trueGlobalPrologFlag(VARIABLE_NAMES_MAY_END_WITH_QUOTES_FLAG)) {
        if (charp == (char *)AuxSp - 1024) {
          goto huge_var_error;
        }
        add_ch_to_buff(ch);
        ch = getchr(inp_stream);
      }
      add_ch_to_buff('\0');
      if (!isvar) {
        Atom ae;
        /* don't do this in iso */
        if (wcharp) {
          ae = Yap_LookupWideAtom((wchar_t *)TokImage);
        } else {
          ae = Yap_LookupAtom(TokImage);
        }
        if (ae == NIL) {
          return CodeSpaceError(t, p, l);
        }
        t->TokInfo = Unsigned(ae);
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        if (ch == '(')
          solo_flag = FALSE;
        t->Tok = Ord(kind = Name_tok);
      } else {
        VarEntry *ve = Yap_LookupVar(TokImage);
        t->TokInfo = Unsigned(ve);
        if (cur_qq) {
          ve->refs++;
        }
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        t->Tok = Ord(kind = Var_tok);
      }
      break;

    case NU: {
      int cherr;
      int cha = ch;
      char *ptr;

      cherr = 0;
      if (!(ptr = AllocScannerMemory(4096))) {
        return TrailSpaceError(t, l);
      }
      CHECK_SPACE();
      if ((t->TokInfo = get_num(&cha, &cherr, inp_stream, ptr, 4096, 1)) ==
          0L) {
        if (p) {
          p->Tok = eot_tok;
          t->TokInfo = TermError;
        }
        /* serious error now */
        return l;
      }
      PopScannerMemory(ptr, 4096);
      ch = cha;
      if (cherr) {
        TokEntry *e;
        t->Tok = Number_tok;
        t->TokPos = GetCurInpPos(inp_stream);
        e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
        if (e == NULL) {
          return TrailSpaceError(p, l);

        } else {
          e->TokNext = NULL;
        }
        t->TokNext = e;
        t = e;
        p = e;
        switch (cherr) {
        case 'e':
        case 'E':
          och = cherr;
          goto scan_name;
          break;
        case '=':
        case '_':
          /* handle error while parsing a float */
          {
            TokEntry *e2;

            t->Tok = Ord(Var_tok);
            t->TokInfo = Unsigned(Yap_LookupVar("E"));
            t->TokPos = GetCurInpPos(inp_stream);
            e2 = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
            if (e2 == NULL) {
              return TrailSpaceError(p, l);
            } else {
              e2->TokNext = NULL;
            }
            t->TokNext = e2;
            t = e2;
            p = e2;
            if (cherr == '=')
              och = '+';
            else
              och = '-';
          }
          goto enter_symbol;
        case '+':
        case '-':
          /* handle error while parsing a float */
          {
            TokEntry *e2;

            t->Tok = Name_tok;
            if (ch == '(')
              solo_flag = FALSE;
            t->TokInfo = Unsigned(AtomE);
            t->TokPos = GetCurInpPos(inp_stream);
            e2 = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
            if (e2 == NULL) {
              return TrailSpaceError(p, l);
            } else {
              e2->TokNext = NULL;
            }
            t->TokNext = e2;
            t = e2;
            p = e2;
          }
        default:
          och = cherr;
          goto enter_symbol;
        }
      } else {
        t->Tok = Ord(kind = Number_tok);
      }
    } break;

    case QT:
    case DC:
    quoted_string:
      TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
      charp = TokImage;
      quote = ch;
      len = 0;
      ch = getchrq(inp_stream);
      wcharp = NULL;

      while (TRUE) {
        if (charp + 1024 > (char *)AuxSp) {
          LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
          LOCAL_ErrorMessage =
              "Heap Overflow While Scanning: please increase code space (-h)";
          break;
        }
        if (ch == 10 && trueGlobalPrologFlag(ISO_FLAG)) {
          /* in ISO a new line terminates a string */
          LOCAL_ErrorMessage = "layout character \n inside quotes";
          break;
        }
        if (ch == quote) {
          ch = getchrq(inp_stream);
          if (ch != quote)
            break;
          add_ch_to_buff(ch);
          ch = getchrq(inp_stream);
        } else if (ch == '\\' &&
                   Yap_GetModuleEntry(CurrentModule)->flags & M_CHARESCAPE) {
          int scan_next = TRUE;
          if ((ch = read_quoted_char(&scan_next, inp_stream))) {
            add_ch_to_buff(ch);
          }
          if (scan_next) {
            ch = getchrq(inp_stream);
          }
        } else if (chtype(ch) == EF && ch <= MAX_ISO_LATIN1) {
          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
          mark_eof(inp_stream);
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermEof;
          break;
        } else {
          add_ch_to_buff(ch);
          ch = getchrq(inp_stream);
        }
        ++len;
        if (charp > (char *)AuxSp - 1024) {
          /* Not enough space to read in the string. */
          return AuxSpaceError(
              t, l, "not enough space to read in string or quoted atom");
        }
      }
      if (wcharp) {
        *wcharp = '\0';
      } else {
        *charp = '\0';
      }
      if (quote == '"' || quote == '`') {
        if (wcharp) {
          mp = AllocScannerMemory(sizeof(wchar_t) * (len + 1));
        } else {
          mp = AllocScannerMemory(len + 1);
        }
        if (mp == NULL) {
          LOCAL_ErrorMessage =
              "not enough heap space to read in string or quoted atom";
          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermOutOfHeapError;
          return l;
        }
        if (wcharp) {
          wcscpy((wchar_t *)mp, (wchar_t *)TokImage);
        } else {
          strcpy(mp, TokImage);
        }
        t->TokInfo = Unsigned(mp);
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        if (quote == '"') {
          if (wcharp) {
            t->Tok = Ord(kind = WString_tok);
          } else {
            t->Tok = Ord(kind = String_tok);
          }
        } else {
          if (wcharp) {
            t->Tok = Ord(kind = WBQString_tok);
          } else {
            t->Tok = Ord(kind = BQString_tok);
          }
        }
      } else {
        if (wcharp) {
          t->TokInfo = Unsigned(Yap_LookupWideAtom((wchar_t *)TokImage));
        } else {
          t->TokInfo = Unsigned(Yap_LookupAtom(TokImage));
        }
        if (!(t->TokInfo)) {
          return CodeSpaceError(t, p, l);
        }
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        t->Tok = Ord(kind = Name_tok);
        if (ch == '(')
          solo_flag = false;
      }
      break;

    case BS:
      if (ch == '\0') {
        t->Tok = Ord(kind = eot_tok);
        if (chtype(pch) == EF) {
          mark_eof(inp_stream);
          t->TokInfo = TermEof;
        } else {
          t->TokInfo = TermNewLine;
        }
        t->TokInfo = TermEof;
        return l;
      } else
        ch = getchr(inp_stream);
      break;
    case SY:
      if (ch == '.' && (pch = Yap_peek(inp_stream - GLOBAL_Stream)) &&
          (chtype(pch) == BS || chtype(pch) == EF || pch == '%')) {
        t->Tok = Ord(kind = eot_tok);
        // consume...
        if (pch == '%') {
	  t->TokInfo = TermNewLine;
          return l;
	}
        ch = getchr(inp_stream);
        if (chtype(ch) == EF) {
          mark_eof(inp_stream);
          t->TokInfo = TermEof;
        } else {
          t->TokInfo = TermNewLine;
        }
        return l;
      }
      if (ch == '`')
        goto quoted_string;
      och = ch;
      ch = getchr(inp_stream);
      if (och == '.') {
        if (chtype(ch) == BS || chtype(ch) == EF || ch == '%') {
          t->Tok = Ord(kind = eot_tok);
          if (ch == '%') {
            t->TokInfo = TermNewLine;
            return l;
	  }
          if (chtype(ch) == EF) {
            mark_eof(inp_stream);
	    t->TokInfo = TermEof;           
	  } else {
            t->TokInfo = TermNewLine;
          }
          return l;
        }
      }
      if (och == '/' && ch == '*') {
        if (store_comments) {
          CHECK_SPACE();
          open_comment('/', inp_stream PASS_REGS);
          while ((och != '*' || ch != '/') && chtype(ch) != EF) {
            och = ch;
            CHECK_SPACE();
            extend_comment(ch PASS_REGS);
            ch = getchr(inp_stream);
          }
          if (chtype(ch) != EF) {
            CHECK_SPACE();
            extend_comment(ch PASS_REGS);
          }
          close_comment(PASS_REGS1);
        } else {
          while ((och != '*' || ch != '/') && chtype(ch) != EF) {
            och = ch;
            ch = getchr(inp_stream);
          }
        }
        if (chtype(ch) == EF) {
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermEof;
          break;
        } else {
          /* leave comments */
          ch = getchr(inp_stream);
          if (t == l) {
            /* we found a comment before reading characters */
            while (chtype(ch) == BS) {
              ch = getchr(inp_stream);
            }
            CHECK_SPACE();
            *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
            Yap_setCurrentSourceLocation(inp_stream);
          }
        }
        goto restart;
      }
    enter_symbol:
      if (och == '.' && (chtype(ch) == BS || chtype(ch) == EF || ch == '%')) {
        t->Tok = Ord(kind = eot_tok);
	if (ch == '%') {
          t->TokInfo = TermNewLine;
          return l;
        }
	if (chtype(ch) == EF) {
          mark_eof(inp_stream);
          t->TokInfo = TermEof;
        } else {
          t->TokInfo = TermNl;
        }
        return l;
      } else {
        Atom ae;

        TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
        charp = TokImage;
        wcharp = NULL;
        add_ch_to_buff(och);
        for (; chtype(ch) == SY; ch = getchr(inp_stream)) {
          if (charp == (char *)AuxSp - 1024) {
            goto huge_var_error;
          }
          add_ch_to_buff(ch);
        }
        add_ch_to_buff('\0');
        if (wcharp) {
          ae = Yap_LookupWideAtom((wchar_t *)TokImage);
        } else {
          ae = Yap_LookupAtom(TokImage);
        }
        if (ae == NIL) {
          return CodeSpaceError(t, p, l);
        }
        t->TokInfo = Unsigned(ae);
        if (t->TokInfo == (CELL)NIL) {
          return CodeSpaceError(t, p, l);
        }
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        t->Tok = Ord(kind = Name_tok);
        if (ch == '(')
          solo_flag = false;
        else
          solo_flag = true;
      }
      break;

    case SL: {
      char chs[2];
      chs[0] = ch;
      chs[1] = '\0';
      ch = getchr(inp_stream);
      t->TokInfo = Unsigned(Yap_LookupAtom(chs));
      t->Tok = Ord(kind = Name_tok);
      if (ch == '(')
        solo_flag = FALSE;
    } break;

    case BK:
      och = ch;
      ch = getchr(inp_stream);
      t->TokInfo = och;
      if (och == '(') {
        while (chtype(ch) == BS) {
          ch = getchr(inp_stream);
        };
        if (ch == ')') {
          t->TokInfo = Unsigned(AtomEmptyBrackets);
          t->Tok = Ord(kind = Name_tok);
          ch = getchr(inp_stream);
          solo_flag = FALSE;
          break;
        } else if (!solo_flag) {
          t->TokInfo = 'l';
          solo_flag = TRUE;
        }
      } else if (och == '[') {
        while (chtype(ch) == BS) {
          ch = getchr(inp_stream);
        };
        if (ch == ']') {
          t->TokInfo = Unsigned(AtomNil);
          t->Tok = Ord(kind = Name_tok);
          ch = getchr(inp_stream);
          solo_flag = FALSE;
          break;
        }
      } else if (och == '{') {
        if (ch == '|') {
          qq_t *qq = (qq_t *)calloc(sizeof(qq_t), 1);
          if (!qq) {
            LOCAL_ErrorMessage = "not enough heap space to read in quasi quote";
            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
            t->Tok = Ord(kind = eot_tok);
            t->TokInfo = TermOutOfHeapError;
            return l;
          }
          if (cur_qq) {
            LOCAL_ErrorMessage = "quasi quote in quasi quote";
            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
            t->Tok = Ord(kind = eot_tok);
            t->TokInfo = TermOutOfHeapError;
            free(qq);
            return l;
          } else {
            cur_qq = qq;
          }
          t->TokInfo = (CELL)qq;
          if (inp_stream->status & Seekable_Stream_f) {
            qq->start.byteno = fseek(inp_stream->file, 0, 0);
          } else {
            qq->start.byteno = inp_stream->charcount - 1;
          }
          qq->start.lineno = inp_stream->linecount;
          qq->start.linepos = inp_stream->linepos - 1;
          qq->start.charno = inp_stream->charcount - 1;
          t->Tok = Ord(kind = QuasiQuotes_tok);
          ch = getchr(inp_stream);
          solo_flag = FALSE;
          break;
        }
        while (chtype(ch) == BS) {
          ch = getchr(inp_stream);
        };
        if (ch == '}') {
          t->TokInfo = Unsigned(AtomBraces);
          t->Tok = Ord(kind = Name_tok);
          ch = getchr(inp_stream);
          solo_flag = FALSE;
          break;
        }
      } else if (och == '|' && ch == '|') {
        qq_t *qq = cur_qq;
        if (!qq) {
          LOCAL_ErrorMessage = "quasi quoted's || without {|";
          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
          free(cur_qq);
          cur_qq = NULL;
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermError;
          return l;
        }
        cur_qq = NULL;
        t->TokInfo = (CELL)qq;
        if (inp_stream->status & Seekable_Stream_f) {
          qq->mid.byteno = fseek(inp_stream->file, 0, 0);
        } else {
          qq->mid.byteno = inp_stream->charcount - 1;
        }
        qq->mid.lineno = inp_stream->linecount;
        qq->mid.linepos = inp_stream->linepos - 1;
        qq->mid.charno = inp_stream->charcount - 1;
        t->Tok = Ord(kind = QuasiQuotes_tok);
        ch = getchr(inp_stream);

        TokImage = Yap_PreAllocCodeSpace();
        if (!TokImage) {
          LOCAL_ErrorMessage =
              "not enough heap space to read in a quasi quoted atom";
          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermError;
          return l;
        }
        charp = TokImage;
        quote = ch;
        len = 0;
        ch = getchrq(inp_stream);
        wcharp = NULL;

        while (TRUE) {
          if (ch == '|') {
            ch = getchrq(inp_stream);
            if (ch != '}') {
            } else {
              charp = (char *)put_utf8((unsigned char *)charp, och);
              charp = (char *)put_utf8((unsigned char *)charp, ch);
              /* we're done */
              break;
            }
          } else if (chtype(ch) == EF) {
            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
            mark_eof(inp_stream);
            t->Tok = Ord(kind = eot_tok);
            t->TokInfo = TermOutOfHeapError;
            break;
          } else {
            charp = (char *)put_utf8((unsigned char *)charp, ch);
            ch = getchrq(inp_stream);
          }
          if (charp > (char *)AuxSp - 1024) {
            /* Not enough space to read in the string. */
            return AuxSpaceError(
                t, l, "not enough space to read in string or quoted atom");
          }
        }
        len = charp - TokImage;
        mp = malloc(len + 1);
        if (mp == NULL) {
          LOCAL_ErrorMessage = "not enough heap space to read in quasi quote";
          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
          t->Tok = Ord(kind = eot_tok);
          t->TokInfo = TermOutOfHeapError;
          return l;
        }
        strncpy(mp, TokImage, len + 1);
        qq->text = (unsigned char *)mp;
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        if (inp_stream->status & Seekable_Stream_f) {
          qq->end.byteno = fseek(inp_stream->file, 0, 0);
        } else {
          qq->end.byteno = inp_stream->charcount - 1;
        }
        qq->end.lineno = inp_stream->linecount;
        qq->end.linepos = inp_stream->linepos - 1;
        qq->end.charno = inp_stream->charcount - 1;
        if (!(t->TokInfo)) {
          return CodeSpaceError(t, p, l);
        }
        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
        solo_flag = FALSE;
        ch = getchr(inp_stream);
        break;
      }
      t->Tok = Ord(kind = Ponctuation_tok);
      break;
    case EF:
      mark_eof(inp_stream);
      t->Tok = Ord(kind = eot_tok);
      t->TokInfo = TermEof;
      return l;

    default: {
      char err[1024];
      snprintf(err, 1023, "\n++++ token: unrecognised char %c (%d), type %c\n",
               ch, ch, chtype(ch));
    }
      t->Tok = Ord(kind = eot_tok);
      t->TokInfo = TermEof;
    }
#if DEBUG
    if (GLOBAL_Option[2])
      fprintf(stderr, "[Token %d %s]", Ord(kind),
              Yap_tokRep(t, inp_stream->encoding));
#endif
    if (LOCAL_ErrorMessage) {
      /* insert an error token to inform the system of what happened */
      TokEntry *e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
      if (e == NULL) {
        return TrailSpaceError(p, l);
      }
      p->TokNext = e;
      e->Tok = Error_tok;
      e->TokInfo = MkAtomTerm(Yap_LookupAtom(LOCAL_ErrorMessage));
      e->TokPos = GetCurInpPos(inp_stream);
      e->TokNext = NULL;
      LOCAL_ErrorMessage = NULL;
      p = e;
    }
  } while (kind != eot_tok);

  return (l);
}

void Yap_clean_tokenizer(TokEntry *tokstart, VarEntry *vartable,
                         VarEntry *anonvartable) {
  CACHE_REGS
  struct scanner_extra_alloc *ptr = LOCAL_ScannerExtraBlocks;
  while (ptr) {
    struct scanner_extra_alloc *next = ptr->next;
    free(ptr);
    ptr = next;
  }
  TR = (tr_fr_ptr)tokstart;
  LOCAL_Comments = TermNil;
  LOCAL_CommentsNextChar = LOCAL_CommentsTail = NULL;
  if (LOCAL_CommentsBuff) {
    free(LOCAL_CommentsBuff);
    LOCAL_CommentsBuff = NULL;
  }
  LOCAL_CommentsBuffLim = 0;
}

/// @}
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								/*************************************************************************
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								*                   *
 								*   YAP Prolog                *
 								*                   *
 								*  Yap Prolog was developed at NCCUP - Universidade do Porto   *
 								*                   *
 								* Copyright L.Damas, V.S.Costa and Universidade do Porto 1985-2003   *
 								*                   *
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								**************************************************************************
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								*                   *
 								* File:    %W% %G%                     *
 								* Last rev:  22-1-03               *
 								* mods:                   *
 								* comments:  Prolog's scanner           *
 								*                   *
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								*************************************************************************/
 								/*
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								 * Description:
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								 *
 								 * This module produces a list of tokens for use by the parser. The calling
 								 * program should supply a routine int nextch(charpos) int *charpos; which,
 								 * when called should produce the next char or -1 if none availlable. The
 								 * scanner will stop producing tokens when it either finds an end of file
 								 * (-1) or a token consisting of just '.' followed by a blank or control
 								 * char. Scanner errors will be signalled by the scanner exiting with a non-
 								 * zero  ErrorMsg and ErrorPos. Note that, even in this case, the scanner
 								 * will try to find the end of the term. A function char
 								 * *AllocScannerMemory(nbytes) should be supplied for allocating (temporary)
 								 * space for strings and for the table of prolog variables occurring in the
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								 * term.
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								 *
 								 */
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								/**
 								@defgroup Formal_Syntax Syntax of Terms
 								@ingroup Syntax
 								@{
 								Prolog tokens are grouped into the following categories:
 								+ Numbers
 								Numbers can be further subdivided into integer and floating-point numbers.
 								  + Integers
 								Integer numbers
 								are described by the following regular expression:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								<integer> := {<digit>+<single-quote>|0{xXo}}<alpha_numeric_char>+
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								where {...} stands for optionality, \a + optional repetition (one or
 								more times), \a \\\<digit\\\> denotes one of the characters 0 ... 9, \a |
 								denotes or, and \a \\\<single-quote\\\> denotes the character "'". The digits
 								before the \a \\\<single-quote\\\> character, when present, form the number
 								basis, that can go from 0, 1 and up to 36. Letters from `A` to
 								`Z` are used when the basis is larger than 10.
 								Note that if no basis is specified then base 10 is assumed. Note also
 								that the last digit of an integer token can not be immediately followed
 								by one of the characters 'e', 'E', or '.'.
 								Following the ISO standard, YAP also accepts directives of the
 								form `0x` to represent numbers in hexadecimal base and of the form
 								`0o` to represent numbers in octal base. For usefulness,
 								YAP also accepts directives of the form `0X` to represent
 								numbers in hexadecimal base.
 								Example:
 								the following tokens all denote the same integer
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 2'1010  3'101  8'12  16'a  36'a  0xa  0o12
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Numbers of the form `0'a` are used to represent character
 								constants. So, the following tokens denote the same integer:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 'd  100
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								YAP (version 6.3.4) supports integers that can fit
 								the word size of the machine. This is 32 bits in most current machines,
 								but 64 in some others, such as the Alpha running Linux or Digital
 								Unix. The scanner will read larger or smaller integers erroneously.
 								  + Floats
 								Floating-point numbers are described by:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								   <float> := <digit>+{<dot><digit>+}
 								               <exponent-marker>{<sign>}<digit>+
 								            |<digit>+<dot><digit>+
 								               {<exponent-marker>{<sign>}<digit>+}
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								where \a \\\<dot\\\> denotes the decimal-point character '.',
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								\a \\\<exponent-marker\\\> denotes one of 'e' or 'E', and \a \\\<sign\\\>
 								denotes
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								one of '+' or '-'.
 								Examples:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 .0   10e3   10e-3   3.1415e+3
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Floating-point numbers are represented as a double in the target
 								machine. This is usually a 64-bit number.
 								+ Strings Character Strings
 								Strings are described by the following rules:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								  string --> '"' string_quoted_characters '"'
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  string --> '`' string_quoted_characters '`'
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
 								  string_quoted_characters --> '"' '"' string_quoted_characters
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  string_quoted_characters --> '`' '`' string_quoted_characters
 								string_quoted_characters --> '\'
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								                          escape_sequence string_quoted_characters
 								  string_quoted_characters -->
 								                          string_character string_quoted_characters
 								  escape_sequence --> 'a' | 'b' | 'r' | 'f' | 't' | 'n' | 'v'
 								  escape_sequence --> '\' | '"' | ''' | '`'
 								  escape_sequence --> at_most_3_octal_digit_seq_char '\'
 								  escape_sequence --> 'x' at_most_2_hexa_digit_seq_char '\'
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								where `string_character` in any character except the double quote
 								and escape characters.
 								Examples:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								""   "a string"   "a double-quote:"""
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								The first string is an empty string, the last string shows the use of
 								double-quoting. The implementation of YAP represents strings as
 								lists of integers. Since YAP 4.3.0 there is no static limit on string
 								size.
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								Escape sequences can be used anf include the non-printable characters
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								`a` (alert), `b` (backspace), `r` (carriage return),
 								`f` (form feed), `t` (horizontal tabulation), `n` (new
 								line), and `v` (vertical tabulation). Escape sequences also be
 								include the meta-characters `\\`, `"`, `'`, and
 								```. Last, one can use escape sequences to include the characters
 								either as an octal or hexadecimal number.
 								The next examples demonstrates the use of escape sequences in YAP:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								"\x0c\" "\01\" "\f" "\\"
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								The first three examples return a list including only character 12 (form
 								feed). The last example escapes the escape character.
 								Escape sequences were not available in C-Prolog and in original
 								versions of YAP up to 4.2.0. Escape sequences can be disable by using:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								:- yap_flag(character_escapes,false).
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								Since 6.3.4 YAP supports compact strings, that are not represented as
 								lists of codes, but instead as a sequence of UTF-8 encoded characters
 								in the execution stack. These strings do not require allocating a
 								symbol, as atoms do, but are much more compact than using lists of
 								codes.
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								+ Atoms Atoms
 								Atoms are defined by one of the following rules:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								   atom --> solo-character
 								   atom --> lower-case-letter name-character*
 								   atom --> symbol-character+
 								   atom --> single-quote  single-quote
 								   atom --> ''' atom_quoted_characters '''
 								  atom_quoted_characters --> ''' ''' atom_quoted_characters
 								  atom_quoted_characters --> '\' atom_sequence string_quoted_characters
 								  atom_quoted_characters --> character string_quoted_characters
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								where:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								   <solo-character>     denotes one of:    ! ;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								   <symbol-character>   denotes one of:    # & * + - . / : <
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								                                           = > ? @ \ ^ ~ `
 								   <lower-case-letter>  denotes one of:    a...z
 								   <name-character>     denotes one of:    _ a...z A...Z 0....9
 								   <single-quote>       denotes:           '
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								and `string_character` denotes any character except the double quote
 								and escape characters. Note that escape sequences in strings and atoms
 								follow the same rules.
 								Examples:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								a   a12x   '$a'   !   =>  '1 2'
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								Version `4.2.0` of YAP removed the previous limit of 256
 								characters on an atom. Size of an atom is now only limited by the space
 								available in the system.
 								+ Variables Variables
 								Variables are described by:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								   <variable-starter><variable-character>+
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								where
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								  <variable-starter>   denotes one of:    _ A...Z
 								  <variable-character> denotes one of:    _ a...z A...Z
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								If a variable is referred only once in a term, it needs not to be named
 								and one can use the character `_` to represent the variable. These
 								variables are known as anonymous variables. Note that different
 								occurrences of `_` on the same term represent <em>different</em>
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								anonymous variables.
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
 								+ Punctuation Tokens
 								Punctuation tokens consist of one of the following characters:
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								( ) , [ ] { } |
 								~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
 								These characters are used to group terms.
 								@subsection Layout Layout
 								Any characters with ASCII code less than or equal to 32 appearing before
 								a token are ignored.
 								All the text appearing in a line after the character \a % is taken to
 								be a comment and ignored (including \a %).  Comments can also be
 								inserted by using the sequence `/\*` to start the comment and
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								`\*` followed by `/` to finish it. In the presence of any sequence of comments
 								or
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								layout characters, the YAP parser behaves as if it had found a
 								single blank character. The end of a file also counts as a blank
 								character for this purpose.
 								+ Encoding Wide Character Support
 								YAP now implements a SWI-Prolog compatible interface to wide
 								characters and the Universal Character Set (UCS). The following text
 								was adapted from the SWI-Prolog manual.
 								YAP now  supports wide characters, characters with character
 								codes above 255 that cannot be represented in a single byte.
 								<em>Universal Character Set</em> (UCS) is the ISO/IEC 10646 standard
 								that specifies a unique 31-bits unsigned integer for any character in
 								any language.  It is a superset of 16-bit Unicode, which in turn is
 								a superset of ISO 8859-1 (ISO Latin-1), a superset of US-ASCII.  UCS
 								can handle strings holding characters from multiple languages and
 								character classification (uppercase, lowercase, digit, etc.) and
 								operations such as case-conversion are unambiguously defined.
 								For this reason YAP, following SWI-Prolog, has two representations for
 								atoms. If the text fits in ISO Latin-1, it is represented as an array
 								of 8-bit characters.  Otherwise the text is represented as an array of
 								wide chars, which may take 16 or 32 bits.  This representational issue
 								is completely transparent to the Prolog user.  Users of the foreign
 								language interface sometimes need to be aware of these issues though.
 								Character coding comes into view when characters of strings need to be
 								read from or written to file or when they have to be communicated to
 								other software components using the foreign language interface. In this
 								section we only deal with I/O through streams, which includes file I/O
 								as well as I/O through network sockets.
 								  + Stream_Encoding Wide character encodings on streams
 								Although characters are uniquely coded using the UCS standard
 								internally, streams and files are byte (8-bit) oriented and there are a
 								variety of ways to represent the larger UCS codes in an 8-bit octet
 								stream. The most popular one, especially in the context of the web, is
 								UTF-8. Bytes 0...127 represent simply the corresponding US-ASCII
 								character, while bytes 128...255 are used for multi-byte
 								encoding of characters placed higher in the UCS space. Especially on
 								MS-Windows the 16-bit Unicode standard, represented by pairs of bytes is
 								also popular.
 								Prolog I/O streams have a property called <em>encoding</em> which
 								specifies the used encoding that influence `get_code/2` and
 								`put_code/2` as well as all the other text I/O predicates.
 								The default encoding for files is derived from the Prolog flag
-												init spelling

											
										
										
											2015-11-05 15:28:39 +00:00
+								`encoding`, which is initialized from the environment.  If the
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								environment variable `LANG` ends in "UTF-8", this encoding is
 								assumed. Otherwise the default is `text` and the translation is
 								left to the wide-character functions of the C-library (note that the
 								Prolog native UTF-8 mode is considerably faster than the generic
 								`mbrtowc()` one).  The encoding can be specified explicitly in
 								load_files/2 for loading Prolog source with an alternative
 								encoding, `open/4` when opening files or using `set_stream/2` on
 								any open stream (not yet implemented). For Prolog source files we also
 								provide the `encoding/1` directive that can be used to switch
 								between encodings that are compatible to US-ASCII (`ascii`,
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								`iso_latin_1`, `utf8` and many locales).
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
 								For
 								additional information and Unicode resources, please visit
 								<http://www.unicode.org/>.
 								YAP currently defines and supports the following encodings:
 								  + octet
 								Default encoding for <em>binary</em> streams.  This causes
 								the stream to be read and written fully untranslated.
 								  + ascii
 -bit encoding in 8-bit bytes.  Equivalent to `iso_latin_1`,
 								but generates errors and warnings on encountering values above
 .
 								  + iso_latin_1
 -bit encoding supporting many western languages.  This causes
 								the stream to be read and written fully untranslated.
 								  + text
 								C-library default locale encoding for text files.  Files are read and
 								written using the C-library functions `mbrtowc()` and
 								`wcrtomb()`.  This may be the same as one of the other locales,
 								notably it may be the same as `iso_latin_1` for western
 								languages and `utf8` in a UTF-8 context.
 								  + utf8
 								Multi-byte encoding of full UCS, compatible to `ascii`.
 								See above.
 								  + unicode_be
 								Unicode Big Endian.  Reads input in pairs of bytes, most
 								significant byte first.  Can only represent 16-bit characters.
 								  + unicode_le
 								Unicode Little Endian.  Reads input in pairs of bytes, least
 								significant byte first.  Can only represent 16-bit characters.
 								Note that not all encodings can represent all characters. This implies
 								that writing text to a stream may cause errors because the stream
 								cannot represent these characters. The behaviour of a stream on these
 								errors can be controlled using `open/4` or `set_stream/2` (not
 								implemented). Initially the terminal stream write the characters using
 								Prolog escape sequences while other streams generate an I/O exception.
 								    + BOM BOM: Byte Order Mark
 								From Stream Encoding, you may have got the impression that
 								text-files are complicated. This section deals with a related topic,
 								making live often easier for the user, but providing another worry to
 								the programmer.   *BOM* or <em>Byte Order Marker</em> is a technique
 								for identifying Unicode text-files as well as the encoding they
 								use. Such files start with the Unicode character `0xFEFF`, a
 								non-breaking, zero-width space character. This is a pretty unique
 								sequence that is not likely to be the start of a non-Unicode file and
 								uniquely distinguishes the various Unicode file formats. As it is a
 								zero-width blank, it even doesn't produce any output. This solves all
 								problems, or ...
 								Some formats start of as US-ASCII and may contain some encoding mark to
 								switch to UTF-8, such as the `encoding="UTF-8"` in an XML header.
 								Such formats often explicitly forbid the the use of a UTF-8 BOM. In
 								other cases there is additional information telling the encoding making
 								the use of a BOM redundant or even illegal.
 								The BOM is handled by the `open/4` predicate. By default, text-files are
 								probed for the BOM when opened for reading. If a BOM is found, the
 								encoding is set accordingly and the property `bom(true)` is
 								available through stream_property/2. When opening a file for
 								writing, writing a BOM can be requested using the option
 								`bom(true)` with `open/4`.
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								                               */
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#include "Yap.h"
-												change Heap.h to YapHeap.h
rename $live to prolog/0 (for SWI compatibility).

											
										
										
											2009-10-23 14:22:17 +01:00
+								#include "YapHeap.h"
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								#include "Yatom.h"
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#include "alloc.h"
 								#include "eval.h"
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								#include "yapio.h"
-												more integration and cleanup work: allow irect access from YAP to swi global and local, to
check flags quickly.

											
										
										
											2013-11-13 12:57:52 +00:00
+								/* stuff we want to use in standard YAP code */
-												more string fixes and get_code

											
										
										
											2013-12-08 19:12:24 +00:00
+								#include "YapText.h"
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#if _MSC_VER || defined(__MINGW32__)
 								#if HAVE_FINITE == 1
-												allow compilation on WIN32 platforms.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@577 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2002-08-28 15:02:35 +01:00
+								#undef HAVE_FINITE
 								#endif
 								#include <windows.h>
 								#endif
-												fix report on initial and token line for parser.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@540 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2002-06-17 16:28:01 +01:00
+								#include "iopreds.h"
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#if HAVE_STRING_H
 								#include <string.h>
 								#endif
-												get rid of debugging info.
routines are only define with wctype.h

											
										
										
											2010-05-05 12:51:38 +01:00
+								#if HAVE_WCTYPE_H
 								#include <wctype.h>
 								#endif
-												improve locale support in scanner and write: basically, just hide current locale

											
										
										
											2013-11-23 12:34:49 +00:00
+								#if O_LOCALE
 								#include "locale.h"
 								#endif
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
 								/* You just can't trust some machines */
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#define my_isxdigit(C, SU, SL)                                                 \
 								  (chtype(C) == NU || (C >= 'A' && C <= (SU)) || (C >= 'a' && C <= (SL)))
 								#define my_isupper(C) (C >= 'A' && C <= 'Z')
 								#define my_islower(C) (C >= 'a' && C <= 'z')
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
-												get rid of PROTO

											
										
										
											2013-04-25 23:15:04 +01:00
+								static Term float_send(char *, int);
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								static Term get_num(int *, int *, struct stream_desc *, char *, UInt, int);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								static void Yap_setCurrentSourceLocation(struct stream_desc *s) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#if HAVE_SOCKET
 								  if (s->status & Socket_Stream_f)
 								    LOCAL_SourceFileName = AtomSocket;
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  else
 								#endif
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      if (s->status & Pipe_Stream_f)
 								    LOCAL_SourceFileName = AtomPipe;
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  else if (s->status & InMemory_Stream_f)
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_SourceFileName = AtomCharsio;
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  else
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_SourceFileName = s->name;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  LOCAL_SourceFileLineno = s->linecount;
 								}
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
 								/* token table with some help from Richard O'Keefe's PD scanner */
-												gcc warnings

											
										
										
											2015-10-20 08:06:46 +01:00
+								char_kind_t Yap_chtype0[NUMBER_OF_CHARS + 1] = {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    EF,
 								    /* nul soh stx etx eot enq ack bel  bs  ht  nl  vt  np  cr  so  si
 								           */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* dle dc1 dc2 dc3 dc4 nak syn etb can  em sub esc  fs  gs  rs  us
 								           */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* sp   !   "   #   $   %   &   '   (   )   *   +   ,   -   .   / */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, SL, DC, SY, SY, CC, SY, QT, BK, BK, SY, SY, BK, SY, SY, SY,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* 0   1   2   3   4   5   6   7   8   9   :   ;   <   =   >   ? */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    NU, NU, NU, NU, NU, NU, NU, NU, NU, NU, SY, SL, SY, SY, SY, SY,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* @   A   B   C   D   E   F   G   H   I   J   K   L   M   N   O */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    SY, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* P   Q   R   S   T   U   V   W   X   Y   Z   [   \   ]   ^   _ */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, BK, SY, BK, SY, UL,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* `   a   b   c   d   e   f   g   h   i   j   k   l   m   n   o */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    SY, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* p   q   r   s   t   u   v   w   x   y   z   {   |   }   ~ del */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, BK, BK, BK, SY, BS,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143
 								           */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* 144 145    147 148 149 150 151 152 153 154 155 156 157 158 159
 								           */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS, BS,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /*     ¡   ¢   £   ¤   ¥   ¦   §   ¨   ©   ª   «   ¬      ®   ¯ */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    BS, SY, SY, SY, SY, SY, SY, SY, SY, SY, LC, SY, SY, SY, SY, SY,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* °   ±   ²   ³   ´   µ   ¶   ·   ¸   ¹   º   »   ¼   ½   ¾   ¿ */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    SY, SY, LC, LC, SY, SY, SY, SY, SY, LC, LC, SY, SY, SY, SY, SY,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								    /* À   Á   Â   Ã   Ä   Å   Æ   Ç   È   É   Ê   Ë   Ì   Í   Î   Ï */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								/* Ð   Ñ   Ò   Ó   Ô   Õ   Ö   ×   Ø   Ù   Ú   Û   Ü   Ý   Þ   ß    */
 								#ifdef vms
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, UC, LC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#else
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    UC, UC, UC, UC, UC, UC, UC, SY, UC, UC, UC, UC, UC, UC, UC, LC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#endif
 								    /* à   á   â   ã   ä   å   æ   ç   è   é   ê   ë   ì   í   î   ï */
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC,
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
 								/* ð   ñ   ò   ó   ô   õ   ö   ÷   ø   ù   ú   û   ü   cannot write the last
 								 * three because of lcc    */
 								#ifdef vms
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC, LC
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#else
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    LC, LC, LC, LC, LC, LC, LC, SY, LC, LC, LC, LC, LC, LC, LC, LC
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#endif
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								};
-												Avoid complaining from strlen

`

											
										
										
											2015-09-21 23:05:36 +01:00
+								typedef struct scanner_internals {
 								  StreamDesc *t;
 								  TokEntry *ctok;
 								  char *_ScannerStack; // = (char *)TR;
 								  char *ScannerExtraBlocks;
 								  CELL *CommentsTail;
 								  CELL *Comments;
 								  CELL *CommentsNextChar;
 								  wchar_t *CommentsBuff;
 								  size_t CommentsBuffLim;
 								} scanner_internals;
 								// standard get char, uses conversion table
 								// and converts to wide
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#define getchr(inp) inp->stream_wgetc_for_read(inp - GLOBAL_Stream)
-												Avoid complaining from strlen

`

											
										
										
											2015-09-21 23:05:36 +01:00
+								// get char for quoted data, eg, quoted atoms and so on
 								// converts to wide
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#define getchrq(inp) inp->stream_wgetc(inp - GLOBAL_Stream)
-												Avoid complaining from strlen

`

											
										
										
											2015-09-21 23:05:36 +01:00
+								// get char for UTF-8 quoted data, eg, quoted strings
 								// reads bytes
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								#define getchru(inp) inp->stream_getc_utf8(inp - GLOBAL_Stream)
-												read works now.

											
										
										
											2011-02-15 07:39:27 +00:00
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								/* in case there is an overflow */
 								typedef struct scanner_extra_alloc {
 								  struct scanner_extra_alloc *next;
 								  void *filler;
 								} ScannerExtraBlock;
-												thread compilation

											
										
										
											2016-02-22 13:01:09 +00:00
+								#define CodeSpaceError(t, p, l) CodeSpaceError__(t, p, l PASS_REGS)
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								static TokEntry *CodeSpaceError__(TokEntry *t, TokEntry *p,
 								                                  TokEntry *l USES_REGS) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								  LOCAL_Error_TYPE = RESOURCE_ERROR_HEAP;
 								  LOCAL_ErrorMessage = "Code Space Overflow";
 								  if (t) {
 								    t->Tok = eot_tok;
 								    t->TokInfo = TermOutOfHeapError;
 								  }
 								  /* serious error now */
 								  return l;
 								}
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
-												thread compilation

											
										
										
											2016-02-22 13:01:09 +00:00
+								#define TrailSpaceError(t, l) TrailSpaceError__(t, l PASS_REGS)
 								static TokEntry *TrailSpaceError__(TokEntry *t, TokEntry *l USES_REGS) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								  LOCAL_ErrorMessage = "Trail Overflow";
 								  LOCAL_Error_TYPE = RESOURCE_ERROR_TRAIL;
 								  if (t) {
 								    t->Tok = eot_tok;
 								    t->TokInfo = TermOutOfTrailError;
 								  }
 								  return l;
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								}
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
-												thread compilation

											
										
										
											2016-02-22 13:01:09 +00:00
+								#define AuxSpaceError(p, l, msg) AuxSpaceError__(p, l, msg PASS_REGS)
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								static TokEntry *AuxSpaceError__(TokEntry *p, TokEntry *l,
 								                                 const char *msg USES_REGS) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								  /* huge atom or variable, we are in trouble */
 								  LOCAL_ErrorMessage = (char *)msg;
 								  LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
 								  Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								  if (p) {
 								    p->Tok = eot_tok;
 								    p->TokInfo = TermOutOfAuxspaceError;
 								  }
 								  /* serious error now */
 								  return l;
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								}
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								static void InitScannerMemory(void) {
 								  CACHE_REGS
 								  LOCAL_ErrorMessage = NULL;
 								  LOCAL_Error_Size = 0;
 								  LOCAL_ScannerStack = (char *)TR;
 								  LOCAL_ScannerExtraBlocks = NULL;
 								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static char *AllocScannerMemory(unsigned int size) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  char *AuxSpScan;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
-												change LOCALS macros.

											
										
										
											2011-05-04 10:11:41 +01:00
+								  AuxSpScan = LOCAL_ScannerStack;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  size = AdjustSize(size);
-												change LOCALS macros.

											
										
										
											2011-05-04 10:11:41 +01:00
+								  if (LOCAL_ScannerExtraBlocks) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    struct scanner_extra_alloc *ptr;
 								    if (!(ptr = (struct scanner_extra_alloc *)malloc(
 								              size + sizeof(ScannerExtraBlock)))) {
 								      return NULL;
 								    }
 								    ptr->next = LOCAL_ScannerExtraBlocks;
 								    LOCAL_ScannerExtraBlocks = ptr;
 								    return (char *)(ptr + 1);
 								  } else if (LOCAL_TrailTop <= AuxSpScan + size) {
 								    UInt alloc_size = sizeof(CELL) * K16;
 								    if (size > alloc_size)
 								      alloc_size = size;
 								    if (!Yap_growtrail(alloc_size, TRUE)) {
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								      struct scanner_extra_alloc *ptr;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								      if (!(ptr = (struct scanner_extra_alloc *)malloc(
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								                size + sizeof(ScannerExtraBlock)))) {
 								        return NULL;
 								      }
-												change LOCALS macros.

											
										
										
											2011-05-04 10:11:41 +01:00
+								      ptr->next = LOCAL_ScannerExtraBlocks;
 								      LOCAL_ScannerExtraBlocks = ptr;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								      return (char *)(ptr + 1);
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  LOCAL_ScannerStack = AuxSpScan + size;
-												Use Doug Lea's malloc as an alternative to YAP's standard malloc
don't use TR directly in scanner/parser, this avoids trouble with ^C while
consulting large files.
pass gcc -mno-cygwin to library compilation in cygwin environment (cygwin should
compile out of the box now).


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1168 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-10-28 21:12:23 +01:00
+								  return AuxSpScan;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static void PopScannerMemory(char *block, unsigned int size) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  if (block == LOCAL_ScannerStack - size) {
-												change LOCALS macros.

											
										
										
											2011-05-04 10:11:41 +01:00
+								    LOCAL_ScannerStack -= size;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  } else if (block == (char *)(LOCAL_ScannerExtraBlocks + 1)) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    struct scanner_extra_alloc *ptr = LOCAL_ScannerExtraBlocks;
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_ScannerExtraBlocks = ptr->next;
 								    free(ptr);
 								  }
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								char *Yap_AllocScannerMemory(unsigned int size) {
-												init spelling

											
										
										
											2015-11-05 15:28:39 +00:00
+								  /* I assume memory has been initialized */
-												all global symbols should now start with _YAP
global functions should not be called from within file (bug in
binutils/WIN32).


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@675 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2002-11-11 17:38:10 +00:00
+								  return AllocScannerMemory(size);
 								}
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								extern double atof(const char *);
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static Term float_send(char *s, int sign) {
 								  Float f = (Float)(sign * atof(s));
-												fix isfinite

											
										
										
											2013-11-23 14:51:56 +00:00
+								#if HAVE_ISFINITE || defined(isfinite)
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  if (trueGlobalPrologFlag(ISO_FLAG)) { /* iso */
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (!isfinite(f)) {
 								      CACHE_REGS
 								      LOCAL_ErrorMessage = "Float overflow while scanning";
 								      return (MkEvalFl(f));
-												continue cleanup

											
										
										
											2013-11-16 00:27:02 +00:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  }
-												continue cleanup

											
										
										
											2013-11-16 00:27:02 +00:00
+								#elif HAVE_FINITE
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  if (trueGlobalPrologFlag(ISO_FLAG)) { /* iso */
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (!finite(f)) {
 								      LOCAL_ErrorMessage = "Float overflow while scanning";
 								      return (MkEvalFl(f));
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  }
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#endif
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  {
 								    CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    return (MkEvalFl(f));
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  }
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								}
 								/* we have an overflow at s */
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static Term read_int_overflow(const char *s, Int base, Int val, int sign) {
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#ifdef USE_GMP
 								  /* try to scan it as a bignum */
-												support new interface between YAP and GMP, so that we don't rely on our own
allocation routines.
Several big fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1490 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-01-02 02:16:19 +00:00
+								  mpz_t new;
 								  Term t;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  mpz_init_set_str(new, s, base);
-												fix case where very big negative integer could have two representations.

											
										
										
											2009-01-12 15:08:26 +00:00
+								  if (sign < 0)
 								    mpz_neg(new, new);
-												support new interface between YAP and GMP, so that we don't rely on our own
allocation routines.
Several big fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1490 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-01-02 02:16:19 +00:00
+								  t = Yap_MkBigIntTerm(new);
-												avoid sideffects from MkBigInt


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1524 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-01-18 15:34:54 +00:00
+								  mpz_clear(new);
-												support new interface between YAP and GMP, so that we don't rely on our own
allocation routines.
Several big fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1490 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-01-02 02:16:19 +00:00
+								  return t;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#else
-												no GMP compilation

											
										
										
											2013-09-06 23:03:24 +01:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  /* try to scan it as a float */
 								  return MkIntegerTerm(val);
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#endif
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static int send_error_message(char s[]) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  LOCAL_ErrorMessage = s;
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  return 0;
 								}
-												read works now.

											
										
										
											2011-02-15 07:39:27 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								static wchar_t read_quoted_char(int *scan_nextp,
 								                                struct stream_desc *inp_stream) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  int ch;
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								/* escape sequence */
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								do_switch:
-												fix \c in quoted strings

											
										
										
											2014-08-07 02:02:28 +01:00
+								  ch = getchrq(inp_stream);
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								  switch (ch) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  case 10:
 								    return 0;
 								  case '\\':
 								    return '\\';
 								  case 'a':
 								    return '\a';
 								  case 'b':
 								    return '\b';
 								  case 'c':
 								    while (chtype((ch = getchrq(inp_stream))) == BS)
 								      ;
 								    {
 								      if (ch == '\\') {
 								        goto do_switch;
-												support for UNICODE, and other bug fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1725 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-11-27 17:42:03 +00:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      return ch;
 								    }
 								  case 'd':
 								    return 127;
 								  case 'e':
 								    return '\x1B'; /* <ESC>, a.k.a. \e */
 								  case 'f':
 								    return '\f';
 								  case 'n':
 								    return '\n';
 								  case 'r':
 								    return '\r';
 								  case 's': /* space */
 								    return ' ';
 								  case 't':
 								    return '\t';
 								  case 'u': {
 								    int i;
 								    wchar_t wc = '\0';
 								    for (i = 0; i < 4; i++) {
 								      ch = getchrq(inp_stream);
 								      if (ch >= '0' && ch <= '9') {
 								        wc += (ch - '0') << ((3 - i) * 4);
 								      } else if (ch >= 'a' && ch <= 'f') {
 								        wc += ((ch - 'a') + 10) << ((3 - i) * 4);
 								      } else if (ch >= 'A' && ch <= 'F') {
 								        wc += ((ch - 'A') + 10) << ((3 - i) * 4);
 								      } else {
 								        return send_error_message("invalid escape sequence");
-												support for UNICODE, and other bug fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1725 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-11-27 17:42:03 +00:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    }
 								    return wc;
 								  }
 								  case 'U': {
 								    int i;
 								    wchar_t wc = '\0';
 								    for (i = 0; i < 8; i++) {
 								      ch = getchrq(inp_stream);
 								      if (ch >= '0' && ch <= '9') {
 								        wc += (ch - '0') << ((7 - i) * 4);
 								      } else if (ch >= 'a' && ch <= 'f') {
 								        wc += ((ch - 'a') + 10) << ((7 - i) * 4);
 								      } else if (ch >= 'A' && ch <= 'F') {
 								        wc += ((ch - 'A') + 10) << ((7 - i) * 4);
 								      } else {
 								        return send_error_message("invalid escape sequence");
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    }
 								    return wc;
 								  }
 								  case 'v':
 								    return '\v';
 								  case 'z': /* Prolog end-of-file */
 								    return send_error_message("invalid escape sequence \\z");
 								  case '\'':
 								    return '\'';
 								  case '"':
 								    return '"';
 								  case '`':
 								    return '`';
 								  case '^':
 								    if (trueGlobalPrologFlag(ISO_FLAG)) {
 								      return send_error_message("invalid escape sequence");
 								    } else {
 								      ch = getchrq(inp_stream);
 								      if (ch == '?') { /* delete character */
 								        return 127;
 								      } else if (ch >= 'a' && ch < 'z') { /* hexa */
 								        return ch - 'a';
 								      } else if (ch >= 'A' && ch < 'Z') { /* hexa */
 								        return ch - 'A';
 								      } else {
 								        return '^';
 								      }
 								    }
 								  case '0':
 								  case '1':
 								  case '2':
 								  case '3':
 								  case '4':
 								  case '5':
 								  case '6':
 								  case '7':
 								    /* character in octal: maximum of 3 digits, terminates with \ */
 								    /* follow ISO */
 								    {
 								      unsigned char so_far = ch - '0';
 								      ch = getchrq(inp_stream);
 								      if (ch >= '0' && ch < '8') { /* octal */
 								        so_far = so_far * 8 + (ch - '0');
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								        ch = getchrq(inp_stream);
 								        if (ch >= '0' && ch < '8') { /* octal */
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          so_far = so_far * 8 + (ch - '0');
 								          ch = getchrq(inp_stream);
 								          if (ch != '\\') {
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								            return send_error_message("invalid octal escape sequence");
 								          }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return so_far;
 								        } else if (ch == '\\') {
 								          return so_far;
 								        } else {
 								          return send_error_message("invalid octal escape sequence");
 								        }
 								      } else if (ch == '\\') {
 								        return so_far;
 								      } else {
 								        return send_error_message("invalid octal escape sequence");
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    }
 								  case 'x':
 								    /* hexadecimal character (YAP allows empty hexadecimal  */
 								    {
 								      unsigned char so_far = 0;
 								      ch = getchrq(inp_stream);
 								      if (my_isxdigit(ch, 'f', 'F')) { /* hexa */
 								        so_far =
 								            so_far * 16 + (chtype(ch) == NU
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								                               ? ch - '0'
 								                               : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        ch = getchrq(inp_stream);
 								        if (my_isxdigit(ch, 'f', 'F')) { /* hexa */
 								          so_far =
 								              so_far * 16 + (chtype(ch) == NU
 								                                 ? ch - '0'
 								                                 : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
 								          ch = getchrq(inp_stream);
 								          if (ch == '\\') {
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								            return so_far;
 								          } else {
 								            return send_error_message("invalid hexadecimal escape sequence");
 								          }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        } else if (ch == '\\') {
 								          return so_far;
 								        } else {
 								          return send_error_message("invalid hexadecimal escape sequence");
 								        }
 								      } else if (ch == '\\') {
 								        return so_far;
 								      } else {
 								        return send_error_message("invalid hexadecimal escape sequence");
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								      }
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  default:
 								    /* accept sequence. Note that the ISO standard does not
 								     consider this sequence legal, whereas SICStus would
 								     eat up the escape sequence. */
 								    return send_error_message("invalid escape sequence");
 								  }
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
 								static int num_send_error_message(char s[]) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  LOCAL_ErrorMessage = s;
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  return TermNil;
 								}
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								/* reads a number, either integer or float */
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								static Term get_num(int *chp, int *chbuffp, StreamDesc *inp_stream, char *s,
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								                    UInt max_size, int sign) {
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								  char *sp = s;
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
+								  int ch = *chp;
-												cosmetic

											
										
										
											2015-01-18 01:32:13 +00:00
+								  Int val = 0L, base = ch - '0';
 								  int might_be_float = TRUE, has_overflow = FALSE;
-												improve locale support in scanner and write: basically, just hide current locale

											
										
										
											2013-11-23 12:34:49 +00:00
+								  const unsigned char *decimalpoint;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
 								  *sp++ = ch;
-												read works now.

											
										
										
											2011-02-15 07:39:27 +00:00
+								  ch = getchr(inp_stream);
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  /*
 								   * because of things like 00'2, 03'2 and even better 12'2, I need to
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								   * do this (have mercy)
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								   */
-												add SWI's char and code_type/2.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@2049 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2007-12-29 12:26:41 +00:00
+								  if (chtype(ch) == NU) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    *sp++ = ch;
 								    if (--max_size == 0) {
 								      return num_send_error_message("Number Too Long");
-												First step to actually handling scanner overflows the right way


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1185 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-11-22 22:28:06 +00:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    base = 10 * base + ch - '0';
 								    ch = getchr(inp_stream);
 								  }
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  if (ch == '\'') {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (base > 36) {
 								      return num_send_error_message("Admissible bases are 0..36");
 								    }
 								    might_be_float = FALSE;
 								    if (--max_size == 0) {
 								      return num_send_error_message("Number Too Long");
 								    }
 								    *sp++ = ch;
 								    ch = getchr(inp_stream);
 								    if (base == 0) {
 								      CACHE_REGS
 								      wchar_t ascii = ch;
 								      int scan_extra = TRUE;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      if (ch == '\\' &&
 								          Yap_GetModuleEntry(CurrentModule)->flags & M_CHARESCAPE) {
 								        ascii = read_quoted_char(&scan_extra, inp_stream);
-												syntax error handling

											
										
										
											2015-07-28 04:22:44 +01:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      /* a quick way to represent ASCII */
 								      if (scan_extra)
 								        *chp = getchr(inp_stream);
 								      if (sign == -1) {
 								        return MkIntegerTerm(-ascii);
 								      }
 								      return MkIntegerTerm(ascii);
 								    } else if (base >= 10 && base <= 36) {
 								      int upper_case = 'A' - 11 + base;
 								      int lower_case = 'a' - 11 + base;
 								      while (my_isxdigit(ch, upper_case, lower_case)) {
 								        Int oval = val;
 								        int chval =
 								            (chtype(ch) == NU ? ch - '0'
 								                              : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
 								        if (--max_size == 0) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								          return num_send_error_message("Number Too Long");
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        *sp++ = ch;
 								        val = oval * base + chval;
 								        if (oval != (val - chval) / base) /* overflow */
 								          has_overflow = (has_overflow || TRUE);
 								        ch = getchr(inp_stream);
 								      }
 								    }
 								  } else if (ch == 'x' && base == 0) {
 								    might_be_float = FALSE;
 								    if (--max_size == 0) {
 								      return num_send_error_message("Number Too Long");
 								    }
 								    *sp++ = ch;
 								    ch = getchr(inp_stream);
 								    while (my_isxdigit(ch, 'F', 'f')) {
 								      Int oval = val;
 								      int chval =
 								          (chtype(ch) == NU ? ch - '0'
 								                            : (my_isupper(ch) ? ch - 'A' : ch - 'a') + 10);
 								      if (--max_size == 0) {
 								        return num_send_error_message("Number Too Long");
 								      }
-												cosmetic

											
										
										
											2015-01-18 01:32:13 +00:00
+								      *sp++ = ch;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      val = val * 16 + chval;
 								      if (oval != (val - chval) / 16) /* overflow */
 								        has_overflow = TRUE;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								      ch = getchr(inp_stream);
-												fix case where very big negative integer could have two representations.

											
										
										
											2009-01-12 15:08:26 +00:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    *chp = ch;
 								  } else if (ch == 'o' && base == 0) {
 								    might_be_float = FALSE;
 								    base = 8;
 								    ch = getchr(inp_stream);
 								  } else if (ch == 'b' && base == 0) {
 								    might_be_float = FALSE;
 								    base = 2;
 								    ch = getchr(inp_stream);
 								  } else {
 								    val = base;
 								    base = 10;
 								  }
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  while (chtype(ch) == NU) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    Int oval = val;
 								    if (!(val == 0 && ch == '0') || has_overflow) {
 								      if (--max_size == 0) {
 								        return num_send_error_message("Number Too Long");
 								      }
 								      *sp++ = ch;
 								    }
 								    if (ch - '0' >= base) {
 								      CACHE_REGS
 								      if (sign == -1)
 								        return MkIntegerTerm(-val);
 								      return MkIntegerTerm(val);
 								    }
 								    val = val * base + ch - '0';
 								    if (val / base != oval || val - oval * base != ch - '0') /* overflow */
 								      has_overflow = TRUE;
 								    ch = getchr(inp_stream);
 								  }
 								  if (might_be_float && (ch == '.' || ch == 'e' || ch == 'E')) {
 								    int has_dot = (ch == '.');
 								    if (has_dot) {
 								      unsigned char *dp;
 								      int dc;
 								      if (chtype(ch = getchr(inp_stream)) != NU) {
 								        if (ch == 'e' || ch == 'E') {
 								          if (trueGlobalPrologFlag(ISO_FLAG))
 								            return num_send_error_message(
 								                "Float format not allowed in ISO mode");
 								        } else { /* followed by a letter, end of term? */
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								          CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          sp[0] = '\0';
 								          *chbuffp = '.';
 								          *chp = ch;
 								          if (has_overflow)
 								            return read_int_overflow(s, base, val, sign);
 								          if (sign == -1)
 								            return MkIntegerTerm(-val);
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          return MkIntegerTerm(val);
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      }
-												improve locale support in scanner and write: basically, just hide current locale

											
										
										
											2013-11-23 12:34:49 +00:00
+								#if O_LOCALE
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      if ((decimalpoint = (unsigned char *)(localeconv()->decimal_point)) ==
 								          NULL)
-												improve locale support in scanner and write: basically, just hide current locale

											
										
										
											2013-11-23 12:34:49 +00:00
+								#endif
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        decimalpoint = (const unsigned char *)".";
 								      dp = (unsigned char *)decimalpoint;
 								      /* translate . to current locale */
 								      while ((dc = *dp++) != '\0') {
 								        *sp++ = dc;
 								        if (--max_size == 0) {
 								          return num_send_error_message("Number Too Long");
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      }
 								      /* numbers after . */
 								      if (chtype(ch) == NU) {
 								        do {
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          if (--max_size == 0) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            return num_send_error_message("Number Too Long");
 								          }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          *sp++ = ch;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        } while (chtype(ch = getchr(inp_stream)) == NU);
 								      }
 								    }
 								    if (ch == 'e' || ch == 'E') {
 								      if (--max_size == 0) {
 								        return num_send_error_message("Number Too Long");
 								      }
 								      *sp++ = ch;
 								      ch = getchr(inp_stream);
 								      if (ch == '-') {
 								        if (--max_size == 0) {
 								          return num_send_error_message("Number Too Long");
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        *sp++ = '-';
 								        ch = getchr(inp_stream);
 								      } else if (ch == '+') {
 								        ch = getchr(inp_stream);
 								      }
 								      if (chtype(ch) != NU) {
 								        CACHE_REGS
 								        if (has_dot)
 								          return float_send(s, sign);
 								        return MkIntegerTerm(sign * val);
 								      }
 								      do {
 								        if (--max_size == 0) {
 								          return num_send_error_message("Number Too Long");
 								        }
 								        *sp++ = ch;
 								      } while (chtype(ch = getchr(inp_stream)) == NU);
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    *sp = '\0';
 								    *chp = ch;
 								    return float_send(s, sign);
 								  } else if (has_overflow) {
 								    *sp = '\0';
 								    /* skip base */
 								    *chp = ch;
 								    if (s[0] == '0' && s[1] == 'x')
 								      return read_int_overflow(s + 2, 16, val, sign);
 								    else if (s[0] == '0' && s[1] == 'o')
 								      return read_int_overflow(s + 2, 8, val, sign);
 								    else if (s[0] == '0' && s[1] == 'b')
 								      return read_int_overflow(s + 2, 2, val, sign);
 								    if (s[1] == '\'')
 								      return read_int_overflow(s + 2, base, val, sign);
 								    if (s[2] == '\'')
 								      return read_int_overflow(s + 3, base, val, sign);
 								    return read_int_overflow(s, base, val, sign);
 								  } else {
 								    CACHE_REGS
 								    *chp = ch;
 								    return MkIntegerTerm(val * sign);
 								  }
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								}
-												read works now.

											
										
										
											2011-02-15 07:39:27 +00:00
+								/* given a function getchr scan until we  either find the number
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								   or end of file */
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								Term Yap_scan_num(StreamDesc *inp) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  Term out;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  int sign = 1;
-												WIN32 support


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1790 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2007-01-28 14:26:37 +00:00
+								  int ch, cherr;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  char *ptr, *mp;
 								  int kind;
 								  void *old_tr = TR;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  InitScannerMemory();
 								  LOCAL_VarTable = LOCAL_AnonVarTable = NULL;
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								  if (!(ptr = AllocScannerMemory(4096))) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_ErrorMessage = "Trail Overflow";
 								    LOCAL_Error_TYPE = RESOURCE_ERROR_TRAIL;
 								    return 0;
 								  }
 								  TokEntry *tokptr = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								  tokptr->TokPos = GetCurInpPos(inp);
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								  while ((ch = getchr(inp)) == BS)
 								    ;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  if (ch == '-') {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    sign = -1;
 								    ch = getchr(inp);
 								  } else if (ch == '+') {
 								    ch = getchr(inp);
 								  }
 								  if (chtype(ch) == NU) {
 								    cherr = '\0';
 								    if (ASP - HR < 1024) {
 								      Yap_clean_tokenizer(old_tr, NULL, NULL);
 								      LOCAL_ErrorMessage = "Stack Overflow";
 								      LOCAL_Error_TYPE = RESOURCE_ERROR_STACK;
 								      return 0;
 								    }
 								    out = get_num(&ch, &cherr, inp, ptr, 4096, sign); /*  */
 								  }
 								  if (LOCAL_ErrorMessage != NULL || ch != -1 || cherr) {
 								    CACHE_REGS
 								    char *s = ptr;
 								    int sign = 1;
 								    out = 0;
 								    if (s[0] == '+') {
 								      s++;
 								    }
 								    if (s[0] == '-') {
 								      s++;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								      sign = -1;
 								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (strcmp(s, "inf") == 0) {
 								      if (sign > 0) {
 								        out = MkFloatTerm(INFINITY);
 								      } else {
 								        out = MkFloatTerm(-INFINITY);
 								      }
 								    }
 								    if (strcmp(s, "nan") == 0) {
 								      if (sign > 0) {
 								        out = MkFloatTerm(NAN);
 								      } else {
 								        out = MkFloatTerm(-NAN);
 								      }
 								    }
 								    if (out == 0) {
 								      TokEntry *e, *ef;
 								      size_t len = strlen(ptr);
 								      mp = AllocScannerMemory(len + 1);
 								      tokptr->Tok = Ord(kind = String_tok);
 								      tokptr->TokInfo = Unsigned(mp);
 								      e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								      ef = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								      tokptr->TokNext = e;
 								      e->Tok = Error_tok;
-												init spelling

											
										
										
											2015-11-05 15:28:39 +00:00
+								      if (!LOCAL_ErrorMessage) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        LOCAL_ErrorMessage =
 								            "syntax error while converting from a string to a number";
-												init spelling

											
										
										
											2015-11-05 15:28:39 +00:00
+								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      e->TokInfo = MkAtomTerm(Yap_LookupAtom(LOCAL_ErrorMessage));
 								      e->TokPos = GetCurInpPos(inp);
 								      e->TokNext = ef;
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								      ef->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								      ef->TokInfo = TermSyntaxError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      ef->TokPos = GetCurInpPos(inp);
 								      ef->TokNext = NULL;
 								      LOCAL_tokptr = tokptr;
 								      LOCAL_toktide = e;
 								      LOCAL_ErrorMessage = NULL;
 								      LOCAL_Error_Term = Yap_syntax_error(e, inp - GLOBAL_Stream);
 								      LOCAL_Error_TYPE = SYNTAX_ERROR;
 								    }
-												clean_tok should reset ASP properly

											
										
										
											2015-10-08 10:18:47 +01:00
+								  }
-												some extra bug fixes for trail overflows: some cannot be recovered that easily,
some can.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1219 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-12-28 22:20:37 +00:00
+								  PopScannerMemory(ptr, 4096);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  Yap_clean_tokenizer(old_tr, NULL, NULL);
-												WIN32 support


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1790 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2007-01-28 14:26:37 +00:00
+								  return out;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#define CHECK_SPACE()                                                          \
 								  if (ASP - HR < 1024) {                                                       \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_ErrorMessage = "Stack Overflow";                                     \
 								    LOCAL_Error_TYPE = RESOURCE_ERROR_STACK;                                   \
 								    LOCAL_Error_Size = 0L;                                                     \
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								    if (p) {                                                                   \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      p->Tok = Ord(kind = eot_tok);                                            \
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								      p->TokInfo = TermOutOfStackError;                                        \
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								    }                                                                          \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    /* serious error now */                                                    \
 								    return l;                                                                  \
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  }
-												support for UNICODE, and other bug fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1725 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-11-27 17:42:03 +00:00
-												win64 support: encodings

											
										
										
											2016-04-10 14:21:17 +01:00
+								const char *Yap_tokRep(TokEntry *tokptr, encoding_t encoding) {
-												syntax_eror support

											
										
										
											2015-08-18 20:59:24 +01:00
+								  CACHE_REGS
 								  Term info = tokptr->TokInfo;
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								  char *buf = LOCAL_FileNameBuf2;
 								  size_t length;
-												syntax_eror support

											
										
										
											2015-08-18 20:59:24 +01:00
+								  UInt flags = 0;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
-												syntax_eror support

											
										
										
											2015-08-18 20:59:24 +01:00
+								  switch (tokptr->Tok) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  case Name_tok:
-												Fix encoding

											
										
										
											2016-02-18 12:10:58 +00:00
+								    if (IsWideAtom((Atom)info)) {
 								      wchar_t *wc = RepAtom((Atom)info)->WStrOfAE;
 								      Term s = Yap_WCharsToString(wc PASS_REGS);
 								      return StringOfTerm(s);
 								    }
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    return RepAtom((Atom)info)->StrOfAE;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  case Number_tok:
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    return Yap_TermToString(info, &length, encoding, flags);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  case Var_tok: {
 								    VarEntry *varinfo = (VarEntry *)info;
-												- add debugging code to memory allocation
= atom to number: cleanup function name as this is more than just supporting SWI
- SrcMask only applies to rules.
- add auxiliary type/error function
- fix typos
- fix enumeration mode in current_flag
- get rid of TrueName routines, use AbsoluteFileName instead.
- detail synatx error source

											
										
										
											2016-02-11 14:00:56 +00:00
+								    varinfo->VarAdr = TermNil;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    return varinfo->VarRep;
 								  }
 								  case String_tok:
 								  case BQString_tok:
 								    return (char *)info;
 								  case WString_tok:
 								  case WBQString_tok: {
 								    wchar_t *op = (wchar_t *)info;
 								    wchar_t c;
 								    unsigned char *bp = (unsigned char *)buf;
 								    while ((c = *op++)) {
 								      bp += put_utf8(bp, c);
-												syntax_eror support

											
										
										
											2015-08-18 20:59:24 +01:00
+								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    bp[0] = '\0';
 								    return buf;
 								  }
 								  case Error_tok:
 								    return "<ERR>";
 								  case eot_tok:
 								    return "<EOT>";
 								  case Ponctuation_tok: {
 								    buf[1] = '\0';
 								    if ((info) == 'l') {
 								      buf[0] = '(';
 								    } else {
 								      buf[0] = (char)info;
 								    }
 								  }
 								    return buf;
 								  case QuasiQuotes_tok:
 								  case WQuasiQuotes_tok:
 								    return "<QQ>";
-												syntax_eror support

											
										
										
											2015-08-18 20:59:24 +01:00
+								  }
 								}
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								static void open_comment(int ch, StreamDesc *inp_stream USES_REGS) {
-												Windows fixes, mainly H -> HB

											
										
										
											2014-01-19 21:15:05 +00:00
+								  CELL *h0 = HR;
 								  HR += 5;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  h0[0] = AbsAppl(h0 + 2);
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  h0[1] = TermNil;
 								  if (!LOCAL_CommentsTail) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    /* first comment */
 								    LOCAL_Comments = AbsPair(h0);
 								  } else {
 								    /* extra comment */
 								    *LOCAL_CommentsTail = AbsPair(h0);
 								  }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  LOCAL_CommentsTail = h0 + 1;
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  h0 += 2;
 								  h0[0] = (CELL)FunctorMinus;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  h0[1] = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  h0[2] = TermNil;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  LOCAL_CommentsNextChar = h0 + 2;
 								  LOCAL_CommentsBuff = (wchar_t *)malloc(1024 * sizeof(wchar_t));
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  LOCAL_CommentsBuffLim = 1024;
 								  LOCAL_CommentsBuff[0] = ch;
 								  LOCAL_CommentsBuffPos = 1;
 								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static void extend_comment(int ch USES_REGS) {
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  LOCAL_CommentsBuff[LOCAL_CommentsBuffPos] = ch;
 								  LOCAL_CommentsBuffPos++;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  if (LOCAL_CommentsBuffPos == LOCAL_CommentsBuffLim - 1) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_CommentsBuff = (wchar_t *)realloc(
 								        LOCAL_CommentsBuff, sizeof(wchar_t) * (LOCAL_CommentsBuffLim + 4096));
 								    LOCAL_CommentsBuffLim += 4096;
 								  }
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static void close_comment(USES_REGS1) {
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  LOCAL_CommentsBuff[LOCAL_CommentsBuffPos] = '\0';
-												new text conversion and string code (big changes, take care please)

											
										
										
											2013-12-02 14:49:41 +00:00
+								  *LOCAL_CommentsNextChar = Yap_WCharsToString(LOCAL_CommentsBuff PASS_REGS);
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								  free(LOCAL_CommentsBuff);
-												improve BigNum handling.

											
										
										
											2011-06-21 15:11:07 +01:00
+								  LOCAL_CommentsBuff = NULL;
-												fixes to scanner code.

											
										
										
											2011-06-14 08:58:51 +01:00
+								  LOCAL_CommentsBuffLim = 0;
-												support for option comments(X) in read_term/1.

											
										
										
											2011-06-12 17:23:10 +01:00
+								}
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								// mark that we reached EOF,
 								// next  token will be end_of_file)
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								static void mark_eof(struct stream_desc *inp_stream) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								  inp_stream->status |= Push_Eof_Stream_f;
 								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								static wchar_t *ch_to_wide(char *base, char *charp) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  int n = charp - base, i;
-												support for UNICODE, and other bug fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1725 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-11-27 17:42:03 +00:00
+								  wchar_t *nb = (wchar_t *)base;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  if ((nb + n) + 1024 > (wchar_t *)AuxSp) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
 								    LOCAL_ErrorMessage =
 								        "Heap Overflow While Scanning: please increase code space (-h)";
 								    return NULL;
 								  }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  for (i = n; i > 0; i--) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    nb[i - 1] = (unsigned char)base[i - 1];
 								  }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  return nb + n;
-												support for UNICODE, and other bug fixes.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1725 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2006-11-27 17:42:03 +00:00
+								}
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#define add_ch_to_buff(ch)                                                     \
 								  if (wcharp) {                                                                \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    *wcharp++ = (ch);                                                          \
 								    if (wcharp >= (wchar_t *)AuxSp - 1024)                                     \
 								      goto huge_var_error;                                                     \
 								    charp = (char *)wcharp;                                                    \
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  } else {                                                                     \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (ch > MAX_ISO_LATIN1 && !wcharp) {                                      \
 								      /* does not fit in ISO-LATIN */                                          \
 								      wcharp = ch_to_wide(TokImage, charp);                                    \
 								      if (!wcharp)                                                             \
 								        goto huge_var_error;                                                   \
 								      *wcharp++ = (ch);                                                        \
 								      charp = (char *)wcharp;                                                  \
 								    } else {                                                                   \
 								      if (charp >= (char *)AuxSp - 1024)                                       \
 								        goto huge_var_error;                                                   \
 								      *charp++ = ch;                                                           \
 								    }                                                                          \
-												improvements to language support.

											
										
										
											2010-05-05 12:45:11 +01:00
+								  }
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								#define add_ch_to_utf8_buff(ch)                                                \
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  {                                                                            \
 								    if ((ch & 0xff) == ch) {                                                   \
 								      *charp++ = ch;                                                           \
 								    } else {                                                                   \
 								      charp = _PL__put_utf8(charp, chr);                                       \
 								    }                                                                          \
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  }
-												quasi quote

											
										
										
											2013-11-25 10:24:13 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								TokEntry *Yap_tokenizer(struct stream_desc *inp_stream, bool store_comments,
 								                        Term *tposp) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
 								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  TokEntry *t, *l, *p;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  enum TokenKinds kind;
 								  int solo_flag = TRUE;
-												WIN32 support


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1790 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2007-01-28 14:26:37 +00:00
+								  int ch;
 								  wchar_t *wcharp;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  struct qq_struct_t *cur_qq = NULL;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  InitScannerMemory();
-												Some global variables(Yap_thread_gl) in C/init.c are now declared in misc/LOCALS

											
										
										
											2011-05-23 16:19:47 +01:00
+								  LOCAL_VarTable = NULL;
 								  LOCAL_AnonVarTable = NULL;
-												First step to actually handling scanner overflows the right way


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1185 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-11-22 22:28:06 +00:00
+								  l = NULL;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								  p = NULL; /* Just to make lint happy */
-												read works now.

											
										
										
											2011-02-15 07:39:27 +00:00
+								  ch = getchr(inp_stream);
-												mainbranch fixes to:
- fix quoted characters output
- fix line position in read_term and friends
- make messages look a bit better
- CLP(BN) EM improvements.

											
										
										
											2008-10-23 22:17:45 +01:00
+								  while (chtype(ch) == BS) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    ch = getchr(inp_stream);
 								  }
 								  *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								  Yap_setCurrentSourceLocation(inp_stream);
-												make $ SY from very beginning

											
										
										
											2015-07-23 01:26:40 +01:00
+								  LOCAL_StartLineCount = inp_stream->linecount;
 								  LOCAL_StartLinePos = inp_stream->linepos;
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  do {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								    wchar_t och, pch;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    int quote, isvar;
 								    char *charp, *mp;
 								    unsigned int len;
 								    char *TokImage = NULL;
 								    t = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								    t->TokNext = NULL;
 								    if (t == NULL) {
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								      return TrailSpaceError(p, l);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    }
 								    if (!l)
 								      l = t;
 								    else
 								      p->TokNext = t;
 								    p = t;
 								  restart:
 								    while (chtype(ch) == BS) {
 								      ch = getchr(inp_stream);
 								    }
 								    t->TokPos = GetCurInpPos(inp_stream);
 								    switch (chtype(ch)) {
 								    case CC:
 								      if (store_comments) {
 								        CHECK_SPACE();
 								        open_comment(ch, inp_stream PASS_REGS);
 								      continue_comment:
 								        while ((ch = getchr(inp_stream)) != 10 && chtype(ch) != EF) {
 								          CHECK_SPACE();
 								          extend_comment(ch PASS_REGS);
 								        }
 								        CHECK_SPACE();
 								        extend_comment(ch PASS_REGS);
 								        if (chtype(ch) != EF) {
 								          ch = getchr(inp_stream);
 								          if (chtype(ch) == CC) {
 								            extend_comment(ch PASS_REGS);
 								            goto continue_comment;
 								          }
 								        }
 								        close_comment(PASS_REGS1);
 								      } else {
 								        while ((ch = getchr(inp_stream)) != 10 && chtype(ch) != EF)
 								          ;
 								      }
 								      if (chtype(ch) != EF) {
 								        /* blank space */
 								        if (t == l) {
 								          /* we found a comment before reading characters */
 								          while (chtype(ch) == BS) {
 								            ch = getchr(inp_stream);
 								          }
 								          CHECK_SPACE();
 								          *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
 								          Yap_setCurrentSourceLocation(inp_stream);
 								        }
 								        goto restart;
 								      } else {
 								        t->Tok = Ord(kind = eot_tok);
 								        mark_eof(inp_stream);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								        t->TokInfo = TermEof;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      }
 								      break;
 								    case UC:
 								    case UL:
 								    case LC:
 								      och = ch;
 								      ch = getchr(inp_stream);
 								    scan_name:
 								      TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
 								      charp = TokImage;
 								      wcharp = NULL;
 								      isvar = (chtype(och) != LC);
 								      add_ch_to_buff(och);
 								      for (; chtype(ch) <= NU; ch = getchr(inp_stream)) {
 								        if (charp == (char *)AuxSp - 1024) {
 								        huge_var_error:
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          return AuxSpaceError(p, l, "Code Space Overflow due to huge atom");
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          /* huge atom or variable, we are in trouble */
 								        }
 								        add_ch_to_buff(ch);
 								      }
 								      while (ch == '\'' && isvar &&
 								             trueGlobalPrologFlag(VARIABLE_NAMES_MAY_END_WITH_QUOTES_FLAG)) {
 								        if (charp == (char *)AuxSp - 1024) {
 								          goto huge_var_error;
 								        }
 								        add_ch_to_buff(ch);
 								        ch = getchr(inp_stream);
 								      }
 								      add_ch_to_buff('\0');
 								      if (!isvar) {
 								        Atom ae;
 								        /* don't do this in iso */
 								        if (wcharp) {
 								          ae = Yap_LookupWideAtom((wchar_t *)TokImage);
 								        } else {
 								          ae = Yap_LookupAtom(TokImage);
 								        }
 								        if (ae == NIL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return CodeSpaceError(t, p, l);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        }
 								        t->TokInfo = Unsigned(ae);
 								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        if (ch == '(')
 								          solo_flag = FALSE;
 								        t->Tok = Ord(kind = Name_tok);
 								      } else {
 								        VarEntry *ve = Yap_LookupVar(TokImage);
 								        t->TokInfo = Unsigned(ve);
 								        if (cur_qq) {
 								          ve->refs++;
 								        }
 								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        t->Tok = Ord(kind = Var_tok);
 								      }
 								      break;
 								    case NU: {
 								      int cherr;
 								      int cha = ch;
 								      char *ptr;
 								      cherr = 0;
 								      if (!(ptr = AllocScannerMemory(4096))) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								        return TrailSpaceError(t, l);
 								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      CHECK_SPACE();
 								      if ((t->TokInfo = get_num(&cha, &cherr, inp_stream, ptr, 4096, 1)) ==
 L) {
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								        if (p) {
 								          p->Tok = eot_tok;
 								          t->TokInfo = TermError;
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        /* serious error now */
 								        return l;
 								      }
 								      PopScannerMemory(ptr, 4096);
 								      ch = cha;
 								      if (cherr) {
 								        TokEntry *e;
 								        t->Tok = Number_tok;
 								        t->TokPos = GetCurInpPos(inp_stream);
 								        e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								        if (e == NULL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return TrailSpaceError(p, l);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        } else {
 								          e->TokNext = NULL;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        t->TokNext = e;
 								        t = e;
 								        p = e;
 								        switch (cherr) {
 								        case 'e':
 								        case 'E':
 								          och = cherr;
 								          goto scan_name;
 								          break;
 								        case '=':
 								        case '_':
 								          /* handle error while parsing a float */
 								          {
 								            TokEntry *e2;
 								            t->Tok = Ord(Var_tok);
 								            t->TokInfo = Unsigned(Yap_LookupVar("E"));
 								            t->TokPos = GetCurInpPos(inp_stream);
 								            e2 = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								            if (e2 == NULL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								              return TrailSpaceError(p, l);
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								            } else {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								              e2->TokNext = NULL;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								            }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            t->TokNext = e2;
 								            t = e2;
 								            p = e2;
 								            if (cherr == '=')
 								              och = '+';
 								            else
 								              och = '-';
 								          }
 								          goto enter_symbol;
 								        case '+':
 								        case '-':
 								          /* handle error while parsing a float */
 								          {
 								            TokEntry *e2;
 								            t->Tok = Name_tok;
 								            if (ch == '(')
 								              solo_flag = FALSE;
 								            t->TokInfo = Unsigned(AtomE);
 								            t->TokPos = GetCurInpPos(inp_stream);
 								            e2 = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								            if (e2 == NULL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								              return TrailSpaceError(p, l);
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								            } else {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								              e2->TokNext = NULL;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								            }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            t->TokNext = e2;
 								            t = e2;
 								            p = e2;
 								          }
 								        default:
 								          och = cherr;
 								          goto enter_symbol;
 								        }
 								      } else {
 								        t->Tok = Ord(kind = Number_tok);
 								      }
 								    } break;
 								    case QT:
 								    case DC:
 								    quoted_string:
 								      TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
 								      charp = TokImage;
 								      quote = ch;
 								      len = 0;
 								      ch = getchrq(inp_stream);
 								      wcharp = NULL;
 								      while (TRUE) {
 								        if (charp + 1024 > (char *)AuxSp) {
 								          LOCAL_Error_TYPE = RESOURCE_ERROR_AUXILIARY_STACK;
 								          LOCAL_ErrorMessage =
 								              "Heap Overflow While Scanning: please increase code space (-h)";
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          break;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        }
 								        if (ch == 10 && trueGlobalPrologFlag(ISO_FLAG)) {
 								          /* in ISO a new line terminates a string */
 								          LOCAL_ErrorMessage = "layout character \n inside quotes";
 								          break;
 								        }
 								        if (ch == quote) {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								          ch = getchrq(inp_stream);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          if (ch != quote)
 								            break;
 								          add_ch_to_buff(ch);
 								          ch = getchrq(inp_stream);
 								        } else if (ch == '\\' &&
 								                   Yap_GetModuleEntry(CurrentModule)->flags & M_CHARESCAPE) {
 								          int scan_next = TRUE;
 								          if ((ch = read_quoted_char(&scan_next, inp_stream))) {
 								            add_ch_to_buff(ch);
 								          }
 								          if (scan_next) {
 								            ch = getchrq(inp_stream);
 								          }
 								        } else if (chtype(ch) == EF && ch <= MAX_ISO_LATIN1) {
 								          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								          mark_eof(inp_stream);
 								          t->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          t->TokInfo = TermEof;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          break;
 								        } else {
 								          add_ch_to_buff(ch);
 								          ch = getchrq(inp_stream);
 								        }
 								        ++len;
 								        if (charp > (char *)AuxSp - 1024) {
 								          /* Not enough space to read in the string. */
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return AuxSpaceError(
 								              t, l, "not enough space to read in string or quoted atom");
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      }
 								      if (wcharp) {
 								        *wcharp = '\0';
 								      } else {
 								        *charp = '\0';
 								      }
 								      if (quote == '"' || quote == '`') {
 								        if (wcharp) {
 								          mp = AllocScannerMemory(sizeof(wchar_t) * (len + 1));
 								        } else {
 								          mp = AllocScannerMemory(len + 1);
 								        }
 								        if (mp == NULL) {
 								          LOCAL_ErrorMessage =
 								              "not enough heap space to read in string or quoted atom";
 								          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								          t->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          t->TokInfo = TermOutOfHeapError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return l;
 								        }
 								        if (wcharp) {
 								          wcscpy((wchar_t *)mp, (wchar_t *)TokImage);
 								        } else {
 								          strcpy(mp, TokImage);
 								        }
 								        t->TokInfo = Unsigned(mp);
 								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        if (quote == '"') {
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								          if (wcharp) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            t->Tok = Ord(kind = WString_tok);
 								          } else {
 								            t->Tok = Ord(kind = String_tok);
 								          }
 								        } else {
 								          if (wcharp) {
 								            t->Tok = Ord(kind = WBQString_tok);
 								          } else {
 								            t->Tok = Ord(kind = BQString_tok);
 								          }
 								        }
 								      } else {
 								        if (wcharp) {
 								          t->TokInfo = Unsigned(Yap_LookupWideAtom((wchar_t *)TokImage));
 								        } else {
 								          t->TokInfo = Unsigned(Yap_LookupAtom(TokImage));
 								        }
 								        if (!(t->TokInfo)) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return CodeSpaceError(t, p, l);
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        t->Tok = Ord(kind = Name_tok);
 								        if (ch == '(')
 								          solo_flag = false;
 								      }
 								      break;
 								    case BS:
 								      if (ch == '\0') {
 								        t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								        if (chtype(pch) == EF) {
 								          mark_eof(inp_stream);
 								          t->TokInfo = TermEof;
 								        } else {
 								          t->TokInfo = TermNewLine;
 								        }
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								        t->TokInfo = TermEof;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        return l;
 								      } else
 								        ch = getchr(inp_stream);
 								      break;
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								    case SY:
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								      if (ch == '.' && (pch = Yap_peek(inp_stream - GLOBAL_Stream)) &&
 								          (chtype(pch) == BS || chtype(pch) == EF || pch == '%')) {
 								        t->Tok = Ord(kind = eot_tok);
-												fix regression: checking for EOT should first peek, and if happy consume the blank after the dot.

											
										
										
											2016-02-18 22:10:40 +00:00
+								        // consume...
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+								        if (pch == '%') {
 									  t->TokInfo = TermNewLine;
-												fix .%

											
										
										
											2016-04-26 15:26:35 +01:00
+								          return l;
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+									}
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								        ch = getchr(inp_stream);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								        if (chtype(ch) == EF) {
 								          mark_eof(inp_stream);
 								          t->TokInfo = TermEof;
 								        } else {
 								          t->TokInfo = TermNewLine;
 								        }
 								        return l;
 								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      if (ch == '`')
 								        goto quoted_string;
-												small

- do not call goal expansion on meta-calls (that is done by undef).
- docs updates
- fix init code

											
										
										
											2015-12-15 09:28:43 +00:00
+								      och = ch;
 								      ch = getchr(inp_stream);
 								      if (och == '.') {
 								        if (chtype(ch) == BS || chtype(ch) == EF || ch == '%') {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          t->Tok = Ord(kind = eot_tok);
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+								          if (ch == '%') {
 								            t->TokInfo = TermNewLine;
-												fix .%

											
										
										
											2016-04-26 15:26:35 +01:00
+								            return l;
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+									  }
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          if (chtype(ch) == EF) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            mark_eof(inp_stream);
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+									    t->TokInfo = TermEof;
 									  } else {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								            t->TokInfo = TermNewLine;
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return l;
 								        }
 								      }
 								      if (och == '/' && ch == '*') {
 								        if (store_comments) {
 								          CHECK_SPACE();
 								          open_comment('/', inp_stream PASS_REGS);
 								          while ((och != '*' || ch != '/') && chtype(ch) != EF) {
 								            och = ch;
 								            CHECK_SPACE();
 								            extend_comment(ch PASS_REGS);
 								            ch = getchr(inp_stream);
 								          }
 								          if (chtype(ch) != EF) {
 								            CHECK_SPACE();
 								            extend_comment(ch PASS_REGS);
 								          }
 								          close_comment(PASS_REGS1);
 								        } else {
 								          while ((och != '*' || ch != '/') && chtype(ch) != EF) {
 								            och = ch;
 								            ch = getchr(inp_stream);
 								          }
 								        }
 								        if (chtype(ch) == EF) {
 								          t->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          t->TokInfo = TermEof;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          break;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        } else {
 								          /* leave comments */
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          ch = getchr(inp_stream);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          if (t == l) {
 								            /* we found a comment before reading characters */
 								            while (chtype(ch) == BS) {
 								              ch = getchr(inp_stream);
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
+								            }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            CHECK_SPACE();
 								            *tposp = Yap_StreamPosition(inp_stream - GLOBAL_Stream);
 								            Yap_setCurrentSourceLocation(inp_stream);
 								          }
 								        }
 								        goto restart;
 								      }
 								    enter_symbol:
 								      if (och == '.' && (chtype(ch) == BS || chtype(ch) == EF || ch == '%')) {
 								        t->Tok = Ord(kind = eot_tok);
-												make modules remember which operators  they ever defined.
Make modules import operators from other modules

											
										
										
											2016-04-28 15:01:16 +01:00
+									if (ch == '%') {
 								          t->TokInfo = TermNewLine;
 								          return l;
 								        }
 									if (chtype(ch) == EF) {
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          mark_eof(inp_stream);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          t->TokInfo = TermEof;
 								        } else {
 								          t->TokInfo = TermNl;
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        return l;
 								      } else {
 								        Atom ae;
 								        TokImage = (char *)((AtomEntry *)(Yap_PreAllocCodeSpace()))->StrOfAE;
 								        charp = TokImage;
 								        wcharp = NULL;
 								        add_ch_to_buff(och);
 								        for (; chtype(ch) == SY; ch = getchr(inp_stream)) {
 								          if (charp == (char *)AuxSp - 1024) {
 								            goto huge_var_error;
 								          }
 								          add_ch_to_buff(ch);
 								        }
 								        add_ch_to_buff('\0');
 								        if (wcharp) {
 								          ae = Yap_LookupWideAtom((wchar_t *)TokImage);
 								        } else {
 								          ae = Yap_LookupAtom(TokImage);
 								        }
 								        if (ae == NIL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return CodeSpaceError(t, p, l);
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        t->TokInfo = Unsigned(ae);
 								        if (t->TokInfo == (CELL)NIL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return CodeSpaceError(t, p, l);
 								        }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        t->Tok = Ord(kind = Name_tok);
 								        if (ch == '(')
-												small

- do not call goal expansion on meta-calls (that is done by undef).
- docs updates
- fix init code

											
										
										
											2015-12-15 09:28:43 +00:00
+								          solo_flag = false;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        else
-												small

- do not call goal expansion on meta-calls (that is done by undef).
- docs updates
- fix init code

											
										
										
											2015-12-15 09:28:43 +00:00
+								          solo_flag = true;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      }
 								      break;
-												indent + EOF work in progress

											
										
										
											2015-07-06 11:49:56 +01:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    case SL: {
 								      char chs[2];
 								      chs[0] = ch;
 								      chs[1] = '\0';
 								      ch = getchr(inp_stream);
 								      t->TokInfo = Unsigned(Yap_LookupAtom(chs));
 								      t->Tok = Ord(kind = Name_tok);
 								      if (ch == '(')
 								        solo_flag = FALSE;
 								    } break;
 								    case BK:
 								      och = ch;
 								      ch = getchr(inp_stream);
 								      t->TokInfo = och;
 								      if (och == '(') {
 								        while (chtype(ch) == BS) {
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          ch = getchr(inp_stream);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        };
 								        if (ch == ')') {
 								          t->TokInfo = Unsigned(AtomEmptyBrackets);
 								          t->Tok = Ord(kind = Name_tok);
 								          ch = getchr(inp_stream);
 								          solo_flag = FALSE;
 								          break;
 								        } else if (!solo_flag) {
 								          t->TokInfo = 'l';
 								          solo_flag = TRUE;
 								        }
 								      } else if (och == '[') {
 								        while (chtype(ch) == BS) {
 								          ch = getchr(inp_stream);
 								        };
 								        if (ch == ']') {
 								          t->TokInfo = Unsigned(AtomNil);
 								          t->Tok = Ord(kind = Name_tok);
 								          ch = getchr(inp_stream);
 								          solo_flag = FALSE;
 								          break;
 								        }
 								      } else if (och == '{') {
 								        if (ch == '|') {
 								          qq_t *qq = (qq_t *)calloc(sizeof(qq_t), 1);
 								          if (!qq) {
 								            LOCAL_ErrorMessage = "not enough heap space to read in quasi quote";
 								            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								            t->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								            t->TokInfo = TermOutOfHeapError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            return l;
 								          }
 								          if (cur_qq) {
 								            LOCAL_ErrorMessage = "quasi quote in quasi quote";
 								            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								            t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								            t->TokInfo = TermOutOfHeapError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            free(qq);
 								            return l;
 								          } else {
 								            cur_qq = qq;
 								          }
 								          t->TokInfo = (CELL)qq;
 								          if (inp_stream->status & Seekable_Stream_f) {
 								            qq->start.byteno = fseek(inp_stream->file, 0, 0);
 								          } else {
 								            qq->start.byteno = inp_stream->charcount - 1;
 								          }
 								          qq->start.lineno = inp_stream->linecount;
 								          qq->start.linepos = inp_stream->linepos - 1;
 								          qq->start.charno = inp_stream->charcount - 1;
 								          t->Tok = Ord(kind = QuasiQuotes_tok);
 								          ch = getchr(inp_stream);
 								          solo_flag = FALSE;
 								          break;
 								        }
 								        while (chtype(ch) == BS) {
 								          ch = getchr(inp_stream);
 								        };
 								        if (ch == '}') {
 								          t->TokInfo = Unsigned(AtomBraces);
 								          t->Tok = Ord(kind = Name_tok);
 								          ch = getchr(inp_stream);
 								          solo_flag = FALSE;
 								          break;
 								        }
 								      } else if (och == '|' && ch == '|') {
 								        qq_t *qq = cur_qq;
 								        if (!qq) {
 								          LOCAL_ErrorMessage = "quasi quoted's || without {|";
 								          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								          free(cur_qq);
 								          cur_qq = NULL;
 								          t->Tok = Ord(kind = eot_tok);
-												cleanups on error handling and fix EOF.

											
										
										
											2016-01-31 10:19:08 +00:00
+								          t->TokInfo = TermError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return l;
 								        }
 								        cur_qq = NULL;
 								        t->TokInfo = (CELL)qq;
 								        if (inp_stream->status & Seekable_Stream_f) {
 								          qq->mid.byteno = fseek(inp_stream->file, 0, 0);
 								        } else {
 								          qq->mid.byteno = inp_stream->charcount - 1;
 								        }
 								        qq->mid.lineno = inp_stream->linecount;
 								        qq->mid.linepos = inp_stream->linepos - 1;
 								        qq->mid.charno = inp_stream->charcount - 1;
 								        t->Tok = Ord(kind = QuasiQuotes_tok);
 								        ch = getchr(inp_stream);
 								        TokImage = Yap_PreAllocCodeSpace();
 								        if (!TokImage) {
 								          LOCAL_ErrorMessage =
 								              "not enough heap space to read in a quasi quoted atom";
 								          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								          t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          t->TokInfo = TermError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return l;
 								        }
 								        charp = TokImage;
 								        quote = ch;
 								        len = 0;
 								        ch = getchrq(inp_stream);
 								        wcharp = NULL;
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        while (TRUE) {
 								          if (ch == '|') {
 								            ch = getchrq(inp_stream);
 								            if (ch != '}') {
 								            } else {
 								              charp = (char *)put_utf8((unsigned char *)charp, och);
 								              charp = (char *)put_utf8((unsigned char *)charp, ch);
 								              /* we're done */
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								              break;
 								            }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          } else if (chtype(ch) == EF) {
 								            Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								            mark_eof(inp_stream);
 								            t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								            t->TokInfo = TermOutOfHeapError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								            break;
 								          } else {
 								            charp = (char *)put_utf8((unsigned char *)charp, ch);
 								            ch = getchrq(inp_stream);
 								          }
 								          if (charp > (char *)AuxSp - 1024) {
 								            /* Not enough space to read in the string. */
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								            return AuxSpaceError(
 								                t, l, "not enough space to read in string or quoted atom");
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          }
 								        }
 								        len = charp - TokImage;
 								        mp = malloc(len + 1);
 								        if (mp == NULL) {
 								          LOCAL_ErrorMessage = "not enough heap space to read in quasi quote";
 								          Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
-												blanks and backslashed strings

											
										
										
											2015-02-10 00:03:02 +00:00
+								          t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          t->TokInfo = TermOutOfHeapError;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								          return l;
 								        }
 								        strncpy(mp, TokImage, len + 1);
 								        qq->text = (unsigned char *)mp;
 								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        if (inp_stream->status & Seekable_Stream_f) {
 								          qq->end.byteno = fseek(inp_stream->file, 0, 0);
 								        } else {
 								          qq->end.byteno = inp_stream->charcount - 1;
 								        }
 								        qq->end.lineno = inp_stream->linecount;
 								        qq->end.linepos = inp_stream->linepos - 1;
 								        qq->end.charno = inp_stream->charcount - 1;
 								        if (!(t->TokInfo)) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								          return CodeSpaceError(t, p, l);
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								        }
 								        Yap_ReleasePreAllocCodeSpace((CODEADDR)TokImage);
 								        solo_flag = FALSE;
 								        ch = getchr(inp_stream);
 								        break;
 								      }
 								      t->Tok = Ord(kind = Ponctuation_tok);
 								      break;
 								    case EF:
 								      mark_eof(inp_stream);
 								      t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								      t->TokInfo = TermEof;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      return l;
-												new scanner.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@736 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2003-01-08 16:45:35 +00:00
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								    default: {
 								      char err[1024];
 								      snprintf(err, 1023, "\n++++ token: unrecognised char %c (%d), type %c\n",
 								               ch, ch, chtype(ch));
 								    }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      t->Tok = Ord(kind = eot_tok);
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								      t->TokInfo = TermEof;
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    }
-												Windows Threaded Patches

											
										
										
											2013-11-13 17:04:34 +00:00
+								#if DEBUG
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (GLOBAL_Option[2])
-												Fixes for processing write to buffer: should b more robust, but we still may have memory leaks

											
										
										
											2016-04-22 18:19:58 +01:00
+								      fprintf(stderr, "[Token %d %s]", Ord(kind),
 								              Yap_tokRep(t, inp_stream->encoding));
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								#endif
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								    if (LOCAL_ErrorMessage) {
 								      /* insert an error token to inform the system of what happened */
 								      TokEntry *e = (TokEntry *)AllocScannerMemory(sizeof(TokEntry));
 								      if (e == NULL) {
-												don't consume char after '.', do peek instead.

early errors should leave early.

											
										
										
											2016-02-14 04:11:55 +00:00
+								        return TrailSpaceError(p, l);
 								      }
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								      p->TokNext = e;
 								      e->Tok = Error_tok;
 								      e->TokInfo = MkAtomTerm(Yap_LookupAtom(LOCAL_ErrorMessage));
 								      e->TokPos = GetCurInpPos(inp_stream);
 								      e->TokNext = NULL;
 								      LOCAL_ErrorMessage = NULL;
 								      p = e;
 								    }
 								  } while (kind != eot_tok);
-												This commit was generated by cvs2svn to compensate for changes in r4,
which included commits to RCS files with non-trunk default branches.


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@5 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2001-04-09 20:54:03 +01:00
+								  return (l);
 								}
-												improved support for threads and code area allocation using malloc


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@965 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-02-05 16:57:02 +00:00
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								void Yap_clean_tokenizer(TokEntry *tokstart, VarEntry *vartable,
-												cmake & text support

											
										
										
											2015-06-19 01:30:13 +01:00
+								                         VarEntry *anonvartable) {
-												cache abstra machine register in threaded mode.

											
										
										
											2011-03-07 16:02:55 +00:00
+								  CACHE_REGS
-												fix get_num init issues

											
										
										
											2015-10-18 11:40:12 +01:00
+								  struct scanner_extra_alloc *ptr = LOCAL_ScannerExtraBlocks;
 								  while (ptr) {
 								    struct scanner_extra_alloc *next = ptr->next;
 								    free(ptr);
 								    ptr = next;
 								  }
 								  TR = (tr_fr_ptr)tokstart;
 								  LOCAL_Comments = TermNil;
 								  LOCAL_CommentsNextChar = LOCAL_CommentsTail = NULL;
 								  if (LOCAL_CommentsBuff) {
 								    free(LOCAL_CommentsBuff);
 								    LOCAL_CommentsBuff = NULL;
 								  }
 								  LOCAL_CommentsBuffLim = 0;
-												improved support for threads and code area allocation using malloc


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@965 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2004-02-05 16:57:02 +00:00
+								}
-												fix memory leak in scanner
instrument memory allocation with default_malloc


git-svn-id: https://yap.svn.sf.net/svnroot/yap/trunk@1445 b08c6af1-5177-4d33-ba66-4b1c6b8b522a

											
										
										
											2005-11-08 13:51:15 +00:00
-												improve docs

											
										
										
											2014-09-15 09:13:50 +01:00
+								/// @}