This is the HISTORY file for the Yale SML/NJ CVS repository. An entry should be made for _every_ commit to the repository. The entries in this file will be used when creating the README for new versions, so keep that in mind when writing the description. The form of an entry should be: Name: Date: yyyy/mm/dd Tag: Description: ---------------------------------------------------------------------- Name: John Reppy Date: 2005/07/20 Tag: Description: Added changes from Dominic Evans (oldmanuk (at) gmail (dot) com) to support HPUX 11. ---------------------------------------------------------------------- Name: John Reppy Date: 2005/07/06 Tag: Description: Changes to the SML/NJ library. See smlnj-lib/CHANGES for details. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/07/06 00:45:00 CDT Tag: blume-20050706-slice-copy Description: Fixed reversed logic for deciding whether to "copy up" or "copy down" in *-array-slice.sml. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2005/05/31 17:00:00 EST Tag: leunga-20050531-cygwin-fault-2 Description: A typo in the cygwin code fixed. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2005/05/31 16:47:00 EST Tag: leunga-20050531-cygwin-fault Description: Updated Cygwin's fault/signal handling to match the Windows version. Updated the export list. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/18 13:35:00 CDT Tag: Release_110_54 Description: New working version (110.54). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/18 11:58:00 CDT Tag: blume-20050518-installer Description: Added support scripts for Mac OS X PackageMaker and modified config/install.sh so that it supports re-dumping a heap image after customization. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/18 10:55:00 CDT Tag: blume-20050518-realdiv-noovld Description: Un-overloaded / to work around bug in overloading resolution code. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/16 23:50:00 CDT Tag: blume-20050516-redump-heap Description: Added mechanism for re-creating a heap file for the interactive system after configuration variables have been changed. CM.redump_heap : string -> unit This is much like SMLofNJ.exportML, but starting from the resulting heap does not return to the caller of CM.redump_heap but restarts the interactive system from scratch. The original call of CM.redump_heap does not return but ends the interactive session. Thus, CM.redump_heap is a lot like SMLofNJ.exportFn. Internally, redump_heap winds the dynamic execution context back to the point where the original heap image was created and re-executes the heap image generation code in the boot code. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/09 21:55:00 CDT Tag: blume-20050509-word64 Description: Added a hack to the existing hack known as Word64 to make fromString behave correctly. I am still not sure whether Word64.scan will work as specified with respect to the interaction of radix and prefix. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2005/05/04 11:50:00 EST Tag: leunga-20050504-checkgc Description: Added a gc protocol checking phase. This phase is enabled with the flag "check-gc". "debug-check-gc" turns on the verbose mode. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/04 10:45:00 CDT Tag: blume-20050504-intinf Description: Fixed a bug in the implementation of div and mod for IntInf. Thanks to Neophytos Michael for reporting the problem. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/05/04 10:35:00 CDT Tag: blume-20050504-join Description: Added a "join" combinator to the ParserComb module in smlnj-lib.cm. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/28 23:40:00 CST Tag: blume-20050228-mVar Description: Fixed serious bug (brown paper bag variety) in new implementation of structure Atom in CML. (I had accidentally used a mailbox instead of an mvar, leaving the door open for races.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/25 15:00:00 CST Tag: Release_110_53 Description: New working version (110.53). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/25 14:50:00 CST Tag: blume-20050225-susp Description: Brought back SMLofNJ.Susp. The underlying suspension type is the one implemented in Core -- which means that it is the same as the one used by the lazy extension. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/24 16:50:00 CST Tag: blume-20050224-cml-atom Description: Simpler and at the same time more general implementation of structure Atom in CML. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/15 17:35:00 CST Tag: blume-20050215-tools Description: Created new "tools" directory under "src" and moved "TraceDebugProf" there. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/02/10 17:55:00 CST Tag: blume-20050210-longlong Description: Implemented "long long" arguments and results for NLFFI. (Only the PPC/MacOS implementation is complete, the other backends still need to be updated.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/01/24 17:40:00 CST Tag: blume-20050124-mlyacc Description: Minor cleanup in ML-Yacc rule printing mechanism. This should fix a problem with certain "as" patterns which previously got rendered using incorrect syntax. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/01/18 12:00:00 CST Tag: blume-20050118-profile Description: Made time profiling code (interrupt handler) in runtime system aware of new array representation. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/01/14 18:00:00 CST Tag: blume-20050114-heap2exec Description: Implemented new (but still experimental) heap2exec facility. This is tested under Mac OS X and should work under Linux (will test shortly). It will probably also work on the Sparc (will test some time later). - removed old "HACKED_STANDALONE" hack from runtime To be able to test this, uncomment the request for "heap2asm" in config/targets prior to installation. (Notice that this is different from "heap2exec" mentioned below. Not a typo.) To perform an actual test, run the command $ bin/heap2exec heapfile execfile (You can put heap2exec on your shell's path.) For example, run $ bin/heap2exec bin/.heap/ml-yacc.ppc-darwin mly This will create a standalone executable called "mly" which you can then invoke directly as a command. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2005/01/07 11:44:00 CST Tag: blume-20050107-mlstring Description: fixed off-by-one error in ML_STRING macro (globals.c) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/23 18:00:00 CST Tag: blume-20041223-santa Description: Made ml-build script "smarter" (but only very little). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/21 15:05:00 CST Tag: blume-20041221-longlong Description: * Implemented access to signed and unsigned long long data in NLFFI. (The parameter-passing part of the picture has not complete. But data structure access seems to work.) * Fixed CM's incorrect assumption that the PPC is little-endian. (On the Mac, it is big-endian. And that's currently our only PPC platform.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/21 12:50:00 CST Tag: blume-20041221-memory Description: Some cleanup in the $c/memory.cm library: separated some concerns by moving allocation code and memory access code each into their own files. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/12/17 16:12:00 EST Tag: leunga-20041217-cygwin-smlnj-home Description: The Unix I/O library of SML/NJ on cygwin does not understand Windows style pathname, so problems arise when SMLNJ_HOME is set to a Windows style pathname. _run-sml now converts SMLNJ_HOME to a POSIX pathname on cygwin. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/16 13:35:00 CST Tag: Release_110_52 Description: Last-minute changes incorporated into 110.52. Release tag moved. The changes: - HashString.hashString' -> HashString.hashSubstring - bug fix in UnivariateStats ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/15 23:40:00 CST Tag: blume-20041215-hashSubstring Description: - HashString.hashString' -> HashString.hashSubstring - corresponding changes in atom.sml - "de-compressed" (aka. un-obfuscated) code for UnivariateStats and added some comments ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/15 15:30:00 CST Tag: (Release_110_52) Description: New working version (110.52). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/15 12:45:00 CST Tag: blume-20041215-spaces Description: More on the space problem (this time for Win32). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/14 17:30:00 CST Tag: blume-20041214-spaces Description: Hacked some of the scripts (in particular: the installer) to cope with spaces in filenames a bit better. But beware: the current "solution" is likely still full of bugs and inherently incomplete. (We need to do away with those shell scripts for a comprehensive solution.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/13 14:45:00 CST Tag: blume-20041213-ml-makedepend Description: Fixed bug in code for ml-makedepend. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/09 16:30:00 CST Tag: blume-20041209-statistics Description: Added two simple but potentially useful statistics modules to SML/NJ Library. (See CHANGES file there.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/12/01 16:50:00 CST Tag: blume-20041201-atom Description: smlnj-lib: Added function HashString.hashString' for substrings. Hand-inlined CharVector.foldl into HashString (for speed). Modified implementation of structure Atom to avoid extracting strings from substrings unless necessary. (Also see CHANGES file for smlnj-lib.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/24 22:15:00 CST Tag: blume-20041124-cml Description: Made sure CML compiles when Position = Int64. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/24 14:30:00 CST Tag: blume-20041124-position Description: The compiler can now be compiled in a mode that makes structure Position equal to Int64. The default, however, is unchanged (Position = Int31) for the time being. To enable 64-bit positions, use the following procedure: 1. Start sml 2. Autoload $smlnj/cmb.cm (if not already autoloaded) 3. Type #set (CMB.symval "USE_64_BIT_POSITIONS") (SOME 1); 4. Run CMB.make() as usual. This is barely tested. The only test so far was a little SML program counting the number of characters in an 8-gigabyte file by reading it character-by-character. That test was successful. In support of 64-bit positions, a number of new functions have been added to the runtime system. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/23 14:45:00 CST Tag: blume-20041123-useFile Description: Fixed a problem with unhelpful error messages related to problems with .cm- or .sml files that appear as part of the sml command line. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/18 15:40:00 CST Tag: Release_110_51 Description: New working version (110.51). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/18 15:35:00 CST Tag: Description: Enabled dlopen and friends for FreeBSD (as recommended by Johannes 5 Joemann). ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/11/17 16:05:21 EST 2004 Tag: leunga-20041117-mlrisc-live-kill Description: Added support for MLTree constructs LIVE and KILL to all the architectures. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/13 00:20:00 CST Tag: blume-20041113-versiontool Description: - Stripped down the versiontool: It now only handles the version number. The date string is generated at bootstrap time (during makeml). - In a previous commit, fixed a minor issue with how polyequal is being translated. In particular, the code now "looks through" abstractions. This results in slightly fewer polyEqual warnings and hopefully slightly more efficient code. Important examples for where this matters are the new int64 and word64 types. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/12 00:30:00 CST Tag: blume-20041112-int64 Description: Structure Int64 fully hooked in. (The implementation is not very efficient, though.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/11 17:30:00 CST Tag: blume-20041111-more64 Description: All the pieces of Word64 are now there, with the exception of the conversions from and to LargeWord. (Eventually these need to be identities, but for the time being they don't even make sense because LargeWord is 32-bit wide.) Also started to add similar support for Int64, but major pieces of that are still missing. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/11 00:15:00 CST Tag: blume-20041111-word64 Description: Structure Word64 is now (almost) complete, word literals and patterns seem to work. There are a few odd pieces missing. In particular, I didn't do the {from,to}LargeWord parts because LargeWord is still Word32 at the moment. Making Word64 official would mean that LargeWord becomes Word64. But this requires extreme care because most word-word conversions have to go through LargeWord, so making a mistake means loss of efficiency or worse. Eventually there will be a solution similar to (but actually simpler than) what I did with IntInf. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/10 18:12:00 CST Tag: blume-20041110-64bit Description: More 64-bit hacking (but still not even half-way there yet). Also, some assorted improvements to the handling of 8-bit words. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/11/09 17:50:00 CST Tag: Description: Started adding some infrastructure for supporting 64-bit int- and word-types. (Still in its very early stages.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/28 10:45:00 CDT Tag: Release_110_50 Description: New working version (110.50). NEW BOOTFILES! ===================== Also: - Changend config/srcarchiveurl from a file just containing the URL string into a file containing shell script code. The code has access to the $VERSION variable. - Made corresponding changes to config/install.sh and config/unpack. - Default contents of config/srcarchiveurl uses $VERSION and normally does not have to be edited to reflect a version change. (As a result, a version change can be done by just editing config/version, the rest is now automatic.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/27 17:50:00 CDT Tag: blume-20041027-btrace-msg Description: BackTrace.monitor now also reports the source of the exception that triggered the trace. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/27 17:20:00 CDT Tag: blume-20041027-x86-c-calls Description: This is the HISTORY entry for two earlier commits, both concerning the x86 c-calls code in MLRISC: - added a missing LOAD in the code that deals with struct arguments - made sure the caller does not add the wrong number of bytes to the stack pointer after a call of a function returning a struct (the callee already pops the implicit argument which points to the space reserved for the result) ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/10/24 14:00:00 EST Tag: leunga-20041024-x86-gas-fucomip Description: John discovered a bug in the syntax of fucomip. The opcodes FU?COMIP? have been changed to fu?comip? %st(i), %st ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/20 15:06:00 CDT Tag: blume-20041020-standalone-backtrace Description: Added a mechanism for getting back-trace information from standalone programs. Here is how it works: 1. The part of the program from which you want to get backtrace information (usually the whole program) should be wrapped with BackTrace.monitor. This is a (unit->'a)->'a function, and your main program could be modified from something like fun main (pgm, args) = ... to fun main (pgm, args) = BackTrace.monitor (fn () => ...) 2. To be able to access BackTrace.monitor, you have to add library $smlnj-tdp/plugins.cm to the .cm file that contains your main function. 3. Remove all compiled code (i.e., all the .cm/ subdirectories that CM might have created in the past for your project). 4. Build the system using this command line: ml-build -Ctdp.instrument=true \$smlnj-tdp/back-trace.cm \ myprog.cm MyProg.main myprog instead of the usual ml-build myprog.cm MyProg.main myprog I changed a library name: $/trace-debug-profile.cm --> $smlnj-tdp/plugins.cm New libraries: $smlnj-tdp/back-trace.cm -- when loaded causes the back-trace plugin to be installed $smlnj-tdp/coverage.cm -- when loaded causes the coverage plugin to be installed ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/18 16:45:00 CDT Tag: blume-20041018-groupowner Description: Added an "obsolete" warning for the "group owner" syntax to CM's parser. Eliminated group owner specs from .cm files throughout the source tree. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/15 15:45:00 CDT Tag: blume-20041015-coverage Description: * Test coverage tool added! * Further reorganization of tracing-, debugging-, and profiling support: - moved original BTImp -- now called BackTrace -- into a separate library called $/trace-debug-profile.cm - eliminated all mentions of BTrace from SMLofNJ.Internals - only the instrumentation mechanism is now left in the compiler proper - BackTrace module is a plugin which is NOT plugged in by default - Coverage module is another such plugin To get the benefits of any of these plugin modules, the code in question must be compiled with tdp instrumentation turned on. This can be done by setting SMLofNJ.Internals.TDP.mode to true. (The ref cell is also controlled via the -Ctdp.instrument=... switch.) Plugins are selected at link time. (Pre-compiled instrumented code can be re-loaded with different plugins in effect.) When an instrumented module is linked, whatever plugins are at that time enabled will come into effect for that module. To enable the back-trace plugin, load library $/trace-debug-profile.cm and invoke BackTrace.install() (e.g., from the interactive prompt). To enable the coverage plugin, load the same library and invoke Coverage.install(). Back-traces are generated automatically on uncaught exceptions and when the code in question explicitly invokes BackTrace.trigger(). Coverage (and execution frequency-) information must be queried explicitly by calling Coverage.not_covered and Coverage.hot_spots. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/14 17:40:00 CDT Tag: blume-20041014-tdp-core Description: Snapshot of a significant overhaul of how the trace/debug/profile support is hooked into the system (specifically: Core and SMLofNJ.Internals). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/13 16:34:00 CDT Tag: blume-20041013-tdp Description: Some rationalization of names: structure BTrace -> structure TDPInstrument etc. This is is preparation of using the original back-trace instrumentation for other purposes. "TDP" stands for Trace/Debug/Profile. The control flag controlling whether instrumentation is on or off is now registered under a different name, so instead of running sml as sml -Cinstrument.btrace-mode=true one has to say sml -Ctdp.instrument=true ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/11 16:37:00 CDT Tag: blume-20041011-regions Description: Made some minor modifications to elabcore.sml to have source regions be propagated more tightly -- resulting in better (i.e., smaller) regions being reported in error- and debug messages. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/08 22:50:00 CDT Tag: blume-20041008-cmkw Description: Fixed handling of keywords in .cm files: After seeing "is" the lexer treats subsequent occurrences of "group", "library", "source", "is", "*", and "-" as ordinary identifiers rather than keywords. Most seriously, this fixes a problem with CM's "shell" tool. The tool is supposed to accept a tool argument called "source", but this did not work because of the clash with the keyword. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/07 16:00:00 CDT Tag: blume-20041007-cleanup Description: Assorted cleanup work: - got rid of intstrmap in favor of using the library's hash table implementation - threw out most of the pathnames stuff, as it was not used anyway - simplified tokentable implementation - fixed some minor spelling errors ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/06 15:15:15 CDT Tag: blume-20041006-handler Description: Cleaned up the absyn to reflect the invariant that HANDLE always carries a FNexp as part of the type definition. This eliminates some superfluous sanity checks at runtime down the road. Some minor cleanup of the btrace code. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/10/01 10:20:30 CDT Tag: blume-20041001-slave Description: Added hack to make slave mode work in the presence of the version tool. (Still, since the master does two passes over the code for CMB.make, the release number gets bumped twice when slaves are attached. I don't know if this is worth fixing...) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/30 10:55:00 CDT Tag: blume-20040930-version Description: * Moved the "version" magic into its own little library under src/system/smlnj/internal. This avoids expensive reconstruction of a stable src/compiler/core.cm. * At the same time, structure CompilerVersion is now known as structure SMLNJVersion. * Arranged for the version tool to NOT kick in when rebuilding the system (makeml -rebuild, fixpt). Otherwise one would never reach a fixpoint. Also, loading the versiontool does not work when rebuilding the system because CM is not properly initialized at that time. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/29 14:00:00 CDT Tag: blume-20040929-autoversion Description: Implemented some CM magic to have file src/compiler/TopLevel/main/version.sml generated automagically. The version is taken from two files: config/version and config/release. The first is expected to contain a two-part version number such as 110.49. The second should contain a single number, but it may be missing. If the environment variable VERSIONTOOL_BUMP_RELEASE is defined at the time the version tool is loaded (which is the first time you say CMB.make), then the tool will increment the value stored in config/release every time CMB.make is invoked. The binfile format is now insensitive to anything beyond the first two components of a version number, so bumping the release does not render binfiles incompatible. Auto-bumping can be used to keep track of versions during development without invalidating existing binfiles. In any case, every CMB.make updates the date information in version.sml. (This is the date that is printed in the banner.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/28 10:53:00 CDT Tag: blume-20040928-controls Description: Some cleanup of the controls code. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/27 22:08:00 CDT Tag: blume-20040927-controls Description: Added two pieces of functionality to the Controls interface: 1. val save'restore: 'a control -> unit -> unit grabs the current value of the control in stage 1 and restores it in stage 2. 2. val set' : 'a control * 'a -> unit -> unit stores the given value into the control in stage 2 (i.e., delayed) but does all error checking in stage 1. (This is for string controls that need to do parse their argument -- something that might fail. In some cases, notably in CM, one already knows the intended argument but wants to delay the actual assignment until a time when error recovery would be more difficult.) Changed the handling of controls in tool arguments to classes "sml" and "lazysml": - use Controls.save'restore as a more robust way of restoring the old value (in particular: without having to re-parse the string) - use controls to handle the "overload" keyword in the init group (I believe this change actually fixes a long-standing obscure bug.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/27 17:00:00 CDT Tag: blume-20040927-lazysml Description: Added a new tool class called "lazysml" to CM's tool chest. The only difference to "sml" is that compilation is done with Control.lazysml set to true. A source of class "lazysml" is automatically recognized by a file name suffix of ".lml". In addition to the above feature, the original class "sml" now also supports a tool argument "lazy" which has the same effect. As a result, the following three lines are equivalent: foo.sml : lazysml foo.sml : sml (lazy) foo.sml (lazy) The setting goes into effect both during parsing and during compilation. The original setting is restored right after parsing and after compilation, respectively. In addition to all the above, there is also a general mechanism to set ANY of the "controls" that are available at the command line via "-C..." on a per-sml-file basis. The same rules that apply for "lazy" apply as well. (In fact, "lazy" is implemented as a special case of the general mechanism.) The .cm file syntax uses a new keyword tool argument called "with". There are several ways of indicating the desired settings: foo.sml (with:parser.quotations=true) foo.sml (with:(name:parser.quotations value:true)) foo.sml (with:(name:name1 value:value1 name:name2 value:value2 ...)) foo.sml (with:(name1=value1 name2=value2 ...)) foo.sml (with:(name1=value1 name:name2 value:value2 name3=value3 ...)) etc. Another possible abbreviation is to leave out the =v or value:v part if the name refers to a boolean control (in which case the value is taken to be true). Thus, one could get lazy sml also by saying: foo.sml (with:parser.lazy-keyword=true) foo.sml (with:parser.lazy-keyword) foo.sml (with:(name:parser.lazy-keyword value:true)) foo.sml (with:(name:parser.lazy-keyword)) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/24 16:22:00 CDT Tag: blume-20040924-ppc-long-branch Description: Turned message about "emiting long form of branch" off by default. Added a control flag to turn it back on when desired. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/24 16:05:00 CDT Tag: blume-20040924-rounding Description: Applied patch for setting rounding modes under Mac OS X. Thanks to Melissa O'Neill for providing the code! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/23 17:30:00 CDT Tag: blume-20040923-envvars Description: 1. Changed definition of type ControlRegistry.registry_tree to include control_info (i.e., the name of the controlling environment variable). 2. Added command-line flags -e and -E to print the names of environment variables that can be used to control internal settings. (This uses the new API mentioned in 1.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/13 16:50:00 CDT Tag: Release_110_49 Description: New working version (110.49). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume Date: 2004/09/13 16:20:00 CDT Tag: blume-20040913-config-mlrisc Description: Put target "mlrisc" back into the default list. (There is no harm in having it, and some users have expressed their wish to have "mlrisc" included by default.) ---------------------------------------------------------------------- Name: John Reppy Date: 2004/09/13 Tag: jhr-20040913-signals Description: Fixed the signal masking code to properly nest mask/unmask operations on a per-signal basis. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/09/08 13:20:00 CDT Tag: blume-20040908-heap-magic Description: Bumped the heap macig number to 0x09082004 to account for the changed layout of the ML frame under MacOS X. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/09/03 11:26:00 EST Tag: leunga-20040903-cygwin-install Description: Added a patch to _arch-n-opsys to enable the Cygwin runtime. The Cygwin runtime is turned on by setting the environment variable SMLNJ_CYGWIN_RUNTIME to 1. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/31 17:15:00 CDT Tag: blume-20040831-core Description: Added some exports to src/compiler/core.cm upon request by J. Joemann. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/30 17:55:00 CDT Tag: blume-20040830-installer Description: Upon request by Johannes Joemann: - improved ML code of installer to fall back to coping when renaming fails (i.e., when source and target are on different file systems); the code compiles but has yet to be tested in anger - removed mlrisc from list of default targets (config/targets) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/27 17:20:00 CDT Tag: blume-20040827-ptreql Description: Added ptreql primop to structure InlineT (upon request from Larry Paulson). ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/08/15 21:21:00 EST Tag: leunga-110_48-udgraph Description: Another bug fix from Carl Hauser: diff /net/niflab/smlnj48/src/MLRISC/graphs/udgraph.sml udgraph.sml > 48c48 > < | rmv((e as (k,_))::es,L) = rmv(es,if k = i then es else > e::L) > --- > > | rmv((e as (k,_))::es,L) = rmv(es,if k = i then L else e::L) > Without this, any deletion of an edge in an undirected graph does severe > violence to the graph. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/08/10 23:35:00 EST Tag: leunga-110_48-ppc Description: The IBM/MacOS syntax switch on PPC was incorrectly swapped. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/10 12:00:00 CDT Tag: Release_110_48 Description: New working version (110.48). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/08/09 12:21:00 EST Tag: leunga-110_47-dijsktra Description: Bug fix from Carl Hauser: single_source_shortest_paths in dijkstra.sml was observed to get wrong answers (by comparing to single_source_shortest_paths in bellman-ford.sml). The problem is that following the expression A.update(dist,s,Num.zero) it is necessary to update the priority queue using Q.decreaseWeight(Q,s). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/06 18:10:00 CDT Tag: blume-20040806-cmdline Description: Fiddled with handling of command-line options: * sml now quits after processing the command line if -H, -S, -h, or -s appears as the last command-line argument * a new option -q terminates the session when encountered on the command line; subsequent arguments will be ignored * bug fixes: short (erroneous) arguments are no longer ignored completely ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/08/04 18:17:00 EST Tag: leunga-110_47-ppc-ibm-asm Description: - Added minimal IBM assembly syntax support for PowerPC. - Cygwin: manually changed the file cygwin.def. Some exported symbols have been altered in the runtime. We need an automatic way to keep the file in sync. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/04 14:00:00 CDT Tag: Release_110_47 Description: New working version (110.47). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/03 14:25:00 CDT Tag: blume-20040803-callingconv Description: Added low-level support for choosing C calling conventions by twiddling the type of rawccall. (See src/compiler/Semant/types/cproto.sml for details.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/02 15:55:00 CDT Tag: blume-20040802-backout Description: Backed out of change to win32-filesys.c. The earlier patch to get_file_time caused CM to produce files with the wrong time stamp. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/08/02 14:45:00 CDT Tag: blume-20040802-nlffi-win32 Description: Added NLFFI support for Win32, adapted from a patch provided by David Hansel. This is currently completely untested. Also, the issue concerning stdcall vs. ccall is still unresolved. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/07/30 17:55:00 CDT Tag: blume-20040730-various Description: Gearing up towards 110.47... - various minor bugfixes to ml-nlffigen - a beginning of a manual for nlffi - eliminated 'export name=value' in config/install.sh as this does not work with certain versions of /bin/sh (Thanks to David King at Motorola for catching this.) - several bugfixes provided or suggested by David Hansel at Reactive Systems: - added a test for tm==NULL to gmtime.c and localtime.c - applied patch for incorrect GetFileTime under win32 - toSeconds -> toMilliseconds in Win32/win32-process.sml ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/07/21 18:20:00 CDT Tag: blume-20040721-nlffigen Description: - Fixed minor issue in ml-nlffigen: Now generate structure T_foo for a typedef to an incomplete type, but leave out the "typ" member. (This is just for consistency.) - Started to produce what is supposed to become better (i.e., comprehensive) documentation of what ml-nlffigen does and produces. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/07/14 16:25:00 CDT Tag: blume-20040714-union Description: Added C_UNION to c-calls/c-types.sml and updated the machinery (ml-nlffigen, cproto.sml) that conveys C function interface information to the code generator. However, the actual architecture-specific implementation of function arguments and results that are C unions is still not implemented. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/07/14 14:38:00 EST Tag: leunga-110_46_1-ppc-lwzu Description: Added these instructions to the PowerPC architecture: LBZU(X), LHZU(X), LWZU(X), STWU(X), STFDU, STFSU etc... Note: I haven't added their instruction encoding into the description. ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/07/13 15:04:00 EST Tag: leunga-110_46_1-ppc-lwarx Description: Added the two instructions LWARX and STWCX to the PowerPC instruction set. A (untested) rewrite of loop-structure.sml. The old version is completely broken. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/07/13 13:50:00 CDT Tag: blume-20040713-nlffi Description: - use paramAlloc to report c-calls with too many arguments (for PPC version where parameter area is pre-allocated) - added ccall_maxargspace to machspec (to implement the above) - made "make" commend in CM's "make" tool configurable - added option (default: on) for passing the name of the SML/NJ's "bin" directory to "make"; the call looks like this: make SMLNJ_BINDIR= This can be used by the Makefile to, e.g., pick the "right" version of ml-nlffigen. - minor code tweaks ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/07/12 22:50:00 CDT Tag: blume-110_46_1-macosx-nlffi Description: NLFFI under Mac OS X now working (sort of). This is largely untested, though. Note: 1. You have to make a new, clean build of the runtime system. 2. There are new BOOTFILES, you have to use them! (Doing the bootstrap process yourself would be *very* painful! If you absolutely have to do it, build the system under a different architecture and then cross-compile.) Version bumped to 110.46.1 to account for runtime data format changes. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/06/18 14:30:00 CDT Tag: blume-20040618-unix Description: Changed the implementation of structure Unix so that the same stream is returned every time one of the {text,bin}{In,Out}streamOf functions is invoked on the same proc. This is not what the spec currently says -- although IMO it arguably should. (See discussion below.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/06/17 18:15:00 CDT Tag: Release_110_46 Description: New working version (110.46). NEW BOOTFILES! ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/06/17 17:20:00 CDT Tag: blume-20040617-timer-unix Description: Changed the interface of structures Timer and Unix to match the most recent Basis spec. In the case of Unix there still seems to be an open/weird issue: The {text,bin}{In,Out}streamOf functions are supposed to create fresh streams whenever they are called -- as opposed to have them return the same stream every time. This design is supposed to prevent space leaks caused by proc values hanging on to streams. The reap function, on the other hand, is supposed to close the streams. This cannot be done without having a handle on the stream in proc after all... I took the liberty to implement the following stopgap solution: The proc value hangs on to the most recently created stream(s). Reap closes those. If either or both of the two streams hadn't been created at all yet, then reap will close the corresponding file descriptors directly. PS: I don't understand the original space leak argument anymore. If a proc hangs on to the imperative stream, then I/O operations on those will advance the state of the cached stream and avoid the space leak. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/28 16:45:00 CDT Tag: blume-20040528-basis Description: Added signature PACK_REAL and exported functor PrimIO. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/25 16:00:00 CDT Tag: blume-20040525-group-owner Description: CM now ignores (but still accepts) the "owner" information in group descriptions. The owner of a group is its next enclosing library. Each group must have a unique owner. (There is a virtual "toplevel" library that owns groups which are not nested within a real library.) Previously, each group had to explicitly declare its owner, and CM would check that such a declaration is correct. The new scheme is to have CM check that for each group there is precisely one owning library. The advantage of the new scheme is that the programmer no longer needs to maintain the somewhat annoying owner information. The downside is that CM cannot enforce the ownership rule across multiple runs of CM.make. Fortunately, enclosing the same group in two different libraries A and B which are not part of the same program does not cause real problems. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/20 16:00:00 CDT Tag: blume-20040520-win32 Description: Made the win32 version work again. (Strangely, a misplaced comma had slipped into win32-process.c which prevented the runtime from being compiled correctly.) Also, included a minor addition to ml-build.bat analogous to what was done in blume-20040519-ml-build. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/19 22:10:00 CDT Tag: blume-20040519-ml-build Description: Arranged for ml-build to clean up after itself a little bit better. The script generates a temporary SML source file and compiles it using CM, so CM generates metadata (GUID, SKEL, objectfile) for it. It now gets rid of those at the end, so they don't accumulate under .cm. This required a minor change to install.sh because the name of the metadata directory (default: .cm) is actually configurable at installation time. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/18 15:50:00 CDT Tag: blume-20040518-mkreader Description: Added Posix.IO.mk{Bin,Text}{Reader,Writer} by lifting their respective implementations from internal modules PosixBinPrimIO and PosixTextPrimIO. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/11 14:35:00 CDT Tag: blume-20040511-win32sock Description: Added previously missing support for many socket-related functions under win32. Thanks to David Hansel for the voluminous patch! (I have not tested this patch under win32 yet.) Here is David's e-mail: Hi, Attached to this email you find a diff against sml/nj 110.45 that will enable socket support under Windows. To apply the patch (using unix or cygwin) 1) gunzip runtime.diff.gz 2) "cd" into "src/runtime" in the source tree of a fresh 110.45 installation. 3) patch -p 1 < [your/path/to]runtime.diff The code compiles fine but has NOT yet been extensively tested. I only ran a few tests for basic socket client functionality (which worked fine). Especially the functions that use ioctl are not tested at all and might not work (see below). I implemented this since we want to move to a newer version of sml/nj but need socket support in order to use it. This is the first time I even had a look at the sml/nj source, so please review my changes before making this part of the distribution! Here are a few issues that I think might be better for someone to solve who is more familiar with the sml/nj source (and socket programming): - getnetbyaddr.c and getnetbyname.c will raise a "not implemented" exception since I could not figure out what the windows equivalent of these functions is - In sockets-osdep.h there are a some #include statements that are only used in a few files that include sockets-osdep.h - In smlnj-sock-lib.c, function init_fn() calls WSAStartup() but does not process its return value since I don't know how to report an error upwards. - It would probably be good to have a call to WSACleanup() when the library is unloaded (if there is such a possibility). Otherwise I think Windows will take care of this automatically when the process finishes. - I used ioctlsocket() as a replacement for ioctl() but I have no idea if that is actually the proper replacement on Windows. - All these issues are marked in the code by "FIXME" comments. We use sml/nj extensively in our products and are quite happy with it. I hope this contribution will help you. Keep up the good work! David ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/11 14:20:00 CDT Tag: blume-20040511-installml Description: Fixed two bugs in installml script. (Thanks to Vesa A. Norrman for the patch.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/11 14:05:00 CDT Tag: blume-20040511-nlffi-netbsd Description: Added support for nlffi under netbsd. (Thanks to Vesa A. Norrman for the patch.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/11 12:05:00 CDT Tag: blume-20040511-exports Description: As per request by Adam Chlipala , extended various export lists in compiler-related .cm-files. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/05/11 11:35:00 CDT Tag: blume-20040511-allsource Description: The installer now honors the "src-smlnj" target again, although its meaning has changed from "all sources required for the compiler" to "all sources the installer knows about". In other words, if you enable "src-smlnj" in the "targets" file, then the installer will pull in sources for everything. (Notice that this refers to source code only. Compiled code is still only installed for modules that were requested explicitly or which are required for other modules that were requested explicitly.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/04/23 17:40:00 CDT Tag: blume-20040423-ieee-scan Description: Fixed IEEEReal.scan (and .fromString) so that if there is an overflow in the exponent calculation we get INF or ZERO (depending on the mantissa and the sign of the exponent). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/04/23 10:40:00 CDT Tag: blume-20040423-ml-build Description: The ml-build script now terminates with a non-0 status when something goes wrong. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/04/22 16:35:00 CDT Tag: blume-20040422-Option Description: Made exception Option to be the same as exception Option.Option (as it should be). ---------------------------------------------------------------------- Name: Allen Leung (leunga (at) reservoir (dot) com) Date: 2004/03/19 14:40:00 EST Tag: leunga-20040319-cygwin-nlffi Description: Fixed the runtime so that ml-nlffi-lib runs on the cygwin version of SML/NJ. The problem is that lib = dlopen(NULL, ...) f = dlsym(lib, "malloc"); does not work on Windows unless we explicitly export symbols such as 'malloc' during linking. We fixed this by explicitly exporting the required symbols with the magic gcc incantation: -Wl,--export-all cygwin.def where cygwin.def is a file containing all the symbols that we wish to export. I suspect this is a Windows problem and we'll have to do the same (somehow with windows compilers) when we build the native win32 version with the system calls LoadLibrary/GetProcAddress. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/03/04 16:35:00 CST Tag: blume-20040304-intinf-fmt Description: Fixed problem with IntInf.fmt (sign would show up on the right instead of on the left for BIN, OCT, and HEX). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/03/04 11:25:00 CST Tag: blume-20040304-symlinks Description: Fixed problem with installer script (unix only) where bin/ml-yacc and friends pointed (via symlinks) to absolute locations instead of just .run-sml. This was reported by Vesa A Norrman. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/02/13 14:50:00 CST Tag: Release_110_45 Description: New working version (110.45). New bootfiles. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/01/26 15:15:15 CST Tag: blume-20040126-toplevel Description: Improved handling of exceptions at the interactive toplevel. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2004/01/26 11:25:00 Tag: blume-20040126-app Description: Type of top-level "app" corrected. Added code for setting vp_limitPtrMask to Win32-specific runtime. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/11/18 17:10 CST Tag: blume-20031118-basis-fiddle Description: - changed Timer interface to what might become the spec - POSIX_FLAGS -> BIT_FLAGS according to spec - some other minor discrepancies wrt. spec eliminated ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/11/06 12:00:00 CST Tag: Release_110_44 Description: New working version (110.44). New bootfiles. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/11/04 11:50:00 CST Tag: blume-20031104-move-libraries Description: Eliminated the "dont_move_libraries" directive in config/targets. (The mechanism was broken and could not be fixed easily. Moreover, there does not seem to be any reason not to move all libraries into lib during installation. I originally implemented this directive as a backward-compatibility feature when I first introduced the new CM. Now that things have been stable for a long time and going back to the old CM is not an option, there is no reason to keep it around.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/11/03 16:00:00 CST Tag: blume-20031103-installdir Description: Made installer honor INSTALLDIR variable again. (Thanks to Chris Richards for pointing out the problem and providing the solution.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/10/01 17:05:00 CDT Tag: blume-20031001-lal-mlrisc Description: MLRISC bug fix from Lal. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/30 16:10:00 CDT Tag: blume-20030930-primio-bat Description: 1. Added openVector, nullRd, and nullWr to PRIM_IO. 2. Improved .bat files (for Win32 port) to make things work under Win95. (thanks to Aaron S. Hawley for this one) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/26 16:05:00 CDT Tag: blume-20030926-wrappriv Description: Added missing wrapper for privilege "primitive" in $smlnj/viscomp/core.cm. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/26 15:00:00 CDT Tag: blume-20030926-110_43_3 Description: - additional cleanup - version number bump, NEW BOOTFILES ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/26 12:00:00 CDT Tag: blume-20030926-ppautoload Description: I modified the read-eval-print loop so that the autoloader gets invoked whenever the prettyprinter tries to look up a symbol that is not currently defined in the toplevel environment but which appears in CM's autoload registry. As a result, we see far fewer of those ?.Foo.Bar.xxx names in the prettyprinter's output. In addition to this I tried to clean up some pieces of the Basis implementation (e.g., Socket, Word8Array) in order to prevent other instances of these ?.Foo.Bar.xxx names from being printed. The mechanism that picks names for types still needs some work, though. (Right now it seems that if there is a type A.t which is defined to be B.u, but B is unavailable at toplevel, then A.t gets printed as "?.B.u" although the perhaps more sensible solution would be to use "A.t" in this case. In other words, the prettyprinter should follow a chain of DEFtycs not farther than there are corresponding toplevel names in the current environment.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/24 16:31:00 CDT Tag: blume-20030924-installer Description: Another installer tweak: All the ML code for the installer is now compiled during CMB.make and put into a little library called $smlnj/installer.cm. The installation then simply invokes sml -m $smlnj/installer.cm and everything happens automagically. Win32: ML code senses value of environment variable SMLNJ_HOME. Unix: ML code senses values of environment variables ROOT, CONFIGDIR, and BINDIR. The new scheme guarantees that the ML code responsible for the installation is in sync with the APIs of the main system. Also, the installer is somewhat faster because the installer script is precompiled. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/24 15:35:00 CDT Tag: blume-20030924-synsock Description: Added a signature SYNCHRONOUS_SOCKET to basis.cm. This is like SOCKET but excludes all non-blocking operations. Defined SOCKET (in Basis) and CML_SOCKET in terms of SYNCHRONOUS_SOCKET. Removed superfluous implementations of non-blocking operations from CML's Socket structure. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/24 15:10:05 CDT Tag: blume-20030924-sockets Description: 1. Fixed SOCKET API and implementation to match Basis spec. This required changing the internal representation of sockets to one that remembers (for each socket file descriptor) whether it is currently blocking or non-blocking. This state is maintained lazily (i.e., a system call is made only if the state actually needs to change). 2. OS-specific details of sockets were moved into separate files, thus making it possible to unify the bulk of the socket implementations between Unix and Win32. 3. CML's socket API changed accordingly. (Note that we need to remove non-blocking functions from this API since they are redundant in the case of CML!) 4. CML's socket implementation now makes use of non-blocking functions provided by Basis, thus removing all OS-dependent code from this part of CML. 5. Changed Real64.precision from 52 to 53. Minor cleanup in Real64 code. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/22 12:10:00 CDT Tag: blume-20030922-110_43_2 Description: Made a new interim version and bootfiles for developer's bootstrapping convenience. 110.43.2 -- NEW BOOTFILES ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/19 15:55:00 CDT Tag: blume-20030919-cmdir Description: 1. new-install.sh -> install.sh 2. changed default CM "metadata" directory name to ".cm" (instead of "CM") 3. tweaked installer so that another name instead of .cm can be chosen at install time (by setting the CM_DIR_ARC environment variable during installation); once installation is complete, the name is fixed ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/18 16:00:00 CDT Tag: blume-20030918-110_43_1 Description: Made a new interim version and bootfiles for developer's bootstrapping convenience. 110.43.1 -- NEW BOOTFILES ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/18 15:20:00 CDT Tag: blume-20030918-misc Description: 1. Exported fractionsPerSecond etc. from TimeImp (but not from Time as this seems to be controversial at the moment) and used those in Posix.ProcEnv.times. 2. Added Time.{from,to}Nanoseconds to Time. 3. Improved Real.{from,to}LargeInt by avoiding needless calculations. For example, fromLargeInt never needs to look at more than 3 "big digits" to get its 53 bits of precision. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/17 16:30:00 CDT Tag: blume-20030917-real32-slices Description: Added an entry to the primitive environment (compiler/Semant/statenv/prim.sml) for int32->real64 conversion and added code to compiler/CodeGen/main/mlriscGen.sml to implement it. Removed some of the "magic" constants in real64.sml and replaced them with code that generates these values from their corresponding integer counterparts. Made all(?) the slice-related changes to the Basis and made everything compile again... ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/15 17:45:00 CDT Tag: blume-20030915-rbase Description: Fixed bug in Real.fromLargeInt. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/13 18:11:00 CDT Tag: blume-20030913-libinstall Description: Minor bugfix in config/libinstall (set anchor with path to standalone tool after installing it, otherwise libraries that need ml-lex or ml-yacc won't compile the first time the installer runs). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/12 11:45:00 CDT Tag: blume-20030912-various Description: - fixed bug in Real.toLargeInt - fixed bug in Posix.ProcEnv.times - changed inputLine functions to return an option - minor installer improvements / bugfixes - changed default @SMLalloc parameter for x86/celeron to 64k ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/09 22:00:00 CDT Tag: Release_110_43 Description: New working release 110.43. New bootfiles. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/09 19:20:00 CDT Tag: blume-20030909-installer Description: Rewrote large parts of config/install.sh in SML (config/libinstall.sml). Modified config/install.bat to take advantage of it. Also modified config/install.sh (and called it config/new-install.sh) to take advantage of it on Unix systems. (The SML code is (supposed to be) platform- independent.) The installer can now install everything under Win32 as well as under *nix as long as it compiles. Other changes: - made CML compile again under Win32 - made eXene compile under Win32 (by providing a fake structure UnixSock and by using OS.Process.getEnv instead of Posix.ProcEnv.getenv) - fixed a bug in nowhere: it assumed that type OS.Process.status is the same as type int; under Win32 it isn't - fixed some slice-related problems in the win32-specific parts of CML - added a functor argument "sameVol" to os-path-fn.sml in the Basis (under Win32, the volume name is case-insensitive, and the OS.Path code compares volume names for equality) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/08 11:55:00 CDT Tag: blume-20030908-fullpath Description: Made Win32 version of OS.FileSys.fullPath return current directory when given an empty string. This is what the spec says, and incidentally, CM depends on it. (CM otherwise goes into an infinite loop in certain cases when presented with the name of a non-existing .cm file.) ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/09/04 16:30:00 CDT Tag: blume-20030905-slices-etc Description: 1. Changed interface to vectors and arrays in Basis to match (draft) Basis spec. 2. Added signatures and implementations of slices according to Basis spec. 3. Edited source code throughout the system to make it compile again under 1. and 2. (In some cases code had to be added to have it match the new signatures.) 4. MLRISC should be backward-compatible: the copies of the originals of files that needed to change under 3. were retained, the .cm files check the compiler version number and use old versions when appropriate. 5. Changed type of OS.FileSys.readDir and Posix.FileSys.readdir to dirstream -> string option (in accordance with Basis spec). 6. When generating code that counts lines, ml-lex used function CharVector.foldli, taking advantage of its old interface. This has been replaced with the corresponding code from CharVectorSlice. (html-lex must be re-lexed!) 7. BitArray in smlnj-lib/Util has been extended/modified to match the new MONO_ARRAY signature. (Do we need BitArraySlice?) 8. Removed temporary additions (fromInternal, toInternal) from the (now obsolete) IntInf in smlnj-lib/Util. 9. Cleaned up structure Byte. 10. Added localOffset, scan, and fromString to Date (according to spec). Cleaned/corrected implementation of Date. (Still need to check for correctness; implement better canonicalizeDate.) 11. Added "scan" to signature IEEE_REAL. 12. Some improvements to IntInf [in particular: efficiency-hack for mod and rem when second operand is 2 (for parity checks).] 13. Changed representation of type Time.time, using a single IntInf.int value counting microseconds. This considerably simplified the implementation of structure Time. We now support negative time values; scan and fromString handle signs. 14. Functor PrimIO now takes two additional arguments (VectorSlice and ArraySlice). ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/08/28 17:00:00 CDT Tag: blume-20030828-intinf Description: This is a major update which comes with a version number bump (110.42.99 -- yes, we are really close to 110.43 :-), NEW BOOTFILES, and an implementation of IntInf in the Basis. There are a fairly large number of related changes and updates throughout the system: Basis: - Implemented IntInf. - Made LargeInt a projection of IntInf (by filtering through INTEGER). - Added some missing Real64 operations, most notably Real.toLargeInt. - Added FixedInt as a synonym for Int32. compiler: * Added support for a built-in intinf type. - literals - pattern matching - conversion shortcuts (Int32.fromLarge o Int.toLarge etc.) - overloading on literals and operations This required adding a primitive type intinf, some additional primops, and implementations for several non-trivial intinf operations in Core. (The intinf type is completely abstract to the compiler; all operations get delegated back to the Core.) * Intinf equality is handled by polyequal. However, the compiler does not print its usual warning in this case (since polyequal is the right thing to do there). * Improved the organization of structure InlineT. * A word about conversion primops: If conversions involving intinf do not cancel out during CPS contract, then the compiler must insert calls to Core functions. Since all core access must be resolved already during the FLINT translate phase, it would be too late a the time of CPS contract to add new Core calls. For this reason, conversion primops for intinf carry two arguments: 1. the numeric argument that they are supposed to convert, and 2. the Core function that can help with this conversion if necessary. If CPS contract eliminates a primop, then the associated Core function becomes dead and goes away. Intinf conversion primops that do not get eliminated by CPS contract get rewritten into calls of their core functions by a separate, new phase. interactive system: - Control.Print.intinfDepth controls max length of intinf constants being printed. (Analogous to Control.Print.stringDepth.) - Cleanup in printutil and pputil: got rid of unused stuff and duplicates; replaced some of the code with code that makes better use of library functionality. CM: Bugfix: parse-errors in init group (system/smlnj/init/init.cmi) are no longer silent. CKIT: Fixed mismatched uses of Int32 and LargeInt. I always decided in favor of LargeInt -- which is now the same as IntInf. CKIT-knowledgable people should check whether this is what's intended and otherwise change things back to using Int32 or FixedInt. Throughout the code: Started using IntInf.int literals and built-in operations (e.g., comparison with 0) where this seems appropriate. ---------------------------------------------------------------------- Name: Dave MacQueen (dbm@cs.uchicago.edu) Date: 2003/08/13 11:36:00 CDT Tag: dbm-20030813-mcz-merge1 Description: Merging changes from the mcz-branch development branch into trunk. These changes involve replacement of the emulated old prettyprinter interface with direct use of the SML/NJ Lib PP library, and fixing of a couple of bugs (895, 1186) relating to error messages. A new prettyprinter for ast datatypes (Elaborator/print/ppast.{sig,sml}) has been added. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/08/11 15:45:00 CDT Tag: blume-20030811-windows Description: Version number bumped to 110.42.9. NEW BOOTFILES!!! http://smlnj.cs.uchicago.edu/dist/working/110.42.9/ This patch restores SML/NJ's ability to run under win32. There are a number of changes, including fixes for several bugs that had gone unnoticed until now: - uname "CYGWIN_NT*" is recognized as win32 (This is relevant only when trying to run the win32 version from within cygwin.) - There are a number of simple .bat scripts that substitute for their corresponding Unix shell-scripts. (See below.) - The internals of ml-build have been modified slightly. The main difference is that instead of calling ".link-sml" (or link-sml.bat) using OS.Process.system, the ML process delegates this task back to the script. Otherwise problems arise in mixed environments such as Cygwin where scripts look and work like Unix scripts, but where OS.Process.system cannot run them. - In CM, the srcpath pickler used native pathname syntax -- which is incorrect in the case of cross-compilation. The new pickle format is independent of platform-specific naming conventions. - Path configuration files (such as lib/pathconfig) can now choose between native and standard syntax. Placing a line of the form standard! into the file causes all subsequent paths to be interpreted using CM standard pathname syntax (= Unix conventions); a line native! switches back to native style. This was needed so that path config files can be written portably, see src/system/pathconfig. - Runtime system: - win32-filesys.c: get_file_time and set_file_time now access modification time, not creation time. - I/O code made aware of new array representation. - Bug fixes in X86.prim.masm. - src/system/makeml made aware of win32. (For use under cygwin and other Unix-environments for windows.) - In Basis, fixed off-by-one error in win32-io.sml (function vecF) which caused BinIO.inputAll to fail consistently. .bat scripts: Windows .bat scripts assume that SMLNJ_HOME is defined. - sml.bat, ml-yacc.bat, ml-lex.bat: Driver scripts for standalone applications (sml, ml-yacc, ml-lex). - ml-build.bat: analogous to ml-build. - config\install.bat: Analogous to config/install.sh. This requires that SMLNJ_HOME is set and that Microsoft Visual C is ready to use. (nmake etc. must be on the path, and vcvars32 must have been run.) Moreover, sources for ml-lex and ml-yacc need to exist under src, and the bootfile hierarchy must have been unpacked under sml.boot.x86-win32. The script is very primitive and does a poor job at error checking. It only installs the base system, ml-lex, and ml-yacc. No other libraries are being installed (i.e., you get only those that are part of the compiler.) - link-sml.bat: analogous to .link-sml, but not currently used Unrelated bug fixes: - ml-nlffigen now exports structures ST_* corresponding to incomplete types. - Added getDevice to PP/src/pp-debug-fn.sml. (Would not compile otherwise.) ---------------------------------------------------------------------- Name: Dave MacQueen (macqueen@cs.uchicago.edu) Date: 2003/06/17 Tag: macqueen-20030617-bug895 Description: Modified compiler/Elaborator/print/pptype.sml to fix bug 895. Tag will be used for new development branch (mcz-branch) for use by MacQueen, (Lucasz) Zairek, and (George) Cao at uchicago. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/05/27 16:55:00 CDT Tag: blume-20030527-polyeq Description: Tried to eliminated most cases of polymorphic equality. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/05/21 17:45:00 CDT Tag: blume-20030517-complete Description: Two changes: 1. Added a flag for controlling whether non-exhaustive bindings will be treated as errors (default is false). 2. Cleaned up the *entire* source tree so that CMB.make goes through without a single non-exhaustive match- or bind warning. ---------------------------------------------------------------------- Name: Matthias Blume (blume (at) tti - c (dot) org) Date: 2003/05/17 10:20:00 CDT Tag: blume-20030517-absyn Description: 1. Added cases for IF, WHILE, ANDALSO, and ORELSE to Absyn. This mainly affects the quality of error messages. However, some of the code is now more straightforward than before. (Treatment of the above four constructs in translate.sml is much simpler than the "macro-expansion" that was going on before. Plus, the mach- compiler no longer gets invoked just to be able to compile an if-expression.) 2. The ErrorMsg.Error exception is now caught and absorbed by the interactive loop. ---------------------------------------------------------------------- Name: Allen Leung Date: 2003/05/16 13:05:00 CDT Tag: leunga-20030516-cygwin-runtime Description: Ported the runtime system to cygwin, which uses the unix x86-unix bin files. Missing/buggy features: o getnetbyname, getnetbyaddr: these functions seem to be missing in the Cygwin library. o Ctrl-C handling may be flaky. o Windows system calls and Windows I/O are not supported. A new set of binfiles is located at: http://www.dorsai.org/~leunga/boot.x86-unix.tgz This is only needed for bootstrapping the cygwin version of smlnj. Other x86 versions can use the existing binfiles. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2003/04/08 15:42:00 CDT Tag: blume-20030408-listpair Description: 1. Added a target 'mlrisc' to installer. 2. Added missing elements to structure ListPair. ---------------------------------------------------------------------- Name: Allen Leung Date: 2003/01/07 10:40:00 EST Tag: leunga-20030107-int-rem Description: Fixed a bug in Int.rem(x,y) where y is a power of 2 on x86. The arguments to the SUBL instruction were swapped. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/12/12 16:25:00 EST Tag: blume-20021212-risc-ra Description: Fixed a serious bug in the rewrite code for FP spilling/reloading that sent the RA into an infinite loop when floating point registers get spilled. (Because of this bug, e.g., nucleic stopped compiling between 110.37 and 110.38.) There was another set of potential problems related to the handling of MLRISC annotations (but those did not yet cause real problems, apparently). ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/12/06 22:40:00 EST Tag: blume-20021206-cm-fileid Description: Added a call of SrcPath.sync at the beginning of Parse.parse (in CM). This fixes the problem of CM getting confused by files that suddenly change their identity (e.g., by getting unlinked and recreated by some text editor such as vi). There might be a better/cheaper/cleaner way of doing this, but for now this will have to do. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/10/28 09:50:00 EST Tag: blume-20021028-typecheck Description: Exported structure Typecheck from $smlnj/viscomp/core.cm. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/10/17 09:10:00 EDT Tag: Release_110_42 Description: In good old tradition, there has been a slight hiccup so that we have to patch 110.42 after the fact. The old release tag has been replaced (see below). The change solves a problem with two competing approaches the configuration problem regarding MacOS 10.1 vs. MacOS 10.2 which got in each other's way. This change only affects the runtime system code and the installer script. (No new bootfiles.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/10/16 12:00:00 EDT Tag: Release_110_42_removed Description: New working release. New bootfiles. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/10/10 13:10:00 EDT Tag: blume-20021010-ppc-divs Description: The mltree operator DIVS must be implemented with an overflow check on the PPC because the hardware indicates divide-by-zero using "overflow" as well. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/07/23 11:45:00 Tag: blume-20020723-smlnj-home Description: Sml now senses the SMLNJ_HOME environment variable. If this is set, then the bin dir is assumed to be in $SMLNJ_HOME/bin and (unless CM_PATHCONFIG is also set), the path configuration file is assumed to be in $SMLNJ_HOME/lib/pathconfig. This way one can easily move the entire tree to some other place and everything will "just work". (Companion commands such as ml-build and ml-makedepend also sense this variable.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/07/12 21:19:00 EDT Tag: blume-20020712-liveness Description: Exported two useful "step" functions from liveness module (MLRISC). ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/07/05 16:00 EDT Tag: Release_110_41 Description: New working release. New bootfiles. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/07/05 10:25:00 EDT Tag: blume-20020705-btimp Description: Exported structure BTImp from $smlnj/viscomp/debugprof.cm so that other clients can set up backtracing support. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/06/25 17:23:00 EDT Tag: blume-20020625-fpmax Description: Fixed a bug in translation of INLMAX (and INLMIN) for the floating-point case. (The sense of the isNaN test was reversed -- which made min and max always return their first argument.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/06/11 Tag: blume-20020611-unixpath Description: Back-ported OS.Path.{from,to}UnixPath from idlbasis-devel branch. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/06/10 16:35:00 EDT Tag: blume-20020610-ieeereal Description: I back-ported my implementation of IEEEReal.fromString from the idlbasis-devel branch so that we can test it. Another small change is that ppDec tries to give more information than just "" in the case of functors. However, this code is broken in some mysterious way if the functor's body's signature has not been declared by ascription but gets inferred from the implementation. This needs fixing... ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/31 Tag: blume-20020531-btrace-mode Description: Resurrected SMLofNJ.Internals.BTrace.mode. (It accidentally fell by the wayside when I switched over to using Controls everywhere.) ---------------------------------------------------------------------- Name: Lal George Date: 2002/05/23 12:21:40 EDT Tag: george-20020523-visual-labels Description: Labels are now displayed in the graphical output to make the fall-through and target blocks obvious. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/22 11:03:00 EDT Tag: blume-20020522-shrink Description: John tweaked yesterday's fix for 1131 to handle an out-of-memory situation that comes up when allocating huge arrays. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/21 16:00:00 EDT Tag: Release_110_40 Description: New working release (110.40). New bootfiles. [Also: John Reppy fixed GC bug 1131.] ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/21 12:35:00 EDT Tag: blume-20020521-cmdoc Description: CM documentation update. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/21 10:55:00 EDT Tag: blume-20020521-misc Description: - John tweaked runtime to be silent on heap export (except when GC messages are on). - I added a few more things (cross-compiling versions of CMB) to config/preloads (as suggestions). ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/20 22:25:00 EDT Tag: blume-20020520-controls Description: - Added ControlUtil structure to control-lib.cm. - Use it throughout. - Used Controls facility to define MLRISC controls (as opposed to registering MLRISC control ref cells with Controls after the fact) - Fixed messed-up controls priorities. * Removed again all the stuff from config/preloads that one wouldn't be able to preload at the time the initial heap image is built. (Many libraries, e.g., CML, do not exist yet at this time. The only libraries that can be preloaded via config/preloads are those that come bundled with the bootfiles.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/20 10:59:00 EDT Tag: blume-20020520-preloads Description: Added a lot of commented-out suggestions for things to be included in config/preloads. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/05/18 14:20:00 EDT Tag: leunga-20020518-mdl Description: o Made the mdl tool stuff compile and run again. o I've disabled all the stuff that depends on RTL specifications; they are all badly broken anyway. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/17 16:49:00 EDT Tag: blume-20020517-controls Description: 0. John Reppy made several modifications to the SML/NJ library. In particular, there is a shiny new controls-lib.cm. 1. Pushed new controls interface through compiler so that everything compiles again. 2. Added FormatComb and FORMAT_COMB to the CML version of the SML/NJ library (so that CML compiles again). 3. Modified init scripts because XXX_DEFAULT environment variables are no longer with us. (Boot-time initialization is now done using the same environment variables that are also used for startup-time initialization of controls.) ---------------------------------------------------------------------- Name: Lal George Date: 2002/05/15 09:20:10 EDT Tag: george-20020515-pseudo-op-decls Description: All pseudo-ops emitted before the first segment declaration such as TEXT, DATA, and BSS directives are assumed to be global declarations and are emitted first in the assembly file. This is useful in a number of situations where one has pseudo-ops that are not specific to any segment, and also works around the constraint that one cannot have client pseudo-ops in the TEXT segment. Because no segment is associated with these declarations it is an error to allocate any space or objects before the first segment directive and an exception will be raised. However, we cannot make this check for client pseudo-ops. These top level declarations are a field in the CFG graph_info. In theory you can continue to add to this field after the CFG has been built -- provided you know what you are doing;-) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/13 16:40:00 EDT Tag: blume-20020513-pp-etc Description: A few minor bugfixes: - Stopgap measure for bug recently reported by Elsa Gunter (ppDec). (Bogus printouts for redefined bindings still occur. Compiler bug should no longer occur now. We need to redo the prettyprinter from scratch.) - CM pathname printer now also adds escape sequences for ( and ) - commend and docu fixes for ml-nlffi ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/05/10 16:40:00 EDT Tag: blume-20020510-erg-textio Description: Applied the following bugfix provided by Emden Gansner: Output is corrupted when outputSubstr is used rather than output. The problem occurs when a substring ss = (s, dataStart, dataLen) where dataStart > 0, fills a stream buffer with avail bytes left. avail bytes of s, starting at index dataStart, are copied into the buffer, the buffer is flushed, and then the remaining dataLen-avail bytes of ss are copied into the beginning of the buffer. Instead of starting this copy at index dataStart+avail in s, the current code starts the copy at index avail. Fix: In text-io-fn.sml, change line 695 from val needsFlush = copyVec(v, avail, dataLen-avail, buf, 0) to val needsFlush = copyVec(v, dataStart+avail, dataLen-avail, buf, 0) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/04/12 13:55:00 EDT Tag: blume-20020412-assyntax Description: 1. Grabbed newer assyntax.h from the XFree86 project. 2. Fiddled with how to compile X86.prim.asm without warnings. 3. (Very) Minor cleanup in CM. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/04/01 (no joke!) 17:07:00 EST Tag: blume-20020401-x86div Description: Added full support for div/mod/rem/quot on the x86, using the machine instruction's two results (without clumsily recomputing the remainder) directly where appropriate. Some more extensive power-of-two support was added to the x86 instruction selector (avoiding expensive divs, mods, and muls where they can be replaced with cheaper shifts and masks). However, this sort of thing ought to be done earlier, e.g., within the CPS optimizer so that all architectures benefit from it. The compiler compiles to a fixed point, but changes might be somewhat fragile nevertheless. Please, report any strange things that you might see wrt. div/mod/quot/rem... ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/29 17:22:00 Tag: blume-20020329-div Description: Fixed my broken div/mod logic. Unfortunately, this means that the inline code for div/mod now has one more comparison than before. Fast paths (quotient > 0 or remainder = 0) are not affected, though. The problem was with quotient = 0, because that alone does not tell us which way the rounding went. One then has to look at whether remainder and divisor have the same sign... :( Anyway, I replaced the bootfiles with fresh ones... ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/29 14:10:00 EST Tag: blume-20020329-inlprims Description: NEW BOOTFILES!!! Version number bumped to 110.39.3. Primops have changed. This means that the bin/boot-file formats have changed as well. To make sure that there is no confusion, I made a new version. CHANGES: * removed REMT from mltree (remainder should never overflow). * added primops to deal with divisions of all flavors to the frontend * handled these primops all the way through so they map to their respective MLRISC support * used these primops in the implementation of Int, Int32, Word, Word32 * removed INLDIV, INLMOD, and INLREM as they are no longer necessary * parameterized INLMIN, INLMAX, and INLABS by a numkind * translate.sml now deals with all flavors of INL{MIN,MAX,ABS}, including floating point * used INL{MIN,MAX,ABS} in the implementation of Int, Int32, Word, Word32, and Real (but Real.abs maps to a separate floating-point-only primop) TODO items: * Hacked Alpha32 instruction selection, disabling the selection of REMx instructions because the machine instruction encoder cannot handle them. (Hppa, PPC, and Sparc instruction selection did not handle REM in the first place, and REM is supported by the x86 machine coder.) * Handle DIV and MOD with DIV_TO_NEGINF directly in the x86 instruction selection phase. (The two can be streamlined because the hardware delivers both quotient and remainder at the same time anyway.) * Think about what to do with "valOf(Int32.minInt) div ~1" and friends. (Currently the behavior is inconsistent both across architectures and wrt. the draft Basis spec.) * Word8 should eventually be handled natively, too. * There seems to be one serious bug in mltree-gen.sml. It appears, though, as if there currently is no execution path that could trigger it in SML/NJ. (The assumptions underlying functions arith and promotable do not hold for things like multiplication and division.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/27 16:27:00 EST Tag: blume-20020327-mlrisc-divisions Description: Added support for all four division operations (ML's div, mod, quot, and rem) to MLRISC. In the course of doing so, I also rationalized the naming (no more annoying switch-around of DIV and QUOT), by parameterizing the operation by div_rounding_mode (which can be either DIV_TO_ZERO or DIV_TO_NEGINF). The generic MLTreeGen functor takes care of compiling all four operations down to only round-to-zero div. Missing pieces: * Doing something smarter than relying on MLTreeGen on architectures like, e.g., the x86 where hardware division delivers both quotient and remainder at the same time. With this, the implementation of the round-to-neginf operations could be further streamlined. * Remove inlining support for div/mod/rem from the frontend and replace it with primops that get carried through to the backend. Do this for all int and word types. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/25 17:25:00 EST Tag: blume-20020325-divmod Description: I improved (hopefully without breaking them) the implementation of Int.div, Int.mod, and Int.rem. For this, the code in translate.sml now takes advantage of the following observations: Let q = x quot y r = x rem y d = x div y m = x mod y where "quot" is the round-to-zero version of integer division that hardware usually provides. Then we have: r = x - q * y where neither the * nor the - will overflow d = if q >= 0 orelse x = q * y then q else q - 1 where neither the * nor the - will overflow m = if q >= 0 orelse r = 0 then r else r + y where the + will not overflow This results in substantial simplification of the generated code. The following table shows the number of CFG nodes and edges generated for fun f (x, y) = x OPER y (* with OPER \in div, mod, quot, rem *) OPER | nodes(old) | edges(old) | nodes(new) | edges(new) -------------------------------------------------------- div | 24 | 39 | 12 | 16 mod | 41 | 71 | 12 | 16 quot | 8 | 10 | 8 | 10 rem | 10 | 14 | 8 | 10 ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/25 22:06:00 EST Tag: blume-20020325-cprotobug Description: Fixed a bug in cproto (c prototype decoder). ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/25 16:00:00 EST Tag: blume-20020325-raw-primops Description: I did some cleanup to Allen's new primop code and replaced yesterday's bootfiles with new ones. (But they are stored in the same place.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/24 22:40:00 EST Tag: blume-20020324-bootfiles Description: Made the bootfiles that Allen asked for. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/23 15:50:00 EST Tag: leunga-20020323-flint-cps-rcc-primops Description: 1. Changes to FLINT primops: (* make a call to a C-function; * The primop carries C function prototype information and specifies * which of its (ML-) arguments are floating point. C prototype * information is for use by the backend, ML information is for * use by the CPS converter. *) | RAW_CCALL of { c_proto: CTypes.c_proto, ml_args: ccall_type list, ml_res_opt: ccall_type option, reentrant : bool } option (* Allocate uninitialized storage on the heap. * The record is meant to hold short-lived C objects, i.e., they * are not ML pointers. With the tag, the representation is * the same as RECORD with tag tag_raw32 (sz=4), or tag_fblock (sz=8) *) | RAW_RECORD of {tag:bool,sz:int} and ccall_type = CCALL_INT32 | CCALL_REAL64 | CCALL_ML_PTR 2. These CPS primops are now overloaded: rawload of {kind:numkind} rawstore of {kind:numkind} The one argument form is: rawload {kind} address The two argument form is: rawload {kind} [ml object, byte-offset] 3. RAW_CCALL/RCC now takes two extra arguments: a. The first is whether the C call is reentrant, i.e., whether ML state should be saved and restored. b. The second argument is a string argument specifying the name of library and the C function. These things are currently not handled in the code generator, yet. 4. In CProto, An encoding type of "bool" means "ml object" and is mapped into C prototype of PTR. Note that "bool" is different than "string", even though "string" is also mapped into PTR, because "bool" is assigned an CPS type of BOGt, while "string" is assigned INT32t. 5. Pickler/unpicker Changed to handle RAW_RECORD and newest RAW_CCALL 6. MLRiscGen, 1. Changed to handle the new rawload/rawstore/rawrecord operators. 2. Code for handling C Calls has been moved to a new module CPSCCalls, in the file CodeGen/cpscompile/cps-c-calls.sml 7. Added the conditional move operator condmove of branch to cps. Generation of this is still buggy so it is currently disabled. ---------------------------------------------------------------------- Name: Lal George Date: 2002/03/22 14:18:25 EST Tag: george-20020322-cps-branch-prob Description: Implemented the Ball-Larus branch prediction-heuristics, and incorporated graphical viewers for control flow graphs. Ball-Larus Heuristics: --------------------- See the file compiler/CodeGen/cpscompile/cpsBranchProb.sml. By design it uses the Dempster-Shafer theory for combining probabilities. For example, in the function: fun f(n,acc) = if n = 0 then acc else f(n-1, n*acc) the ball-larus heuristics predicts that the n=0 is unlikely (OH-heuristic), and the 'then' branch is unlikely because of the RH-heuristic -- giving the 'then' branch an even lower combined probability using the Dempster-Shafer theory. Finally, John Reppy's loop analysis in MLRISC, further lowers the probability of the 'then' branch because of the loop in the else branch. Graphical Viewing: ------------------ I merely plugged in Allen's graphical viewers into the compiler. The additional code is not much. At the top level, saying: Control.MLRISC.getFlag "cfg-graphical-view" := true; will display the graphical view of the control flow graph just before back-patching. daVinci must be in your path for this to work. If daVinci is not available, then the default viewer can be changed using: Control.MLRISC.getString "viewer" which can be set to "dot" or "vcg" for the corresponding viewers. Of course, these viewers must be in your path. The above will display the compilation unit at the level of clusters, many of which are small, boring, and un-interesting. Also setting: Control.MLRISC.getInt "cfg-graphical-view_size" will display clusters that are larger than the value set by the above. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/21 22:20:00 EST Tag: blume-20020321-kmp-bugfix Description: Changed the interface to the KMP routine in PreString and fixed a minor bug in one place where it was used. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/21 20:30:00 EST Tag: leunga-20020321-cfg Description: Fixed a potential problem in cfg edge splitting. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/21 17:15:00 EST Tag: leunga-20020321-x86-fp-cfg Description: 1. Recoded the buggy parts of x86-fp. a. All the block reordering code has been removed. We now depend on the block placement phases to do this work. b. Critical edge splitting code has been simplified and moved into the CFG modules, as where they belong. Both of these were quite buggy and complex. The code is now much, much simpler. 2. X86 backend. a. Added instructions for 64-bit support. Instruction selection for 64-bit has not been committed, however, since that requires changes to MLTREE which haven't been approved by Lal and John. b. Added support for FUCOMI and FUCOMIP when generating code for PentiumPro and above. We only generate these instructions in the fast-fp mode. c. Added cases for JP and JNP in X86FreqProps. 3. CFG CFG now has a bunch of methods for edge splitting and merging. 4. Machine description. John's simplification of MLTREE_BASIS.fcond broke a few machine description things: rtl-build.{sig,sml} and hppa.mdl fixed. NOTE: the machine description stuff in the repository is still broken. Again, I can't put my fixes in because that involves changes to MLTREE. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/20 15:55:00 EST Tag: blume-20020320-kmp Description: Implemented Knuth-Morris-Pratt string matching in PreString and used it for String.isSubstring, Substring.isSubstring, and Substring.position. (Might need some stress-testing. Simple examples worked fine.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/19 16:37:00 EST Tag: blume-20020319-witnesses Description: Added a structure C.W and functions convert/Ptr.convert to ml-nlffi-lib. This implements a generic mechanism for changing constness qualifiers anywhere within big C types without resorting to outright "casts". (So far, functions such as C.rw/C.ro or C.Ptr.rw/C.Ptr.ro only let you modify the constness at the outermost level.) The implementation of "convert" is based on the idea of "witness" values -- values that are not used by the operation but whose types "testify" to their applicability. On the implementation side, "convert" is simply a projection (returning its second curried argument). With cross-module inlining, it should not result in any machine code being generated. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/15 16:40:00 EST Tag: blume-20020315-basis Description: Provided (preliminary?) implementations for {String,Substring}.{concatWith,isSuffix,isSubstring} and Substring.full Those are in the Basis spec but they were missing in SML/NJ. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/14 21:30:00 EST Tag: blume-20020314-controls Description: Controls: --------- 1. Factored out the recently-added Controls : CONTROLS stuff and put it into its own library $/controls-lib.cm. The source tree for this is under src/smlnj-lib/Controls. 2. Changed the names of types and functions in this interface, so they make a bit more "sense": module -> registry 'a registry -> 'a group 3. The interface now deals in ref cells only. The getter/setter interface is (mostly) gone. 4. Added a function that lets one register an already-existing ref cell. 5. Made the corresponding modifications to the rest of the code so that everything compiles again. 6. Changed the implementation of Controls.MLRISC back to something closer to the original. In particular, this module (and therefore MLRISC) does not depend on Controls. There now is some link-time code in int-sys.sml that registers the MLRISC controls with the Controls module. CM: --- * One can now specify the lambda-split aggressiveness in init.cmi. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/13 17:30:00 EST Tag: leunga-20020313-x86-fp-unary Description: Bug fix for: > leunga@weaselbane:~/Yale/tmp/sml-dist{21} bin/sml > Standard ML of New Jersey v110.39.1 [FLINT v1.5], March 08, 2002 > - fun f(x,(y,z)) = Real.~ y; > [autoloading] > [autoloading done] > fchsl (%eax), 184(%esp) > Error: MLRisc bug: X86MCEmitter.emitInstr > > uncaught exception Error > raised at: ../MLRISC/control/mlriscErrormsg.sml:16.14-16.19 The problem was that the code generator did not generate any fp registers in this case, and the ra didn't know that it needed to run the X86FP phase to translate the pseudo fp instruction. This only happened with unary fp operators in certain situations. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/13 14:00:00 EST Tag: blume-20020313-overload-etc Description: 1. Added _overload as a synonym for overload for backward compatibility. (Control.overloadKW must be true for either version to be accepted.) 2. Fixed bug in install script that caused more things to be installed than what was requested in config/targets. 3. Made CM aware of the (_)overload construct so that autoloading works. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/12 22:03:00 EST Tag: blume-20020312-url Description: Forgot to update BOOT and srcarchiveurl. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/12 17:30:00 EST Tag: blume-20020312-version110392 Description: Yet another version number bump (because of small changes to the binfile format). Version number is now 110.39.2. NEW BOOTFILES! Changes: The new pid generation scheme described a few weeks ago was overly complicated. I implemented a new mechanism that is simpler and provides a bit more "stability": Once CM has seen a compilation unit, it keeps its identity constant (as long as you do not delete those crucial CM/GUID/* files). This means that when you change an interface, compile, then go back to the old interface, and compile again, you arrive at the original pid. There now also is a mechanism that instructs CM to use the plain environment hash as a module's pid (effectively making its GUID the empty string). For this, "noguid" must be specified as an option to the .sml file in question within its .cm file. This is most useful for code that is being generated by tools such as ml-nlffigen (because during development programmers tend to erase the tool's entire output directory tree including CM's cached GUIDs). "noguid" is somewhat dangerous (since it can be used to locally revert to the old, broken behavior of SML/NJ, but in specific cases where there is no danger of interface confusion, its use is ok (I think). ml-nlffigen by default generates "noguid" annotations. They can be turned off by specifying -guid in its command line. ---------------------------------------------------------------------- Name: Lal George Date: 2002/03/12 12 14:42:36 EST Tag: george-20020312-frequency-computation Description: Integrated jump chaining and static block frequency into the compiler. More details and numbers later. ---------------------------------------------------------------------- Name: Lal George Date: 2002/03/11 11 22:38:53 EST Tag: george-20020311-jump-chain-elim Description: Tested the jump chain elimination on all architectures (except the hppa). This is on by default right now and is profitable for the alpha and x86, however, it may not be profitable for the sparc and ppc when compiling the compiler. The gc test will typically jump to a label at the end of the cluster, where there is another jump to an external cluster containing the actual code to invoke gc. This is to allow factoring of common gc invocation sequences. That is to say, we generate: f: testgc ja L1 % jump if above to L1 L1: jmp L2 After jump chain elimination the 'ja L1' instructions is converted to 'ja L2'. On the sparc and ppc, many of the 'ja L2' instructions may end up being implemented in their long form (if L2 is far away) using: jbe L3 % jump if below or equal to L3 jmp L2 L3: ... For large compilation units L2 may be far away. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/11 13:30:00 EST Tag: blume-20020311-mltreeeval Description: A functor parameter was missing. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/11 10:30:00 EST Tag: leunga-20020311-runtime-string0 Description: The representation of the empty string now points to a legal null terminated C string instead of unit. It is now possible to convert an ML string into C string with InlineT.CharVector.getData. This compiles into one single machine instruction. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/10 23:55:00 EST Tag: leunga-20020310-x86-call Description: Added machine generation for CALL instruction (relative displacement mode) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/08 16:05:00 Tag: blume-20020308-entrypoints Description: Version number bumped to 110.39.1. NEW BOOTFILES! Entrypoints: non-zero offset into a code object where execution should begin. - Added the notion of an entrypoint to CodeObj. - Added reading/writing of entrypoint info to Binfile. - Made runtime system bootloader aware of entrypoints. - Use the address of the label of the first function given to mlriscGen as the entrypoint. This address is currently always 0, but it will not be 0 once we turn on block placement. - Removed the linkage cluster code (which was The Other Way(tm) of dealing with entry points) from mlriscGen. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/07 20:45:00 EST Tag: leunga-20020307-x86-cmov Description: Bug fixes for CMOVcc on x86. 1. Added machine code generation for CMOVcc 2. CMOVcc is now generated in preference over SETcc on PentiumPro or above. 3. CMOVcc cannot have an immediate operand as argument. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/03/07 16:15:00 EST Tag: blume-20020307-controls Description: This is a very large but mostly boring patch which makes (almost) every tuneable compiler knob (i.e., pretty much everything under Control.* plus a few other things) configurable via both the command line and environment variables in the style CM did its configuration until now. Try starting sml with '-h' (or, if you are brave, '-H') To this end, I added a structure Controls : CONTROLS to smlnj-lib.cm which implements the underlying generic mechanism. The interface to some of the existing such facilities has changed somewhat. For example, the MLRiscControl module now provides mkFoo instead of getFoo. (The getFoo interface is still there for backward-compatibility, but its use is deprecated.) The ml-build script passes -Cxxx=yyy command-line arguments through so that one can now twiddle the compiler settings when using this "batch" compiler. TODO items: We should go through and throw out all controls that are no longer connected to anything. Moreover, we should go through and provide meaningful (and correct!) documentation strings for those controls that still are connected. Currently, multiple calls to Controls.new are accepted (only the first has any effect). Eventually we should make sure that every control is being made (via Controls.new) exactly once. Future access can then be done using Controls.acc. Finally, it would probably be a good idea to use the getter-setter interface to controls rather than ref cells. For the time being, both styles are provided by the Controls module, but getter-setter pairs are better if thread-safety is of any concern because they can be wrapped. ***************************************** One bug fix: The function blockPlacement in three of the MLRISC backpatch files used to be hard-wired to one of two possibilities at link time (according to the value of the placementFlag). But (I think) it should rather sense the flag every time. ***************************************** Other assorted changes (by other people who did not supply a HISTORY entry): 1. the cross-module inliner now works much better (Monnier) 2. representation of weights, frequencies, and probabilities in MLRISC changed in preparation of using those for weighted block placement (Reppy, George) ---------------------------------------------------------------------- Name: Lal George Date: 2002/03/07 14:44:24 EST 2002 Tag: george-20020307-weighted-block-placement Tested the weighted block placement optimization on all architectures (except the hppa) using AMPL to generate the block and edge frequencies. Changes were required in the machine properties to correctly categorize trap instructions. There is an MLRISC flag "weighted-block-placement" that can be used to enable weighted block placement, but this will be ineffective without block/edge frequencies (coming soon). ---------------------------------------------------------------------- Name: Lal George Date: 2002/03/05 17:24:48 EST Tag: george-20020305-linkage-cluster In order to support the block placement optimization, a new cluster is generated as the very first cluster (called the linkage cluster). It contains a single jump to the 'real' entry point for the compilation unit. Block placement has no effect on the linkage cluster itself, but all the other clusters have full freedom in the manner in which they reorder blocks or functions. On the x86 the typical linkage code that is generated is: ---------------------- .align 2 L0: addl $L1-L0, 72(%esp) jmp L1 .align 2 L1: ---------------------- 72(%esp) is the memory location for the stdlink register. This must contain the address of the CPS function being called. In the above example, it contains the address of L0; before calling L1 (the real entry point for the compilation unit), it must contain the address for L1, and hence addl $L1-L0, 72(%esp) I have tested this on all architectures except the hppa.The increase in code size is of course negligible ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/03/03 13:20:00 EST Tag: leunga-20020303-mlrisc-tools Added #[ ... ] expressions to mlrisc tools ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/27 12:29:00 EST Tag: blume-20020227-cdebug Description: - made types in structure C and C_Debug to be equal - got rid of code duplication (c-int.sml vs. c-int-debug.sml) - there no longer is a C_Int_Debug (C_Debug is directly derived from C) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/26 12:00:00 EST Tag: blume-20020226-ffi Description: 1. Fixed a minor bug in CM's "noweb" tool: If numbering is turned off, then truly don't number (i.e., do not supply the -L option to noweb). The previous behavior was to supply -L'' -- which caused noweb to use the "default" line numbering scheme. Thanks to Chris Richards for pointing this out (and supplying the fix). 2. Once again, I reworked some aspects of the FFI: A. The incomplete/complete type business: - Signatures POINTER_TO_INCOMPLETE_TYPE and accompanying functors are gone! - ML types representing an incomplete type are now *equal* to ML types representing their corresponding complete types (just like in C). This is still safe because ml-nlffigen will not generate RTTI for incomplete types, nor will it generate functions that require access to such RTTI. But when ML code generated from both incomplete and complete versions of the C type meet, the ML types are trivially interoperable. NOTE: These changes restore the full generality of the translation (which was previously lost when I eliminated functorization)! B. Enum types: - Structure C now has a type constructor "enum" that is similar to how the "su" constructor works. However, "enum" is not a phantom type because each "T enum" has values (and is isomorphic to MLRep.Signed.int). - There are generic access operations for enum objects (using MLRep.Signed.int). - ml-nlffigen will generate a structure E_foo for each "enum foo". * The structure contains the definition of type "mlrep" (the ML-side representation type of the enum). Normally, mlrep is the same as "MLRep.Signed.int", but if ml-nlffigen was invoked with "-ec", then mlrep will be defined as a datatype -- thus facilitating pattern matching on mlrep values. ("-ec" will be suppressed if there are duplicate values in an enumeration.) * Constructors ("-ec") or values (no "-ec") e_xxx of type mlrep will be generated for each C enum constant xxx. * Conversion functions m2i and i2m convert between mlrep and MLRep.Signed.int. (Without "-ec", these functions are identities.) * Coversion functions c and ml convert between mlrep and "tag enum". * Access functions (get/set) fetch and store mlrep values. - By default (unless ml-nlffigen was invoked with "-nocollect"), unnamed enumerations are merged into one single enumeration represented by structure E_'. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/02/25 04:45:00 EST Tag: leunga-20020225-cps-spill This is a new implementation of the CPS spill phase. The new phase is in the new file compiler/CodeGen/cpscompile/spill-new.sml In case of problems, replace it with the old file spill.sml The current compiler runs into some serious performance problems when constructing a large record. This can happen when we try to compile a structure with many items. Even a very simple structure like the following makes the compiler slow down. structure Foo = struct val x_1 = 0w1 : Word32.int val x_2 = 0w2 : Word32.int val x_3 = 0w3 : Word32.int ... val x_N = 0wN : Word32.int end The following table shows the compile time, from N=1000 to N=4000, with the old compiler: N 1000 CPS 100 spill 0.04u 0.00s 0.00g MLRISC ra 0.06u 0.00s 0.05g (spills = 0 reloads = 0) TOTAL 0.63u 0.07s 0.21g 1100 CPS 100 spill 8.25u 0.32s 0.64g MLRISC ra 5.68u 0.59s 3.93g (spills = 0 reloads = 0) TOTAL 14.71u 0.99s 4.81g 1500 CPS 100 spill 58.55u 2.34s 1.74g MLRISC ra 5.54u 0.65s 3.91g (spills = 543 reloads = 1082) TOTAL 65.40u 3.13s 6.00g 2000 CPS 100 spill 126.69u 4.84s 3.08g MLRISC ra 0.80u 0.10s 0.55g (spills = 42 reloads = 84) TOTAL 129.42u 5.10s 4.13g 3000 CPS 100 spill 675.59u 19.03s 11.64g MLRISC ra 2.69u 0.27s 1.38g (spills = 62 reloads = 124) TOTAL 682.48u 19.61s 13.99g 4000 CPS 100 spill 2362.82u 56.28s 43.60g MLRISC ra 4.96u 0.27s 2.72g (spills = 85 reloads = 170) TOTAL 2375.26u 57.21s 48.00g As you can see the old cps spill module suffers from some serious performance problem. But since I cannot decipher the old code fully, instead of patching the problems up, I'm reimplementing it with a different algorithm. The new code is more modular, smaller when compiled, and substantially faster (O(n log n) time and O(n) space). Timing of the new spill module: 4000 CPS 100 spill 0.02u 0.00s 0.00g MLRISC ra 0.25u 0.02s 0.15g (spills=1 reloads=3) TOTAL 7.74u 0.34s 1.62g Implementation details: As far as I can tell, the purpose of the CPS spill module is to make sure the number of live variables at any program point (the bandwidth) does not exceed a certain limit, which is determined by the size of the spill area. When the bandwidth is too large, we decrease the register pressure by packing live variables into spill records. How we achieve this is completely different than what we did in the old code. First, there is something about the MLRiscGen code generator that we should be aware of: o MLRiscGen performs code motion! In particular, it will move floating point computations and address computations involving only the heap pointer to their use sites (if there is only a single use). What this means is that if we have a CPS record construction statement RECORD(k,vl,w,e) we should never count the new record address w as live if w has only one use (which is often the case). We should do something similar to floating point, but the transformation there is much more complex, so I won't deal with that. Secondly, there are now two new cps primops at our disposal: 1. rawrecord of record_kind option This pure operator allocates some uninitialized storage from the heap. There are two forms: rawrecord NONE [INT n] allocates a tagless record of length n rawrecord (SOME rk) [INT n] allocates a tagged record of length n and initializes the tag. 2. rawupdate of cty rawupdate cty (v,i,x) Assigns to x to the ith component of record v. The storelist is not updated. We use these new primops for both spilling and increment record construction. 1. Spilling. This is implemented with a linear scan algorithm (but generalized to trees). The algorithm will create a single spill record at the beginning of the cps function and use rawupdate to spill to it, and SELECT or SELp to reload from it. So both spills and reloads are fine-grain operations. In contrast, in the old algorithm "spills" have to be bundled together in records. Ideally, we should sink the spill record construction to where it is needed. We can even split the spill record into multiple ones at the places where they are needed. But CPS is not a good representation for global code motion, so I'll keep it simple and am not attempting this. 2. Incremental record construction (aka record splitting). Long records with many component values which are simulatenously live (recall that single use record addresses are not considered to be live) are constructed with rawrecord and rawupdate. We allocate space on the heap with rawrecord first, then gradually fill it in with rawupdate. This is the technique suggested to me by Matthias. Some restrictions on when this is applicable: 1. It is not a VECTOR record. The code generator currently does not handle this case. VECTOR record uses double indirection like arrays. 2. All the record component values are defined in the same "basic block" as the record constructor. This is to prevent speculative record construction. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/02/22 01:02:00 EST Tag: leunga-20020222-mlrisc-tools Minor bug fixes in the parser and rewriter ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/02/21 20:20:00 EST Tag: leunga-20020221-peephole Regenerated the peephole files. Some contained typos in the specification and some didn't compile because of pretty printing bugs in the old version of 'nowhere'. ---------------------------------------------------------------------- Name: Allen Leung Date: 2002/02/19 20:20:00 EST Tag: leunga-20020219-mlrisc-tools Description: Minor bug fixes to the mlrisc-tools library: 1. Fixed up parsing colon suffixed keywords 2. Added the ability to shut the error messages up 3. Reimplemented the pretty printer and fixed up/improved the pretty printing of handle and -> types. 4. Fixed up generation of literal symbols in the nowhere tool. 5. Added some SML keywords to to sml.sty ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/19 16:20:00 EST Tag: blume-20020219-cmffi Description: A wild mix of changes, some minor, some major: * All C FFI-related libraries are now anchored under $c: $/c.cm --> $c/c.cm $/c-int.cm --> $c/internals/c-int.cm $/memory.cm --> $c/memory/memory.cm * "make" tool (in CM) now treats its argument pathname slightly differently: 1. If the native expansion is an absolute name, then before invoking the "make" command on it, CM will apply OS.Path.mkRelative (with relativeTo = OS.FileSys.getDir()) to it. 2. The argument will be passed through to subsequent phases of CM processing without "going native". In particular, if the argument was an anchored path, then "make" will not lose track of that anchor. * Compiler backends now "know" their respective C calling conventions instead of having to be told about it by ml-nlffigen. This relieves ml-nlffigen from one of its burdens. * The X86Backend has been split into X86CCallBackend and X86StdCallBackend. * Export C_DEBUG and C_Debug from $c/c.cm. * C type encoding in ml-nlffi-lib has been improved to model the conceptual subtyping relationship between incomplete pointers and their complete counterparts. For this, ('t, 'c) ptr has been changed to 'o ptr -- with the convention of instantiating 'o with ('t, 'c) obj whenever the pointer target type is complete. In the incomplete case, 'o will be instantiated with some "'c iobj" -- a type obtained by using one of the functors PointerToIncompleteType or PointerToCompleteType. Operations that work on both incomplete and complete pointer types are typed as taking an 'o ptr while operations that require the target to be known are typed as taking some ('t, 'c) obj ptr. voidptr is now a bit "more concrete", namely "type voidptr = void ptr'" where void is an eqtype without any values. This makes it possible to work on voidptr values using functions meant to operate on light incomplete pointers. * As a result of the above, signature POINTER_TO_INCOMPLETE_TYPE has been vastly simplified. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/19 10:48:00 EST Tag: blume-20020219-pqfix Description: Applied Chris Okasaki's bug fix for priority queues. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/15 17:05:00 Tag: Release_110_39 Description: Last-minute retagging is becoming a tradition... :-( This is the working release 110.39. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/15 16:00:00 EST Tag: Release_110_39-orig Description: Working release 110.39. New bootfiles. (Update: There was a small bug in the installer so it wouldn't work with all shells. So I retagged. -Matthias) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/15 14:17:00 EST Tag: blume-20020215-showbindings Description: Added EnvRef.listBoundSymbols and CM.State.showBindings. Especially the latter can be useful for exploring what bindings are available at the interactive prompt. (The first function returns only the list of symbols that are really bound, the second prints those but also the ones that CM's autoloading mechanism knows about.) ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/15 12:08:00 EST Tag: blume-20020215-iptrs Description: Two improvements to ml-nlffigen: 1. Write files only if they do not exist or if their current contents do not coincide with what's being written. (That is, avoid messing with the time stamps unless absolutely necessary.) 2. Implement a "repository" mechanism for generated files related to "incomplete pointer types". See the README file for details. ---------------------------------------------------------------------- Name: Matthias Blume Date: 2002/02/14 11:50:00 EST Tag: blume-20020214-quote Description: Added a type 't t_' to tag.sml (in ml-nlffi-lib.cm). This is required because of the new and improved tag generation scheme. (Thanks to Allen Leung for pointing it out.) ---------------------------------------------------------------------- Name: Lal George Date: 2002/02/14 09:55:27 EST 2002 Tag: george-20020214-isabelle-bug Description: Fixed the MLRISC bug sent by Ma