cdesktopenv

Commit Graph

Author	SHA1	Message	Date
Martijn Dekker	2ecc2575d5	Fix import of float attribute/value from environment (re: `960a1a99`) Bug 1: as of `960a1a99`, floating point literals were no longer recognised when importing variables from the environment. The attribute was still imported but the value reverted to zero: $ (export LC_NUMERIC=C; typeset -xF5 num=7.75; \ ksh -c 'typeset -p num') typeset -x -F 5 num=0.00000 Bug 2 (inherited from 93u+): The code that imported variable attributes from the environment only checked '.' to determine whether the float attribute should be set. It should check the current radix point instead. $ (export LC_NUMERIC=debug; typeset -xF5 num=7,75; \ ksh -c 'typeset -p num') typeset -x -i num=0 ...or, after fixing bug 1 only, the output is: typeset -x -i num=75000 src/cmd/ksh93/sh/arith.c: sh_strnum(): - When importing untrusted env vars at init time, handle not only "base#value" literals using strtonll, but also floating point literals using strtold. This fixes the bug without reallowing arbitary expressions. (re: `960a1a99`) - When not initialising, use sh.radixpoint (see `f0386a87`) instead of '.' to help decide whether to evaluate an arith expression. src/cmd/ksh93/sh/init.c: env_import_attributes(): - Use sh.radixpoint instead of '.' to check for a decimal fraction. (This code is needed because doubles are exported as integers for ksh88 compatibility; see attstore() in name.c.)	2022-06-03 12:18:54 +01:00
Martijn Dekker	f0386a8794	Improve radix point (decimal point/comma) handling The GETDECIMAL(x) macro in src/cmd/ksh93/features/locale uses a global static variable 'lp' to get the localeconv() results. Several functions in ksh use local variables called 'lp'. It's dumb luck that this hasn't conflicted yet; it's a bug waiting to happen. It's also slightly inefficient as it calls localeconv() every time. In addition there is a redundant 'sh.decomma' flag that is set to 1 if the radix point is a comma, but only once, by sh_init(). It is not updated if the locale changes. That is not correct. This commit gets rid of both and implements a new approach instead: store the radix point in sh.radixpoint at init time in sh_init(), and in the put_lang() locale discipline function, reinitialise sh.radixpoint whenever LC_ALL or LC_NUMERIC changes. The rest of the code can then simply use sh.radixpoint without worry.	2022-06-03 12:16:44 +01:00
Martijn Dekker	6dbde5ec7c	Correctly exit from namespace on error (re: `f73b8617`) The referenced commit introduced at least one bug: EXIT traps were not triggered if a special builtin threw an error in a namespace that is within a virtual subshell. src/cmd/ksh93/sh/xec.c: sh_exec(): TNSPACE: - When an error occurs, siglongjmp to the previous saved environment; it is not correct to sh_exit directly. src/cmd/ksh93/tests/namespace.sh: - Remove the forkign workaround and the TODO where I incorrectly blamed this problem on the virtual subshell mechanism.	2022-06-02 03:28:41 +01:00
Martijn Dekker	ea300089a1	New feature: 'typeset -g' as in bash 4.2+ typeset -g allows directly manipulating the attributes of variables at the global level from any context. This feature already exists on bash 4.2 and later. mksh (R50+), yash and zsh have this flag as well, but it's slightly different: it ignores the current local scope, but a parent local scope from a calling function may still be used -- whereas on bash, '-g' always refers to the global scope. Since ksh93 uses static scoping (see III.Q28 at <http://kornshell.com/doc/faq.html>), only the bash behaviour makes sense here. Note that the implementation needs to be done both in nv_setlist() (name.c) and in b_typeset() (typeset.c) because assignments are executed before the typeset built-in itself. Hence also the pre-parsing of typeset options in sh_exec(). src/cmd/ksh93/include/nval.h: - Add new NV_GLOBAL bit flag, using a previously unused bit that still falls within the 32-bit integer range. src/cmd/ksh93/sh/xec.c: sh_exec(): - When pre-parsing typeset flags, make -g pass the NV_GLOBAL flag to the nv_setlist() call that processes shell assignments prior to running the command. src/cmd/ksh93/sh/name.c: nv_setlist(): - When the NV_GLOBAL bit flag is passed, save the current variable tree pointer (sh.var_tree) as well as the current namespace (sh.namespace) and temporarily set the former to the global variable tree (sh.var_base) and the latter to NULL. This makes assignments global and ignores namesapces. src/cmd/ksh93/bltins/typeset.c: - b_typeset(): - Use NV_GLOBAL bit flag for -g. - Allow combining -n with -g, permitting 'typeset -gn var' or 'nameref -g var' to create a global nameref from a function. - Do not allow a nonsensical use of -g when using nested typeset to define member variables of 'typeset -T' types. (Type method functions can still use it as normal.) - setall(): - If NV_GLOBAL is passed, use sh.var_base and deactivate sh.namespace as in nv_setlist(). This allows attributes to be set correctly for global variables. src/cmd/ksh93/tests/{functions,namespace}.sh: - Add regression tests based on reproducers for problems found by @hyenias in preliminary versions of this feature. Resolves: https://github.com/ksh93/ksh/issues/479	2022-06-01 21:07:01 +01:00
Martijn Dekker	f73b8617dd	Restore namespace's parent scope when exiting due to error Reproducer: $ namespace test { x=123; typeset -g x=456; } $ echo $x ${.test.x} 456 123 $ namespace test { typeset -Q; } arch/darwin.i386-64/bin/ksh: typeset: -Q: unknown option [usage message snipped for brevity] $ echo $x ${.test.x} 123 123 <== expected: 123 456 $ x=789 $ echo $x ${.test.x} 789 789 <== expected: 789 456 $ # look at that, we never left the namespace... When prefixing the erroneous 'typeset' with 'command', the problem does not occur. 'command' disables the properties of special built-ins such as exit on error. So, when a special built-in exits on error, the parent scope is not properly resotred. This bug exists in every ksh93 version with SHOPT_NAMESPACE so far. src/cmd/ksh93/sh/xec.c: sh_exec(): - Before entering a namespace, use sh_pushcontext and sigsetjmp to make sure we return here if sh_exit() is called, e.g. when a special builtin throws an error, to ensure the parent scope (oldnspace) is restored. Thanks to @hyenias for making me aware of this bug. Discussion: https://github.com/ksh93/ksh/issues/479#issuecomment-1140468965	2022-05-29 23:05:03 +01:00
Martijn Dekker	8f14514661	set --default: properly restore ksh IFS behaviour (re: `9e2a8c69`) Reproducer: $ (IFS=$'\t\t'; val=$'\tone\t\ttwo\t'; set --posix; \ set -- $val; echo $#; set --noposix; set -- $val; echo $#) 2 4 <== OK $ (IFS=$'\t\t'; val=$'\tone\t\ttwo\t'; set --posix; \ set -- $val; echo $#; set --default; set -- $val; echo $#) 2 2 <== bug The output of the seconnd command line should be like the first. When POSIX mode is turned off using 'set --noposix' (or 'set +o posix'), sh.ifstable is invalidated as it needs to be repopulated on the next field split to restore ksh-specific special handling of a repeated $IFS whitespace character as non-whitespace. However, when 'set --default' is used, this does not happen, which is a bug. src/cmd/ksh93/sh/args.c: sh_argopts(): - While processing --default, when turning off SH_POSIX, call sh_invalidate_ifs() to invalidate sh.ifstable.	2022-05-28 00:13:46 +01:00
Martijn Dekker	83baa27ef9	Fix incorrect typeset -L/-R/-Z on input with spaces (re: `bdb99741`) The typeset output for -L/-R/-Z seems to be wrong when the input has leading/trailing spaces. This started occurring after the dynamic buffer size changes introduced in name.c as part of the fix for <https://github.com/ksh93/ksh/issues/142>. Test script: typeset -L8 s_date1=" 22/02/09 08:25:01"; echo "$s_date1" typeset -R10 s_date1="22/02/09 08:25:01 "; echo "$s_date1" typeset -Z10 s_date1="22/02/09 08:25:01 "; echo "$s_date1" Actual output: 22/02/0 08:25:01 0008:25:01 Expected output: 22/02/09 9 08:25:01 9 08:25:01 src/cmd/ksh93/sh/name.c: nv_newattr(): - Simplify allocation code, replacing the earlier dynamic buffer size calculation with just the greater of the strlen and size. Resolves: https://github.com/ksh93/ksh/issues/476 Co-authored-by: George Lijo <george.lijo@gmail.com>	2022-05-26 00:08:45 +01:00
Martijn Dekker	c2fad38bf8	Fix 'cd -e' regress fails on some UNIXen (re: `e6989853`, `b398f33c`) On some systems, including at least Solaris 10.1 and QNX 6.5.0, the regression tests below occurred. This is because, on these systems, 'cd .' always fails with 'no such file or directory', regardless of flags, if the present working directory no longer exists. This is a legitimate variation in UNIX-like systems so the tests should be compatible. test builtins begins at 2022-05-22+13:08:28 /usr/local/src/ksh/src/cmd/ksh93/tests/builtins.sh[1499]: cd: .: [No such file or directory] builtins.sh[1501]: FAIL: cd -P without -e exits with error status if $PWD doesn't exist (expected 0, got 1) /usr/local/src/ksh/src/cmd/ksh93/tests/builtins.sh[1504]: cd: .: [No such file or directory] builtins.sh[1506]: FAIL: cd -eP doesn't fail if $PWD doesn't exist (expected 1, got 2) test builtins failed at 2022-05-22+13:08:37 with exit code 2 [ 287 tests 2 errors ] src/cmd/ksh93/tests/builtins.sh: - Delete the 'cd -P .' test for a nonexistent PWD. - For the 'cd -eP .' test for a nonexistent PWD, redirect standard error to /dev/null and also accept exit status 2, which we would expect with the '-e' flag if a 'no such file or directory' error is thrown.	2022-05-22 12:49:42 +01:00
vmihalko	2b27f63d64	Fix build with gcc 12 on Fedora 36 (#478 ) Due to missing or incorrectly positioned links with the m library (-lm flag), ksh failed to build with gcc 12 on Fedora 36.	2022-05-22 12:06:36 +01:00
atheik	9bed28c3f9	Fix line continuation within command substitutions In command substitutions of the $(standard) and ${ shared state; } form, backslash line continuation is broken. Reproducer: echo $( echo one two\ three ) Actual output (ksh93, all versions): one two\ three Expected output (every other shell, POSIX spec): one twothree src/cmd/ksh93/sh/lex.c: sh_lex(): case S_REG: - Do not skip new-line joining if we're currently processing a command substitution of one of these forms (i.e., if the lp->lexd.dolparen level is > 0). Background info/analysis: comsub() is called from sh_lex() when S_PAR is the current state. In src/cmd/ksh93/data/lexstates.c, we see that S_PAR is reached in the ST_DOL state table at index 40. Decimal 40 is ( in ASCII. So, the previous skipping of characters was done according to the ST_DOL state table, and the character that stopped it was (. This means we have $(. Alternatively, comsub() may be called from sh_lex() by jumping to the do_comsub label. In brief, that is the case when we have ${. Regardless of which it is from the two, comsub() is now called from sh_lex(). In comsub(), lp->lexd.dolparen is incremented at the beginning and decremented at the end. Between them, we see that sh_lex() is called. So, lp->lexd.dolparen in sh_lex() indicates the depth of nesting $( or ${ statements we're in. Thus, it is also the number of comsub() invocations seen in a backtrace taken in sh_lex(). The codepath for `...` is different (and never had this bug). Co-authored by: Martijn Dekker <martijn@inlv.org> Resolves: https://github.com/ksh93/ksh/issues/367	2022-05-22 00:23:54 +01:00
atheik	40a5c45b48	Allow double quotes within backtick comsub within double quotes The following reproducer causes a spurious syntax error: foo="`: "("`" The nested double quotes are not recognised correctly, causing a syntax error at the '('. Removing the outer double quotes (which are unnecessary) is a workaround, but it's still a bug as every other shell accepts this. This bug has been present since the original Bourne shell. src/cmd/ksh93/sh/lex.c: sh_lex(): case S_QUOTE: - If the current character is '"' and we're in a `...` command substitution (ingrave is true), then do not switch to the old mode but keep using the ST_QUOTE state table. Thanks to @JohnoKing for the report and to @atheik for the fix. Co-authored by: Martijn Dekker <martijn@inlv.org> Resolves: https://github.com/ksh93/ksh/issues/352	2022-05-20 22:48:47 +01:00
Trey Valenta	c86713455c	Fix typo "HOSTYPE" in package.sh documentation (#477 ) This commit fixes typos in the documentation of: - src/cmd/INIT/package.sh: HOSTYPE is corrected as HOSTTYPE	2022-04-30 13:44:09 +02:00
Martijn Dekker	b52edb380c	edit: avoid potential crash with overlapping strings In vi.c, there is a potential use of strcpy(3) on overlapping strings, which is undefined behaviour: > SHOPT_MULTIBYTE == 0 > > # define gencpy(a,b) strcpy((char)(a),(char)(b)) > > . > . > . > > if( mode != 'y' ) > { > gencpy(cp,cp+nchars); Thanks to Heiko Berges for the report. src/cmd/ksh93/edit/{edit,emacs,vi}.c: - Change almost all use of strcpy(3) to libast strcopy(), which is a simple implementation that does not have a problem with overlapping strings. Note that the return value is different (it returns a pointer to the terminating '\0') but that is not relevant in any of these cases. - Same for strncpy(3) => libast strncopy(). src/lib/libast/string/strcopy.c: - Backport a couple of cosmetic tweaks from the 93v- beta.	2022-04-21 03:03:04 +02:00
Martijn Dekker	41db60c6be	Restore build on QNX Neutrino 6.5.0 (re: a874f8c1) The QNX system at polarhome.com seems to be back up, at least temporarily, though polarhome has officially shut down. This allowed me to test ksh on QNX again, discovering that the build was broken since reworking the standards macros handling. src/lib/libast/features/standards: - On QNX, define _QNX_SOURCE=1 to enable all extensions and _FILE_OFFSET_BITS=64 to enable large file support with standard library function names.	2022-04-21 02:43:08 +02:00
atheik	86b94d9feb	libast: optget(3): Fix memory leak in --help/--man info src/lib/libast/misc/optget.c: textout(): case ']': - Before returning, call pop() to free any \f...\f info items that are left. Note that this is a safe no-op if the pointer is null. Resolves: https://github.com/ksh93/ksh/issues/407 Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-03-11 21:24:08 +01:00
Martijn Dekker	fd28da31da	Fix another test/[ corner case bug; add --posix test script This fixes another corner case bug in the horror show that is the test/[ comand. Reproducer: $ ksh --posix -c 'test X -a -n' ksh: test: argument expected Every other shell returns 0 (success) as, POSIXly, this is a test for the strings 'X' and '-n' both being non-empty, combined with the binary -a (logical and) operator. Instead, '-n' was taken as a unary primary operator with a missing argument, which is incorrect. POSIX reference: https://pubs.opengroup.org/onlinepubs/9699919799/utilities/test.html > 3 arguments: > * If $2 is a binary primary, perform the binary test of $1 and $3. src/cmd/ksh93/bltins/test.c: - e3(): If the final argument begins with a dash, always treat it as a test for a non-empty string, therefore return true. Do not limit this to "new flags" only. src/cmd/ksh93/tests/posix.sh: - Added. These are tests for every aspect of the POSIX mode.	2022-03-11 21:23:45 +01:00
Martijn Dekker	9e2a8c6925	posix mode: disable effect of repeating whitespace char in $IFS ksh has a little-known field splitting feature that conflicts with POSIX: if a single-byte whitespace character (cf. isspace(3)) is repated in $IFS, then field splitting is done as if that character wasn't a whitespace character. An exmaple with the tab character: $ (IFS=$'\t'; val=$'\tone\t\ttwo\t'; set -- $val; echo $#) 2 $ (IFS=$'\t\t'; val=$'\tone\t\ttwo\t'; set -- $val; echo $#) 4 The latter being the same as, for example $ (IFS=':'; val='1️⃣2️⃣'; set -- $val; echo $#) 4 However, this is incompatible with the POSIX spec and with every other shell except zsh, in which repeating a character in IFS does not have any effect. So the POSIX mode must disable this. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - Add sh_invalidate_ifs() function that invalidates the IFS state table by setting the ifsnp discipline struct member to NULL, which will cause the next get_ifs() call to regenerate it. - get_ifs(): Treat a repeated char as S_DELIM even if whitespace, unless --posix is on. src/cmd/ksh93/sh/args.c: - sh_argopts(): Call sh_invalidate_ifs() when enabling or disabling the POSIX option. This is needed to make the change in field splitting behaviour take immediate effect instead of taking effect at the next assignment to IFS.	2022-03-11 21:22:22 +01:00
Martijn Dekker	fae1932e62	enum: remove arbitrary one-argument limitation b_enum() contains a check that exactly one argument is given: 237: if (error_info.errors \|\| !argv \|\| (argv + 1)) But the subsequent argument handling loop will happily deal with multiple arguments: 246: while(cp = argv++) Every other declaration command supports multiple arguments and I see no reason why enum shouldn't. Simply removing the '(argv + 1)' check allows 'enum' to create more than one type per invocation. src/cmd/ksh93/bltins/enum.c: - b_enum(): Remove check for >1 args as described above. - Update documentation to describe the behaviour of enumeration types in arithmetic expressions and to add an example: a bool type with two enumeration values 'false' (0) and 'true' (1). That type is predefined in ksh 93v- and 2020. We're not going to do that in 93u+m but it's good to document the possibility. src/cmd/ksh93/sh.1: - Make changes parallel to the enum.c self-doc update.	2022-03-11 21:21:23 +01:00
Martijn Dekker	b398f33c49	Another round of accumulated minor fixes and cleanups Only notable changes listed below. */Mamfile: - Do not bother redirecting standard error for 'cmp -s' to /dev/null. Normally, 'cmp -s' on Linux, macOS, BSD, or Solaris do not not print any message. If it does, something unusual is going on and I would want to see the message. - Since we now require a POSIX shell, we can use '!'. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - Remove SH_TYPE_PROFILE symbol, unused after the removal of the SHOPT_PFSH code. (re: eabd6453) src/cmd/ksh93/sh/io.c: - piperead(), slowread(): Replace redundant sffileno() calls by the variables already containing their results. (re: 50d342e4) src/cmd/ksh93/bltins/mkservice.c, rc/lib/libcmd/vmstate.c: - If these aren't compiled, define a stub function to silence the ranlib(1) warning that the .o file does not contain symbols.	2022-03-11 21:20:32 +01:00
Johnothan King	8fc8c2f51c	Fix a few minor issues (#473 ) Changes: - Fixed two xtrace test failures introduced in commit cfc8744c. - The definition of _use_ntfork_tcpgrp in xec.c is now dependent on SHOPT_SPAWN being defined (re: 8e9ed5be). - Removed many unnecessary newlines and fixed various typos.	2022-03-11 21:18:42 +01:00
Martijn Dekker	7b99b7cf04	Restore full 'bin/package test' functionality package now uses mamake to run all available regression tests (currently iffe, mamake and ksh93) instead of just the ksh tests. However, as a consequence, passing options or other arguments to the shtests script via bin/package is no longer possible -- run bin/shtests directly for that. src/Mamfile, src//Mamfile: - Make the 'test' virtual target execute subdirectories by including the 'install' and 'all' virtual targets within it. src/cmd/ksh93/Mamfile: - Simplify and update the script in the 'test' virtual target. src/cmd/builtin/Mamfile: - Add dummy 'test' target to avoid an error. bin/package, src/cmd/INIT/package.sh: - Run 'mamake test' from the arch//src directory. This is the one that must be the initial working directory for mamake, though the Mamfiles aren't here; mamake finds them via the VPATH variable. - Update self-doc.	2022-03-11 21:17:03 +01:00
Johnothan King	dccf6b5ea8	Backport ksh93v- regression tests and fix various regression test bugs (#472 ) - tests/*.sh: Backported many additional regression tests and test fixes from the alpha and beta releases of ksh93v-. - tests/alias.sh: Avoid trying to add vi to the hash table, as some platforms do not provide a vi(1) implementation installed as part of the default system. This fixes a regression test failure I was getting in one of my Linux virtual machines. - tests/builtins.sh: Fixed a bug in one of the regression tests that caused an incorrect total error count if any of the tests failed. - tests/sh_match.sh: Fixed a regression test failure on DragonFly BSD caused by the diff command printing an extra 'No differences encountered' line.	2022-03-11 21:15:55 +01:00
Johnothan King	bb3527aea5	Fix infinite loop when posix_spawn fails (re: `0863a8eb`) (#468 ) This commit fixes an infinite loop introduced in commit `0863a8eb` that caused ksh to enter an infinite loop if posix_spawn failed to start a new process after setting the terminal process group. Reproducer (warning: it will cause ksh to crash Wayland sessions and drives up CPU usage by a ton): $ /tmp/this/file/does/not/exist /usr/bin/ksh: /tmp/this/file/does/not/exist: not found $ <Press enter> (ksh now prints $PS1 in a loop until killed with SIGKILL) The first bug fixed is the infinite loop that occurs when posix_spawn fails to execute a command. This was fixed by setting the terminal process group to the main interactive shell. The second bug fixed is related to the signal handling of the SIGTTIN, SIGTTOU and SIGTSTP signals. In sh_ntfork() these signals are set to their default signal handlers (SIG_DFL) before running a command. The signal handlers were only restored to SIG_IGN (ignore signal) when sh_ntfork() successfully ran a command. This could cause a SIGTTOU lockup under strace when a command failed to execute in an interactive shell, while also being one cause of the infinite loop. src/cmd/ksh93/sh/xec.c: sh_ntfork(): - Restore the terminal process group if posix_spawn failed to launch a new process. This is necessary because posix_spawn will set the terminal process group before it attempts to run a command and doesn't restore it on failure.	2022-03-11 21:14:20 +01:00
atheik	2e5fd4d4c1	slowread(): Turn off O_NONBLOCK for stdin if it is on (#471 ) This change turns off O_NONBLOCK for stdin if a previously ran program left it on so that interactive programs that expect it to be off work properly. src/cmd/ksh93/sh/io.c: slowread(): - Turn off O_NONBLOCK for stdin if it is on. Fixes: https://github.com/ksh93/ksh/issues/469	2022-03-11 21:10:59 +01:00
Johnothan King	e87dbebebd	Fix use after free bug when using += (re: `75796a9c`) (#466 ) The previous fix for the += operator introduced a use-after-free bug that could result in a variable pointing to random garbage: $ foo=bar $ foo+=_foo true $ typeset -p foo foo=V V The use after free issue occurs because when nv_clone creates a copy of $foo in the true command's invocation-local scope, it does not duplicate the string $foo points to. As a result, the $foo variable in the parent scope points to the same string as $foo in the invocation-local scope, which causes the use after free bug when cloned $foo variable is freed from memory. src/cmd/ksh93/sh/nvdisc.c: - To fix the use after free bug, allow nv_clone to duplicate the string with memdup or strdup when no flags are passed. src/cmd/ksh93/tests/variables.sh: - Add a regression test for using the += operator with regular commands. src/cmd/ksh93/tests/leaks.sh: - Add a regression test to ensure the bugfix doesn't introduce any memory leaks.	2022-03-11 21:08:57 +01:00
Martijn Dekker	bc6c5dbdd9	path_pwd(): Fix use after free (re: `11177d44`) Of course, we should not free the 'cp' pointer when we still need to use it. Resolves: https://github.com/ksh93/ksh/issues/467 Thanks to @atheik for the report.	2022-02-19 21:50:20 +01:00
Martijn Dekker	f43bb4981f	Improve custom type declaration command parsing (re: c5018f7c) dcl_dehacktivate() actually left a bit of inconsistent state beyond the current line because it did not clear the dummy builtins tree. This caused assignments to variables of types whose definitions were parsed but not executed to throw a "not found" error instead of a syntax error, even beyond the current line. There is also an opportunity for an optimisation. We do not need to initialise and maintain the dummy builtins tree if we're never going to use it. So move that to the check_typedef() function right before the sh_addbuiltin() call: check there if the tree exists and if not create it and open a view. If no 'typeset -T' or 'enum' type definition command is executed, the tree is never created. dcl_hacktivate() now basically does nothing until the tree is needed, but it does still count the recursion level and install the error_info.exit hook because we need this to dcl_dehacktivate() at the correct time when the tree does exist. dcl_dehacktivate() is amended to clear the tree -- except if we're running shcomp, as shcomp parses the script line by line without executing anything, so we need the dummies to persist beyond the sh_parse() invocation for the entire script. (Note that we do not need this workaround for dot scripts or noexec mode, as the script is entirely parsed in a single sh_parse() call in those cases.)	2022-02-18 03:44:19 +00:00
Martijn Dekker	b09ce2fa02	Fix crash when suspending a blocked write to a FIFO Reproducer (symptoms on at least macOS and FreeBSD): $ mkfifo f $ echo foo > f (press Ctrl+Z) ^Zksh: f: cannot create [Interrupted system call] Abort The shell either aborts (dev builds) or crashes with 'Illegal instruction' (release builds). This is consistent with UNREACHABLE() being reached. Backtrace: 0 libsystem_kernel.dylib __kill + 10 1 ksh sh_done + 836 (fault.c:678) 2 ksh sh_fault + 1324 3 libsystem_platform.dylib _sigtramp + 29 4 dyld ImageLoaderMachOCompressed::resolve(ImageLoader::LinkContext const&, char const, unsigned ch 5 libsystem_c.dylib abort + 127 6 ksh sh_redirect + 3576 (io.c:1356) 7 ksh sh_exec + 7231 (xec.c:1308) 8 ksh exfile + 3247 (main.c:607) 9 ksh sh_main + 3551 (main.c:368) 10 ksh main + 38 (pmain.c:45) 11 libdyld.dylib start + 1 This means that UNREACHABLE() is actually reached here: ksh/src/cmd/ksh93/sh/io.c 1351: if((fd=sh_open(tname?tname:fname,o_mode,RW_ALL)) <0) 1352: { 1353: errormsg(SH_DICT,ERROR_system(1),((o_mode&O_CREAT)?e_create:e_open),fname); 1354: UNREACHABLE(); 1355: } The cause is that, in the following section of code in sh_fault(): ksh/src/cmd/ksh93/sh/fault.c 183: #ifdef SIGTSTP 184: if(sig==SIGTSTP) 185: { 186: sh.trapnote \|= SH_SIGTSTP; 187: if(pp->mode==SH_JMPCMD && sh_isstate(SH_STOPOK)) 188: { 189: sigrelease(sig); 190: sh_exit(SH_EXITSIG); 191: return; 192: } 193: } 194: #endif / SIGTSTP */ ...sh_exit() is not getting called and the function will not return because the SH_STOPOK bit is not set while the shell is blocked waiting to write to a FIFO. Even if sh_exit() did get called, that would not fix it, because that function also checks for the SH_STOPOK bit and returns without doing a longjmp if the signal is SIGTSTP and the SH_STOPOK bit is not set. That is direct the reason why UNREACHABLE() was raeched: errormsg() does call sh_exit() but sh_exit() then does not longjmp. src/cmd/ksh93/sh/fault.c: sh_fault(): - To avoid the crash, we simply need to return from sh_fault() if SH_STOPOK is off, so that the code path does not continue, no error message is given on Ctrl+Z, UNREACHABLE() is not reached, and the shell resumes waiting on trying to write to the FIFO. The sh.trapnote flag should not be set if we're not going to process the signal. This makes ksh behave like all other shells. Resolves: https://github.com/ksh93/ksh/issues/464	2022-02-17 20:21:23 +00:00
Martijn Dekker	11177d448d	Fix crash on cd in subshell with PWD unset (re: `5ee290c`) Reproducer: $ ksh -c 'unset PWD; (cd /); :' Memory fault The shell crashes because b_cd() is testing the value of the PWD variable without checking if there is one. src/cmd/ksh93/sh/path.c: path_pwd(): - Never return an unfreeable pointer to e_dot; always return a freeable pointer. This fixes another corner-case crashing bug. - Make sure the PWD variable gets assigned a value if it doesn't have one, even if it's the "." fallback. However, if the PWD is inaccessible but we did inherit a $PWD value that starts with a /, then use the existing $PWD value as this will help the shell fail gracefully. src/cmd/ksh93/bltins/cd_pwd.c: - b_cd(): When checking if the PWD is valid, use the sh.pwd copy instead of the PWD variable. This fixes the crash above. - b_cd(): Since path_pwd() now always returns a freeable value, free sh.pwd unconditionally before setting the new value. - b_pwd(): Not only check that path_pwd() returns a value starting with a slash, but also verify it with test_inode() and error out if it's wrong. This makes the 'pwd' command useful for checking that the PWD is currently accessible. src/cmd/ksh93/data/msg.c: - Change e_pwd error message for accuracy and clarity.	2022-02-17 19:45:37 +00:00
Martijn Dekker	d55e9686d7	Backport 'read -a' and 'read -p' from ksh 93v-/2020 This backports two minor additions to the 'read' built-in from ksh 93v-: '-a' is now the same as '-A' and '-u p' is the same as '-p'. This is for compatibility with some 93v- or ksh2020 scripts. Note that their change to the '-p' option to support both prompts and reading from the coprocess was not backported because we found it to be broken and unfixable. Discussoin at: https://github.com/ksh93/ksh/issues/463 src/cmd/ksh93/bltins/read.c: b_read(): - Backport as described above. - Rename the misleadingly named 'name' variable to 'prompt'. It points to the prompt string, not to a variable name. src/cmd/ksh93/data/builtins.c: sh_optpwd[]: - Add -a as an alterative to -A. All that is needed is adding '\|a' and optget(3) will automatically convert it to 'A'. - Change -u from a '#' (numeric) to ':' option to support 'p'. Note that b_read() now needs a corresponding strtol() to convert file descriptor strings to numbers where applicable. - Tweaks. src/cmd/ksh93/sh.1: - Update accordingly. - Tidy up the unreadable mess that was the 'read' documentation. The options are now shown in a list.	2022-02-17 19:44:54 +00:00
Martijn Dekker	95d695cb5a	Improve and document fast filescan loops (SHOPT_FILESCAN) From README: FILESCAN on Experimental option that allows fast reading of files using while < file;do ...; done and allowing fields in each line to be accessed as positional parameters. As SHOPT_FILESCAN has been enabled by default since ksh 93l 2001-06-01, the filescan loop is now documented in the manual page and the compile-time option is no longer considered experimental. We must disable this at runtime if --posix is active because it breaks a portable use case: POSIXly, 'while <file; do stuff; done' repeatedly excutes 'stuff' while 'file' can successfully be opened for reading, without actually reading from 'file'. This also backports a bugfix from the 93v- beta. Reproducer: $ echo 'one two three' >foo $ while <foo; do printf '[%s] ' "$@"; echo; done [one two three] Expected output: [one] [two] [three] The bug is that "$@" acts like "$*", joining all the positional parameters into one word though it should be generating one word for each. src/cmd/ksh93/sh/macro.c: varsub(): - Backport fix for the bug described above. I do not understand the opaque macro.c code well enough yet to usefully describe the fix. src/cmd/ksh93/sh/xec.c: sh_exec(): - Improved sanity check for filescan loop: do not recognise it if the simple command includes variable assignments, more than one redirection, or an output or append redirection. - Disable filescan loops if --posix is active. - Another 93v- fix: handle interrupts (errno==EINTR) when closing the input file.	2022-02-17 19:43:36 +00:00
Martijn Dekker	82ff91e9d9	Remove SHOPT_ENV dead code and sh.env variable (re: `8d7f616e`) As so often with SHOPT_* compile-time options, not all the relevant code was actually conditional on the option macro. After removing SHOPT_ENV, the arguments to sh_putenv() and env_delete() macro calls are dead code and, after removing that, the sh.env variable is unused.	2022-02-17 19:42:27 +00:00
Johnothan King	5bd18322e0	Remove unnecessary sh.defpathlist variable (re: `e3a1dda9`) (#461 ) src/cmd/ksh93/{sh/path.c,include/shell.h}: - The sh.defpathlist variable is never set once, which makes every use of this variable unnecessary (as it's always null). This commit removes sh.defpathlist while also fixing a possible memory leak (there was another location where defpathinit() was invoked without saving the returned pointer, causing a memory leak).	2022-02-17 19:40:53 +00:00
Martijn Dekker	4886463bb6	Disable broken KEYBD trap for multibyte characters In UTF-8 locales, ksh breaks when a KEYBD trap is active, even a dummy no-op one like 'trap : KEYBD'. Entering multi-byte characters fails (the input is interrupted and a new prompt is displayed) and pasting content with multi-byte characters produces corrupted results. The cause is that the KEYBD trap code is not multibyte-ready. Unfortunately nobody yet understands the edit.c code well enough to implement a proper fix. Pending that, this commit implements a workaround that at least avoids breaking the shell. src/cmd/ksh93/edit/edit.c: ed_getchar(): - When a multi-byte locale is active, do not trigger the the KEYBD trap except for ASCII characters (1-127). Resolves: https://github.com/ksh93/ksh/issues/307	2022-02-17 19:39:42 +00:00
Martijn Dekker	fc5bd8e8c3	tests/sh_match.sh: redirect ulimit to 2>/dev/null macOS 12.2.1 doesn't seem to like the -M, -v or -d ulimit options: sh_match.sh[502]: FAIL: test_xmlfragment1/0/testfile1.xml: Expected empty stderr, got $'test1_script.sh[2]: ulimit: 1048576: limit exceeded [Invalid argument]\ntest1_script.sh[3]: ulimit: 1048576: limit exceeded [Invalid argument]\ntest1_script.sh[4]: ulimit: 1048576: limit exceeded [Invalid argument]' The 'Invalid argument' addition is caused by errno==EINVAL and suggests the OS either doesn't support setting this limit, or support for it was somehow disabled. src/cmd/ksh93/tests/sh_match.sh: - Redirect standard error for ulimit commands to 2>/dev/null. If they fail it's pretty inconsequential and it's not related to actual ${.sh.match} testing at all. Resolves: https://github.com/ksh93/ksh/issues/459 Thanks to @posguy99 for the report.	2022-02-17 19:38:59 +00:00
Martijn Dekker	56c0c24b55	Do not disable completion along with pathname expansion The -f/--noglob shell option is documented simply as: "Disables pathname expansion." But after 'set -f' on an interactive shell, command completion and file name completion also stop working. This is because they internally use the pathname expansion mechanism. But it is not documented anywhere that 'set -f' disables completion; it's just a side effect of an implementation detail. Though ksh has always acted like this, I think it should change because it's not useful or expected behaviour. Other shells like bash, yash or zsh don't act like this. src/cmd/ksh93/sh/expand.c, src/cmd/ksh93/sh/macro.c: - Allow the SH_COMPLETE (command completion) or SH_FCOMPLETE (file name completion) state bit to override SH_NOGLOB in path_generate() and in sh_macexpand().	2022-02-17 19:38:15 +00:00
Martijn Dekker	4fee9d84fe	tests/sigchild.sh: try to fix intermittent fail (re: `dc80f40d`) It probably won't make a difference since the 'sleep' is run in the background, but let's change 'sleep .5 &' back to the original 'sleep 1 &' from the 93u+ 2012-08-01 version and see what happens. See: https://github.com/ksh93/ksh/issues/344#issuecomment-982219206	2022-02-17 19:37:41 +00:00
Martijn Dekker	1cdd963f53	do not save file desc state for subshares (re: `6304dfce`, `fb755163`) The >&- redirection subshell leak fixed in `6304dfce` still existed for shared-state ${ command substitutions; } a.k.a. subshares, which cannot be forked. I previously noticed that sh_subsavefd() saves the FD state even for subshares. That seems logically incorrect as subshares share their state with the invoking environment by definition. src/cmd/ksh93/sh/subshell.c: sh_subsavefd(): - Sure enough, adding a check for !sh.subshare fixes the bug. - Use the sh.subshell counter and not the subshell data pointer to check for a virtual subshell. If the shell is reinitialised in a fork to execute a new script (see 0af81992), any parent virtual subshell data is currently not cleared as it is locally scoped to subshell.c, so that check would be incorrect then. src/cmd/ksh93/sh/io.c: sh_redirect: - Remove now-redundant (and actually incorrectly placed) check for sh.subshare added in `fb755163`.	2022-02-17 19:36:50 +00:00
Martijn Dekker	6304dfce41	Fix corner-case >&- redirection leak out of subshell Reproducer: exec 9>&1 ( { exec 9>&1; } 9>&- ) echo "test" >&9 # => 9: cannot open [Bad file descriptor] The 9>&- incorrectly persists beyond the { } block that it was attached to and beyond the ( ) subshell. This is yet another bug with non-forking subshells; forking it with something like 'ulimit -t unlimited' works around the bug. In over a year we have not been able to find a real fix, but I came up with a workaround that forks a virtual subshell whenever it executes a code block with a >&- or <&- redirection attached. That use case is obscure enough that it should not cause any performance regression except in very rare corner cases. src/cmd/ksh93/sh/xec.c: sh_exec(): TSETIO: - This is where redirections attached to code blocks are handled. Check for a >&- or <&- redirection using bit flaggery from shnodes.h and fork if we're executing such in a virtual subshell. Resolves: https://github.com/ksh93/ksh/issues/161 Thanks to @ko1nksm for the bug report.	2022-02-17 19:35:47 +00:00
Martijn Dekker	14a43a0a88	Yet more misc. cleanups; rm SHOPT_PFSH, SHOPT_TYPEDEF Notable changes: - Remove SHOPT_PFSH compile-time option and associated code. This was meant to work with Solaris rights profiles, see: https://docs.oracle.com/cd/E23824_01/html/821-1461/profiles-1.html#REFMAN1profiles-1 But it has been obsolete for years as Solaris stopped using it in its shipped ksh several OS versions ago, preferring a library-based wrapper around ksh and other shells. Nonetheless I experimented with the option on Solaris 11.4. Result: no external command will run; output of unitialised memory in error message. So it's already fallen victim to bit rot. There's nothing interesting here, so just get rid. - Remove SHOPT_TYPEDEF compile-time option (but keep the code!). Turning it off caused the build to fail. It may be possible to fix it, but the type definition code is integral to ksh now (e.g. 'enum' depends on much of it) so it makes no sense to disable it. This was removed in the ksh 93v- beta version as well. - Remove nv_close() calls and remove nv_close() documentation from the nval.3 man page. This function is a dummy, present without any changes since the beginning of the ast-open-archive repo in 1995. The comment was: "Currently this is a dummy, but someday will be needed for reference counting". 27 or more years later, it's time to admit it's never going to happen. (And of course, nv_close() calls were not being used with anything resembling consistency.) - Add a null nv_close() macro to nval.h for compatibility with third party code that follows the old documentation. - Add a few missing regression tests.	2022-02-10 21:04:56 +00:00
Johnothan King	a00fe6b7fd	Fix one buffer overflow in 'typeset -p .sh.type' (#457 ) This small commit replaces one instance of memcmp with strncmp to fix one of the buffer overflows that causes 'typeset -p .sh.type' to crash (see also https://github.com/ksh93/ksh/issues/456).	2022-02-10 21:04:45 +00:00
Johnothan King	f38494ea1d	Fix multiple bugs in .sh.match (#455 ) This commit backports all of the relevant .sh.match bugfixes from ksh93v-. Most of the .sh.match rewrite is from versions 2012-08-24 and 2012-10-04, with patches from later releases of 93v- and ksh2020 also applied. Note that there are still some remaining bugs in .sh.match, although now the total count of .sh.match bugs should be less that before. These are the relevant changes in the ksh93v- changelog that were backported: 12-08-07 .sh.match no longer gets set for patterns in PS4 during set -x. 12-08-10 Rewrote .sh.match expansions fixing several bugs and improving performance. 12-08-22 .sh.match now handles subpatterns that had no matches with ${var//pattern} correctly. 12-08-21 A bug in setting .sh.match after ${var//pattern/string} when string is empty has been fixed. 12-08-21 A bug in setting .sh.match after [[ string == pattern ]] has been fixed. 12-08-31 A bug that could cause a core dump after typeset -m var=.sh.match has been fixed. 12-09-10 Fixed a bug in typeset -m the .sh.match is being renamed. 12-09-07 Fixed a bug in .sh.match code that coud cause the shell to quitely 13-02-21 The 12-01-16 bug fix prevented .sh.match from being used in the replacement string. The previous code was restored and a different fix which prevented .sh.match from being computed for nested replacement has been used instead. 13-05-28 Fixed two bug for typeset -c and typeset -m for variable .sh.match. Changes: - The SHOPT_2DMATCH option has been removed. This was already the default behavior previously, and now it's documented in the man page. - init.c: Backported the sh_setmatch() rewrite from 93v- 2012-08-24 and 2012-10-04. - Backported the libast 93v- strngrpmatch() function, as the .sh.match rewrite requires this API. - Backported the sh_match regression tests from ksh93v-, with many other sh_match tests backported from ksh2020. Much of the sh_match script is based on code from Roland Mainz: https://marc.info/?l=ast-developers&m=134606574109162&w=2 https://marc.info/?l=ast-developers&m=134490505607093 - tests/{substring,treemove}.sh: Backported other relevant .sh.match fixes, with tests added to the substring and treemove test scripts. - tests/types.sh: One of the (now reverted) memory leak bugfixes introduced a CI test failure in this script, so for that test the error message has been improved. - string/strmatch.c: The original ksh93v- code for the strngrpmatch() changes introduced a crash that could occur because strlen would be used on a null pointer. This has been fixed by avoiding strlen if the string is null. One nice side effect of these changes is a considerable performance improvement in the shbench[1] gsub benchmark (results from 20 iterations with CCFLAGS=-Os): -------------------------------------------------- name /tmp/ksh-current /tmp/ksh-matchfixes -------------------------------------------------- gsub.ksh 0.883 [0.822-0.959] 0.457 [0.442-0.505] -------------------------------------------------- Despite all of the many fixes and improvements in the backported 93v- .sh.match code, there are a few remaining bugs: - .sh.match is printed with a default [0] subscript (see also https://github.com/ksh93/ksh/issues/308#issuecomment-1025016088): $ arch//bin/ksh -c 'echo ${!.sh.match}' .sh.match[0] This bug appears to have been introduced by the changes from ksh93v- 2012-08-24. - The wrong variable name is given for 'parameter not set' errors (from https://marc.info/?l=ast-developers&m=134489094602596): $ arch//bin/ksh -u $ x=1234 $ true "${x//~(X)([012])\|([345])/}" $ compound co $ typeset -m co.array=.sh.match $ printf "%q\n" "${co.array[2][0]}" arch/linux.i386-64/bin/ksh: co.array[2][(null)]: parameter not set - .sh.match leaks out of subshells. Further information and a reproducer can be found here: https://marc.info/?l=ast-developers&m=136292897330187 [1]: https://github.com/ksh-community/shbench	2022-02-10 21:04:23 +00:00
Martijn Dekker	232b7bff30	Fix multiple bugs in executing scripts without a #! path When executing a script without a hashbang path like #!/bin/ksh, ksh forks itself, longjmps back to sh_main(), and then (among other things) calling sh_reinit() which is the function that tries to reinitialise as much of the shell as it can. This is its way of ensuring the child script is run in ksh and not some other shell. However, this appraoch is incredibly buggy. Among other things, changes in built-in commands and custom type definitions survived the reinitialisation, "exporting" variables didn't work properly, and the hash table and ${.sh.stats} weren't reset. As a result, depending on what the invoking script did, the invoked script could easily fail or malfunction. It is not actually possible to reinitialise the shell correctly, because some of the shell state is in locally scoped static variables that cannot simply be reinitialised. There are probably huge memory leaks with this approach as well. At some point, all this is going to need a total redesign. Clearly, the only reliable way involves execve(2) and a start from scratch. For now though, this seems to fix the known bugs at least. I'm sure there are more to be discovered. This commit makes another change: instead of the -h/trackall option (which has been a no-op for decades), the posix option is now inherited by the child script. Since there is no hashbang path from which to decide whether the shell should run in POSIX mode nor not, the best guess is probably the invoking script's setting. src/cmd/ksh93/sh/init.c: sh_reinit(): - Keep the SH_INIT state on during the entire procedure. - Delete remaining non-exported, non-default variables. - Remove attributes from exported variables. In POSIX mode, remove all attributes; otherwise, only remove readonly. - Unset discipline function pointers for variables. - Delete all custom types. - Delete all functions and built-ins, then reinitialise the built-ins table from scatch. - Free the alias values before clearing the alias table. - Same with hash table entries (tracked aliases). - Reset statistics. - Inherit SH_POSIX instead of SH_TRACKALL. - Call user init function last, not somewhere in the middle. src/cmd/ksh93/sh/name.c: sh_envnolocal(): - Be sure to preserve the export attribute of kept variables. Resolves: https://github.com/ksh93/ksh/issues/350	2022-02-10 21:03:43 +00:00
Johnothan King	7d4c7d9156	Fix 'typeset -p' output of compound array types (#453 ) This bugfix was backported from ksh93v- 2012-10-04. The bug fixed by this change is one that causes 'typeset -p' to omit the -C flag when listing compound arrays belonging to a type: $ typeset -T Foo_t=(compound -a bar) $ Foo_t baz $ typeset -p baz.bar typeset -a baz.bar='' # This should be 'typeset -C -a' src/cmd/ksh93/sh/nvtype.c: - Backport change from 93v- 2012-10-04 that sets the array nvalue to a pointer named Null (which is "") in nv_mktype(), then to Empty in fixnode(). - Change the Null name from the 93v- code to AltEmpty to avoid misleading code readers into thinking that it's a null pointer. src/cmd/ksh93/tests/types.sh: - Backport the relevant 93v- changes to the types regression tests. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-02-10 21:03:24 +00:00
Johnothan King	787058bdbf	Fix the output of `typeset -p` for two dimensional indexed arrays (#454 ) In ksh93v- 2012-10-04 the following bugfix is noted in the changelog (this fix was most likely part of ksh93v- 2012-09-27, although that version is not archived anywhere): 12-09-21 A bug in which the output of a two dimensional sparse indexed array would cause the second subscript be treated as an associative array when read back in has been fixed. Elements that are sparse indexed arrays now are prefixed type "typeset -a". Below is a before and after of this change: # Before $ typeset -a foo[1][2]=bar $ typeset -p foo typeset -a foo=([1]=([2]=bar) ) # After $ typeset -a foo[1][2]=bar $ typeset -p foo typeset -a foo=(typeset -a [1]=([2]=bar) ) src/cmd/ksh93/sh/*.c: - Backport changes from ksh93v- to print 'typeset -a' before sparse indexed arrays and properly handle 'typeset -a' in reinput commands from 'typeset -p'. src/cmd/ksh93/tests: - Add two regression tests to arrays.sh for this change. - Update the existing regression tests for compatibility with the new printed typeset output.	2022-02-10 21:01:40 +00:00
Martijn Dekker	e6d0187dd8	Don't allow 'enum' and 'typeset -T' to override special built-ins Special builtins are undeleteable for a reason. But 'enum' and 'typeset -T' allow overriding them, causing an inconsistent state. @JohnoKing writes: \| The behavior is rather buggy, as it appears to successfully \| override normal builtins but fails to delete the special \| builtins, leading to scenarios where both the original builtin \| and type are run: \| \| $ typeset -T eval=(typeset BAD; typeset TYPE) # This should have failed \| $ eval foo=BAD \| /usr/bin/ksh: eval: line 1: foo: not found \| $ enum trap=(BAD TYPE) # This also should have failed \| $ trap foo=BAD \| /usr/bin/ksh: trap: condition(s) required \| $ enum umask=(BAD TYPE) \| $ umask foo=BAD \| $ echo $foo \| BAD \| \| # Examples of general bugginess \| $ trap bar=TYPE \| /usr/bin/ksh: trap: condition(s) required \| $ echo $bar \| TYPE \| $ eval var=TYPE \| /usr/bin/ksh: eval: line 1: var: not found \| $ echo $var \| TYPE This commit fixes the following: The 'enum' and 'typeset -T' commands are no longer allowed to override and replace special built-in commands, except for type definition commands previously created by these commands; these are already (dis)allowed elsewhere. A command like 'typeset -T foo_t' without any assignments no longer creates an incompletely defined 'foo_t' built-in comamnd. Instead, it is now silently ignored for backwards compatibility. This did have a regression test checking for it, but I'm changing it because that's just not a valid use case. An incomplete type definition command does nothing useful and only crashes the shell when run. src/cmd/ksh93/bltins/enum.c: b_enum(): - Do not allow overriding non-type special built-ins. src/cmd/ksh93/sh/name.c: nv_setlist(): - Do not allow 'typeset -T' to override non-type special built-ins. To avoid an inconsistent state, this must be checked for while processing the assignments list before typeset is really invoked. src/cmd/ksh93/bltins_typeset.c: b_typeset(): - Only create a type command if sh.envlist is set, i.e., if some shell assignment(s) were passed to the 'typeset -T' command. Progresses: https://github.com/ksh93/ksh/issues/350	2022-02-10 21:01:00 +00:00
Martijn Dekker	65aff0befb	Fix conditional expansions ${array[i]=value}, ${array[i]?error} $ unset foo $ echo ${foo[42]=bar} (empty line) Instead of the empty line, 'bar' was expected. As foo[42] was unset, the conditional assignment should have worked. $ unset foo $ : ${foo[42]?error: unset} (no output) The expansion should have thrown an error with the given message. This bug was introduced in ksh 93t 2008-10-01. Thanks to @JohnoKing for finding the breaking change. Analysis: The problem was experimenally determined to be in in the following lines of nv_putsub(). If the array member is unset (i.e. null), the value is set to the empty string instead: src/cmd/ksh93/sh/array.c 1250: else 1251: ap->val[size].cp = Empty; It makes some sense: if there is a value (even an empty one), the variable is set and these expansions should behave accordingly. Sure enough, deleting these lines fixes the bug, but at the expense of introducing a lot of other array-related regressions. So we need a way to special-case the affected expansions. Where to do this? If we replace line 1251 with an abort(3) call, we get this stack trace: 0 libsystem_kernel.dylib __pthread_kill + 10 1 libsystem_pthread.dylib pthread_kill + 284 2 libsystem_c.dylib abort + 127 3 ksh nv_putsub + 1411 (array.c:1255) 4 ksh nv_endsubscript + 940 (array.c:1547) 5 ksh nv_create + 4732 (name.c:1066) 6 ksh nv_open + 1951 (name.c:1425) 7 ksh varsub + 4934 (macro.c:1322) [rest omitted] The special-casing needs to be done on line 1250 of array.c, but flagged in varsub() which processes these expansions. So, varsub() calls nv_open() calls nv_create() calls nv_endsubscript() calls nv_putsub(). That's a fairly deep call stack, so passing an extra flag argument does not seem doable. I did try an approach using a couple of new bit flags passed via these functions' flags and mode parameters, but the way this code base uses bit flags is so intricate, it does not seem to be possible to add or change anything without unwanted side effects in all sorts of places. So the only fix I can think of adds yet another global flag variable for a very special case. It's ugly, but it works. An elegant fix would probably involve a fairly comprehensive redesign, which is simply not going to happen. src/cmd/ksh93/include/shell.h: - Add global sh.cond_expan flag. src/cmd/ksh93/sh/array.c: nv_putsub(): - Do not set value to empty string if sh.cond_expan is set. src/cmd/ksh93/sh/macro.c: varsub(): - Set sh.cond_expan flag while calling nv_open() for one of the affected expansions. - Minor refactoring for legibility and to make the fix fit better. - SSOT: Instead of repeating string "REPLY", use the node's nvname. - Do not pointlessly add an extra 0 byte when saving id for error message; sfstruse() already adds this. Thanks to @oguz-ismail for the bug report. Resolves: https://github.com/ksh93/ksh/issues/383	2022-02-05 23:39:16 +00:00
Martijn Dekker	493a31053e	Do not export variables with dot names (re: `8e72608c`) Variables with a dot in their name, such as those declared in namespace { ... } blocks, are usually stored in a separate tree with their actual names not containing any dots. But under some circumstances, including at least direct assignment of a non-preexisting dot variable, dot variables are stored in the main sh.var_tree with names actually containing dots. With allexport active, those could end up exported to the environment. This bug was also present in previous release versions of ksh. src/cmd/ksh93/sh/name.c: pushnam(): - Check for a dot in the name before pushing a variable to export.	2022-02-05 15:08:50 +00:00
Johnothan King	a8dd1bbd9d	typeset -p: fix output of nonexistent [0]= array element (#451 ) This fix was backported from ksh 93v- 2012-10-04. src/cmd/ksh93/sh/nvtree.c: nv_outnode(): - If the array is supposed to be empty, do not continue. This avoids outputting a nonexistent [0]= element for empty arrays. Resolves: https://github.com/ksh93/ksh/issues/420 Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-02-05 13:53:51 +00:00
Johnothan King	fb696ecfae	trap: fix use after free (#446 ) This commit adds a fix for the trap command, backported from a fork of ksh2020: https://github.com/l0stman/ksh/commit/2033375f src/cmd/ksh93/sh/jobs.c: job_chldtrap(): - Fixed a use after free bug in the for loop. The string pointed to by sh.st.trapcom[SIGCHLD] may be freed from memory after sh_trap(), so it must be reobtained each time sh_trap() is called from within the for loop.	2022-02-05 13:53:11 +00:00
Johnothan King	8e72608c1c	Export all variables assigned to while allexport is on (#431 ) All variables that are assigned a value should be exported while the allexport shell option is enabled. This works in most cases, but variables assigned to with ${var:=foo} or $((var=123)) aren't exported while allexport is on. src/cmd/ksh93/sh/name.c: - nv_putval(): This is the central assignment function; all forms of variable assignment end up here. So this is the best place to check for SH_ALLEXPORT and turn on the export attribute. - nv_setlist(): Remove allexport handling, now redundant. src/cmd/ksh93/bltins/read.c: sh_readline(): - Remove allexport handling, now redundant. src/cmd/ksh93/sh/main.c: sh_main(): - nv_putval() is used to initialize PS4 and IFS using nv_putval(); this is after an -a/--allexport specified on the ksh command line has been processed, so temporarily turn that off. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-02-05 13:52:28 +00:00
Johnothan King	0863a8eb29	Support glibc 2.35's posix_spawn_file_actions_addtcsetpgrp_np(3) This commit implements support for the glibc 2.35 posix_spawn_file_actions_addtcsetpgrp_np(3) extension[2][3], updating spawnveg(3) to use the new function for setting the terminal group. This was done with the intention of improving performance in interactive shells without reintroducing previous race conditions[4][5]. [1]: https://sourceware.org/pipermail/libc-alpha/2022-February/136040.html [2]: https://sourceware.org/git/?p=glibc.git;a=commit;h=342cc934 [3]: https://sourceware.org/git/?p=glibc.git;a=commit;h=6289d28d [4]: https://github.com/ksh93/ksh/issues/79 [5]: https://www.mail-archive.com/ast-developers@research.att.com/msg00717.html src/cmd/ksh93/sh/path.c: - Tell spawnveg(3) to set the terminal process group when launching a child process in an interactive shell. src/cmd/ksh93/sh/xec.c: - If posix_spawn_file_actions_addtcsetpgrp_np(3) is available, allow use of spawnveg(3) (via sh_ntfork()) even with job control active. - sh_ntfork(): Reimplement most of the SIGTSTP handling code removed in commit `66c37202`. src/lib/libast/comp/spawnveg.c, src/lib/libast/misc/procopen.c, src/lib/libast/features/sys: - Add support for posix_spawn_file_actions_addtcsetpgrp_np(3). - Allow spawnveg to set the terminal process group when pgid == 0. This was necessary to avoid race conditions when using the new function. src/lib/libast/features/lib: - Detect posix_spawn_file_actions_addtcsetpgrp_np(3). - Do not detect an OS spawnveg(3). With the API changes to spawnveg in this pull request ksh probably can't use the OS's spawnveg function anymore. (That's assuming anything else even provides a spawnveg function to begin with, which is unlikely.) src/lib/libast/features/api, src/cmd/ksh93/include/defs.h: - Bump libast version (20220101 => 20220201) due to the spawnveg(3) API change. src/lib/libast/man/spawnveg.3: - Document the changes to spawnveg(3) in the corresponding man page. Currently, it will only use the new tcfd argument if posix_spawn_file_actions_addtcsetpgrp_np(3) is supported. This could also be implemented for the fork(2) fallback, but for now I've avoided changing that since actually using it in the fork code would likely require a lot of hackery to avoid attempting tcsetpgrp with vfork (the behavior of tcsetpgrp after vfork is not portable) and would only benefit systems that don't have posix_spawn and vfork (I can't recall any off the top of my head that would fall under that category). - Updated the man page to account for spawnveg's change in behavior. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-02-05 13:31:31 +00:00
Martijn Dekker	1375cda934	Default to emacs upon invoking interactive shell If the VISUAL or EDITOR environment variable is not set to a value matching [Vv][Ii] or macs at initialisation time, then ksh does not turn on any line editor. This is user-hostile. New users on Unix-like systems typically have a simple editor like nano preconfigured as their default, or may not have the VISUAL or EDITOR variable set at all. So if they try ksh, they find themselves without basic functionality such as arrow keys and probably go straight back to bash. The emacs line editor is by far the most widely used, especially among new users, so ksh should default to that. Most other shells already do this. src/cmd/ksh93/sh/main.c: sh_main(): - On an interactive shell, if on editor was turned on based on $VISUAL or $EDITOR, turn on emacs before reading input.	2022-02-01 23:47:56 +00:00
Martijn Dekker	d650c73e55	Fix SIGINT handling for external commands run from scripts Reproducer: $ ksh -c 'bash -c '\''kill -s INT $$'\''; echo "$?, continuing"' Expected result: output "258, continuing"; exit status 0. Actual result: no output; exit status 258. The child process sent SIGINT only to itself and not to the process group, so the parent script was wrongly interrupted. Every shell except ksh93 produces the expected result. ksh93 also gave the expected result before version 2008-01-31 93s+, which introduced the code below. Analysis: The problem is in these lines of code in xec.c, sh_exec(), TFORK case, parent branch of fork: 1649: if(!sh_isstate(SH_MONITOR)) 1650: { 1651: if(!(sh.sigflag[SIGINT]&(SH_SIGFAULT\|SH_SIGOFF))) 1652: sh_sigtrap(SIGINT); 1653: sh.trapnote \|= SH_SIGIGNORE; 1654: } [...pipe and I/O handling, wait for command to finish...] 1667: if(!sh_isstate(SH_MONITOR)) 1668: { 1669: sh.trapnote &= ~SH_SIGIGNORE; 1700: if(sh.exitval == (SH_EXITSIG\|SIGINT)) 1701: kill(sh.current_pid,SIGINT); 1702: } When a user presses Ctrl+C, SIGINT is sent to the entire process group. If job control is fully off (i.e., !sh_isstate(SH_MONITOR)), then the process group includes the parent script. Therefore, in a script such as $ ksh -c 'bash -c '\''read x'\''; echo "$?, continuing"' when the user presses Ctrl+C while bash waits for 'read x' input, the parent ksh script should be interrupted as well. Now, the code above ignores SIGINT while bash is running. (This is done using special-casing in sh_fault() to handle that SH_SIGIGNORE flag for SIGINT.) So, when Ctrl+C interrupts the process group, the parent script is not getting interrupted as it should. To compensate for that, the code then detects, using sh.exitval (the child process' exit status), whether the child process was killed by SIGINT. If so, it simply assumes that the signal was meant for the process group including the parent script, so it reissues SIGINT to itself after unignoring it. But, as we can see from the broken reproducer above, that assumption is not valid. Scripts are perfectly free to send SIGINT to themselves only, and that must work as expected. src/cmd/ksh93/sh/xec.c: sh_exec(): TFORK: parent branch: - Instead of ignoring SIGINT, sigblock() it, which delays handling the signal until sigrelease(). (Note that these are macros defined in src/cmd/ksh93/features/sigfeatures according to OS capabilities.) - This makes reissuing SIGINT redundant, so delete that, which fixes the bug. src/cmd/ksh93/sh/fault.c: - Nothing now sets the SH_SIGIGNORE flag in sh.trapnote, so remove special-casing added in 2008-01-31 93s+.	2022-02-01 05:50:10 +00:00
Johnothan King	9a5af738ef	Add support for more keyboard shortcuts (#410 ) Add extra key bindings to the emacs and vi modes This patch adds the following key bindings to the emacs and vi editing modes: - Support for Home key sequences ^[[1~ and ^[[7~ as well as End key sequences ^[[4~ and ^[[8~. - Support for arrow key sequences ^[OA, ^[OB, ^[OC and ^[OD. - Support for the following keyboard shortcuts (if the platform supports the expected escape sequence): - Ctrl-Left Arrow: Go back one word - Alt-Left Arrow: Go back one word (Not supported on Haiku) - Ctrl-Right Arrow: Go forward one word - Alt-Right Arrow: Go forward one word (Not supported on Haiku) - Ctrl-G: Cancel reverse search - Ctrl-Delete: Delete next word (Not supported on Haiku) - Added a key binding for the Insert key, which differs in the emacs and vi editing modes: - In emacs mode, Insert escapes the next character. - In vi mode, Insert will switch the editor to insert mode (like in vim). src/cmd/ksh93/edit/{emacs,vi}.c: - Add support for the <M-Left> and <M-Right> sequences. Like in bash and mksh, these shortcuts move the cursor one word backward or forward (like the <Ctrl-Left> and <Ctrl-Right> shortcuts). - Only attempt to process these shortcuts if the escape sequence begins with $'\E[1;'. src/cmd/ksh93/edit/vi.c: - If the shell isn't doing a reverse search, insert the bell character when Ctrl+G is input. - Add the Ctrl-Delete shortcut as an alias of 'dw'. Calling ed_ungetchar twice does not work for 'dw', so Ctrl-Delete was implemented by using a vp->del_word variable. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-01-31 21:51:50 +00:00
Martijn Dekker	9c313c7fe3	Update copyright years in files changed since 1st Jan 2022	2022-01-30 20:49:04 +00:00
Johnothan King	ad9f9ff13e	Accumulated fixes for minor issues (#442 ) This commit applies various accumulated bugfixes: - Applied some fixes for compiler warnings based off of the following pull requests (with whitespace changes excluded to avoid inflating the size of the diff): https://github.com/att/ast/pull/281 https://github.com/att/ast/pull/283 https://github.com/att/ast/pull/303 https://github.com/att/ast/pull/304 - clone_type(): Two separate variables in this function share the same name. A bugfix from ksh93v- 2013-05-24 was backported to avoid conflict issues. - Backported a minor error message improvement from ksh2020 for when the user attempts to run too many coprocesses. - Backported a ksh2020 bugfix for a file descriptor leak: https://github.com/att/ast/commit/58bc8b56 - Backported ksh2020 bugfixes for unused variable and pointless assignment lint warnings: https://github.com/att/ast/commit/47650fe0 https://github.com/att/ast/commit/df209c0d https://github.com/att/ast/commit/5e417b00 - Applied a few minor improvements to libast from graphviz: https://gitlab.com/graphviz/graphviz/-/commit/e7c03541 https://gitlab.com/graphviz/graphviz/-/commit/969a7cde - dtmemory(): Mark unused parameters with NOT_USED(). Based on: https://gitlab.com/graphviz/graphviz/-/commit/6ac3ad99 - Applied a few documentation/comment tweaks for the NEWS file, printf -v and spawnveg. - Added a missing regression test for using the rm builtin's -f option without additional arguments (this was fixed in ksh93u+ 2012-02-14).	2022-01-30 20:42:59 +00:00
Johnothan King	e3a1dda93a	Fix memory leak in defpathinit() (#441 ) Currently, running ksh under ASan without the ASAN_OPTIONS variable set to 'detect_leaks=0' usually ends with ASan complaining about a memory leak in defpathinit() (this leak doesn't grow in a loop, so no regression test was added to leaks.sh). Reproducer: $ ENV=/dev/null arch//bin/ksh $ cp -? cp: invalid option -- '?' Try 'cp --help' for more information. $ exit ================================================================= ==225132==ERROR: LeakSanitizer: detected memory leaks Direct leak of 85 byte(s) in 1 object(s) allocated from: #0 0x7f6dab42d459 in __interceptor_calloc /build/gcc/src/gcc/libsanitizer/asan/asan_malloc_linux.cpp:154 #1 0x5647b77fe144 in sh_calloc /home/johno/GitRepos/KornShell/ksh/src/cmd/ksh93/sh/init.c:265 #2 0x5647b788fea9 in path_addcomp /home/johno/GitRepos/KornShell/ksh/src/cmd/ksh93/sh/path.c:1567 #3 0x5647b78911ed in path_addpath /home/johno/GitRepos/KornShell/ksh/src/cmd/ksh93/sh/path.c:1705 #4 0x5647b7888e82 in defpathinit /home/johno/GitRepos/KornShell/ksh/src/cmd/ksh93/sh/path.c:442 #5 0x5647b78869f3 in ondefpath /home/johno/GitRepos/KornShell/ksh/src/cmd/ksh93/sh/path.c:67 --- cut --- SUMMARY: AddressSanitizer: 174 byte(s) leaked in 2 allocation(s). Analysis: The previous code was leaking memory because defpathinit() returns a pointer from path_addpath(), which has memory allocated to it in path_addcomp(). This is the code ASan traced as having allocated memory: 442: return(path_addpath((Pathcomp_t)0,(defpath),PATH_PATH)); In path_addpath(): 1705: first = path_addcomp(first,old,cp,type); [...] 1729: return(first); In path_addcomp(): 1567: pp = sh_newof((Pathcomp_t*)0,Pathcomp_t,1,len+1); The ondefpath() function doesn't save a reference to the pointer returned by defpathinit(), which causes the memory leak: 66: if(!defpath) 67: defpathinit(); The changes in this commit avoid this problem by setting the defpath variable without also calling path_addpath(). src/cmd/ksh93/sh/path.c: - Move the code for allocating defpath from defpathinit() into its own dedicated function called std_path(). This function is called by defpathinit() and ondefpath() to obtain the current string stored in the defpath variable. This bugfix is adapted from a fork of ksh2020: https://github.com/l0stman/ksh/commit/db5c83a6	2022-01-30 20:42:15 +00:00
Martijn Dekker	9e525c5bde	array_grow(): fix wrong sizeof() The array_grow() function calculates the size by multiplying with sizeof(union Value*), where sizeof(union Value) was clearly meant. In practice, these are the same size on most (or maybe even all) systems, as no current member of union Value is larger than a pointer -- see name.h. But it's still wrong.	2022-01-30 20:41:29 +00:00
Martijn Dekker	304648d0c5	Another round of accumulated tweaks and cleanups Notable changes: src/cmd/ksh93/.c: - Get rid of all the dtuserdata(FOO,&sh,1) calls backported in `cc492752`. These set pointers to sh in Cdt objects. As of `b590a9f1`, the code does not use any pointers to sh, so these are superfluous. src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - As of ksh 93l 2001-06-01, the -h/trackall option has no effect at all, so trim its documentation. src/lib/libast/man/stk.3, src/lib/libast/man/stak.3: - Correct the documentation on what the ST(A)K_SMALL option bit actually does based on a reading of the code. - Document the STK_NULL option bit. README.md, src/cmd/ksh93/README: - Add a note that -fdiagnostics-color=always will break the build. Ref.: https://github.com/ksh93/ksh/issues/379 src/lib/libast/Mamfile: - Remove a 'rm -f astmath' command -- a file that is never created. But on Cygwin this removes astmath.exe, which is* used. As a result, executing it failed on Cygwin, so the system incorrectly detected that Cygwin needs -lm for math functions.	2022-01-28 21:12:31 +00:00
Martijn Dekker	7259153f1a	Fix resuming external command run from 'eval' (re: `2c35a539`) 'eval' suffers from the same bug. Reproducer: $ eval vi then suspend vi, then try to resume it -- the same as in the reproducer shown in the previous commit. src/cmd/ksh93/bltins/misc.c: b_eval(): - Same fix. Do not turn off SH_MONITOR.	2022-01-28 05:59:54 +00:00
Martijn Dekker	2c35a53964	Fix resuming external command run from POSIX function or dot script This fixes yet another whopper of a bug in job control. And it's been in every version of ksh93 back to 1995, the beginning of ast-open-archive. <sigh> This is also bug number 23 that is fixed by simply removing code. Reproducer: 1. Run vi, or another suspendable program, from a dot script or POSIX function (ksh handles them the same way). So either: $ echo vi >v $ . ./v or: $ v() { vi; } $ v 2. Suspend vi by typing Ctrl+Z. 3. Not one but two jobs are registered: $ jobs -l [2] + 85513 Stopped . ./v [1] - 85512 Stopped . ./v 4. 'fg' does not work on either of them, just printing the job command name but not resuming the editor. The second job disappears from the table after 'fg'. $ fg %2 . ./v $ fg %2 ksh: fg: no such job $ fg %1 . ./v $ fg %1 . ./v Either way, you're stuck with an unresumable vi that you have to 'kill -9' manually. src/cmd/ksh93/sh/xec.c: sh_exec(): TFORK: - Do not turn off the SH_MONITOR state flag (which tells ksh to keep track of jobs) when running an external command from a profile script or dot script/POSIX function. It's obvious that this results in an inconsistent job control state as ksh will not update it when the external command is suspended. The purpose of this nonsense is surely lost to history as it's been there since 1995 or before. My testing says that removing it doesn't break anything. If that turns out to be wrong, then the breakage will have to be fixed in a correct way instead.	2022-01-28 05:15:42 +00:00
Johnothan King	c0567c5e1d	Fix spurious syntax error when using ${foo[${bar}..${baz}]} (#436 ) Attempting to use array subscript expansion with variables that aren't set currently causes a spurious syntax error (in ksh93u+ and older commits the reproducer crashes): $ ksh -c 'echo ${foo[${bar}..${baz}]}' # Shouldn't print anything ksh: : arithmetic syntax error src/cmd/ksh93/sh/macro.c: - Backport a parser bugfix from ksh93v- 2012-08-24 that avoids setting mp->dotdot until the copyto() function's loop is finished. src/cmd/ksh93/tests/arrays.sh: - Add regression tests for this bug.	2022-01-27 16:08:54 +00:00
Martijn Dekker	fe268fcc91	Cygwin: workaround for ksh to execute #!-less scripts with itself On Cygwin, ksh does not execute scripts without a #! path in a fork of the ksh process as it does on other systems. Reproducer (run from ksh): $ cat test.sh echo "${BASH_VERSION:-not bash}" echo "${.sh.version}" $ chmod +x test.sh $ ./test.sh 4.4.12(3)-release ./test.sh: line 2: ${.sh.version}: bad substitution The script was executed in bash instead of ksh. After this fix, the output on Cygwin is like ksh on other systems: not bash Version AJM 93u+m/1.1.0-alpha+dev 2022-01-26 This also fixes a number of regression test failures, as quite a few tests create and execute temp scripts without a hashbang path. Analysis: On Cygwin, execve(2) happily executes shell scripts without a #! path with /bin/sh (which is bash --posix). However, ksh relies on execve(2) executing binaries or #! only, as it uses an ENOEXEC failure to decide whether to fork and execute a #!-less shell script with a reinitialized copy of itself via exscript(). src/cmd/ksh93/sh/path.c: path_spawn(): - Look at the magic first two bytes of the file; if it is "MZ" (Mark Zbikowski, originator of the .exe format) or "#!", continue as normal, otherwise simulate an ENOEXEC failure from execve(2) which will cause ksh to fall back on #!-less script execution.	2022-01-27 16:08:44 +00:00
Martijn Dekker	172becffea	Some more accumulated minor tweaks and cleanups Notable changes: src/cmd/ksh93/include/fault.h: - Get rid of the superflous sh pointer argument in the sh_pushcontext() and sh_popcontext() macros. (re: `2d3ec8b6`) src/cmd/ksh93/tests/io.sh: - Tweak a process substitution test to allow up to a second for unused process substitution processes to disappear. On the Alpine Linux console (at least the musl libc version), this is needed to avoid a test failure as long as no GUI is active. As soon as you start X11, this phenomenon disappears, even on the console. Very strange, but also very probably not ksh's fault. src/cmd/ksh93/tests/shtests: - Instead of just SIGCONT and SIGPIPE, set all signals to default, just to be sure to avoid spurious test failures due to signals that were ignored on entry. (It made no difference to the aforementioned Alpine Linux test failure, so ignored signals had nothing to do with that -- but still a good idea.) .github/workflows/ci.yml: - On the GitHub CI runs, when testing with SHOPTs disabled, disable SHOPT_SPAWN as well, which tests that everything still works correctly with the regular fork(2) method. COPYRIGHT: - Remove duplicate of BSD license.	2022-01-25 16:13:15 +00:00
Martijn Dekker	41ee12a527	Document history expansion and fix a few loose ends src/cmd/ksh93/sh.1: - Add a new section on history expansion mostly adapted from the "History substitution" section from the tcsh(1) man page. This has the standard BSD license which is already in the COPYRIGHT file. Inapplicable stuff was removed, some new stuff added. src/cmd/ksh93/edit/hexpand.c, src/cmd/ksh93/sh/io.c: - Set '#' as the default history comment character as on bash; no longer disable it by default. - Add the 'a' modifier as a synonym for 'g', as on bash. - Remove pointless static keyword from np variable; since the value from previous calls is never used it can just be local. - Use NV_NOADD flag with nv_open() to avoid pointlessly creating the node if the variable doesn't exist yet. - Fix a bug in history expansion where the 'p' modifier had no effect if the 'histverify' option is on. Reproducer: $ set -H -o histv $ true a b c $ !!:p $ true a b c▁ <= instead of printed, the line is re-edited Expected: $ set -H -o histv $ true a b c $ !!:p true a b c $ ▁ This is fixed by making 'p' remove the HIST_EVENT bit from the returned flag bits in hist_expand(), leaving only the HIST_PRINT flag, then adding special handling for this case to slowread() in io.c (print the line, then instead of executing it, continue and read the next line).	2022-01-25 03:45:47 +00:00
Martijn Dekker	cda8fc916f	Fix a crashing bug in history expansion Reproducer: $ set -o histexpand $ echo foo !#^:h !#^:& /usr/local/bin/ksh: :&: no previous substitution ksh(80822,0x10bc2a5c0) malloc: * error for object 0x10a13bae3: pointer being freed was not allocated ksh(80822,0x10bc2a5c0) malloc: * set a breakpoint in malloc_error_break to debug Abort Analysis: In hist_expand(), the 'cc' variable has two functions: it holds a pointer to a malloc'ed copy of the current line, and is also used as a temporary pointer with functions like strchr(). After that temporary use, it is set to NULL again, because the 'done:' routine checks if it non-NULL to decide whether to free the pointer. But if an error occurs, the function may jump straight to 'done' without first setting cc to NULL if it had been used as a temporary pointer. It then tries to free an unallocated pointer. src/cmd/ksh93/edit/hexpand.c: hist_expand(): - Eliminate this bad practice by using a separate variable for temporary pointer purposes. (I was unable to reproduce the crash in a pty regression test, though it is consistently reproducible in a real interactive session. So I haven't added that test.)	2022-01-24 08:41:27 +00:00
Martijn Dekker	0a1cc391bf	Document SHOPT_ACCT and SHOPT_AUDIT A 2008 blog post by Finnbar P. Murphy is the only documentation on these facilities that is available to date. Thankfully, Finnbarr has graciously granted me permission to use all his ksh93-related blog posts for ksh 93u+m under the same license as ksh. Since SHOPT_ACCT (disabled by default) is essentially an older and more primitive version of SHOPT_AUDIT (enabled by default), we should probably remove the former in a future release. src/cmd/ksh93/README-AUDIT.md: - Added.	2022-01-24 03:10:14 +00:00
Martijn Dekker	cda661d34c	Various cleanups, mostly in regression tests src/cmd/ksh93/data/variables.sh: shtab_variables[]: - Remove unused "CSWIDTH" entry. All use of it (including the matching CSWIDTHNOD macro) was removed in version 2003-04-22. src/cmd/ksh93/tests/variables.sh: - For the tests on the shtab_variables[] variables, read the variable names straight from variables.c instead of synching the list in the test script, which would surely be forgotten. src/cmd/ksh93/tests/*.sh: - Fix a number of mistaken tries to count errors from a subshell. - Fix miscellaneous minor breakage and typos.	2022-01-24 02:58:25 +00:00
Martijn Dekker	8afc4756e8	history expansion: add missing bounds check So far all ksh versions accept event numbers referring to nonexistent history events in history expansion (-H/-o histexpand), e.g. !9999 is accepted even if the history file has no item 9999. These expansions seem to show random content from the history file, sometimes including binary data. Of course an "event not found" error should have been thrown instead. hist_expand() in hexpand.c calls hist_seek() (from history.c) without any bounds checking except verifying the history event number is greater than zero. This commit adds a bounds check to hist_seek() itself as it's called from three other places in history.c, so perhaps this fixes a few other bugs as well. src/cmd/ksh93/edit/history.c: hist_seek(): - Use the hist_min() and hist_max() macros provided in history.h to check bounds. Note that hist_max() yields the number of the command line currently being entered, so the maximum for seeking purposes is actually its result minus 1.	2022-01-21 02:13:53 +00:00
Johnothan King	eaf7662daa	Fix history expansion buffer overflow (#434 ) History expansion currently crashes under ASan due to a buffer overflow. Reproducer: $ set -H $ !!:s/old/new/ Explanation from <https://github.com/att/ast/issues/1369>: > The problem is the code assumes the buffer allocated for a string > stream is zero initialized. But the SFIO code uses malloc() to > allocate the buffer and does not explicitly initialize it with > memset(). That it works at all, even without ASAN enabled, is > purely accidental. It will fail if that malloc() returns a block > that had been previously allocated, used, and freed. Under ASAN > the buffer is initialized (at least on my system) to a sequence > of 0xBE bytes. So the strdup() happily tries to duplicate a > string that is the size of that buffer and fails when it reads > past the end of the buffer looking for the terminating zero byte. src/cmd/ksh93/edit/hexpand.c: - Backport ksh2020 bugfix that avoids assuming the string stream has been initialized to zeros: https://github.com/att/ast/commit/cf16bcca (minus the incorrect change to the static wm variable).	2022-01-21 02:13:08 +00:00
Martijn Dekker	2c4b05b4f8	tie up standards macros loose ends (re: `289dd46c`) src/lib/libast/features/standards: - Do not emit #defines for the typ u_long test which is only used as a heuristic in subsequent tests in this file. (Note that 'set' can set and unset any iffe command-line --option at runtime.) - Remove definition of _ISOC99_SOURCE macro. This is another old GNU thing; feature_test_macros(7) says invoking the compiler with the option -std=c99 has the same effect. But modern GCC has C11 with GNU extensions as the default, which is fine. If a particular standard is desired, pass a -std=... flag in $CC. src/cmd/ksh93/features/rlimits: - Remove overlooked Linux 64 types/functions hackery. After defining standards macros it caused a build failure on at least one version of Void Linux (but not 5.15.14_1). Thanks to @JohnoKing for the report. src/cmd/ksh93/sh/subshell.c, src/lib/libdll/dllnext.c: - Remove now-redundant local definitions of _GNU_SOURCE and __EXTENSIONS__ macros. src/cmd/ksh93/tests/builtins.sh: - Fix broken sed invocation (re: `41829efa`).	2022-01-20 05:50:00 +00:00
Martijn Dekker	41829efa06	Various minor cleanups and fixes The more notable ones are: src/lib/libast/features/standards: - Do not redefine _GNU_SOURCE and _FILE_OFFSET_BITS if already defined from $CCFLAGS. Thanks to @hyanias for the heads-up. (re: `289dd46c`) src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/include/shell.h, src/cmd/ksh93/sh/args.c, src/cmd/ksh93/sh/name.c: - Remove -T test code activation option. It was basically unused. The only thing it did was intentionally introduce a memory leak in table_unset() if the 4th bit in the option argument was set. A search in ast-open-history reveals a few more trivial test uses that were later deleted, but nothing interesting. src/cmd/ksh93/tests/{basic,path}.sh: - Skip a couple of tests on AIX avoid hangs, at least one of which is not ksh's fault. Thanks to @HansH111 for the report. src/cmd/ksh93/tests/builtins.sh: - Change one awk use to a more portable sed invocation to placate systems with ancient awk commands, such as AIX. (re: `de795e1f`)	2022-01-20 00:54:42 +00:00
Martijn Dekker	289dd46c05	build: include standards macros for all AST code (re: `7fb814e1`) Turns out that the standards macros set by features/standards (such as _GNU_SOURCE for Linux or _DARWIN_SOURCE for macOS) were still not included for most C source files! Instead, they were selectively included for some files only, sometimes via FEATURE/standards (the output of features/standards), sometimes via ast_standards.h which is copied from FEATURE/standards. Consequently, there were still inconsistencies in the system header interfaces exposed on Linux, macOS, Solaris, et al. It's no wonder it sometimes took so much hackery to keep everything building. Of course, making this consistent had to break things somewhere. Breakage occurred on 32-bit Linux due to a lot of ugly hackery involving direct use of internal GNU types like off64_t and functions like fseek64(). This is now all removed and they are activated by setting the appropriate feature macro instead, so these types and functions can be used with their standard names (off_t, fseek, etc.) Before committing I've tested these changes on the following i386/x86_64 systems: Linux (glibc 32 and 64 bit, musl libc 64 bit), Solaris (32 and 64 bit), illumos (32 and 64 bit), FreeBSD (64 bit), macOS (64 bit), Cygwin (32 bit), and Haiku (64 bit). (Note: ast_standards.h is copied from FEATURE/standards, whereas ast_common.h is copied from FEATURE/common.) src/lib/libast/include/ast_std.h, src/lib/libast/stdio/stdhdr.h: - Include <ast_standards.h> first. This should cause all the AST and dependent code (such as ksh) to get the standards macros. src/lib/libast/features/standards: - For GNU (glibc), #define _FILE_OFFSET_BITS 64 to get large file support with 64-bit offsets. - Stop GNU and Cygwin <string.h> form defining the GNU version of basename(3); on Cygwin, that declaration conflicts with the AST version (and with POSIX) by using a const char* argument instead of char. It is deactivated by defining the macro 'basename' (as 'basename'); this causes GNU string.h to consider it to be already defined by the standard libgen.h header. All other changed files: - Remove direct use of 64* types and functions and a lot of related hackery.	2022-01-20 00:53:22 +00:00
Johnothan King	fb8e239cb4	Fix build error on AIX 64-bit (#428 ) This commit adds a wrapper for the AIX ar command that uses the -X64 flag to avoid build errors on that platform. Resolves: https://github.com/ksh93/ksh/issues/385	2022-01-17 20:25:06 +00:00
Martijn Dekker	e569f23ef9	bump internal libast version; various minor cleanups These are minor things I accumulated over the last month or so. Notable changes: src/lib/libast/features/api, src/lib/libast/misc/state.c, src/lib/libast/comp/conf.tab, src/cmd/ksh93/include/defs.h: - Bump internal libast version to 20220101L. We've made a few additions to the API, at least pathicase (see `71934570`, `ca3ec200`) and astconf_long (see `c2ac69b2`), so this should have been done already. This also updates '/opt/ast/bin/getconf _AST_VERSION'. - Use AST_VERSION instead of outdated _AST_VERSION. - In state.c, use AST_VERSION instead of hardcoding the version. src/cmd/ksh93/sh/xec.c: - Remove 'restorefd' variable, unused as of 42becab6. - Remove 'cmdrecurse' function and SH_RUNPROG macro; this was once used by a few libcmd commands, but ast-open-archive reveals it's unused as of ast 1999-12-25. src/cmd/ksh93/sh/.c: - Where available, use e_dot instead of "." for consistency; it is defined as an extern so we might as well use it. src/cmd/ksh93/tests/.sh: - When reporting signal names in fails, include the SIG prefix. - Fix a broken process hang test in subshell.sh. src/lib/libast/man/sfdisc.3: - Removed. The interfaces described here never made it out of AT&T; they do not exist in any libast version in ast-open-archive. Resolves: https://github.com/ksh93/ksh/issues/426	2022-01-14 19:55:35 +00:00
Johnothan King	07fc64f52b	Fix use after free bug in discipline functions (#424 ) This fixes one of the ASan failures in the variables.sh regression tests. Explanation from <https://github.com/att/ast/issues/1268>: > The problem is caused by this block of code freeing the Namfun_t* > (via the call to chktfree()): > https://github.com/ksh93/ksh/blob/307bc3ed/src/cmd/ksh93/sh/nvdisc.c#L570-L577 >> 570 else >> 571 { >> 572 struct blocked bp; >> 573 action = vp->disc[type]; >> 574 vp->disc[type] = 0; >> 575 if(!(bp=block_info(np,(struct blocked)0)) \|\| !isblocked(bp,UNASSIGN)) >> 576 chktfree(np,vp); >> 577 } > That invalidates the value stored in vp which is dereferenced here: > https://github.com/ksh93/ksh/blob/307bc3ed/src/cmd/ksh93/sh/nvdisc.c#L411-L421 >> 419 unblock(bp,type); >> 420 if(!vp->disc[type]) >> 421 chktfree(np,vp); ksh2020 commit: https://github.com/att/ast/commit/df1e8165 src/cmd/ksh93/sh/nvdisc.c: - Block nv_setdisc from freeing the memory associated with the vp pointer.	2022-01-14 19:51:24 +00:00
Johnothan King	307bc3edce	time: Fix precision bug in times(3) fallback (#425 ) In the times(3) fallback for the time keyword (which can be enabled in xec.c by undefining _lib_getrusage and timeofday), ksh will print the obtained time incorrectly if TIMEFORMAT is set to use a precision level of three: $ TIMEFORMAT=$'\nreal\t%3lR' $ time sleep .080 real 0m00.008s # Should be '00.080s' This commit corrects that issue by using 10^precision to get the correct fractional scaling. Note that the fallback still doesn't support a true precision level of three (times(3) alone doesn't support it), so this in effect pads a zero to the end of the output when the precision level is three. Additional change to tests/builtins.sh: - While fixing the above issue I found out that ksh93v- broke support for passing microseconds to the sleep builtin in the form of <num>U. I've added a regression test for that bug to ensure it isn't backported to ksh93u+m by accident. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-01-13 12:25:22 +00:00
Johnothan King	40b2c3c3d4	alias: Avoid unnecessary forks (re: `ec888867`) (#417 ) The code used to fork subshells when creating/changing aliases will always fork, even when the alias tree isn't changed: $ echo $(unalias --man 2> /dev/null; echo $$ ${.sh.pid}) 375097 375107 $ alias foo=bar; echo $(alias -p foo; echo $$ ${.sh.pid}) alias foo=bar 375097 375110 This is a bit inefficient, so this commit avoids forking a subshell unless at least one change is made to the alias table. src/cmd/ksh93/bltins/typeset.c: - b_alias(), b_unalias(): Remove sh_subfork() calls. - setall(): When creating an alias (name contains '='), fork a virtual subshell before calling nv_open() to add the node. - unall(): - When unsetting all aliases (-a), fork subshell before dtclear(). - When unsetting one alias, fork subshell before nv_delete(). - Move sh_pushcontext() and sh_popcontext() expansions so that sh_subfork() is not in between them, as that would cause program flow corruption or a crash. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-01-13 01:56:19 +00:00
Martijn Dekker	b509e92241	edit: do not enable multiline mode with no editor active If neither gmacs/emacs nor vi are active, the multiline mode should not be enabled even if the multiline option is on. Doing so can cause inconsistent behaviour, particularly in multibyte locales where, if the shell is compiled with SHOPT_RAWONLY (as is default), the no-editor mode is actually handled by vi.c. Also, the new --histreedit and --histverify options only work in the emacs or vi editors, or in no-editor mode when handled by vi. Which means they cannot ever work if neither emacs or vi were compiled in (i.e. SHOPT_ESH and SHOPT_VSH were both disabled). In that case, there's no point in compiling in those options. Come to think of it, the same applies to the multiline option. All changed files: - Update SHOPT_ESH/SHOPT_VSH preprocessor directives as per above. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/include/shell.h: - Move definitions of history expansion-related options to shell.h, which is where all the other shell options are defined.	2022-01-12 20:39:05 +00:00
Martijn Dekker	a5700d3937	Expose --histreedit and --histverify options (re: `921bbcae`) This adds two long-form shell options that modify history expansion (-H/--histexpand). If --histreedit is on and a history expansion fails, the command line is reloaded into the next prompt's edit buffer, allowing corrections. If --histverify is on, the results of a history expansion are not immediately executed but instead loaded into the next prompt's edit buffer, allowing further changes. SH_HISTREEDIT and SH_HISTVERIFY were already handled all along in slowread() in io.c and via 'reedit' arguments to functions called there, but could not be turned on as they were only ever exposed as shopt options in the removed bash compatibility mode (in theory only, as it failed to compile). I had overlooked them until now. So, since the code is there, let's expose these options through the normal long options interface. They're working fine, and activating them actually makes history expansion tolerable to use. src/cmd/ksh93/data/options.c: - Make these options available as "histreedit" and "histverify". src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - Document the "new" options. src/cmd/ksh93/include/defs.h: - Remove unused SH_HISTAPPEND and SH_HISTORY2 macros which were part of the removed bash compatibility code. Note that ksh does not need a histappend option as it never overwrites the history file (in the bash mode, this shopt option was a no-op).	2022-01-12 20:38:30 +00:00
Martijn Dekker	2d4a787564	Fix comsub hang on subshell fork (re: `090b65e7`) The referenced commit introduced a bug that caused command substitutions to hang, writing infinite zero bytes, when redirecting standard output on a built-in comand that forks the command substitution subshell. The bug was caused by removing the fork when redirecting standard output in a non-permanent manner. However, simply reintroducing the fork causes multiple regressions that we had fixed in the meantime. Thankfully, it looks like this forking workaround is only necessary when redirecting the output of built-ins. It appears that moving workaround from io.c to the built-ins handling code in sh_exec() in xec.c, right before calling sh_redirect(), allows reintroducing the forking workaround for non-permanent redirections without causing other regressions. It would be better if the underlying cause of the hang were fixed so the workaround becomes unnecessary, but I don't think that is going to happen any time soon (AT&T didn't manage, either). src/cmd/ksh93/sh/io.c: sh_redirect(): - Remove forking workaround for redirecting stdout in a comsub. src/cmd/ksh93/sh/xec.c: sh_exec(): TCOM: built-ins handling code: - Reimplement the workaround here. Resolves: https://github.com/ksh93/ksh/issues/416	2022-01-12 20:30:20 +00:00
Martijn Dekker	f711da9081	Make process substitutions work on Haiku On Haiku: # /bin/cat <(echo hi) # no redirection cat: /tmp/ksh.f29pd8f: No such file or directory Whereas this works fine: # /bin/cat < <(echo hi) # with redirection hi # /opt/ast/bin/cat <(echo hi) # no redirection; use built-in hi Haiku does not have /dev/fd, so uses the FIFO (named pipe) fallback mechanism. See also: `c3eac977` Analysis: In the TFORK part of sh_exec(), forked branch (child), the FIFO (sh.fifo) is unlinked immediately after opening it. This is not a problem if the process substitution is used in combination with a redirection, but if not, then the FIFO is passed on to the command as a file name argument. This creates a race condition: ksh was counting on the external 'cat' command opening the FIFO before the child could unlink it. Whether that race is won depends on operating system implementation details. When invoking an external command on Haiku, the race is lost. src/cmd/ksh93/sh/xec.c: sh_exec(): TFORK: child branch: - Delay unlinking the FIFO until after executing the process substitution, when we're about to exit from the child process.	2022-01-12 20:30:02 +00:00
Martijn Dekker	de5bdd12a4	iffe: some more tweaks src/cmd/INIT/iffe.sh: - Remove obsolete $posix_noglob/$posix_glob code to disable and re-enable pathname expansion. Instead, globally disable it using 'set -o noglob'. Except a couple of 'rm' commands, no aspect of this code uses globbing (note: 'case' patterns do not count), nor do any of the shell code feature tests that are eval'ed by iffe. Field splitting is heavily used via unquoted variable expansions; globbing should not interfere. (Concrete example: one of the test notes in src/cmd/ksh93/features/options is "SHOPT_* option probe" and that SHOPT_* is taken as a file name pattern, so could be expanded.) For the rm commands, globbing is turned back on in a subshell. If this breaks something that I've missed, hopefully that will turn up soon. - Remove $show/$SHOW code to massage different versions of 'echo' into acting consistently; instead, use a show_test() function that uses the POSIX-specified printf command. - For debugging level 2 and 3, use much more extensive PS4 prompts taken from modernish (which I wrote), depending on the shell. - Replace the obsolete 'set -' command. In the ancient Bourne shell, this disables the v and x options, so is equivalent to 'set +v +x'. This still works on almost all modern shells, but not yash, and POSIX considers it "unspecified". - Remove a few echo\|sed fallbacks I'd missed (re: 215efa15). src/cmd/ksh93/features/math.sh: - Disable pathname expansion here as well. - Pass down an inherited $IFFEFLAGS to recursive iffe invocations.	2022-01-12 20:29:51 +00:00
Johnothan King	1a9af9db40	Fix vi mode tab completion with spaces (#413 ) Attempting to complete file names in vi mode using tab completion can fail if the last character on the command line is a space. Reproducer (note that this bug doesn't occur in emacs mode): $ set -o vi $ mkdir '/tmp/foo bar' $ test -d /tmp/foo\ <Tab> src/cmd/ksh93/edit/vi.c: - Don't disable tab completion or reset the tab count just because the last character on the command line is a space. This bugfix was backported from ksh93v- 2014-06-06. src/cmd/ksh93/tests/pty.sh: - Add a regression test for the tab completion bug.	2022-01-07 16:18:28 +00:00
Johnothan King	ca5803419b	Fix various typos, man page issues and improve the documentation (#415 ) This commit makes various different improvements to the documentation: - sh.1: Backported (with changes) mandoc warning fixes from ksh2020 for the ksh93(1) man page: <https://github.com/att/ast/pull/1406> - Removed unnecessary spaces at the end of lines to fix a few other mandoc warnings. - Fixed various typos and capitalization errors in the documentation. - ANNOUNCE: Document the addition of the ${.sh.pid} variable (re: `9de65210`). - libast/man/str: Update the man pages for the libast str functions to improve how accurately each function is described. - ksh93/README: Update regression test/compatibility notes to include OpenBSD 7.0, FreeBSD 13.0 and WSL running Ubuntu 20.04. - Change a few places to store the return value from strlen in a size_t variable rather than signed int. - comp/setlocale.c: To avoid confusion of two separate variables named lang, the function local variable has been renamed to langidx.	2022-01-07 16:17:55 +00:00
Johnothan King	d347ec0fc9	Allow ksh to compile on Haiku; implement SIGKILLTHR support (#408 ) This commit implements the build fixes required to get ksh running on Haiku. Note that while ksh does compile, it has a ton of regression test failures on Haiku. src/cmd/ksh93/data/signals.c, src/lib/libast/features/signal.c: - Add support for the SIGKILLTHR signal, which is supported by BeOS and Haiku. - SIGINFO was missing an entry in the libast feature test, so add one (re: `658bba74`). src/cmd/ksh93/RELEASE: - Add an entry noting that ksh now compiles on Haiku, albeit with many regression test failures. src/cmd/ksh93/{include/terminal.h,sh/path.c}: - Silence compiler warnings on Haiku. src/lib/libast/features/mmap: - The mmap feature test freezes on Haiku, so modify the test to fail immediately on that OS. src/lib/libast/misc/signal.c: - Avoid redefining the signal definition on Haiku to fix a compiler error. src/lib/libast/features/nl_types: - For some reason the nl_item typedef on Haiku doesn't work correctly. Work around that by creating the nl_item type in the libast nl_types feature test.	2022-01-07 16:16:42 +00:00
Martijn Dekker	b590a9f155	[shp cleanup 01..20] all the rest (re: `2d3ec8b6`) This combines 20 cleanup commits from the dev branch. All changed files: - Clean up pointer defererences to sh. - Remove shp arguments from functions. Other notable changes: src/cmd/ksh93/include/shell.h, src/cmd/ksh93/sh/init.c: - On second thought, get rid of the function version of sh_getinterp() as libshell ABI compatibility is moot. We've already been breaking that by reordering the sh struct, so there is no way it's going to work without recompiling. src/cmd/ksh93/sh/name.c: - De-obfuscate the relationship between nv_scan() and scanfilter(). The former just calls the latter as a static function, there's no need to do that via a function pointer and void* type conversions. src/cmd/ksh93/bltins/typeset.c, src/cmd/ksh93/sh/name.c, src/cmd/ksh93/sh/nvdisc.c: - 'struct adata' and 'struct tdata', defined as local struct types in these files, need to have their first three fields in common, the first being a pointer to sh. This is because scanfilter() in name.c accesses these fields via a type conversion. So the sh field needed to be removed in all three at the same time. TODO: de-obfuscate: good practice definition via a header file. src/cmd/ksh93/sh/path.c: - Naming consistency: reserve the path_ function name prefix for externs and rename statics with that prefix. - The default path was sometimes referred to as the standard path. To use one term, rename std_path to defpath and onstdpath() to ondefpath(). - De-obfuscate SHOPT_PFSH conditional code by only calling pf_execve() (was path_pfexecve()) if that is compiled in. src/cmd/ksh93/include/streval.h, src/cmd/ksh93/sh/streval.c: - Rename extern strval() to arith_strval() for consistency. src/cmd/ksh93/sh/string.c: - Remove outdated/incorrect isxdigit() fallback; '#ifnded isxdigit' is not a correct test as isxdigit() is specified as a function. Plus, it's part of C89/C90 which we now require. (re: `ac8991e5`) src/cmd/ksh93/sh/suid_exec.c: - Replace an incorrect reference to shgd->current_pid with getpid(); it cannot work as (contrary to its misleading directory placement) suid_exec is an independent libast program with no link to ksh or libshell at all. However, no one noticed because this was in fallback code for ancient systems without setreuid(2). Since that standard function was specified in POSIX Issue 4 Version 2 from 1994, we should remove that fallback code sometime as part of another obsolete code cleanup operation to avoid further bit rot. (re: `843b546c`) src/cmd/ksh93/bltins/print.c: genformat(): - Remove preformat[] which was always empty and had no effect. src/cmd/ksh93/shell.3: - Minor copy-edit. - Remove documentation for nonexistent sh.infile_name. A search through ast-open-archive[*] reveals this never existed at all. - Document sh.savexit (== $?). src/cmd/ksh93/shell.3, src/cmd/ksh93/include/shell.h, src/cmd/ksh93/sh/init.c: - Remove sh.gd/shgd; this is now unused and was never documented or exposed in the shell.h public interface. - sh_sigcheck() was documented in shell.3 as taking no arguments whereas in the actual code it took a shp argument. I decided to go with the documentation. - That leaves sh_parse() as the only documented function that still takes an shp argument. I'm just going to go ahead and remove it for consistency, reverting sh_parse() to its pre-2003 spec. - Remove undocumented/unused sh_bltin_tree() function which simply returned sh.bltin_tree. - Bump SH_VERSION to 20220106.	2022-01-07 16:16:31 +00:00
Martijn Dekker	01da863154	In the original ast code base, src/{cmd/nmake,lib/libast}/Makefile (nmake makefiles) defined this macro: __OBSOLETE__ == $("6 months ago":@F=%(%Y0101)T) This was used to automatically disable code after a period between 6 and 18 months, on 1st Jan of each year, in preprocessor directives like: #if __OBSOLETE__ < 20080101 // obsolete code here #endif However, when compiling without nmake (as we do), this __OBSOLETE__ macro is not defined at all. And undefined macros evaluate to zero in arithmetic comparisons, so all that obsolete code has been getting compiled. Thankfully it doesn't seem to have done any harm, but all that code was supposed to expire between 2008 and 2014. src/lib/libast/disc/sfstrtmp.c: - Removed. Was supposed to be a stub #if __OBSOLETE__ >= 20070101. src/lib/libast/include/ast.h: - Remove unused fmtbasell() macro (/* until 2014-01-01 */). Other changed files: - Remove __OBSOLETE__d code.	2022-01-07 15:57:46 +00:00
Martijn Dekker	aee917f666	tests/builtins.sh: skip cd permission test if root (re: `59bacfd4`)	2022-01-07 15:56:50 +00:00
Martijn Dekker	a1c613c48d	package: more flat view fixes (re: `336e82f9`, `aeda3502`) bin/package, src/cmd/INIT/package.sh: - Automatically update an existing flat view even if 'flat' wasn't given for a make action. This stops a flat view becoming inaccurate if you forget to use 'bin/package flat make' consistently. If the $PACKAGEROOT/lib/package/gen directory exists, an existing flat view is assumed. - Correct symlink fallbacks. We need an absolute path for the symlink target or it's going to be broken. .gitignore: - Update.	2022-01-07 15:56:15 +00:00
Johnothan King	f1627e2a8c	Fix typeset -m crash under ASan and on OpenBSD (#412 ) This fixes the use after free issue that caused typeset -m to crash on older versions of OpenBSD and under ASan. The problem that was causing the failure was that the ap pointer wasn't set to null after the memory associated with it was freed. This commit backports a bugfix from ksh93v- 2013-06-28 that sets ap to null before freeing the associated memory and adds a check that makes sure ap is still a valid pointer before calling array_unscope(). tests/types.sh changes: - Avoid redirecting stderr to /dev/null, as this test shouldn't print anything to stderr. - Apply error message improvement from https://github.com/ksh93/ksh/issues/231#issue-834252084. tests/arrays.sh change: - Apply error message improvement from https://github.com/ksh93/ksh/issues/229#issue-834240645 (re: `7c7fde75`). Resolves: https://github.com/ksh93/ksh/issues/231	2022-01-07 15:54:46 +00:00
Johnothan King	7c7fde75c8	Fix arrays.sh test failure under ASan (#411 ) This backports a ksh2020 fix for an ASan heap-use-after-free error in arrays.sh. The arrays regression tests were failing under ASan because the ap pointer was used after the memory allocated to it was freed by _nv_unset(). ksh2020 commit: `f1e5119e31`	2022-01-07 15:53:10 +00:00
Johnothan King	997f0e9dd8	iffe: Correct backslashes in POSIX command substitutions (re: `aeda3502`) (#414 ) In backtick command substitutions '\\' will only print one backslash, while placing a backslash in front of a different character will print a backslash followed by that character: $ echo `echo '\\\\ \3'` \\ \3 $ echo $(echo '\\\\ \3') \\\\ \3 This difference in behavior was causing iffe to fail on Haiku after the changes in `aeda3502`. This commit applies fixes for backslashes in quoted strings to fix the Haiku build failure and other possible bugs introduced by the referenced commit.	2022-01-07 05:20:42 +00:00
Martijn Dekker	ecce82c3ca	package: report 'failed' if build failed bin/package, src/cmd/INIT/package.sh: - Make the EXIT (0) trap report failure based on $error_status (see `d18469d6`), replacing 'done' with 'failed' if it is nonzero. - Remove extra space before 'at' in that report line. - If we get a signal, we have to set error_status to nonzero manually so that the correct exit message is printed.	2022-01-01 02:43:04 +00:00
Martijn Dekker	aeda350298	more package and iffe tweaks/cleanups I've reconsidered excluding build system internals (and also '.o' files) from the flat view. Really the only thing that should be excluded are .old files. bin/package, src/cmd/INIT/package.sh: - Do not exclude anything except .old files from the flat view. - This would delete bin/package when cleaning up the flat view, so explicitly keep bin/package in the clean routine. - Be much faster about updating an existing flat view by checking if a link already exists before creating one. - Add silent cleanup for dozens of orphaned macOS .dSYM bundles belonging to deleted temporary feature test executables. src/cmd/INIT/{iffe,ignore,silent}.sh, bin/{ignore,silent}: - Remove obsolete Bourne shell fallbacks. - Modernise command substitutions. - Remove unused literal() function. - Update copy() function to use printf. - Distinguish just two shell types now: ksh and POSIX. - compile(): Remove obsolete/incorrect grepping for signal messages. Add a POSIX-compiliant check with 'kill -l' to see if the compiler was terminated by a signal.	2022-01-01 02:28:54 +00:00
Martijn Dekker	d9fc61c022	init.c: upstream init.c.patch for CDE's dtksh This upstreams the patch to init.c that is necessary to build dtksh (graphical extensions for CDE, see <https://cdesktopenv.sf.net/>). It has no effect when building regular ksh. Upstreaming it avoids the need to keep updating it when changes to init.c are made.	2022-01-01 02:28:45 +00:00
Martijn Dekker	336e82f942	package: reintroduce/rewrite flat view (re: 20f8557c) I called the flat view featuritis, but it turns out the dtksh build system depends on it. But it was broken, so let's make a proper version instead. This new approach does not symlink directories before make, but hardlinks files after make. To make performance tolerable, it requires a modern POSIX 'find' utility with '{} +'. bin/package, src/cmd/INIT/package.sh: - We're going to depend on 'test X -ef Y' to check if X is the same file as Y, so add that to the $min_posix checks even though it is not technically POSIX. But it's so close to universally available it doesn't really make a difference. - After 'make', create a flat view by hardlinking arch/$HOSTTYPE files, minus build system internals, onto their corresponding paths in $PACKAGEROOT. Fall back to symlinking if hardlinking fails (this would happen when crossing device boundaries). - Clean up flat view when doing clean/clobber. This is done by using 'find' to loop through the files in arch/ again and removing any root paths that are the same file as their corresponding arch/ path (this is where test X -ef Y comes in). Then delete arch/$HOSTTYPE, then delete empty directories left. - Check for the 'flat' qualifier when doing clean/clobber. If given, do not delete arch/$HOSTTYPE but only clean up the flat view and remove empty directories. src/cmd/INIT/Mamfile: - Do not create the lib/package directory. (re: beb3c64a) .gitignore: - Update.	2022-01-01 02:28:38 +00:00
Martijn Dekker	3fc6cf0e2f	iffe: really abort on output{ fail (re: `e20c0c6b`, `e72543a`, `b6bd981`) After further examining the iffe code I found that fail{ ... }end as well as pass{ ... }end blocks are executed in iffe's current environment, using a simple 'eval', with no safeguards whatsoever! This does of course afford maximum flexibility... for example, a block can 'exit 1' to abort the iffe run and the whole build with it. We can use this to abort properly on fatal compilation errors. src/cmd/INIT/iffe.sh: - Complete the fail{ and pass{ documentation; it should really be known that they run in iffe's current environment. - Make one change: for just the 'eval' command that runs these blocks, redirect standard error back to the saved $stderr file descriptor so these blocks can write error messages using the standard >&2 instead of the undocumented >&$stderr. src/lib/*/features/: - Write error message to standard error and error out properly when an output{ ... }end block fails to compile, instead of writing an #error directive to error out later.	2022-01-01 02:28:27 +00:00
Martijn Dekker	2d3ec8b67a	[shp cleanup 00] Reunify the original sh state struct As observed previously (see `3654ee73`, `7e6bbf85`, `79d19458`), the ksh 93u+ codebase on which we rebased development was in a transition: AT&T evidently wanted to make it possible to have several shell interpreter states in the same process, which in theory would have made it possible to start a complete new shell (not just a subshell) without forking a new process. This required transitioning from accessing the 'sh' state struct directly to accessing it via pointers (usually but not always called 'shp'), introducing a lot of bug-prone passing around of those pointers via function arguments and other state structs. Some of the original 'sh' struct was separated into a 'struct shared' called 'shgd' a.k.a. 'sh.gd' (global data) instead; these were global state variables that were going to be shared between the different main shell environments sharing a process. Yet, for some reason, that struct was allocated dynamically once at init time, requiring yet another pointer to access it. <shrug> None of this ever worked, because that transition was incomplete. It was much further along in the ksh 93v- beta, but I don't think it actually worked there either (not very much really did). So, starting a new shell has always required starting a new process. So, now that it's clear what they were trying to do, should we try to make it work? I'm going to go with a firm "no" on that question. Even non-forking (virtual) subshells, something quite a bit less ambitious, were already an unmitigated nightmare of bugs. In 93u+m we fixed a load of bugs related to those, but I'm sure there are still many left. At the very least there are multiple memory leaks. I think the ambition to go even further and have complete shells running separate programs share a process, particularly given the brittle and buggy state of the existing codebase, is evidence that the AT&T team, in the final years, had well and truly lost the ability to think "wait a minute, aren't we in over our heads here, and why are we doing this again? Is this actually a feasible and useful idea?" In my view, having entirely separate programs share a process is a terrible, horrible, no-good idea that takes us back to the bad old days before Unix, when kernels and CPUs were unable to enforce any memory access restrictions. Programmers are imperfect. If you're going to run a new program, you need the kernel to enforce the separation between programs, or you're just asking for memory corruption and security holes. And that separation is enforced by starting a new program in a new process. That's what processes are for. And if you need that to be radically performance-optimised then you're probably doing it wrong anyway. (By the way, I would still argue the same for subshells, even after we fixed many bugs in virtual subshells. But forking all subshells would in fact cause many scripts to slow down, and the community would surely revolt. <sigh> Maybe I should make it a shell option instead, so scripts can 'set -o subfork' for reliability.) It is also unclear how they were going to make something like 'ulimit' work, which can only work in a separate process. There was no sign of a mechanism to fork a separate program's shell mid-execution like there is for subshells (sh_subfork()). Anyway... I had already changed some code here and there to access the sh state struct directly, but as of this commit I'm beginning to properly undo this exercise in pointlessness. From now on, we're exercising pointerlessness instead. I'll do this in stages to make any problems introduced more traceable. Stage 0 restores the full 'sh' state struct to its former static glory and reverts 'shgd' as a separate entity. src/cmd/ksh93/sh/defs.c, src/cmd/ksh93/include/defs.h, src/cmd/ksh93/include/shell.h src/cmd/ksh93/Mamfile:: - Move 'struct sh_scoped' and 'struct limits' from defs.h to shell.h as the sh struct will need their complete definitions. - Get rid of 'struct shared' (shgd) in defs.h; its members are folded back into their original place, the main Shell_t struct (sh) in shell.h. There are no name conflicts. - Get rid of the _SH_PRIVATE macro in defs.h. The members it defines are now defined normally in the main Shell_t struct (sh) in shell.h. - To make this possible, move <history.h> and "fault.h" includes from defs.h to shell.h and update the Mamfile accordingly. - Turn sh_getinterp() and shgd into macros that resolve to (&sh). This will allow the compiler to optimise out many pointer dereferences already. - Keep extern sh_getinterp() for libshell ABI compatibility. src/cmd/ksh93/sh/init.c: - sh_init(): Do not calloc (sh_newof) the sh or shgd structs. - sh_getinterp(): Keep function for libshell ABI compat.	2022-01-01 02:28:06 +00:00

1 2 3 4 5 ...

1037 Commits