cdesktopenv

Commit Graph

Author	SHA1	Message	Date
Martijn Dekker	441dcc0483	Upgrade licence to EPL 2.0 EPL 1.0 says, in section 7: "The Program (including Contributions) may always be distributed subject to the version of the Agreement under which it was received. In addition, after a new version of the Agreement is published, Contributor may elect to distribute the Program (including its Contributions) under the new version." The Eclipse Foundation also encourage everyone to upgrade: https://www.eclipse.org/legal/epl-2.0/faq.php#h.60mjudroo8e5 https://www.eclipse.org/legal/epl-2.0/faq.php#h.tci84nlsqpgw Unfortunately the new Secondary License option is not available to us as we're not the original copyright holders and don't have the legal power to add one. So, no GPL compatibility. Sorry.	2022-07-28 05:46:08 +02:00
Martijn Dekker	1a0d75d47c	Add ${.sh.ppid} (re: `48f78227`) Since we now have sh.current_ppid, we might as well point a shell variable to it, as the cost is nil. Together, ${.sh.pid} and ${.sh.ppid} (updated when a virtual subshell forks) form a logical counterpart pair to $$ and $PPID (never updated in subshells). This commit also adds a section to the manual page that hopefully does away with the depressingly widespread subshell/child shell confusion once and for all... :P	2022-07-27 22:58:43 +02:00
Martijn Dekker	305073af9d	Remove .sh.dollar (re: `3613da42`, `7b0e0776`) The undocumented .sh.dollar variable was a hack used to change the value of $$ (main shell's PID); any value assigned to .sh.dollar overrides the value of $$. This was used by libcoshell and by the pre-fork(2) code to update the value of $$ in child processes initialised by internally generated shell scripts. Both were removed long ago, so we don't need this.	2022-07-27 22:58:10 +02:00
Martijn Dekker	ca5ea73d8c	Fix assumption that pid_t==int32_t, other wrong typecasts For $PPID and ${.sh.pid}, ksh has simply been assuming that pid_t is a 32-bit integer. In POSIX, the size of pid_t is not specified and operating systems are free to vary it. On any system where sizeof(pid_t) != 4, these variables produce undefined results. (Note that this is not a problem with $$ as it is converted to a string value directly from pid_t in special() in macro.c.) src/cmd/ksh93/features/options: - Add feature test that produces NV_PID, a name-value attribute macro that is set to NV_INTEGER\|NV_SHORT (for int16_t), NV_INTEGER (for int32_t) or NV_INTEGER\|NV_LONG (for Sflong_t a.k.a. intmax_t), depending on the size of pid_t. These are the three integer size attributes supported by nv_getval(). The build will abort on any (hypothetical?) system where pid_t has a size other than these three. In that case, the test and nv_getval() will need to have a new integer size attribute added. src/cmd/ksh93/data/variables.c: - Use NV_PID instead of NV_INTEGER for $PPID and ${.sh.pid}. This fixes the issue. Other changed files: - Fix the typecasts to various fmtbase() calls; the first argument is of type intmax_t (a.k.a. Sflong_t), not long. - When outputting PIDs with sfprintf(), typecast to Sflong_t and use %lld to account for possible systems with very large PIDs. (Though %lld is not in C89/C90, Sfio supports it anyway.)	2022-07-27 22:50:05 +02:00
Martijn Dekker	592ce7415a	test/[/[[: Fix incorrect 0-skipping, float precision (re: `c734568b`) The arguments to the binary numerical comparison operators (-eq, -gt, etc.) in the [[ and test/[ commands are treated as arithmetic expressions, even if $((...)) is not used. But there is some seriously incorrect behaviour: Reproducers (all should output 0/true): $ [[ 0x0A -eq 10 ]]; echo $? 1 $ [[ 1+0x0A -eq 11 ]]; echo $? 0 $ (set --posix; [[ 010 -eq 8 ]]); echo $? 1 $ (set --posix; [[ +010 -eq 8 ]]); echo $? 0 $ [[ 0xA -eq 10 ]]; echo $? 1 $ xA=10; [[ 0xA -eq 10 ]]; echo $? 0 $ xA=WTF; [[ 0xA -eq 10 ]]; echo $? ksh: WTF: parameter not set (POSIX mode enables the recognition of the initial 0 as a prefix indicating an octal number in arithmetic expressions.) The cause is the two 'while' loops in this section in test_binop() in src/cmd/ksh93/bltins/test.c: 502: if(op&TEST_ARITH) 503: { 504: while(left=='0') 505: left++; 506: while(right=='0') 507: right++; 508: lnum = sh_arith(left); 509: rnum = sh_arith(right); 510: } So initial zeros are unconditionally skipped. Ostensibly this is to disable the recognition of the initial zero as an octal prefix as well as 0x as a hexadecimal prefix. This would be okay for enforcing a decimal-only limitation for simple numbers, but to do this for arithmetic expressions is flagrantly incorrect, to say the least. (insert standard rant about AT&T quality standards here) The fix for '[[' is simply to delete the two 'while' loops. But that creates a problem for the deprecated-but-standard 'test'/'[' command, which also uses the test_binop() function. According to POSIX, test/[ only parses simple decimal numbers, so octal, etc. is not a thing. But, after that fix, 'test 08 -eq 10' in POSIX mode yields true, which is unlike every other shell. (Note that this is unlike [[ 08 -eq 10 ]], which yields true on bash because '[[' parses operands as arithmetic expressions.) For test/[ in non-POSIX mode, we don't need to change anything. For POSIX mode, we should only parse literal decimal numbers for these operators in test/[, disallowing unexpanded arithmetic expressions. This makes ksh's POSIX-mode test/[ act like every other shell and like external .../bin/test commands shipped by OSs. src/cmd/ksh93/bltins/text.c: test_binop(): - Correct a type mismatch. The left and right hand numbers should be Sfdouble_t, the maximum double length on the current system, and the type that sh_arith() returns. Instead, long double (typeset -lF) values were reduced to double (typeset -F) before comparing! - For test/[ in POSIX, only accept simple decimal numbers: use strtold() instead of sh_arith(). Do skip initial zeros here as this disables the recognition of the hexadecimal prefix. Throw an error on an invalid decimal number. Floating point constants are still parsed, but that's fine as this does not cause any incompatibility with POSIX. - For code legibility, move handling of TEST_EQ, etc. to within the if(op&TEST_ARITH) block. This also allows lnum and rnum to be local to that block.	2022-07-26 21:33:00 +02:00
Martijn Dekker	8264d2089a	Remove ineffective check for login shell; require -p for suid/sgid In main.c: 158: if(sh.ppid==1) 159: sh.login_sh++; If that was ever valid, it certainly is not now. As far as I know, there is no currently existing system where PID 1 (init or systemd or whatever) is the parent shell of the login shell, even straight after bootup; login shells are invoked via a program like login(1). Plus, there is no guarantee the init process actually has PID 1. This invalidates all use of login_sh that couldn't be replaced by checks for the login_shell option, so this commit does just that. src/cmd/ksh93/include/shell.h: - Remove login_sh flag. src/cmd/ksh93/sh/init.c: - If a login shell was detected, just set the login_shell option. - Remove obsolete check for #! setuid scripts. This was meant to guard against a symlink called '-i' to a setuid script with a hashbang path, which used to give users a root shell. All modern Unixes ignore the setuid bit when they detect a hashbang path. src/cmd/ksh93/SHOPT.sh: - By default, let's require the -p/--privileged invocation option for the setuid/setgid bit on the shell binary to be respected, for all user IDs (>= 0). This is what bash and mksh do, and it seems sensible. (See init.c 1475-1483)	2022-07-21 05:51:07 +02:00
Martijn Dekker	42fc5c4c0d	[v1.0] Remove experimental .getn discipline .getn discipline functions cause .sh.value to have a float type for arithmetic expressions that get the value of foo, avoiding the problem of having to convert between floats and strings (e.g. rounding errors). There is no corresponding .setn discipline. A search in the ast-open-archive repo reveals that the getn discipline was quietly added in version 2009-08-21 93t+, with not even a mention in the RELEASE file. The one available mention on the internet is this old thread: https://www.mail-archive.com/ast-users@research.att.com/msg00601.html Apparently a setn discipline was* planned, but never implemented. getn discipline functions may also crash in several ways. I've been unsuccessful at solving all the crashes, particularly as one of them is intermittent. This should not be in the 1.0 release. Further discussion: https://github.com/ksh93/ksh/issues/435	2022-07-16 08:42:50 +02:00
Martijn Dekker	ffee9100d5	Robustify ${.sh.level} scope switching (re: `69d37d5e`, `e1c41bb2`) Switching the function scope to a parent scope by assigning to .sh.level (SH_LEVELNOD) leaves the shell in an inconsistent state, causing invalid-free and/or use-after-free bugs. The intention of .sh.level was always to temporarily switch scopes inside a DEBUG trap, so this commit minimises the pitfalls and instability by imposing some sensible limitations: 1. .sh.level is now a read-only variable except while executing a DEBUG trap; 2. while it's writeable, attempts to unset .sh.level or to change its attributes are ignored; 3. attempts to set a discipline function for .sh.level are ignored; 4. it is an error to set a level < 0 or > the current scope. Even more crashing bugs are fixed by simplifiying the handling and initialisation of .sh.level and by exempting it completely from virtual subshell scoping (to which it's irrelevant). TODO: one thing remains: scope corruption and use-after-free happen when using the '.' command inside a DEBUG trap with ${.sh.level} changed. Behaviour same as before this commit. To be investigated. All changed files: - Consistently use the int16_t type for level values as that is the type of its non-pointer storage in SH_LEVELNOD. - Update .sh.level by using an update_sh_level() macro that assigns directly to the node value, then restores the scope if needed. - To eliminate implicit typecasts, use the same int16_t type (the type used by short ints such as SH_LEVELNOD) for all variables containing a function and/or dot script level. src/cmd/ksh93/include/variables.h: - Add update_sh_level() macro. src/cmd/ksh93/include/name.h, src/cmd/ksh93/sh/macro.c: - Add a nv_nonptr() macro that checks attributes for a non-pointer value -- currently only signed or unsigned short integer value, accessed via the 's' member of 'union Value' (e.g. np->nvalue.s). - nv_isnull(): To avoid undefined behaviour, check for attributes indicating a non-pointer value before accessing the nvalue.cp pointer (re: `5aba0c72`). - varsub(): In the set/unset check, remove the now-redundant exception for SH_LEVELNOD. src/cmd/ksh93/data/variables.c, src/cmd/ksh93/sh/init.c: - shtab_variables[]: Make .sh.level a read-only short integer. - sh_inittree(): To avoid undefined behaviour, do not assign to the 'union Value' char pointer if the attribute indicates a non- pointer short integer value. Instead, the table value is ignored. src/cmd/ksh93/sh/subshell.c: sh_assignok(): - Never create a subshell scope for SH_LEVELNOD. src/cmd/ksh93/sh/xec.c: - Get rid of 'struct Level' and its maxlevel member. This was only used in put_level() to check for an out of range assignment, but this can be trivially done by checking sh.fn_depth+sh.dot_depth. - This in turn allows further simplification that reduces init for .sh.level to a single nv_disc() call in sh_debug(), so get rid of init_level(). - put_level(): Throw a "level out of range" error if assigned a wrong level. - sh_debug(): - Turn off the NV_RDONLY (read-only) attribute for SH_LEVELNOD while executing the DEBUG trap. - Restore the current scope when trap execution is finished. - sh_funct(): Remove all .sh.level handling. POSIX functions (and dot scripts) already handle it in b_dot_cmd(), so sh_funct(), which is used by both, is the wrong place to do it. - sh_funscope(): Update .sh.level for ksh syntax functions here instead. Also, do not bother to initialise its discipline here, as it can now only be changed in a DEBUG trap. src/cmd/ksh93/bltins/typeset.c: setall(): - When it's not read-only, ignore all attribute changes for .sh.level, as changing the attributes would crash the shell. src/cmd/ksh93/sh/nvdisc.c: nv_setdisc(): - Ignore all attempts to set a discipline function for .sh.level, as doing this would crash the shell. src/cmd/ksh93/bltins/misc.c: b_dot_cmd(): - Bug fix: also update .sh.level when quitting a dot script. src/cmd/ksh93/sh/name.c: - _nv_unset(): - To avoid an inconsistent state, ignore all attempts to unset .sh.level. - To avoid undefined behaviour, do not zero np->nvalue.cp if attributes for np indicate a non-pointer value (the actual bit value of a null pointer is not defined by the standard, so there is no guarantee that zeroing .cp will zero .s). - sh_setscope(): For consistency, always set error_info.id (the command name for error messages) to the new scope's cmdname. Previously this was only done for two calls of this function. - nv_name(): Fix a crashing bug by checking that np->nvname is a non-null pointer before dereferencing it. src/cmd/ksh93/include/nval.h: - The NV_UINT16P macro (which is unsigned NV_INT16P) had a typo in it, which went unnoticed for many years because it's not directly used (though its bit flags are set and used indirectly). Let's fix it anyway and keep it for completeness' sake.	2022-07-13 23:11:18 +02:00
Martijn Dekker	9ce426a8c4	Disable SHOPT_DYNAMIC by default Dynamically loadable built-ins do not work well with a statically linked ksh; they cannot use ksh's statically linked copies of libast and libshell, so they would need to bring their own, but multiple copies of those don't play well together. So dynamically loaded built-ins cannot interface with the shell. Only non-AST, non-SFIO built-ins are possible. Which is something that perhaps five people in the world know how to do as this is not documented anywhere (hint: your built-in needs the BLT_NOSFIO attribute to use stdio without problems). And those five people are also able to compile their own ksh with SHOPT_DYNAMIC reenabled. Plus, the SHOPT_DYNAMIC code causes strange $PATH search regressions on a few systems. The cause of that bug has eluded me so far, but disabling this is effectively a fix on those systems. src/cmd/ksh93/SHOPT.sh: - Turn off SHOPT_DYNAMIC by default. src/cmd/ksh93/data/builtins.c: - Do not compile in irrelevant sh_optbuiltin[] (builtin --man) documentation if SHOPT_DYNAMIC is disabled.	2022-07-09 17:17:52 +02:00
Martijn Dekker	4d50b69cbd	[v1.0] remove alarm builtin It's undocumented, it's broken and can crash the shell, and it's unclear if it can ever be fixed. So with a 1.0 release (hopefully) not very far off, it's time to remove it from the 1.0 branch. Related: https://github.com/ksh93/ksh/issues/422	2022-06-21 05:45:53 +01:00
Martijn Dekker	627058e862	Yet more accumulated tweaks and cleanups Notable changes: src/cmd/ksh93/bltins/trap.c: b_trap(): - Disable the unadvertised 'trap + SIG' feature in POSIX mode; it's not compatible as '+' is a legitimate command name. src/cmd/ksh93/data/builtins.c: - Give "pwd", "alarm" and "times" the BLT_ENV flag for better performance. There is no need for those to be stoppable. src/cmd/ksh93/sh/xec.c: - sh_eval(): Remove a "temporary tksh hack" and associated sh.fn_reset flag. - sh_exec(): - Remove now-used 'unpipe' flag and one remaining dead check for it (re: `a2196f94`, `4b22fd5d`). - Fix unnecessary and confusing reuse of the 'type' variable. src/lib/libast/comp/conf.sh: - trap: Use 'rm -rf' instead of 'rm -f' to remove temp executables; on macOS, they may include *.dSYM directories.	2022-06-18 23:29:49 +01:00
Martijn Dekker	148a8a3f46	Another build system overhaul (re: `35672208`, `580ff616`, `6cc2f6a0`) So far we've been handling AST release build and git commit flags and ksh SHOPT_* compile time options in the generic package build script. That was a hack that was necessary before I had sufficient understanding of the build system. Some of it did not work very well, e.g. the correct git commit did not show up in ${.sh.version} when compiling from a git repo. As of this commit, this is properly included in the mamake dependency tree by handling it from the libast and ksh93 Mamfiles, guaranteeing they are properly up to date. For a release build, the _AST_ksh_release macro is renamed to _AST_release, because some aspects of libast also use this. This commit also adds my first attempt at documenting the (very simple, six-command) mamake language as it is currently implemented -- which is significantly different from Glenn Fowler's original paper. This is mostly based on reading the mamake.c source code. src/cmd/INIT/README-mamake.md: - Added. bin/package, src/cmd/INIT/package.sh: - Delete the hack. /Mamfile: - Remove KSH_RELFLAGS and KSH_SHOPTFLAGS, which supported the hack. - Delete 'meta' commands. They were no-ops; mamake.c ignores them. They also did not add any informative value. src/lib/libast/Mamfile: - Add a 'virtual' target that obtains the current git commit, examines the git branch, and decides whether to auto-set an _AST_git_commit and/or or _AST_release #define to a new releaseflags.h header file. This is overwritten on each run. - Add code to the install target that copies limit.h to include/ast, but only if it doesn't exist or the content of the original changed. This allows '#include <releaseflags.h>' from any program using libast while avoiding needless recompiles. - When there are uncommitted changes, add /MOD (modified) to the commit hash instead of not defining it at all. src/cmd/ksh93/: - Mamfile: Add a shopt.h target that reads SHOPT.sh and converts it into a new shopt.h header file in the object code directory. The shopt.h header only contains SHOPT_* directives that have a value in SHOPT.sh (not the empty/probe ones). They also do not redefine the macros if they already exist, so overriding with something like CCFLAGS+=' -DSHOPT_FOO=1' remains possible. - *.c: Every c file now #includes "shopt.h" first. So SHOPT_ macros are no longer passed via environment/MAM variables. * SHOPT.sh: The AUDITFILE and CMDLIB_DIR macros no longer need an extra backslash-escape for the double quotes in their values. (The old way required this because mamake inserts MAM variables directly into shell scripts as literals without quoting. :-/ ) src/cmd/INIT/mamake.c: - Import the two minor changes between from 93u+ and 93v-: bind() is renamed to bindfile() and there is a tweak to detecting an "improper done statement". - Allow arbitrary whitespace (isspace()) everywhere, instead of spaces only. This obsoletes my earlier indentation workaround from 6cc2f6a0; turns out mamake always supported indentation, but with spaces only. - Do not skip line numbers at the beginning of each line. This undocumented feature is not and (AFAICT) never has been used. - Throw an error on unknown command or rule attribute. Quite an important feature for manual maintenance: catches typos, etc.	2022-06-12 05:47:02 +01:00
Martijn Dekker	3030197b89	Add error message for ambiguous long-form option abbreviation Before: $ set -o hist -ksh: set: hist: bad option(s) After: $ set --hist -ksh: set: hist: ambiguous option In ksh 93u+m, there are three options starting with 'hist', so the abbreviation does not represent a single option. It is useful to communicate this in the error message. In addition, "bad option(s)" was changed to "unknown option", the same message as that given by AST optget(3), for consistency. src/cmd/ksh93/sh/string.c: - Make sh_lookopt() return -1 for an ambiguous option. This is trivial as there is already an 'amb' variable flagging that up. src/cmd/ksh93/sh/args.c: - Use the negative sh_lookopt() return status to decide between "unknown option" and "ambiguous option". src/cmd/ksh93/data/builtins.c: sh_set[]: - Explain the --option and --nooption forms in addition to the -o option and +o option forms. - Document the long options without their 'no' prefixes (e.g. glob instead of noglob) as this simplifies documentation and the relation with the short options makes more sense. Those names are also how they show up in 'set -o' output and it is better for those to match. - Tweaks.	2022-06-10 01:11:46 +01:00
Martijn Dekker	b14e79e9d0	posix: use real pipe(2) instead of socketpair(2) The POSIX standard requires real UNIX pipes as in pipe(2). But on systems supporting it (all modern ones), ksh uses socketpair(2) instead to make it possible for certain commands to peek ahead without consuming input from the pipe, which is not possible with real pipes. See features/poll and sh/io.c. But this can create undesired side effects: applications connected to a pipe may test if they are connected to a pipe, which will fail if they are connected to a socket. Also, on Linux: $ cat /etc/passwd \| head -1 /dev/stdin head: cannot open '/dev/stdin' for reading: No such device or address ...which happens because, unlike most systems, Linux cannot open(2) or openat(2) a socket (a limitation that is allowed by POSIX). Unfortunately at least two things depend on the peekahead capability of the _pipe_socketpair feature. One is the non-blocking behaviour of the -n option of the 'read' built-in: -n Causes at most n bytes to be read instead of a full line, but will return when reading from a slow device as soon as any characters have been read. The other thing that breaks is the <#pattern and <##pattern redirection operators that basically grep standard input, which inherently requires peekahead. Standard UNIX pipes always block on read and it is not possible to peek ahead, so these features inevitably break. Which means we cannot simply use standard pipes without breaking compatibility. But we can at least fix it in the POSIX mode so that cross-platform scripts work more correctly. src/cmd/ksh93/sh/io.c: sh_pipe(): - If _pipe_socketpair is detected at compile time, then use a real pipe via sh_rpipe() if the POSIX mode is active. (If _pipe_socketpair is not detected, a real pipe is always used.) src/cmd/ksh93/data/builtins.c: - sh.1 documents the slow-device behaviour of -n but 'read --man' did not. Add that, making it conditional on _pipe_socketpair. Resolves: https://github.com/ksh93/ksh/issues/327	2022-06-09 16:16:16 +01:00
Martijn Dekker	89cec81b32	Another round of minor tweaks and cleanups Notable changes: - The typeset builtin's usage and error messages for incompatible options used with -f has been corrected to show that -t and -u can be used with -f. - In name.c, get rid of misleaadingly named Null static which is actually the empty string, not the null value. Replace with a new AltEmpty macro that is defined similarly to Empty. This is now also used in nvtype.c (re: de037b6e).	2022-06-09 03:02:06 +01:00
Martijn Dekker	e3aa32a129	Add --functrace shell option (re: `2a835a2d`) A side effect of the bug fixed in `2a835a2d` caused the DEBUG trap action to appear to be inherited by subshells, but in a buggy way that could crash the shell. After the fix, the trap is reset in subshells along with all the others, as it should be. Nonetheless, as that bug was present for years, some have come to rely on it. This commit implements that functionality properly. When the new --functrace option is turned on, DEBUG trap actions are now inherited by subshells as well as ksh function scopes. In addition, since it makes logical sense, the new option also causes the -x/--xtrace option's state to be inherited by ksh function scopes. Note that changes made within the scope do not propagate upwards; this is different from bash. (I've opted against adding a -T short-form equivalent as on bash, because -T was formerly a different option on 93u+ (see 63c55ad7) and on mksh it has yet anohter a different meaning. To minimise confusion, I think it's best to have the long-form name only.) src/cmd/ksh93/include/shell.h, src/cmd/ksh93/data/options.c: - Add new "functrace" (SH_FUNCTRACE) long-form shell option. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - When functrace is on, copy the parent's DEBUG trap action into the virtual subshell scope after resetting the trap actions. src/cmd/ksh93/sh/xec.c: sh_funscope(): - When functrace is on and xtrace is on in the parent scope, turn it on in the child scope. - Same DEBUG trap action handling as in sh_subshell(). Resolves: https://github.com/ksh93/ksh/issues/162	2022-06-04 17:27:27 +01:00
Martijn Dekker	ea300089a1	New feature: 'typeset -g' as in bash 4.2+ typeset -g allows directly manipulating the attributes of variables at the global level from any context. This feature already exists on bash 4.2 and later. mksh (R50+), yash and zsh have this flag as well, but it's slightly different: it ignores the current local scope, but a parent local scope from a calling function may still be used -- whereas on bash, '-g' always refers to the global scope. Since ksh93 uses static scoping (see III.Q28 at <http://kornshell.com/doc/faq.html>), only the bash behaviour makes sense here. Note that the implementation needs to be done both in nv_setlist() (name.c) and in b_typeset() (typeset.c) because assignments are executed before the typeset built-in itself. Hence also the pre-parsing of typeset options in sh_exec(). src/cmd/ksh93/include/nval.h: - Add new NV_GLOBAL bit flag, using a previously unused bit that still falls within the 32-bit integer range. src/cmd/ksh93/sh/xec.c: sh_exec(): - When pre-parsing typeset flags, make -g pass the NV_GLOBAL flag to the nv_setlist() call that processes shell assignments prior to running the command. src/cmd/ksh93/sh/name.c: nv_setlist(): - When the NV_GLOBAL bit flag is passed, save the current variable tree pointer (sh.var_tree) as well as the current namespace (sh.namespace) and temporarily set the former to the global variable tree (sh.var_base) and the latter to NULL. This makes assignments global and ignores namesapces. src/cmd/ksh93/bltins/typeset.c: - b_typeset(): - Use NV_GLOBAL bit flag for -g. - Allow combining -n with -g, permitting 'typeset -gn var' or 'nameref -g var' to create a global nameref from a function. - Do not allow a nonsensical use of -g when using nested typeset to define member variables of 'typeset -T' types. (Type method functions can still use it as normal.) - setall(): - If NV_GLOBAL is passed, use sh.var_base and deactivate sh.namespace as in nv_setlist(). This allows attributes to be set correctly for global variables. src/cmd/ksh93/tests/{functions,namespace}.sh: - Add regression tests based on reproducers for problems found by @hyenias in preliminary versions of this feature. Resolves: https://github.com/ksh93/ksh/issues/479	2022-06-01 21:07:01 +01:00
Martijn Dekker	fd28da31da	Fix another test/[ corner case bug; add --posix test script This fixes another corner case bug in the horror show that is the test/[ comand. Reproducer: $ ksh --posix -c 'test X -a -n' ksh: test: argument expected Every other shell returns 0 (success) as, POSIXly, this is a test for the strings 'X' and '-n' both being non-empty, combined with the binary -a (logical and) operator. Instead, '-n' was taken as a unary primary operator with a missing argument, which is incorrect. POSIX reference: https://pubs.opengroup.org/onlinepubs/9699919799/utilities/test.html > 3 arguments: > * If $2 is a binary primary, perform the binary test of $1 and $3. src/cmd/ksh93/bltins/test.c: - e3(): If the final argument begins with a dash, always treat it as a test for a non-empty string, therefore return true. Do not limit this to "new flags" only. src/cmd/ksh93/tests/posix.sh: - Added. These are tests for every aspect of the POSIX mode.	2022-03-11 21:23:45 +01:00
Martijn Dekker	b398f33c49	Another round of accumulated minor fixes and cleanups Only notable changes listed below. */Mamfile: - Do not bother redirecting standard error for 'cmp -s' to /dev/null. Normally, 'cmp -s' on Linux, macOS, BSD, or Solaris do not not print any message. If it does, something unusual is going on and I would want to see the message. - Since we now require a POSIX shell, we can use '!'. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/sh/init.c: - Remove SH_TYPE_PROFILE symbol, unused after the removal of the SHOPT_PFSH code. (re: eabd6453) src/cmd/ksh93/sh/io.c: - piperead(), slowread(): Replace redundant sffileno() calls by the variables already containing their results. (re: 50d342e4) src/cmd/ksh93/bltins/mkservice.c, rc/lib/libcmd/vmstate.c: - If these aren't compiled, define a stub function to silence the ranlib(1) warning that the .o file does not contain symbols.	2022-03-11 21:20:32 +01:00
Martijn Dekker	11177d448d	Fix crash on cd in subshell with PWD unset (re: `5ee290c`) Reproducer: $ ksh -c 'unset PWD; (cd /); :' Memory fault The shell crashes because b_cd() is testing the value of the PWD variable without checking if there is one. src/cmd/ksh93/sh/path.c: path_pwd(): - Never return an unfreeable pointer to e_dot; always return a freeable pointer. This fixes another corner-case crashing bug. - Make sure the PWD variable gets assigned a value if it doesn't have one, even if it's the "." fallback. However, if the PWD is inaccessible but we did inherit a $PWD value that starts with a /, then use the existing $PWD value as this will help the shell fail gracefully. src/cmd/ksh93/bltins/cd_pwd.c: - b_cd(): When checking if the PWD is valid, use the sh.pwd copy instead of the PWD variable. This fixes the crash above. - b_cd(): Since path_pwd() now always returns a freeable value, free sh.pwd unconditionally before setting the new value. - b_pwd(): Not only check that path_pwd() returns a value starting with a slash, but also verify it with test_inode() and error out if it's wrong. This makes the 'pwd' command useful for checking that the PWD is currently accessible. src/cmd/ksh93/data/msg.c: - Change e_pwd error message for accuracy and clarity.	2022-02-17 19:45:37 +00:00
Martijn Dekker	d55e9686d7	Backport 'read -a' and 'read -p' from ksh 93v-/2020 This backports two minor additions to the 'read' built-in from ksh 93v-: '-a' is now the same as '-A' and '-u p' is the same as '-p'. This is for compatibility with some 93v- or ksh2020 scripts. Note that their change to the '-p' option to support both prompts and reading from the coprocess was not backported because we found it to be broken and unfixable. Discussoin at: https://github.com/ksh93/ksh/issues/463 src/cmd/ksh93/bltins/read.c: b_read(): - Backport as described above. - Rename the misleadingly named 'name' variable to 'prompt'. It points to the prompt string, not to a variable name. src/cmd/ksh93/data/builtins.c: sh_optpwd[]: - Add -a as an alterative to -A. All that is needed is adding '\|a' and optget(3) will automatically convert it to 'A'. - Change -u from a '#' (numeric) to ':' option to support 'p'. Note that b_read() now needs a corresponding strtol() to convert file descriptor strings to numbers where applicable. - Tweaks. src/cmd/ksh93/sh.1: - Update accordingly. - Tidy up the unreadable mess that was the 'read' documentation. The options are now shown in a list.	2022-02-17 19:44:54 +00:00
Martijn Dekker	14a43a0a88	Yet more misc. cleanups; rm SHOPT_PFSH, SHOPT_TYPEDEF Notable changes: - Remove SHOPT_PFSH compile-time option and associated code. This was meant to work with Solaris rights profiles, see: https://docs.oracle.com/cd/E23824_01/html/821-1461/profiles-1.html#REFMAN1profiles-1 But it has been obsolete for years as Solaris stopped using it in its shipped ksh several OS versions ago, preferring a library-based wrapper around ksh and other shells. Nonetheless I experimented with the option on Solaris 11.4. Result: no external command will run; output of unitialised memory in error message. So it's already fallen victim to bit rot. There's nothing interesting here, so just get rid. - Remove SHOPT_TYPEDEF compile-time option (but keep the code!). Turning it off caused the build to fail. It may be possible to fix it, but the type definition code is integral to ksh now (e.g. 'enum' depends on much of it) so it makes no sense to disable it. This was removed in the ksh 93v- beta version as well. - Remove nv_close() calls and remove nv_close() documentation from the nval.3 man page. This function is a dummy, present without any changes since the beginning of the ast-open-archive repo in 1995. The comment was: "Currently this is a dummy, but someday will be needed for reference counting". 27 or more years later, it's time to admit it's never going to happen. (And of course, nv_close() calls were not being used with anything resembling consistency.) - Add a null nv_close() macro to nval.h for compatibility with third party code that follows the old documentation. - Add a few missing regression tests.	2022-02-10 21:04:56 +00:00
Martijn Dekker	493a31053e	Do not export variables with dot names (re: `8e72608c`) Variables with a dot in their name, such as those declared in namespace { ... } blocks, are usually stored in a separate tree with their actual names not containing any dots. But under some circumstances, including at least direct assignment of a non-preexisting dot variable, dot variables are stored in the main sh.var_tree with names actually containing dots. With allexport active, those could end up exported to the environment. This bug was also present in previous release versions of ksh. src/cmd/ksh93/sh/name.c: pushnam(): - Check for a dot in the name before pushing a variable to export.	2022-02-05 15:08:50 +00:00
Johnothan King	8e72608c1c	Export all variables assigned to while allexport is on (#431 ) All variables that are assigned a value should be exported while the allexport shell option is enabled. This works in most cases, but variables assigned to with ${var:=foo} or $((var=123)) aren't exported while allexport is on. src/cmd/ksh93/sh/name.c: - nv_putval(): This is the central assignment function; all forms of variable assignment end up here. So this is the best place to check for SH_ALLEXPORT and turn on the export attribute. - nv_setlist(): Remove allexport handling, now redundant. src/cmd/ksh93/bltins/read.c: sh_readline(): - Remove allexport handling, now redundant. src/cmd/ksh93/sh/main.c: sh_main(): - nv_putval() is used to initialize PS4 and IFS using nv_putval(); this is after an -a/--allexport specified on the ksh command line has been processed, so temporarily turn that off. Co-authored-by: Martijn Dekker <martijn@inlv.org>	2022-02-05 13:52:28 +00:00
Martijn Dekker	9c313c7fe3	Update copyright years in files changed since 1st Jan 2022	2022-01-30 20:49:04 +00:00
Johnothan King	ad9f9ff13e	Accumulated fixes for minor issues (#442 ) This commit applies various accumulated bugfixes: - Applied some fixes for compiler warnings based off of the following pull requests (with whitespace changes excluded to avoid inflating the size of the diff): https://github.com/att/ast/pull/281 https://github.com/att/ast/pull/283 https://github.com/att/ast/pull/303 https://github.com/att/ast/pull/304 - clone_type(): Two separate variables in this function share the same name. A bugfix from ksh93v- 2013-05-24 was backported to avoid conflict issues. - Backported a minor error message improvement from ksh2020 for when the user attempts to run too many coprocesses. - Backported a ksh2020 bugfix for a file descriptor leak: https://github.com/att/ast/commit/58bc8b56 - Backported ksh2020 bugfixes for unused variable and pointless assignment lint warnings: https://github.com/att/ast/commit/47650fe0 https://github.com/att/ast/commit/df209c0d https://github.com/att/ast/commit/5e417b00 - Applied a few minor improvements to libast from graphviz: https://gitlab.com/graphviz/graphviz/-/commit/e7c03541 https://gitlab.com/graphviz/graphviz/-/commit/969a7cde - dtmemory(): Mark unused parameters with NOT_USED(). Based on: https://gitlab.com/graphviz/graphviz/-/commit/6ac3ad99 - Applied a few documentation/comment tweaks for the NEWS file, printf -v and spawnveg. - Added a missing regression test for using the rm builtin's -f option without additional arguments (this was fixed in ksh93u+ 2012-02-14).	2022-01-30 20:42:59 +00:00
Martijn Dekker	304648d0c5	Another round of accumulated tweaks and cleanups Notable changes: src/cmd/ksh93/.c: - Get rid of all the dtuserdata(FOO,&sh,1) calls backported in `cc492752`. These set pointers to sh in Cdt objects. As of `b590a9f1`, the code does not use any pointers to sh, so these are superfluous. src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - As of ksh 93l 2001-06-01, the -h/trackall option has no effect at all, so trim its documentation. src/lib/libast/man/stk.3, src/lib/libast/man/stak.3: - Correct the documentation on what the ST(A)K_SMALL option bit actually does based on a reading of the code. - Document the STK_NULL option bit. README.md, src/cmd/ksh93/README: - Add a note that -fdiagnostics-color=always will break the build. Ref.: https://github.com/ksh93/ksh/issues/379 src/lib/libast/Mamfile: - Remove a 'rm -f astmath' command -- a file that is never created. But on Cygwin this removes astmath.exe, which is* used. As a result, executing it failed on Cygwin, so the system incorrectly detected that Cygwin needs -lm for math functions.	2022-01-28 21:12:31 +00:00
Martijn Dekker	172becffea	Some more accumulated minor tweaks and cleanups Notable changes: src/cmd/ksh93/include/fault.h: - Get rid of the superflous sh pointer argument in the sh_pushcontext() and sh_popcontext() macros. (re: `2d3ec8b6`) src/cmd/ksh93/tests/io.sh: - Tweak a process substitution test to allow up to a second for unused process substitution processes to disappear. On the Alpine Linux console (at least the musl libc version), this is needed to avoid a test failure as long as no GUI is active. As soon as you start X11, this phenomenon disappears, even on the console. Very strange, but also very probably not ksh's fault. src/cmd/ksh93/tests/shtests: - Instead of just SIGCONT and SIGPIPE, set all signals to default, just to be sure to avoid spurious test failures due to signals that were ignored on entry. (It made no difference to the aforementioned Alpine Linux test failure, so ignored signals had nothing to do with that -- but still a good idea.) .github/workflows/ci.yml: - On the GitHub CI runs, when testing with SHOPTs disabled, disable SHOPT_SPAWN as well, which tests that everything still works correctly with the regular fork(2) method. COPYRIGHT: - Remove duplicate of BSD license.	2022-01-25 16:13:15 +00:00
Martijn Dekker	cda661d34c	Various cleanups, mostly in regression tests src/cmd/ksh93/data/variables.sh: shtab_variables[]: - Remove unused "CSWIDTH" entry. All use of it (including the matching CSWIDTHNOD macro) was removed in version 2003-04-22. src/cmd/ksh93/tests/variables.sh: - For the tests on the shtab_variables[] variables, read the variable names straight from variables.c instead of synching the list in the test script, which would surely be forgotten. src/cmd/ksh93/tests/*.sh: - Fix a number of mistaken tries to count errors from a subshell. - Fix miscellaneous minor breakage and typos.	2022-01-24 02:58:25 +00:00
Martijn Dekker	41829efa06	Various minor cleanups and fixes The more notable ones are: src/lib/libast/features/standards: - Do not redefine _GNU_SOURCE and _FILE_OFFSET_BITS if already defined from $CCFLAGS. Thanks to @hyanias for the heads-up. (re: `289dd46c`) src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/include/shell.h, src/cmd/ksh93/sh/args.c, src/cmd/ksh93/sh/name.c: - Remove -T test code activation option. It was basically unused. The only thing it did was intentionally introduce a memory leak in table_unset() if the 4th bit in the option argument was set. A search in ast-open-history reveals a few more trivial test uses that were later deleted, but nothing interesting. src/cmd/ksh93/tests/{basic,path}.sh: - Skip a couple of tests on AIX avoid hangs, at least one of which is not ksh's fault. Thanks to @HansH111 for the report. src/cmd/ksh93/tests/builtins.sh: - Change one awk use to a more portable sed invocation to placate systems with ancient awk commands, such as AIX. (re: `de795e1f`)	2022-01-20 00:54:42 +00:00
Martijn Dekker	b509e92241	edit: do not enable multiline mode with no editor active If neither gmacs/emacs nor vi are active, the multiline mode should not be enabled even if the multiline option is on. Doing so can cause inconsistent behaviour, particularly in multibyte locales where, if the shell is compiled with SHOPT_RAWONLY (as is default), the no-editor mode is actually handled by vi.c. Also, the new --histreedit and --histverify options only work in the emacs or vi editors, or in no-editor mode when handled by vi. Which means they cannot ever work if neither emacs or vi were compiled in (i.e. SHOPT_ESH and SHOPT_VSH were both disabled). In that case, there's no point in compiling in those options. Come to think of it, the same applies to the multiline option. All changed files: - Update SHOPT_ESH/SHOPT_VSH preprocessor directives as per above. src/cmd/ksh93/include/defs.h, src/cmd/ksh93/include/shell.h: - Move definitions of history expansion-related options to shell.h, which is where all the other shell options are defined.	2022-01-12 20:39:05 +00:00
Martijn Dekker	a5700d3937	Expose --histreedit and --histverify options (re: `921bbcae`) This adds two long-form shell options that modify history expansion (-H/--histexpand). If --histreedit is on and a history expansion fails, the command line is reloaded into the next prompt's edit buffer, allowing corrections. If --histverify is on, the results of a history expansion are not immediately executed but instead loaded into the next prompt's edit buffer, allowing further changes. SH_HISTREEDIT and SH_HISTVERIFY were already handled all along in slowread() in io.c and via 'reedit' arguments to functions called there, but could not be turned on as they were only ever exposed as shopt options in the removed bash compatibility mode (in theory only, as it failed to compile). I had overlooked them until now. So, since the code is there, let's expose these options through the normal long options interface. They're working fine, and activating them actually makes history expansion tolerable to use. src/cmd/ksh93/data/options.c: - Make these options available as "histreedit" and "histverify". src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - Document the "new" options. src/cmd/ksh93/include/defs.h: - Remove unused SH_HISTAPPEND and SH_HISTORY2 macros which were part of the removed bash compatibility code. Note that ksh does not need a histappend option as it never overwrites the history file (in the bash mode, this shopt option was a no-op).	2022-01-12 20:38:30 +00:00
Johnothan King	ca5803419b	Fix various typos, man page issues and improve the documentation (#415 ) This commit makes various different improvements to the documentation: - sh.1: Backported (with changes) mandoc warning fixes from ksh2020 for the ksh93(1) man page: <https://github.com/att/ast/pull/1406> - Removed unnecessary spaces at the end of lines to fix a few other mandoc warnings. - Fixed various typos and capitalization errors in the documentation. - ANNOUNCE: Document the addition of the ${.sh.pid} variable (re: `9de65210`). - libast/man/str: Update the man pages for the libast str functions to improve how accurately each function is described. - ksh93/README: Update regression test/compatibility notes to include OpenBSD 7.0, FreeBSD 13.0 and WSL running Ubuntu 20.04. - Change a few places to store the return value from strlen in a size_t variable rather than signed int. - comp/setlocale.c: To avoid confusion of two separate variables named lang, the function local variable has been renamed to langidx.	2022-01-07 16:17:55 +00:00
Johnothan King	d347ec0fc9	Allow ksh to compile on Haiku; implement SIGKILLTHR support (#408 ) This commit implements the build fixes required to get ksh running on Haiku. Note that while ksh does compile, it has a ton of regression test failures on Haiku. src/cmd/ksh93/data/signals.c, src/lib/libast/features/signal.c: - Add support for the SIGKILLTHR signal, which is supported by BeOS and Haiku. - SIGINFO was missing an entry in the libast feature test, so add one (re: `658bba74`). src/cmd/ksh93/RELEASE: - Add an entry noting that ksh now compiles on Haiku, albeit with many regression test failures. src/cmd/ksh93/{include/terminal.h,sh/path.c}: - Silence compiler warnings on Haiku. src/lib/libast/features/mmap: - The mmap feature test freezes on Haiku, so modify the test to fail immediately on that OS. src/lib/libast/misc/signal.c: - Avoid redefining the signal definition on Haiku to fix a compiler error. src/lib/libast/features/nl_types: - For some reason the nl_item typedef on Haiku doesn't work correctly. Work around that by creating the nl_item type in the libast nl_types feature test.	2022-01-07 16:16:42 +00:00
Johnothan King	8f9d1bec97	Add three options to 'ulimit' (#406 ) This patch adds a few extra options to the ulimit command (if the OS supports them). These options are also present in Bash, although in ksh additional long forms of each option are available: ulimit -k/--kqueues This is the maximum number of kqueues. ulimit -P/--npts This is the maximum number of pseudo-terminals. ulimit -R/--rttime This is the time a real-time process can run before blocking, in microseconds. When the limit is exceeded, the process is sent SIGXCPU. Other changes: - bltins/ulimit.c: Change the formatting from sfprintf and increase the size of the tmp buffer to prevent text from being cut off in ulimit -a (this was required to add ulimit -R). - data/limits.c: Add support for using microseconds as a unit.	2021-12-28 22:02:20 +00:00
Johnothan King	0e197eee57	Fix `mkservice` compile errors and add `SHOPT_MKSERVICE` (#401 ) The unused mkservice and eloop builtins are currently not built, and if an attempt to compile them is made the build ends in failure. This commit backports a few build fixes from ksh93v- 2012-08-24 that allow mkservice and eloop to build (plus an additional compiler warning fix not in ksh93v-). I've also added a new SHOPT_MKSERVICE setting (turned off by default) so that mkservice and eloop can be built if the user chooses to include them in their build of ksh.	2021-12-28 17:51:11 +00:00
Johnothan King	24174f0fb7	Backport -P and -t flags for 'type'/'whence' from ksh93v- (#392 ) This commit backports the whence '-t' option from ksh93v-. The '-t' option is useful when one needs to identify the type of a command. The '-t' flag was added by ksh93v- for compatibility with Bash. It should be noted the ksh93v- patch had one bug, which this commit fixes. Path-bound builtins from /opt/ast/bin were classified as files if loaded from /opt/ast/bin in the PATH. Reproducer: $ PATH=/opt/ast/bin whence -t cat file src/cmd/ksh93/bltins/whence.c: - Simplify the bitmask values for the command and whence builtin flags. - Add the -t flag to the whence and type builtins. To prevent bugs, -t will always override -v if both of those flags were passed. src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - Add documentation for the new -t option.	2021-12-27 06:40:02 +00:00
Johnothan King	f7213f03a2	Fix multiple bugs when using 'alias -p' to print aliases (#398 ) This commit was originally intended to fix just one bug with shcomp's handling of 'alias -p', but while fixing that I found a large number of related issues in the alias command's -p, -t and -x options. The current patch provides bugfixes for all of the bugs listed below: 1) Listing aliases in a script with 'alias -p' or 'alias' broke shcomp's bytecode output: https://github.com/ksh93/ksh/issues/87#issuecomment-813819122 2) Listing individual aliases with the -p option doesn't work: $ alias foo=bar bar=foo $ alias foo foo=bar $ alias -p foo # No output 3) Listing specific tracked aliases with -pt does not display them in a reusable format, but rather adds another tracked alias: $ hash -r cat vi $ alias -pt vi # No output $ alias -pt rm $ alias -t cat=/usr/bin/cat rm=/usr/bin/rm vi=/usr/bin/vi 4) Listing all tracked aliases with -pt does not output them in a reusable format (the resulting command printed only creates a normal alias, which is different from a tracked alias): $ hash -r cat $ alias -pt alias cat=/usr/bin/cat # Expected 'alias -t cat' 5) Listing a non-existent alias with -p doesn't cause an error: $ unalias -a $ alias -p notanalias # No output $ echo $? 0 $ alias notanalias notanalias: alias not found $ echo $? 1 $ hash -r $ alias -pt notacommand # No output $ echo $? 0 6) Attempting to list 256 non-existent aliases results in exit status zero: $ unalias -a $ alias $(awk -v ORS= 'BEGIN { for(i=0;i<256;i++) print "x "; }') x: alias not found --cut error message-- $ echo $? 0 Changes: - typeset.c: Avoid printing anything while shcomp is compiling a script. This is needed because the alias command is run by shcomp to prevent parsing issues. - b_alias(): To avoid adding tracked aliases with -pt, set tdata.aflag to '+' so that setall() and other related functions only list tracked aliases. - b_alias(): Set tdata.pflag to 1 so that setall() and other functions recognize -p was passed. - print_value(): Add support for listing specific aliases with 'alias -p'. - setall(): To avoid any issues with zombie tracked aliases (see also the regression tests) ignore tracked alias nodes marked with the NV_NOALIAS attribute. This bit is set for tracked alias nodes by the nv_rehash() function. - setall(): For backward compatibility, continue incrementing the exit status for each invalid alias and tracked alias passed. This was already how alias behaved when listing aliases without -p, so using -p shouldn't cause a change in behavior: $ unalias -a $ alias foo bar foo: alias not found bar: alias not found $ echo $? 2 To fix bug 6, the exit status is set to one if an enforced 8-bit exit status would be zero. - print_namval(): Set the prefix to 'alias -t' so that listing tracked aliases with 'alias -pt' works correctly. - data/msg.c and include/name.h: Add an error message for when 'alias -pt' doesn't find a tracked alias. - tests/alias.sh: Add a ton of regression tests for the bugs fixed in this commit.	2021-12-27 03:49:06 +00:00
Martijn Dekker	a1f5c99204	INIT: remove proto, ratz (re: `46593a89`, `6137b99a`); major cleanup This takes another step towards cleaning up the build system. We now do not even pretend to be theoretically compatible with pre-1989 K&R C compilers or with C++ compilers. In practice, this had already been broken for many years due to bit rot. Commit `46593a89` already removed the license handling enormity that depended on proto, so now we can cleanly remove it altogether. But we do need to leave some backwards compatibility stubs to keep the build system compatible with older AST code; it should remain possible to build older ksh versions with the current build system (the bin/ and src/cmd/INIT/ directories) for testing purposes. So as of now there is no more __MANGLE__d rubbish in your generated header files. This is only about a quarter of a century overdue... This commit also includes a huge amount of code cleanup to remove thousands of unused K&R C fallbacks and other cruft, particularly in libast. This code base should now be a little easier to understand for people who are familiar with a modern(ish) C standard. ratz is now also removed; this was a standalone and simplified 2005 version of gunzip. As of `6137b99a`, none of our code uses it, even theoretically. And the real g(un)zip is now everywhere. src/cmd/INIT/proto.c, src/cmd/INIT/ratz.c: - Removed. COPYRIGHT: - Remove zlib license; this only applied to ratz. bin/package, src/cmd/INIT/package.sh: - Related cleanups. - Unset LC_ALL before invoking a new shell, respecting the user's locale again and avoiding multibyte character corruption on the command line. src/cmd/INIT/proto.sh: - Add stub for backwards compatibility with Mamfiles that depend on proto. It does nothing but pass input without modification and is now installed as the new arch//bin/proto by src/cmd/INIT/Mamfile. src/cmd/INIT/iffe.sh: - Ignore the proto-related -e (--package) and -p (--prototyped) options; keep parsing them for backwards compatibility. - Trim the macros passed to every test to their standard C versions, removing K&R C and C++ versions. These are now considered to be for backwards compatibility only. src/cmd/INIT/iffe.tst: - Remove proto(1) mangling code. By the way, iffe can be regression-tested as follows: $ bin/package use # set up environment in a child shell $ regress src/cmd/INIT/iffe.tst $ exit # leave package environment src/cmd/INIT/make.probe, src/cmd/INIT/probe.win32: - Remove code to handle C++. src/lib/libast/features/common: - As in iffe.sh above, trim macros designed for compatibility with C++ and ancient C compilers to their standard C versions and comment that they are for backwards compatibility with AST code. This is needed to keep all the old ast and ksh code compiling. src/cmd/ksh93/sh/init.c, src/cmd/ksh93/sh/name.c: - Clarify libshell ABI compatibility function versions of macros. A "proto workaround" comment in the original code mislead me into thinking this had something to do with the removed proto(1), but it's unrelated. Call the workaround macro BYPASS_MACRO instead. src/cmd/ksh93/include/defs.h: - sh_sigcheck() macro: allow &sh as an argument: parenthesise shp. src/cmd/ksh93/sh/nvtype.c: - Remove unused nv_mkstruct() function. (re: `d0a5cab1`) /features/: - Remove obsolete iffe 'set prototyped' option. /Mamfile: - Remove all references to the ast/prototyped.h header. - Remove all use of the proto command. Simply copy instead. * 850-ish source files: *** - Remove all '#pragma prototyped' directives. - Remove all C++ compat code conditional upon defined(__cplusplus). - Remove all use of the _ARG_ macro, which on standard C expands to its argument: #define _ARG_(x) x (on K&R C, it expanded to nothing) - Remove all use of _BEGIN_EXTERNS_ and _END_EXTERNS_ macros (empty on standard C; this was for C++ compatibility) - Reduce all #if __STD_C (standard code) #else (K&R code) #endif blocks to the standard code only, without use of the macro. - Same for _STD_ macro which seems to have had the same function. - Change all instances of 'Void_t' to standard 'void'.	2021-12-24 07:05:22 +00:00
Martijn Dekker	e67df29c07	Re-fix defining types conditionally or in subshells (re: `f508660d`) New version. I'm pretty sure the problems that forced me to revert it earlier are fixed. This commit mitigates the effects of the hack explained in the referenced commit so that dummy built-in command nodes added by the parser for declaration/assignment purposes do not leak out into the execution level, except in a relatively harmless corner case. Something like if false; then typeset -T Foo_t=(integer -i bar) fi will no longer leave a broken dummy Foo_t declaration command. The same applies to declaration commands created with enum. The corner case remaining is: $ ksh -c 'false && enum E_t=(a b c); E_t -a x=(b b a c)' ksh: E_t: not found Since the 'enum' command is not executed, this should have thrown a syntax error on the 'E_t -a' declaration: ksh: syntax error at line 1: `(' unexpected This is because the -c script is parsed entirely before being executed, so E_t is recognised as a declaration built-in at parse time. However, the 'not found' error shows that it was successfully eliminated at execution time, so the inconsistent state will no longer persist. This fix now allows another fix to be effective as well: since built-ins do not know about virtual subshells, fork a virtual subshell into a real subshell before adding any built-ins. src/cmd/ksh93/sh/parse.c: - Add a pair of functions, dcl_hactivate() and dcl_dehacktivate(), that (de)activate an internal declaration built-ins tree into which check_typedef() can pre-add dummy type declaration command nodes. A viewpath from the main built-ins tree to this internal tree is added, unifying the two for search purposes and causing new nodes to be added to the internal tree. When parsing is done, we close that viewpath. This hides those pre-added nodes at execution time. Since the parser is sometimes called recursively (e.g. for command substitutions), keep track of this and only activate and deactivate at the first level. (Fixed compared to previous version of this commit: calling dcl_dehacktivate() when the recursion level is already zero is now a harmless no-op. Since this only occurs in error handling conditions, who cares.) - We also need to catch errors. This is done by setting libast's error_info.exit variable to a dcl_exit() function that tidies up and then passes control to the original (usually sh_exit()). (Fixed compared to previous version of this commit: dcl_exit() immediately deactivates the hack, no matter the recursion level, and restores the regular sh_exit(). This is the right thing to do when we're in the process of erroring out.) - sh_cmd(): This is the most central function in the parser. You'd think it was sh_parse(), but $(modern)-form command substitutions use sh_dolparen() instead. Both call sh_cmd(). So let's simply add a dcl_hacktivate() call at the beginning and a dcl_deactivate() call at the end. - assign(): This function calls path_search(), which among many other things executes an FPATH search, which may execute arbitrary code at parse time (!!!). So, regardless of recursion level, forcibly dehacktivate() to avoid those ugly parser side effects returning in that context. src/cmd/ksh93/bltins/enum.c: b_enum(): - Fork a virtual subshell before adding a built-in. src/cmd/ksh93/sh/xec.c: sh_exec(): - Fork a virtual subshell when detecting typeset's -T option. Improves fix to https://github.com/ksh93/ksh/issues/256	2021-12-17 01:28:28 +01:00
Martijn Dekker	7cdb01f625	New selection of default libcmd /opt/ast/bin built-ins Note that this is only about the /opt/ast/bin built-in commands, not about the regular pathless builtins such as printf. To use these, either add /opt/ast/bin to your $PATH or use a command like 'builtin cp'. As usual, --man provides info. Removed as defaults for lack of convincing advantages over the OS's external commands: - chmod, cmp, head, logname, mkdir, sync, uname, wc Remain as useful defaults: - basename, cat, cut, dirname. These are commonly used in performance-sensitive code paths in scripts and having them as built-ins can be good for performance. - getconf: This is the only interface to some libast internals that is available to ksh. It's also has better functionality than most OS-shipped 'getconf' commands, e.g., it can list and query all the configuration values. Added as defaults: - cp, ln, mv: Having these built in can speed up scripts that manage files. Also the AST versions have extended functionality (see cp --man, etc.). - mktemp: External mktemp commands vary too widely and are incompatible, but it's important that scripts can securely make temporary files, so it's good to ship a known interface to this functionality. As a result, the statically linked ksh binary is very slightly smaller than before. Resolves: https://github.com/ksh93/ksh/issues/349	2021-12-16 09:09:37 +01:00
Johnothan King	e54001d58b	Various minor capitalization and typo fixes (#371 ) This commit fixes various minor typos, punctuation errors and corrects the capitalization of many names.	2021-12-13 01:49:42 +01:00
Johnothan King	cd562b16e2	Port more shell lint improvements from illumos and ksh93v- (#374 ) This commit adds onto <https://github.com/ksh93/ksh/pull/353> by porting over two additional improvements to the shell linter: 1) The changes in the aforementioned pull request were merged into illumos-gate with an additional change.[] The illumos revision of the patch improved the warning for (( $foo = $? )) to specify '$foo' causes the warning.[*] Example: $ ksh -n -c '(( $? != $bar ))' ksh: warning: line 1: in '(( $? != $bar ))', using '$' as in '$bar' is slower and can introduce rounding errors While I was porting the illumos patch I did notice one problem. The string it uses from paramsub() skips over the initial '{' in '${var}', resulting in the warning printing '$var}' instead: $ ksh -n -c '(( ${.sh.pid} != $$ ))' ... in '(( ${.sh.pid} != $$ ))', using '$' as in '$.sh.pid}' is slower ... This was fixed by including the missing '{' in the string returned by paramsub for ${var} variables. 2) In ksh93v-, parsing x=$((expr)) with the shell linter will cause ksh to warn the user x=$((expr)) is slower than ((x=expr)). This improvement has been backported with a modified warning: # Result from this commit $ ksh -n -c 'x=$((1 + 2))' ksh: warning: line 1: x=$((1 + 2)) is slower than ((x=1 + 2)) # Result from ksh93v- $ ksh93v -n -c 'x=$((1 + 2))' ksh93v: warning: line 1: ((x=1 + 2)) is more efficient than x=$((1 + 2)) Minor note: the ksh93v- patch had an invalid use of memcmp; this version of the patch uses strncmp instead. References: `be548e87bc` https://code.illumos.org/c/illumos-gate/+/1834/comment/65722363_22fdf8e7/	2021-12-13 01:49:37 +01:00
Martijn Dekker	350e52877b	Revert "[1.0 release prep] Remove tilde expansion discipline" This reverts `c0334e32`, thereby restoring `936a1939`. After the fixes in `0a343244` and `a2bc49be`, the tilde expansion disciplines work nicely, so they can come back to the 1.0 branch.	2021-12-09 07:31:37 +01:00
Martijn Dekker	aa3048880b	cleanup: get rid of KSHELL and _BLD_shell preprocessor macros Once upon a time it might have been possible to build certain parts of ksh, such as the emacs and vi editors and possibly even the name/value library (nval(3)) as independent libraries. But given the depressing amount of bit rot in the code that we inherited, I am certain that disabling either of these macros had been resulting in a broken build for many years before AT&T abandoned this code base. These are certainly not going to be useful now. Meanwhile the KSHELL macro got in the way of me today, because the Mamfile did not define it for all the .c files, but some headers declared some functionality conditionally upon that macro. So including <io.h> in, e.g., nvdisc.c did not declare the same functions as including that header in files with KSHELL defined. This inconsistency is now gone as well, for various files. I'm currently working on making it possible once again to build libshell as a dynamic library; that should be good enough. And that never involved disabling either of these macros.	2021-12-09 06:43:22 +01:00
Johnothan King	beccb93fd4	Fix various compiler warnings and minor issues (#362 ) List of changes: - Fixed some -Wuninitialized warnings and removed some unused variables. - Removed the unused extern for B_login (re: `d8eba9d1`). - The libcmd builtins and the vmalloc memfatal function now handle memory errors with 'ERROR_SYSTEM\|ERROR_PANIC' for consistency with how ksh itself handles out of memory errors. - Added usage of UNREACHABLE() where it was missing from error handling. - Extend many variables from short to int to prevent overflows (most variables involve file descriptors). - Backported a ksh2020 patch to fix unused value Coverity issues (https://github.com/att/ast/pull/740). - Note in src/cmd/ksh93/README that ksh compiles with Cygwin on Windows 10 and Windows 11, albeit with many test failures. - Add comments to detail some sections of code. Extensive list of commits related to this change: `ca2443b5`, `7e7f1372`, `2db9953a`, `7003aba4`, `6f50ff64`, `b1a41311`, `222515bf`, `a0dcdeea`, `0aa9e03f`, `61437b27`, `352e68da`, `88e8fa67`, `bc8b36fa`, `6e515f1d`, `017d088c`, `035a4cb3`, `588a1ff7`, `6d63b57d`, `a2f13c19`, `794d1c86`, `ab98ec65`, `1026006d` - Removed a lot of dead ifdef code. - edit/emacs.c: Hide an assignment to avoid a -Wunused warning. (See also https://github.com/att/ast/pull/753, which removed the assignment because ksh2020 removed the !SHOPT_MULTIBYTE code.) - sh/nvdisc.c: The sh_newof macro cannot return a null pointer because it will instead cause the shell to exit if memory cannot be allocated. That makes the if statement here a no-op, so remove it. - sh/xec.c: Fixed one unused variable warning in sh_funscope(). - sh/xec.c: Remove a fallthrough comment added in commit `ed478ab7` because the TFORK code doesn't fall through (GCC also produces no -Wimplicit-fallthrough warning here). - data/builtins.c: The cd and pwd man pages state that these builtins default to -P if PATH_RESOLVE is 'physical', which isn't accurate: $ /opt/ast/bin/getconf PATH_RESOLVE physical $ mkdir /tmp/dir; ln -s /tmp/dir /tmp/sym $ cd /tmp/sym $ pwd /tmp/sym $ cd -P /tmp/sym $ pwd /tmp/dir The behavior described by these man pages isn't specified in the ksh man page or by POSIX, so to avoid changing these builtin's behavior the inaccurate PATH_RESOLVE information has been removed. - Mamfiles: Preserve multi-line errors by quoting the $x variable. This fix was backported from 93v-. (See also <https://github.com/lkujaw/ast/commit/a7e9cc82>.) - sh/subshell.c: Remove set but not used sp->errcontext variable.	2021-12-09 06:42:59 +01:00
Martijn Dekker	b3050769ea	Fix 'return' emitting signals; allow arbitrary return values When a global EXIT trap is set, and a ksh-style function exits with a status > 256 that could have been the result of a signal, then the shell incorrectly issues that signal to itself. Depending on the signal, this causes ksh to terminate itself ungracefully: $ cat /tmp/exit267 trap 'echo OK' EXIT # This trap triggers the crash function foo { return 267; } foo $ bash /tmp/exit267 OK $ ksh-3aee10d7 /tmp/exit267 OK $ ksh /tmp/exit267 Memory fault(coredump) On most systems, status 267 corresponds to SIGSEGV. The reported memory fault is not real; it results from ksh incorrectly killing itself with that signal. The problem is caused by two factors: 1. As of 93u+ 2012-08-01, ksh explicitly allows 'return' to use an exit status corresponding to a signal (from 257 to end of signal range). The rest of the integer range is trunctated to 8 bits. This is contrary to both 'man ksh' and 'return --man' which both say it's always truncated to 8 bits. Plus, combined with point 2 below, this new behaviour is nonsensical, as 'return' has no business actually generating signals. However, a couple of regression tests now depend on this, as may some scripts. 2. When a ksh-style function does not handle a signal, the signal is passed down to the parent environment and ksh does this by reissuing the signal to its own process after leaving the function scope. However, it does this by checking the exit status, which is very bad practice as there is no guarantee that an exit status corresponding to a signal was in fact produced by a signal, particularly after they changed the behaviour of 'return' per 1 above. This commit fixes both issues. It also takes a proper decision on allowable 'return' exit status arguments. Since 93u+ was released nearly a decade ago and some scripts may now rely on being able to pass certain exit statuses out of the 8-bit range, we should not disallow this now. But neither should we be half-hearted in allowing only some arbitrary selection of 9-bit statuses; 'return' values categorically should have nothing to do with signals, so this is no basis for limiting them. We're now allowing the full unsigned integer range, which is usually 32 bits. This is like zsh, and may create some interesting possibilities for scripts. Just don't forget that $? will still lose all but its 8 least significant bits when leaving the current (sub)shell environment. src/cmd/ksh93/sh/xec.c: sh_funscope(): - Fix passing down unhandled signals from interrupted ksh functions (jumpval==SH_JMPFUN) to the parent environment. Do not pay any attention to the exit status. Instead, use sh.lastsig (a.k.a. shp->lastsig). It is set by sh_fault() in fault.c for just this purpose and contains the last signal handled for the current command. It is reset in sh_exec() before running any new command. So if it contains a signal, that is the one that interrupted the ksh function, so it's the correct one to pass down. (Further evidence: sh_subshell() was already using this in the same way.) src/cmd/ksh93/bltins/cflow.c: b_return(): - Allow any signed int return value when invoked as and behaving like 'return'. - Add warning if a passed value is out of int range. Set the exit status to 128 in that case; int overflow is undefined behaviour in C and we want consistent behaviour across platforms. It should be safe enough to check if the long and int values are equal. - Refactor for clarity. src/cmd/ksh93/sh/subshell.c: sh_subshell(): - If a function returns with a status out of the 8 bit range in a virtual subshell, this status could be passed down to the parent shell in full. However, if the subshell forks, then the kernel will enforce an 8-bit exit status. That is inconsistent. Scripts should not be able to tell the difference between forked and non-forked subshells, so artificially enforce that limit here. Other changed files: - Documentation updates and copy-edits. - Update an AT&T functions.sh regress test to allow arbitrary integer return values for functions. - Add regression tests based in part on @JohnoKing's reproducers. - Rework some vaguely related regression tests to fail gracefully. Thanks to Johnothan King for the report and the testing. Fixes: https://github.com/ksh93/ksh/issues/364	2021-12-09 06:41:39 +01:00
Martijn Dekker	0d2d6bb47b	Documentation tweaks (re: `cd8c48cc`) The information about an out of memory error does not apply to the version of the PR that was eventually committed since it does not use getcwd() which might cause such an error. See https://github.com/ksh93/ksh/pull/358#issuecomment-986294934	2021-12-06 06:58:34 +01:00
Johnothan King	cd8c48cc5a	Add the '-e' flag to the 'cd' builtin (#358 ) This change adds the -e flag to the cd builtin, as specified in <https://www.austingroupbugs.net/view.php?id=253>. The -e flag is used to verify if the the current working directory after 'cd -P' successfully changes the directory, and returns with exit status 1 if the cwd couldn't be determined. Additionally, it causes all other errors to return with exit status >1 (i.e., status 2 unless ENOMEM occurs) if -e and -P are both active. src/cmd/ksh93/bltins/cd_pwd.c: - Add -e option to the cd builtin command. It verifies $PWD by using test_inode() to execute the equivalent of [[ . -ef $PWD ]]. - The check for restricted mode has been moved after optget to allow 'cd -eP' to return with exit status 2 when in restricted mode. To avoid changing the previous behavior of cd when -e isn't passed, extra checks have been added to prevent cd from printing usage information in restricted mode. src/cmd/ksh93/tests/builtins.sh: - Add regression tests for the exit status when using the cd -P flag with and without -e. src/cmd/ksh93/data/builtins.c, src/cmd/ksh93/sh.1: - Document the addition of -e to the cd builtin.	2021-12-06 06:58:11 +01:00
Johnothan King	6904585f49	Port illumos' shell linter improvements (#353 ) This commit ports over two improvements to the shell linter from illumos (original patch written by Andy Fiddaman). Links to the relevant bug reports and the original patch: https://www.illumos.org/issues/13601 https://www.illumos.org/issues/13631 `c7b656fc71` The first improvement is to the lint warning for arithmetic operators in [[ ... ]]. The ksh linter now suggests the correct equivalent operator to use in ((...)). Example: $ ksh -nc '[[ 30 -gt 25 ]]' # Original warning warning: line 1: -gt within [[ ... ]] obsolete, use ((...)) # New warning warning: line 1: [[ ... -gt ... ]] obsolete, use ((... > ...)) The second improvement pertains to variable expansion in arithmetic expressions. The ksh linter now suggests referencing variable names directly: $ ksh -nc 'integer foo=40; (($foo < 50 ))' # Old warning warning: line 1: variable expansion makes arithmetic evaluation less efficient # New warning warning: line 1: in '(($foo < 50))', using '$' is slower and can introduce rounding errors src/cmd/ksh93/{data/lexstates,sh/lex,sh/parse}.c: - Port the improved shell lint warnings from illumos to ksh93u+m. - The original checks for arithmetic operators involved a bunch of if statements with inefficient calls to strcmp(3). These were replaced with a more efficient switch statement that avoids strcmp.	2021-12-05 19:27:16 +01:00

1 2 3 4

155 Commits