man-pages

Commit Graph

Author	SHA1	Message	Date
Alejandro Colomar	5945cd7bd3	seccomp.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-20 14:43:43 +12:00
Alejandro Colomar	292583e25b	seccomp.2: Document why each header is needed Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Cc: Kees Cook <keescook@chromium.org> Cc: Tyler Hicks <tyhicks@canonical.com> Cc: Will Drewry <wad@chromium.org> Cc: Michael Kerrisk <mtk.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-20 14:43:43 +12:00
Michael Kerrisk	911789ee76	seccomp_unotify.2: Add caveats regarding emulation of blocking system calls Reported-by: Sargun Dhillon <sargun@sargun.me> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	1b5592f534	seccomp_unotify.2: Reformat ioctls as subsections rather than hanging list Doing so decreases the degree to which text is indented, and thus avoids short, poorly wrapped lines. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	d1c8db825a	seccomp_unotify.2: Document the SECCOMP_IOCTL_NOTIF_ADDFD ioctl() Starting from some notes by Sargun Dhillon. Reported-by: Sargun Dhillon <sargun@sargun.me> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	c13b1b2bdd	seccomp_unotify.2: EXAMPLES: simplify logic in getTargetPathname() And reword some comments there. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	f8899e1c88	seccomp_unotify.2: EXAMPLES: fix a file descriptor leak Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	8760bd15a1	seccomp_unotify.2: EXAMPLES: some code modularity improvements Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	8bae56c220	seccomp_unotify.2: Minor cleanup fix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	40fdc84999	seccomp_unotify.2: Change name of SECCOMP_IOCTL_NOTIF_ID_VALID function Give this function a shorter, slightly easier to read name. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	b4763b6e61	seccomp_unotify.2: Fixes after review comments from Christian Brauner Reported-by: Christian Brauner <christian.brauner@canonical.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	a46a1879c5	seccomp_unotify.2: A cookie check is also required after reading target's memory Quoting Jann Horn: [[ As discussed at <https://lore.kernel.org/r/CAG48ez0m4Y24ZBZCh+Tf4ORMm9_q4n7VOzpGjwGF7_Fe8EQH=Q@mail.gmail.com>, we need to re-check checkNotificationIdIsValid() after reading remote memory but before using the read value in any way. Otherwise, the syscall could in the meantime get interrupted by a signal handler, the signal handler could return, and then the function that performed the syscall could free() allocations or return (thereby freeing buffers on the stack). In essence, this pread() is (unavoidably) a potential use-after-free read; and to make that not have any security impact, we need to check whether UAF read occurred before using the read value. This should probably be called out elsewhere in the manpage, too... Now, of course, reading is the easy case. The difficult case is if we have to write to the remote process... because then we can't play games like that. If we write data to a freed pointer, we're screwed, that's it. (And for somewhat unrelated bonus fun, consider that /proc/$pid/mem is originally intended for process debugging, including installing breakpoints, and will therefore happily write over "readonly" private mappings, such as typical mappings of executable code.) So, uuuuh... I guess if anyone wants to actually write memory back to the target process, we'd better come up with some dedicated API for that, using an ioctl on the seccomp fd that magically freezes the target process inside the syscall while writing to its memory, or something like that? And until then, the manpage should have a big fat warning that writing to the target's memory is simply not possible (safely). ]] and <https://lore.kernel.org/r/CAG48ez0m4Y24ZBZCh+Tf4ORMm9_q4n7VOzpGjwGF7_Fe8EQH=Q@mail.gmail.com>: [[ The second bit of trouble is that if the supervisor is so oblivious that it doesn't realize that syscalls can be interrupted, it'll run into other problems. Let's say the target process does something like this: int func(void) { char pathbuf[4096]; sprintf(pathbuf, "/tmp/blah.%d", some_number); mount("foo", pathbuf, ...); } and mount() is handled with a notification. If the supervisor just reads the path string and immediately passes it into the real mount() syscall, something like this can happen: target: starts mount() target: receives signal, aborts mount() target: runs signal handler, returns from signal handler target: returns out of func() supervisor: receives notification supervisor: reads path from remote buffer supervisor: calls mount() but because the stack allocation has already been freed by the time the supervisor reads it, the supervisor just reads random garbage, and beautiful fireworks ensue. So the supervisor fundamentally has to be written to expect that at any time, the target can abandon a syscall. And every read of remote memory has to be separated from uses of that remote memory by a notification ID recheck. And at that point, I think it's reasonable to expect the supervisor to also be able to handle that a syscall can be aborted before the notification is delivered. ]] Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	8742c19c9f	seccomp_unotify.2: wfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	589a15959f	seccomp_unotify.2: EXAMPLES: make SECCOMP_IOCTL_NOTIF_ID_VALID function return bool - Rename the function that does the SECCOMP_IOCTL_NOTIF_ID_VALID check. - Make that function return a 'bool' rather than terminating the process. - Use that return value in the calling function. - Rework/improve various related comments. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	6f0ca7da71	seccomp_unotify.2: EXAMPLES: Improve comments describing checkNotificationIdIsValid() Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	8a7703864c	seccomp_unotify.2: EXAMPLES: make getTargetPathname() a bit more generically useful Allow the caller to specify which system call argument should be looked up as a pathname. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	dbcc2ad691	seccomp_unotify.2: SEE ALSO: add pidfd_open(2) and pidfd_getfd(2) pidfd_open(2) and pidfd_getfd(2) presumably have use cases with the user-space notification feature. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	9688ea78cb	seccomp_unotify.2: NOTES: describe an example use-case The container manager use case was the original motivation for this feature. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	b8360d3701	seccomp_unotify.2: Remove FIXME asking about usefulness of POLLOUT/EPOLLOUT According to Tycho Andersen, he had no particular use case in mind when building this detail into the API. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	b183b6503c	seccomp_unotify.2: srcfix: Add a further FIXME relating to SA_RESTART behavior Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	191350e602	seccomp_unotify.2: Various fixes after review comments from Kees Cook Reported-by: Kees Cook <keescook@chromium.org> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	16ba7af469	seccomp_unotify.2: Update a FIXME Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	fd1295e8f1	seccomp_unotify.2: Describe the interaction with SA_RESTART signal handlers And, as noted by Jann Horn, note how the user-space notification mechanism causes a small breakage in the user-space API with respect to nonrestartable system calls. ==== From the email discussion with Jann Horn > >> So, I partially demonstrated what you describe here, for two example > >> system calls (epoll_wait() and pause()). But I could not exactly > >> demonstrate things as I understand you to be describing them. (So, > >> I'm not sure whether I have not understood you correctly, or > >> if things are not exactly as you describe them.) > >> > >> Here's a scenario (A) that I tested: > >> > >> 1. Target installs seccomp filters for a blocking syscall > >> (epoll_wait() or pause(), both of which should never restart, > >> regardless of SA_RESTART) > >> 2. Target installs SIGINT handler with SA_RESTART > >> 3. Supervisor is sleeping (i.e., is not blocked in > >> SECCOMP_IOCTL_NOTIF_RECV operation). > >> 4. Target makes a blocking system call (epoll_wait() or pause()). > >> 5. SIGINT gets delivered to target; handler gets called; > >> *and syscall gets restarted by the kernel* > >> > >> That last should never happen, of course, and is a result of the > >> combination of both the user-notify filter and the SA_RESTART flag. > >> If one or other is not present, then the system call is not > >> restarted. > >> > >> So, as you note below, the UAPI gets broken a little. > >> > >> However, from your description above I had understood that > >> something like the following scenario (B) could occur: > >> > >> 1. Target installs seccomp filters for a blocking syscall > >> (epoll_wait() or pause(), both of which should never restart, > >> regardless of SA_RESTART) > >> 2. Target installs SIGINT handler with SA_RESTART > >> 3. Supervisor performs SECCOMP_IOCTL_NOTIF_RECV operation (which > >> blocks). > >> 4. Target makes a blocking system call (epoll_wait() or pause()). > >> 5. Supervisor gets seccomp user-space notification (i.e., > >> SECCOMP_IOCTL_NOTIF_RECV ioctl() returns > >> 6. SIGINT gets delivered to target; handler gets called; > >> and syscall gets restarted by the kernel > >> 7. Supervisor performs another SECCOMP_IOCTL_NOTIF_RECV operation > >> which gets another notification for the restarted system call. > >> > >> However, I don't observe such behavior. In step 6, the syscall > >> does not get restarted by the kernel, but instead returns -1/EINTR. > >> Perhaps I have misconstructed my experiment in the second case, or > >> perhaps I've misunderstood what you meant, or is it possibly the > >> case that things are not quite as you said? > > Thanks for the code, Jann (including the demo of the CLONE_FILES > technique to pass the notification FD to the supervisor). > > But I think your code just demonstrates what I described in > scenario A. So, it seems that I both understood what you > meant (because my code demonstrates the same thing) and > also misunderstood what you said (because I thought you > were meaning something more like scenario B). Ahh, sorry, I should've read your mail more carefully. Indeed, that testcase only shows scenario A. But the following shows scenario B... [Below, two pieces of code from Jann, with a lot of cosmetic changes by mtk.] ==== [And from a follow-up in the same email thread:] > If userspace relies on non-restarting behavior, it should be using > something like epoll_pwait(). And that stuff only unblocks signals > after we've already past the seccomp checks on entry. Thanks for elaborating that detail, since as soon as you talked about "enlarging a preexisting race" above, I immediately wondered sigsuspend(), pselect(), etc. (Mind you, I still wonder about the effect on system calls that are normally nonrestartable because they have timeouts. My understanding is that the kernel doesn't restart those system calls because it's impossible for the kernel to restart the call with the right timeout value. I wonder what happens when those system calls are restarted in the scenario we're discussing.) Anyway, returning to your point... So, to be clear (and to quickly remind myself in case I one day reread this thread), there is not a problem with sigsuspend(), pselect(), ppoll(), and epoll_pwait() since: * Before the syscall, signals are blocked in the target. * Inside the syscall, signals are still blocked at the time the check is made for seccomp filters. * If a seccomp user-space notification event kicks, the target is put to sleep with the signals still blocked. * The signal will only get delivered after the supervisor either triggers a spoofed success/failure return in the target or the supervisor sends a CONTINUE response to the kernel telling it to execute the target's system call. Either way, there won't be any restarting of the target's system call (and the supervisor thus won't see multiple notifications). ==== Scenario A $ ./seccomp_unotify_restart_scen_A C: installed seccomp: fd 3 C: woke 1 waiters P: child installed seccomp fd 3 C: About to call pause(): Success P: going to send SIGUSR1... C: sigusr1_handler handler invoked P: about to terminate C: got pdeath signal on parent termination C: about to terminate /* Modified version of code from Jann Horn / #define _GNU_SOURCE #include <stdio.h> #include <signal.h> #include <err.h> #include <errno.h> #include <unistd.h> #include <stdlib.h> #include <sched.h> #include <stddef.h> #include <limits.h> #include <sys/mman.h> #include <sys/syscall.h> #include <sys/prctl.h> #include <linux/seccomp.h> #include <linux/filter.h> #include <linux/futex.h> struct { int seccomp_fd; } shared; static void sigusr1_handler(int sig, siginfo_t * info, void uctx) { printf("C: sigusr1_handler handler invoked\n"); } static void sigusr2_handler(int sig, siginfo_t info, void uctx) { printf("C: got pdeath signal on parent termination\n"); printf("C: about to terminate\n"); exit(0); } int main(void) { setbuf(stdout, NULL); / Allocate memory that will be shared by parent and child / shared = mmap(NULL, 0x1000, PROT_READ \| PROT_WRITE, MAP_ANONYMOUS \| MAP_SHARED, -1, 0); if (shared == MAP_FAILED) err(1, "mmap"); shared->seccomp_fd = -1; / glibc's clone() wrapper doesn't support fork()-style usage / / Child process and parent share file descriptor table / pid_t child = syscall(__NR_clone, CLONE_FILES \| SIGCHLD, NULL, NULL, NULL, 0); if (child == -1) err(1, "clone"); / CHILD / if (child == 0) { / don't outlive the parent / prctl(PR_SET_PDEATHSIG, SIGUSR2); if (getppid() == 1) exit(0); / Install seccomp filter / prctl(PR_SET_NO_NEW_PRIVS, 1, 0, 0, 0); struct sock_filter insns[] = { BPF_STMT(BPF_LD \| BPF_W \| BPF_ABS, offsetof(struct seccomp_data, nr)), BPF_JUMP(BPF_JMP \| BPF_JEQ \| BPF_K, __NR_pause, 0, 1), BPF_STMT(BPF_RET \| BPF_K, SECCOMP_RET_USER_NOTIF), BPF_STMT(BPF_RET \| BPF_K, SECCOMP_RET_ALLOW) }; struct sock_fprog prog = { .len = sizeof(insns) / sizeof(insns[0]), .filter = insns }; int seccomp_ret = syscall(__NR_seccomp, SECCOMP_SET_MODE_FILTER, SECCOMP_FILTER_FLAG_NEW_LISTENER, &prog); if (seccomp_ret < 0) err(1, "install"); printf("C: installed seccomp: fd %d\n", seccomp_ret); / Place the notifier FD number into the shared memory / __atomic_store(&shared->seccomp_fd, &seccomp_ret, __ATOMIC_RELEASE); / Wake the parent / int futex_ret = syscall(__NR_futex, &shared->seccomp_fd, FUTEX_WAKE, INT_MAX, NULL, NULL, 0); printf("C: woke %d waiters\n", futex_ret); / Establish SA_RESTART handler for SIGUSR1 / struct sigaction act = { .sa_sigaction = sigusr1_handler, .sa_flags = SA_RESTART \| SA_SIGINFO }; if (sigaction(SIGUSR1, &act, NULL)) err(1, "sigaction"); struct sigaction act2 = { .sa_sigaction = sigusr2_handler, .sa_flags = 0 }; if (sigaction(SIGUSR2, &act2, NULL)) err(1, "sigaction"); / Make a blocking system call / perror("C: About to call pause()"); pause(); perror("C: pause returned"); exit(0); } / PARENT / / Wait for futex wake-up from child / int futex_ret = syscall(__NR_futex, &shared->seccomp_fd, FUTEX_WAIT, -1, NULL, NULL, 0); if (futex_ret == -1 && errno != EAGAIN) err(1, "futex wait"); / Get notification FD from the child / int fd = __atomic_load_n(&shared->seccomp_fd, __ATOMIC_ACQUIRE); printf("\tP: child installed seccomp fd %d\n", fd); sleep(1); printf("\tP: going to send SIGUSR1...\n"); kill(child, SIGUSR1); sleep(1); printf("\tP: about to terminate\n"); exit(0); } ==== Scenario B $ ./seccomp_unotify_restart_scen_B C: installed seccomp: fd 3 C: woke 1 waiters C: About to call pause() P: child installed seccomp fd 3 P: about to SECCOMP_IOCTL_NOTIF_RECV P: got notif: id=17773741941218455591 pid=25052 nr=34 P: about to send SIGUSR1 to child... P: about to SECCOMP_IOCTL_NOTIF_RECV C: sigusr1_handler handler invoked P: got notif: id=17773741941218455592 pid=25052 nr=34 P: about to send SIGUSR1 to child... P: about to SECCOMP_IOCTL_NOTIF_RECV C: sigusr1_handler handler invoked P: got notif: id=17773741941218455593 pid=25052 nr=34 P: about to send SIGUSR1 to child... P: about to SECCOMP_IOCTL_NOTIF_RECV C: sigusr1_handler handler invoked P: got notif: id=17773741941218455594 pid=25052 nr=34 P: about to send SIGUSR1 to child... C: sigusr1_handler handler invoked C: got pdeath signal on parent termination C: about to terminate / Modified version of code from Jann Horn / #define _GNU_SOURCE #include <stdio.h> #include <signal.h> #include <err.h> #include <errno.h> #include <unistd.h> #include <stdlib.h> #include <sched.h> #include <stddef.h> #include <string.h> #include <limits.h> #include <inttypes.h> #include <sys/mman.h> #include <sys/syscall.h> #include <sys/ioctl.h> #include <sys/prctl.h> #include <linux/seccomp.h> #include <linux/filter.h> #include <linux/futex.h> struct { int seccomp_fd; } shared; static void sigusr1_handler(int sig, siginfo_t * info, void uctx) { printf("C: sigusr1_handler handler invoked\n"); } static void sigusr2_handler(int sig, siginfo_t info, void uctx) { printf("C: got pdeath signal on parent termination\n"); printf("C: about to terminate\n"); exit(0); } static size_t max_size(size_t a, size_t b) { return (a > b) ? a : b; } int main(void) { setbuf(stdout, NULL); / Allocate memory that will be shared by parent and child / shared = mmap(NULL, 0x1000, PROT_READ \| PROT_WRITE, MAP_ANONYMOUS \| MAP_SHARED, -1, 0); if (shared == MAP_FAILED) err(1, "mmap"); shared->seccomp_fd = -1; / glibc's clone() wrapper doesn't support fork()-style usage / / Child process and parent share file descriptor table / pid_t child = syscall(__NR_clone, CLONE_FILES \| SIGCHLD, NULL, NULL, NULL, 0); if (child == -1) err(1, "clone"); / CHILD / if (child == 0) { / don't outlive the parent / prctl(PR_SET_PDEATHSIG, SIGUSR2); if (getppid() == 1) exit(0); / Install seccomp filter / prctl(PR_SET_NO_NEW_PRIVS, 1, 0, 0, 0); struct sock_filter insns[] = { BPF_STMT(BPF_LD \| BPF_W \| BPF_ABS, offsetof(struct seccomp_data, nr)), BPF_JUMP(BPF_JMP \| BPF_JEQ \| BPF_K, __NR_pause, 0, 1), BPF_STMT(BPF_RET \| BPF_K, SECCOMP_RET_USER_NOTIF), BPF_STMT(BPF_RET \| BPF_K, SECCOMP_RET_ALLOW) }; struct sock_fprog prog = { .len = sizeof(insns) / sizeof(insns[0]), .filter = insns }; int seccomp_ret = syscall(__NR_seccomp, SECCOMP_SET_MODE_FILTER, SECCOMP_FILTER_FLAG_NEW_LISTENER, &prog); if (seccomp_ret < 0) err(1, "install"); printf("C: installed seccomp: fd %d\n", seccomp_ret); / Place the notifier FD number into the shared memory / __atomic_store(&shared->seccomp_fd, &seccomp_ret, __ATOMIC_RELEASE); / Wake the parent / int futex_ret = syscall(__NR_futex, &shared->seccomp_fd, FUTEX_WAKE, INT_MAX, NULL, NULL, 0); printf("C: woke %d waiters\n", futex_ret); / Establish SA_RESTART handler for SIGUSR1 / struct sigaction act = { .sa_sigaction = sigusr1_handler, .sa_flags = SA_RESTART \| SA_SIGINFO }; if (sigaction(SIGUSR1, &act, NULL)) err(1, "sigaction"); struct sigaction act2 = { .sa_sigaction = sigusr2_handler, .sa_flags = 0 }; if (sigaction(SIGUSR2, &act2, NULL)) err(1, "sigaction"); / Make a blocking system call / printf("C: About to call pause()\n"); pause(); perror("C: pause returned"); exit(0); } / PARENT / / Wait for futex wake-up from child / int futex_ret = syscall(__NR_futex, &shared->seccomp_fd, FUTEX_WAIT, -1, NULL, NULL, 0); if (futex_ret == -1 && errno != EAGAIN) err(1, "futex wait"); / Get notification FD from the child / int fd = __atomic_load_n(&shared->seccomp_fd, __ATOMIC_ACQUIRE); printf("\tP: child installed seccomp fd %d\n", fd); / Discover seccomp buffer sizes and allocate notification buffer / struct seccomp_notif_sizes sizes; if (syscall(__NR_seccomp, SECCOMP_GET_NOTIF_SIZES, 0, &sizes)) err(1, "notif_sizes"); struct seccomp_notif notif = malloc(max_size(sizeof(struct seccomp_notif), sizes.seccomp_notif)); if (!notif) err(1, "malloc"); for (int i = 0; i < 4; i++) { printf("\tP: about to SECCOMP_IOCTL_NOTIF_RECV\n"); memset(notif, '\0', sizes.seccomp_notif); if (ioctl(fd, SECCOMP_IOCTL_NOTIF_RECV, notif)) err(1, "notif_recv"); printf("\tP: got notif: id=%llu pid=%u nr=%d\n", notif->id, notif->pid, notif->data.nr); sleep(1); printf("\tP: about to send SIGUSR1 to child...\n"); kill(child, SIGUSR1); } sleep(1); exit(0); } ==== Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	1661264841	seccomp_unotify.2: EXAMPLE: correct the check for NUL in buffer returned by read() In the usual case, read(fd, buf, PATH_MAX) will return PATH_MAX bytes that include trailing garbage after the pathname. So the right check is to scan from the start of the buffer to see if there's a NUL, and error if there is not. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	d1774d6af8	seccomp_unotify.2: Better handling of invalid target pathname After some discussions with Jann Horn, perhaps a better way of dealing with an invalid target pathname is to trigger an error for the system call. Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	47056412d7	seccomp_unotify.2: EXAMPLE: rename a variable Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	2f37aeb620	seccomp_unotify.2: EXAMPLE: Improve allocation of response buffer From a conversation with Jann Horn: [[ >>>> struct seccomp_notif_resp resp = malloc(sizes.seccomp_notif_resp); >>> >>> This should probably do something like max(sizes.seccomp_notif_resp, >>> sizeof(struct seccomp_notif_resp)) in case the program was built >>> against new UAPI headers that make struct seccomp_notif_resp big, but >>> is running under an old kernel where that struct is still smaller? >> >> I'm confused. Why? I mean, if the running kernel says that it expects >> a buffer of a certain size, and we allocate a buffer of that size, >> what's the problem? > > Because in userspace, we cast the result of malloc() to a "struct > seccomp_notif_resp ". If the kernel tells us that it expects a size > smaller than sizeof(struct seccomp_notif_resp), then we end up with a > pointer to a struct that consists partly of allocated memory, partly > of out-of-bounds memory, which is generally a bad idea - I'm not sure > whether the C standard permits that. And if userspace then e.g. > decides to access some member of that struct that is beyond what the > kernel thinks is the struct size, we get actual OOB memory accesses. Got it. (But gosh, this seems like a fragile API mess.) I added the following to the code: /* When allocating the response buffer, we must allow for the fact that the user-space binary may have been built with user-space headers where 'struct seccomp_notif_resp' is bigger than the response buffer expected by the (older) kernel. Therefore, we allocate a buffer that is the maximum of the two sizes. This ensures that if the supervisor places bytes into the response structure that are past the response size that the kernel expects, then the supervisor is not touching an invalid memory location. / size_t resp_size = sizes.seccomp_notif_resp; if (sizeof(struct seccomp_notif_resp) > resp_size) resp_size = sizeof(struct seccomp_notif_resp); struct seccomp_notif_resp resp = malloc(resp_size); ]] Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	bf892a6527	seccomp_unotify.2: EXAMPLE: ensure path read() by the supervisor is null-terminated From a conversation with Jann Horn: >> We should probably make sure here that the value we read is actually >> NUL-terminated? > > So, I was curious about that point also. But, (why) are we not > guaranteed that it will be NUL-terminated? Because it's random memory filled by another process, which we don't necessarily trust. While seccomp notifiers aren't usable for applying extra security restrictions, the supervisor will still often be more privileged than the supervised process. Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	e4db7ae69d	seccomp_unotify.2: wfix in example program Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	5c12cebdf2	seccomp_unotify.2: Small wording fix Change "read(2) will return 0" to "read(2) may return 0". Quoting Jann Horn: Maybe make that "may return 0" instead of "will return 0" - reading from /proc/$pid/mem can only return 0 in the following cases AFAICS: 1. task->mm was already gone at open() time 2. mm->mm_users has dropped to zero (the mm only has lazytlb users; page tables and VMAs are being blown away or have been blown away) 3. the syscall was called with length 0 When a process has gone away, normally mm->mm_users will drop to zero, but someone else could theoretically still be holding a reference to the mm (e.g. someone else in the middle of accessing /proc/$pid/mem). (Such references should normally not be very long-lived though.) Additionally, in the unlikely case that the OOM killer just chomped through the page tables of the target process, I think the read will return -EIO (same error as if the address was simply unmapped) if the address is within a non-shared mapping. (Maybe that's something procfs could do better...) Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	e06808b4b1	seccomp_unotify.2: Minor wording change + add a FIXME Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	bcfeed7d4e	seccomp_unotify.2: User-space notification can't be used to implement security policy Add some strongly worded text warning the reader about the correct uses of seccomp user-space notification. Reported-by: Jann Horn <jannh@google.com> Cowritten-by: Christian Brauner <christian@brauner.io> Cowritten-by: Kees Cook <keescook@chromium.org> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	03e4237409	seccomp_unotify.2: Fixes after review comments from Christian Brauner Reported-by: Christian Brauner <christian@brauner.io> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	fd376c6b2a	seccomp.2, seccomp_unotify.2: Clarify that there can be only one SECCOMP_FILTER_FLAG_NEW_LISTENER Reported-by: Christian Brauner <christian@brauner.io> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	cd3224b7df	seccomp_unotify.2: Note when FD indicates EOF/(E)POLLHUP in (e)poll/select Verified by experiment. Reported-by: Christian Brauner <christian.brauner@canonical.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	6048506c77	seccomp_unotify.2: Note when notification FD indicates as writable by select/poll/epoll Reported-by: Tycho Andersen <tycho@tycho.pizza> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	ea4d03e6b0	seccomp_unotify.2: Minor fixes Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	a08715b41e	seccomp_unotify.2: Fixes after review comments by Jann Horn Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	d85217eff7	seccomp_unotify.2: Add BUGS section describing SECCOMP_IOCTL_NOTIF_RECV bug Tycho Andersen confirmed that this issue is present. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	72a8602617	seccomp_unotify.2: srcfix: remove bogus FIXME Pathname arguments are limited to PATH_MAX bytes. Reported-by: Tycho Andersen <tycho@tycho.pizza> Reported-by: Jann Horn <jannh@google.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	391194cd52	seccomp_unotify.2: Changes after feed back from Tycho Andersen Reported-by: Tycho Andersen <tycho@tycho.pizza> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	a9a8e35644	seccomp_unotify.2: Document the seccomp user-space notification mechanism The APIs used by this mechanism comprise not only seccomp(2), but also a number of ioctl(2) operations. And any useful example demonstrating these APIs is will necessarily be rather long. Trying to cram all of this into the seccomp(2) page would make that page unmanageably long. Therefore, let's document this mechanism in a separate page. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	0a86ac9c9b	seccomp.2: Note that SECCOMP_RET_USER_NOTIF can be overridden Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	8459e46597	seccomp.2: wfix: mention term "supervisor" in description of SECCOMP_RET_USER_NOTIF Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	1741f7fc2e	seccomp.2: SEE ALSO: add seccomp_unotify(2) Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	2bbe9bd9ae	seccomp.2: Rework SECCOMP_GET_NOTIF_SIZES somewhat The existing text says the structures (plural!) contain a 'struct seccomp_data'. But this is only true for the received notification structure (seccomp_notif). So, reword the sentence to be more general, noting simply that the structures may evolve over time. Add some comments to the structure definition. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	b723c6d8dd	seccomp.2: Add some details for SECCOMP_FILTER_FLAG_NEW_LISTENER Rework the description a little, and note that the close-on-exec flag is set for the returned file descriptor. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:17 +12:00
Michael Kerrisk	d7a3918456	seccomp.2: Minor edits to Tycho's SECCOMP_FILTER_FLAG_NEW_LISTENER patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Tycho Andersen	b9395f4a3e	seccomp.2: Document SECCOMP_FILTER_FLAG_NEW_LISTENER Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Michael Kerrisk	8fa47f3ae4	seccomp.2: Reorder list of SECCOMP_SET_MODE_FILTER flags alphabetically (No content changes.) Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Michael Kerrisk	3bed246e7e	seccomp.2: Some reworking of Tycho's SECCOMP_RET_USER_NOTIF patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Tycho Andersen	c734bbd265	seccomp.2: Document SECCOMP_RET_USER_NOTIF Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Michael Kerrisk	6fc8b8a0a1	seccomp.2: Minor edits to Tycho Andersen's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Tycho Andersen	9bc48145a6	seccomp.2: Document SECCOMP_GET_NOTIF_SIZES Signed-off-by: Tycho Andersen <tycho@tycho.ws> CC: Kees Cook <keescook@chromium.org> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:40:16 +12:00
Michael Kerrisk	408483bd31	socketcall.2: srcfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	b6687e3971	socketcall.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	1b4d275a0e	sigprocmask.2: Use syscall(SYS_...); for raw system calls Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	aa03a4e732	shmop.2: Remove unused include Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	1cd36d9dea	sgetmask.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	18e21e1e4c	set_tid_address.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	ba4d34a16d	set_thread_area.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	9202a1eb8e	rt_sigqueueinfo.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Alejandro Colomar	5e9623f3b9	open.2: Remove unused <sys/stat.h> I can't see a reason to include it. <fcntl.h> provides O_* constants for 'flags', S_* constants for 'mode', and mode_t. Probably a long time ago, some of those weren't defined in <fcntl.h>, and both headers needed to be included, or maybe it's a historical error. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-06-10 10:37:46 +12:00
Michael Kerrisk	14987c153f	setresuid.2: tfix (Oxford comma) Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-21 20:19:28 +12:00
Michael Kerrisk	e4a403876d	select.2: Strengthen the warning regarding the low value of FD_SETSIZE All modern code should avoid select(2) in favor of poll(2) or epoll(7). For a long history of this problem, see: https://marc.info/?l=bugtraq&m=110660879328901 List: bugtraq Subject: SECURITY.NNOV: Multiple applications fd_set structure bitmap array index overflow From: 3APA3A <3APA3A () security ! nnov ! ru> Date: 2005-01-24 20:30:08 https://sourceware.org/legacy-ml/libc-alpha/2003-05/msg00171.html User-settable FD_SETSIZE and select() From: mtk-lists at gmx dot net To: libc-alpha at sources dot redhat dot com Date: Mon, 19 May 2003 14:49:03 +0200 (MEST) Subject: User-settable FD_SETSIZE and select() https://sourceware.org/bugzilla/show_bug.cgi?id=10352 http://0pointer.net/blog/file-descriptor-limits.html https://twitter.com/pid_eins/status/1394962183033868292 Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-20 11:00:11 +12:00
Michael Kerrisk	2a1ba6ae7f	select.2: Relocate sentence about the fd_set value-result arguments to BUGS Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-20 09:49:09 +12:00
Alejandro Colomar	65dfda3dd1	sched_setattr.2: Use syscall(SYS_...); for system calls without a wrapper Document also why each header is required Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	d4d006687d	s390_sthyi.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	c6450cf82b	s390_sthyi.2: Replace numeric constant by its name (macro) Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Cc: Heiko Carstens <heiko.carstens@de.ibm.com> Cc: Eugene Syromyatnikov <evgsyr@gmail.com> Cc: QingFeng Hao <haoqf@linux.vnet.ibm.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	cca4e32eb3	s390_runtime_instr.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	f908665187	s390_pci_mmio_write.2: Use syscall(SYS_...); for system calls without a wrapper; fix includes too This function doesn't use any flags or special types, so there's no reason to include <asm/unistd.h>; remove it. Add the includes needed for syscall(2) only. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	56cfe81cfb	s390_guarded_storage.2: Use syscall(SYS_...); for system calls without a wrapper Also document why each header is needed. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 20:04:30 +12:00
Alejandro Colomar	cc6f5bf20f	rename.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 16:44:37 +12:00
Michael Kerrisk	090fdddb43	memfd_create.2, mmap.2, shmget.2: Document the EPERM for huge page allocations This error can occur if the caller is does not have CAP_IPC_LOCK and is not a member of the sysctl_hugetlb_shm_group. Reported-by: Yang Xu <xuyang2018.jy@fujitsu.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-17 15:42:04 +12:00
Michael Kerrisk	66c743b191	getdents.2: ffix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	fac7dabcd1	reboot.2: Use syscall(SYS_...); for system calls without a wrapper Explain also why headers are needed. And some ffix. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	71b08c22b5	readlink.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	8f33ee075a	readdir.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	05214ec7ba	quotactl.2: Better detail why <xfs/xqm.h> is included Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	4e8ac36900	process_madvise.2: Use syscall(SYS_...); for system calls without a wrapper. Fix includes too. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	e393b243c0	poll.2: Remove <signal.h> It is only used for providing 'sigset_t'. We're only documenting (with some exceptions) the includes needed for constants and the prototype itself. And 'sigset_t' is better documented in system_data_types(7). Remove that include. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	3e67d1a76b	pivot_root.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	5a24cb274f	pipe.2: wfix For consistency with other pages. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	94df87ef9b	pidfd_send_signal.2: Use syscall(SYS_...); for system calls without a wrapper. Fix includes too Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	46227ba213	pidfd_open.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	13cf4fc78a	pidfd_getfd.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	e691579150	perf_event_open.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	10f4414ccb	openat2.2: Use syscall(SYS_...); for system calls without a wrapper; fix includes too Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	bc2813df5e	alloc_hugepages.2, arch_prctl.2, capget.2, clone.2, delete_module.2, exit_group.2, get_robust_list.2, getunwind.2, init_module.2: Add note about the use of syscall(2) Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	a39bcd0b85	mq_getsetattr.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	57d2facb78	modify_ldt.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	0eefb56c95	mmap2.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:45 +12:00
Alejandro Colomar	01ee7ce9b7	mknod.2: Remove unused includes All of the constants used by mknod() are defined in <sys/stat.h>. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:28:38 +12:00
Alejandro Colomar	c88fc2baad	mincore.2: Remove unused include AFAICS, there's no use for <unistd.h> here. The prototype is declared in <sys/mman.h>, and there are no constants needed. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	39df5bd6bc	membarrier.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	3977e9ff1f	lookup_dcookie.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	aecad91d0b	llseek.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	f1d0eaf52b	link.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	5b013bd50f	keyctl.2: Use syscall(SYS_...); for system calls without a glibc wrapper Remove the libkeyutils prototype from the synopsis, which isn't documented in the rest of the page, and as NOTES says, it's probably better to use the various library functions. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	e59830eda9	kexec_load.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	b5c3fcdb65	kcmp.2: tfix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	2f4306b033	kcmp.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	e3e30ce1bd	ipc.2: Add needed include The constants needed for using this function are defined in <linux/ipc.h>. Add the include, even when those constants are not mentioned in this manual page. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	acb2e04c24	ipc.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	0d961e8818	ioprio_set.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	2f8cb589fb	ioperm.2: Remove obvious comment Of course that is for the glibc wrapper. As all of the other pages that don't explicitly say otherwise. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	8b4c942c50	io_getevents.2: Use syscall(SYS_...); for system calls without a wrapper In this case there's a wrapper provided by libaio, but this page documents the raw syscall. Also remove <linux/time.h> from the includes: 'struct timespec' is already documented in system_data_types(7), where the information is more up to date. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	950d1738ef	io_destroy.2: Use syscall(SYS_...); for system calls without a wrapper In this case there's a wrapper provided by libaio, but this page documents the raw syscall. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	cac89bc794	ioctl_userfaultfd.2: SYNOPSIS: Add <linux/userfaultfd.h> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	70f9a4edb3	ioctl_tty.2: Fix includes <sys/ioctl.h> is needed for the prototype of ioctl(). That header also provides most of the constants used by the function. Only a few of those constants are not provided by that header, and need <termios.h>; clarify which constants do need that include. ...... $ <man2/ioctl_tty.2 \ sed -n '/^.SH DESCRIPTION/,/^.SH/p' \ \|grep -e '^\.B' -e TIOCM \ \|sed 's/^\.B[^ ]* //' \ \|awk '{print $1}' \ \|grep '^[[:upper:]]' \ \|grep -v -e '^CAP' -e '^E' -e '^SIG' -e '^O_' -e '^[TR]XD$' -e '^POLL' \ \|sort \ \|uniq \ \|while read f; do \ find /usr/include/ -type f \ \|xargs grep -l "define\s$f" \ \|grep -q ioctl.*.h \ \|\|echo $f \ \|while read ff; do \ echo "============ $ff"; \ find /usr/include/ -type f \ \|xargs grep -n "define\s$ff"; \ done; \ done; ============ CLOCAL /usr/include/asm-generic/termbits.h:142:#define CLOCAL 0004000 /usr/include/gphoto2/gphoto2-port-portability.h:127:# define CLOCAL 0x00000800 /usr/include/x86_64-linux-gnu/bits/termios-c_cflag.h:34:#define CLOCAL 0004000 ============ TCIFLUSH /usr/include/asm-generic/termbits.h:191:#define TCIFLUSH 0 /usr/include/x86_64-linux-gnu/bits/termios.h:70:#define TCIFLUSH 0 ============ TCIOFF /usr/include/asm-generic/termbits.h:187:#define TCIOFF 2 /usr/include/x86_64-linux-gnu/bits/termios.h:66:#define TCIOFF 2 ============ TCIOFLUSH /usr/include/asm-generic/termbits.h:193:#define TCIOFLUSH 2 /usr/include/x86_64-linux-gnu/bits/termios.h:72:#define TCIOFLUSH 2 ============ TCION /usr/include/asm-generic/termbits.h:188:#define TCION 3 /usr/include/x86_64-linux-gnu/bits/termios.h:67:#define TCION 3 ============ TCOFLUSH /usr/include/asm-generic/termbits.h:192:#define TCOFLUSH 1 /usr/include/x86_64-linux-gnu/bits/termios.h:71:#define TCOFLUSH 1 ============ TCOOFF /usr/include/asm-generic/termbits.h:185:#define TCOOFF 0 /usr/include/x86_64-linux-gnu/bits/termios.h:64:#define TCOOFF 0 ============ TCOON /usr/include/asm-generic/termbits.h:186:#define TCOON 1 /usr/include/x86_64-linux-gnu/bits/termios.h:65:#define TCOON 1 ============ TIOCREMOTE ============ TIOCSTART ============ TIOCSTOP ============ TIOCTTYGSTRUCT ============ TIOCUCNTL Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	4b63cf3ca7	getdents.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	0aa385fe25	futex.2: Use syscall(SYS_...); for system calls without a wrapper At the same time, document only headers that are required for calling the function, or those that are specific to the function: <unistd.h> is required for the syscall() prototype. <sys/syscall.h> is required for the syscall name SYS_xxx. <linux/futex.h> is specific to this syscall. However, uint32_t is generic enough that it shouldn't be documented here. The system_data_types(7) page already documents it, and is more precise about it. The same goes for timespec. As a general rule a man[23] page should document the header that includes the prototype, and all of the headers that define macros that should be used with the call. However, the information about types should be restricted to system_data_types(7) (and that page should probably be improved by adding types), except for types that are very specific to the call. Otherwise, we're duplicating info and it's then harder to maintain, and probably outdated in the future. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 17:18:08 +12:00
Alejandro Colomar	1cf69258ad	execveat.2: Remove unused include This complements commit `e3eba861bd`. Since we don't need syscall(2) anymore, we don't need SYS_* definitions. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 05:06:23 +12:00
Alejandro Colomar	1c55195743	open.2: Fix bug in linkat(2) call example AT_EMPTY_PATH works with empty strings (""), but not with NULL (or at least it's not obvious). The relevant kernel code is the following: linux$ sed -n 189,198p fs/namei.c result->refcnt = 1; /* The empty path is special. / if (unlikely(!len)) { if (empty) empty = 1; if (!(flags & LOOKUP_EMPTY)) { putname(result); return ERR_PTR(-ENOENT); } } Reported-by: Walter Harms <wharms@bfs.de> Cc: Theodore Ts'o <tytso@mit.edu> Cc: Adam Borowski <kilobyte@angband.pl> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-11 04:34:03 +12:00
Alejandro Colomar	099b33fbff	epoll_wait.2: Move subsection to NOTES from BUGS 'C library/kernel differences' was added to BUGS incorrectly. Fix it Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 13:05:42 +12:00
Alejandro Colomar	4cc91169d0	sched_get_priority_max.2, open_memstream.3: tfix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 11:19:29 +12:00
Alejandro Colomar	fe10d82f6e	clone.2: tfix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 11:17:11 +12:00
Akihiro Motoki	19063c3c07	signalfd.2: tfix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Akihiro Motoki	a128697221	semctl.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Akihiro Motoki	4f060b5467	move_pages.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Štěpán Němec	85102346e3	execve.2: tfix Signed-off-by: Štěpán Němec <stepnem@gmail.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Vishwajith K	5974cc557f	shmop.2: tfix Signed-off-by: Vishwajith K <vishuvikas1996@gmail.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Alejandro Colomar	f729c4f36e	sigwaitinfo.2: tfix Fix wording issue introduced in commit `bf1298c9e5`. Reported-by: Chris Keilbart <keilbartchris@gmail.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Jakub Wilk	ba48f20bc4	exit_group.2, getunwind.2: tfix Signed-off-by: Jakub Wilk <jwilk@jwilk.net> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Dmitry V. Levin	8bc6f5fdea	ptrace.2: mention PTRACE_GET_SYSCALL_INFO in RETURN VALUE section Mirror the wording about PTRACE_GET_SYSCALL_INFO return value semantics from "DESCRIPTION" section to "RETURN VALUE" section. Reported-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com> Complements: `fc91449cb` "ptrace.2: Document PTRACE_GET_SYSCALL_INFO" Signed-off-by: Dmitry V. Levin <ldv@altlinux.org> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Dmitry V. Levin	4d95a1eef3	move_pages.2: ffix Signed-off-by: Dmitry V. Levin <ldv@altlinux.org> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Johannes Berg	1874ca39ba	clone.2: tfix Despite my mention of this spawning a hilarious discussion on IRC, this alignment restriction should be 128-bit, not 126-bit. Signed-off-by: Johannes Berg <johannes@sipsolutions.net> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Borislav Petkov	67238a538d	sigaltstack.2: tfix Add a missing "to" in an "in order to" formulation. Signed-off-by: Borislav Petkov <bp@suse.de> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Aurelien Aptel	334ed9799d	flock.2: add CIFS details CIFS flock() locks behave differently than the standard. Give overview of those differences. Here is the rendered text: CIFS details In Linux kernels up to 5.4, flock() is not propagated over SMB. A file with such locks will not appear locked for remote clients. Since Linux 5.5, flock() locks are emulated with SMB byte-range locks on the entire file. Similarly to NFS, this means that fcntl(2) and flock() locks interact with one another. Another important side-effect is that the locks are not advisory anymore: any IO on a locked file will always fail with EACCES when done from a separate file descriptor. This difference originates from the design of locks in the SMB proto- col, which provides mandatory locking semantics. Remote and mandatory locking semantics may vary with SMB protocol, mount options and server type. See mount.cifs(8) for additional infor- mation. Signed-off-by: Aurelien Aptel <aaptel@suse.com> Discussion: linux-man <https://lore.kernel.org/linux-man/20210302154831.17000-1-aaptel@suse.com/> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 09:09:11 +12:00
Alejandro Colomar	dffa597887	Various pages: Remove unused <sys/ipc.h> (and <sys/types.h>) In `b0b19983d9` we removed <sys/types.h>. For the same reasons there, remove now <sys/ipc.h> from many pages. If someone wonders why <sys/ipc.h> was needed, the reason was to get all the definitions of IPC_* constants. However, that header is now included by <sys/msg.h>, so it's not needed anymore to explicitly include it. Quoting POSIX: "In addition, the <sys/msg.h> header shall include the <sys/ipc.h> header." There were some remaining cases where I forgot to remove <sys/types.h>; remove them now too. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 07:47:06 +12:00
Alejandro Colomar	a484b43bde	open_by_handle_at.2: Remove unused <sys/stat.h> AFAICS, all types and constants used by these functions are defined in <fcntl.h>. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 06:10:12 +12:00
Michael Kerrisk	eb2b1b990d	syscalls.2: perfmonctl(2) was removed in Linux 5.10 Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 06:08:38 +12:00
Michael Kerrisk	9c6ca43e91	perfmonctl.2: This system call was removed in Linux 5.10 Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 06:08:38 +12:00
Michael Kerrisk	e3eba861bd	execveat.2: Library support has been added in glibc 2.34 Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-05-10 04:16:42 +12:00
Michael Kerrisk	ba93f72c44	syscalls.2: SEE ALSO: add ausyscall(1) Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-29 21:14:59 +02:00
Michael Kerrisk	2673a70a57	get_mempolicy.2, mq_getsetattr.2, poll.2: ffix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 13:44:48 +02:00
Michael Kerrisk	92a4b09356	pipe.2: Rearrange SYNOPSIS so that minority version pipe() is at end A few architectures have a different call signature for pipe(). Since those architectures are the minority, place the prototype at the end of the SYNOPSIS, rather than the start. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 13:44:48 +02:00
Michael Kerrisk	3ab99460db	mount.2: ffix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 13:44:48 +02:00
Michael Kerrisk	9ae36f1824	userfaultfd.2: tfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 13:44:48 +02:00
Michael Kerrisk	1cf1ada55a	ioctl_userfaultfd.2: tfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 13:44:48 +02:00
Michael Kerrisk	7a3d084504	ioctl_userfaultfd.2, userfaultfd.2: Minor tweaks to Peter Xu's patches Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-06 10:04:46 +02:00
Peter Xu	f559fa36a6	ioctl_userfaultfd.2: Add write-protect mode docs Userfaultfd write-protect mode is supported starting from Linux 5.7. Acked-by: Mike Rapoport <rppt@linux.vnet.ibm.com> Signed-off-by: Peter Xu <peterx@redhat.com> [alx: ffix + srcfix] Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 22:02:44 +02:00
Peter Xu	fbda69bbdd	ioctl_userfaultfd.2: Add UFFD_FEATURE_THREAD_ID docs UFFD_FEATURE_THREAD_ID is supported in Linux 4.14. Acked-by: Mike Rapoport <rppt@linux.vnet.ibm.com> Signed-off-by: Peter Xu <peterx@redhat.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 22:02:05 +02:00
Peter Xu	4b338b38e6	userfaultfd.2: Add write-protect mode Write-protect mode is supported starting from Linux 5.7. Acked-by: Mike Rapoport <rppt@linux.vnet.ibm.com> Signed-off-by: Peter Xu <peterx@redhat.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 22:00:21 +02:00
Peter Xu	e70f957d81	userfaultfd.2: Add UFFD_FEATURE_THREAD_ID docs UFFD_FEATURE_THREAD_ID is supported since Linux 4.14. Acked-by: Mike Rapoport <rppt@linux.vnet.ibm.com> Signed-off-by: Peter Xu <peterx@redhat.com> [alx: srcfix] Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 21:48:53 +02:00
Alejandro Colomar	0bdc3187ec	io_cancel.2: Use syscall(SYS_...); for system calls without a wrapper In this case there's a wrapper provided by libaio, but this page documents the raw kernel syscall. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	d9f8238d37	init_module.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	4695076306	delete_module.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Michael Kerrisk	843c01931c	get_robust_list.2: ffix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	eb4a40e023	get_robust_list.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	a2fe65bc36	getunwind.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	73f2d2ba11	exit_group.2: Use syscall(SYS_...); for system calls without a wrapper <linux/unistd.h> is not needed. We need <unistd.h> for syscall(), and <sys/syscall.h> for SYS_exit_group. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	86970bf4d3	execveat.2: Use syscall(SYS_...); for system calls without a wrapper Add <linux/fcntl.h>, which contains AT_* definitions used by execveat(). Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:48 +02:00
Alejandro Colomar	a380538369	clone.2: Use syscall(SYS_...); for system calls without a wrapper The CLONE_* constants seem to be available from either <linux/sched.h> or <sched.h>, and since clone3() already includes <linux/sched.h> for 'struct clone_args', <sched.h> is not really needed, AFAICS; however, to avoid confusion, I also included <sched.h> for clone3() for consistency: clone() is getting CLONE_* from <sched.h>, and it would confuse the reader if clone3() got the same CLONE_* constants from a different header. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:44:42 +02:00
Michael Kerrisk	74ddb3019f	capget.2: Minor tweaks to Alex Colomar's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:29:05 +02:00
Alejandro Colomar	00e4779ad7	capget.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:26:53 +02:00
Alejandro Colomar	149eb741d7	arch_prctl.2: SYNOPSIS: Remove unused includes AFAICS, there's no reason to include that. All of the macros that this function uses are already defined in the other headers. Cc: glibc <libc-alpha@sourceware.org> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:25:23 +02:00
Alejandro Colomar	a9a96d8a5b	arch_prctl.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:25:14 +02:00
Alejandro Colomar	58d15b72da	alloc_hugepages.2: Use syscall(SYS_...); for system calls without a wrapper The page didn't specify includes, and the syscalls are extinct, so instead of adding incomplete information about includes, just leave it without any includes. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:23:06 +02:00
Alejandro Colomar	4175b08beb	access.2: Use syscall(SYS_...); for system calls without a wrapper Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:17:25 +02:00
Michael Kerrisk	fa1755e4be	ioctl_fideduperange.2: Minor tweaks to Alex Colomar's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:15:31 +02:00
Alejandro Colomar	0fdaa53806	ioctl_fideduperange.2: Make clear why exactly is each header needed Only the include that provides the prototype doesn't need a comment. Also sort the includes alphabetically. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:13:57 +02:00
Alejandro Colomar	40798f5b45	ioctl_ficlonerange.2: Make clear why is each header exactly needed. Only the one that provides the prototype doesn't need a comment. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:13:00 +02:00
Alejandro Colomar	f72f6f260b	ioctl_fslabel.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:10:31 +02:00
Michael Kerrisk	5aed5c683e	ioctl_getfsmap.2: Minor tweaks to Alex Colomar's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:08:58 +02:00
Alejandro Colomar	2ef3cedaf4	ioctl_getfsmap.2: Make clear why exactly is each header needed <linux/fs.h> doesn't seem to be needed! Only the include that provides the prototype doesn't need a comment. Also sort the includes alphabetically. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:05:44 +02:00
Alejandro Colomar	9c9e0561d0	ioctl_fslabel.2: Make clear why exactly is each header needed Only the include that provides the prototype doesn't need a comment. Also sort the includes alphabetically. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:05:23 +02:00
Michael Kerrisk	0b1dc48225	ioctl_fat.2: Minor tweaks to Alex Colomar's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 13:05:23 +02:00
Alejandro Colomar	a71d209b9b	ioctl_fat.2: tfix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:58:29 +02:00
Alejandro Colomar	3a0989120d	ioctl_fat.2: Make clear why is each header exactly needed. Only the one that provides the prototype doesn't need a comment. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:57:23 +02:00
Alejandro Colomar	1852998a68	dup.2: SYNOPSIS: Use consistent comments through pages [mtk: Alex's change switches the comment to the more generally used form "Definition of..."] Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:53:41 +02:00
Alejandro Colomar	d2ac3dcfbe	fanotify_init.2: Add comment: why more than one include is needed Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:50:00 +02:00
Alejandro Colomar	0cb59ab63f	getpriority.2: Remove unused include <sys/time.h> is not needed to get the function declaration nor any constant used by the function. It was only needed (before POSIX.1) to get 'struct timeval', but that information would be more suited for system_data_types(7), and not for this page. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:49:18 +02:00
Alejandro Colomar	43be8898c2	delete_module.2: Add missing include Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:48:35 +02:00
Alejandro Colomar	1e145f150d	getrlimit.2, getrusage.2: Remove unused include <sys/time.h> is not required by any of the function declarations or macro definitions used by these functions. It may be (or maybe not) needed by some type inside the rlimit structure, but that info belongs in system_data_types(7), not here. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:47:01 +02:00
Alejandro Colomar	8c402eb013	add_key.2: Remove unused include <sys/types.h> was only needed for size_t, AFAIK. That is already (and more precisely) documented in system_data_types(7). Let's remove it here, as it's not really needed for calling add_key(). Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:45:48 +02:00
Alejandro Colomar	af2ea7fdbb	fcntl.2: Remove unused include I couldn't find a reason for including <unistd.h>. All the macros used by fcntl() are defined in <fcntl.h>. For comparison, FreeBSD and OpenBSD don't specify <unistd.h> in their manual pages. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:43:49 +02:00
Alejandro Colomar	f6ecadcba1	exit_group.2: Use 'noreturn' in prototypes This function never returns to its caller. Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:43:05 +02:00
Alejandro Colomar	c0e1178b14	futimesat.2: ffix Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-05 12:41:53 +02:00
Michael Kerrisk	8e2cab90e1	set_mempolicy.2: srcfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-04 22:07:31 +02:00
Huang Ying	5858ad9ca8	set_mempolicy.2: Add mode flag MPOL_F_NUMA_BALANCING In Linux kernel 5.12, a new mode flag, MPOL_F_NUMA_BALANCING, is added to set_mempolicy() to optimize the page placement among the NUMA nodes with the NUMA balancing mechanism even if the memory of the applications is bound with MPOL_BIND. This patch updates the man page for the new mode flag. Related kernel commits: bda420b985054a3badafef23807c4b4fa38a3dff [mtk: Minor fixes to commit message] Signed-off-by: "Huang, Ying" <ying.huang@intel.com> Cc: "Michael Kerrisk" <mtk.manpages@gmail.com> [ alx: srcfix ] Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-04 22:04:49 +02:00
Bruce Merry	285a7373e7	mmap.2: Clarify that MAP_POPULATE is best-effort As discussed on linux-mm (https://marc.info/?l=linux-mm&m=161528594100612&w=2), MAP_POPULATE can fail silently if the hugetlb cgroup settings allow huge page reservation but prevents huge pages being allocated. Closes https://bugzilla.kernel.org/show_bug.cgi?id=212153. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-04 22:03:30 +02:00
Michael Kerrisk	1f72eb7511	write.2: wfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-04-02 12:28:00 +02:00
Michael Kerrisk	46b470231f	listen.2: wfix Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-26 12:05:15 +01:00
Michael Kerrisk	737a840daa	_exit.2: Add a little more detail on the raw _exit() system cal Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-25 09:32:30 +01:00
Michael Kerrisk	b96ad91c7c	dup.2: Further clarify the effect of dup2() Add a sentence explaining what dup2() does in terms of file descriptors and open file descriptions. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-23 11:33:40 +01:00
Michael Kerrisk	8b339e35fa	dup.2: Clarify what silent closing means Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-23 11:27:39 +01:00
Michael Kerrisk	5c3611aa44	open.2: Make it clearer that an FD is an index into the process's FD table Sometimes people are confused, thinking a file descriptor is just a number. To help avoid such confusions, add text highlighting that a file descriptor is an index to an entry in the process's FD table. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-23 11:05:24 +01:00
Michael Kerrisk	f5270fe6f8	dup.2: Rewrite the description of dup() somewhat As can be seen by any number of StackOverflow questions, people persistently misunderstand what dup() does, and the existing manual page text, which talks of "copying" a file descriptor doesn't help. Rewrite the text a little to try to prevent some of these misunderstandings, in particular noting at the start that dup() allocates a new file descriptor. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-23 10:38:28 +01:00
Michael Kerrisk	1d767b552e	getent.1, ldd.1, locale.1, localedef.1, memusage.1, memusagestat.1, mtrace.1, _exit.2, _syscall.2, accept.2, access.2, acct.2, add_key.2, adjtimex.2, alloc_hugepages.2, arch_prctl.2, bdflush.2, bind.2, bpf.2, brk.2, cacheflush.2, capget.2, chdir.2, chmod.2, chown.2, chroot.2, clock_getres.2, clock_nanosleep.2, clone.2, close.2, close_range.2, connect.2, copy_file_range.2, create_module.2, delete_module.2, dup.2, epoll_create.2, epoll_ctl.2, epoll_wait.2, eventfd.2, execve.2, execveat.2, fanotify_init.2, fanotify_mark.2, fcntl.2, flock.2, fork.2, fsync.2, futex.2, get_kernel_syms.2, get_mempolicy.2, get_robust_list.2, getcpu.2, getdents.2, getdomainname.2, getgid.2, getgroups.2, gethostname.2, getitimer.2, getpagesize.2, getpeername.2, getpid.2, getpriority.2, getrandom.2, getresuid.2, getrlimit.2, getrusage.2, getsid.2, getsockname.2, getsockopt.2, gettid.2, gettimeofday.2, getuid.2, getunwind.2, getxattr.2, idle.2, init_module.2, inotify_add_watch.2, inotify_rm_watch.2, io_cancel.2, io_destroy.2, io_getevents.2, io_setup.2, io_submit.2, ioctl.2, ioctl_console.2, ioctl_fat.2, ioctl_ficlonerange.2, ioctl_fideduperange.2, ioctl_fslabel.2, ioctl_getfsmap.2, ioctl_ns.2, ioctl_tty.2, ioctl_userfaultfd.2, ioperm.2, iopl.2, ipc.2, kcmp.2, kexec_load.2, keyctl.2, kill.2, link.2, listen.2, listxattr.2, llseek.2, lookup_dcookie.2, lseek.2, madvise.2, mbind.2, membarrier.2, memfd_create.2, migrate_pages.2, mincore.2, mkdir.2, mknod.2, mlock.2, mmap.2, mmap2.2, modify_ldt.2, mount.2, move_pages.2, mprotect.2, mq_getsetattr.2, mremap.2, msgctl.2, msgget.2, msgop.2, msync.2, nanosleep.2, nfsservctl.2, nice.2, open.2, open_by_handle_at.2, openat2.2, pause.2, pciconfig_read.2, perf_event_open.2, perfmonctl.2, personality.2, pidfd_getfd.2, pidfd_open.2, pidfd_send_signal.2, pipe.2, pivot_root.2, pkey_alloc.2, poll.2, posix_fadvise.2, prctl.2, pread.2, process_vm_readv.2, ptrace.2, query_module.2, quotactl.2, read.2, readahead.2, readdir.2, readlink.2, readv.2, reboot.2, recv.2, remap_file_pages.2, removexattr.2, rename.2, request_key.2, restart_syscall.2, rmdir.2, rt_sigqueueinfo.2, s390_guarded_storage.2, s390_pci_mmio_write.2, s390_runtime_instr.2, s390_sthyi.2, sched_get_priority_max.2, sched_rr_get_interval.2, sched_setaffinity.2, sched_setattr.2, sched_setparam.2, sched_setscheduler.2, sched_yield.2, seccomp.2, select.2, select_tut.2, semctl.2, semget.2, semop.2, send.2, sendfile.2, set_thread_area.2, seteuid.2, setfsgid.2, setfsuid.2, setgid.2, setpgid.2, setresuid.2, setreuid.2, setsid.2, setuid.2, setup.2, setxattr.2, sgetmask.2, shmctl.2, shmget.2, shmop.2, shutdown.2, sigaction.2, sigaltstack.2, signal.2, signalfd.2, sigpending.2, sigprocmask.2, sigreturn.2, sigsuspend.2, sigwaitinfo.2, socket.2, socketcall.2, socketpair.2, splice.2, spu_create.2, spu_run.2, stat.2, statfs.2, statx.2, stime.2, subpage_prot.2, swapon.2, symlink.2, sync.2, sync_file_range.2, syscall.2, syscalls.2, sysctl.2, sysfs.2, sysinfo.2, syslog.2, time.2, timer_create.2, timer_delete.2, timer_getoverrun.2, timer_settime.2, timerfd_create.2, times.2, tkill.2, truncate.2, umask.2, umount.2, uname.2, unimplemented.2, unlink.2, unshare.2, uselib.2, userfaultfd.2, ustat.2, utime.2, utimensat.2, vfork.2, vhangup.2, vm86.2, vmsplice.2, wait.2, wait4.2, write.2, CPU_SET.3, __ppc_get_timebase.3, __ppc_set_ppr_med.3, __ppc_yield.3, __setfpucw.3, a64l.3, abort.3, abs.3, acos.3, acosh.3, addseverity.3, adjtime.3, aio_cancel.3, aio_error.3, aio_fsync.3, aio_read.3, aio_return.3, aio_suspend.3, aio_write.3, alloca.3, argz_add.3, asin.3, asinh.3, asprintf.3, assert.3, assert_perror.3, atan.3, atan2.3, atanh.3, atexit.3, atof.3, atoi.3, backtrace.3, basename.3, bcmp.3, bcopy.3, bindresvport.3, bsd_signal.3, bsearch.3, bstring.3, btowc.3, byteorder.3, bzero.3, cabs.3, cacos.3, cacosh.3, canonicalize_file_name.3, carg.3, casin.3, casinh.3, catan.3, catanh.3, catgets.3, catopen.3, cbrt.3, ccos.3, ccosh.3, ceil.3, cexp.3, cexp2.3, cfree.3, cimag.3, circleq.3, clearenv.3, clock.3, clock_getcpuclockid.3, clog.3, clog10.3, clog2.3, closedir.3, cmsg.3, confstr.3, conj.3, copysign.3, cos.3, cosh.3, cpow.3, cproj.3, creal.3, crypt.3, csin.3, csinh.3, csqrt.3, ctan.3, ctanh.3, ctermid.3, ctime.3, daemon.3, des_crypt.3, difftime.3, dirfd.3, div.3, dl_iterate_phdr.3, dladdr.3, dlerror.3, dlinfo.3, dlopen.3, dlsym.3, drand48.3, drand48_r.3, duplocale.3, dysize.3, ecvt.3, ecvt_r.3, encrypt.3, endian.3, envz_add.3, erf.3, erfc.3, err.3, errno.3, error.3, ether_aton.3, euidaccess.3, exec.3, exit.3, exp.3, exp10.3, exp2.3, expm1.3, fabs.3, fclose.3, fcloseall.3, fdim.3, fenv.3, ferror.3, fexecve.3, fflush.3, ffs.3, fgetc.3, fgetgrent.3, fgetpwent.3, fgetwc.3, fgetws.3, fileno.3, finite.3, flockfile.3, floor.3, fma.3, fmax.3, fmemopen.3, fmin.3, fmod.3, fmtmsg.3, fnmatch.3, fopen.3, fopencookie.3, fpathconf.3, fpclassify.3, fpurge.3, fputwc.3, fputws.3, fread.3, frexp.3, fseek.3, fseeko.3, ftime.3, ftok.3, fts.3, ftw.3, futimes.3, fwide.3, gamma.3, gcvt.3, get_nprocs_conf.3, get_phys_pages.3, getaddrinfo.3, getaddrinfo_a.3, getauxval.3, getcontext.3, getcwd.3, getdate.3, getdirentries.3, getdtablesize.3, getentropy.3, getenv.3, getfsent.3, getgrent.3, getgrent_r.3, getgrnam.3, getgrouplist.3, gethostbyname.3, gethostid.3, getifaddrs.3, getipnodebyname.3, getline.3, getloadavg.3, getlogin.3, getmntent.3, getnameinfo.3, getnetent.3, getnetent_r.3, getopt.3, getpass.3, getprotoent.3, getprotoent_r.3, getpt.3, getpw.3, getpwent.3, getpwent_r.3, getpwnam.3, getrpcent.3, getrpcent_r.3, getrpcport.3, gets.3, getservent.3, getservent_r.3, getspnam.3, getsubopt.3, getttyent.3, getumask.3, getusershell.3, getutent.3, getutmp.3, getw.3, getwchar.3, glob.3, gnu_get_libc_version.3, grantpt.3, group_member.3, gsignal.3, hsearch.3, hypot.3, iconv.3, iconv_close.3, iconv_open.3, if_nameindex.3, if_nametoindex.3, ilogb.3, index.3, inet.3, inet_net_pton.3, inet_ntop.3, inet_pton.3, initgroups.3, insque.3, isalpha.3, isatty.3, isfdtype.3, isgreater.3, iswalnum.3, iswalpha.3, iswblank.3, iswcntrl.3, iswctype.3, iswdigit.3, iswgraph.3, iswlower.3, iswprint.3, iswpunct.3, iswspace.3, iswupper.3, iswxdigit.3, j0.3, key_setsecret.3, killpg.3, ldexp.3, lgamma.3, lio_listio.3, list.3, localeconv.3, lockf.3, log.3, log10.3, log1p.3, log2.3, logb.3, login.3, lrint.3, lround.3, lsearch.3, lseek64.3, makecontext.3, makedev.3, mallinfo.3, malloc.3, malloc_get_state.3, malloc_hook.3, malloc_info.3, malloc_stats.3, malloc_trim.3, malloc_usable_size.3, mallopt.3, matherr.3, mblen.3, mbrlen.3, mbrtowc.3, mbsinit.3, mbsnrtowcs.3, mbsrtowcs.3, mbstowcs.3, mbtowc.3, mcheck.3, memccpy.3, memchr.3, memcmp.3, memcpy.3, memfrob.3, memmem.3, memmove.3, mempcpy.3, memset.3, mkdtemp.3, mkfifo.3, mkstemp.3, mktemp.3, modf.3, mpool.3, mq_close.3, mq_getattr.3, mq_notify.3, mq_open.3, mq_receive.3, mq_send.3, mq_unlink.3, mtrace.3, nan.3, newlocale.3, nextafter.3, nextup.3, nl_langinfo.3, ntp_gettime.3, on_exit.3, open_memstream.3, opendir.3, openpty.3, perror.3, popen.3, posix_fallocate.3, posix_madvise.3, posix_memalign.3, posix_openpt.3, posix_spawn.3, pow.3, pow10.3, printf.3, profil.3, psignal.3, pthread_attr_init.3, pthread_attr_setaffinity_np.3, pthread_attr_setdetachstate.3, pthread_attr_setguardsize.3, pthread_attr_setinheritsched.3, pthread_attr_setschedparam.3, pthread_attr_setschedpolicy.3, pthread_attr_setscope.3, pthread_attr_setsigmask_np.3, pthread_attr_setstack.3, pthread_attr_setstackaddr.3, pthread_attr_setstacksize.3, pthread_cancel.3, pthread_cleanup_push.3, pthread_cleanup_push_defer_np.3, pthread_create.3, pthread_detach.3, pthread_equal.3, pthread_exit.3, pthread_getattr_default_np.3, pthread_getattr_np.3, pthread_getcpuclockid.3, pthread_join.3, pthread_kill.3, pthread_kill_other_threads_np.3, pthread_mutex_consistent.3, pthread_mutexattr_getpshared.3, pthread_mutexattr_setrobust.3, pthread_rwlockattr_setkind_np.3, pthread_self.3, pthread_setaffinity_np.3, pthread_setcancelstate.3, pthread_setconcurrency.3, pthread_setname_np.3, pthread_setschedparam.3, pthread_setschedprio.3, pthread_sigmask.3, pthread_sigqueue.3, pthread_spin_init.3, pthread_spin_lock.3, pthread_testcancel.3, pthread_tryjoin_np.3, pthread_yield.3, ptsname.3, putenv.3, putgrent.3, putpwent.3, puts.3, putwchar.3, qecvt.3, qsort.3, raise.3, rand.3, random.3, random_r.3, rcmd.3, re_comp.3, readdir.3, readdir_r.3, realpath.3, regex.3, remainder.3, remove.3, remquo.3, resolver.3, rewinddir.3, rexec.3, rint.3, round.3, rpc.3, rpmatch.3, rtime.3, rtnetlink.3, scalb.3, scalbln.3, scandir.3, scanf.3, sched_getcpu.3, seekdir.3, sem_close.3, sem_destroy.3, sem_getvalue.3, sem_init.3, sem_open.3, sem_post.3, sem_unlink.3, sem_wait.3, setaliasent.3, setbuf.3, setenv.3, setjmp.3, setlocale.3, setlogmask.3, setnetgrent.3, shm_open.3, siginterrupt.3, signbit.3, significand.3, sigpause.3, sigqueue.3, sigset.3, sigsetops.3, sigvec.3, sigwait.3, sin.3, sincos.3, sinh.3, sleep.3, slist.3, sockatmark.3, sqrt.3, stailq.3, statvfs.3, stdarg.3, stdio.3, stdio_ext.3, stpcpy.3, stpncpy.3, strcasecmp.3, strcat.3, strchr.3, strcmp.3, strcoll.3, strcpy.3, strdup.3, strerror.3, strfmon.3, strfromd.3, strfry.3, strftime.3, string.3, strlen.3, strnlen.3, strpbrk.3, strptime.3, strsep.3, strsignal.3, strspn.3, strstr.3, strtod.3, strtoimax.3, strtok.3, strtol.3, strtoul.3, strverscmp.3, strxfrm.3, swab.3, sysconf.3, syslog.3, system.3, sysv_signal.3, tailq.3, tan.3, tanh.3, tcgetpgrp.3, tcgetsid.3, telldir.3, tempnam.3, termios.3, tgamma.3, timegm.3, timeradd.3, tmpfile.3, tmpnam.3, toascii.3, toupper.3, towctrans.3, towlower.3, towupper.3, trunc.3, tsearch.3, ttyname.3, ttyslot.3, tzset.3, ualarm.3, ulimit.3, undocumented.3, ungetwc.3, unlocked_stdio.3, unlockpt.3, updwtmp.3, uselocale.3, usleep.3, wcpcpy.3, wcpncpy.3, wcrtomb.3, wcscasecmp.3, wcscat.3, wcschr.3, wcscmp.3, wcscpy.3, wcscspn.3, wcsdup.3, wcslen.3, wcsncasecmp.3, wcsncat.3, wcsncmp.3, wcsncpy.3, wcsnlen.3, wcsnrtombs.3, wcspbrk.3, wcsrchr.3, wcsrtombs.3, wcsspn.3, wcsstr.3, wcstoimax.3, wcstok.3, wcstombs.3, wcswidth.3, wctob.3, wctomb.3, wctrans.3, wctype.3, wcwidth.3, wmemchr.3, wmemcmp.3, wmemcpy.3, wmemmove.3, wmemset.3, wordexp.3, wprintf.3, xcrypt.3, xdr.3, y0.3, cciss.4, console_codes.4, dsp56k.4, hpsa.4, initrd.4, loop.4, lp.4, msr.4, random.4, rtc.4, smartpqi.4, veth.4, wavelan.4, acct.5, core.5, elf.5, hosts.5, locale.5, proc.5, resolv.conf.5, rpc.5, slabinfo.5, sysfs.5, tmpfs.5, utmp.5, address_families.7, aio.7, attributes.7, bootparam.7, capabilities.7, cgroups.7, complex.7, ddp.7, environ.7, epoll.7, fanotify.7, feature_test_macros.7, hier.7, inode.7, inotify.7, ip.7, ipv6.7, keyrings.7, locale.7, man-pages.7, man.7, math_error.7, mount_namespaces.7, namespaces.7, netdevice.7, netlink.7, numa.7, packet.7, pkeys.7, pthreads.7, queue.7, raw.7, rtnetlink.7, sched.7, session-keyring.7, shm_overview.7, sigevent.7, signal-safety.7, signal.7, sock_diag.7, socket.7, spufs.7, symlink.7, system_data_types.7, tcp.7, time_namespaces.7, udp.7, udplite.7, unicode.7, unix.7, uri.7, user_namespaces.7, vdso.7, vsock.7, x25.7, iconvconfig.8, ld.so.8, ldconfig.8, sln.8, tzselect.8: tstamp Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-22 00:15:34 +01:00
Michael Kerrisk	e3e22b2b2b	close_range.2: Correct the explanation of the EMFILE error close_range() CLOSE_RANGE_USHARE triggers a call to dup_fd() which in turn calls alloc_fdtable(), which checks that sysctl_nr_open has not been exceeded. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Michael Kerrisk	368ace8467	close_range.2: Minor wording fix Reported-by: Christian Brauner <christian.brauner@ubuntu.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Michael Kerrisk	336bd62ba2	close_range.2: Include a better example program The current example program can't really be used to demonstrate the effect of close_range(). Replace it by a program that does show the effect of this system call. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Michael Kerrisk	3bb4fe47a5	close.2: SEE ALSO: add close_range(2) Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Michael Kerrisk	8a7d961f0c	close_range.2: Minor tweaks to Stephen Kitt's patch Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Stephen Kitt	bd704558d9	close_range.2: New page documenting close_range(2) This documents close_range(2) based on information in 278a5fbaed89dacd04e9d052f4594ffd0e0585de, 60997c3d45d9a67daf01c56d805ae4fec37e0bd8, and 582f1fb6b721facf04848d2ca57f34468da1813e. Reported-by: Christian Brauner <christian.brauner@ubuntu.com> Signed-off-by: Stephen Kitt <steve@sk2.org> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-21 16:37:17 +01:00
Alejandro Colomar	b0b19983d9	Various pages: Remove unused <sys/types.h> The manual pages are already inconsistent in which headers need to be included. Right now, not all of the types used by a function have their required header included in the SYNOPSIS. If we were to add the headers required by all of the types used by functions, the SYNOPSIS would grow too much. Not only it would grow too much, but the information there would be less precise. Having system_data_types(7) document each type with all the information about required includes is much more precise, and the info is centralized so that it's much easier to maintain. So let's document only the include required for the function prototype, and also the ones required for the macros needed to call the function. <sys/types.h> only defines types, not functions or constants, so it doesn't belong to man[23] (function) pages at all. I ignore if some old systems had headers that required you to include <sys/types.h> before them (incomplete headers), but if so, those implementations would be broken, and those headers should probably provide some kind of warning. I hope this is not the case. [mtk: Already in 2001, POSIX.1 removed the requirement to include <sys/types.h> for many APIs, so this patch seems well past due.] Acked-by: Zack Weinberg <zackw@panix.com> Signed-off-by: Alejandro Colomar <alx.manpages@gmail.com> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-15 08:43:12 +01:00
Jens Axboe	c245512ad0	man2/openat2.2: Add RESOLVE_CACHED RESOLVE_CACHED allows an application to attempt a cache-only open of a file. If this isn't possible, the request will fail with -1/EAGAIN and the caller should retry without RESOLVE_CACHED set. This will generally happen from a different context, where a slower open operation can be performed. Signed-off-by: Jens Axboe <axboe@kernel.dk> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-03 23:06:32 +01:00
Michael Kerrisk	b007f233f5	kcmp.2: Since Linux 5.12, kcmp() availability is unconditional kcmp() is no longer dependent on CONFIG_CHECKPOINT_RESTORE. Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-03-03 23:06:16 +01:00
edef	04fd7f3121	futex.2: tfix Signed-off-by: edef <edef@edef.eu> Signed-off-by: Michael Kerrisk <mtk.manpages@gmail.com>	2021-02-20 22:46:59 +01:00

... 2 3 4 5 6 ...

9334 Commits