freebsd

mirror of https://git.FreeBSD.org/src.git synced 2024-12-24 11:29:10 +00:00

Author	SHA1	Message	Date
Marcel Moolenaar	ca125f9c17	Fix a (new) source of instability: When interrupting a kernel context, we don't need to switch stacks (memory nor register). As such, we were also not restoring the register stack pointer (ar.bspstore). This, however, fails to be valid in 1 situation: when we interrupt a register stack switch as is being done in restorectx(). The problem is that restorectx() needs to have ar.bsp == ar.bspstore before it can assign the new value to ar.bspstore. This is achieved by doing a loadrs prior to assigning to ar.bspstore. If we take an interrupt in between the loadrs and the assignment and we don't make sure we restore the ar.bspstore prior to returning from the interrupt, we switch stacks with possibly non-zero dirty registers, which means that the new frame pointer (ar.bsp) will be invalid. So, instead of jumping over the restoration of the register frame pointer and related registers, we conditionalize it based on whether we return to kernel context or user context. A future performance tweak is possible by only restoring ar.bspstore when returning to kernel mode and when the RSE is in enforced lazy mode. One cannot assume ar.bsp == ar.bspstore if the RSE is not in enforced lazy mode anyway. While here (well, not quite) don't unconditionally assign to ar.bspstore in exception_save. Only do that when we actually switch stacks. It can only harm us to do it unconditionally. Approved by: re@ (blanket)	2003-05-23 23:55:31 +00:00
Marcel Moolenaar	42b919d4a6	In swapctx(), put the RSE in enforced lazy mode before we flush the register stack. There's nothing really wrong with flushing before putting the RSE in enforced lazy mode, provided you don't depend on ar.bspstore being equal to ar.bsp when the RSE has been put in enforced lazy more. The small window between the flush and setting the RSE may be sufficient to have the RSE eagerly increase the dirty region (and hence cause ar.bspstore != ar.bsp) or have an interrupt that may even get the laziest RSE to do something. Anyway: we don't depend on ar.bspstore being equal to ar.bsp, so nothing was and is broken. But the code was non-intuitive and easily confuses. This is a source of future bugs. Note: the advantage of not depending on ar.bspstore is that there's some recilience against an interrupted flushrs. Clobbering is limited to stacked register contents only, not to RSE address clobbering. Approved: re@ (blanket)	2003-05-23 23:16:43 +00:00
Marcel Moolenaar	bfaccb767c	o Fix a definite bogon: the dirty bity fault, instruction access failt and data access fault install the PTE in question into the VHPT table. However, a post-increment was missing and we wrote the raw PTE data into the pagesize/access key field. This leaves a corrupt VHPT entry. o While here, remove the explicit cache purge. Insertion into the translation implicitly purges any overlapping entries. o Make sure there's a cycle break between the itc and the rfi. o Whitespace fixes.	2003-05-20 06:57:20 +00:00
Marcel Moolenaar	14d2ae56c7	Rename the "IA64 ITC" counter to "ITC" counter. We don't call the "TSC" counter on i386 "I386 TSC". Approved by: re@ (blanket)	2003-05-20 06:51:20 +00:00
Marcel Moolenaar	9b9ce577d4	Prevent corruption of the VHPT collision chain by protecting it with a mutex. The only volatile chain operations are insertion and deletion but since updating an existing PTE also updates the VHPT entry itself, and we have the VHPT mutex in both other cases, we also lock when we update an existing PTE even though no chain operation is involved. Note that we perform the insertion and deletion careful enough that we don't need to lock traversals. If we need to lock traversals, we also need to lock from the exception handler, which we can't without creating a trapframe. We're now able to withstand a -j8 buildworld. More work is needed to withstand Murphy fields. In other words: we still have a bogon... Approved by: re@ (blanket)	2003-05-20 02:52:41 +00:00
Alexander Kabaev	980ded9a7d	sys/sys/limits.h: - Fix visibilty test for LONG_BIT and WORD_BIT. `#if defined(__FOO_VISIBLE)' is alays wrong because __FOO_VISIBLE is always defined (to 0 for invisibility). sys/<arch>/include/limits.h sys/<arch>/include/_limits.h: - Style fixes. Submitted by: bde Reviewed by: bsdmike Approved by: re (scottl)	2003-05-19 20:29:07 +00:00
Marcel Moolenaar	b8c4149cff	Turn pmap_install_pte() into a critical section. We better not get interrupted while writing into the VHPT table. While here, make sure memory accesses a properly ordered. Tag invalidation must happen first so that the hardware VHPT walker will not be able to match this entry while we're updating it and we have to make sure the new new tag gets written only after the PTE is completely updated. Approved by: re (blanket)	2003-05-19 08:02:36 +00:00
Marcel Moolenaar	a75b99ea2d	Unconditionally set pcb_current_pmap. WIP versions of the code previously committed cleared pcb_current_pmap prior to changing the region registers, but that was removed before committing. Since we don't normally (at all?) pass a NULL pointer, the bug was mostly harmless. Fix it while I'm here... I'm here because we need to have data serialization after writing to the region registers. Not doing so was likely the cause of the hangs we were experiencing. General exceptions in cpu_switch may also be caused by the lack of serialization. Approved by: re (blanket)	2003-05-19 06:05:30 +00:00
Marcel Moolenaar	dc0bde0f18	pmap_install() needs to be atomic WRT to context switching. Protect switching user regions (region 0-4) with schedlock. Avoid unnecessary recursion on schedlock by moving the core functionality to another function (pmap_switch()) where we assert schedlock is held. Turn pmap_install() into a wrapper that grabs schedlock. This minimizes the number of callsites that need to be changed. Since we already have schedlock in cpu_switch() and cpu_throw(), have them call pmap_switch() directly. These were also the only two calls to pmap_install() outside pmap.c, so make pmap_install() static and remove its prototype from pmap.h Approved by: re (blanket)	2003-05-19 04:16:30 +00:00
Marcel Moolenaar	040c5b92bb	Remove unused files. cpu_switch() and cpu_throw(), normally in swtch.s, can be found in machdep.c. Approved: re@	2003-05-17 04:55:04 +00:00
Marcel Moolenaar	f2c49dd248	Revamp of the syscall path, exception and context handling. The prime objectives are: o Implement a syscall path based on the epc inststruction (see sys/ia64/ia64/syscall.s). o Revisit the places were we need to save and restore registers and define those contexts in terms of the register sets (see sys/ia64/include/_regset.h). Secundairy objectives: o Remove the requirement to use contigmalloc for kernel stacks. o Better handling of the high FP registers for SMP systems. o Switch to the new cpu_switch() and cpu_throw() semantics. o Add a good unwinder to reconstruct contexts for the rare cases we need to (see sys/contrib/ia64/libuwx) Many files are affected by this change. Functionally it boils down to: o The EPC syscall doesn't preserve registers it does not need to preserve and places the arguments differently on the stack. This affects libc and truss. o The address of the kernel page directory (kptdir) had to be unstaticized for use by the nested TLB fault handler. The name has been changed to ia64_kptdir to avoid conflicts. The renaming affects libkvm. o The trapframe only contains the special registers and the scratch registers. For syscalls using the EPC syscall path no scratch registers are saved. This affects all places where the trapframe is accessed. Most notably the unaligned access handler, the signal delivery code and the debugger. o Context switching only partly saves the special registers and the preserved registers. This affects cpu_switch() and triggered the move to the new semantics, which additionally affects cpu_throw(). o The high FP registers are either in the PCB or on some CPU. context switching for them is done lazily. This affects trap(). o The mcontext has room for all registers, but not all of them have to be defined in all cases. This mostly affects signal delivery code now. The *context syscalls are as of yet still unimplemented. Many details went into the removal of the requirement to use contigmalloc for kernel stacks. The details are mostly CPU specific and limited to exception_save() and exception_restore(). The few places where we create, destroy or switch stacks were mostly simplified by not having to construct physical addresses and additionally saving the virtual addresses for later use. Besides more efficient context saving and restoring, which of course yields a noticable speedup, this also fixes the dreaded SMP bootup problem as a side-effect. The details of which are still not fully understood. This change includes all the necessary backward compatibility code to have it handle older userland binaries that use the break instruction for syscalls. Support for break-based syscalls has been pessimized in favor of a clean implementation. Due to the overall better performance of the kernel, this will still be notived as an improvement if it's noticed at all. Approved by: re@ (jhb)	2003-05-16 21:26:42 +00:00
Marcel Moolenaar	baf74b8876	o In pmap_install, don't prevent switching the pmap if we're switching to kernel_pmap. The pmap is not special enough. o Clear the active bit on the pmap we're switching out. o Fix some nearby style(9) bugs. Approved by: re@	2003-05-16 07:57:44 +00:00
Marcel Moolenaar	906f065725	Indent a comment. This makes 1.100. Still approved by: re@ (blanket)	2003-05-16 07:05:08 +00:00
Marcel Moolenaar	164d4986fd	Turn pmap_growkernel() into a critical section. While here, initialize kernel_vm_end in pmap_bootstrap. Don't delay the initialization until we need to grow the kernel VM space. This BTW happens twice before we enter either single- or multi-user mode. Don't adjust kernel_vm_end while growing based on whether the KPT contains a non-NULL entry. We trust kernel_vm_end to be correct and we make sure it's still correct after growing. Define virtual_avail and virtual_end in terms of VM_MIN_KERNEL_ADDRESS and VM_MAX_KERNEL_ADDRESS (resp). Don't hardcode region knowledge.	2003-05-16 07:03:15 +00:00
Marcel Moolenaar	8cc31ae5be	Revamp the RID allocation code: o Limit the size of the region ID map to 64KB. This gives a bitmap that is large enough to keep track of 2^19 numbers. The minimal map size is 32KB. The reason we limit the map size is that processor models may have implemented a 24-bit region ID, which would give a 2MB bitmap while the maximum number of allocations is always less than PID_MAX*5, which is less than 2^19. o Allocate all region IDs up-front. The slight downside of reserving more RIDs then a process needs (3 for ia64 native and 1 for ia32) is preferable over the call to pmap_ensure_rid() where RIDs are allocated on demand. On SMP systems this may lead to a race condition. o When allocating a region ID, don't use arc4random(). We're not interested in randomness or uniform distribution across the spectrum. We only need uniqueness. Random numbers may easily collide when the number of allocated RIDs is high, creating a possibly unbounded retry rate.	2003-05-16 06:40:40 +00:00
Marcel Moolenaar	75189cff08	Move the conditional definition of KSTACK_MAX_PAGES up ahead where it's more visible. Approved by: re@ (blanket)	2003-05-16 06:17:34 +00:00
Marcel Moolenaar	794518cd6d	This file creates register sets based on the runtime specification. The advantage of using register sets is that you don't focus on each register seperately, but instead instroduce a level of abstraction. This reduces the chance of errors, and also simplifies the code. The register sers form the basis of everything register. The sets in this file are: struct _special contains all of the control related registers, such as instruction pointer and stack pointer. It also contains interrupt specific registers like the faulting address. The set is roughly split in 3 groups. The first contains the registers that define a context or thread. This is the only group that the kernel needs to switch threads. The second group contains registers needed in addition to the first group needed to switch userland threads. This group contains the thread pointer and the FP control register. The third group contains those registers we need for execption handling and are used on top of the first two groups. struct _callee_saved, struct _callee_saved_fp These sets contain the preserved registers, including the NaT after spilling. The general registers (including branch registers) are seperated from the FP registers for ptrace(2). struct _caller_saved, struct _caller_saved_fp These sets contain the scratch registers based on SDM 2.1, This means that both ar.csd and ar.ccd are included here, even though they contain ia32 segment register descriptions. We keep seperate NaT bits for scratch and preserved registers, because they are never saved/restored at the same time. struct _high_fp The upper 96 FP registers that can be enabled/disabled seperately on the CPU from the lower 32 FP registers. Due to the size of this set, we treat them specially, even though they are defined as scratch registers. CVS ----------------------------------------------------------------------	2003-05-15 08:36:03 +00:00
Marcel Moolenaar	4bae872201	This file contains elementary context related functions used to save and restore "sets" of registers in various places. The restorectx and swapctx functions are used by cpu_switch() and deal with the special registers, as well as the preserved registers. The callee_saved functions are used to save and restore the preserved registers (integer and floating-point). They are useful for signal delivery and ptrace support. The save_high_fp and restore_high_fp functions are used to "load" and "unload" to and from the CPU as part of lazy context switching. The ia32 specific context functions have been kept with the ia32 code. Approved by: re@ (blanket)	2003-05-15 08:08:32 +00:00
Marcel Moolenaar	1d67adffd6	This file contains the code that implements the syscall path based on the epc instruction. The epc instruction, given the permissions of the page in which the epc is located, allows the privilege level to be increased with little or no overhead. The previous privilege level is recorded in the current frame marker and is restored by a regular (function) return. Since the epc instruction has to live in a page with non-standard properties, we hardwire a "gateway" page in the address space. The address of the gateway page is exported to userland in ar.k7. This allows us to rewire the page without breaking the ABI. The syscall stubs in libc are regular function calls that slightly differ from the normal runtime. The difference is mostly to simplify the stubs themselves by by moving some of the logic to the kernel. The libc stubs call into the gateway page (offset 0), from where the kernel trampolines to the code that sets up a minimal trapframe and arranges to execute from the kernel stack. The way back is basicly the same. The kernel returns to the gateway page, whereby privilege is dropped, and jumps back to the syscall stub. Only the special registers are saved in the trapframe. None of the scratch registers are preserved and since the kernel follows the same runtime model, none of the preserved registers are saved. Future enhancements can include the implementation of lightweight syscalls, where kernel functions are performed without setting up a trapframe. Good candidates are the *context syscalls for example. Now that there's a gateway page from which code can be executed in a non-privileged context, we also have the ideal place to put the signal trampolines. By moving the signal trampolines from the user stack to the gateway page, we open up the doors to unexecutable stacks. The gateway page contains signal trampolines for both the "legacy" break-based syscall code and the new and improved epc- based syscall code. Approved: re@ (blanket)	2003-05-15 07:51:22 +00:00
John Baldwin	90af4afacb	- Merge struct procsig with struct sigacts. - Move struct sigacts out of the u-area and malloc() it using the M_SUBPROC malloc bucket. - Add a small sigacts_*() API for managing sigacts structures: sigacts_alloc(), sigacts_free(), sigacts_copy(), sigacts_share(), and sigacts_shared(). - Remove the p_sigignore, p_sigacts, and p_sigcatch macros. - Add a mutex to struct sigacts that protects all the members of the struct. - Add sigacts locking. - Remove Giant from nosys(), kill(), killpg(), and kern_sigaction() now that sigacts is locked. - Several in-kernel functions such as psignal(), tdsignal(), trapsignal(), and thread_stopped() are now MP safe. Reviewed by: arch@ Approved by: re (rwatson)	2003-05-13 20:36:02 +00:00
Alexander Kabaev	0eda4c08a5	Style fixes. Remove DBL_DIG, DBL_MIN, DBL_MAX and their FLT_ counterparts, they were marked for deprecation ever since SUSv1 at least. Only define ULLONG_MIN/MAX and LLONG_MAX if long long type is supported. Restore a lost comment in MI _limits.h file and remove it from sys/limits.h where it does not belong.	2003-05-04 22:13:04 +00:00
Marcel Moolenaar	e126260569	Fix c99 victim: the accepted character '0 most now be types as '0'.	2003-05-03 23:05:16 +00:00
Marcel Moolenaar	aea4a02702	Option KADB does not exist. It came from alpha, where it still exists.	2003-05-02 20:34:15 +00:00
Marcel Moolenaar	367165975d	Kill MID_MACHINE, its a.out specific, the only platform that supports it is i386. All of the other platforms should remove it too. -- peter@	2003-04-30 23:16:33 +00:00
John Baldwin	d90e753aa8	Range check the syscall number before looking it up in the syscallnames[] array. Submitted by: pho	2003-04-30 17:59:27 +00:00
Alexander Kabaev	104a9b7e3e	Deprecate machine/limits.h in favor of new sys/limits.h. Change all in-tree consumers to include <sys/limits.h> Discussed on: standards@ Partially submitted by: Craig Rodrigues <rodrigc@attbi.com>	2003-04-29 13:36:06 +00:00
Marcel Moolenaar	c283dd9dad	Revamp the newbus functions: o do not use the in* and out* functions. These functions are used by legacy drivers and thus must have ia32 compatible behaviour. Hence, they need to have fences. Using these functions for newbus would then pessimize performance. o remove the conditional compilation of PIO and/or MEMIO support. It's a PITA without having any significant benefit. We always support them both. Since there are no I/O ports on ia64 (they are simulated by the chipset by translating memory mapped I/O to predefined uncacheable memory regions) the only difference between PIO and MEMIO is in the address calculation. There should be enough ILP that can be exploited here that making these computations compile-time conditional is not worth it. We now also don't use the read* and write* functions. o Add the missing *_8 variants. They were missing, although not missed. It's for completeness. o Do not add the fences that were present in the low-level support functions here. We're using uncacheable memory, which means that accesses are in program order. Change the barrier implementation to not only do a memory fence, but also an acceptance fence. This should more reliably synchronize drivers with the hardware. The memory fence enforces ordering, but does not imply visibility (ie the access does not necessarily have happened). This is what the acceptance deals with. cpufunc.h cleanup: o Remove the low-level memory mapped I/O support functions. They are not used. Keep the low-level I/O port access functions for legacy drivers and add fences to ensure ia32 compatibility. o Remove the syscons specific functions now that we have moved the proper definitions where they belong. o Replace the ia64_port_address() and ia64_memory_address() functions with macros. There's a bigger change inline functions get inlined when there aren't function callsi and the calculations are simply enough to do it with macros. Replace the one reference to ia64_memory address in mp_machdep.c to use the macro.	2003-04-29 09:50:03 +00:00
John Baldwin	7ff022c485	- Push down Giant into the sysarch() calls that still need Giant. - Standardize on EINVAL rather than EOPNOTSUPP if the sysarch op value is invalid.	2003-04-25 20:04:02 +00:00
John Baldwin	d8ca78d02f	Regen.	2003-04-25 15:59:44 +00:00
John Baldwin	9fb3809a3a	Oops, the thr_* and jail_attach() syscall entries should be NOPROTO rather than STD.	2003-04-25 15:59:18 +00:00
Daniel Eischen	1328e1c4be	Add an argument to get_mcontext() which specified whether the syscall return values should be cleared. The system calls getcontext() and swapcontext() want to return 0 on success but these contexts can be switched to at a later time so the return values need to be cleared in the saved register sets. Other callers of get_mcontext() would normally want the context without clearing the return values. Remove the i386-specific context saving from the KSE code. get_mcontext() is not i386-specific any more. Fix a bad pointer in the alpha get_mcontext() code. The context was being bcopy()'d from &td->tf_frame, but tf_frame is itself a pointer, so the thread was being copied instead. Spotted by jake. Glanced at by: jake Reviewed by: bde (months ago)	2003-04-25 01:50:30 +00:00
John Baldwin	9bc65d35f2	Regen.	2003-04-24 20:50:57 +00:00
John Baldwin	d46b3412dc	Fix the thr_create() entry by adding a trailing \. Also, sync up the MP safe flag for thr_* with the main table.	2003-04-24 20:49:46 +00:00
Alexander Kabaev	6fd839f9c7	Add a new sys/limits.h file which in turn depends on machine/_limits.h to get actual constant values. This is in preparation for machine/limits.h retirement. Discussed on: standards@ Submitted by: Craig Rodrigues <rodrigc@attbi.com> (*) Modified by: kan	2003-04-23 21:41:59 +00:00
John Baldwin	fe8cdcae87	- Replace inline implementations of sigprocmask() with calls to kern_sigprocmask() in the various binary compatibility emulators. - Replace calls to sigsuspend(), sigaltstack(), sigaction(), and sigprocmask() that used the stackgap with calls to the corresponding kern_sig*() functions instead without using the stackgap.	2003-04-22 18:23:49 +00:00
David Xu	5b70587b8a	Remove single threading detecting code, these code really should be replaced by thread_user_enter(), but current we don't want to enable this in trap.	2003-04-22 03:17:41 +00:00
Marcel Moolenaar	148eac48f1	Don't use the tpa instruction to implement pmap_kextract. The tpa instruction requires that a translation is present in the TC. This may trigger a TLB miss and a subsequent call to vm_fault(). This implementation is deliberately non-inline for debugging and profiling purposes. Partial or full inlining should eventually be done. Valuable insights by: jake	2003-04-22 01:48:43 +00:00
Hidetoshi Shimokawa	092cd06fcd	Add FireWire drivers to GENERIC.	2003-04-21 16:44:05 +00:00
John Baldwin	889a6b5845	Use the proc lock to protect p_singlethread and a P_WEXIT test. This fixes a couple of potential KSE panics on non-i386 arch's that weren't holding the proc lock when calling thread_exit().	2003-04-18 20:20:00 +00:00
Marcel Moolenaar	146324b0d2	Add the EHCI host controller.	2003-04-16 01:29:08 +00:00
Maxime Henrion	7a648f56cf	I deserve a big pointy hat for having missed all those references to bus_dmasync_op_t in my last commit.	2003-04-10 23:50:06 +00:00
Maxime Henrion	141bacb048	Change the operation parameter of bus_dmamap_sync() from an enum to an int and redefine the BUS_DMASYNC_* constants as flags. This allows us to specify several operations in one call to bus_dmamap_sync() as in NetBSD.	2003-04-10 23:03:33 +00:00
Mike Barcroft	fd7a8150fb	o In struct prison, add an allprison linked list of prisons (protected by allprison_mtx), a unique prison/jail identifier field, two path fields (pr_path for reporting and pr_root vnode instance) to store the chroot() point of each jail. o Add jail_attach(2) to allow a process to bind to an existing jail. o Add change_root() to perform the chroot operation on a specified vnode. o Generalize change_dir() to accept a vnode, and move namei() calls to callers of change_dir(). o Add a new sysctl (security.jail.list) which is a group of struct xprison instances that represent a snapshot of active jails. Reviewed by: rwatson, tjr	2003-04-09 02:55:18 +00:00
Dag-Erling Smørgrav	fe58453891	Introduce an M_ASSERTPKTHDR() macro which performs the very common task of asserting that an mbuf has a packet header. Use it instead of hand- rolled versions wherever applicable. Submitted by: Hiten Pandya <hiten@unixdaemons.com>	2003-04-08 14:25:47 +00:00
Marcel Moolenaar	302d51fcb2	Remove COMPAT_FREEBSD4. It's impossible because FreeBSD 4 does not run on ia64 at all.	2003-04-08 08:32:00 +00:00
Marcel Moolenaar	ca13bfade5	Remove the 32KB VHPT section from the kernel image. We don't really use it because we allocate a VHPT based on the size of the physical memory and even if the allocated VHPT is 32KB, we don't use the in- image section for it. Since the VHPT must be naturally aligned, we save 48K on average (due to alignment). Consequently, we start off with the VHPT disabled (it is assumed the VHPT is disabled because the EFI loader runs without memory address translation and thus has no need to setup the VHPT). It's probably a good idea to explicitly disable the VHPT if we make the use of the VHPT optional.	2003-04-06 21:31:26 +00:00
Marcel Moolenaar	9fac9065b5	Also set the access bit in the PTE when we get a data dirty bit fault. This avoids an immediate access bit fault when we serviced the dirty bit fault in case the access bit is unset. This typically happens for newly allocated memory that's being zeroed and thus very common.	2003-04-06 05:55:36 +00:00
Marcel Moolenaar	2206cb596f	Include <geom/geom_disk.h> and stop including <sys/disk.h>. The former gives us 'struct disk'.	2003-04-05 21:14:05 +00:00
Dag-Erling Smørgrav	9f45b2da8f	Define ovbcopy() as a macro which expands to the equivalent bcopy() call, to take care of the KAME IPv6 code which needs ovbcopy() because NetBSD's bcopy() doesn't handle overlap like ours. Remove all implementations of ovbcopy(). Previously, bzero was a function pointer on i386, to save a jmp to bzero_vector. Get rid of this microoptimization as it only confuses things, adds machine-dependent code to an MD header, and doesn't really save all that much. This commit does not add my pagezero() / pagecopy() code.	2003-04-04 17:29:55 +00:00
Poul-Henning Kamp	891619a66d	Use bioq_flush() to drain a bio queue with a specific error code. Retain the mistake of not updating the devstat API for now. Spell bioq_disksort() consistently with the remaining bioq_*(). #include <geom/geom_disk.h> where this is more appropriate.	2003-04-01 15:06:26 +00:00
Jeff Roberson	a0704f9de9	- Add thr and umtx system calls.	2003-04-01 01:15:56 +00:00
Jeff Roberson	b8db34d280	- Define a new md function 'casuptr'. This atomically compares and sets a pointer that is in user space. It will be used as the basic primitive for a kernel supported user space lock implementation. - Implement this function in x86's support.s - Provide stubs that return -1 in all other architectures. Implementations will follow along shortly. Reviewed by: jake	2003-04-01 00:18:55 +00:00
Jeff Roberson	a9b34138dc	- Add a placeholder for sigwait	2003-03-31 23:36:40 +00:00
Jeff Roberson	4093529dee	- Move p->p_sigmask to td->td_sigmask. Signal masks will be per thread with a follow on commit to kern_sig.c - signotify() now operates on a thread since unmasked pending signals are stored in the thread. - PS_NEEDSIGCHK moves to TDF_NEEDSIGCHK.	2003-03-31 22:49:17 +00:00
Jeff Roberson	1bf4700bff	- Change trapsignal() to accept a thread and not a proc. - Change all consumers to pass in a thread. Right now this does not cause any functional changes but it will be important later when signals can be delivered to specific threads.	2003-03-31 22:02:38 +00:00
Jeff Roberson	772e5d8d88	- Use sigexit() instead of twiddling the signal mask, catch, ignore, and action bits to allow SIGILL to work as expected. This brings this file in line with other architectures.	2003-03-31 21:40:47 +00:00
David Schultz	8ee63f6eae	Correct LDBL_* constants based on values from i386.	2003-03-27 20:38:22 +00:00
Jake Burkholder	227f9a1c58	- Add vm_paddr_t, a physical address type. This is required for systems where physical addresses larger than virtual addresses, such as i386s with PAE. - Use this to represent physical addresses in the MI vm system and in the i386 pmap code. This also changes the paddr parameter to d_mmap_t. - Fix printf formats to handle physical addresses >4G in the i386 memory detection code, and due to kvtop returning vm_paddr_t instead of u_long. Note that this is a name change only; vm_paddr_t is still the same as vm_offset_t on all currently supported platforms. Sponsored by: DARPA, Network Associates Laboratories Discussed with: re, phk (cdevsw change)	2003-03-25 00:07:06 +00:00
Ruslan Ermilov	ab0f83bd03	Remove bitrot associated with `maxusers'. Submitted by: bde	2003-03-22 14:18:23 +00:00
Maxime Henrion	fd1b2ab0c9	Use atomic operations to increment and decrement the refcount in busdma tags. There are currently no tags shared accross different drivers so this isn't needed at the moment, but it will be required when we'll have a proper newbus method to get the parent busdma tag.	2003-03-20 19:45:26 +00:00
Jake Burkholder	5501d40bb9	Made the prototypes for pmap_kenter and pmap_kremove MD. These functions are machine dependent because they are not required to update the tlb when mappings are added or removed, and doing so is machine dependent. In addition, an implementation may require that pages mapped with pmap_kenter have a backing vm_page_t, which is not necessarily true of all physical pages, and so may choose to pass the vm_page_t to pmap_kenter instead of the physical address in order to make this requirement clear.	2003-03-16 04:16:03 +00:00
Maxime Henrion	40b63da2a9	Bah, get it right this time and add sys/lock.h before sys/mutex.h.	2003-03-14 13:30:31 +00:00
Maxime Henrion	7541142438	Oops, add missing includes. Pass me the pointy hat. Reported by: jake	2003-03-14 00:04:37 +00:00
Maxime Henrion	c0796d1cb4	Grab Giant around calls to contigmalloc() and contigfree() so that drivers converted to be MP safe don't have to deal with it.	2003-03-13 17:18:48 +00:00
Maxime Henrion	ea458bbcdb	Memory allocated with contigmalloc() should be freed with contigfree(), not with free().	2003-03-13 17:10:54 +00:00
Marcel Moolenaar	e9523c31c5	Fix two rounds of breakages and cleanup. Remove the sccdebug sysctl while I'm here and garbage collect dead code (ssc_clone). Define d_maxsize as DFLTPHYS for now because that's what it will be if we don't define it.	2003-03-10 01:58:31 +00:00
Poul-Henning Kamp	60794e0478	Centralize the devstat handling for all GEOM disk device drivers in geom_disk.c. As a side effect this makes a lot of #include <sys/devicestat.h> lines not needed and some biofinish() calls can be reduced to biodone() again.	2003-03-08 08:01:31 +00:00
Marcel Moolenaar	cafd6dbd76	Fix threaded applications on ia64 that are linked dynamicly. We did not save (restore) the global pointer (GP) in the jmpbuf in setjmp (longjmp) because it's not needed in general. GP is considered a scratch register at callsites and hence is always restored after a call (when it's possible that the call resolves to a symbol in a different loadmodule; otherwise GP does not have to be saved and restored at all), including calls to setjmp/longjmp. There's just one problem with this now that we use setjmp/longjmp for context switching: A new context must have GP defined properly for the thread's entry point. This means that we need to put GP in the jmpbuf and consequently that we have to restore is in longjmp. This automaticly requires us to save it as well. When setjmp/longjmp isn't used for context switching, this can be reverted again.	2003-03-05 04:39:24 +00:00
Marcel Moolenaar	a402169a8e	ABI breaker: Move the J_SIGMASK field in the jmpbuf before the J_SIG0 field. While here, rename J_SIG0 to J_SIGSET and remove J_SIG1. The main reason for this change is that the 128-bit sigset_t is now aligned on a 16-byte boundary, which allows us to use 16-byte atomic loads and stores on CPUs that support it. The removal of J_SIG1 is done to avoid confusion: it is never accessed and should not be. Renaming J_SIG0 to J_SIGSET is the icing on the cake that's better done now than later.	2003-03-05 03:30:54 +00:00
John Baldwin	263067951a	Replace calls to WITNESS_SLEEP() and witness_list() with equivalent calls to WITNESS_WARN().	2003-03-04 21:03:05 +00:00
Poul-Henning Kamp	7ac40f5f59	Gigacommit to improve device-driver source compatibility between branches: Initialize struct cdevsw using C99 sparse initializtion and remove all initializations to default values. This patch is automatically generated and has been tested by compiling LINT with all the fields in struct cdevsw in reverse order on alpha, sparc64 and i386. Approved by: re(scottl)	2003-03-03 12:15:54 +00:00
Alan Cox	72c3aad7e8	MFi386 revision 1.88 Remove some long unused declarations.	2003-03-01 10:02:11 +00:00
David Xu	1d56863dd2	Needn't kse.h	2003-02-27 03:16:35 +00:00
Julian Elischer	ac2e415327	Change the process flags P_KSES to be P_THREADED. This is just a cosmetic change but I've been meaning to do it for about a year.	2003-02-27 02:05:19 +00:00
Maxime Henrion	f6c912dd0c	Correctly set BUS_SPACE_MAXSIZE in all the busdma backends. It was bogusly set to 64 * 1024 or 128 * 1024 because it was bogusly reused in the BUS_DMAMAP_NSEGS definition.	2003-02-26 02:16:06 +00:00
Maxime Henrion	07159f9c56	Cleanup of the d_mmap_t interface. - Get rid of the useless atop() / pmap_phys_address() detour. The device mmap handlers must now give back the physical address without atop()'ing it. - Don't borrow the physical address of the mapping in the returned int. Now we properly pass a vm_offset_t * and expect it to be filled by the mmap handler when the mapping was successful. The mmap handler must now return 0 when successful, any other value is considered as an error. Previously, returning -1 was the only way to fail. This change thus accidentally fixes some devices which were bogusly returning errno constants which would have been considered as addresses by the device pager. - Garbage collect the poorly named pmap_phys_address() now that it's no longer used. - Convert all the d_mmap_t consumers to the new API. I'm still not sure wheter we need a __FreeBSD_version bump for this, since and we didn't guarantee API/ABI stability until 5.1-RELEASE. Discussed with: alc, phk, jake Reviewed by: peter Compile-tested on: LINT (i386), GENERIC (alpha and sparc64) Runtime-tested on: i386	2003-02-25 03:21:22 +00:00
Poul-Henning Kamp	263444cfbf	Change the console interface to pass a "struct consdev " instead of a dev_t to the method functions. The dev_t can still be found at struct consdev ->cn_dev. Add a void *cn_arg element to struct consdev which the drivers can use for retrieving their softc.	2003-02-20 20:54:45 +00:00
Warner Losh	a163d034fa	Back out M_* changes, per decision of the TRB. Approved by: trb	2003-02-19 05:47:46 +00:00
Julian Elischer	3931c9a231	Fix missed patch in last commit	2003-02-17 10:21:32 +00:00
Julian Elischer	4a338afd7a	Move a bunch of flags from the KSE to the thread. I was in two minds as to where to put them in the first case.. I should have listenned to the other mind. Submitted by: parts by davidxu@ Reviewed by: jeff@ mini@	2003-02-17 09:55:10 +00:00
Marcel Moolenaar	d1d78df69b	Define _ALIGNBYTES to be 15. This should have been done right away.	2003-02-17 09:53:29 +00:00
Marcel Moolenaar	a2ca37c83e	Print two new processor features: o Spontaneous deferral (A feature required by dutch railways :-) o 16-byte atomic operations (ld, st, cmpxchg)	2003-02-17 08:17:26 +00:00
Jeff Roberson	5215b1872f	- Split the struct kse into struct upcall and struct kse. struct kse will soon be visible only to schedulers. This greatly simplifies much the KSE code. Submitted by: davidxu	2003-02-17 05:14:26 +00:00
Jeff Roberson	e4625663c9	- Move ke_sticks, ke_iticks, ke_uticks, ke_uu, ke_su, and ke_iu back into the proc. These counters are only examined through calcru. Submitted by: davidxu Tested on: x86, alpha, UP/SMP	2003-02-17 02:19:58 +00:00
Poul-Henning Kamp	f341ca9891	Remove #include <sys/dkstat.h>	2003-02-16 14:13:23 +00:00
Marcel Moolenaar	89c1ecfade	Fix misuse of Maxmem in the calculation of the VHPT size. Maxmem is already in pages, so we should not convert from bytes to pages. The result of this bug was bad scaling of the VHPT relative to the available memory. Submitted by: Arun Sharma <arun@sharma-home.net>	2003-02-15 20:58:32 +00:00
David E. O'Brien	36dc5b9427	Fix the style of the SCHED_4BSD commit.	2003-02-13 22:24:44 +00:00
Alan Cox	5b0a1f3af2	MFi386 Remove kptobj. Instead, use VM_ALLOC_NOOBJ.	2003-02-13 07:03:44 +00:00
Mike Barcroft	8cf5ed5125	Implement fpclassify(): o Add a MD header private to libc called _fpmath.h; this header contains bitfield layouts of MD floating-point types. o Add a MI header private to libc called fpmath.h; this header contains bitfield layouts of MI floating-point types. o Add private libc variables to lib/libc/$arch/gen/infinity.c for storing NaN values. o Add __double_t and __float_t to <machine/_types.h>, and provide double_t and float_t typedefs in <math.h>. o Add some C99 manifest constants (FP_ILOGB0, FP_ILOGBNAN, HUGE_VALF, HUGE_VALL, INFINITY, NAN, and return values for fpclassify()) to <math.h> and others (FLT_EVAL_METHOD, DECIMAL_DIG) to <float.h> via <machine/float.h>. o Add C99 macro fpclassify() which calls __fpclassify{d,f,l}() based on the size of its argument. __fpclassifyl() is never called on alpha because (sizeof(long double) == sizeof(double)), which is good since __fpclassifyl() can't deal with such a small `long double'. This was developed by David Schultz and myself with input from bde and fenner. PR: 23103 Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU> (significant portions) Reviewed by: bde, fenner (earlier versions)	2003-02-08 20:37:55 +00:00
Hartmut Brandt	e557905435	Fix a problem in bus_dmamap_load_{mbuf,uio} when the first mbuf or the first uio segment is empty. In this case no dma segment is create by bus_dmamap_load_buffer, but the calling routine clears the first flag. Under certain combinations of addresses of the first and second mbuf/uio buffer this leads to corrupted DMA segment descriptors. This was already fixed by tmm in sparc64/sparc64/iommu.c. PR: kern/47733 Reviewed by: sam Approved by: jake (mentor)	2003-02-04 16:30:27 +00:00
Jake Burkholder	238dd3209a	Split statclock into statclock and profclock, and made the method for driving statclock based on profhz when profiling is enabled MD, since most platforms don't use this anyway. This removes the need for statclock_process, whose only purpose was to subdivide profhz, and gets the profiling clock running outside of sched_lock on platforms that implement suswintr. Also changed the interface for starting and stopping the profiling clock to do just that, instead of changing the rate of statclock, since they can now be separate. Reviewed by: jhb, tmm Tested on: i386, sparc64	2003-02-03 17:53:15 +00:00
Marcel Moolenaar	871a64fdf0	Don't use the 'c' partition for mounting root. A disklabel is very likely not present under the simulator. If multiple partitions are present on the virtual disk, then the 'a' partition would be the most logical choice. Nowadays partitions are GPT based, which would make the assumption of a disklabel even more questionable. Given all the possible scenarios, assuming a raw "device" seems best.	2003-02-03 01:10:01 +00:00
Alfred Perlstein	8deebb0160	Consolidate MIN/MAX macros into one place (param.h). Submitted by: Hiten Pandya <hiten@unixdaemons.com>	2003-02-02 13:17:30 +00:00
Poul-Henning Kamp	ecc575bfaa	We don't need sscopen() and sscclose(). Register sscstrategy directly, instead of using a cdevsw{} for the purpose. Tested by: marcel	2003-02-02 10:22:34 +00:00
Marcel Moolenaar	0d50208281	Export IA32 from opt_ia32.h to assembly so that we can eliminate saving and restoring ia32 specific registers when switching context and ia32 support has not been compiled-in. The primary reason for this change is that one of the ia32 registers (ar.fcr) is wrongly marked as invalid by the simulator. Now that we avoid using the register when possible, usability is improved. The secundary reason is that it saves us 7 loads and stores. Note that the PCB will continue to have room for these registers, irrespective of the IA32 option. There are no benefits that make it worthwhile.	2003-02-02 09:07:15 +00:00
Marcel Moolenaar	dc00c828e3	Remove special casing for running in the simulator from the kernel and instead add platform, firmware and EFI stubs to the loader. The net effect of this change is that besides a special console and disk driver, the kernel has no knowledge of the simulator. This has the following advantages: o Simulator support is much harder to break, o It's easier to make use of more feature complete simulators. This would only need a change in the simulator specific loader, o Running SMP kernels within the simulator. Note that ski at this time does not simulate IPIs, so there's no way to start APs. The platform, firmware and EFI stubs describe the following hardware: o 4 CPU Itanium, o 128 MB RAM within the 4GB address space, o 64 MB RAM above the 4GB address space. NOTE: The stubs in the skiloader describe a machine that should in parts be defined by the simulator. Things like processor interrupt block and AP wakeup vector cannot be choosen at random because they require interpretation by the simulator. Currently the simulator is ignorant of this. This change introduces an unofficial SSC call SSC_SAL_SET_VECTORS which is ignored by the simulator. Tested with: ski (version 0.943 for linux)	2003-02-01 22:50:09 +00:00
Josef Karthauser	c0c9e71f9a	Put replace spaces with tabs in keeping with the rest of the file.	2003-02-01 18:45:18 +00:00
Julian Elischer	6f8132a867	Reversion of commit by Davidxu plus fixes since applied. I'm not convinced there is anything major wrong with the patch but them's the rules.. I am using my "David's mentor" hat to revert this as he's offline for a while.	2003-02-01 12:17:09 +00:00
Poul-Henning Kamp	2c61f6b389	Remove D_CANFREE from sscdisk.c. I belive it got here by copy&paste and I see no signs in the source code that BIO_DELETE was dealt with correctly and can only wonder what kind of trouble this may have caused.	2003-01-30 11:48:50 +00:00
Julian Elischer	0d7cc0749c	Unbreak SMP cases for these architectures. statclock_process() changed arguments. note: it may be worth checking if curkse is needed on these architectures.. (and if so, why?)	2003-01-27 00:00:06 +00:00
David Xu	0dbb100b9b	Move UPCALL related data structure out of kse, introduce a new data structure called kse_upcall to manage UPCALL. All KSE binding and loaning code are gone. A thread owns an upcall can collect all completed syscall contexts in its ksegrp, turn itself into UPCALL mode, and takes those contexts back to userland. Any thread without upcall structure has to export their contexts and exit at user boundary. Any thread running in user mode owns an upcall structure, when it enters kernel, if the kse mailbox's current thread pointer is not NULL, then when the thread is blocked in kernel, a new UPCALL thread is created and the upcall structure is transfered to the new UPCALL thread. if the kse mailbox's current thread pointer is NULL, then when a thread is blocked in kernel, no UPCALL thread will be created. Each upcall always has an owner thread. Userland can remove an upcall by calling kse_exit, when all upcalls in ksegrp are removed, the group is atomatically shutdown. An upcall owner thread also exits when process is in exiting state. when an owner thread exits, the upcall it owns is also removed. KSE is a pure scheduler entity. it represents a virtual cpu. when a thread is running, it always has a KSE associated with it. scheduler is free to assign a KSE to thread according thread priority, if thread priority is changed, KSE can be moved from one thread to another. When a ksegrp is created, there is always N KSEs created in the group. the N is the number of physical cpu in the current system. This makes it is possible that even an userland UTS is single CPU safe, threads in kernel still can execute on different cpu in parallel. Userland calls kse_create to add more upcall structures into ksegrp to increase concurrent in userland itself, kernel is not restricted by number of upcalls userland provides. The code hasn't been tested under SMP by author due to lack of hardware. Reviewed by: julian	2003-01-26 11:41:35 +00:00
Jeff Roberson	c3384118a1	- Introduce the SCHED_ULE and SCHED_4BSD options for compile time selection of the scheduler. - Add SCHED_4BSD as the scheduler for all kernel config files in cvs.	2003-01-26 05:29:12 +00:00
Doug Rabson	388dc84194	Fix pmap_extract so that it doesn't panic if the user types 'cat /proc/pid/map' Submitted by: Arun Sharma <arun.sharma@intel.com>	2003-01-24 09:58:32 +00:00
Alfred Perlstein	44956c9863	Remove M_TRYWAIT/M_WAITOK/M_WAIT. Callers should use 0. Merge M_NOWAIT/M_DONTWAIT into a single flag M_NOWAIT.	2003-01-21 08:56:16 +00:00
Jeff Roberson	04de47b0d3	- Add a VM_WAIT in the appropriate cases where vm_page_alloc() fails and flags indicate that uma_small_alloc should not. This code should be refactored so that there is not so much cross arch duplication. Reviewed by: jake Spotted by: tmm Tested on: alpha, sparc64 Pointy hat to: jeff and everyone who cut and pasted the bad code. :-)	2003-01-21 05:44:52 +00:00
Jake Burkholder	7251b4bf93	Resolve relative relocations in klds before trying to parse the module's metadata. This fixes module dependency resolution by the kernel linker on sparc64, where the relocations for the metadata are different than on other architectures; the relative offset is in the addend of an Elf_Rela record instead of the original value of the location being patched. Also fix printf formats in debug code. Submitted by: Hartmut Brandt <brandt@fokus.gmd.de> PR: 46732 Tested on: alpha (obrien), i386, sparc64	2003-01-21 02:42:44 +00:00
Poul-Henning Kamp	5a3d3419f8	We need neither <sys/diskslice.h> nor <sys/disklabel.h> here.	2003-01-20 11:11:51 +00:00
Maxime Henrion	1d8dc7e4a3	Don't try to free() map in bus_dmamap_destroy() when it's set to &nobounce_dmamap. A similar bug was fixed by wpaul in revision 1.19 of sys/alpha/alpha/busdma_machdep.c.	2003-01-18 18:33:56 +00:00
Matthew Dillon	e3669cee72	Merge all the various copies of vm_fault_quick() into a single portable copy.	2003-01-16 00:02:21 +00:00
Matthew Dillon	f597900329	Merge all the various copies of vmapbuf() and vunmapbuf() into a single portable copy. Note that pmap_extract() must be used instead of pmap_kextract(). This is precursor work to a reorganization of vmapbuf() to close remaining user/kernel races (which can lead to a panic).	2003-01-15 23:54:35 +00:00
Marcel Moolenaar	f40614b8aa	Move ia64_sapics and ia64_sapic_count from interrupt.c to sapic.c and declare them extern in interrupt.c. This eliminates the need for ia64_add_sapic(), which is called from sapic.c. While here, reformat ia64_enable() in interrupt.c to improve indentation and add a sysctl (machdep.apic) to dump the I/O APIC entries currently programmed into all I/O APICs. The latter can help analyze interrupt problems. Note that the sysctl is not intended as a userland (software) interface. It may be changed in the future to include counters so that vmstat -i can make use of it. It may also be removed...	2003-01-06 02:09:08 +00:00
Peter Wemm	839c70593f	Move the itm reload to a single place rather than having two identical copies of the reload. Note that we use the precomputed itm_reload value so that we can avoid a division in the kernel. The ia64 cpu does not have integer divide, so this would have been done by a floating point operation.	2003-01-06 01:53:55 +00:00
Marcel Moolenaar	cbb095815a	Replace the hardcoding of 255 as the clock interrupt vector with CLOCK_VECTOR and define it as 254, not 255. Vector 255 is already in use as the AP wakeup vector on the HP rx2600. This needs to be made more dynamic. The likelyhood of vector 254 being in use is pretty small, but we already have code to assign vectors to IPIs (see sal.c) and it's preobably better to have a centralized "vector manager" that hands out vectors based on some imput (like priority).	2003-01-06 01:39:25 +00:00
Marcel Moolenaar	de09ec92e6	Manually inline handleclock(). There's only a single caller and handleclock itself is trivial. While here, replace (itc_frequency+hz/2)/hz with itm_reload for consistency. There's now a single place where we determine the ITM reload value.	2003-01-06 00:38:35 +00:00
Marcel Moolenaar	0d6d96ea78	Count interrupts as soon as possible. This makes sure interrupts are counted even when there are no handlers.	2003-01-06 00:25:31 +00:00
Marcel Moolenaar	c58d580e70	Don't hardcode the address of the local (S)APIC (aka processor interrupt block). We use the previously hardcoded address as a default only, but will otherwise use whatever ACPI tells us. The address can be found in the MADT table header or in the LAPIC override table entry.	2003-01-05 22:14:30 +00:00
Marcel Moolenaar	ff263ea2dc	Bump the number of interrupts from 65 to 257. This is a waste of space most of the time, but handles machines with lots of I/O (S)APICs. We cannot make this more dynamic without breaking the interface with vmstat. Hence, we need to fix the interface first.	2003-01-05 22:00:19 +00:00
Marcel Moolenaar	671f297efb	Handle 3-digit interrupt numbers (vectors). While here, change the name of unused entries from "intr XXX" to "#XXX". This makes it easier to debug interrupt problems, because vmstat can be hacked more easily to dump all interrupt entries that are in use and not those that have had interrupts.	2003-01-05 21:48:33 +00:00
Marcel Moolenaar	d291811a6a	Make all memory I/O addresses (explicitly) 64-bit. Memory mapped devices aren't necessarily mapped within 4GB. I/O port addresses are offsets into the memory mapped I/O port space, which is not larger than 16MB. No need to convert those to 64 bit types.	2003-01-05 21:40:45 +00:00
Marcel Moolenaar	aa063fddbf	Provide a null-implementation for bus_space_unmap, like i386. bus_space_unmap is required for puc(4).	2003-01-05 21:34:05 +00:00
Marcel Moolenaar	518a75ea1c	Adopt, adapt and improve: o Make the URL of the handbook match reality o Improve some comments (either wording or formatting) o Sync with i386: comment-out DDB, INVARIANTS, INVARIANT_SUPPORT o Add some more SCSI/RAID controllers: ahd, mpt, asr, ciss, dpt, iir, mly, ida o Remove support for the parallel port o Add NICs: em, bge o Remove NICs: ste, tl, tx, vr, wb o Enable USB support again, except of the UHCI host controller. UHCI still hangs the BigSur (=HP i2000) machines, and makes them useless. The OHCI controller works fine. Note that newer ia64 boxes based on the Intel host controllers (UHCI or EHCI) still won't have USB support. We really need to import the EHCI host controller from NetBSD...	2003-01-05 00:04:28 +00:00
Alan Cox	fd421a85e1	Hold the page queues lock around pmap_remove_pte() in pmap_enter(). Submitted by: Arun Sharma <adsharma@unix-os.sc.intel.com>	2003-01-04 06:49:52 +00:00
Marcel Moolenaar	dd778c94bf	Make this build and sync-up: o Add COMPAT_FREEBSD4 o Remove NO_GEOM o Remove commented out options.	2003-01-03 23:10:47 +00:00
Jens Schweikhardt	9d5abbddbf	Correct typos, mostly s/ a / an / where appropriate. Some whitespace cleanup, especially in troff files.	2003-01-01 18:49:04 +00:00
Robert Watson	a95f262c6b	Synchronize to kern/syscalls.master:1.139. Obtained from: TrustedBSD Project	2002-12-29 20:33:26 +00:00
Tim J. Robbins	b30a7779d4	MB_LEN_MAX is not MD, move it to the MI limits.h.	2002-12-22 06:38:45 +00:00
Marcel Moolenaar	d8e7d462eb	More MFp4: DIG64 structures.	2002-12-18 18:52:20 +00:00
Marcel Moolenaar	91c71b46ed	Export the physical address of the RSDP to userland by means of the `machdep.acpi_root' sysctl. This is required on ia64 because the root pointer hardly ever, if at all, lives in the first MB of memory and also because scanning the first MB of memory can cause machine checks. This provides a save and reliable way for ACPI tools to work with the tables if ACPI support is present in the kernel. On ia64 ACPI is non-optional.	2002-12-18 08:47:07 +00:00
Marcel Moolenaar	1aa83d38bc	Check that the dump device is large enough. Otherwise we could end up with a dump offset that's smaller than the start of the dump device and either clobber data in preceding partitions or try to write beyond the end of the medium (unsigned wrap). Implement legacy behaviour to never write to the first 64KB as that is where metadata (ie disklabels) may reside.	2002-12-17 02:51:56 +00:00
Marcel Moolenaar	722d8e49a9	Regen: swapoff	2002-12-16 00:49:36 +00:00
Marcel Moolenaar	3f60ecab4e	Change swapoff from MNOPROTO to UNIMPL. The former doesn't work.	2002-12-16 00:48:52 +00:00
Matthew Dillon	92da00bb24	This is David Schultz's swapoff code which I am finally able to commit. This should be considered highly experimental for the moment. Submitted by: David Schultz <dschultz@uclink.Berkeley.EDU> MFC after: 3 weeks	2002-12-15 19:17:57 +00:00
Alfred Perlstein	d1e405c5ce	SCARGS removal take II.	2002-12-14 01:56:26 +00:00
Alfred Perlstein	bc9e75d7ca	Backout removal SCARGS, the code freeze is only "selectively" over.	2002-12-13 22:41:47 +00:00
Alfred Perlstein	0bbe7292e1	Remove SCARGS. Reviewed by: md5	2002-12-13 22:27:25 +00:00
Julian Elischer	696058c3c5	Unbreak the KSE code. Keep track of zobie threads using the Per-CPU storage during the context switch. Rearrange thread cleanups to avoid problems with Giant. Clean threads when freed or when recycled. Approved by: re (jhb)	2002-12-10 02:33:45 +00:00
Marcel Moolenaar	7b54e1ca53	Use one of the bi_spare entries for the DIG64 HCDP table address. The HCDP table is one (non-proprietary) way for the platform to inform the OS about headless operation. This field would normally hold the address as can be found by scanning the EFI system table, which we also pass to the kernel. The apparent duplication allows us to synthesize a HCDP table in the loader by whatever means we can think of, including relocating the platform table into pre- mapped address space. In short: it gives us more freedom. Approved by: re (blanket)	2002-12-08 20:32:56 +00:00
Marcel Moolenaar	c0fabbfb90	Disable SMP. It reduces the chance that the kernel boots. On top of that, there's some nasty process corruption when running with SMP. Note that this was already in effect for the 5.0-RC1 kernels in the form of a local patch. Approved by: re (blanket)	2002-12-08 20:14:04 +00:00
Alan Cox	8e7ea1fc7d	MFi386 Hold the page queues lock around vm_page_unhold() in vunmapbuf(). Approved by: re (blanket)	2002-12-02 01:12:05 +00:00
Marcel Moolenaar	4ea25f94be	Implement bus_space_subregion(). Identical to i386. Approved by: re (carte blanc)	2002-11-29 20:14:03 +00:00
Marcel Moolenaar	ddc6d86ca3	Better handle sparse physical memory: Don't use the address range as a measure for available memory to scale the VHPT. Instead, use the previously determined Maxmem. Approved by: re (carte blanc)	2002-11-29 20:10:21 +00:00
Marcel Moolenaar	53efb27fc5	MFp4: Add function map_port_space() to map the memory mapped I/O port range as uncacheable virtual memory and call it prior to probing for a console. This removes the dependency on the loader to have done this for us. Note that this change does not include doing the same for APs. Approved by: re (blanket)	2002-11-24 20:15:08 +00:00
Marcel Moolenaar	26cd294128	Fix comparison that caused a 1-off bug. This appeared harmless for the kernel itself, but SAL on Itanium2 machines spontaneously rebooted the machine. Approved by: re (blanket) Submitted by: Arun Sharma <adsharma@unix-os.sc.intel.com>	2002-11-24 20:07:23 +00:00
Maxime Henrion	b19d9defef	Under certain circumstances, we were calling kmem_free() from i386 cpu_thread_exit(). This resulted in a panic with WITNESS since we need to hold Giant to call kmem_free(), and we weren't helding it anymore in cpu_thread_exit(). We now do this from a new MD function, cpu_thread_dtor(), called by thread_dtor(). Approved by: re@ Suggested by: jhb	2002-11-22 23:57:02 +00:00
Alan Cox	779df20df2	MFi386 r1.369 - Clear the PG_WRITEABLE flag in pmap_page_protect() if write access is being removed. Return immediately if write access is being removed and PG_WRITEABLE is already clear.	2002-11-17 21:48:42 +00:00
Daniel Eischen	84b427ce23	Regenerate after adding syscalls.	2002-11-16 23:48:14 +00:00
Daniel Eischen	a4b04278f0	Add *context() syscalls to ia64 32-bit compatability table as requested in kern/syscalls.master.	2002-11-16 15:15:17 +00:00
Daniel Eischen	2be05b70c9	Add getcontext, setcontext, and swapcontext as system calls. Previously these were libc functions but were requested to be made into system calls for atomicity and to coalesce what might be two entrances into the kernel (signal mask setting and floating point trap) into one. A few style nits and comments from bde are also included. Tested on alpha by: gallatin	2002-11-16 06:35:53 +00:00
Peter Wemm	cdf5e9ccb6	Do not assume that time_t is an int. Approved by: re (jhb)	2002-11-15 22:36:57 +00:00
Peter Wemm	70285c3e5f	Test the water. Make time_t long (64 bit) on ia64 since we do not have to worry about ABI vs released systems yet. This is mostly transparent since there is no significant exposure in the syscall interface. The things that go wrong are mostly userland stuff - time(&intvariable). Reviewed by: dfr, marcel Approved by: re (jhb)	2002-11-15 22:35:34 +00:00
Alan Cox	eea85e9bb6	Move pmap_collect() out of the machine-dependent code, rename it to reflect its new location, and add page queue and flag locking. Notes: (1) alpha, i386, and ia64 had identical implementations of pmap_collect() in terms of machine-independent interfaces; (2) sparc64 doesn't require it; (3) powerpc had it as a TODO.	2002-11-13 05:39:58 +00:00
Marcel Moolenaar	7aa65edc75	ia64 ABI breaker: Don't force 16-byte alignment at run-time. Do it at compile-time. This saves us the pointer fiddling by the setjmp functions and reduces complexity. While here, increase the jmp_buf by 16 bytes to an even 512 bytes. Coincidentally, due to the way alignment was handled prior to this change, the jmp_buf has not changed in size, but only in how the space is used. Prior to this change the 16 bytes were reserved for enforcing alignment; now they are reserved by us for future extensions. Therefore, this ABI breaker is relatively save: the failure is always an alignment trap.	2002-11-11 08:11:44 +00:00
Alan Cox	6372d61e3e	- Clear the page's PG_WRITEABLE flag in the i386's pmap_changebit() if we're removing write access from the page's PTEs. - Export pmap_remove_all() on alpha, i386, and ia64. (It's already exported on sparc64.)	2002-11-11 05:17:34 +00:00
Marcel Moolenaar	3c428f6a5a	Comment-out USB support. A kernel doesn't boot with it. Deal with it later.	2002-11-11 01:50:10 +00:00
Dag-Erling Smørgrav	97b67f3141	Print real / avail memory in megabytes rather than kilobytes.	2002-11-09 16:19:14 +00:00
Thomas Moestl	0fca57b8b8	Move the definitions of the hw.physmem, hw.usermem and hw.availpages sysctls to MI code; this reduces code duplication and makes all of them available on sparc64, and the latter two on powerpc. The semantics by the i386 and pc98 hw.availpages is slightly changed: previously, holes between ranges of available pages would be included, while they are excluded now. The new behaviour should be more correct and brings i386 in line with the other architectures. Move physmem to vm/vm_init.c, where this variable is used in MI code.	2002-11-07 23:57:17 +00:00
Juli Mallett	751d91aabd	Remove what was a temporary bogus assignment of bits of siginfo_t, as it does not look like the prerequisites to fill it in properly will be in the tree for the upcoming release, but it's mostly done, so there is no need for these to stay around to remind us.	2002-11-06 14:53:35 +00:00
Marcel Moolenaar	6e296c0d4e	Define UMA_MD_SMALL_ALLOC so that we can allocate memory with region 7 addresses for use by page tables and kernel stacks. Obtained from: peter	2002-11-06 04:47:38 +00:00
Marcel Moolenaar	654684338d	o Remove devices that are commented out. o Enable sc o Remove NO_GEOM. We need GEOM for GPT. o Remove NO_CPU_COPTFLAGS.	2002-11-05 08:35:37 +00:00
Marcel Moolenaar	356492b3e2	Remove mcclock. It's an Alpha left-over.	2002-11-05 08:23:26 +00:00
Robert Watson	fabb3caa9e	Sync to src/sys/kern/syscalls.master	2002-11-02 23:55:30 +00:00
Marcel Moolenaar	5d3590aed7	Don't pass the return address to exception_save in register b0. Use a true scratch register. This change and future re-allocations will eventually result in code that we can unwind to to get the preserved registers of the process. This of course means that we cannot trash them while saving the process context. While re-allocating, remove the register aliases. Abstraction is in this case disadvanteous.	2002-10-30 07:58:11 +00:00
Marcel Moolenaar	22d9ff4691	Rewrite cpu_switch(). The most notable change is the fact that we now have f16-f31 as part of the context. The PCB has been reorganized to better match how we save and restore the (preserved) registers. This commit also moves the context restoriation to its own function (named pcb_restore), as we did with pcb_save. Only minimal effort has been put in writing optimal assembly. The expectation is that there will be more rounds of changes.	2002-10-30 05:55:29 +00:00
Marcel Moolenaar	d506d5dc0e	Remove mf.a from sapic_read() and sapic_write(). We only care about ordering and not acceptance. The removal of mf.a leaves behind the mf that accompanied it.	2002-10-28 01:59:27 +00:00
Marcel Moolenaar	9c156e012e	Remove mf.a (the acceptance form of the memory fence instruction) from all low-level bus space support functions. There's no need to actually force the read/write to be accepted by the platform before we can do anything else. We still have the mf instruction there, which forces ordering. This too is not required given the semantices of the bus space I/O functions, but it's not at all clear to me if there are any poorly written device drivers that depend on the strict ordering by the processor. The motto here is to take small steps...	2002-10-28 01:00:57 +00:00
Marcel Moolenaar	84903f7f4e	Make vmstat -i work: o Properly set the pointer to the counter for each interrupt and update the intrnames table. o Remove Alpha cruft from intrcnt.h. o Create INTRNAME_LEN as the single entity that defines the width of the names in the intrnames table (incl. terminatinf '\0').	2002-10-28 00:50:39 +00:00
Marcel Moolenaar	36cb272078	In ipi_send(), perform a mf instruction prior to initiating the IPI. This guarantees that loads and stores emitted before the fence are made visible before the IPI becomes pended. Remove the mf.a instruction after initiating the IPI. There's no guarantee that the IPI becomes pended prior to subsequent reads or writes. Even if there was a guarantee, it would mostly be without any benefit.	2002-10-27 23:00:46 +00:00
Peter Wemm	3556388d17	Add COMPAT_FREEBSD4 here too. It has COMPAT_43 as well.	2002-10-26 04:58:36 +00:00
Peter Wemm	23eeeff7be	Split 4.x and 5.x signal handling so that we can keep 4.x signal handling clean and functional as 5.x evolves. This allows some of the nasty bandaids in the 5.x codepaths to be unwound. Encapsulate 4.x signal handling under COMPAT_FREEBSD4 (there is an anti-foot-shooting measure in place, 5.x folks need this for a while) and finish encapsulating the older stuff under COMPAT_43. Since the ancient stuff is required on alpha (longjmp(3) passes a 'struct osigcontext ' to the current sigreturn(2), instead of the 'ucontext_t ' that sigreturn is supposed to take), add a compile time check to prevent foot shooting there too. Add uniform COMPAT_43 stubs for ia64/sparc64/powerpc. Tested on: i386, alpha, ia64. Compiled on sparc64 (a few days ago). Approved by: re	2002-10-25 19:10:58 +00:00
Julian Elischer	1434d3fe6f	Extract out KSE specific code from machine specific code so that there is ony one copy of it. Fix that one copy so that KSEs with no mailbox in a KSE program are not a cause of page faults (this can legitmatly happen). Submitted by: (parts) davidxu	2002-10-24 23:09:48 +00:00
John Baldwin	0f8e01f117	Oops, I missed a few changes in 'device acpica' -> 'device acpi' change. Submitted by: Hiten Pandya <hiten@angelica.unixdaemons.com>	2002-10-24 19:17:06 +00:00
John Baldwin	d2ec391b39	Rename 'device acpica' to 'device acpi'. Approved by: msmith, iwasaki	2002-10-24 19:05:04 +00:00
Marcel Moolenaar	b019ad1bc0	In cb_dumphdr() we were calling buf_write() with di->priv as the pointer to a dumperinfo instead of di. A brainfart, surely. This bug went unnoticed for all this time because the pointer is only used by buf_write() when it can write a completely filled buffer to the dump device. This depends on the number of memory chunks that needs to be dumped. This has apparently been low enough that it has never happened up until this point.	2002-10-20 23:39:43 +00:00
Marcel Moolenaar	bfc937c758	Remove the special casing for IP addresses that are within the IVT or the do_syscall() function. We have unwind directives to stop the unwinder.	2002-10-20 06:03:05 +00:00
Marcel Moolenaar	341572f4bd	Define IVT_ENTRY and IVT_END as special versions of ENTRY and END for defining vectors. As a result, each vector will be a global function with unwind directives to notify the unwinder that we're in an interrupt handler. In the debugger this will show up something like: Debugger(0xe000000000a211d8, 0xe000000000748960) at Debugger+0x31 panic(0xe000000000a36858, 0xe0000000021d32d0, 0xe000000000ae42e8, ... trap(0x14, 0x100000, 0xe0000000021d32d0, 0x0, 0xa0000000002095f0, ... ivt_Data_TLB(0x14, 0x100000, 0xe0000000021d32d0) at ivt_Data_TLB+0x1f0	2002-10-20 06:00:09 +00:00
Peter Wemm	8556393bb2	Stake a claim on 418 (__xstat), 419 (__xfstat), 420 (__xlstat)	2002-10-19 22:25:31 +00:00
Peter Wemm	c8447553b5	Grab 416/417 real estate before I get burned while testing again. This is for the not-quite-ready signal/fpu abi stuff. It may not see the light of day, but I'm certainly not going to be able to validate it when getting shot in the foot due to syscall number conflicts.	2002-10-19 22:09:23 +00:00
Robert Watson	bc5245d94c	Add a placeholder for the execve_mac() system call, similar to SELinux's execve_secure() system call, which permits a process to pass in a label for a label change during exec. This permits SELinux to change the label for the resulting exec without a race following a manual label change on the process. Because this interface uses our general purpose MAC label abstraction, we call it execve_mac(), and wrap our port of SELinux's execve_secure() around it with appropriate sid mappings. Obtained from: TrustedBSD Project Sponsored by: DARPA, Network Associates Laboratories	2002-10-19 21:06:57 +00:00
Marcel Moolenaar	c8a4afbc11	Update the unwind information when modules are loaded and unloaded by using the linker hooks. Since these hooks are called for the kernel as well, we don't need to deal with that with a special SYSINIT. The initialization implicitly performed on the first update of the unwind information is made explicit with a SYSINIT. We now don't need the _ia64_unwind_{start\|end} symbols.	2002-10-19 19:30:38 +00:00
Marcel Moolenaar	1aeb23cdfa	Add two hooks to signal module load and module unload to MD code. The primary reason for this is to allow MD code to process machine specific attributes, segments or sections in the ELF file and update machine specific state accordingly. An immediate use of this is in the ia64 port where unwind information is updated to allow debugging and tracing in/across modules. Note that this commit does not add the functionality to the ia64 port. See revision 1.9 of ia64/ia64/elf_machdep.c. Validated on: alpha, i386, ia64	2002-10-19 19:16:03 +00:00
Robert Watson	c7063e5913	Permits UFS ACLs to be used with the GENERIC kernel. Due to recent ACL configuration changes, this shouldn't result in different code paths for file systems not explicitly configured for ACLs by the system administrator. For UFS1, administrators must still recompile their kernel to add support for extended attributes; for UFS2, it's sufficient to enable ACLs using tunefs or at mount-time (tunefs preferred for reliability reasons). UFS2, for a variety of reasons, including performance and reliability, is the preferred file system for use with ACLs. Approved by: re	2002-10-19 16:54:15 +00:00
Marcel Moolenaar	db55d99d36	Make this compile when DDB is not defined by conditionally compiling all references to ksym_start and ksym_end.	2002-10-19 04:14:08 +00:00
Marcel Moolenaar	47f750125b	Fix kernel module loading on ia64. Cross-module function calls were improperly relocated due to faulty logic in lookup_fdesc() in elf_machdep.c. The symbol index (symidx) was bogusly used for load modules other than the one the relocation applied to. This resulted in bogus bindings and consequently runtime failures. The fix is to use the symbol index only for the module being relocated and to use the symbol name for look-ups in the modules in the dependent list. As such, we need a function to return the symbol name given the linker file and symbol index.	2002-10-15 05:40:07 +00:00
Peter Wemm	af3f249f3a	The a.out md_coredump stuff isn't referenced anywhere anymore, and hasn't been filled in for ages.. Nuked.	2002-10-15 00:02:50 +00:00
Marcel Moolenaar	df8b64f634	Allow kernel dumps to be aborted with ctrl-C.	2002-10-14 05:38:22 +00:00
Mike Barcroft	4275e0d98d	Remove the P1003_1B kernel option; it is no longer used.	2002-10-13 16:29:17 +00:00
Mike Barcroft	795b630655	struct ia64_fpreg needs to be available outside of the kernel for struct sigcontext. Pointy hat to: mike	2002-10-13 15:30:17 +00:00
Mike Barcroft	7a77412855	Add standards visibility conditionals. Change any uses of sigset_t to struct __sigset to avoid depending on objects from <sys/signal.h>.	2002-10-13 00:31:46 +00:00
Marcel Moolenaar	163afbe0d5	o Fix typo in previous commit: s/sc-nsect/sc->nsect/ o Fix printf format error for %d format with long argument.	2002-10-12 23:00:40 +00:00
Marcel Moolenaar	3508148f6e	Plug two holes where we returned to userland without restoring the predicate registers. Even though the ITLB and DTLB interrupts happen often enough, this bug didn't do much harm. The reason is that the interrupt handlers only modify p1 and since this is a preserved (callee-saved) register it is hardly used in code generated by the compiler. Compilers use scratch registers by default. Changing the interrupt handlers to use p6 (ie a scratch register) proved that the bug was in fact fatal.	2002-10-12 22:42:48 +00:00
Marcel Moolenaar	02cda72148	Polish previous commit: o Replace KSTACK_PAGES with pages on panic() in pmap_new_thread(), o Fix style bugs in adjacent code, o Use NULL instead of 0 for pointers, o Save the virtual kstack address if we create an alternate kstack because 1) we can derive the physical (RR7) address from it and 2) we need the virtual address for contigfree() in pmap_dispose_thread(). Thus td_altkstack saves td_md.md_kstackvirt.	2002-10-12 21:04:01 +00:00
Marcel Moolenaar	8d7d4e5d71	MFp4: Include machine/vmparam.h to pull in definition of IA64_RR_BASE. Obtained from: peter	2002-10-12 20:54:05 +00:00
Marcel Moolenaar	526a05f186	Remove the dependency on ia64_cpu.h by not defining pmap_kextract() as a trivial function that only calls ia64_tpa() and hence requires the prototype of ia64_tpa(), but by defining pmap_kextract as ia64_tpa. This solves the inclusion ordering issue in ddb/db_watch.c	2002-10-12 20:35:56 +00:00
Peter Wemm	4d5266715e	cut/paste the pmap_new_altkstack stuff from the other platforms. It's no different here. Update the rest of the kstack API's for scottl's changes.	2002-10-11 22:29:22 +00:00
Peter Wemm	0250db7caa	Call uma_zalloc on pvzone with M_NOWAIT, just like i386 and alpha. Otherwise we get hundreds of 'could sleep' during boot.	2002-10-11 21:41:53 +00:00
Mike Barcroft	2b7f24d210	Change iov_base's type from `char ' to the standard` void '. All uses of iov_base which assume its type is `char ' (in order to do pointer arithmetic) have been updated to cast iov_base to `char '.	2002-10-11 14:58:34 +00:00
Peter Wemm	d6b8445ebd	re-regen. Sigh.	2002-10-09 22:40:41 +00:00
Peter Wemm	d4f4eb09d9	Sigh. Fix fat-fingering of diff. I knew this was going to happen.	2002-10-09 22:40:02 +00:00
Peter Wemm	9105d26b73	regenerate. sendfile stuff and other recently picked up stubs.	2002-10-09 22:28:48 +00:00
Peter Wemm	459e3a7a37	Try and deal with the #ifdef COMPAT_FREEBSD4 sendfile stuff. This would have been a lot easier if do_sendfile() was usable externally.	2002-10-09 22:27:24 +00:00
Peter Wemm	4f4c431d9b	Try and patch up some tab-to-space spammage.	2002-10-09 22:14:35 +00:00
Peter Wemm	8e223883f0	Add placeholder stubs for nsendfile, mac_syscall, ksem_close, ksem_post, ksem_wait, ksem_trywait, ksem_init, ksem_open, ksem_unlink, ksem_getvalue, ksem_destroy, __mac_get_pid, __mac_get_link, __mac_set_link, extattr_set_link, extattr_get_link, extattr_delete_link.	2002-10-09 22:10:23 +00:00
Mike Barcroft	ddb056b7d4	Add conditionals to allow va_list to be defined in other headers.	2002-10-06 22:02:06 +00:00
Mike Barcroft	0b058e3cbd	o Add conditionals to allow va_list to be defined in other headers. o Standardize on _MACHINE_STDARG_H_ to allow multiple header includes. o Restrict the definition of va_copy() to C99 environments.	2002-10-06 22:01:07 +00:00
David E. O'Brien	39b9b6a43f	It appears CPU_MAXID should be 1 more than the number of CPU_* defines.	2002-10-06 05:25:10 +00:00
Poul-Henning Kamp	3bd6561289	NB: This commit does NOT make GEOM the default in FreeBSD NB: But it will enable it in all kernels not having options "NO_GEOM" Put the GEOM related options into the intended order. Add "options NO_GEOM" to all kernel configs apart from NOTES. In some order of controlled fashion, the NO_GEOM options will be removed, architecture by architecture in the coming days. There are currently three known issues which may force people to need the NO_GEOM option: boot0cfg/fdisk: Tries to update the MBR while it is being used to control slices. GEOM does not allow this as a direct operation. SCSI floppy drives: Appearantly the scsi-da driver return "EBUSY" if no media is inserted. This is wrong, it should return ENXIO. PC98: It is unclear if GEOM correctly recognizes all variants of PC98 disklabels. (Help Wanted! I have neither docs nor HW) These issues are all being worked. Sponsored by: DARPA & NAI Labs.	2002-10-05 16:35:33 +00:00
Mike Barcroft	e61d3b10d7	Fix namespace issues by using visibility conditionals from <sys/cdefs.h>.	2002-10-05 05:47:56 +00:00
Mike Barcroft	085f8416a2	style(9) <machine/setjmp.h> headers so they look mostly the same.	2002-10-04 22:10:06 +00:00
Sam Leffler	14c17bd293	New bus_dma interfaces for use by crypto device drivers: o bus_dmamap_load_mbuf o bus_dmamap_load_uio Test on i386. Known to compile on alpha and sparc64, but not tested. Otherwise untried.	2002-10-04 20:40:39 +00:00
Peter Wemm	debff18680	Gah, spell extern correctly. Do not trust cut/paste via old mozilla builds.	2002-10-04 01:57:46 +00:00
Peter Wemm	ae8ef28761	List the IO SAPIC delivery mode definitions.	2002-10-04 01:33:30 +00:00
Peter Wemm	74cf93d79d	Declare itc_frequency and itm_reload.	2002-10-04 01:23:58 +00:00
Peter Wemm	120e7b6b43	Do a bit of rude hackery to get clock interrupts on all CPUs. This is partly based on the Alpha system which duplicates the clock to each cpu, instead of doing a clock roundrobin like on i386. This means we get hz * ncpu clocks per second and so we have to seperate clock sampling from actual 'do the work' clock processing. The BSP runs the complete processing, the rest just sample state etc. Using the on-cpu interval timer is not ideal as it will drift. There is more to be done here, we should use an external clock source.	2002-10-04 01:00:35 +00:00
Peter Wemm	da2893e7ed	Update stubs for post-kseIII.	2002-10-04 00:19:58 +00:00
Peter Wemm	1678d0990b	Update for post-kseIII	2002-10-04 00:18:21 +00:00
Archie Cobbs	36a8dac10d	Let kse_wakeup() take a KSE mailbox pointer argument. Reviewed by: julian	2002-10-02 16:48:16 +00:00
Poul-Henning Kamp	fcd054483a	Fix the same misinitialization of pmap_prefault_pageorder as on i386. Suggeste by: jake	2002-10-01 20:56:29 +00:00
Archie Cobbs	89def71cbd	Make the following name changes to KSE related functions, etc., to better represent their purpose and minimize namespace conflicts: kse_fn_t -> kse_func_t struct thread_mailbox -> struct kse_thr_mailbox thread_interrupt() -> kse_thr_interrupt() kse_yield() -> kse_release() kse_new() -> kse_create() Add missing declaration of kse_thr_interrupt() to <sys/kse.h>. Regenerate the various generated syscall files. Minor style fixes. Reviewed by: julian	2002-09-25 18:10:42 +00:00
Alfred Perlstein	4f492bfab5	use __packed.	2002-09-23 18:54:32 +00:00
Peter Wemm	c692fbe091	At great personal risk, add a __packed and __aligned(x) define that expand to __attribute__((packed)) and __attribute__((aligned(x))) respectively. Replace the handful of gcc-ism's that use __attribute__((aligned(16))) etc around the kernel with __aligned(16). There are over 400 __attribute__((packed)) to deal with, that can come later. I just want to use __packed in new code rather than add more gcc-ism's.	2002-09-23 05:55:10 +00:00
Mike Barcroft	c0d4804d67	Be careful not to define GCC-specific optimizations in the non-GCC case.	2002-09-23 04:32:17 +00:00
Poul-Henning Kamp	7812d86f03	(This commit touches about 15 disk device drivers in a very consistent and predictable way, and I apologize if I have gotten it wrong anywhere, getting prior review on a patch like this is not feasible, considering the number of people involved and hardware availability etc.) If struct disklabel is the messenger: kill the messenger. Inside struct disk we had a struct disklabel which disk drivers used to communicate certain metrics to the disklayer above (GEOM or the disk mini-layer). This commit changes this communication to use four explicit fields instead. Amongst the benefits is that the fields do not get overwritten by wrong or bogus on-disk disklabels. Once that is clear, <sys/disk.h> which is included in the drivers no longer need to pull <sys/disklabel.h> and <sys/diskslice.h> in, the few places that needs them, have gotten explicit #includes for them. The disklabel inside struct disk is now only for internal use in the disk mini-layer, so instead of embedding it, we malloc it as we need it. This concludes (modulus any mistakes) the series of disklabel related commits. I belive it all amounts to a NOP for all the rest of you :-) Sponsored by: DARPA & NAI Labs.	2002-09-20 19:36:05 +00:00
Poul-Henning Kamp	6fb3d70418	For reasons now lost in historical fog, the bounds_check_with_label() function were put in i386/i386/machdep.c from where it has been cut and pasted to other architectures with only minor corruption. Disklabel is really a MI format in many ways, at least it certainly is when you operate on struct disklabel. Put bounds_check_with_label() back in subr_disklabel.c where it belongs. Sponsored by: DARPA & NAI Labs.	2002-09-20 17:51:00 +00:00
John Baldwin	c79408a059	Implement db_print_backtrace() if DDB is compiled into the kernel. This MD function is just a wrapper around db_stack_trace_cmd() that prints out a backtrace of curthread. Currently, this function is only implemented on i386 and alpha (and the alpha version isn't quite tested yet, will do that in a bit). Other changes: - For i386, fix a bug in the raw frame address case. The eip we extract from the passed in frame address does not match the frame we received. Thus, instead of printing a bogus frame with the wrong eip, go ahead and advance frame down to the same frame as the eip we are using. - For alpha, attempt to add a way of doing a raw trace for alpha. Instead of passing a frame address in 'addr', pass in a pointer to a structure containing PC and KSP and use those to start the backtrace. The alpha db_print_backtrace() uses asm to read in the current PC and KSP values into such a request. Tested on: i386 Requested by: many	2002-09-19 18:46:29 +00:00
Mike Barcroft	86954511d2	Implement C99's va_copy() macro.	2002-09-18 07:33:16 +00:00
Peter Wemm	66422f5b7a	Initiate deorbit burn for the i386-only a.out related support. Moves are under way to move the remnants of the a.out toolchain to ports. As the comment in src/Makefile said, this stuff is deprecated and one should not expect this to remain beyond 4.0-REL. It has already lasted WAY beyond that. Notable exceptions: gcc - I have not touched the a.out generation stuff there. ldd/ldconfig - still have some code to interface with a.out rtld. old as/ld/etc - I have not removed these yet, pending their move to ports. some includes - necessary for ldd/ldconfig for now. Tested on: i386 (extensively), alpha	2002-09-17 01:49:00 +00:00
Julian Elischer	4f0db5e08c	Allocate KSEs and KSEGRPs separatly and remove them from the proc structure. next step is to allow > 1 to be allocated per process. This would give multi-processor threads. (when the rest of the infrastructure is in place) While doing this I noticed libkvm and sys/kern/kern_proc.c:fill_kinfo_proc are diverging more than they should.. corrective action needed soon.	2002-09-15 23:52:25 +00:00
Jun Kuriyama	b37a9be231	Use "options " rather than "options<tab>".	2002-09-09 02:40:59 +00:00
Juli Mallett	77c70ade18	Fill out two fields (si_pid, si_uid) in the siginfo structure handed back to userland in the signal handler that were not being iflled out before, but should and can be. This part of sendsig could be slightly refactored to use an MI interface, or ideally, sendsig() would have an API change to accept a siginfo_t, which would be filled out by an MI function in the level above sendsig, and said MI function would make a small call into MD code to fill out the MD parts (some of which may be bogus, such as the si_addr stuff in some places). This would eventually make it possible for parts of the kernel sending signals to set up a siginfo with meaningful information. Reviewed by: mux MFC after: 2 weeks	2002-09-07 19:12:53 +00:00
Peter Wemm	1a50106e30	Zap the implementations of the i386-aout specific cpu_coredump function. Most of the non-i386 platforms had rather broken implementations anyway.	2002-09-07 01:26:34 +00:00
Peter Wemm	b11ac5597c	Make this compile	2002-09-03 04:30:00 +00:00
Mike Barcroft	dd1b6791a8	Now that _BSD_CLK_TCK_ and _BSD_CLOCKS_PER_SEC_ are the same on all architectures, move the definition directly into <time.h> and finish the removal of <machine/ansi.h>.	2002-09-03 00:06:58 +00:00
Mike Barcroft	f2893bd9ef	Align _BSD_CLK_TCK_ and _BSD_CLOCKS_PER_SEC_ with most other platforms. This introduces some binary incompatibilities for dynamically linked programs which make use of clock(3) and times(3).	2002-09-02 22:40:12 +00:00
Alan Cox	660ab349e5	o Remove an initialized but unused variable from pmap_remove_all().	2002-09-02 05:13:37 +00:00
Marcel Moolenaar	9010448c55	Sync up: remove device counts.	2002-09-01 22:34:58 +00:00
Jake Burkholder	f36ba45234	Added fields for VM_MIN_ADDRESS, PS_STRINGS and stack protections to sysentvec. Initialized all fields of all sysentvecs, which will allow them to be used instead of constants in more places. Provided stack fixup routines for emulations that previously used the default.	2002-09-01 21:41:24 +00:00
Peter Wemm	31cdffc6d8	Take a shot at fixing up a whole stack of style and other embarresing unforced errors that Bruce identified. I have not yet addressed all of his concerns.	2002-08-31 03:33:32 +00:00
Peter Wemm	0b1482d1fd	Do not use an object for the pte and pv zones on ia64 because it overrides the pmap_allocf() function that we provide above. We still use the limits via other means. Submitted by: jeff	2002-08-31 03:11:20 +00:00
Peter Wemm	447b3772dc	Change hw.physmem and hw.usermem to unsigned long like they used to be in the original hardwired sysctl implementation. The buf size calculator still overflows an integer on machines with large KVA (eg: ia64) where the number of pages does not fit into an int. Use 'long' there. Change Maxmem and physmem and related variables to 'long', mostly for completeness. Machines are not likely to overflow 'int' pages in the near term, but then again, 640K ought to be enough for anybody. This comes for free on 32 bit machines, so why not?	2002-08-30 04:04:37 +00:00
Jake Burkholder	bafbd49201	Renamed poorly named setregs to exec_setregs. Moved its prototype to imgact.h with the other exec support functions.	2002-08-29 06:17:48 +00:00
Jake Burkholder	409017e8e1	Fixed printf format errors.	2002-08-29 05:49:04 +00:00
Alan Cox	6508a194aa	o Retire pmap_pageable(). It's an advisory routine that none of our platforms implements.	2002-08-25 04:20:05 +00:00
Marcel Moolenaar	1b570732c5	s/_BSD_VA_LIST_/__va_list/. The former type doesn't exist anymore.	2002-08-23 22:19:57 +00:00
Mike Barcroft	bd8e0716d9	Move several MI types from <machine/_types.h> to <sys/_types.h>. These types are unlikely to ever become very MD. They include: clockid_t, ct_rune_t, fflags_t, intrmask_t, mbstate_t, off_t, pid_t, rune_t, socklen_t, timer_t, wchar_t, and wint_t. While moving them, make a few adjustments (submitted by bde): o __ct_rune_t needs to be precisely `int', not necessarily __int32_t, since the arg type of the ctype functions is int. o __rune_t, __wchar_t and __wint_t inherit this via a typedef of __ct_rune_t. o Some minor wording changes in the comment blocks for ct_rune_t and mbstate_t. Submitted by: bde (partially)	2002-08-23 16:49:06 +00:00
Maxime Henrion	de794a36b9	Convert NEXUS_ACCESSOR to use the __BUS_ACCESSOR macro instead of reimplementing it. Approved by: peter	2002-08-22 19:47:10 +00:00
Mike Barcroft	abbd890233	o Merge <machine/ansi.h> and <machine/types.h> into a new header called <machine/_types.h>. o <machine/ansi.h> will continue to live so it can define MD clock macros, which are only MD because of gratuitous differences between architectures. o Change all headers to make use of this. This mainly involves changing: #ifdef _BSD_FOO_T_ typedef _BSD_FOO_T_ foo_t; #undef _BSD_FOO_T_ #endif to: #ifndef _FOO_T_DECLARED typedef __foo_t foo_t; #define _FOO_T_DECLARED #endif Concept by: bde Reviewed by: jake, obrien	2002-08-21 16:20:02 +00:00
Robert Watson	0a9b91cde1	Correct one more errant whitespace nit that crept in during changes in the arguments to vn_rdwr(). Hopefully the last.	2002-08-20 02:21:38 +00:00
Peter Wemm	f453022cd9	remove unit counts from atkbdc, pckbd, sc	2002-08-20 00:10:22 +00:00
Robert Watson	2b94185b82	Correct a minor whitespace nit that sneaked in with my previous commit.	2002-08-15 21:41:20 +00:00
Robert Watson	9ca435893b	In order to better support flexible and extensible access control, make a series of modifications to the credential arguments relating to file read and write operations to cliarfy which credential is used for what: - Change fo_read() and fo_write() to accept "active_cred" instead of "cred", and change the semantics of consumers of fo_read() and fo_write() to pass the active credential of the thread requesting an operation rather than the cached file cred. The cached file cred is still available in fo_read() and fo_write() consumers via fp->f_cred. These changes largely in sys_generic.c. For each implementation of fo_read() and fo_write(), update cred usage to reflect this change and maintain current semantics: - badfo_readwrite() unchanged - kqueue_read/write() unchanged pipe_read/write() now authorize MAC using active_cred rather than td->td_ucred - soo_read/write() unchanged - vn_read/write() now authorize MAC using active_cred but VOP_READ/WRITE() with fp->f_cred Modify vn_rdwr() to accept two credential arguments instead of a single credential: active_cred and file_cred. Use active_cred for MAC authorization, and select a credential for use in VOP_READ/WRITE() based on whether file_cred is NULL or not. If file_cred is provided, authorize the VOP using that cred, otherwise the active credential, matching current semantics. Modify current vn_rdwr() consumers to pass a file_cred if used in the context of a struct file, and to always pass active_cred. When vn_rdwr() is used without a file_cred, pass NOCRED. These changes should maintain current semantics for read/write, but avoid a redundant passing of fp->f_cred, as well as making it more clear what the origin of each credential is in file descriptor read/write operations. Follow-up commits will make similar changes to other file descriptor operations, and modify the MAC framework to pass both credentials to MAC policy modules so they can implement either semantic for revocation. Obtained from: TrustedBSD Project Sponsored by: DARPA, NAI Labs	2002-08-15 20:55:08 +00:00

... 3 4 5 6 7 ...

1128 Commits