sysadmin/ipxe - ipxe - codex.r10x.net

mirror of https://github.com/ipxe/ipxe synced 2026-02-28 03:11:18 +03:00

Author	SHA1	Message	Date
Michael Brown	0a8e34657e	[efi] Add image security database GUID definition Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-12 12:09:40 +00:00
Michael Brown	5c135240bc	[efi] Add Microsoft vendor GUID definition Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-12 12:01:37 +00:00
Michael Brown	5154b6fcc5	[efi] Add storage security command protocol header and GUID definition Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 23:24:22 +00:00
Michael Brown	27ec3c76ab	[efi] Update to current EDK2 headers Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 23:24:22 +00:00
Michael Brown	3a2f75b789	[efi] Mark Arm/ProcessorBind.h as a non-imported header Support for ARM32 has been removed from the EDK2 codebase. However, we may as well retain the ability to build iPXE for existing EFI platforms. Add an iPXE include guard to this file so that the EDK2 header import script will no longer attempt to import it from the EDK2 tree. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 16:14:22 +00:00
Michael Brown	f0d978d8b4	[efi] Mark Ip4Config.h as a non-imported header The Ip4Config.h header has been removed from the EDK2 codebase as obsolete. However, we may still encounter it in the wild and so it is useful to retain the GUID and the corresponding protocol name for debug messages. Add an iPXE include guard to this file so that the EDK2 header import script will no longer attempt to import it from the EDK2 tree. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 16:12:26 +00:00
Michael Brown	f9b9ef578a	[efi] Mark UgaDraw.h as a non-imported header The UgaDraw.h header has been removed from the EDK2 codebase as obsolete. However, we may still encounter it in the wild and so it is useful to retain the GUID and the corresponding protocol name for debug messages. Add an iPXE include guard to this file so that the EDK2 header import script will no longer attempt to import it from the EDK2 tree. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 16:09:27 +00:00
Michael Brown	c0ac23fc56	[efi] Switch back to VA_START() etc macros for EFIAPI functions Commit `670810b` ("[efi] Use standard va_args macros instead of VA_START() etc") fixed a 32-bit RISC-V build error, but broke the functionality of the InstallMultipleProtocolInterfaces() and UninstallMultipleProtocolInterfaces() wrapper functions. GCC does not automatically check the ABI of the current function when using the __builtin_va_start() and related macros, and it is therefore necessary for code to use __builtin_ms_va_start() etc from within functions marked as EFIAPI. Since commit 9016f2e ("[efi] Skip including the EDK2 ProcessorBind.h header for 32-bit RISC-V") has now fixed the original 32-bit RISC-V build error, we can switch back to using the EDK2 VA_START() etc macros to obtain the correct behaviour. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 15:41:42 +00:00
Michael Brown	360aa7064f	[efi] Skip including the EDK2 ProcessorBind.h header for 32-bit RISC-V We currently include the EDK2 RiscV64/ProcessorBind.h header when building for 32-bit RISC-V, as a placeholder since there is no support for 32-bit RISC-V in upstream EDK2. This causes errors when attempting to use the EDK2 VA_START() et al macros, since RiscV64/ProcessorBind.h ends up defining UINTN with a size different from the size of a pointer. Fix by falling back to the generic definitions for UINTN etc (as used for EFI_HOSTONLY) whenever we don't have an architecture-specific ProcessorBind.h header available. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 15:41:42 +00:00
Michael Brown	df7f59d47a	[pci] Move ECAM pci_can_probe() definition to correct header file Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-11 13:13:54 +00:00
Michael Brown	b41bda4413	[spcr] Accept alternative type value for a 16550-compatible UART Some systems (observed on an AWS EC2 m7i.metal-24xl instance in eu-south-2) use the newer "16550-compatible with parameters defined in Generic Address Structure" type value. (There does not appear to be any particular reason why the newer value needs to be used: the UART is still a standard 16550 with single-byte registers.) Accept this additional type value for a 16550-compatible UART. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 20:06:53 +00:00
Michael Brown	6d9374e5a3	[acpi] Allow acpi_ioremap() to map a port I/O address Assume that on any platforms where port I/O is used (i.e. x86), a port I/O address may be used directly for the combined MMIO and port I/O accessors without requiring an explicit mapping operation. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 19:33:53 +00:00
Michael Brown	bd3982b630	[ioapi] Allow iounmap() to be called for port I/O addresses Allow code using the combined MMIO and port I/O accessors to safely call iounmap() to unmap the MMIO or port I/O region. In the virtual offset I/O mapping API as used for UEFI, 32-bit BIOS, and 32-bit RISC-V SBI, iounmap() is a no-op anyway. In 64-bit RISC-V SBI, we have no concept of port I/O and so the issue is moot. This leaves only 64-bit BIOS, where it suffices to simply do nothing for any pages outside of the chosen MMIO virtual address range. For symmetry, we implement the equivalent change in the very closely related RISC-V page management code. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 19:33:53 +00:00
Michael Brown	f2ea97102e	[spcr] Use the serial port defined by the ACPI SPCR by default On platforms where we expect ACPI tables to exist, use the serial port defined by the ACPI Serial Port Console Redirection (SPCR) table by default, falling back to the fixed serial port defined at build time. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 16:55:20 +00:00
Michael Brown	595ff24030	[spcr] Add support for the ACPI Serial Port Console Redirection table The BIOS may provide an ACPI Serial Port Console Redirection (SPCR) table to describe the serial port to be used for early boot messages. Add support for parsing the SPCR and instantiating a 16550-based UART. We do not currently attempt to support other types of UART, since iPXE does not yet have drivers for other types. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 14:12:57 +00:00
Michael Brown	8fd5e27727	[acpi] Add acpi_ioremap() to map an ACPI-described address An ACPI Generic Address Structure (GAS) may be used to describe the location of a peripheral such as an early boot console. Add the relevant definitions and provide acpi_ioremap() as a helper function to map a region described using this structure. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 14:10:14 +00:00
Michael Brown	08d4d7fe9d	[uart] Make baud rate a property of the UART Make the current baud rate (if specified) a property of the UART, to allow the default_serial_console() function to specify the default baud rate as well as the default UART device. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-05 12:18:17 +00:00
Michael Brown	a786c8d231	[uart] Support 16550 UARTs accessed via either MMIO or port I/O Use the combined accessors ioread8() and iowrite8() to read and write 16550 UART registers, to allow the decision between using MMIO and port I/O to be made at runtime. Minimise the increase in code size for x86 by ignoring the register shift, since this is essentially used only for non-x86 SoCs. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-04 21:14:41 +00:00
Michael Brown	f7de1b53dc	[ioapi] Provide combined MMIO and port I/O accessors Some devices (such as a 16550 UART) may be accessed via either MMIO or port I/O. This is currently forced to be a compile-time decision. For example: we currently access a 16550 UART via port I/O on x86 and via MMIO on any other platform. PCI UARTs with MMIO BARs do exist but are not currently supported in an x86 build of iPXE. Some AWS EC2 systems (observed on a c6i.metal instance in eu-west-2) provide only a PCI MMIO UART, and it is therefore currently impossible to get serial output from iPXE on these instance types. Add ioread8(), ioread16(), etc accessors that will select between MMIO and port I/O at the point of use. For non-x86 platforms where we currently have no port I/O support, these simply become wrappers around the corresponding readb(), readw(), etc MMIO accessors. On x86, we use the fairly well-known trick of treating any 16-bit address (below 64kB) as a port I/O address. This trick works even in the i386 BIOS build of iPXE (where virtual addresses are offset from physical addresses by a runtime constant), since the first 64kB of the virtual address space will correspond to the iPXE binary itself (along with its uninitialised-data space), and so must be RAM rather than a valid MMIO address range. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-11-04 21:14:41 +00:00
Michael Brown	fde35ff003	[pci] Disable decoding while setting a BAR value Setting the base address for a 64-bit BAR requires two separate 32-bit writes to configuration space, and so will necessarily result in the BAR temporarily holding an invalid partially written address. Some hypervisors (observed on an AWS EC2 c7a.medium instance in eu-west-2) will assume that guests will write BAR values only while decoding is disabled, and may not rebuild MMIO mappings for the guest if the BAR registers are written while decoding is enabled. The effect of this is that MMIO accesses are not routed through to the device even though inspection from within the guest shows that every single PCI configuration register has the correct value. Writes to the device will be ignored, and reads will return the all-ones pattern that typically indicates a nonexistent device. With the ENA network driver now using low latency transmit queues, this results in the transmit descriptors being lost (since the MMIO writes to BAR2 never reach the device), which in turn causes the device to lock up as soon as the transmit doorbell is rung for the first time. Fix by disabling decoding of memory and I/O cycles while setting a BAR address (as we already do while sizing a BAR), so that the invalid partial address can never be decoded and so that hypervisors will rebuild MMIO mappings as expected. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-29 23:30:52 +00:00
Michael Brown	606e87ec7a	[cloud] Display instance type in AWS EC2 Experiments suggest that the instance type is exposed via the SMBIOS product name. Include this information within the default output, since it is often helpful in debugging. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-29 13:26:50 +00:00
Michael Brown	0336e2987c	[ena] Leave queue base address empty when creating a low latency queue The queue base address is meaningless for a low latency queue, since the queue entries are written directly to the on-device memory. Any non-zero queue base address will be safely ignored by the hardware, but leaves open the possibility that future revisions could treat it as an error. Leave this field as zero, to match the behaviour of the Linux driver. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-28 12:27:06 +00:00
Michael Brown	0ddd830693	[riscv] Correct page table stride calculation Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-27 14:22:16 +00:00
Michael Brown	426c721e32	[librm] Correct page table stride calculation Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-27 14:22:16 +00:00
Michael Brown	c8f088d4e1	[cloud] Display build architecture in AWS EC2 On some newer (7th and 8th generation) instance types, the 32-bit build of iPXE cannot access PCI configuration space since the ECAM is placed outside of the 32-bit address space. The visible symptom is that iPXE fails to detect any network devices. The public AMIs are all now built as 64-bit binaries, but there is nothing that prevents the building and importing of a 32-bit AMI. There are still potentially valid use cases for 32-bit AMIs (e.g. if planning to use the AMI only for older instance types), and so we cannot sensibly prevent this error at build time. Display the build architecture as part of the AWS EC2 embedded script, to at least allow for easy identification of this particular failure mode at run time. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-20 12:58:03 +01:00
Michael Brown	416a2143af	[cloud] Remove AWS public image access block only if not already unblocked Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-20 12:58:03 +01:00
Michael Brown	ba1846a0d3	[cloud] Remove AWS public image access block automatically if needed Making images public is blocked by default in new AWS regions. Remove this block automatically whenever creating a public image. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-17 14:22:21 +01:00
Michael Brown	b2e8468219	[ena] Limit receive queue size to work around hardware bugs Commit `a801244` ("[ena] Increase receive ring size to 128 entries") increased the receive ring size to 128 entries (while leaving the fill level at 16), since using a smaller receive ring caused unexplained failures on some instance types. The original hardware bug that resulted in that commit seems to have been fixed: experiments suggest that the original failure (observed on a c6i.large instance in eu-west-2) will no longer reproduce when using a receive ring containing only 16 entries (as was the case prior to that commit). Newer generations of the ENA hardware (observed on an m8i.large instance in eu-south-2) seem to have a new and exciting hardware bug: these instance types appear to use a hash of the received packet header to determine which portion of the (out-of-order) receive ring to use. If that portion of the ring happens to be empty (e.g. because only 32 entries of the 128-entry ring are filled at any one time), then the packet will be silently dropped. Work around this new hardware bug by reducing the receive ring size down to the current fill level of 32 entries. This appears to work on all current instance types (but has not been exhaustively tested). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-17 13:25:05 +01:00
Michael Brown	846c505ae9	[ena] Increase transmit queue size to match receive fill level Avoid running out of transmit descriptors when sending TCP ACKs by increasing the transmit queue size to match the increased received fill level. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-17 13:25:05 +01:00
Michael Brown	0ae5e25de2	[ena] Add memory barrier after writing to on-device memory Ensure that writes to on-device memory have taken place before writing to the doorbell register. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-17 12:35:23 +01:00
Michael Brown	c296747d0e	[ena] Increase receive fill level Experiments suggest that at least some instance types (observed with c6i.large in eu-west-2) experience high packet drop rates with only 16 receive buffers allocated. Increase the fill level to 32 buffers. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-16 16:36:29 +01:00
Michael Brown	c1badf71ca	[ena] Add support for low latency transmit queues Newer generations of the ENA hardware require the use of low latency transmit queues, where the submission queues and the initial portion of the transmitted packet are written to on-device memory via BAR2 instead of being read from host memory. Detect support for low latency queues and set the placement policy appropriately. We attempt the use of low latency queues only if the device reports that it supports inline headers, 128-byte entries, and two descriptors prior to the inlined header, on the basis that we don't care about using low latency queues on older versions of the hardware since those versions will support normal host memory submission queues anyway. We reuse the redundant memory allocated for the submission queue as the bounce buffer for constructing the descriptors and inlined packet data, since this avoids needing a separate allocation just for the bounce buffer. We construct a metadata submission queue entry prior to the actual submission queue entry, since experimentation suggests that newer generations of the hardware require this to be present even though it conveys no information beyond its own existence. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-16 16:36:29 +01:00
Michael Brown	0d15d7f0a5	[ena] Record supported device features Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-16 16:36:29 +01:00
Michael Brown	e5e371f485	[ena] Cancel uncompleted transmit buffers on close Avoid spurious assertion failures by ensuring that references to uncompleted transmit buffers are not retained after the device has been closed. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-16 16:36:29 +01:00
Michael Brown	dcc5d36ce5	[ena] Map the on-device memory, if present Newer generations of the ENA hardware require the use of low latency transmit queues, where the submission queues and the initial portion of the transmitted packet are written to on-device memory via BAR2 instead of being read from host memory. Prepare for this by mapping the on-device memory BAR. As with the register BAR, we may need to steal a base address from the upstream PCI bridge since the BIOS on some instance types (observed with an m8i.metal-48xl instance in eu-south-2) will fail to assign an address to the device. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-15 15:55:57 +01:00
Michael Brown	510f3e5e17	[ena] Add descriptive messages for any admin queue command failures Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-15 12:00:42 +01:00
Michael Brown	3538e9c39a	[pci] Record prefetchable memory window for PCI bridges Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-14 18:38:08 +01:00
Michael Brown	04a61c413d	[ena] Use pci_bar_set() to place device within bridge memory window Use pci_bar_set() when we need to set a device base address (on instance types such as c6i.metal where the BIOS fails to do so), so that 64-bit BARs will be handled automatically. This particular issue has so far been observed only on 6th generation instances. These use 32-bit BARs, and so the lack of support for handling 64-bit BARs has not caused any observable issue. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-14 15:57:02 +01:00
Michael Brown	94902ae187	[pci] Handle sizing of 64-bit BARs Provide pci_bar_set() to handle setting the base address for a potentially 64-bit BAR, and rewrite pci_bar_size() to correctly handle sizing of 64-bit BARs. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-14 14:43:50 +01:00
Michael Brown	e80818e4f6	[tls] Disable renegotiation unless extended master secret is used RFC 7627 states that renegotiation becomes no longer secure under various circumstances when the non-extended master secret is used. The description of the precise set of circumstances is spread across various points within the document and is not entirely clear. Avoid a superset of the circumstances in which renegotiation apparently becomes insecure by refusing renegotiation completely unless the extended master secret is used. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-12 23:25:09 +01:00
Michael Brown	57504353fe	[tls] Refuse to resume sessions with mismatched master secret methods RFC 7627 section 5.3 states that the client must abort the handshake if the server attempts to resume a session where the master secret calculation method stored in the session does not match the method used for the connection being resumed. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-12 23:25:09 +01:00
Michael Brown	ab64bc5b8d	[tls] Add support for the Extended Master Secret RFC 7627 defines the Extended Master Secret (EMS) as an alternative calculation that uses the digest of all handshake messages rather than just the client and server random bytes. Add support for negotiating the Extended Master Secret extension and performing the relevant calculation of the master secret. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-12 23:25:04 +01:00
Michael Brown	d6656106e9	[tls] Generate master secret only after sending Client Key Exchange The calculation for the extended master secret as defined in RFC 7627 relies upon the digest of all handshake messages up to and including the Client Key Exchange. Facilitate this calculation by generating the master secret only after sending the Client Key Exchange message. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-12 22:20:13 +01:00
Michael Brown	4f44f62402	[gve] Rearm interrupts unconditionally on every poll Experimentation suggests that rearming the interrupt once per observed completion is not sufficient: we still see occasional delays during which the hardware fails to write out completions. As described in commit `d2e1e59` ("[gve] Use dummy interrupt to trigger completion writeback in DQO mode"), there is no documentation around the precise semantics of the interrupt rearming mechanism, and so experimentation is the only available guide. Switch to rearming both TX and RX interrupts unconditionally on every poll, since this produces better experimental results. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-10 13:12:19 +01:00
Michael Brown	f5ca1de738	[gve] Use raw DMA addresses in descriptors in DQO-QPL mode The DQO-QPL operating mode uses registered queue page lists but still requires the raw DMA address (rather than the linear offset within the QPL) to be provided in transmit and receive descriptors. Set the queue page list base device address appropriately. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-10 12:49:26 +01:00
Michael Brown	1cc1f1cd4f	[gve] Report only packet completions for the transmit ring The hardware reports descriptor and packet completions separately for the transmit ring. We currently ignore descriptor completions (since we cannot free up the transmit buffers in the queue page list and advance the consumer counter until the packet has also completed). Now that transmit completions are written out immediately (instead of being delayed until 128 bytes of completions are available), there is no value in retaining the descriptor completions. Omit descriptor completions entirely, and reduce the transmit fill level back down to its original value. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-09 17:29:20 +01:00
Michael Brown	d2e1e591ab	[gve] Use dummy interrupt to trigger completion writeback in DQO mode When operating in the DQO operating mode, the device will defer writing transmit and receive completions until an entire internal cacheline (128 bytes) is full, or until an associated interrupt is asserted. Since each receive descriptor is 32 bytes, this will cause received packets to be effectively delayed until up to three further packets have arrived. When network traffic volumes are very low (such as during DHCP, DNS lookups, or TCP handshakes), this typically induces delays of up to 30 seconds and results in a very poor user experience. Work around this hardware problem in the same way as for the Intel 40GbE and 100GbE NICs: by enabling dummy MSI-X interrupts to trick the hardware into believing that it needs to write out completions to host memory. There is no documentation around the interrupt rearming mechanism. The value written to the interrupt doorbell does not include a consumer counter value, and so must be relying on some undocumented ordering constraints. Comments in the Linux driver source suggest that the authors believe that the device will automatically and atomically mask an MSI-X interrupt at the point of asserting it, that any further interrupts arriving before the doorbell is written will be recorded in the pending bit array, and that writing the doorbell will therefore immediately assert a new interrupt if needed. In the absence of any documentation, choose to rearm the interrupt once per observed completion. This is overkill, but is less impactful than the alternative of rearming the interrupt unconditionally on every poll. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-09 17:12:20 +01:00
Michael Brown	c2d7ddd0c2	[gve] Add missing memory barriers Ensure that remainder of completion records are read only after verifying the generation bit (or sequence number). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-09 16:42:20 +01:00
Michael Brown	5438299649	[intelxl] Use default dummy MSI-X target address Use the default dummy MSI-X target address that is now allocated and configured automatically by pci_msix_enable(). Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-09 16:37:14 +01:00
Michael Brown	4224f574da	[pci] Map all MSI-X interrupts to a dummy target address by default Interrupts as such are not used in iPXE, which operates in polling mode. However, some network cards (such as the Intel 40GbE and 100GbE NICs) will defer writing out completions until the point of asserting an MSI-X interrupt. From the point of view of the PCI device, asserting an MSI-X interrupt is just a 32-bit DMA write of an opaque value to an opaque target address. The PCI device has no know to know whether or not the target address corresponds to a real APIC. We can therefore trick the PCI device into believing that it is asserting an MSI-X interrupt, by configuring it to write an opaque 32-bit value to a dummy target address in host memory. This is sufficient to trigger the associated write of the completions to host memory. Allocate a dummy target address when enabling MSI-X on a PCI device, and map all interrupts to this target address by default. Signed-off-by: Michael Brown <mcb30@ipxe.org>	2025-10-09 16:29:29 +01:00

1 2 3 4 5 ...

7231 Commits