| Age | Commit message (Collapse) | Author | 
|---|
|  | vk_shader_decompiler: Use Visit instead of reimplementing it | 
|  | shader/memory: Implement LDG.U8 and unaligned U8 loads | 
|  | ExprCondCode visit implements the generic Visit. Use this instead of
that one.
As an intended side effect this fixes unwritten memory usages in cases
when a negation of a condition code is used. | 
|  | shader/conversion: Implement byte selector in I2F | 
|  | shader/texture: Properly shrink unused entries in size mismatches | 
|  | gl_shader_decompiler: Add missing DeclareImages | 
|  | shader_bytecode: Fix TLD4S encoding | 
|  | vk_scheduler: Delegate commands to a worker thread and state track | 
|  | LDG can load single bytes instead of full integers or packs of integers.
These have the advantage of loading bytes that are not aligned to 4
bytes.
To emulate these this commit gets the byte being referenced (by doing
"address & 3" and then using that to extract the byte from the loaded
integer:
result = bitfieldExtract(loaded_integer, (address % 4) * 8, 8) | 
|  | I2F's byte selector is used to choose what bytes to convert to float.
e.g. if the input is 0xaabbccdd and the selector is ".B3" it will
convert 0xaa. The default (when it's not shown in nvdisasm) is ".B0", in
that example the default would convert 0xdd to float. | 
|  | When a image format mismatches we were inserting zeroes to the texture
itself. This was not handling cases were the mismatch uses less
coordinates than the guest shader code. Address that by resizing the
vector. | 
|  |  | 
|  |  | 
|  | common: SPSCQueue: Notify after incrementing queue size. | 
|  |  | 
|  | renderer_opengl: Miscellaneous clean ups | 
|  | Corrections and fixes to TLD4S & bindless samplers failing | 
|  | maxwell_to_vk: Use VK_EXT_index_type_uint8 and misc changes | 
|  | gl_device: Enable compute shaders for Intel Mesa drivers | 
|  | gl_shader_cache: Add missing new-line on emitted GLSL | 
|  | A1B5G5R5 uses A1R5G5B5. This is flipped with image view swizzles;
flushing is still not properly implemented on Vulkan for this particular
format. | 
|  |  | 
|  | Add an extra argument to query device capabilities in the future. The
intention behind this is to use native quads, quad strips, line loops
and polygons if these are released for Vulkan. | 
|  | The OpenGL spec defines GL_CLAMP's formula similarly to CLAMP_TO_EDGE
and CLAMP_TO_BORDER depending on the filter mode used. It doesn't
exactly behave like this, but it's the closest we can get with what
Vulkan offers without emulating it by injecting shader code. | 
|  |  | 
|  | Introduce a worker thread approach for delegating Vulkan work derived
from dxvk's approach. https://github.com/doitsujin/dxvk
Now that the scheduler is what handles all Vulkan work related to
command streaming, store state tracking in itself. This way we can know
when to reupload Vulkan dynamic state to the queue (since this one is
invalidated between command buffers unlike NVN). We can also store the
renderpass state and graphics pipeline bound to avoid redundant binds
and renderpass begins/ends. | 
|  | kernel/svc: Amend function signature of SignalProcessWideKey | 
|  | Added missing include | 
|  |  | 
|  |  | 
|  |  | 
|  |  | 
|  | implemented.
This commit ensures the OGL backend does not execute tesselation shader 
stages as they are currently unimplemented. | 
|  | shader: Implement MEMBAR.GL | 
|  |  | 
|  | * Kernel: Correct behavior of Address Arbiter threads.
This corrects arbitration threads to behave just like in Horizon OS.
They are added into a container and released according to what priority
they had when added. Horizon OS does not reorder them if their priority
changes.
* Kernel: Address Feedback. | 
|  | This function doesn't actually return a result code, so we can amend the
signature of it to match. | 
|  | Previously we naively checked for "Intel" in GL_VENDOR, but this
includes both Intel's proprietary driver and the mesa driver. Re-enable
compute shaders for mesa. | 
|  | Add missing new-line. This caused shaders using local memory and shared
memory to inject a preprocessor GLSL line after an expression (resulting
in invalid code).
It looked like this:
shared uint smem[8];#define LOCAL_MEMORY_SIZE 16
It should look like this (addressed by this commit):
shared uint smem[8];
\#define LOCAL_MEMORY_SIZE 16 | 
|  | kernel/svc: Provide implementations for svcDumpInfo/svcDumpInfoNew | 
|  | This commit finishes adding depth mode that was reverted before due to
other unresolved issues. | 
|  | Implement using memoryBarrier in GLSL and OpMemoryBarrier on SPIR-V. | 
|  |  | 
|  |  | 
|  |  | 
|  | Update Sirit and its usage in vk_shader_decompiler. Highlights:
- Implement tessellation shaders
- Implement geometry shaders
- Implement some missing features
- Use native half float instructions when available. | 
|  |  | 
|  |  | 
|  | - Setup more features and requirements.
- Improve logging for missing features.
- Collect telemetry parameters.
- Add queries for more image formats.
- Query push constants limits.
- Optionally enable some extensions. | 
|  | maxwell_3d: Add tessellation state entries |