fran/mesa - mesa - GNLUG git store

fran/mesa

Author	SHA1	Message	Date
Francisco Jerez	2890c17cb9	intel/fs/xe2: Fix up subdword integer region restriction with strided byte src and packed byte dst. This fixes a corner case of the LNL sub-dword integer restrictions that wasn't being detected by has_subdword_integer_region_restriction(), specifically: > if(Src.Type==Byte && Dst.Type==Byte && Dst.Stride==1 && W!=2) { > // ... > if(Src.Stride == 2) && (Src.UniformStride) && (Dst.SubReg%32 == Src.SubReg/2 ) { Allowed } > // ... > } All the other restrictions that require agreement between the SubReg number of source and destination only affect sources with a stride greater than a dword, which is why has_subdword_integer_region_restriction() was returning false except when "byte_stride(srcs[i]) >= 4" evaluated to true, but as implied by the pseudocode above, in the particular case of a packed byte destination, the restriction applies for source strides as narrow as 2B. The form of the equation that relates the subreg numbers is consistent with the existing calculations in brw_fs_lower_regioning (see required_src_byte_offset()), we just need to enable lowering for this corner case, and change lower_dst_region() to call lower_instruction() recursively, since some of the cases where we break this restriction are copy instructions introduced by brw_fs_lower_regioning() itself trying to lower other instructions with byte destinations. This fixes some Vulkan CTS test-cases that were hitting these restrictions with byte data types. Fixes: `217d412360` ("intel/fs/gfx20+: Implement sub-dword integer regioning restrictions.") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30630> (cherry picked from commit `0ad835a929`)	2024-11-15 12:07:09 +01:00
Sam Lantinga	5eea24a465	util: Fixed crash in HEVC encoding on 32-bit systems This builds on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/25059, and extends that change to all 32-bit systems. This fixes a crash on SteamOS with the following test case: unsigned char data[] = { 0x00, 0x00, 0x00, 0x01, 0x40, 0x01, 0x0c, 0x01, 0xff, 0xff, 0x01, 0x60, 0x00, 0x00, 0x03, 0x00, 0xb0, 0x00, 0x00, 0x03, 0x00, 0x00, 0x03, 0x00, 0x99, 0x2c, 0x0c, 0x01, 0x64, 0x7c, 0x00, 0x7c, 0xd2, 0x56, 0x01, 0x40, 0x00, 0x00, 0x00, 0x01, 0x42, 0x01, 0x01, 0x01, 0x60, 0x00, 0x00, 0x03, 0x00, 0xb0, 0x00, 0x00, 0x03, 0x00, 0x00, 0x03, 0x00, 0x99, 0xa0, 0x02, 0x80, 0x80, 0x32, 0x16, 0x24, 0xbb, 0x90, 0x84, 0x48, 0x9a, 0x83, 0x03, 0x03, 0x02, 0x00, 0xb2, 0x3e, 0x00, 0x3e, 0x69, 0x2b, 0x00, 0x5f, 0x08, 0x04, 0x10, 0x00, 0x00, 0x00, 0x01, 0x44, 0x01, 0xc0, 0x62, 0x0f, 0x02, 0x24 }; vlVaContext context; vlVaBuffer buf; memset(&context, 0, sizeof(context)); memset(&buf, 0, sizeof(buf)); context.packed_header_emulation_bytes = true; buf.data = data; buf.size = sizeof(data); vlVaHandleVAEncPackedHeaderDataBufferTypeHEVC(&context, &buf); Cc: mesa-stable Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31596> (cherry picked from commit `4ed8ef74b4`)	2024-11-15 11:40:27 +01:00
Samuel Pitoiset	f5e91ecc27	radv: fix ignoring src stage mask when dst stage mask is BOTTOM_OF_PIPE Otherwise the driver doesn't synchronize if there are image layout transitions. This fixes rendering issues with displayable DCC (usually black squares in the bottom of screen). This mostly happens when an application uses a lower resolution than the screen supports and fshack (wine/proton) which upscales images uses COMPUTE_SHADER->BOTTOM_OF_PIPE for the barrier after a dispatch. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11547 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11600 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11789 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8705 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/9890 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32115> (cherry picked from commit `c08d2c40ed`)	2024-11-15 11:40:27 +01:00
Danylo Piliaiev	972edad884	nir/nir_opt_offsets: Do not fold load/store with const offset > max When (off_const > max) there is a wrap around uint when calling try_extract_const_addition. Exit early since folding doesn't make sense in this case. Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32118> (cherry picked from commit `b501cbf153`)	2024-11-15 11:40:27 +01:00
Friedrich Vock	902736f5c1	vulkan/rmv: Correctly set heap size RMV expects the size to be in bits 5-68, not 4-68. Fixes: `845792db` ("vulkan: Add RMV file exporter") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31903> (cherry picked from commit `73d513c5be`)	2024-11-15 11:40:26 +01:00
Lionel Landwerlin	91c8694792	brw: allocate physical register sizes for spilling All of the spilling code should work with physical register units because for example SEND messages will expect a physical register as destination. So always allocate a full physical register for the spilled/unspilled values and adjust the offsets of the registers to physical sizes too. Cc: mesa-stable Fixes: `aa494cba` ("brw: align spilling offsets to physical register sizes") Closes: mesa/mesa#11967 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Found-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32124> (cherry picked from commit `a21cd8c5b6`)	2024-11-15 11:40:26 +01:00
David Rosca	797996b6cd	radv/video: Avoid selecting rc layer over maximum Vulkan spec doesn't say if this is allowed or not, but trying to do this will hang. Fixes: `4a19047d32` ("radv/video: Select temporal layer when encoding each frame") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31418> (cherry picked from commit `d1c1a33b35`)	2024-11-15 11:40:26 +01:00
David Rosca	0ed75e4e00	radv/video: Report correct encodeInputPictureGranularity Only aligned size can be encoded. Fixes: `54d499818c` ("radv/video: add initial support for encoding with h264.") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31418> (cherry picked from commit `e941acfb9d`)	2024-11-15 11:40:26 +01:00
David Rosca	6b1eaa99dd	radv/video: Fix HEVC slice control This needs to use aligned size, otherwise it will output two slices when the size is not 64 aligned. Fixes: `967e4e09de` ("radv/video: add h265 encode support") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31418> (cherry picked from commit `e4ec135d8b`)	2024-11-15 11:40:26 +01:00
David Rosca	e511bb9c19	radv/video: Fix H264 slice control This needs to use aligned size, otherwise it will output two slices when the size is not 16 aligned. Fixes: `54d499818c` ("radv/video: add initial support for encoding with h264.") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31418> (cherry picked from commit `6a121f1507`)	2024-11-15 11:40:26 +01:00
Eric Engestrom	664afa395b	.pick_status.json: Mark `1368ee5e1a` as denominated	2024-11-15 11:40:26 +01:00
Eric Engestrom	421eea5a07	.pick_status.json: Mark `d21f7f75ff` as denominated	2024-11-15 11:39:25 +01:00
Eric Engestrom	3a4d64b6ac	.pick_status.json: Mark `962b996d4c` as denominated	2024-11-15 11:37:54 +01:00
Eric Engestrom	13adaad55e	.pick_status.json: Mark `ca947e1295` as denominated	2024-11-15 11:37:51 +01:00
Eric Engestrom	a9d8ff7f8e	.pick_status.json: Mark `a78c2bf2a4` as denominated	2024-11-15 11:37:35 +01:00
Eric Engestrom	e0c0c625c8	.pick_status.json: Mark `ae85e6920c` as denominated	2024-11-15 11:37:24 +01:00
Eric Engestrom	b54746ef74	.pick_status.json: Update to `4ed8ef74b4`	2024-11-15 11:36:26 +01:00
Jesse Natalie	d253b4a05f	wgl: Add missing idep_mesautilformat Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/30874> (cherry picked from commit `f990322597`)	2024-11-13 17:43:51 +01:00
Eric Engestrom	2f6f108865	ci: raise priority of release manager pipelines KernelCI jobs have priority 44 and are very long-running jobs (and there might be an issue with the KernelCI that makes it create hundreds of jobs, @sergi is looking into that). While bumping to 45+ would be enough to allow Mesa release staging pipelines to run despite the KernelCI, during the CI meeting with @sergi and @mupuf it was determined that the Mesa releases are an important enough operation to warrant being a higher priority than user forks pipelines, so priority 70 was picked (still under the 75 of Marge pipelines). Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32093> (cherry picked from commit `50f9bec3ce`)	2024-11-13 12:41:03 +01:00
Lionel Landwerlin	cc372f4165	anv: update shader descriptor resource limits Some limits got stuck to the old binding table limits. Those don't apply anymore since EXT_descriptor_indexing was implemented. Fixes: `6e230d7607` ("anv: Implement VK_EXT_descriptor_indexing") Fixes: `96c33fb027` ("anv: enable direct descriptors on platforms with extended bindless offset") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31999> (cherry picked from commit `d6acb56f11`)	2024-11-13 12:38:54 +01:00
Tomeu Vizoso	c2c3d6ab61	etnaviv/ml: Fix includes etnaviv_ml.h uses dynarray, but the u_inlines.h header is needed by some of the files that include it. Fixes: `d6473ce28e` ("etnaviv: Use NN cores to accelerate convolutions") Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31842> (cherry picked from commit `70bff0c971`)	2024-11-13 12:38:53 +01:00
M Henning	d3cbd92d72	nvk: Fix invalidation of NVK_CBUF_TYPE_DYNAMIC_UBO Because dyn_start and dyn_end are indices into nvk_root_descriptor_table->dynamic_buffers, we would need to offset cbuf->dynamic_idx by nvk_root_descriptor_table->set_dynamic_buffer_start[cbuf->desc_set] in order to do those comparisons correctly. We could do that, but it's simpler and no less precise to sinply re-use the same comparison that we do in the other cases here. This fixes a rendering artifact in Baldur's Gate 3 (Vulkan), which regressed with the commit listed below. Fixes: `091a945b57` ("nvk: Be much more conservative about rebinding cbufs") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32086> (cherry picked from commit `dc12c78235`)	2024-11-13 12:38:53 +01:00
M Henning	fa0b36b903	nvk/cmd_buffer: Pass count to set_root_array Previously, we were passing the end index which was incorrect. Also, improve the macros so that they can take an expression for the count. Fixes: `b2d85ca36f` ("nvk: Use helper macros for accessing root descriptors") Reviewed-by: Faith Ekstrand <faith.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32086> (cherry picked from commit `64f17c1391`)	2024-11-13 12:38:53 +01:00
Jose Maria Casanova Crespo	138bced84a	v3d: Enable Early-Z with discards when depth updates are disabled The Early-Z optimization is disabled when there is a discard instruction in the shader used in the draw call. But if discard is the only reason to disable Early-Z, and at draw call time the updates in the draw call are disabled we can enable Early-Z using a shader variant. If there are occlussion queries active we also need to disable Early-z optimization. So this patch enables Early-Z in this scenario. The performance improvement is significant when running gfxbench benchmark showing an average improvement of 11.15% fps_avg helped: gl_gfxbench_aztec_high.trace: 3.13 -> 3.73 (19.13%) fps_avg helped: gl_gfxbench_aztec.trace: 4.82 -> 5.68 (17.88%) fps_avg helped: gl_gfxbench_manhattan31.trace: 5.10 -> 6.00 (17.59%) fps_avg helped: gl_gfxbench_manhattan.trace: 7.24 -> 8.36 (15.52%) fps_avg helped: gl_gfxbench_trex.trace: 19.25 -> 20.17 ( 4.81%) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32028> (cherry picked from commit `5b951bcdd7`)	2024-11-13 12:38:53 +01:00
Karmjit Mahil	11843f8a2e	nir: Fix `no_lower_set` leak on early return Addresses: ``` Indirect leak of 256 byte(s) in 2 object(s) allocated from: #0 0x7faaf53ee0 in __interceptor_malloc ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:145 #1 0x7fa8cfe900 in ralloc_size ../src/util/ralloc.c:118 #2 0x7fa8cfeb20 in rzalloc_size ../src/util/ralloc.c:152 #3 0x7fa8cff004 in rzalloc_array_size ../src/util/ralloc.c:232 #4 0x7fa8d06a84 in _mesa_set_init ../src/util/set.c:133 #5 0x7fa8d06bcc in _mesa_set_create ../src/util/set.c:152 #6 0x7fa8d0939c in _mesa_pointer_set_create ../src/util/set.c:613 #7 0x7fa95e5790 in nir_lower_mediump_vars ../src/compiler/nir/nir_lower_mediump.c:574 #8 0x7fa862c1c8 in tu_spirv_to_nir(tu_device, void, unsigned long, VkPipelineShaderStageCreateInfo const, tu_shader_key const, pipe_shader_type) ../src/freedreno/vulkan/tu_shader.cc:116 #9 0x7fa8646f24 in tu_compile_shaders(tu_device, unsigned long, VkPipelineShaderStageCreateInfo const, nir_shader, tu_shader_key const, tu_pipeline_layout, unsigned char const, tu_shader, char, void, nir_shader, VkPipelineCreationFeedback) ../src/freedreno/vulkan/tu_shader.cc:2741 #10 0x7fa85a16a4 in tu_pipeline_builder_compile_shaders ../src/freedreno/vulkan/tu_pipeline.cc:1887 #11 0x7fa85eb844 in tu_pipeline_builder_build<(chip)7> ../src/freedreno/vulkan/tu_pipeline.cc:3923 #12 0x7fa85e6bd8 in tu_graphics_pipeline_create<(chip)7> ../src/freedreno/vulkan/tu_pipeline.cc:4203 #13 0x7fa85c2588 in VkResult tu_CreateGraphicsPipelines<(chip)7>(VkDevice_T, VkPipelineCache_T, unsigned int, VkGraphicsPipelineCreateInfo const, VkAllocationCallbacks const, VkPipeline_T**) ../src/freedreno/vulkan/tu_pipeline.cc:4234 ``` seen in: dEQP-VK.binding_model.mutable_descriptor.single.switches.uniform_texel_buffer_storage_image.update_write.no_source.no_source.pool_expand_types.pre_update.no_array.vert Fixes: `7e986e5f04` ("nir/lower_mediump_vars: Don't lower mediump shared vars with atomic access.") Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32057> (cherry picked from commit `2a7df331af`)	2024-11-13 12:38:53 +01:00
Karmjit Mahil	969b7ba54d	tu: Fix potential alloc of 0 size We can end up calling vk_multialloc_alloc with 0 size when `attachment_count` is 0 and `clearValueCount` is 0. Addressed: ``` Direct leak of 1 byte(s) in 1 object(s) allocated from: #0 0x7faf033ee0 in __interceptor_malloc ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:145 #1 0x7fada5cc10 in vk_default_alloc ../src/vulkan/util/vk_alloc.c:26 #2 0x7fac50b270 in vk_alloc ../src/vulkan/util/vk_alloc.h:48 #3 0x7fac555040 in vk_multialloc_alloc ../src/vulkan/util/vk_alloc.h:234 #4 0x7fac555040 in void tu_CmdBeginRenderPass2<(chip)7>(VkCommandBuffer_T, VkRenderPassBeginInfo const, VkSubpassBeginInfo const*) ../src/freedreno/vulkan/tu_cmd_buffer.cc:4634 #5 0x7fac900760 in vk_common_CmdBeginRenderPass ../src/vulkan/runtime/vk_render_pass.c:261 ``` seen in: dEQP-VK.robustness.robustness2.bind.notemplate.r32i.dontunroll.nonvolatile.uniform_texel_buffer.no_fmt_qual.len_252.samples_1.1d.frag Fixes: `4cfd021e3f` ("turnip: Save the renderpass's clear values in the cmdbuf state.") Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32057> (cherry picked from commit `c923eff742`)	2024-11-13 12:38:53 +01:00
Karmjit Mahil	ed2af40089	tu: Fix push_set host memory leak on command buffer reset Addresses: ``` Direct leak of 192 byte(s) in 1 object(s) allocated from: #0 0x7fbe5e4230 in __interceptor_realloc ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:164 #1 0x7fbd008bf4 in vk_default_realloc ../src/vulkan/util/vk_alloc.c:37 #2 0x7fbbabb2fc in vk_realloc ../src/vulkan/util/vk_alloc.h:70 #3 0x7fbbaead38 in tu_push_descriptor_set_update_layout ../src/freedreno/vulkan/tu_cmd_buffer.cc:3173 #4 0x7fbbaeb0b4 in tu_push_descriptor_set ../src/freedreno/vulkan/tu_cmd_buffer.cc:3203 #5 0x7fbbaeb500 in tu_CmdPushDescriptorSet2KHR(VkCommandBuffer_T, VkPushDescriptorSetInfoKHR const) ../src/freedreno/vulkan/tu_cmd_buffer.cc:3235 #6 0x7fbbe35c80 in vk_common_CmdPushDescriptorSetKHR ../src/vulkan/runtime/vk_command_buffer.c:300 ``` seen in: dEQP-VK.binding_model.shader_access.secondary_cmd_buf.bind.with_push.sampler_mutable.tess_eval.multiple_discontiguous_descriptors.1d_array Fixes: `03294e1dd1` ("turnip: Keep a host copy of push descriptor sets.") Signed-off-by: Karmjit Mahil <karmjit.mahil@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32057> (cherry picked from commit `53c2d5e426`)	2024-11-13 12:38:53 +01:00
Job Noorman	c4ca3d53f9	ir3/ra: prevent moving source intervals for shared collects Non-trivial collects (i.e., ones that will introduce moves because the sources don't line-up with the destination) may cause source intervals to get implicitly moved when they are inserted as children of the destination interval. Since we don't support moving intervals in shared RA, this may cause illegal register allocations. Prevent this by creating a new top-level interval for the destination so that the source intervals will be left alone. Signed-off-by: Job Noorman <jnoorman@igalia.com> Fixes: `fa22b0901a` ("ir3/ra: Add specialized shared register RA/spilling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31978> (cherry picked from commit `b36a7ce0f1`)	2024-11-13 12:38:53 +01:00
Matt Turner	aa81c1093e	anv: Align anv_descriptor_pool::host_mem Otherwise anv_descriptor_set is accessed through an unaligned pointer, which is undefined behavior in C. ``` anv_descriptor_set.c:1620:17: runtime error: member access within misaligned address 0x61900002c2b5 for type 'struct anv_descriptor_set', which requires 8 byte alignment 0x61900002c2b5 ``` Fixes: `2570a58bcd` ("anv: Implement descriptor pools") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32070> (cherry picked from commit `a2c4a34303`)	2024-11-13 12:38:53 +01:00
Eric Engestrom	63f2ab67a2	.pick_status.json: Update to `b32d0d4b45`	2024-11-13 12:38:50 +01:00
Eric Engestrom	07cea4ca0a	.pick_status.json: Mark `5cd054ebe5` as denominated	2024-11-11 11:13:21 +01:00
Ian Romanick	5b2b92a5b2	brw/cse: Don't eliminate instructions that write flags With other changes in my tree, I observed this code from dEQP-VK.subgroups.vote.compute.subgroupallequal_float have the second cmp.z removed. undef(8) %69:UD cmp.z.f0.0(8) %69:F, %37:F, %57+0.0<0>:F mov(1) v58+0.0:D, 0d NoMask group0 (+f0.0) mov(1) v58+0.0:D, -1d NoMask group0 cmp.nz.f0.0(8) null:D, v58+0.0<0>:D, 0d ... undef(8) %72:UD cmp.z.f0.0(8) %72:F, %37:F, %57+0.0<0>:F mov(1) v63+0.0:D, 0d NoMask group0 (+f0.0) mov(1) v63+0.0:D, -1d NoMask group0 This was also fixed by running dead-code elimination before CSE. That seems more like avoiding the problem than fixing it, though. I believe this affects shader-db results because leaving the second CMP in the shader can give more opportunities for cmod propagation. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `234c45c929` ("intel/brw: Write a new global CSE pass that works on defs") shader-db: All Intel platforms had similar results. (Lunar Lake shown) total cycles in shared programs: 922097690 -> 922260862 (0.02%) cycles in affected programs: 3178926 -> 3342098 (5.13%) helped: 130 HURT: 88 helped stats (abs) min: 2 max: 2194 x̄: 296.71 x̃: 16 helped stats (rel) min: <.01% max: 16.56% x̄: 1.86% x̃: 0.18% HURT stats (abs) min: 4 max: 11992 x̄: 2292.55 x̃: 47 HURT stats (rel) min: 0.04% max: 57.32% x̄: 11.82% x̃: 0.61% 95% mean confidence interval for cycles value: 320.36 1176.63 95% mean confidence interval for cycles %-change: 1.59% 5.73% Cycles are HURT. LOST: 2 GAINED: 1 fossil-db: Lunar Lake, Meteor Lake, Tiger Lake had similar results. (Lunar Lake shown) Totals: Instrs: 142022960 -> 142022928 (-0.00%); split: -0.00%, +0.00% Cycle count: 21995242782 -> 21995384040 (+0.00%); split: -0.00%, +0.00% Max live registers: 48013385 -> 48013343 (-0.00%) Totals from 507 (0.09% of 551441) affected shaders: Instrs: 886191 -> 886159 (-0.00%); split: -0.01%, +0.01% Cycle count: 69302492 -> 69443750 (+0.20%); split: -0.66%, +0.86% Max live registers: 94413 -> 94371 (-0.04%) DG2 Totals: Instrs: 152856370 -> 152856093 (-0.00%); split: -0.00%, +0.00% Cycle count: 17237159885 -> 17236804052 (-0.00%); split: -0.00%, +0.00% Fill count: 150673 -> 150631 (-0.03%) Max live registers: 31871520 -> 31871476 (-0.00%) Totals from 506 (0.08% of 633197) affected shaders: Instrs: 831795 -> 831518 (-0.03%); split: -0.04%, +0.01% Cycle count: 55578509 -> 55222676 (-0.64%); split: -1.38%, +0.74% Fill count: 2779 -> 2737 (-1.51%) Max live registers: 51383 -> 51339 (-0.09%) Ice Lake and Skylake had similar results. (Ice Lake shown) Totals: Instrs: 152017826 -> 152017793 (-0.00%); split: -0.00%, +0.00% Cycle count: 15180773451 -> 15180761166 (-0.00%); split: -0.00%, +0.00% Fill count: 106610 -> 106614 (+0.00%) Max live registers: 32195006 -> 32194966 (-0.00%) Totals from 411 (0.06% of 637268) affected shaders: Instrs: 705935 -> 705902 (-0.00%); split: -0.01%, +0.01% Cycle count: 47830019 -> 47817734 (-0.03%); split: -0.05%, +0.02% Fill count: 2865 -> 2869 (+0.14%) Max live registers: 42883 -> 42843 (-0.09%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32041> (cherry picked from commit `9aba731d03`)	2024-11-11 11:12:58 +01:00
Ian Romanick	a5c46206e3	brw/copy: Don't copy propagate through smaller entry dest size Copy propagation would incorrectly occur in this code mov(16) v4+2.0:UW, u0<0>:UW NoMask ... mov(8) v6+2.0:UD, v4+2.0:UD NoMask group0 to create mov(16) v4+2.0:UW, u0<0>:UW NoMask ... mov(8) v6+2.0:UD, u0<0>:UD NoMask group0 This has different behavior. I think I just made a mistake when I changed this condition in `e3f502e007`. It seems like this condition could be relaxed to cover cases like (note the change of destination stride) mov(16) v4+2.0<2>:UW, u0<0>:UW NoMask ... mov(8) v6+2.0:UD, v4+2.0:UD NoMask group0 I'm not sure it's worth it. No shader-db or fossil-db changes on any Intel platform. Even the code for the test case mentioned in the original commit did not change. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `e3f502e007` ("intel/fs: Allow copy propagation between MOVs of mixed sizes") Closes: #12116 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32041> (cherry picked from commit `80a5d158ae`)	2024-11-11 11:12:57 +01:00
Karol Herbst	d7db9b470e	nvc0: return NULL instead of asserting in nvc0_resource_from_user_memory Fixes: `212f1ab40e` ("nvc0: support PIPE_CAP_RESOURCE_FROM_USER_MEMORY_COMPUTE_ONLY") Acked-by: David Heidelberg <david@ixit.cz> Acked-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Reviewed-by: Daniel Stone <daniels@collabora.com> Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/27783> (cherry picked from commit `277925471e`)	2024-11-11 11:12:56 +01:00
Karol Herbst	3a7a04559b	nv/codegen: Do not use a zero immediate for tex instructions They aren't always legal for tex instructions, specifically for TXQ when an actual source is needed. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11999 Fixes: `85a31fa1fc` ("nv50/ir/nir: fix txq emission on MS textures") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32043> (cherry picked from commit `47a1565c3d`)	2024-11-11 11:12:55 +01:00
Eric Engestrom	afe62e9644	.pick_status.json: Update to `4d09cd7fa5`	2024-11-11 11:12:52 +01:00
Eric Engestrom	7e011814fe	meson: add dependencies needed by wsi_common_x11.c even on non-drm platforms Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11907 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32012> (cherry picked from commit `06cca41889`)	2024-11-07 18:38:37 +01:00
Benjamin Herrenschmidt	a47d11b736	dril: Fixup order of pixel formats in drilConfigs Having the RGB* formats before the BGR* formats in the table causes problems where under some circumstances, some applications end up with the wrong colors. The repro case for me is: Xvnc + mutter + chromium There was an existing comment in dri_fill_in_modes() which explained the problem. This was lost when dril_target.c was created. Fixes: `ec7afd2c24` ("dril: rework config creation") Fixes: `3de62b2f9a` ("gallium/dril: Compatibility stub for the legacy DRI loader interface") Signed-off-by: Benjamin Herrenschmidt <benh@kernel.crashing.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31950> (cherry picked from commit `e1098310da`)	2024-11-07 18:38:35 +01:00
Christian Gmeiner	d73b751b90	etnaviv: Fix incorrect pipe_nn creation When etna_screen_create(..) is called with gpu != NULL and npu == NULL, screen->pipe_nn is incorrectly set up. This leads to an unintended stream configuration for compute-only contexts, as determined by pipe = (compute_only && screen->pipe_nn) ? screen->pipe_nn : screen->pipe; To address this, extend the gpu != npu condition by adding a check for npu != NULL to ensure pipe_nn is only initialized when both gpu and npu are provided. Fixes: `a4653587cc` ("etnaviv: Add a separate NPU pipe") Signed-off-by: Christian Gmeiner <cgmeiner@igalia.com> Reviewed-by: Philipp Zabel <p.zabel@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/32025> (cherry picked from commit `f4e8849d79`)	2024-11-07 18:38:32 +01:00
Rhys Perry	bb0502e57b	aco: don't byte align global VMEM loads if it might be unsafe Using the byte align path can be unsafe even when 12 byte loads are supported. fossil-db (navi21): Totals from 185 (0.23% of 79395) affected shaders: Instrs: 391501 -> 391575 (+0.02%); split: -0.03%, +0.05% CodeSize: 2147336 -> 2147672 (+0.02%); split: -0.03%, +0.05% Latency: 3762613 -> 3860941 (+2.61%); split: -0.01%, +2.62% InvThroughput: 871429 -> 888013 (+1.90%); split: -0.08%, +1.98% VClause: 9712 -> 10210 (+5.13%) Copies: 53775 -> 53010 (-1.42%); split: -1.46%, +0.04% VALU: 254009 -> 252146 (-0.73%) SALU: 56698 -> 56699 (+0.00%); split: -0.00%, +0.00% VMEM: 18503 -> 19601 (+5.93%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Fixes: `391bf3ea30` ("aco: don't expand smem/mubuf global loads") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31807> (cherry picked from commit `b318fe47e9`)	2024-11-07 18:38:29 +01:00
Lionel Landwerlin	fe1157896f	anv: add texture cache inval after binding pool update Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31928> (cherry picked from commit `f9e76e8ca6`)	2024-11-07 18:38:14 +01:00
Lionel Landwerlin	fbb3dbdf7a	anv: fix even set/reset on blitter engine Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: José Roberto de Souza <jose.souza@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31928> (cherry picked from commit `b3f487bd0d`)	2024-11-07 18:38:07 +01:00
Lionel Landwerlin	6b3af03586	vulkan/runtime: fix allocation failure handling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `93d0c66b27` ("vulkan/pipeline_cache: Add helpers for storing NIR in the cache") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31982> (cherry picked from commit `2cadab5dcf`)	2024-11-07 18:38:06 +01:00
Samuel Pitoiset	d8e49ead8e	radv: cleanup tools related resources when destroying logical device This was missing. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31986> (cherry picked from commit `64774f9c19`)	2024-11-07 18:38:04 +01:00
itycodes	4962a83c21	intel: Fix a typo in intel_device_info.c:has_get_tiling The structs are of equal size and both ioctls were added at the same time, so the functionality is equivalent, but it's nonetheless the incorrect type being passed. Signed-off-by: tranquillitycodes@proton.me Fixes: `762e601f77` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31974> (cherry picked from commit `10c92cbd39`)	2024-11-07 18:38:03 +01:00
Marek Olšák	ee5ae8c717	radeonsi/gfx12: fix AMD_DEBUG=nodcc not working surface->modifier is always 0 here. We should use the parameter instead. Fixes: `3d05d86d88` (radeonsi/gfx12: add DCC) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31910> (cherry picked from commit `5d09374ffe`)	2024-11-07 18:38:02 +01:00
Marek Olšák	c19d4af9d8	radeonsi/gfx11: fix Z corruption for Blender The corruption only happens with non-TC-compatible HTILE, so always use TC-compatible HTILE. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/11891 Cc: mesa-stable Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31910> (cherry picked from commit `047532b1e1`)	2024-11-07 18:38:01 +01:00
Lucas Fryzek	34ed9f05f8	lp: Only close udmabuf handle if its valid Also change ifdef's from just `HAVE_LIBDRM` to check for both LIBDRM and for UDMABUF HEADER. preventing unbalanced guards preventing part of the code from being included if you just have LIBDRM or just have the udmabuf headers. Fixes: `4cfaf10c` ("llvmpipe: Only use udmabuf with libdrm") Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31877> (cherry picked from commit `159fb9691d`)	2024-11-07 18:38:00 +01:00
Eric Engestrom	19d06c73f1	.pick_status.json: Update to `fe50011ddb`	2024-11-07 18:37:51 +01:00
Rob Clark	e143fabe22	freedreno/a6xx: Stop exposing MSAA image load/store harder Fixes KHR-GL46.multi_bind.dispatch_bind_image_textures which decides max_image_samples==1 means that MSAA image load/store is supported. Switch the condition to > 0, which matches what zink does. Fixes: `e277b13182` ("freedreno: Stop exposing MSAA image load/store on desktop GL.") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/31857> (cherry picked from commit `f8e7c0e2a2`)	2024-11-02 20:28:35 +01:00

1 2 3 4 5 ...

763 Commits