fran/mesa - mesa - GNLUG git store

fran/mesa

Author	SHA1	Message	Date
Samuel Pitoiset	d1daf01b3a	radv: fix ignoring graphics shader stages that don't need to be imported If a shader stage is already imported from a library it should be properly ignored. Fixes recent CTS dEQP-VK.pipeline.fast_linked_library.misc.unused_shader_stages*. Fixes: `c8765c5244` ("radv: ignore shader stages that don't need to be imported with GPL") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20899> (cherry picked from commit `b97fee432c`)	2023-01-26 15:40:35 +00:00
Lionel Landwerlin	5a2ca86429	vulkan/wsi/wayland: improve same gpu detection Some compositor like KWin do not return the render node. v2: Make sure we test if only drm_info.hasPrimary is true (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `db42ed1e04` ("vulkan/wsi/wl: correctly find whether the compositor uses the same GPU") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8139 Reviewed-by: Simon Ser <contact@emersion.fr> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20914> (cherry picked from commit `e27d217fb1`)	2023-01-26 15:40:35 +00:00
Francisco Jerez	8f0b387d94	intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation. The ACP entries created by copy propagation to track the implied copies of LOAD_PAYLOAD instructions don't model the behavior of LOAD_PAYLOAD correctly, since (as of `41868bb682`) header moves are implicitly retyped to UD and the destination of non-header copies implicitly uses the same type as the corresponding source, even though the ACP entries created for such copies could incorrectly represent a type conversion, which can lead to mis-optimization of the program. According to Marcin, this fixes the func.mesh.ext.workgroup_id.task.q0 crucible test. Fixes: `41868bb682` ("i965/fs: Rework the fs_visitor LOAD_PAYLOAD instruction") Reported-by: Marcin Ślusarz <marcin.slusarz@intel.com> Tested-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/18980> (cherry picked from commit `7b5e933629`)	2023-01-26 15:40:35 +00:00
Mike Blumenkrantz	67f2d07eff	zink: fix VK_DYNAMIC_STATE_LINE_WIDTH usage add a special tracker here to set the state only when necessary Fixes: `659c39fafb` ("zink: rework primitive rasterization type logic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20886> (cherry picked from commit `06a125942b`)	2023-01-26 15:40:35 +00:00
Italo Nicola	3a65dc4f7f	panfrost: fix off-by-one when exporting format modifiers `count` should not be incremented before the check, because it causes the modifiers array to be filled starting from position 1 instead of 0. This bug causes one less format modifier to be available than would otherwise be expected, which could then lead to a dmabuf query failing in situations where a supported modifier wouldn't be advertised. It also causes garbage data to be advertised as a modifier in position 0 of the array, although this is not very likely to cause issues. Fixes: `2a1217513` ("panfrost: Implement panfrost_query_dmabuf_modifiers") Cc: mesa-stable Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20879> (cherry picked from commit `6c446377ff`)	2023-01-26 15:40:35 +00:00
Pierre-Eric Pelloux-Prayer	518487158a	radeonsi/gfx11: fix ge_cntl programming gfx11 renamed PRIM_GRP_SIZE to VERTS_PER_SUBGRP but another change was was missed. Update our code based on PAL's UniversalCmdBuffer::CalcGeCntl function (especially useVgtOnchipCntlForTess being false for gfx11). Fixes: `25a66477d0` ("radeonsi/gfx11: register changes") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20728> (cherry picked from commit `f73cdda983`)	2023-01-26 15:40:35 +00:00
Emma Anholt	8d8987577d	zink: Re-emit the SpvBuiltInSampleMask access chain each load. Otherwise, the access chain you emitted last time may not dominate the current use. Fixes the following validation failure in dEQP-GLES31.functional.shaders.sample_variables.sample_mask_in.bits_unique_per_sample.multisample_texture_2: UNASSIGNED-CoreValidation-Shader-InconsistentSpirv(ERROR / SPEC): msgNum: 7060244 - Validation Error: [ UNASSIGNED-CoreValidation-Shader-InconsistentSpirv ] Object 0: handle = 0x55cf3cea2c60, type = VK_OBJECT_TYPE_DEVICE; \| MessageID = 0x6bbb14 \| SPIR-V module not valid: ID '67[%67]' defined in block '23[%23]' does not dominate its use in block '31[%31]' Fixes: `8899f6a198` ("zink: fix gl_SampleMaskIn spirv generation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20756> (cherry picked from commit `4286633eec`)	2023-01-26 15:40:35 +00:00
Emma Anholt	135425a3ae	zink: Fix up mismatches of memory model vs addressing model. MemoryModelVulkan was left out for CSes using it. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20756> (cherry picked from commit `1e4deb3b89`)	2023-01-26 15:40:35 +00:00
Emma Anholt	617fdb8818	zink: Fix validation failure for maxLod < minLod. GL lets you set a silly state, so do something plausible instead of undefined. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20756> (cherry picked from commit `adf81044d4`)	2023-01-26 15:40:35 +00:00
Emma Anholt	4671e270df	zink: Add missing Flat decorations on some inputs. Fixes validation failures: Test case 'dEQP-GLES31.functional.android_extension_pack.shaders.es32.extension_directive.oes_sample_variables'.. MESA: error: Validation Error: [ UNASSIGNED-CoreValidation-Shader-InconsistentSpirv ] Object 0: handle = 0x563a1838b790, type = VK_OBJECT_TYPE_DEVICE; \| MessageID = 0x6bbb14 \| SPIR-V module not valid: [VUID-StandaloneSpirv-Flat-04744] Fragment OpEntryPoint operand 31 with Input interfaces with integer or float type must have a Flat decoration for Entry Point id 4. %gl_SampleId = OpVariable %_ptr_Input_uint Input Test case 'KHR-GL46.shader_ballot_tests.ShaderBallotAvailability'.. MESA: error: Validation Error: [ UNASSIGNED-CoreValidation-Shader-InconsistentSpirv ] Object 0: handle = 0x5558e12f17e0, type = VK_OBJECT_TYPE_DEVICE; \| MessageID = 0x6bbb14 \| SPIR-V module not valid: [VUID-StandaloneSpirv-Flat-04744] Fragment OpEntryPoint operand 28 with Input interfaces with integer or float type must have a Flat decoration for Entry Point id 4. %gl_SubgroupLocalInvocationId = OpVariable %_ptr_Input_uint Input Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20756> (cherry picked from commit `2a33d509ca`)	2023-01-26 15:40:35 +00:00
Marcin Ślusarz	e79f4e0b12	intel/compiler/mesh: handle const data in task & mesh programs Started showing up when nir_opt_large_constants call was moved in `88756cee8d`. Fixes dEQP-VK.mesh_shader.ext.smoke.monolithic.fullscreen_gradient* Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Fixes: `88756cee8d` ("intel/compiler: Run nir_opt_large_constants before scalarizing consts") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20876> (cherry picked from commit `536a2acfc2`)	2023-01-26 15:40:34 +00:00
Lionel Landwerlin	1c243f4f8b	intel/fs: avoid cmod optimization on instruction with different write_mask I've been running into failures with tests like : dEQP-VK.robustness.robustness2.bind.notemplate.rgba32i.unroll.nonvolatile.uniform_buffer_dynamic.no_fmt_qual.len_4.samples_1.1d.frag With the load_global_const_block_intel NIR intrinsic, you can load a vec8/vec16 with a predicate. The predicate is correctly uniformized to feed into the SEND instruction's flag register. The problem is that a series of optimization first remove the find_live_channel and then changes the broadcast into a simple MOV instruction, on the assumption that the first channel is always active if there is not control flow. This is correct. But after that the cmod optimzation will remove this instruction : mov.nz.f0.0(16) null:D, vgrf16+0.0<0>:D NoMask because it seems to be equivalent to : cmp.g.f0.0(16) vgrf16:D, vgrf12:D, 63d In this case vgrf16 is the predicate to the load block SEND instruction. Since the execution mask is different between both, some of the channels of the SEND instruction end up not being loaded or loaded with the wrong predication and we end up with incorrect UBO data. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20852> (cherry picked from commit `a50d2fdb46`)	2023-01-26 15:40:34 +00:00
Mike Blumenkrantz	4502264786	zink: use actual swapchain object for surface comparison the outer swapchain object is persistent, which means checking it will never yield an update after the first check fixes #8122 Fixes: `b2739c9f00` ("zink: set surface->dt when updating swapchain" Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20814> (cherry picked from commit `474ed4b877`)	2023-01-26 15:40:34 +00:00
Erik Faye-Lund	d240b30e35	radeonsi: respect smoothing_enabled When this was last changed, the smoothing_enabled flag seems to have been forgotten about, breaking line-smoothing (and probably also polygon smoothing). Fixes: `4147add280` ("radeonsi: update db_eqaa even if msaa is disabled") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20810> (cherry picked from commit `9f4f131f2e`)	2023-01-26 15:40:34 +00:00
Jonathan Gray	8e2e275985	egl/dri2: avoid undefined unlocks unlocks were incorrectly added to paths using dri2_egl_display() as well as those using dri2_egl_display_lock() pthread_mutex_unlock() when unlocked is documented by posix as being undefined behaviour. On OpenBSD pthread_mutex_unlock() will call abort(3) if this happens. Fixes: `f1efe037df` ("egl/dri2: Add display lock") Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20712> (cherry picked from commit `0594b3c143`)	2023-01-26 15:40:34 +00:00
Marek Olšák	16fc1641cc	glthread: handle GL_*_ARRAY in glEnable/Disable Surprisingly, the GL compatibility profile allows these in both glEnableClientState and glEnable. Fixes: `0b1dd18591` - glthread: track which vertex array attribs are enabled Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `777166cc66`)	2023-01-26 15:40:34 +00:00
Marek Olšák	75a63104b0	mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture This wasn't implemented, but the spec requires it. Fixes: `1fe7b1f972` - mesa: implement ARB_bindless_texture Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `721526227c`)	2023-01-26 15:40:34 +00:00
Marek Olšák	01185dcd60	util: fix util_is_vbo_upload_ratio_too_large It was wrong. For example, if the draw vertex count was 10 and the upload vertex count was 150, u_vbuf wouldn't unroll the draw and would instead memcpy 150 vertices. This fixes that case. Fixes: `068a3bf0d7` - util: move and adjust the vertex upload heuristic equation from u_vbuf Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20824> (cherry picked from commit `4f6e785876`)	2023-01-26 15:40:34 +00:00
Marek Olšák	6c869d993a	glthread: fix an upload buffer leak Fixes: `befbd54864` - glthread: don't use atomics for refcounting to decrease overhead on AMD Zen Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20804> (cherry picked from commit `4d4995b32b`)	2023-01-26 15:40:34 +00:00
Mike Blumenkrantz	7b38217922	zink: don't use ds3 blend states without color attachments this is illegal and causes validation errors cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20799> (cherry picked from commit `5d44318566`)	2023-01-26 15:40:34 +00:00
Mike Blumenkrantz	95961596d4	zink: delete need_blend_constants this is an artifact of very old code before the dynamic state was set for all graphics pipelines now the checks only cause blend constants to not be updated, which triggers bugs and validation failures cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20799> (cherry picked from commit `b4d18f2ad1`)	2023-01-26 15:40:33 +00:00
Rose Hudson	3b5dfeda81	radeonsi: report 0 block size for Polaris HEVC encoding makes encoded videos resemble the input again :) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7992 Fixes: `c4482a3c1a` ("radeonsi/vcn: enable multi-slice encoding") Reviewed-by: Ruijing Dong <ruijing.dong@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20714> (cherry picked from commit `e8a60633da`)	2023-01-26 15:40:33 +00:00
Tapani Pälli	cedf4e4d46	iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format, additionally for Gfx12+ field must be disabled when num multisamples > 1 or forced multisample count > 1. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7892 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `247c06d419`)	2023-01-26 15:40:33 +00:00
Tapani Pälli	74d02d26df	hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `58dd9d5134`)	2023-01-26 15:40:33 +00:00
Tapani Pälli	5d473b4282	anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable Field must be disabled if any render targets have integer format, additionally for Gfx12+ field must be disabled when num multisamples > 1 or forced multisample count > 1. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20671> (cherry picked from commit `9b37ef40f8`)	2023-01-26 15:40:33 +00:00
Danylo Piliaiev	5d747f334c	tu/kgsl: do not use kgsl_command_object::offset offset field in kgsl_command_object is NOT used by KGSL, so we should offset directly to iova. Fixes weird hangs on KGSL. E.g. fixes the hang in: dEQP-VK.memory.pipeline_barrier.transfer_dst_storage_texel_buffer.1024 cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20795> (cherry picked from commit `926f626b95`)	2023-01-26 15:40:33 +00:00
Mike Blumenkrantz	da1ac780bf	zink: preserve present resources during async presentation ensure that these have a lifetime great enough to be presented fixes #7781 cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20793> (cherry picked from commit `020db79340`)	2023-01-26 15:40:33 +00:00
Julia Tatz	8d955313dc	zink: correct sparse bo mem_type_idx placement VK_MEMORY_PROPERTY_DEVICE_LOCAL_BIT = 0x01 has been incidently the correct memory type index, but isn't guaranteed to be, which is why it hasn't caused issues yet Fixes: `f9515d93` ("zink: allocate/place memory using memoryTypeIndex directly") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `c71287e70c`)	2023-01-26 15:40:33 +00:00
Julia Tatz	87146b2158	zink: trival renames heap_idx -> memoryTypeIndex Trival renames to correctly identify vulkan memory type indices aren't the same as zink heaps Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `e20e8f2243`)	2023-01-26 15:40:33 +00:00
Julia Tatz	2ff2ffff18	zink: zink_heap isn't 1-to-1 with memoryTypeIndex Clarify the relationship between zink heaps and vulkan memory type indices, and resolve the issues from mixing the two up. Closes: #7588, #7813 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20264> (cherry picked from commit `f6d3a5755f`)	2023-01-26 15:40:33 +00:00
Mike Blumenkrantz	9fa0dd2f65	zink: handle modifier nplanes queries correctly for planar formats this just returns the number of planes in the base format as a default, which matches the behavior of other drivers cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20753> (cherry picked from commit `6ff334e54a`)	2023-01-26 15:40:32 +00:00
Mike Blumenkrantz	9f0fd0562f	zink: store drm format as internal_format for imported resources internal_format is the "real" format of a resource, and the "real" format of imported resources is the external-facing format, not the pipe format this ensures the correct format is available for internal ops, such as nplanes queries Fixes: `2e2775c11b` ("zink: fix PIPE_RESOURCE_PARAM_NPLANES with format modifier") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20753> (cherry picked from commit `072e29a22e`)	2023-01-26 15:40:32 +00:00
Samuel Pitoiset	a6292889be	radv: fix creating BC image views when the base layer is > 0 When the base array layer of the image view is > 0, addrlib computes the offset (in HwlComputeSubResourceOffsetForSwizzlePattern) which is then added to the base VA in RADV. But if the driver doesn't reset the base array layer, the hw will compute incorrect addressing (ie. base array will be added twice). This also matches AMDVLK. This fixes a VM fault followed by a GPU hang on RDNA2 when trying to join a multiplayer game with medium settings in Halo Infinite. Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `8d191b2cfb`)	2023-01-26 15:40:32 +00:00
Samuel Pitoiset	af03528185	radv: fix buffer to image copies with BC views on the graphics queue The color surface descriptor needs to be adjusted, otherwise addressing is wrong. Fixes tests performed on the graphics queue from dEQP-VK.api.copy_and_blit..image_to_buffer.2d_images.mip_copies_. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/7900 Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `18aaa373b7`)	2023-01-26 15:40:32 +00:00
Samuel Pitoiset	2414591e5c	radv: fix setting MAX_MIP for BC views MAX_MIP should always be the number of levels minus one from the hw perspective. This doesn't fix anything known. Fixes: `98ba1e0d81` ("radv: Fix mipmap views on GFX10+") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20761> (cherry picked from commit `aff5fe3f94`)	2023-01-26 15:40:32 +00:00
Pierre-Eric Pelloux-Prayer	c70ffcc8e3	glthread: fix glArrayElement handling This must be marshalled synchronously or the attrib pointers' content might change by the time we use them. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8068 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `ddc721e15c`)	2023-01-26 15:40:32 +00:00
Pierre-Eric Pelloux-Prayer	bc487cecdc	vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage The ideal case for performance is to have a single buffer for all display list. The caveat is that large buffers are less likely to be freed because they're refcounted: it only takes 1 user (diplay list) to keep it in VRAM. This lowers VRAM usage when replaying the trace attached of the trace attached to !6140 from 5.5 GB to about 1.8 GB. Viewperf snx performance isn't affected. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6140 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `0f5c8c3dc3`)	2023-01-26 15:40:32 +00:00
Pierre-Eric Pelloux-Prayer	474f6f60c0	vbo: remove bogus assert grow_vertex_storage may call wrap_filled_vertex, which will trigger the assert incorrectly because the new size will be smaller than 'new_size' but it's correct because 'vertex_store->used' has been reset to 0. Fixes: `a08baaff97` ("vbo/dlist: fix indentation in vbo_save_api.c") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20748> (cherry picked from commit `491f6b138e`)	2023-01-26 15:40:32 +00:00
Lionel Landwerlin	90e1c36baa	nir/lower_io: fix bounds checking for 64bit_bounded_global If the offset is negative like it's the case in dEQP-VK.robustness.robustness2.bind.notemplate.r32i.unroll.volatile.storage_buffer_dynamic.readwrite.no_fmt_qual.len_256.samples_1.1d.comp we end up passing the bounds checking condition because it's using signed integers. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Jason Ekstrand <jason.ekstrand@collabora.com> Cc: mesa-stable Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20762> (cherry picked from commit `ff34e96701`)	2023-01-26 15:40:32 +00:00
Kenneth Graunke	89e679803b	intel/compiler: Drop redundant 32-bit expansion for shared float atomics We already expanded data to 32-bit a few lines earlier, so this is just redundantly doing it a second time. Fixes: `43169dbbe5` ("intel/compiler: Support 16 bit float ops") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20604> (cherry picked from commit `f7b29d7924`)	2023-01-26 15:40:32 +00:00
Francisco Jerez	7fe1d202d5	intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics. This avoids a violation of the Vulkan memory model that was leading to intermittent failures of at least 8k test-cases of the Vulkan CTS (within the group dEQP-VK.memory_model.) on TGL and DG2 platforms. In theory the issue may be reproducible on earlier platforms like IVB and ICL, but the SYNC.ALLWR instruction is not available on those platforms so a different (likely costlier) fix will be needed. The issue occurs within the sequence we emit for a NIR memory barrier with acquire semantics requiring the synchronization of multiple caches, e.g. in pseudocode for a barrier involving the TGM and UGM caches on DG2: x <- load.ugm // Atomic read sequenced-before the barrier y <- fence.ugm z <- fence.tgm wait(y, z) w <- load.tgm // Read sequenced-after the barrier In the example we must provide the guarantee that the memory load for x is completed before the one for w, however this ordering can be reversed with the intervention of a concurrent thread, since the UGM fence will block on the prior UGM load and potentially take a long time, while the TGM fence may complete and invalidate the TGM cache immediately, so a concurrent thread could pollute the TGM cache with stale contents for the w location before* the UGM load has completed, leading to an inversion of the expected memory ordering. v2: Apply the workaround regardless of whether the NIR barrier intrinsic specifies multiple storage classes or a single one, since an acquire barrier is required to order subsequent requests relative to previous atomic requests of unknown storage class not necessarily specified by the memory scope information of the intrinsic. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20690> (cherry picked from commit `4a2e7306dd`)	2023-01-26 15:40:31 +00:00
Paulo Zanoni	9098d83fb3	anv: check the return value of anv_execbuf_add_bo_bitset() Because anv_execbuf_add_bo_bitset() calls anv_execbuf_add_bo(), which can fail if its memory allocations fail. I have seen dEQP tests exercising memory allocation failures during anv_execbuf_add_bo(), but I don't think the path coming from add_bo_biset() was specifically exercised. Anyway, add the error check just in case. v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> (cherry picked from commit `3d37950fd9`)	2023-01-26 15:40:31 +00:00
Paulo Zanoni	f2aaa18997	anv: don't leave undefined values in exec->syncobj_values In anv_execbuf_add_syncobj(), we try to not create or use exec->syncobj_values if we don't need to. But when we figure we're going to need it (i.e., when timeline_value is not zero), then we create exec->syncobj_values with vk_zalloc, which means every previous value is set to zero, as it should be. This is all correct. The problem starts when we add a 16th element. In this case we double exec->syncobj_array_length and realloc the buffer by using vk_alloc and copying the old array to the new one. After that, we write the timeline_value to the array only if it's not zero, and that's the problem: since we just used vkalloc and memcpy, we don't have any guarantees that the new array will be zero after the 16th element, and if timeline_value is zero we write nothing to that position. Once we start using exec->syncobj_values we have to commit to using it, so the "if (timeline_value)" check near the end of the function has to be changed to "if (exec->syncobj_values)", so we actually set elements after the 16th to zero when they need to be zero. Another approach to fix this would be to memset the new elements once we double syncobj_array_length. In practice, I couldn't find any application or deqp test that used more than 3 elements in exec->syncobj_array_length, and we need more than 16 elements in order to be able to reproduce the bug, so I'm not aware of any real-world bug that goes away with this patch. This issue was found while reading code. If we craft a little Vulkan program that submits a ton of timeline and binary semaphores on vkQueueSubmit, then waits for them, we get the following error without this patch: MESA: error: ../../src/intel/vulkan/anv_batch_chain.c:1910: execbuf2 failed: Invalid argument (VK_ERROR_DEVICE_LOST) v2: Rebase. Cc: mesa-stable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20703> (cherry picked from commit `ad6a036a68`)	2023-01-26 15:40:31 +00:00
Maíra Canal	3f75b640be	v3dv: remove unused clamp_to_transparent_black_border property Commit `e07c5467` ("v3dv/format: use XYZ1 swizzle for three-component formats") removes the only code that handled the clamp_to_transparent_black_border variable. Therefore, the variable can be deleted, as it is not currently being used. Fixes: `e07c5467` ("v3dv/format: use XYZ1 swizzle for three-component formats") Signed-off-by: Maíra Canal <mcanal@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20746> (cherry picked from commit `86c9bdcd9a`)	2023-01-26 15:40:31 +00:00
Lionel Landwerlin	04bc00cf2c	nir/divergence: add missing RT intrinsinc handling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20763> (cherry picked from commit `b82d9b1a3d`)	2023-01-26 15:40:31 +00:00
Alyssa Rosenzweig	9f4d8d7595	mesa: Set info.separate_shader for ARB programs ARB programs are logically separate, and Mesa will happily mix and match them. We need to alert backends of this fact, by setting nir->info.separate_shader. Otherwise, backends may link shaders invalidly. Fixes fp-abs-01 on Bifrost. (We don't use separate_shader for anything on Valhall, so the issue doesn't appear there.) Compare `151aa19c21` ("ttn: Set nir->info.separate_shader"), which fixed a similar issue with TGSI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Adam Jackson <ajax@redhat.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20688> (cherry picked from commit `7e68cf91d7`)	2023-01-26 15:40:31 +00:00
Gert Wollny	e020f913bb	virgl: drop the separable flag for cases that can't be handled The host can't assign more than 32 locations explicitly, and we exhaust this already when we handle patches and generics. So drop the separable flag in cases when we have other IO that uses generated names that will have to be matched by name. v2: skip tests for VS input and FS outputs Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20738> (cherry picked from commit `8084b412ca`)	2023-01-26 15:40:31 +00:00
Erik Faye-Lund	56be7ba217	zink: fix depth-clip disable cap We use EXT_depth_clip_enable for this, not EXT_depth_clip_control, which is what depth_clip_control_missing is a proxy for. Fixes: `721f33cd0f` ("zink: fix return for PIPE_CAP_DEPTH_CLIP_DISABLE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20740> (cherry picked from commit `c12fed1804`)	2023-01-26 15:40:31 +00:00
Jason Ekstrand	8da3b21d07	gallium,util: Pull u_indices and u_primconvert back into gallium This was moved in !13741 but doing so created a link-time dependency between util and gallium which causes problems for Vulkan drivers. Meanwhile, having mesa/main depend on gallium is fine now that we don't have any classic drivers. It's a bit circular but should be harmless. Fixes: `97ba2f2fd4` ("move util/indices to core util") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/8098 Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20734> (cherry picked from commit `d292cb82b8`)	2023-01-26 15:40:31 +00:00
Rob Clark	c465184d26	freedreno: Fix tracking of enabled SSBOs Clearing all of the modified bits an relying on OR'ing the needed bits back in the loop below doesn't quite work out, Because of early continue if the SSBO has not changed. Fixes: `0ed053f03d` ("freedreno: simplify fd_set_shader_buffers(..)") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/20575> (cherry picked from commit `e41d19a711`)	2023-01-26 15:40:31 +00:00

... 3 4 5 6 7 ...

634 Commits