Comparing 69e596a791..0966afa0b8 - mesa

fran/mesa

Author	SHA1	Message	Date
Rhys Perry	2036a2c5c5	radv: use load_shared2_amd/store_shared2_amd fossil-db (Sienna Cichlid): Totals from 376 (0.23% of 162293) affected shaders: MaxWaves: 9620 -> 9596 (-0.25%); split: +0.08%, -0.33% Instrs: 207533 -> 203901 (-1.75%); split: -1.76%, +0.01% CodeSize: 1130904 -> 1106420 (-2.16%); split: -2.17%, +0.01% VGPRs: 14016 -> 14120 (+0.74%); split: -0.34%, +1.08% Latency: 2143281 -> 2132212 (-0.52%); split: -0.56%, +0.05% InvThroughput: 389116 -> 387990 (-0.29%); split: -0.34%, +0.05% VClause: 4483 -> 4485 (+0.04%); split: -0.11%, +0.16% SClause: 5780 -> 5778 (-0.03%); split: -0.17%, +0.14% Copies: 15319 -> 15331 (+0.08%); split: -0.53%, +0.61% Branches: 5561 -> 5563 (+0.04%) PreSGPRs: 11776 -> 11775 (-0.01%) PreVGPRs: 11393 -> 11497 (+0.91%); split: -0.13%, +1.04% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	67fc0e3655	ac/llvm: implement load_shared2_amd/store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	c883abda76	aco: implement load_shared2_amd/store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	5aa5af7776	aco: handle read2st64/write2st64 in optimizer Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	2135c88d9c	aco: fix signedness of DS_instruction::offset0/1 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	778fc176b1	nir/opt_load_store_vectorize: create load_shared2_amd/store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	dc835626b3	nir/opt_load_store_vectorize: fix broken indentation Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Rhys Perry	8ff122f8b8	nir: add load_shared2_amd and store_shared2_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13778>	2022-04-13 23:08:07 +00:00
Konstantin Seurer	bbdf22ce13	radv: Fix barriers with cp dma We need to wait for cp dma if VK_PIPELINE_STAGE_2_ALL_TRANSFER_BIT or VK_PIPELINE_STAGE_2_ALL_COMMANDS_BIT are set. Closes: #5911 Fixes: `4b9bc4791b` ("radv: only sync CP DMA for transfer operations or bottom pipe") Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15933>	2022-04-13 22:16:43 +00:00
Daniel Schürmann	d703a0e808	aco: remove register hints entirely Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	2fe005a3fe	aco: remove occurences of VCC hint Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	b10c4d7dee	aco: make program->needs_vcc independent of VCC hints Totals from 5 (0.00% of 135048) affected shaders: (GFX9) SGPRs: 208 -> 160 (-23.08%) CodeSize: 2700 -> 2692 (-0.30%) Instrs: 533 -> 531 (-0.38%) Latency: 41688 -> 41680 (-0.02%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	415a3820fc	aco/ra: omit VCC affinity on VOPC_SDWA for GFX9+ VOPC_SDWA can also use arbitrary SGPR pairs on GFX9+. Totals from 5607 (4.16% of 134913) affected shaders: (GFX10.3) CodeSize: 42470760 -> 42452988 (-0.04%) Instrs: 7943174 -> 7942883 (-0.00%) Latency: 102887029 -> 102886305 (-0.00%); split: -0.00%, +0.00% InvThroughput: 20454456 -> 20454338 (-0.00%); split: -0.00%, +0.00% Copies: 376818 -> 376865 (+0.01%); split: -0.00%, +0.01% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	6ebc61d71b	aco/ra: create VCC-affinities during RA instead of using register hints. Totals from 88367 (65.50% of 134913) affected shaders: (GFX10.3) CodeSize: 322492184 -> 322252912 (-0.07%); split: -0.08%, +0.01% Instrs: 60615809 -> 60541260 (-0.12%); split: -0.12%, +0.00% Latency: 557067980 -> 557009210 (-0.01%); split: -0.01%, +0.00% InvThroughput: 109676757 -> 109674804 (-0.00%); split: -0.00%, +0.00% SClause: 1939703 -> 1939924 (+0.01%); split: -0.01%, +0.02% Copies: 4557567 -> `4487530` (-1.54%); split: -1.54%, +0.00% Branches: 1941123 -> 1937453 (-0.19%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Daniel Schürmann	44fb9ba84a	aco/ra: only use VCC if program->needs_vcc == true A future commit will make VCC register assignment independent from register hints. Up to GFX9, VCC can alternatively be used as regular SGPR, so prevent overlap. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15408>	2022-04-13 21:52:43 +00:00
Lionel Landwerlin	08f3950d6b	anv: stop using old entrypoint/struct/enum names for 1.3 v2: More replacements Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15920>	2022-04-13 21:13:56 +00:00
Emma Anholt	5fad6bca72	nir_to_tgsi: Do the required cleanup for nir_opt_find_array_copies(). If we made a copy deref, then we need to do dead-write elimination for the pervious writes or we'll just emit the same copy deref again next time around. And, at the end of the opt loop, we need to lower copy derefs because later passes (locals_to_regs, notably) depend on it. Fixes infinite opt loop on fs-function-inout-array with virgl on NTT. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15899>	2022-04-13 19:44:39 +00:00
Jason Ekstrand	c8df09ebd4	iris: More gracefully fail in resource_from_user_memory rusticl (and clover) would like to get a graceful fail here so they can fall back to a shadow copy instead of us asserting. We also start rejecting arrayed surface because isl doesn't allow selecting a QPitch yet. Even if it did, QPitch is horribly restrictive, even for linear surfaces, that it likely wouldn't be that useful. Fixes: `e81f3edf76` ("iris: Allow userptr on 1D and 2D images") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15903>	2022-04-13 19:18:54 +00:00
Mike Blumenkrantz	8501661332	zink: set optimal tiling on swapchain images this otherwise breaks kopper fixes #6294 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15928>	2022-04-13 19:01:29 +00:00
Louis-Francis Ratté-Boulianne	3017522e74	dzn: Add CI target for vulkan driver A custom branch of `deqp` is used to have proper results when crashing. See: https://github.com/KhronosGroup/VK-GL-CTS/issues/311 A custom branch of `deqp-runner` with Windows support is also used until the changes are merged into the main repository. The `api`, `info`, `draw`, `query-pool` and `memory` test cases are executed for now. Signed-off-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15742>	2022-04-13 18:05:44 +00:00
Louis-Francis Ratté-Boulianne	fb24f34fc3	dzn: Add a debug flag to enable D3D12 debug layer Signed-off-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15742>	2022-04-13 18:05:44 +00:00
Karmjit Mahil	f7ddd584ab	pvr: Implement vkCreateQueryPool() and vkDestroyQueryPool(). Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15880>	2022-04-13 17:58:03 +00:00
Karmjit Mahil	1250e30929	pvr: Add pvrsrvkm visibility test heap. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15880>	2022-04-13 17:58:03 +00:00
Karmjit Mahil	76ee1671f6	pvr: Add core count info and pvr_device_runtime_info. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15880>	2022-04-13 17:58:03 +00:00
Jason Ekstrand	93fbaae7d5	v3dv: Add emulated timeline semaphore support This is trivial thanks to the emulated timelines provided in common code. "Real" timeline semaphores which can be shared across processes will require kernel support. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	1cc917bc68	v3dv: Use the core version property helpers vulkaninfo is the same before and after. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	1973b2da9d	v3dv: Use the core version feature helpers vulkaninfo is the same before and after. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	316728a55b	v3dv: Switch to the common submit framework Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	321f0b85f2	v3dv: Always wait on last_job_syncs if job->serialize Even if we're the first job on some queue, there may be no wait semaphores but we still need to ensure things happen in-order. (See the "Implicit Synchronization Guarantees" section of the Vulkan spec.) The client can submit back-to-back command buffers with no semaphores between them and it needs to adt the same as if there were a semaphore. If job->serialize is set because of a barrier or something, we still need to synchronize across HW queues by waiting on last_job_syncs. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	00b84fae2d	v3dv: Add a condition variable for queries In order to properly wait for a query to be complete, we need to first wait for the end query job to flush through on the queue. Since query end is always handled on the CPU, we can do this with a condition variable. The 2s timeout is taken from ANV. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	e5a0e2122f	v3dv: Use util/os_time helpers Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	8bd7bd9577	v3dv: Switch to the common device lost tracking Vulkan requires that, once the device has been lost, you keep returning VK_ERROR_DEVICE_LOST. We've got tracking for this in common code; it just needs to be wired up. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	32527f3ccc	v3dv: Destroy the device mutex on the teardown path Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	30191fd9df	v3dv: Don't use pthread functions on c11 mutexes This only works because c11/threads.h is typedeffing the c11 stuff to ptrheads. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	25441b5e5c	v3dv: Put indirect compute CSD jobs in the job list Instead of having the CPU job execute the CSD job, put both jobs on the list with the CPU job first which modifies the GPU job which gets kicked off next. This gives the queue code more visibility into what types of jobs are actually in the list. In particular, if an indirect compute job is the last job in a batch buffer, it currently appears as if the batch ends with CPU work which isn't true because it kicks off GPU work. In that case, the last job on the list is now a GPU job, which better matches reality. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	0208bb2d58	v3dv: Stop directly setting vk_device::alloc vk_device_init() will do this. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Jason Ekstrand	b284c512e6	vulkan/drm_syncobj: Implement WAIT_PENDING with a sync_file lookup The v3dv kernel driver doesn't support timelines yet but we want threaded submit and that requires WAIT_PENDING. Fortunately, it should never sit in this loop for long in practice. The primary use-case is sorting out dependencies and these checks will always trivially succeed for non-shared semaphores because v3dv only has a single queue. Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15704>	2022-04-13 17:22:14 +00:00
Rhys Perry	7478b00c7c	aco: remove old global access intrinsics No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	61ac5acca3	radv,ac/nir: lower global access to _amd global access intrinsics fossil-db (Sienna Cichlid): Totals from 400 (0.30% of 134621) affected shaders: VGPRs: 18696 -> 18688 (-0.04%) CodeSize: 2031348 -> 1946640 (-4.17%) Instrs: 374703 -> 360226 (-3.86%) Latency: 4200727 -> 4108628 (-2.19%); split: -2.20%, +0.01% InvThroughput: 1059935 -> 1029441 (-2.88%); split: -2.88%, +0.00% VClause: 5777 -> 5771 (-0.10%) SClause: 11890 -> 10891 (-8.40%); split: -8.57%, +0.17% Copies: 34035 -> 33259 (-2.28%); split: -2.98%, +0.70% Branches: 11108 -> 11100 (-0.07%); split: -0.08%, +0.01% PreSGPRs: 15999 -> 15942 (-0.36%); split: -0.44%, +0.08% PreVGPRs: 16994 -> 16970 (-0.14%) fossil-db (Polaris10): Totals from 400 (0.29% of 135668) affected shaders: SGPRs: 23799 -> 22919 (-3.70%); split: -4.30%, +0.61% VGPRs: 18480 -> 18472 (-0.04%) CodeSize: 2090316 -> 2041592 (-2.33%) Instrs: 395461 -> 385747 (-2.46%); split: -2.46%, +0.00% Latency: 5045768 -> 5020196 (-0.51%); split: -0.53%, +0.02% InvThroughput: 2694320 -> 2689886 (-0.16%); split: -0.23%, +0.07% VClause: 5982 -> 5968 (-0.23%) SClause: 12064 -> 10823 (-10.29%); split: -10.33%, +0.04% Copies: 48233 -> 48322 (+0.18%); split: -0.47%, +0.65% PreSGPRs: 16409 -> 16358 (-0.31%); split: -0.39%, +0.08% fossil-db (Pitcairn): Totals from 400 (0.29% of 135668) affected shaders: SGPRs: 22431 -> 22215 (-0.96%); split: -2.60%, +1.64% VGPRs: 18776 -> 18560 (-1.15%); split: -1.21%, +0.06% CodeSize: 2104440 -> 2017708 (-4.12%) MaxWaves: 2363 -> 2367 (+0.17%) Instrs: 413099 -> 397446 (-3.79%) Latency: 5507707 -> 5450251 (-1.04%); split: -1.12%, +0.07% InvThroughput: 2838867 -> 2786903 (-1.83%); split: -1.83%, +0.00% VClause: 10334 -> 10097 (-2.29%) SClause: 12346 -> 11005 (-10.86%); split: -10.89%, +0.02% Copies: 54034 -> 52065 (-3.64%); split: -3.99%, +0.35% PreSGPRs: 17916 -> 17857 (-0.33%); split: -0.40%, +0.07% PreVGPRs: 16917 -> 16893 (-0.14%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	9d1bab3615	aco: increase global_load_params.max_const_offset_plus_one The callback now supports this. This shouldn't have any effect yet except on GFX6 with 12 byte loads. fossil-db (Pitcairn): Totals from 246 (0.18% of 135668) affected shaders: VGPRs: 14684 -> 14768 (+0.57%); split: -0.44%, +1.01% CodeSize: 1765792 -> 1738040 (-1.57%) Instrs: 344605 -> 340055 (-1.32%) Latency: 4892904 -> 4861942 (-0.63%) InvThroughput: 2479599 -> 2446070 (-1.35%) VClause: 8782 -> 8735 (-0.54%) SClause: 9854 -> 9853 (-0.01%) Copies: 47327 -> 45401 (-4.07%); split: -4.08%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	3e9517c757	aco: implement _amd global access intrinsics fossil-db (Sienna Cichlid): Totals from 7 (0.01% of 134621) affected shaders: VGPRs: 760 -> 776 (+2.11%) CodeSize: 222000 -> 222044 (+0.02%); split: -0.01%, +0.03% Instrs: 40959 -> 40987 (+0.07%); split: -0.01%, +0.08% Latency: 874811 -> 886609 (+1.35%); split: -0.00%, +1.35% InvThroughput: 437405 -> 443303 (+1.35%); split: -0.00%, +1.35% VClause: 1242 -> 1240 (-0.16%) SClause: 1050 -> 1049 (-0.10%); split: -0.19%, +0.10% Copies: 4953 -> 4973 (+0.40%); split: -0.04%, +0.44% Branches: 1947 -> 1957 (+0.51%); split: -0.05%, +0.56% PreVGPRs: 741 -> 747 (+0.81%) fossil-db changes seem to be noise. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	eeefe31014	ac/llvm: implement _amd global access intrinsics Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	5c038b3f02	nir: add _amd global access intrinsics These are the same as the normal ones, but they take an unsigned 32-bit offset in BASE and another unsigned 32-bit offset in the last source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	391bf3ea30	aco: don't expand smem/mubuf global loads For example, dwordx3->dwordx4 or ubyte3->dwordx2. Global loads don't have the bounds checking that buffer loads have that makes this safe. The alignment checks are added to global_load_callback() in case byte_align_loads=false, align=1 and bytes_needed=3. Without them, the callback will create a dword load. fossil-db (Sienna Cichlid): Totals from 267 (0.20% of 134621) affected shaders: CodeSize: 1603352 -> 1606568 (+0.20%) Instrs: 294946 -> 295482 (+0.18%); split: -0.00%, +0.18% Latency: 2997003 -> 2997052 (+0.00%); split: -0.02%, +0.02% InvThroughput: 526645 -> 526659 (+0.00%) SClause: 9179 -> 9185 (+0.07%); split: -0.02%, +0.09% Copies: 25363 -> 25375 (+0.05%); split: -0.08%, +0.13% Branches: 8298 -> 8299 (+0.01%) fossil-db (Polaris10): Totals from 267 (0.20% of 135668) affected shaders: CodeSize: 1636672 -> 1638756 (+0.13%); split: -0.00%, +0.13% Instrs: 308484 -> 308733 (+0.08%); split: -0.01%, +0.09% Latency: 3446045 -> 3446904 (+0.02%); split: -0.00%, +0.03% InvThroughput: 1206722 -> 1206828 (+0.01%); split: -0.00%, +0.01% SClause: 9308 -> 9311 (+0.03%); split: -0.08%, +0.11% Copies: 36933 -> 36921 (-0.03%); split: -0.08%, +0.05% fossil-db (Pitcairn): Totals from 275 (0.20% of 135668) affected shaders: SGPRs: 17616 -> 17520 (-0.54%); split: -0.64%, +0.09% VGPRs: 15428 -> 15540 (+0.73%); split: -0.23%, +0.96% CodeSize: 1885792 -> 1929120 (+2.30%); split: -0.00%, +2.30% MaxWaves: 1284 -> 1285 (+0.08%) Instrs: 368963 -> 376095 (+1.93%); split: -0.00%, +1.94% Latency: 5122922 -> 5168398 (+0.89%); split: -0.01%, +0.90% InvThroughput: 2562866 -> 2604279 (+1.62%) VClause: 9268 -> 9296 (+0.30%); split: -0.13%, +0.43% SClause: 10702 -> 10705 (+0.03%); split: -0.05%, +0.07% Copies: 48620 -> 50629 (+4.13%); split: -0.08%, +4.21% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	6baad09711	aco: use saddr for global access with sgpr address fossil-db (Sienna Cichlid): Totals from 38 (0.03% of 134621) affected shaders: CodeSize: 237196 -> 237060 (-0.06%); split: -0.09%, +0.03% Instrs: 43895 -> 43894 (-0.00%); split: -0.02%, +0.01% Latency: 914633 -> 916263 (+0.18%); split: -0.01%, +0.19% InvThroughput: 468215 -> 468971 (+0.16%); split: -0.02%, +0.18% SClause: 1239 -> 1242 (+0.24%) PreSGPRs: 997 -> 1003 (+0.60%) PreVGPRs: 936 -> 923 (-1.39%); split: -1.50%, +0.11% Regression seems to be RA noise, creating a waitcnt. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	d957730b9b	aco: use vcc for 64-bit vgpr addition fossil-db (Sienna Cichlid): Totals from 229 (0.17% of 134621) affected shaders: CodeSize: 1520192 -> 1517644 (-0.17%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Rhys Perry	0360e12ebf	radv: don't require robust vectorization for nir_var_mem_global Robust vectorization is to prevent vectorization of loads using the near maximum offset with loads of offset 0. Global loads can't read from offset 0 (NULL) anyways, so this isn't necessary. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14124>	2022-04-13 16:23:35 +00:00
Jason Ekstrand	6ca328988f	iris: Don't leak scratch BOs Fixes: `4d219b0eb3` ("iris: implement scratch space!") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15897>	2022-04-13 15:56:50 +00:00
Timur Kristóf	394b93bfeb	radv: Only use TES vertex offset 2 for triangles and quads. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15837>	2022-04-13 15:04:26 +00:00
Timur Kristóf	e02c71d6c5	radv: Fix gs_vgpr_comp_cnt for NGG VS without passthrough mode. When not in passthrough mode, the NGG shader needs to calculate the primitive export value from the input primitive's vertex indices. So, GS vertex offset 2 is needed when NGG has triangles and isn't in passthrough mode. Fixes: `7ad69e2f7e` "radv: stop loading invocation ID for NGG vertex shaders" Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15837>	2022-04-13 15:04:26 +00:00
Timur Kristóf	a7147ef1e8	nir: Handle out of bounds access in nir_vectorize_tess_levels. Replace out of bounds loads with undef. Then, delete instructions with out of bounds access. Fixes: `f5adf27fb9` "nir,radv: add and use nir_vectorize_tess_levels()" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6264 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15775>	2022-04-13 13:25:10 +00:00
Timur Kristóf	b874a2ed61	aco: Fix VOP2 instruction format in visit_tex. There was a v_or_b32 that accidentally used SOP2. It should use VOP2. Issue found by looking at a gfxreconstruct trace posted by a user in this bug: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5838 Cc: mesa-stable Fixes: `93c8ebfa78` "aco: Initial commit of independent AMD compiler" Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15923>	2022-04-13 13:06:49 +00:00
Rohan Garg	581035b3a9	iris: set a default EDSC flag anv sets the default EDSC flag, do the same for iris too Fixes: `5ae278da18` ("iris: use vtbl to avoid multiple symbols, fix state base address") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15905>	2022-04-13 12:36:01 +00:00
Lionel Landwerlin	e11bedb9f5	intel/fs: add a note on possible optimization of root node address Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15910>	2022-04-13 11:24:49 +00:00
Lionel Landwerlin	9c0805ef91	intel/fs: fix metadata preserve on trace_ray intrinsic `c78be5da30` ("intel/fs: lower ray query intrinsics") introduced a helper function using nir_(push\|pop)_if which invalidated dominance & block_index for the replacement of nir_intrinsic_rt_trace_ray. We can still keep dominance/block_index metadata for the lowering of nir_intrinsic_rt_execute_callable though. This change uses 2 different lowering function with correct metadata preservation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c78be5da30` ("intel/fs: lower ray query intrinsics") Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15910>	2022-04-13 11:24:49 +00:00
Mike Blumenkrantz	fcd6b2a47a	zink: avoid creating ssbo variable types with multiple runtime arrays this is illegal affects: KHR-GL46.shader_storage_buffer_object.advanced-unsizedArrayLength-cs-packed-matC cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15894>	2022-04-13 10:50:22 +00:00
Mike Blumenkrantz	ff4dcb76d9	zink: use the calculated last struct member idx for ssbo size in ntv this may or may not be 1 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15894>	2022-04-13 10:50:22 +00:00
Gert Wollny	a30ff90561	virgl: Fix relocating the re-writing the transformation code The transformation must come before the code emission. Fixes: `6a264e7024` Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15919>	2022-04-13 12:02:17 +02:00
Kenneth Graunke	b7111f89e8	iris: Add VF_CACHE_INVALIDATE to IRIS_DOMAIN_OTHER_WRITE flush bits Suggested by Francisco Jerez. Although including VF invalidation in the flush bits is strange, we believe this is the only way to guarantee that stream output has finished. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	a969ad1ddf	iris: Demote DC flush to HDC flush in cache tracker FLUSH_HDC is sufficient to flush things out to L3, so we'd rather use that where possible. It's also emulated via DATA_CACHE_FLUSH on platforms where it isn't supported, so we can use it unconditionally. We still use DATA_CACHE_FLUSH for invalidating the data cache, and to flush the DC-tagged cachelines in L3 to be globally-observable. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	1c8b4940eb	iris: Emit flushes for push constant source buffers Push constant loading is not coherent with L3 according to the document that describes the hardware change for the vertex buffer L3 Bypass Disable field. If we've updated a push constant buffer with say, a blorp_buffer_copy, we may need to flush both the render cache and the tile cache. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	bbd5714a7e	iris: Use cache-tracker for draw count flushing We should be using the cache tracker for this. We can consider this access IRIS_DOMAIN_OTHER_READ now that it's the catch-all non-L3-coherent read-only access domain. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	9c8874b9ab	iris: Add pre-draw flushing for stream output targets When stream output is active, we need to let the cache tracker know about any SO buffers, which we access via IRIS_DOMAIN_OTHER_WRITE. In particular, we may have written to those buffers via another mechanism, such as BLORP buffer copies. In that case, previous writes happened via IRIS_DOMAIN_RENDER_WRITE, in which case we'd need to flush both the render cache and the tile cache to make that data globally- observable before we begin writing via streamout, which is incoherent with the earlier mechanism. Fixes misrendering in Ryujinx. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6085 Fixes: `d8cb76211c` ("iris: Fix MOCS for buffer copies") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	43e3747eea	iris: Extend the cache tracker to handle L3 flushes and invalidates Most clients are L3-coherent these days. However, there are some notable exceptions, such as push constants, stream output, and command streamer memory reads and writes. With the advent of the tile cache, flushing the render or depth caches alone are no longer sufficient for memory to become globally-observable. For those, we need to flush the tile cache as well. However, we'd like to avoid that for L3-coherent clients, as it shouldn't be necessary, and is expensive. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8cd7e94eca	iris: Add a separate PIPE_CONTROL_L3_READ_ONLY_CACHE_INVALIDATE bit This will let us use it without performing a VF cache invalidation, should we want to do that. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	b92cd58508	iris: Add an iris_is_domain_l3_coherent helper. The render, depth, sampler, and data (HDC) caches are all coherent with L3. We consider OTHER_READ and OTHER_WRITE to be non-coherent, as they're kitchen-sink domains which include non-L3-clients. Starting with Tigerlake, the VF cache is coherent with L3 (because we set the L3BypassDisable bit in the vertex/index buffer packets). Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	536eee31d0	iris: Fix UBO cache tracking for the !indirect_ubos_use_sampler case On Tigerlake, we use the data cache for reading indirect UBOs instead of the sampler. But we still use the constant cache for direct UBO access, so unfortunately we may access it through two different domains. To work around this, we add a new domain for pull constants (UBOs), which will be either constant+texture or constant+data. Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	d39bd7ba70	iris: Split out an IRIS_DOMAIN_SAMPLER_READ domain from OTHER_READ The bulk of IRIS_DOMAIN_OTHER_READ domain usage was the 3D sampler, but there were also a few oddball cases like command streamer reads, blitter access, and so on. The sampler is definitely L3 coherent, but some off the more esoteric reads may not be, so I'd like to separate them, so that OTHER_READ can become a non-L3-coherent kitchen-sink domain. The sampler cases only need TEXTURE_CACHE_INVALIDATE, and can skip the CONSTANT_CACHE_INVALIDATE we had on IRIS_DOMAIN_OTHER_READ. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Kenneth Graunke	8e0ff0275d	iris: Use IRIS_DOMAIN_DEPTH_WRITE for read only depth/stencil. We were using IRIS_DOMAIN_OTHER_READ for read-only depth/stencil access in an attempt to avoid unnecessary flushing; IRIS_DOMAIN_DEPTH_WRITE could indicate read-write access. However, IRIS_DOMAIN_OTHER_READ is clearly the wrong domain. Depth and stencil data is read via the depth cache, while IRIS_DOMAIN_OTHER_READ currently corresponds to the sampler cache and constant cache together (although this will change in future patches). It's unclear whether this hack was useful. For now, just drop it and use the correct depth cache domain, even if it's marked as read-write. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15275>	2022-04-13 09:07:35 +00:00
Gert Wollny	6a264e7024	virgl: Apply integer op fix only for ALU ops and clear modifiers For texture fetches and buffer load the fix is not needed, and the override creates faulty TGSI. In addition remove all modifiers from the src in the additional mov instruction. Fixes: `d1c7a7b131` virgl: Add an extra mov for int outputs from constant and immediate inputs v2: Move workaround after the use of virgl_tgsi_rewrite_src_for_input_temp (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15896>	2022-04-13 08:56:47 +00:00
Gert Wollny	29564031cf	r600: Assign shader type when creating a new CS state Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15898>	2022-04-13 08:48:13 +00:00
Kenneth Graunke	68ef895674	st/mesa: Transcode ASTC to BC7 (BPTC) where possible This patch adds support for transcoding ASTC to BC7 (BPTC) and prefers it over BC3 (DXT5) when hardware supports that format. BC7 is a much newer format (~2009 vs. ~1999) and offers higher quality than the older BC3 format. Furthermore, our encoder seems to be faster. Tapani put together a small benchmark for transcoding a 1024x1024 ASTC texture, and switching from BC3 to BC7 improves performance of that microbenchmark by 25% on my Tigerlake NUC (with hardware ASTC disabled so we can test this path). Presumably, this isn't fundamental to the formats, but rather reflects the speed of our in-tree compressors. So, we should use BC7 where possible. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15875>	2022-04-13 07:58:11 +00:00
Kenneth Graunke	d4521a2515	st/mesa: Make transcode_astc also check for non-SRGB format support This is probably unnecessary in that all drivers which support the sRGB format likely also support the non-sRGB format. But we may as well check both the formats we use, for documentation if nothing else. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15875>	2022-04-13 07:58:11 +00:00
Tomeu Vizoso	7d474c100e	ci: Move most stuff out of root .gitlab-ci.yml This file was getting a bit hard to navigate. Split container, build and test jobs to their own files. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	2a578c6505	ci: Allow local installations to build additional stuff into the rootfs This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	e81693a1b4	ci: Add env var to add packages to install in debian/arm_build image This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	79aef41881	ci: Add env var to add packages to install in rootfs This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	b46000f076	ci: Allow specifying a different kernel in LAVA jobs To make it possible to use a kernel different from that built along with the rootfs. This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Tomeu Vizoso	f7713b0af0	ci: Use CI_PROJECT_NAME instead of hardcoding 'mesa' This can make it more convenient for other projects to reuse these scripts. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15891>	2022-04-13 07:34:36 +00:00
Lionel Landwerlin	3394680368	nir/lower_shader_calls: name resume shaders Helpful when lost in a sea of NIR :) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15887>	2022-04-13 06:59:29 +00:00
Tomeu Vizoso	8506c2b7ee	ci: Disable Google's lab The runner is down and pipelines are being stuck. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15909>	2022-04-13 08:11:07 +02:00
Mike Blumenkrantz	c3ad1331be	zink: rework choose_pdev to (finally) be competent now zink will init using a priority system if multiple devices are available multiple devices will ONLY be available if: * the user does not specify VK_ICD_FILENAMES as they should * the user does not specify LIBGL_ALWAYS_SOFTWARE * multiple drivers exist I've prioritized the virtualized gpu here with the assumption that if such a thing is detected, the environment is most likely virtualized Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	0c0ff57c61	aux/trace: clean up some zink+lavapipe tracing awfulness now that it's easier to determine whether zink is being used (mostly), this whole thing can be simplified Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	d5ff82df38	zink: ZINK_USE_LAVAPIPE -> LIBGL_ALWAYS_SOFTWARE this is a documented variable, so reuse it Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Mike Blumenkrantz	42ff02de14	egl: don't make LIBGL_ALWAYS_SOFTWARE and MESA_LOADER_DRIVER_OVERRIDE=zink exclusive Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15857>	2022-04-13 00:14:57 +00:00
Indrajit Kumar Das	3abc66dc9f	ac/gpu_info: disallow displayable DCC for Navi12 and Navi14 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15813>	2022-04-12 23:52:24 +00:00
Jason Ekstrand	69b5424ea4	intel/nir: Lower 8 and 16-bit bitwise unops Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Jason Ekstrand	a482877c70	intel/fs: Implement 16-bit [ui]mul_high Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Jason Ekstrand	d0ace28790	nir/lower_int64: Fix [iu]mul_high handling `e551040c60`, which added a new mechanism for 64-bit imul which is more efficient on BDW and later Intel hardware also introduced a bug where we weren't properly walking both X and Y. No idea how testing didn't find this. Fixes: `e551040c60` ("nir/glsl: Add another way of doing lower_imul64 for gen8+" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6306 Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15829>	2022-04-12 23:19:38 +00:00
Mike Blumenkrantz	48ae404b42	kopper: print better error message if loader not detected silently failing on release builds is annoying Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15851>	2022-04-12 21:34:30 +00:00
Erico Nunes	cf1390e1b8	lima: fix vector const src referenced multiple times It can happen that a single vector constant is referenced multiple times by the same node, with different swizzles. This needs to be taken into account by checking and updating the swizzles for all the srcs of a target node when inserting the const node to the same instruction. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15726>	2022-04-12 20:07:32 +00:00
Mike Blumenkrantz	19a22ae110	features: mark off ARB_seamless_cubemap_per_texture for zink forgot to do this with the MR Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15902>	2022-04-12 19:12:57 +00:00
Gert Wollny	c3096e562d	ntt: translate nir_intrinsic_shader_clock Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15889>	2022-04-12 18:47:08 +00:00
Mike Blumenkrantz	dea65ae590	zink: finish up radv piglit baseline updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15900>	2022-04-12 14:00:47 -04:00
Konstantin Seurer	521492e8b1	radv: Refactor ray tracing support checks Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15860>	2022-04-12 16:13:38 +00:00
Konstantin Seurer	a9fce44dd6	radv: Refactor radv_tex_aniso_filter Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15860>	2022-04-12 16:13:38 +00:00
Mike Blumenkrantz	6b65d4234c	radv: set read/write without format flags for supported texel buffers if the storage case is supported, this should be supported too Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15826>	2022-04-12 15:52:03 +00:00
Samuel Pitoiset	2b688942c1	Revert "radv: Disable NGG for GS with suboptimal output vertex count." It breaks too many things and shouldn't have been merged. The fix isn't trivial and it will probably not be backported because it's intrusive. It will be re-applied later when everything will work. This reverts commit `94706601fa`. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15882>	2022-04-12 12:26:32 +00:00
Gert Wollny	e466d73368	r600: make r600_load_ar available to driver code This is needed for the new NIR assembler Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	050e05db22	r600: Set the last bit if an alu group is split by kcache allocation Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	d920200ad6	r600: Force last instruction of group when starting a new CF When emitting the AR forces splitting an ALU group, and at the same time a new CF instruction is started, then the last instrcution in the finished CF block might not have the "last" bit set, which results in an invalid shader that might hang, or crash SB. So when a new CF is started, force the last bit in the last ALU instruction. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	04fd9a6488	r600: don't reschedule INTERP_LOAD_P0 With the NIR code, we have instructions groups that use INTERP_LOAD_P0 that don't fill all slots. Just make sure the backend scheduler doesn't fill in INTERP_LOAD_P0 instructions with a different LDS location. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	3c4644afb0	r600: ignore dest sel for non-write targets when counting registers Since the value is not written, there is no need to allocate a register for it, so don't take it into account. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Gert Wollny	67d145d9ab	r600: Don't limit scheduling of PARAM_SRC values ALU_SRC_PARAM_BASE is an inline constant that defines the address for pulling data from LDS memory for interpolation and not a value from the kcache, so there is no need to take these values into account when allocating kcache load slots. v2: Fix the constant range check to not exclude the translated ranges for kcache banks 2 and 3. v3: limit range check to only include kcache values and and rename relevant function (Emma). Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15714>	2022-04-12 12:10:19 +00:00
Rhys Perry	f6262804af	radv: increase inline push constant limit if we can inline all constants fossil-db (Sienna Cichlid): Totals from 665 (0.49% of 134627) affected shaders: CodeSize: 4519620 -> 4491724 (-0.62%); split: -0.62%, +0.01% Instrs: 842745 -> 837313 (-0.64%); split: -0.66%, +0.01% Latency: 7289925 -> 7279661 (-0.14%); split: -0.30%, +0.16% InvThroughput: 1240770 -> 1240639 (-0.01%); split: -0.01%, +0.00% VClause: 15799 -> 15772 (-0.17%) SClause: 33773 -> 32604 (-3.46%); split: -3.66%, +0.20% Copies: 67695 -> 64992 (-3.99%); split: -4.49%, +0.50% PreSGPRs: 38597 -> 38640 (+0.11%); split: -0.14%, +0.25% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	773c7cbcbc	radv,aco: implement 64-bit inline push constants fossil-db (Sienna Cichlid): Totals from 21 (0.02% of 134621) affected shaders: CodeSize: 1932 -> 1560 (-19.25%) Instrs: 357 -> 303 (-15.13%) Latency: 6576 -> 5883 (-10.54%) InvThroughput: 26304 -> 23532 (-10.54%) SClause: 42 -> 24 (-42.86%) Copies: 90 -> 105 (+16.67%); split: -10.00%, +26.67% PreSGPRs: 144 -> 201 (+39.58%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	7f6262bb85	radv: allow holes in inline push constants Use a dword mask instead of a range to track which push constants to inline. fossil-db (Sienna Cichlid): Totals from 5724 (4.25% of 134621) affected shaders: CodeSize: 20894044 -> 20815748 (-0.37%); split: -0.39%, +0.02% Instrs: 4002568 -> 3988385 (-0.35%); split: -0.38%, +0.02% Latency: 29285060 -> 29224414 (-0.21%); split: -0.22%, +0.01% InvThroughput: 5529700 -> 5526893 (-0.05%); split: -0.05%, +0.00% VClause: 78093 -> 78240 (+0.19%); split: -0.23%, +0.41% SClause: 135495 -> 131027 (-3.30%); split: -3.30%, +0.00% Copies: 330856 -> 324552 (-1.91%); split: -2.37%, +0.46% PreSGPRs: 226031 -> 224778 (-0.55%); split: -0.61%, +0.05% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Rhys Perry	72cf6cca91	radv: allow inline push constants in more situations We don't need to disable this path if there are indirect or 8/16/64-bit push constant loads. We can just use the default path for them. fossil-db (Sienna Cichlid): Totals from 21 (0.02% of 134621) affected shaders: CodeSize: 2028 -> 1884 (-7.10%) Instrs: 366 -> 363 (-0.82%); split: -2.46%, +1.64% Latency: 6630 -> 6579 (-0.77%) InvThroughput: 26520 -> 26316 (-0.77%) Copies: 84 -> 102 (+21.43%) PreSGPRs: 141 -> 222 (+57.45%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12145>	2022-04-12 11:44:30 +00:00
Mykhailo Skorokhodov	9c7e750ffe	intel/fs: Enable b2f(inot(a)) and b2i(inot(a)) optimization for Gfx12+ The commit enables the optimization for Intel Gfx12+ graphics. Tigerlake ``` total instructions in shared programs: 1289326 -> 1289015 (-0.02%) instructions in affected programs: 37841 -> 37530 (-0.82%) helped: 78 HURT: 9 helped stats (abs) min: 1 max: 26 x̄: 4.69 x̃: 3 helped stats (rel) min: 0.10% max: 12.50% x̄: 2.07% x̃: 1.21% HURT stats (abs) min: 1 max: 18 x̄: 6.11 x̃: 4 HURT stats (rel) min: 0.16% max: 1.95% x̄: 0.94% x̃: 0.61% 95% mean confidence interval for instructions value: -4.95 -2.20 95% mean confidence interval for instructions %-change: -2.34% -1.18% Instructions are helped. total cycles in shared programs: 105606388 -> 105606442 (<.01%) cycles in affected programs: 620119 -> 620173 (<.01%) helped: 49 HURT: 28 helped stats (abs) min: 2 max: 3618 x̄: 228.63 x̃: 12 helped stats (rel) min: 0.02% max: 23.31% x̄: 4.60% x̃: 1.11% HURT stats (abs) min: 1 max: 2142 x̄: 402.04 x̃: 29 HURT stats (rel) min: 0.01% max: 36.42% x̄: 5.01% x̃: 0.46% 95% mean confidence interval for cycles value: -151.80 153.20 95% mean confidence interval for cycles %-change: -3.00% 0.79% Inconclusive result (value mean confidence interval includes 0). ``` Related-to: `7725d60938` Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14017>	2022-04-12 10:55:05 +00:00
Gert Wollny	d1c7a7b131	virgl: Add an extra mov for int outputs from constant and immediate inputs virglrenderer doesn't properly emit the conversion code when the source is a integer value and the output is also integer. Fixes on NTT: dEQP-GLES31.functional.shaders.sample_variables.sample_mask.inverse_per_* v2: fix typo (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Gert Wollny	a083ae818a	virgl: Always make some extra temps available for transformations The host driver will optimize unused variables away, and checking thoroughly whether we may need an extra temp is just uselessly costly. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Gert Wollny	a4a34cd323	virgl: Propagate precice flag through moves NIR doesn't propagate precise through moves, and with NTT the last output is usually preceded by a move, so that we no longer see that the evaluation of some value is supposed to be exact, and, hence we can't decorate the outputs accordingly. Fixes with NTT: dEQP-GLES31.functional.tessellation.common_edge. triangles_equal_spacing_precise triangles_fractional_odd_spacing_precise triangles_fractional_even_spacing_precise quads_equal_spacing_precise quads_fractional_odd_spacing_precise quads_fractional_even_spacing_precise v2: Don't clear the precise flag when we hit a mov, because we may hit a if/else construct like below and we don't track branches IF X TEMP[0] = OP_PRECICE ... ELSE TEMP[0] = MOV CONST[] ENDIF Thanks Emma for pointing out the problem. v2: allocate precise handling flags to transform_prolog (Emma) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15836>	2022-04-12 10:44:17 +00:00
Juan A. Suarez Romero	0439f0e9fc	ci: add Broadcom CI maintainer Include in the CODEOWNERS file who to ping in case of issues with the Broadcom (V3D/V3DV/VC4) CI. v2: - Add Chema (Chema) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15858>	2022-04-12 10:42:31 +00:00
Juan A. Suarez Romero	18c4ad6e3b	CODEOWNERS: add Broadcom maintainers v2: - Add more maintainers (Iago) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Acked-by: Alejandro Piñeiro <apinheiro@igalia.com> Acked-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15858>	2022-04-12 10:42:31 +00:00
Gert Wollny	c63424b2eb	r600: Only emit the NOP group triggered by dest.rel after a full group In addition really fill all slots, because otherwise the alu-group merger might move a read from the indirectly written register into the 't' slot. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15848>	2022-04-12 10:33:58 +00:00
Icecream95	fc6f141304	drm-shim: Implement a shim function for close Remove the fd from the fd_map, so that if the fd is later reused for another file then mmap won't be intercepted. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12203>	2022-04-12 10:01:39 +00:00
Icecream95	c9eec12be7	drm-shim: Explicitly use off64_t for the offset to drm_shim_mmap drm_shim.c undefines the _FILE_OFFSET_BITS macro, so plain off_t might be 32 bits, while it's 64 bits in device.c. To avoid this mismatch, use off64_t which will always be 64 bits in both source files. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12203>	2022-04-12 10:01:39 +00:00
Icecream95	11ab86d581	drm-shim: Return fake render nodes in /dev/dri first loader_open_render_node returns the first device in /dev/dri that it can use. To make sure the drm-shim device always gets chosen, return the fake entries in readdir before returning the real ones. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12203>	2022-04-12 10:01:39 +00:00
Icecream95	dfd30035b9	drm-shim: Add a function for mmap64 rather than using an alias Fixes build on 32-bit systems. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12203>	2022-04-12 10:01:39 +00:00
Marcin Ślusarz	9b23aaf3cf	nir: remove gl_PrimitiveID output from MS when it's not used in FS Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15340>	2022-04-12 09:35:26 +00:00
Marcin Ślusarz	65600a34c2	anv: initialize 3DMESH_1D.ExtendedParameter0 when ExtendedParameter0Present When IndirectParameterEnable==true it's not actually used by the hardware, but if it's not initialized and INTEL_DEBUG=bat is set, then Valgrind complains. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15850>	2022-04-12 09:10:31 +00:00
Marcin Ślusarz	f844ce66c8	anv: fix push constant lowering for task/mesh Fixes: `a6031cd9bd` ("anv: fix push constant lowering with bindless shaders") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15850>	2022-04-12 09:10:31 +00:00
Timothy Arceri	20ab7046c0	glsl/st: use nir pass to lower indirect rather than GLSL IR Will allow us to drop more GLSL IR code in future once we switch all drivers to NIR. Also stops the need for all drivers to call this pass to remove indirect temps that may have been added during the NIR varying linking lowering/optimisations. This patch fixes some tests on i915, d3d12, lima and vc4. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15871>	2022-04-12 06:51:20 +00:00
Samuel Pitoiset	619e6d44eb	radv: add few helpers to deal with pipeline layout With VK_EXT_graphics_pipeline_library, we will have to support independent sets and also to merge sets from different libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15849>	2022-04-12 06:31:33 +00:00
Samuel Pitoiset	c338bd2957	radv: remove unused radv_pipeline_layout::size field Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15849>	2022-04-12 06:31:33 +00:00
Samuel Pitoiset	dca28a6355	radv: drop the remaining uses of shader modules With VK_EXT_graphics_pipeline_library, shader modules can be NULL and be passed via the pNext of VkPipelineShaderStageCreateInfo. To prepare for this, just store everything we need to radv_pipeline_stage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15847>	2022-04-12 06:13:24 +00:00
Samuel Pitoiset	b48231cb90	radv: store the shader sha1 to radv_pipeline_stage To remove use of shader modules completely in the next commit. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15847>	2022-04-12 06:13:24 +00:00
Samuel Pitoiset	c1b9c1269d	radv: replace convert_rt_stage() by vk_to_mesa_shader_stage() Mesa shader stages are correctly sorted. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15847>	2022-04-12 06:13:24 +00:00
Pavel Ondračka	f1202a92cf	nine: check hardware support before using vertex texture Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Axel Davy <davyaxel0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15864>	2022-04-12 07:38:47 +02:00
Mike Blumenkrantz	d637eee212	zink: create pipeline layout if only bindless descriptor set is used bindless descriptors are descriptors too. cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15853>	2022-04-12 04:49:17 +00:00
Mike Blumenkrantz	23c758807e	zink: handle 0 ubos and 0 ssbos in pipeline layout this is the number of types needed, and it can be zero cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15853>	2022-04-12 04:49:17 +00:00
Mike Blumenkrantz	c7ae22e4b8	zink: prune unused st-injected pointsize exports only the last vertex stage needs to keep these, so prune any that aren't being weirdly passed through Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15853>	2022-04-12 04:49:17 +00:00
Mike Blumenkrantz	cf3f3791e3	zink: try copy region first for non-resolve blits Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15853>	2022-04-12 04:49:17 +00:00
Mike Blumenkrantz	327ca3e5ef	zink: refactor copy_region path in zink_blit to util function no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15853>	2022-04-12 04:49:17 +00:00
Dave Airlie	60c61d7b68	draw: handle tess eval shader when getting num outputs This tripped up some pointsize/prim id interactions with zink. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15872>	2022-04-12 14:21:41 +10:00
Emma Anholt	835704e669	turnip: Move autotune buffers to suballoc. Now the ANGLE trex_200 trace replay does a single BO allocation at startup for autotune results instead of one per frame (~350 for the whole replay). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	7c636acd53	turnip: Get autotune off of ralloc destructors. We've wanted to remove destructors from ralloc's API for a long time (it's an extra storage cost per ralloc for a rarely-used feature), and for the suballoc change we'd need to spend more storage on storing the tu_device pointer per result since destructors don't get anything else but the pointer passed into them. Fixes use-after-frees: ================================================================= ==2383==ERROR: AddressSanitizer: heap-use-after-free on address 0xffff88fe1940 at pc 0xffff934f427c bp 0xfffff5481e90 sp 0xfffff5481ea8 WRITE of size 8 at 0xffff88fe1940 thread T0 #0 0xffff934f4278 in list_del ../src/util/list.h:108 #1 0xffff934f4278 in result_destructor ../src/freedreno/vulkan/tu_autotune.c:237 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #5 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #6 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #7 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 [...] 0xffff88fe1940 is located 80 bytes inside of 112-byte region [0xffff88fe18f0,0xffff88fe1960) freed by thread T0 here: #0 0xffff9c1c90d8 in __interceptor_free ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:127 #1 0xffff934f4368 in history_destructor ../src/freedreno/vulkan/tu_autotune.c:229 #2 0xffff9377793c in unsafe_free ../src/util/ralloc.c:300 #3 0xffff9377793c in ralloc_free ../src/util/ralloc.c:265 #4 0xffff934f5990 in tu_autotune_on_submit ../src/freedreno/vulkan/tu_autotune.c:442 #5 0xffff935cf2ac in tu_queue_submit_locked ../src/freedreno/vulkan/tu_drm.c:997 [...] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	435d4f08b2	turnip: Reduce the pipeline's CS allocation a bit. We don't return unused space to the suballocator, so it's a little useful to limit how much we overallocate to reduce memory footprint. I took a look through the tu_cs_emit_array() calls and accounted for a couple of them in the variant-specific space calculation, then dropped the base allocation by factors of 2 until we started throwing asserts. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	58f6331eec	turnip: Skip telling the kernel the BO list when we don't need any. In fencing, we sometimes do a dummy submit with no nr_cmds. If we don't have commands to execute, we don't need to pin or fence any BOs either. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	dc3203b087	turnip: Sub-allocate pipelines out of a device-global BO pool. Allocating a BO for each pipeline meant that for apps with many pipelines (such as Asphalt9 under ANGLE), we would end up spending too much time in the kernel tracking the BO references. Looking at CS:Source on zink, before we had 85 BOs for the pipelines for a total of 1036 kb, and now we have 7 BOs for a total of 896 kb. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	e0fbdd3eda	turnip: Stop allocating unused pvtmem space in the pipeline CS. The pvtmem was split off to a separate read/write BO. Fixes: `931ad19a18` ("turnip: make cmdstream bo's read-only to GPU") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Emma Anholt	80c44a6626	turnip: Track refcounts on BOs in kgsl as well. I'm going to be using the BO refcount for the pipeline and autotune buffer suballocation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15038>	2022-04-12 01:01:56 +00:00
Francisco Jerez	e858da39e5	intel/perf: Fix OA report accumulation on Gfx12+. The intel_perf_query path used for performance queries on GL was passing a bogus "end" pointer to intel_perf_query_result_accumulate(), causing it to accumulate garbage values. This was causing the values of many performance counters to be corrupted. The "end" pointer was incorrect because the current code was assuming that different OA reports were located TOTAL_QUERY_DATA_SIZE bytes apart, which is a hard-coded preprocessor define. However recent (Gfx12+) hardware generations use a variable query size determined by the query layout. Use the size derived from it instead, and remove the stale define. Fixes: `3c51325025` ("intel/perf: switch query code to use query layout") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15783>	2022-04-12 00:11:47 +00:00
Dave Airlie	6f98dc535a	zink/query: refactor out vk queries and allow sharing them gallium queries have to be mapped onto multiple vulkan queries, this can happen for two reasons. 1. primitives generated and overflow any don't map directly, and multiple vulkan queries are needs per iteration. These are stored inside the "starts" as zink_vk_query ptrs. 2. suspending/resuming queries uses multiple queries, these are the "starts". Every suspend/resume cycle adds a new start. Vulkan also requires that multiple queries of the same time don't execute at once, which affects the overflow any vs xfb normal queries, so vk_query structs are refcounted and can be shared between starts. Due to this when the draw state changes, it's simple to just suspend/resume all queries so the shared vulkan queries get handled properly. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15785>	2022-04-12 00:01:23 +00:00
Pavel Ondračka	ef2b8f56b1	r300: move pointer dereference after a NULL check Vs state can be NULL by the time r300_set_constant_buffer is called. We don't hit this with OpenGL though, so this is why I didn't spot this in my testing, but nine hits this codepath. Restore the original behavior here. Fixes: `882811b1ff` Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15842>	2022-04-11 20:48:11 +00:00
Mike Blumenkrantz	2149f12c1e	zink: add issue notes for remaining radv fails Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15867>	2022-04-11 20:18:09 +00:00
Jocelyn Falempe	744cf02fbc	llvmpipe: remove unused array chans variable doesn't need to be an array. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6204 Reviewed-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Jocelyn Falempe <jfalempe@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15556>	2022-04-11 19:56:31 +00:00
Jocelyn Falempe	fbe392d16b	llvmpipe: fix color rendering on big endian. Some colors were missing/inverted on big endian machine(s390x). because blend_type.length > src_fmt->nr_channels. In my case, blend_type has 4 channels (rgba) but src_fmt has only 3. So the from_lsb was wrong by 1, and channels got messed up. (blue was always 255, green -> red, and blue -> green). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6204 Reviewed-by: Dave Airlie <airlied@redhat.com> Signed-off-by: Jocelyn Falempe <jfalempe@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15556>	2022-04-11 19:56:31 +00:00
Mike Blumenkrantz	960119008b	zink: update radv piglit baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15865>	2022-04-11 19:44:44 +00:00
Kenneth Graunke	b05ac36f01	intel/genxml: Add SAMPLER_MODE bits for enabling Small PL on Icelake This enables a lower power mode in the sampler hardware in certain common scenarios. On Tigerlake, SAMPLER_MODE is not programmable by userspace but the kernel already sets this bit for us. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	e3defe7ae7	intel/genxml: Delete SAMPLER_MODE register definition on Gfx12+ While this register still exists, it's no longer a per-context register. Instead, on Gfx12+, SAMPLER_MODE exists per dual-subslice and is accessed as a "multicast" register, where you write control which version is accessed by the "steering control register". At any rate, userspace cannot write it any longer, and so there's not much point to it existing in our genxml (which was missing most of the fields anyway). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	8092704705	intel/genxml: Add new "Low Quality Filter" field on Gfx12+. This allows the sampler to perform faster filtering of 8-bit UNORM textures by filtering them at a different precision. The filtering is intended to still be OpenGL and DirectX spec compliant. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Kenneth Graunke	9a70385e2b	intel/genxml: Add SAMPLER_STATE::Allow Low Quality LOD Calculation field This allows the hardware to perform a faster LOD calculation in many simple cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15628>	2022-04-11 19:17:07 +00:00
Juan A. Suarez Romero	3ac7383843	ci: enable v3dv arm64 jobs This reverts commit `f567a832ee`. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15856>	2022-04-11 15:26:58 +00:00
Mike Blumenkrantz	809dc31223	zink: reorganize radv ci baseline categorize known fails and move remainder to top of file for better visibility Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15855>	2022-04-11 09:15:11 -04:00
Mike Blumenkrantz	5abd95434e	zink: update radv ci baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15855>	2022-04-11 09:12:08 -04:00
Boris Brezillon	bc58b34087	dzn: Fix loop condition in dzn_descriptor_set_copy() We need to make sure we still have descriptors to copy in the while() condition. While at it, drop the assert() checking that the number of descriptors already copied is less than the requested number. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15828>	2022-04-11 07:04:24 +00:00
Mike Blumenkrantz	7744a477e8	zink: only do swapchain update during fb setup if swapchain is active otherwise this explodes blitter operations Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15809>	2022-04-11 04:03:33 +00:00
Yonggang Luo	6d263ff5a3	util: Convert util/u_printf.cpp to util/u_printf.c By doing this to remove the need of C++ runtime when not using llvmpipe Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15659>	2022-04-11 03:31:40 +00:00
Yonggang Luo	1dca31cda6	util: Add tests for u_printf.h Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15659>	2022-04-11 03:31:40 +00:00
Mike Blumenkrantz	bcf5d2d8f4	zink: force texture barriers when performing in-renderpass clears according to spec, a barrier is required any time the pixels of the framebuffer are changed. since zink defers clears and runs them at a later time, it must also be responsible for handling the required synchronization for such operations fixes (radv): KHR-GL46.blend_equation_advanced.blend_all* fixes #5572 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15831>	2022-04-11 01:11:07 +00:00
Mike Blumenkrantz	4f6f34456a	zink: rework texture_barrier hook according to spec, for fbfetch this should match the subpass self-dependency of * stage FRAGMENT_STAGE -> FRAGMENT_STAGE * access 0 -> INPUT_ATTACHMENT_READ zs fbfetch doesn't seem to be a thing, so that code is left for historical and/or future purposes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15831>	2022-04-11 01:11:07 +00:00
Mike Blumenkrantz	aced1ac2d3	zink: add a self-dependency for fbfetch renderpasses it's necessary to have a self-dependency here so that texture barriers can be emitted during the renderpass to enable fbfetch Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15831>	2022-04-11 01:11:07 +00:00
Mike Blumenkrantz	42bb760c8d	zink: remove compiled conditional for lavapipe usage this is super annoying since it means that a build of zink cannot be mix-and-matched with an existing build of lavapipe, e.g., for faster bisecting the env var should be sufficient to handle this, and if someone sets it and doesn't have swrast enabled then they can deal with it Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15844>	2022-04-11 00:59:15 +00:00
Dave Airlie	dd078d13cb	zink: fix tessellation shader key matching. I happened to run heaven with tess enabled on zink and was seeing 6fps, and lots of shader recompiles, turns out the tess shader key was never getting matched properly. Fixes: `62b8daa889` ("zink: set shader key size to 0 for non-generated tcs") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15843>	2022-04-11 00:16:26 +00:00
Mike Blumenkrantz	ec05155c30	zink only use zs-specific layout for zs attachments otherwise this is illegal Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15833>	2022-04-11 00:05:02 +00:00
Mike Blumenkrantz	97d5ebc93e	zink: clamp cube size queries to 2 return components more validation spam eliminated Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15833>	2022-04-11 00:05:02 +00:00
Icecream95	ab3ee60282	util/hash_table: Remove Unicode byte order mark The mark can confuse utilities which do not support Unicode, and no other file has the character. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15728>	2022-04-10 23:28:47 +00:00
Mike Blumenkrantz	ee2741e451	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	ee29db0270	mesa/st: handle adding pointsize when gl_Position is never written Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	9c212e117d	nir/lower_point_size_mov: handle case where gl_Position isn't written Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	cfca760d6e	mesa/st: handle copy_deref cases for adding pointsize these may not have been lowered yet Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	a2823747bb	mesa/st: fix pointsize adding check * components is already multiplied by 4 * also verify base maxcomponents for gs fixes #6282 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	6357ce2b8e	mesa: set PointSizeIsOne on context creation forgot to add this when inverting the flag's meaning previously Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15821>	2022-04-10 16:45:15 +00:00
Mike Blumenkrantz	f567a832ee	ci: disable v3dv arm64 jobs these have been broken for almost 48 hours Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15841>	2022-04-10 12:25:39 -04:00
Icecream95	5da8c280b7	panfrost: Remove BO mapping from import BOs will be mapped when needed, so there is no need to mmap BOs when importing them. Fixes crashes when exporting a non-AFBC resource and importing it back in the same context. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15818>	2022-04-10 01:51:04 +00:00
Mihai Preda	279eea5bda	amd/llvm: Transition to LLVM "opaque pointers" For context, see LLVM opaque pointers: https://llvm.org/docs/OpaquePointers.html https://llvm.org/devmtg/2015-10/slides/Blaikie-OpaquePointerTypes.pdf This fixes the deprecation warnings in src/amd/llvm only. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15572>	2022-04-09 05:20:29 +00:00
Emma Anholt	6c0150c389	ci/zink: Mark a new GLX flake that hit an innocent MR. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15834>	2022-04-09 00:30:34 +00:00
Yiwei Zhang	a263da69ec	venus: prepare and feed renderer protocol info into cs Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15495>	2022-04-09 00:19:05 +00:00
Yiwei Zhang	440705d78f	venus: update protocol for mask helper and ignore renderer unknown pNext Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15495>	2022-04-09 00:19:05 +00:00
Yiwei Zhang	0de968f71c	venus: add cs helper stubs to be used by protocol Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15495>	2022-04-09 00:19:05 +00:00
Yiwei Zhang	2223f13b26	venus: store extension mask in renderer info Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15495>	2022-04-09 00:19:05 +00:00
Emma Anholt	be713e15ab	ci/softpipe: Mark some flakes that have appeared across a few MRs. This MR got hit by one, so I checked the logs for the recent NEW flakes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15540>	2022-04-08 23:56:11 +00:00
Emma Anholt	00e9c4f3c9	st/glsl-to-tgsi: Fix handling of csel(bool, vec, vec). We were throwing an assertion failure across shader-db on nv92. I'm guessing this is a regression from !14573. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15540>	2022-04-08 23:56:11 +00:00
Emma Anholt	9b02587c5b	ci/crocus: Disable pixmark-piano trace testing. It hangchecked for me on hsw recently, too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15832>	2022-04-08 23:32:11 +00:00
Emma Anholt	8238658a81	ci/iris: Disable pixmark-piano trace testing. It gave me a spurious hangcheck on https://gitlab.freedesktop.org/mesa/mesa/-/jobs/20946403, and this trace causes trouble for a lot of drivers so just disable it on all iris. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15832>	2022-04-08 23:32:11 +00:00
Emma Anholt	5a1d19e945	tgsi/transform: Drop a stale comment. This method returns void. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15782>	2022-04-08 21:31:44 +00:00
Emma Anholt	ef9f2e8829	tgsi/transform: Make tgsi_transform_shader() manage token allocation. Previously, the caller allocated storage and tgsi_transform_shader() would emit into that, returning how many tokens it emitted. All the callers had to guess at how much storage was necessary, trying not to over-allocate but also getting enough that you wouldn't (effectively) silently run out of space. Instead, make tgsi_transform_shader() do the allocation for you, taking just a hint of how much space you think you need, and internally double size when necessary. Fixes failures on virgl with fp64 since we've added more fp64 virglrenderer workarounds and its old "XXX: is this enough?" allocation wasn't any more. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15782>	2022-04-08 21:31:44 +00:00
Emma Anholt	e15154a735	nir_to_tgsi: Fix the address reg mapping for images and SSBOs to match G-T-T. I missed these in the previous fix to mimic GLSL-to-TGSI address reg behavior, which r600 relies on. Fixes: `4bb9c0a28a` ("nir_to_tgsi: Use the same address reg mappings as GLSL-to-TGSI did.") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15824>	2022-04-08 21:01:42 +00:00
Emma Anholt	664f69a4d5	nir_to_tgsi: Extract const components of atomic counter offsets into Index. virglrenderer maps atomic accesses to atomic counter declarations using the .Index field. We were previously emitting a .Index of 0 for array accesses, so virglrenderer would emit atomicIncrement(first_counter[counter_offset+array_index]). This would mostly work because hardware doesn't care about the bounds of counter declarations, but if the first counter was a non-array, then the [] GLSL emit gets dropped (can't array access a scalar!) and you'd access the non-array first_counter instead. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15824>	2022-04-08 21:01:42 +00:00
Emma Anholt	8dcf7646ce	r600: Add a helper function for rat_index_mode, with documentation and assert. We have some special requirements on the address regs of our incoming TGSI, which nir_to_tgsi violated. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15824>	2022-04-08 21:01:42 +00:00
Emma Anholt	9674d96820	r600: Stop using ArrayID to look up atomic counters. Even if you find a matching array ID, you still need to offset by Index compared to .start, so the ArrayID lookup didn't help. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15824>	2022-04-08 21:01:42 +00:00
Mike Blumenkrantz	0bfb3b0436	zink/kopper: don't use generated include in kopper interface this causes build race conditions Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15830>	2022-04-08 16:33:05 -04:00
Mike Blumenkrantz	b6ddf8f0a4	zink: don't generate VK_ACCESS_SHADER_READ_BIT barrier for vertex inputs this is both redundant and illegal Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	3bd9fcfa3d	zink: don't rely on implicit access for generated barriers compute shaders will get this wrong, so calculate access manually using binding info Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	a72f05ff96	zink: add handling for !sync2 in renderpass dependencies src/dst stage can't be zero without sync2 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	b7e05c56a4	zink: hook up sync2 extension Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	46bbf49607	zink: bitcast InterpolateAtOffset offset to fvec required by spec Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	ac8337041c	zink: clamp out partial texels when creating bufferviews this is an illegal alignment, so clamp the range to the nearest texel offset since the shader should be hitting the robustness case for the partial texel affects: dEQP-GLES31.functional.texture.texture_buffer.modify.mapbuffer_readwrite.range_size_65537 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15827>	2022-04-08 20:05:24 +00:00
Mike Blumenkrantz	b3ee943050	zink: only get swapchain present semaphore on batch flush if not presented otherwise this has already been present-waited and can just be used whenever Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15820>	2022-04-08 19:54:08 +00:00
Mike Blumenkrantz	21496dea9c	zink: only get swapchain present semaphore on batch flush after acquire otherwise this was a no-op flush from the frontend that should be ignored Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15820>	2022-04-08 19:54:08 +00:00
Mike Blumenkrantz	00e846ba88	zink: unset deferred present barrier on flush don't need to flush this every frame Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15820>	2022-04-08 19:54:08 +00:00
Mike Blumenkrantz	f0a3a4d2f1	zink: only trigger deferred present barrier if swapchain has acquired otherwise this was just pointlessly flushed by the frontend and can be ignored Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15820>	2022-04-08 19:54:08 +00:00
Dave Airlie	165b016bbe	radv: use flush vgt streamout like PAL does. This uses WRITE_DATA on the ME engine to reset the register, to match what PAL does on GFX9+. This fixes KHR-GL45.transform_feedback_overflow_query_ARB.multiple-streams-one-buffer-per-stream on zink/radv. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15812>	2022-04-08 18:59:15 +00:00
Vitalii.Lomaka	1407a4db69	intel/batch-decoder: Fix uninitialized scalar variables CID: 1498516 CID: 1498560 Signed-off-by: Vitalii Lomaka <vitalii.lomaka@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15685>	2022-04-08 18:35:34 +00:00
Mike Blumenkrantz	387f6e0173	zink: don't emit SpvCapabilityStorageImageMultisample for fbfetch Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15823>	2022-04-08 18:24:30 +00:00
Mike Blumenkrantz	538472e279	zink: handle multisampled fbfetch this uses a shader key to flag the fbfetch as per-sample it doesn't work, but at least it doesn't error anymore? Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15823>	2022-04-08 18:24:30 +00:00
Mike Blumenkrantz	1dd89cc09c	zink: handle SUBPASS_MS in ntv for completeness but also maybe fbfetch Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15823>	2022-04-08 18:24:30 +00:00
Jordan Justen	4862010141	intel/dev: Add ATS-M pci-ids Ref: Bspec 44477 Ref: https://patchwork.freedesktop.org/series/101907/ Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15814>	2022-04-08 17:48:13 +00:00
Emma Anholt	949bc15ea5	nir_to_tgsi: Fix emitting the sample number for non-array MSAA image access. It's always in .w, rather than being the next component after the x/y/array index. Fixes: `c6d3fd8c21` ("gallium/ntt: Emit sample index when necessary for image load/store.") Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15825>	2022-04-08 17:17:16 +00:00
Emma Anholt	b18374002e	virgl: Disable nir_op_ffloor to avoid sending DFLR to virglrenderer. This means that we send ffract+fsub in place of a normal FLR, but hopefully virglrenderer can be fixed (or doubles support removed). Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15825>	2022-04-08 17:17:16 +00:00
Samuel Pitoiset	a652c1653a	radv: introduce new radv_pipeline_stage structure This is used to store everything for a pipeline stage like the module, the NIR, the shader arguments etc. This is inspired from ANV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15794>	2022-04-08 16:38:30 +00:00
Samuel Pitoiset	86b1a1d5f2	radv: add missing multi inclusion define to radv_shader_args.h Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15794>	2022-04-08 16:38:30 +00:00
Samuel Pitoiset	000e9ac874	radv: rework pipeline and shaders creation feedback Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15794>	2022-04-08 16:38:30 +00:00
Samuel Pitoiset	1387593fbf	radv: re-order shader stages directly in radv_create_shaders() We will have to access pStages::pNext for modules and this will also allow to rework feedback creation. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15794>	2022-04-08 16:38:30 +00:00
Rajnesh Kanwal	748c1fdf1b	pvr: Remove logic to set vk_device::alloc. The field is being already set by vk_device_init(). Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15768>	2022-04-08 15:12:04 +00:00
Iago Toral Quiroga	40f0c616e8	v3dv: fix bogus VkDrmFormatModifierProperties2EXT usage The array is allocated for VkDrmFormatModifierPropertiesEXT, so writring entried with type VkDrmFormatModifierProperties2EXT is bogus. It seems this was a mistake added with a change intended to get rid of VK_OUTARRAY_MAKE, that changed the type of the write by mistake. Fixes: `56a2ccf058` ('v3dv: Stop using VK_OUTARRAY_MAKE()') Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15819>	2022-04-08 14:43:25 +00:00
Tomeu Vizoso	30be6788cc	ci: Disable Link Power Management with RTL8153 There are reliability problems with the RTL8153 ethernet driver under certain network loads, related to incompatibility of the device with Link Power Management. Add usbcore.quirks=0bda:8153:k to the kernel command line to enable the USB_QUIRK_NO_LPM option. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15791>	2022-04-08 14:07:31 +00:00
Tomeu Vizoso	9d5fa59322	Revert "ci/freedreno: Disable a618 jobs" This reverts commit `96e17287b4`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15791>	2022-04-08 14:07:31 +00:00
Juan A. Suarez Romero	36066702ad	v3d: do not leak BO on query begin This happens when we have a sequence of multiple beginQuery / endQuery with the same target and query identifier. As a BO is created on beginQuery but not free on endQuery (we need to wait until either explicitly deleting the query or querying the results), in the above sequence we are basically leaking the BO. This ensures that any BO created before is unreference before creating the new one. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15796>	2022-04-08 13:51:25 +00:00
Erik Faye-Lund	82ca8a707e	wgl: do not disable error-dialogs by default We don't know if an application developer wants error-dialog by default or not, even when using a Mesa OpenGL driver. So let's not assume this is a good thing, and instead make an option for this behavior that we can enable for CI testing. Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15485>	2022-04-08 13:15:12 +00:00
Erik Faye-Lund	ae73a26355	util: limit error-dialogs to win32 These dialogs only exist on Windows, so let's not even expose a util function for this on other platforms. The code is only ever called from Windows specific code anyway. While we're at it, clean up the name a bit as well. Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15485>	2022-04-08 13:15:12 +00:00
Erik Faye-Lund	116553b554	wgl: rename force-msaa env-var This env-var is not specific to the SVGA driver, so we should rename it for consistency. While we're at it, let's document the feature. Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15485>	2022-04-08 13:15:12 +00:00
Timur Kristóf	94706601fa	radv: Disable NGG for GS with suboptimal output vertex count. When GS has a lot of output vertices: In this case, the occupancy of NGG GS is very low because API GS invocations can't even occupy a single Wave32 wave. Therefore the legacy pipeline performs better here. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6260 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15767>	2022-04-08 12:35:05 +00:00
Boris Brezillon	53e83b7031	dzn: Support independent depth/stencil access Needed for VK_KHR_maintenance2. While at it, fix various places where we were only issuing resource state transition on the first sub-resource instead of doing it per-layer/aspect. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:44 +00:00
Boris Brezillon	69e8a6042f	dzn: Fix 2D <-> 3D blits layer_count == 1 doesn't imply image_is_3d. So let's add explicit is_3d checks where appropriate. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:44 +00:00
Boris Brezillon	451a43ae1e	dzn: Lower partial copy of multisample resources to blits Unfortunately that won't work on transfer queues, but we don't have a better option right now. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:44 +00:00
Boris Brezillon	f0667be8b5	vulkan/util: Make STACK_ARRAY() C++-friendly C++ doesn't like the {0} initializer pattern when the first field is not an integer, let's use the {} when __cplusplus is defined. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:44 +00:00
Boris Brezillon	8d30204ca4	dzn: Drop extra blank line in dzn_CmdCopyImage2() Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	653c362ca6	dzn: Check image view usage instead of image usage when creating an image view So we take VkImageViewUsageCreateInfo extension instead of ignoring it. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	7ecc22ccaa	dzn: Force sampleCounts to 1 for bgra4 images Those are not expected to be used as render-target, and Vulkan mandates that such formats get their sampleCounts set to VK_SAMPLE_COUNT_1_BIT. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	84ad923482	dzn: Get rid of dzn_GetPhysicalDeviceProperties() Rely on the vk_common_ wrapper. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	9a804b6390	dzn: Get rid of dzn_GetPhysicalDeviceFeatures() And rely on the vk_common_ wrapper to get it implemented. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	e9c69fe39a	dzn: 3D array images don't exist Let's force maxArrayLayers to one in that case. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	ad0ac592be	dzn: Set bufferFeatures to zero on depth/stencil formats Those are not supposed to advertise buffer features. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	8dfab9b382	dzn: Make sure the properties are all zero when the format is not supported Move one of the is_supported() check before we start filling the structure so we don't end up with a partially filled object when we return VK_ERROR_FORMAT_NOT_SUPPORTED (which deqp doesn't seem to like, so it's probably coming from a spec requirement). Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	dc3dd9690b	dzn: Return a valid imageFormatProperties.maxMipLevels maxMipLevels is encoding the maximum number of MIP levels, but dzn_physical_device_get_max_mip_levels() return the maximum MIP level. Let's rename the function and add one to the returned value to fix the problem. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	cfd3dfc074	dzn: Fix 3D <-> 2D image copies We just need to treat layers as slices when manipulating 3D resources whose slices are coming from/going to 2D array layers. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	94923021d7	dzn: Support 2Darray views on 3D images for color attachments Those are declared as 3D RTVs in D3D12, and layers are treated as slices. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	3684cae58c	dzn: Fix dzn_image_get_rtv_desc() for 3D views VK_REMAINING_ARRAY_LAYERS maps to -1 in the D3D12 world. Let's make sure we set WSize to -1 in that case, because the layer_count calculated by dzn_get_layer_count() won't work for 3D images which never have more than one layer (in case of 3D images, we treat slices as layers). Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	9f5831bbec	dzn: Replace C++ references by pointers Let's keep as much as we can in plain C. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	1401f62069	dzn: Align the default case in dzn_image_view_prepare_dsv_desc() Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	1692d8f0f6	dzn: Don't crash when EndCommandBuffer() returns an error Leftover from a debug session. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	c937152756	dzn: Fix dzn_translate_viewport() when height < 0 Since negative height is not a thing in D3D12, we need to adjust the TopLeftY accordingly. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Boris Brezillon	05b6c1ed84	dzn: Fix pipeline creation when rasterization is disabled We use some of the VkGraphicsPipelineCreateInfo fields that should be ignored when rasterization in disabled, assuming those who be set to NULL by the caller in that case, which is not mandated by the Vulkan specification. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15698>	2022-04-08 11:54:43 +00:00
Corentin Noël	c59fc44114	virgl/ci: Uprev virglrenderer and crosvm Use a patch file for crosvm instead of relying on private repositories for faster uprevs. Use latest virglrenderer to keep the tests in-sync. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15772>	2022-04-08 11:28:27 +00:00
Corentin Noël	9a88231458	ci: Only apply patches with the build-skqp prefix Allows to ship patches for other components too. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15772>	2022-04-08 11:28:27 +00:00
Samuel Pitoiset	8db9b175a5	radv: stop relying on shader modules after SPIRV->NIR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:31 +02:00
Samuel Pitoiset	b2568be1de	radv: stop passing the module to the compiler debug callback After SPIRV->NIR, the driver shouldn't rely on the module. This will still report messages via VK_EXT_debug_report but the object will be NULL. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:29 +02:00
Samuel Pitoiset	0835065260	radv: drop the module reference for enable_mrt_output_nan_fixup Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:26 +02:00
Samuel Pitoiset	0411bb1297	radv: drop the module reference in radv_can_dump_shader_stats() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:25 +02:00
Samuel Pitoiset	a434097453	radv: drop the module reference in radv_can_dump_shader() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:22 +02:00
Samuel Pitoiset	e11712a0a3	radv: copy the spirv module for debugging after compilation Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15766>	2022-04-08 09:50:20 +02:00
Samuel Pitoiset	115fd6dd8e	radv: remove more references to the pipeline layout during compilation Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15749>	2022-04-08 07:00:34 +00:00
Samuel Pitoiset	2c97e79473	radv: lower ycbcr textures just before applying the pipeline layout This shouldn't change anything. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15749>	2022-04-08 07:00:34 +00:00
Samuel Pitoiset	ace073eb0b	radv: assert that the arg is declared when used in get_scalar_arg() Help debugging. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15789>	2022-04-08 06:40:52 +00:00
Samuel Pitoiset	a0f3839ce8	radv: add radv_is_vrs_enabled() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Samuel Pitoiset	59466d40a3	radv: add a new helper to initialize various type of pipelines This is common to graphics, compute and library pipelines. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Samuel Pitoiset	6117612189	radv: add radv_generate_pipeline_key() for common graphics/compute keys Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Samuel Pitoiset	e74217d5a7	radv: remove unused parameters in radv_get_{wave,ballot_bit}_size() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Samuel Pitoiset	465b530a15	radv: use radv_pipeline_has_ds_attachments() more Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Samuel Pitoiset	ac6dbb8c7b	radv: do not check if VkPipelineRenderingCreateInfo is NULL The driver converts legacy render pass to dynamic rendering, so this structure should always be in pNext. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15792>	2022-04-08 06:21:22 +00:00
Iago Toral Quiroga	cf4b3cb563	broadcom/compiler: prefer reconstruction over TMU spills when possible We have been reconstructing/rematerializing uniforms for a while, but we can do this in more scenarios, namely instructions which result is immutable along the execution of a shader across all channels. By doing this we gain the capacity to eliminate TMU spills which not only are slower, but can also make us drop to a fallback compilation strategy. Shader-db results show a small increase in instruction counts caused by us now being able to choose preferential compiler strategies that are intended to reduce TMU latency. In some cases, we are now also able to avoid dropping thread counts: total instructions in shared programs: 12658092 -> 12659245 (<.01%) instructions in affected programs: 75812 -> 76965 (1.52%) helped: 55 HURT: 107 total threads in shared programs: 416286 -> 416412 (0.03%) threads in affected programs: 126 -> 252 (100.00%) helped: 63 HURT: 0 total uniforms in shared programs: 3716916 -> 3716396 (-0.01%) uniforms in affected programs: 19327 -> 18807 (-2.69%) helped: 94 HURT: 50 total max-temps in shared programs: 2161796 -> 2161578 (-0.01%) max-temps in affected programs: 3961 -> 3743 (-5.50%) helped: 80 HURT: 24 total spills in shared programs: 3274 -> 3266 (-0.24%) spills in affected programs: 98 -> 90 (-8.16%) helped: 6 HURT: 0 total fills in shared programs: 4657 -> 4642 (-0.32%) fills in affected programs: 130 -> 115 (-11.54%) helped: 6 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15710>	2022-04-08 05:37:28 +00:00
Connor Abbott	32af90d96f	freedreno/a6xx: Fix SP_DS_CTRL_REG0 definition Bit 20 isn't actually MERGEDREGS, the mode for the entire geometry pipeline is controlled by SP_VS_CTRL_REG0::MERGEDREGS and it appears to be something preamble-related instead since writing any register in the preamble hangs if it's set. This fixes those hangs on freedreno and turnip since we no longer set it. Fixes: `fccc35c2de` ("ir3: Add preamble optimization pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15801>	2022-04-08 04:40:17 +00:00
Mike Blumenkrantz	80683943d1	mesa/st: simplify st_can_add_pointsize_to_program iterator Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:10 -04:00
Mike Blumenkrantz	16f08ad469	mesa/st: don't precompile the pointsize upload variant anymore this is no longer likely to be used, so precompile the base variant now also delete some now-unused code Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:10 -04:00
Mike Blumenkrantz	4f027fbff2	mesa/st: only flag pointsize constant uploads if they're needed now that shaders are guaranteed to have a pointsize export, the only time the variant using the uploaded constant is needed is when pointsize != 1.0 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:10 -04:00
Mike Blumenkrantz	f964881fcc	mesa/st: only use constant upload pointsize variants if pointsize != 1.0 it's not that common for apps to need varying pointsize, so now that shaders are guaranteed to have the export in the shader, the constant version is almost never used Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:10 -04:00
Mike Blumenkrantz	2b626af110	mesa/st: also add pointsize to fixedfunction vertex shaders as needed Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:10 -04:00
Mike Blumenkrantz	d773055d92	mesa/st: always inject a 1.0 pointsize for vertex stages since 1.0 is used in nearly every case, drivers requiring this exporting can avoid potential shader variants by adding a 1.0 export to the base shader variant and the only using the ubo upload when pointsize is explicitly set for wide point functionality drivers can then be responsible for removing unused pointsize exports as needed (or desired) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:09 -04:00
Mike Blumenkrantz	3aa449ff72	mesa/st: declare added pointsize var as hidden ensure this isn't counted as part of the shader and ignored for e.g., glGetProgramInterfaceiv Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:09 -04:00
Mike Blumenkrantz	310903d096	nir/lower_point_size_mov: fix check for overwriting existing pointsize this should match the comment and allow overwriting injected pointsize variables regardless of whether xfb is flagged Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:09 -04:00
Mike Blumenkrantz	02b573e03e	mesa: add a bool indicating when pointsize == 1.0 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15699>	2022-04-07 21:56:09 -04:00
Emma Anholt	75a4e3f0e8	Revert "ci/freedreno: Reduce concurrency when replaying traces on a630" This reverts commit `d948f32365`. I think that fixing the timeout will have resolved this problem. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Emma Anholt	d51aea7f57	freedreno: Fix the cpu-prep wait to be "infinite". We don't need to restrict our timeout to 5 seconds, because the kernel's hangcheck will ensure that the wait completes in finite time if the GPU gets wedged. If the GPU is making progress, we don't want to time out early and have pipe_transfer_map() return an error, causing glReadPixels() to throw a confusing GL_OOM even though we're not out memory. The INFINITE arg to this function isn't actually infinite, it's limited to an hour. But an hour of GPU processing to wait on is probably plenty. This 5s timeout has caused problems with the CTS on freedreno at high parallelism, and I suspect is the cause of recent issues in the closed traces replay jobs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15805>	2022-04-08 00:24:20 +00:00
Timothy Arceri	7d216f296a	glsl: fix needs_lowering() call in varying packing pass Here we remove the outer arrays on geom and tess shaders where needed. Without this the pass can sometimes attempt to pack a varying on only one side of the shader interface where it is not actually needed. The result can be mismatching varying types. Fixes: `d6b9202873` ("glsl: disable varying packing when its not safe") Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15761>	2022-04-07 23:57:40 +00:00
Mike Blumenkrantz	b3abd3db33	docs: update features for lavapipe Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15808>	2022-04-07 23:06:05 +00:00
Mike Blumenkrantz	3030e5baaf	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15808>	2022-04-07 23:06:05 +00:00
Mike Blumenkrantz	4bb45bcd16	zink: add error logging for SRGB framebuffer without KHR_swapchain_mutable_format this is going to explode, so at least print an error explaining why Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15808>	2022-04-07 23:06:05 +00:00
Mike Blumenkrantz	bbede22850	lavapipe: KHR_swapchain_mutable_format it Just Works(tm) Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15808>	2022-04-07 23:06:05 +00:00
Mike Blumenkrantz	fde2c1db88	zink: adds refs to user index buffers when tc is not active there are no ref tricks to abuse in this case, so add our own ref fixes #6273 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15807>	2022-04-07 22:10:35 +00:00
Mike Blumenkrantz	9187af41b8	zink: allow lod for RECT sampler types the RECT is a lie, so this is fine Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15804>	2022-04-07 21:37:58 +00:00
Mike Blumenkrantz	6cfcf891c1	nir/lower_tex: avoid adding invalid LOD to RECT textures this is illegal Fixes: `74ec2b12be` ("nir/lower_tex: Rework invalid implicit LOD lowering") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15804>	2022-04-07 21:37:58 +00:00
Mike Blumenkrantz	dff8813246	zink: set nir_shader_compiler_options::has_txs Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15804>	2022-04-07 21:37:58 +00:00
Jason Ekstrand	4cd260590c	nir: Dont set coord_components on txs Fixes: `e1fc23265f` ("nir: Add a pass for lowering CL-style image ops to texture ops") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15758>	2022-04-07 21:13:35 +00:00
Mike Blumenkrantz	d09ee469f0	kopper: add a dmabuf-free image interface for use with sw drivers sw drivers don't support modifiers or dmabufs or any of that, so separate interfaces are needed to avoid advertising extensions that will only lead to crashes Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15802>	2022-04-07 20:08:10 +00:00
Erik Faye-Lund	31824d4213	dzn: add missing space Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	08e4b28a05	dzn: drop unused include Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	4ae6c34a5a	dzn: drop incorrect return statement We're returning nothing here. Let's not do that. Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	600d99650b	dzn: drop unused header Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	76b0023c06	dzn: remove unused variable Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	96bbcb1e48	dzn: fixup indent Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	f3ce7d8561	dzn: add D3D12_IGNORE_SDK_LAYERS define Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	659e26285a	dzn: drop needless includes These include either dzn_nir.h or dzn_internal.h which already includes this header. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	bda5852846	dzn: remove unused struct Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Erik Faye-Lund	f7b0cfe04a	dzn: remove needless using Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15790>	2022-04-07 19:53:03 +00:00
Mike Blumenkrantz	1e3b96913c	zink: handle swapchain readbacks when a present is pending if a present semaphore is pending, don't try to create another one Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15784>	2022-04-07 19:22:14 +00:00
Mike Blumenkrantz	ed07721d09	zink: only apply swapchain behavior in flush_resource for swapchain images these could otherwise be pixmaps or dmabufs Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15784>	2022-04-07 19:22:14 +00:00
Mike Blumenkrantz	127d7aeb6c	zink: handle deferred swapchain resource flushing if the swapchain image hasn't been acquired yet, flag it as a deferred present so it can be picked up on flush Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15784>	2022-04-07 19:22:14 +00:00
Mike Blumenkrantz	3c4be122cc	egl: implement more hooks for swrast these just need to use swapbuffers instead of flush Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15784>	2022-04-07 19:22:14 +00:00
Benjamin Cheng	0666b7fecc	anv: drop from_wsi bit from anv_image It was originally introduced in `ca791f5c` but it was never actually set anywhere. It doesn't serve any purpose other than some sanity checking so let's clean it up for now. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15799>	2022-04-07 18:46:50 +00:00
Ian Romanick	b5fa43952a	intel/fs: Better handle constant sources of FS_OPCODE_PACK_HALF_2x16_SPLIT I noticed that a LOT of fragment shaders in Shadow of the Tomb Raider, for instance, end up with a sequence of NIR like: vec1 32 ssa_2 = load_const (0x00000000 = 0.000000) ... vec1 32 ssa_191 = pack_half_2x16_split ssa_188, ssa_2 vec1 32 ssa_192 = pack_half_2x16_split ssa_189, ssa_2 vec1 32 ssa_193 = pack_half_2x16_split ssa_190, ssa_2 This results in an assembly sequence like: mov(8) g28<1>UD 0x00000000UD mov(8) g21<2>HF g28<8,8,1>F shl(8) g21<1>UD g21<8,8,1>UD 0x00000010UD mov(8) g21<2>HF g25<8,8,1>F mov(8) g19<2>HF g28<8,8,1>F shl(8) g19<1>UD g19<8,8,1>UD 0x00000010UD mov(8) g19<2>HF g23<8,8,1>F mov(8) g20<2>HF g28<8,8,1>F shl(8) g20<1>UD g20<8,8,1>UD 0x00000010UD mov(8) g20<2>HF g24<8,8,1>F After this commit, the generated assembly is: mov(8) g21<1>UD 0x00000000UD mov(8) g21<2>HF g23<8,8,1>F mov(8) g19<1>UD 0x00000000UD mov(8) g19<2>HF g17<8,8,1>F mov(8) g20<1>UD 0x00000000UD mov(8) g20<2>HF g18<8,8,1>F Tiger Lake, Ice Lake, Skylake, and Haswell had similar results. (Ice Lake shown) total instructions in shared programs: 20119086 -> 20119034 (<.01%) instructions in affected programs: 9056 -> 9004 (-0.57%) helped: 8 HURT: 0 helped stats (abs) min: 2 max: 16 x̄: 6.50 x̃: 4 helped stats (rel) min: 0.29% max: 1.75% x̄: 1.00% x̃: 0.98% 95% mean confidence interval for instructions value: -11.01 -1.99 95% mean confidence interval for instructions %-change: -1.56% -0.44% Instructions are helped. total cycles in shared programs: 861019414 -> 861021044 (<.01%) cycles in affected programs: 279862 -> 281492 (0.58%) helped: 4 HURT: 2 helped stats (abs) min: 6 max: 936 x̄: 239.00 x̃: 7 helped stats (rel) min: 0.03% max: 8.13% x̄: 2.09% x̃: 0.09% HURT stats (abs) min: 18 max: 2568 x̄: 1293.00 x̃: 1293 HURT stats (rel) min: 0.36% max: 1.14% x̄: 0.75% x̃: 0.75% 95% mean confidence interval for cycles value: -972.56 1515.89 95% mean confidence interval for cycles %-change: -4.77% 2.49% Inconclusive result (value mean confidence interval includes 0). Broadwell total instructions in shared programs: 17812327 -> 17812263 (<.01%) instructions in affected programs: 9867 -> 9803 (-0.65%) helped: 8 HURT: 0 helped stats (abs) min: 2 max: 28 x̄: 8.00 x̃: 4 helped stats (rel) min: 0.32% max: 1.80% x̄: 1.00% x̃: 0.95% 95% mean confidence interval for instructions value: -15.46 -0.54 95% mean confidence interval for instructions %-change: -1.54% -0.47% Instructions are helped. total cycles in shared programs: 904768620 -> 904773291 (<.01%) cycles in affected programs: 454799 -> 459470 (1.03%) helped: 4 HURT: 4 helped stats (abs) min: 36 max: 586 x̄: 344.50 x̃: 378 helped stats (rel) min: 0.47% max: 4.04% x̄: 2.01% x̃: 1.77% HURT stats (abs) min: 1 max: 5572 x̄: 1512.25 x̃: 238 HURT stats (rel) min: <.01% max: 2.77% x̄: 1.46% x̃: 1.53% 95% mean confidence interval for cycles value: -1122.40 2290.15 95% mean confidence interval for cycles %-change: -2.26% 1.71% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 18581 -> 18579 (-0.01%) spills in affected programs: 323 -> 321 (-0.62%) helped: 1 HURT: 0 total fills in shared programs: 24985 -> 24981 (-0.02%) fills in affected programs: 1348 -> 1344 (-0.30%) helped: 1 HURT: 0 Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 143585431 -> 143513657 (-0.0%) Instructions helped: 14403 Cycles in all programs: 8439312778 -> 8439371578 (+0.0%) Cycles helped: 10570 Cycles hurt: 3290 Gained: 146 Lost: 74 All of the lost and gained fossil-db shaders are SIMD32 fragment shaders. 14,247 of the affected shaders are from Shadow of the Tomb Raider. 154 are from Batman Arkham Origins, and the remaining two are from Octopath Traveler. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15089>	2022-04-07 18:26:23 +00:00
Alyssa Rosenzweig	1fb4427a7a	pan/bi: Imply round mode most of the time Much less noisy, and provides a path to further improvements. There is a slight behaviour change: int-to-float conversions now use RTE instead of RTZ. For 32-bit opcodes, this affects conversions of integers with magnitude greater than 2^23 by at most 1 ulp. As this behaviour is unspecified in GLSL, this change is believed to be acceptable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	a747708b9d	pan/bi: Use should_skip in bi_builder generation To avoid further code duplication. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	de37f75554	pan/bi: Mark some opcodes as default round-to-zero Conversions to integer have different rounding rules. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	24d072fd6a	pan/bi: Don't use funny round modes in tests To prepare for defeaturing round modes, replace uses of round-to-positive with round-to-even in our unit tests. This doesn't meaningfully impact test coverage; there is no way to generate that round mode. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15187>	2022-04-07 18:03:57 +00:00
Alyssa Rosenzweig	b5026c3d7c	panfrost: Use track_image_access on Bifrost Equivalent logic, as previously extracted. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	8689a88d10	panfrost: Add helpers to emit Valhall data structures Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	0d48a57452	panfrost: Adapt viewport/scissor to Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	e4edb1a53f	panfrost: Hide some Bifrost-specific functions Pertains to data structures removed in Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	58934dd666	panfrost: Add valhall_has_blend_shader field Required in a hot path for silly historical reasons, so add a field to save a pre-computed value thereof. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	fed04de481	panfrost: Add Valhall fields to panfrost_batch There are new data structures that we need to (dirty) track. Add the corresponding fields so we can proceed as with Bifrost dirty tracking. Trivial increase in memory usage, but that should not matter as batches are few. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	270e3e02f2	panfrost: Split out allow_fpk helper For sharing between Bifrost's renderer state descriptor and Valhall's draw call descriptor, which require the same logic in different places. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	ba395fc94c	panfrost: Split out panfrost_get_blend_shaders The same logic is useful on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:38 +00:00
Alyssa Rosenzweig	c408f7551c	panfrost: Specialize vertex state for Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Alyssa Rosenzweig	0795f3d7e4	panfrost: Add a pool to sampler_view So we can allocate temporary sampler views for writeable images on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Alyssa Rosenzweig	0299565f72	panfrost: Adapt panfrost_rasterizer for v9 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Alyssa Rosenzweig	9a521433ae	panfrost: Don't set a default for blend count Unnecessary. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Alyssa Rosenzweig	7d1d7cdf57	panfrost: Don't check alpha test in fs_required on Bifrost+ Alpha testing is only native on Midgard. Saves a few instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15797>	2022-04-07 17:43:37 +00:00
Ian Romanick	c08302670b	intel/compiler: Fix sample_d messages on DG2 DG2 can only do sample_d and sample_d_c on 1D and 2D surfaces. The maximum number of gradient components and coordinate components should be 2. In spite of this limitation, the Bspec lists a mysterious R component before the min_lod, so the maximum coordinate components is 3. Fixes the following Vulkan CTS failures on DG2: dEQP-VK.glsl.texture_functions.texturegradclamp.isampler1d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.isampler2d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1d_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1d_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2d_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2d_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler1d_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler2d_fragment The Fixes: tag below is a bit misleading. This commit fixes some test cases similar to ones fixed by the Fixes: commit. I just want to make sure this commit gets applied everywhere that commit was also applied. Fixes: `635ed58e52` ("intel/compiler: Lower txd for 3D samplers on XeHP.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15781>	2022-04-07 17:09:28 +00:00
Boris Brezillon	07af51da9d	dzn: Pass a NULL ralloc context to dxil_create_validator() instance is not allocated with ralloc, we can't pass it as a ralloc context to dxil_create_validator(). Fixes: `09c2016d6b` ("dzn: use dxil_validator") Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15787>	2022-04-07 16:55:02 +00:00
Jason Ekstrand	5d198782a0	vulkan,docs: Add documentation for Vulkan dispatch Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	dd340ce1a1	vulkan,docs: Document vk_device Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	6073610d7a	vulkan,docs: Document vk_physical_device Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	f6d4641433	vulkan,docs: Document vk_instance Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	f06fa8f7e0	vulkan,docs: Document vk_object_base Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	0ca8b95824	vulkan: vk_object_base_init/finish have no unused parameters Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15472>	2022-04-07 16:32:21 +00:00
Jason Ekstrand	13fc698cef	anv/formats: Relax usage checks if EXTENDED_USAGE_BIT is set Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14153>	2022-04-07 15:56:33 +00:00
Roman Stratiienko	9b28b76e85	android: Set max platform-sdk-version to 10000 platform-sdk-version::max require updating every time new Android releases. It makes sense to set this value to some unreachable maximum value to avoid such patches. Such approach will allow to use stable mesa3d without any changes with newest Android as well. As was suggested by Yiwei Zhang we can use value 10000 that corresponds to CUR_DEVELOPMENT [1] definition which may be used by AOSP/master in between of Android major releases. [1]: https://developer.android.com/reference/android/os/Build.VERSION_CODES#CUR_DEVELOPMENT Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15747>	2022-04-07 15:33:48 +00:00
Alyssa Rosenzweig	0510e2373e	panfrost: Split out image access tracking This logic is not device-specific. It is pulled from our existing Bifrost image implementation and will be reused for Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:05 +00:00
Alyssa Rosenzweig	5d187e9cad	panfrost: Add helpers to set batch masks This logic is not device specific and will be used for both Bifrost and Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	30c14f54cf	panfrost: Disable PIPE_CAP_PRIMITIVE_RESTART on v9 Valhall removed the ability to set an explicit primitive restart index as required by desktop OpenGL, in favour of fixed primitive restart indices only as required by OpenGL ES. Set the CAPs accordingly so that mesa/st lowers unusual primitive restart indices at draw call time. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	dd735bcb69	panfrost: Make alpha=0 NOP / 1 store Bifrost only These fields were removed in Valhall in favour of a simpler overdraw mechanism. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	bb8a9038ff	panfrost: Move assign_vertex_buffer to pan_helpers Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	e39f9aa883	panfrost: Hide AFBC on Valhall The relevant data structures have been shuffled a bit. We need to wire up AFBC for Valhall; however, that's out of scope for the initial bring up. Just hide it so we can build. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	76e0a7c49e	panfrost: Adapt pan_shader.h for Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	4d04437a3b	panfrost: Add shader_stage helper For Valhall, which specifies these in the shader program descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	783d27645c	panfrost: Add panfrost_make_resource_table helper For Valhall drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	3baceb0ca4	panfrost: Hide parts of pan_encoder.h for Valhall These pertain to data structures that no longer exist. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	d11351c616	panfrost: Control tiler memory usage Ensure we don't hit OOM when rendering at 8192x8192 on Valhall by disabling the smallest bin size of the hierarchy mask. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	586c3b9e35	panfrost: Handle stencil texturing on Valhall Use a Bifrost compatible path. It's not clear this is optimal but it passes the tests and is no worse than what we do on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Alyssa Rosenzweig	db20152c8a	panfrost: Handle Valhall texturing Surface descriptors have been replaced by plane descriptors, which facilitate the intermediate layout of textures. This allows for more sophisticated handling of texture compressions, of particular to interest to copy_image. However, it requires a considerable amount of new logic to handle. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15795>	2022-04-07 15:11:04 +00:00
Krunal Patel	f21d6e18bc	frontend/va: Create decoder once the max_references is updated Issue: When a video is decoded where the max_references is updated the decoder keeps using same old value. This results into green patches and decoding is not proper. Root Cause: The max_references is updated only once when the instance is created for the first time. Fix: Added a check along with the context->decoder to check if the context->templat.max_references has changed. If yes, then we go ahead and create the decoder again. Reviewed-by: Leo Liu <leo.liu@amd.com> Signed-off-by: Krunal Patel <krunalkumarmukeshkumar.patel@amd.corp-partner.google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15750>	2022-04-07 14:48:37 +00:00
Mike Blumenkrantz	2f8d11463f	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	527deb91ff	zink: export PIPE_CAP_SEAMLESS_CUBE_MAP_PER_TEXTURE Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	dac830b5ad	zink: run the cubemap -> array compiler pass if the shader key is set Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	f97014c3da	zink: handle nonseamless cube sampler binding now when a cube is sampled with a nonseamless sampler, the relevant shader stage is flagged for a variant update, the Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	be6d4b8583	zink: create an array view for all cube samplerviews Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	55f90c5fa4	zink: set nonseamless hint for sampler states Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	5b2d0cca17	zink: handle shader key variants that have nonseamless cubemaps this is just the program-based handling, which means finding the right variant and/or creating new ones based on matching a 32bit mask indicating which textures must be rewritten Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	a06b0e0d21	zink: specify struct member name when copying inline uniforms for gfx variants avoid memory mismatch if inline uniform values aren't first member of struct Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	be1b36d631	zink: support nir_op_imod Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	2d745904ca	zink: add a gently mangled version of the d3d12 cubemap -> array compiler pass this differs in that it doesn't handle images and also doesn't filter based on sampler type, instead using a 32bit mask for determining which samplers to rewrite also zink doesn't use sampler derefs Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Mike Blumenkrantz	dc3f62e0c2	zink: rename a variable no functional changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15536>	2022-04-07 14:36:25 +00:00
Alyssa Rosenzweig	813d355e9e	pan/va: Add LD_TILE.v3.f16 packing test This tests the staging register behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	2b57303eaf	pan/bi: Consider flow control in DCE We don't want to remove instructions like `NOP.wait` on Valhall; this would be tantamount to deleting barriers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	6e69c3369c	pan/bi: Don't lower vertex_id for malloc IDVS Based on hardware behaviour, it appears vertex_id is zero-based with the legacy geometry flow but not with the new malloc IDVS flow. Since the geometry flow is per-shader (not per-machine), there's not a good way to communicate this to NIR. Rather than trying to shoehorn this obscure detail into NIR, just do the lowering ourselves instead of in NIR. It's not much more code anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ccdec68aee	pan/bi: Report whether workgroups can be merged This flag gates a Valhall hardware optimization for compute shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	170d5a012e	pan/bi: Avoid masked writes for now Our swizzle lowering optimizations depend on replication of scalar fp16. This holds on Bifrost (at least for now), but not on Valhall which has proper support for write masks. For now, enforce Bifrost-compatible behaviour as we do not make use of the write masks on Valhall yet. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ba5b63f642	pan/bi: Generate LD_BUFFER on Valhall Replace LOAD.ubo with LD_BUFFER since the .ubo segment doesn't exist on Valhall. We could do this with a lowering pass instead but this is probably fine. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f487c09045	pan/bi: Make psiz variants Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9497a6a3c9	pan/bi: Lower gl_PointSize to FP16 on Valhall It is unclear if FP32 point sizes are supported on Valhall -- I can't get the DDK to use them at any rate. Always lower them to FP16 and store them as FP16 for hardware use. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	8e6f97b5fc	pan/bi: Force psiz to mediump To match driver behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	90d3f55aff	pan/bi: Set table for Valhall LD_ATTR Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	f79e33f82f	pan/bi: Emit Valhall-style varying stores Varying stores was changed in Valhall. Rather than using attribute descriptors like on Bifrost and Midgard, on Valhall we store to memory directly with hardware-allocated buffers. This requires a new implementation of store_output, with special provisions for writing gl_PointSize from a position shader. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	14e7796d4c	pan/bi: Emit Valhall-style varying loads Memory-allocated IDVS requires special varying load instructions that take an offset into the hardware-allocated varying buffer, as opposed to a varying slot. Emit these instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a1d5bf0a7a	pan/bi: Track whether the malloc IDVS flow is used This affects what instructions the fragment shader uses. Will be used for the legacy geometry flow in blit shaders. Whether that is a good idea remains to be seen, admittedly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9758555481	pan/bi: Handle Valhall texturing in helper analysis Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	ae79f6765a	pan/bi: Emit Valhall texture instructions Valhall uses an updated version fo the TEXC path. To avoid disrupting the existing Bifrost code, add a new Valhall-specific texture path that generates the new-style texture instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	9091b6261b	pan/bi: Specialize BLEND emit for Valhall Fewer arguments compared to Bifrost; the corresponding information is encoded in a Valhall-specific blend shader prologue instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a8afe6f7fb	pan/bi: Waits before tilebuffer access on Valhall On Bifrost, this is handled in the scheduler. Until we grow a Valhall scheduler, add a NOP with the appropriate flow control. This is correct but carries a small performance cost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	fe9cf1d0a4	pan/bi: Fix spilling on Valhall We need a slightly different idiom on Valhall, since the segment modifiers no longer exist but we now have an immediate offset. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:45 +00:00
Alyssa Rosenzweig	a2916aa934	pan/bi: Mark LD_TILE as w=format This tracks register usage more precisely for LD_TILE, which is an encoding difference on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:44 +00:00
Alyssa Rosenzweig	b371e509da	panfrost: Add a table for images For the default Valhall ABI. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15793>	2022-04-07 14:20:44 +00:00
Mike Blumenkrantz	736f3fdadd	radv: improve failure logging for amdgpu on init Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15771>	2022-04-07 13:48:34 +00:00
Alyssa Rosenzweig	0864b15047	pan/va: Allow small constants in register pairs They are zero extended 32->64-bit. Allow this. Noticed debugging spilling on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:28:43 -04:00
Alyssa Rosenzweig	862a19aa4b	pan/va: Add flow control lowering pass Something an instruction has two logic flow controls, namely wait + reconverge. These are orthogonal -- we need to insert a NOP to handle this. Add a lowering pass that works out flow control to replace the ad hoc previous va_pack_flow. Fixes dEQP-GLES31.functional.ssbo.layout.single_basic_type.shared.lowp_vec3. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	4f5e0e1874	pan/va: Don't truncate slots Causes BARRIER not to work. Fixes: `f45654af59` ("pan/va: Add packing routines") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	9b727944a0	pan/va: Model image load instructions These use the attribute pipe, the new versions of LD_ATTR_TEX, but reading texture descriptors instead of attribute descriptors unlike their Bifrost predecessors. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	12da32c31f	pan/va: Pack LEA_TEX_IMM Mostly automatic. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	1f4cb6d99f	pan/va: Add indirect LEA_{ATTR, TEX} For parity with Bifrost. We might need these for images. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Alyssa Rosenzweig	c6fdafe5ea	pan/bi: Model Valhall image loads Like LD_ATTR_TEX. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15756>	2022-04-07 09:27:32 -04:00
Lionel Landwerlin	b5031bd6f7	intel/nir: don't report progress on rayqueries if no queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c78be5da30` ("intel/fs: lower ray query intrinsics") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15769>	2022-04-07 08:24:19 +00:00
Danylo Piliaiev	dde1623ed2	turnip: Implement VK_EXT_primitives_generated_query Similar to pipeline statistics but done for a single counter. We use REG_A6XX_RBBM_PRIMCTR_7 to get generated primitives and not PRIMCTR_8 because PRIMCTR_7 counts pre-clipped prims while PRIMCTR_8 counts them after clipping. OpenGL spec for GL_PRIMITIVES_GENERATED says: "Subsequent rendering will increment the counter once for every vertex that is emitted from the geometry shader, or from the vertex shader if no geometry shader is present." Passes tests: dEQP-VK.transform_feedback.primitives_generated_query.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15746>	2022-04-07 08:01:59 +00:00
Samuel Pitoiset	5ac8f10ec3	radv: mark all states declared dynamic at pipeline creation It will be easier to merge dynamic states from libraries this way. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15764>	2022-04-07 06:16:20 +00:00
Samuel Pitoiset	c7ae87d7af	radv: add a new helper to determine if rasterization is enabled Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15764>	2022-04-07 06:16:20 +00:00
Samuel Pitoiset	3724e09609	radv: fix dynamic raster discard with VK_EXT_depth_clip_control Fixes: `43e83949dc` ("radv: implement VK_EXT_depth_clip_control") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15764>	2022-04-07 06:16:20 +00:00
Mike Blumenkrantz	fddb294cdf	zink: update ci list Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	df700317ab	zink: ci fixup Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	ab1941fc0e	zink: handle zombie swapchains inject a dummy, matching image that can be used until the frontend catches up and rescues us from whatever is happening Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	d32a897f4e	driconf: add override for Xwayland zink needs this to avoid deadlocking on startup Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	48d63705d9	zink: export PIPE_CAP_DEVICE_RESET_STATUS_QUERY this can work now Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	7f56fd9655	zink: it's kopperin' time Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	8ade5588e3	zink: add kopper api Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	a2711e47af	zink: check whether clear is enabled before applying in unbind this is implicit, but make it explicit Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	4deba8cc96	zink: move variable decl up in unbind_fb_surface Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	ed343b415e	zink: pass index to unbind_fb_surface Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	513fcc37d9	zink: move drirc handling up this can modify instance creation Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	820077acdb	zink: split surface creation more to allow disabling caching Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	e37f33cc8d	zink: add fail logging for drmPrimeFDToHandle Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	6998dbb778	zink: add VK_KHR_swapchain_mutable_format Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	1a029d38d8	zink: use two submits for every queue submit first one empty for now Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	4772543956	zink: change early returns in zink_blit to gotos Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	e2abf7d28d	zink: move blit src/dst decls up in function no changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	9811c39994	zink: move update_framebuffer_state() higher up in file no changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	7e0ff34010	zink: put screen param into flush queue global data Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	43f197acd7	zink: move flush queue init down a little further no functional changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Adam Jackson	d760a9151b	gallium: Learn about kopper Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Adam Jackson	bab8d97ea9	glx: Learn about kopper Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Adam Jackson	e1e2de800e	egl: Learn about kopper Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Adam Jackson	93aa5399f1	kopper: Define the driver interface The loader extension provides upcalls to get surface state (native resource and size) into the driver. The driver extension is called by a kopper-aware loader in preference to __DRI_SWRAST's createNewDrawable. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Adam Jackson	25c42abac8	meson: Define a HAVE_XXXX macro for every gallium driver we build Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Mike Blumenkrantz	3883ca5ea7	st/manager: update framebuffer size if texture has been resized zink/kopper can and does expect this when resizing swapchains, so don't ignore it Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14541>	2022-04-07 00:17:40 +00:00
Erik Faye-Lund	0998621496	clc/tests: use dxil_validator Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15751>	2022-04-07 00:00:45 +00:00
Erik Faye-Lund	09c2016d6b	dzn: use dxil_validator Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15751>	2022-04-07 00:00:45 +00:00
Erik Faye-Lund	0c5d772b71	microsoft/spirv_to_dxil: use dxil_validator This has one negative side-effect; we're no longer able to print validation errors without dxcompiler.dll. I doubt that's a real problem, but if it is, we should add this ability to dxil_validator instead of having a second implementation here. The reasons I didn't try adding this in the first place is: 1. This code seems a bit janky; it doesn't consult the "known"-variable to figure out if the encoding is OK, and it's lacking a fallback path in that case. 2. It seems unlikely that the compiler varies the encoding of the output in the first place; one of the two code-paths in here is probably untested. 3. Since dxil_validator leaves reporting to the call-site, we'd need to either add and output-encoding to the API (yuck), or re-encode the string to UTF-8 using WinAPI. Right now, it seems questionable if fixing all of the above is worth it. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15751>	2022-04-07 00:00:45 +00:00
Erik Faye-Lund	1e570962ef	d3d12: use dxil_validator Now that we have a shiny, new dxil validator interface, let's start using it in the D3D12 gallium driver. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15751>	2022-04-07 00:00:44 +00:00
Erik Faye-Lund	193cf76c09	microsoft/compiler: add common dxil-validator API This API is only available on Windows, which is the only OS where DXIL validation is a requirement, and where DXIL.dll (and dxcompiler.dll) are available. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15751>	2022-04-07 00:00:44 +00:00
Jason Ekstrand	e81f3edf76	iris: Allow userptr on 1D and 2D images Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15779>	2022-04-06 23:18:21 +00:00
Jason Ekstrand	5fd2f462fb	iris: Allow non-page-aligned userptr Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15779>	2022-04-06 23:18:21 +00:00
Jason Ekstrand	27697ac20c	iris: Take offsets into account when mapping resources Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15779>	2022-04-06 23:18:21 +00:00
Mike Blumenkrantz	f7ade1f188	zink: simplify shader i/o assignment by utilizing a separate slot map for patch variables, the entire i/o assignment mechanism can be simplified to accurately manage i/o for all types of variables and avoid location conflicts affects: KHR-Single-GL46.enhanced_layouts* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15770>	2022-04-06 23:05:36 +00:00
Mike Blumenkrantz	beb7a9cec5	zink: use local variable in consumer shader i/o assign to match producer usage no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15770>	2022-04-06 23:05:36 +00:00
Mike Blumenkrantz	91e0b919f4	zink: use local variable more consistently in producer shader i/o assign no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15770>	2022-04-06 23:05:36 +00:00
Mike Blumenkrantz	7d7d3b5f26	zink: prune shader i/o more aggressively fixes some radv validation spam on: KHR-Single-GL46.enhanced_layouts* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15770>	2022-04-06 23:05:36 +00:00
Mike Blumenkrantz	8269445ce5	zink: run shader optimize loop during initial create this is important for removing dead variables Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15770>	2022-04-06 23:05:36 +00:00
Mike Blumenkrantz	e96342c531	zink: rework missing feature warnings the previous methodology triggered warnings any time a rasterizer state was created with unsupported features without determining whether those features would actually be used a more optimal process is to check for missing features at pipeline creation, as all the necessary info is now available, and spurious warnings can be avoided Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15778>	2022-04-06 22:52:12 +00:00
Mike Blumenkrantz	da80beafb2	zink: fix warning text in missing feature macro Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15778>	2022-04-06 22:52:12 +00:00
Mike Blumenkrantz	a489b1d936	zink: add a param to warn_missing_feature() macro this lets the macro be used more programmatically since the variable is defined externally Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15778>	2022-04-06 22:52:12 +00:00
Mike Blumenkrantz	b6f99cf378	zink: switch warn_missing_feature to mesa_logw Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15778>	2022-04-06 22:52:12 +00:00
Emma Anholt	3cf28d16f6	ci: Uprev deqp-runner and piglit. deqp-runner uprevved to reduce memory usage on HW runners, let us experiment with shader cache on tmpfs, and hopefully provide a tool for virgl to be able to plausibly run piglit under crosvm instead of vtest. piglit uprevved to avoid a flake in softpipe in glx-multithread-texture, and improve performance of the test, too. This also brings in the fbo-blending-format-quirks fix to properly initialize the buffers, fixing some fails/flakes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:53 +00:00
Emma Anholt	cd39523c53	ci/turnip: Drop xfails for create_list_modifiers. These were fixed in `5ce06f8474` ("turnip: Use correct type for OUTARRAY in FormatProperties2"), but they aren't included in the pre-merge CI run. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15419>	2022-04-06 20:43:52 +00:00
Jason Ekstrand	cde1be0b84	iris: Handle range tracking for global bindings The moment something is bound globally, the whole thing is valid. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15777>	2022-04-06 20:16:55 +00:00
Jason Ekstrand	187923c2eb	iris: Account for BO offsets in iris_set_global_binding() Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15777>	2022-04-06 20:16:55 +00:00
Danylo Piliaiev	25202b5861	ci/freedreno: Add fractional test of forced unaligned gmem store Unaligned gmem store is a mostly untested path since most of the times faster path is chosen. We have to force unaligned store to really test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15773>	2022-04-06 19:53:27 +00:00
Lionel Landwerlin	56ef501e3a	blorp: disable depth bounds Otherwise the driver setting interacts with it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `939ddccb7a` ("anv: Add support for depth bounds testing.") Fixes: `1df871f8ff` ("iris: Add support for depth bounds testing.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15763>	2022-04-06 19:00:50 +00:00
Lionel Landwerlin	3069337144	anv: remove unused 3DSTATE_DEPTH_BOUNDS fields Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15763>	2022-04-06 19:00:50 +00:00
Jason Ekstrand	dbfb0b4989	lavapipe: Go back to manually signaling in lvp_AcquireNextImage2() When porting lavapipe to the common sync framework, I stole the dummy sync signal_for_memory idea from RADV but didn't actually do it correctly. Unless you set wsi_device::signal_semaphore_with_memory and wsi_device::signal_fence_with_memory, it doesn't actually signal anything. If you do set those, it works but also results in dummy syncs being created for present fences which we see as signals. We could choose to just skip those like RADV does but that's too magic. Instead, have our own AcquireNextImage2() again which sets dummy syncs. Fixes: `3b547a9b58` ("lavapipe: Switch to the common sync framework") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15774>	2022-04-06 18:30:51 +00:00
Danylo Piliaiev	a5a97f0b77	turnip: Fix subpassLoad from CUBE input attachments Cube descriptors require a different sampling instruction in shader, however we don't know whether image is a cube or not until the start of a renderpass. We have to patch the descriptor to make it compatible with how it is sampled in shader. For the reference subpassLoad is currently translated into isaml.a Blob v615 also doesn't handle this case correctly. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15734>	2022-04-06 19:42:30 +03:00
Erik Faye-Lund	837f781c9a	d3d12: fix return-code without dxcompiler.dll When we don't have a good dxcompiler.dll that we can load IDxcLibrary from to help with diagnostics, we currently return true for validation even if the validation actually failed. Let's fix that, and also add a debug-message explaining what went wrong for those who are debugging and wondering what's up. Fixes: `2ea15cd661` ("d3d12: introduce d3d12 gallium driver") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15744>	2022-04-06 14:56:39 +00:00
Cristian Ciocaltea	0e14b674f1	ci: Lock Intel GPU frequency for performance tests In order to ensure consistent results when running performance tests, lock the frequency for Intel GPUs to ~70% of the maximum allowed by hardware. This seems to offer a good balance between execution speed and results consistency. An increase of the frequency will also increase the rate of throttling events, with a negative impact on consistency. Such events are logged, as in the following example: GPU throttling detected: act=200 min=850 cur=850 RPn=100 This shows the actual GPU frequency (200 MHz) dropped below the minimum requested (850 MHz). For more details about the various frequency information sources, please see the script header comments in ".gitlab-ci/common/intel-gpu-freq.sh". Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15662>	2022-04-06 13:11:05 +00:00
Cristian Ciocaltea	a3dfbf1ec7	ci: Provide intel-gpu-freq.sh in LAVA and bare-metal rootfs The script will be used for tuning Intel GPU frequency to maximize performance tests execution, while also trying to reduce throttling, which has a negative impact on results consistency. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15662>	2022-04-06 13:11:05 +00:00
Cristian Ciocaltea	5d5af8bffb	ci: Add Intel GPU frequency utility Add script to manage Intel GPU frequencies. It can be used for debugging performance problems or to lock a stable frequency while executing benchmark tests. Typical use cases: - Get all available GPU frequency information $ ./intel-gpu-freq.sh -g all * Hardware capabilities RP0: 1350 MHz RPn: 100 MHz RP1: 400 MHz * Enforcements max: 1350 MHz min: 100 MHz boost: 1350 MHz * Actual act: 100 MHz cur: 400 MHz - Lock frequency to 80% of the maximum allowed by hardware and enable throttling detection $ ./intel-gpu-freq.sh -s 80% -d GPU throttling detected: act=1050 min=1350 cur=1350 RPn=100 GPU throttling detected: act=1100 min=1350 cur=1350 RPn=100 Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15662>	2022-04-06 13:11:05 +00:00
Lionel Landwerlin	88f77aa811	anv: disable preemption on 3DPRIMITIVE on gfx12 To workaround a push constant corruption issue. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5963 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5662 Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15753>	2022-04-06 12:51:15 +00:00
Vadym Shovkoplias	04a6693871	anv: fix EXT_depth_clip_control This fixes arb_clip_control-clip-control and depth_clamp piglit tests on zink. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6186 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15561>	2022-04-06 13:26:52 +03:00
Danylo Piliaiev	6c18602164	turnip: Add "unaligned_store" debug option to better test gmem stores Unaligned store is incredibly rare in CTS, we have to force it to actually test it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	e255305e84	turnip: Ignore aspectMask for D32S8 framebuffer attachment Vulkan spec says: "When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." Since we use two planes for D32S8 format we have to add a special case for depth in addition to already existing case for stencil. Fixes hang in CTS: dEQP-VK.renderpass.depth_stencil_write_conditions.stencil_kill_write_d32sf_s8ui Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Danylo Piliaiev	72716993b2	turnip: Correctly store separate stencil in gmem store - When resolving d32s8 to s8 we stored stencil with a wrong format. - For unaligned multi-sample store we used wrong gmem offset for stencil. If unaligined store is forced this change fixes a hang in: dEQP-VK.renderpass2.depth_stencil_resolve.image_2d_32_32.samples_2.d32_sfloat_s8_uint_separate_layouts.compatibility_depth_zero_stencil_zero_testing_stencil Fixes: `b157a5d0d6` ("tu: Implement non-aligned multisample GMEM STORE_OP_STORE") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15532>	2022-04-06 08:44:28 +00:00
Samuel Pitoiset	045c96d896	radv: enable VK_KHR_pipeline_library This has been initially implemented for raytracing but VK_EXT_graphics_pipeline_library requires it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15737>	2022-04-06 06:35:56 +00:00
Mike Blumenkrantz	eb8cde0d93	zink: use GENERAL layout for mixed zs fb attachments this interaction requires read-only sampler access from depth component with writes to the stencil component, which can only be done in the GENERAL layout affects: GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_color_and_stencil_blit Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	fb2a08bb01	zink: update samplerview layouts for zs attachments during renderpass prep this interaction might require layout changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	c132a28745	zink: use store op NONE when necessary for depth usage Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	eee8db250d	zink: delete some code in get_layout_for_binding() should be no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	ef99ceaec2	zink: add a ctx param to zink_descriptor_util_image_layout_eval Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	7781a75229	zink: add a renderpass flag for mixed zs layout this indicates that the layout requires reading from the depth aspect and writing to the stencil aspect Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	f24b966d27	zink: further simplify zs case for zink_descriptor_util_image_layout_eval Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	fd9a5fed16	zink: remove commented code probably never actually going to do this since it serves no purpose Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	a57b3bb82a	zink: refactor zink_descriptor_util_image_layout_eval slightly more readable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	72a63649c2	zink: use EXT_image_2d_view_of_3d fixes #4562 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	f05d0f1238	zink: unset resource layout+access when doing storage setup the previous access info is transferred to the staging resource for the copy, and the new image has no access info cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	65394fcaef	zink: prune shader i/o to avoid validation spam when variables are declared but never used, eliminate persisting i/o variables which are never used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	c5f44e51fb	zink: fix max geometry input component advertising forgot to divide by 4 somehow cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	5ca6b28c58	zink: convert all 64bit vertex attribs to 32bit this applies not only to dvec3/dvec4, but to any type of 64bit input since that's what gallium gives us fixes #6211 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	7cf7e113ba	zink: apply fb attachment layout to dummy attachments this doesn't actually matter but it still triggers validation spam Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	e72ca7e7ac	zink: clamp min viewport width to 1 fixes validation spam Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	d4e91d0baa	zink: handle 1bit xor as OpLogicalNotEqual fixes more validation spam Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:49:00 +00:00
Mike Blumenkrantz	d1f52d300d	zink: set Geometry capability for fs if geometry inputs are read super legal. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:48:59 +00:00
Mike Blumenkrantz	3f918bbd23	zink: always set stencil dynamic states before draw these are dynamic states set on the pipeline, so they must always be set on a cmdbuf before draw fixes some validation spam Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:48:59 +00:00
Mike Blumenkrantz	2ccb24757a	zink: merge stencil test case for draw-time dynamic state these cases were identical but using different conditionals Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:48:59 +00:00
Mike Blumenkrantz	b76ad3efa0	zink: only uncommit sparse pages that have been committed avoid spamming drivers unnecessary with submits since this is expensive Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15716>	2022-04-06 04:48:59 +00:00
Mike Blumenkrantz	3dcb80da9d	zink: fix barrier generation for ssbo descriptors ssbo binds are always at least readable, so without deeper shader inspection, never assume that write binds mean no read access this is different than image descriptors which can explicitly get write-only access set cc: mesa-stable fixes #6231 THANKS TO @daniel-schuermann FOR SOLVING THIS WITH HIS INCREDIBLE TALENT AND WIT Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15697>	2022-04-05 22:47:39 -04:00
Renato Pereyra	5ec4995305	Revert "venus: Increase the base sleep of vn_relax" This reverts commit `737937f45e`. Testing has revealed sizable performance drops arising out of this change. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15760>	2022-04-06 01:46:10 +00:00
Jason Ekstrand	cc78a3a820	panvk: Enable VK_EXT_debug_report and VK_EXT_debug_utils They're both implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Jason Ekstrand	29b8097408	anv: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Jason Ekstrand	77ffa61a14	lavapipe: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Jason Ekstrand	bdf52654ac	turnip: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Jason Ekstrand	292ceb297c	v3dv: Enable VK_EXT_debug_utils It's implemented in common code as long as you use vk_command_buffer. Acked-by: Emma Anholt <emma@anholt.net> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15560>	2022-04-06 01:18:23 +00:00
Jason Ekstrand	3b547a9b58	lavapipe: Switch to the common sync framework The common Vulkan sync framework will do most of the queueing for us. It will even sort out timeline semaphore dependencies and ensure everything executes in-order. All we have to do is make sure our vk_sync type implements VK_SYNC_FEATURE_WAIT_PENDING. This lets us get rid of a big pile of code. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15651>	2022-04-06 00:38:22 +00:00
Jason Ekstrand	2f6bca6a74	vulkan: Use timespec_add_nsec in vk_sync_timeline Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15651>	2022-04-06 00:38:22 +00:00
Jason Ekstrand	0ba22b203d	util/timespec: Return overflow from timespec_add_[mn]sec() To avoid altering any currently existing callers, we continue on with the calculation regardless of overflow. This also matches the behavior of GCC's __builtin_add_overflow(). Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15651>	2022-04-06 00:38:22 +00:00
Emma Anholt	591899eedd	gallivm/nir: Add a short circuit uniform-offset mode for load_ssbo/load_shared. dEQP-VK.binding_model.buffer_device_address.set3.depth3.basessbo.convertuvec2.nostore.multi.scalar.vert runtime -24.4002% +/- 1.94375% (n=7). The win (I think) is in LLVM not having to chew through handling the extra loops on every constant-offset SSBO load, not in actual rendering time. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	181f25aff4	gallivm/nir: Add a short circuit uniform-offset mode for load_global. If we know the offset is constant, we don't have ask LLVM to loop over the elements pulling the same value out over and over. This doesn't seem to have produced a win in the testcase I was looking at, but it was an easier entrypoint to figuring out how to do scalar memory access than load_memory, and will probably affect some workload. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	d74606d440	gallivm/nir: Refactor out some repeated code to generate 0 values. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	4fad4c1d79	gallivm/nir: Refactor out some repeated logic for SSBO/shared access. I needed to be able to get these pointers/limits from another location, and missing some of the repeated steps was giving me bugs. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	21b3db7d17	gallivm/nir: Pull some repeated exec_mask computation out of loops. If the exec mask hasn't changed, don't hassle LLVM to set it up Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	9ab4ecb1ae	gallivm/nir: Don't do uniform-and-broadcast access on inactive invocations. In a fragment shader or inside of control flow, invocation 0 might be inactive, and so our use-first-invocation-and-broadcast optimizations would be invalid, and the loop logic of an emit_read_invocation would defeat the point of these optimized paths. For load_kernel_input, I didn't guard the uniform path with the check because some CL tests that are currently passing are doing the load_kernel_input under (presumably) uniform control flow. Instead, I dropped in a once warning for the next person to be debugging CL. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	04cdd41fdc	util/log: Add support for logging once. These are not data-race safe (like many other once patterns in Mesa), so they might not log exactly once, but it should be good enough for not spamming the console. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14999>	2022-04-06 00:04:14 +00:00
Emma Anholt	ad0fcaf1ee	ci/lava: Simplify passthrough of the request to upload results/ to minio. We already have a way to pass env vars around, just use that instead of packing/unpacking it on the kernel command line. Cleans up HW runner job log output some more. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Emma Anholt	34278e8f2e	ci/deqp: Move the set +e just before the deqp-runner invocation. You don't want to proceed to running deqp-runner if you failed at the vtest or runner options environment setup. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Emma Anholt	82cd8614e6	ci/deqp: Add gitlab-ci sections to deqp-runner.sh. This should help highlight the actual test results, as opposed to the setup and teardown. Also tuned the "set -x"es a little bit so we get less surrounding noise in the echo process. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Emma Anholt	03549f3bf3	spirv: Silence "Decoration not allowed on struct members: SpvDecorationRestrict" VK-GL-CTS causes tons of these due to a bug in glslang, to the point where it's hard to find actual issues in test logs. Disable the warning for now, with a link to the issue we're waiting on being resolved. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Emma Anholt	0ee0d54985	util/log: Don't print an extra \n if the format string had one. You're not supposed to include a '\n' in mesa_log*() messages because android logging will log what you provide on its own line anyway, so each mesa_log() should be the body of a log line. But also, getting everyone to consistently not do that is hopeless because we're all so trained by printf(). So, just detect an existing \n and don't add a new one. Cleans up deqp-vk debug output a bunch from turnip. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15332>	2022-04-05 21:37:46 +00:00
Mike Blumenkrantz	e0b431da33	docs: update features for VK_EXT_image_2d_view_of_3d Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15757>	2022-04-05 17:08:02 -04:00
Mike Blumenkrantz	6fd344ff98	anv: expose VK_EXT_image_2d_view_of_3d sampling only available on gen9+ Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15754>	2022-04-05 20:30:31 +00:00
Mike Blumenkrantz	8b5c9e7c81	lavapipe: expose VK_EXT_image_2d_view_of_3d Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15754>	2022-04-05 20:30:31 +00:00
Mike Blumenkrantz	9a6ea51388	vulkan: check 3D image type for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15754>	2022-04-05 20:30:31 +00:00
Connor Abbott	b91b90c256	tu: Expose VK_KHR_maintenance4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	5eb63d825f	tu: Remove tu_pipeline::layout This makes it more obvious that the layout is never used after creating the pipeline, which is required by VK_KHR_maintenance4. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	7455a7a44c	tu: Fill out maxBufferSize It seems this is really a workaround for silly issues in GetBufferMemoryRequirements when you ask for a really large buffer. Just expose the maximum possible size ATM. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Connor Abbott	d1762b7df0	tu: Implement GetDevice*MemoryRequirements() Based mostly on anv, which is a bit more optimized than radv - we at allocate the image on the stack. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15488>	2022-04-05 17:46:35 +00:00
Erik Faye-Lund	c42da6dd60	ci: do not specify c_std and cpp_std for windows-build When parts of the tree needs later c and c++ versions, they should ask for it in the build-system itself, not expect the user to ask for it on the command-line instead. So let's not paper over things by specifying them here. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15706>	2022-04-05 16:58:56 +00:00
Erik Faye-Lund	f607db2689	dozen: require c++20 for designated initializers We do require C++20 still, because designated initializers is part of that standard. This is almost a revert, but conditionally selecting between c++latest or c++20 when available, as that's what we really want. Fixes: `55ca1c8db3` ("vulkan/microsoft: Remove `override_options: ['cpp_std=c++latest']` option for visual studio") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15706>	2022-04-05 16:58:56 +00:00
Erik Faye-Lund	ed399a179e	nir/tests: do not use designated initializers in c++ code Designated initializers require C++20, which is a bit easier said than done to support well across meson versions. Let's avoid using them for now instead. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15706>	2022-04-05 16:58:56 +00:00
Erik Faye-Lund	28dbabec8e	aco: do not use designated initializers Designated initializers are a C++20 feature, but we don't use C++20. In fact, enabling C++20 for ACO triggers new compiler errors due to some equality semantics details. So let's instead stop using designated initializers here. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15706>	2022-04-05 16:58:56 +00:00
Thong Thai	dcd81d2d80	frontends/va: fix decode issues introduced by efc change Fixes: `9602526568` ("frontends/va: add encoder format conversion (EFC) support") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6153 Signed-off-by: Thong Thai <thong.thai@amd.com> Tested-by: Andrew Falcon <bluestang2006@gmail.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15717>	2022-04-05 16:40:08 +00:00
Konstantin Seurer	dacd78fd5a	radv: Remove radv_util.c Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15725>	2022-04-05 15:53:46 +00:00
Mike Blumenkrantz	b591409b6c	vulkan: spec update to 1.3.211 Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15752>	2022-04-05 12:55:21 +00:00
Roman Stratiienko	61f94fff0d	panfrost: Don't crash on panfrost_bo_create() with size==0 invocation 1. Clamp bucket_index from both ends to avoid returning negative index. 2. Return NULL in case BO allocation/fetching failure to prevent invalid bo mapping. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6247 Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15748>	2022-04-05 13:08:51 +03:00
Samuel Pitoiset	576833507b	radv: only declare dynamic states that are used by internal operations Initialize some default static PSO states instead. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15729>	2022-04-05 07:54:52 +00:00
Samuel Pitoiset	edc09beccc	radv: use radv_dynamic_state for saving/restoring meta operations Instead of duplicating every fields. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15729>	2022-04-05 07:54:52 +00:00
Samuel Pitoiset	3fa3d81172	radv: save/restore more dynamic states during internal driver operations This doesn't fix anything known but it could happen in theory. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15729>	2022-04-05 07:54:52 +00:00
Samuel Pitoiset	ebf4f66c6a	radv/ci: update CI lists against CTS 1.3.1.1 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15745>	2022-04-05 09:13:41 +02:00
Igor Torrente	cc8e271813	venus: add VK_EXT_{conditional_rendering,index_type_uint8} extensions Implements all the necessary code in the device initialization and extensions functions. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15652>	2022-04-05 05:10:26 +00:00
Igor Torrente	bfab83ab4b	venus: Update venus-protocol to add two new extensions These are the changes automatically generated from the venus-protocol repository. Update the file to add `VK_EXT_index_type_uint8` and `VK_EXT_conditional_rendering` Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15652>	2022-04-05 05:10:26 +00:00
Yiwei Zhang	801cdd83f1	venus: workaround an ANGLE assumption on FORMAT_IMPLEMENTATION_DEFINED ANGLE expects VK_FORMAT_UNDEFINED to be returned for such AHB prop query. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15721>	2022-04-05 05:03:39 +00:00
Omar Akkila	4208895175	ci: bump VK-GL-CTS to 1.3.1.1 Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15668>	2022-04-04 23:04:33 +00:00
Jason Ekstrand	94ce812497	anv: Advertise two more formats These both require swizzling so border colors won't work. However, they're conveniently in the list of formats for which custom border colors require you to specify a format in the sampler. That list constists of: - VK_FORMAT_B4G4R4A4_UNORM_PACK16 - VK_FORMAT_B5G6R5_UNORM_PACK16 - VK_FORMAT_B5G5R5A1_UNORM_PACK16 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6226 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	e32b9e5c3f	anv: Generalize border color swizzles Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	54509d27d9	anv: Disallow blending on swizzled formats Fixes: `c20f78dc5d` ("anv: Support swizzled formats.") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Jason Ekstrand	257a20f40d	intel/isl: Add a helper for swizzling color values Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15624>	2022-04-04 21:42:23 +00:00
Benjamin Cheng	4489933842	vulkan/queue: Destroy wait temps if they are skipped Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6223 Fixes: `8a11d2a31b` ("vulkan: Add a dummy sync type") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Tested-by: Jakob Bornecrantz <jakob@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15727>	2022-04-04 20:52:46 +00:00
Rhys Perry	5b4e41e4db	aco: don't use v_mad_mix on GFX9 if 16-bit denormals must be preserved This probably effectively disables the v_mad_mix optimization on GFX9. fossil-db (Vega): Totals from 11545 (7.15% of 161366) affected shaders: MaxWaves: 43025 -> 42780 (-0.57%); split: +0.06%, -0.63% Instrs: 18571635 -> 18734201 (+0.88%); split: -0.00%, +0.88% CodeSize: 96483568 -> 96611012 (+0.13%); split: -0.11%, +0.24% SGPRs: 1079056 -> 1077616 (-0.13%); split: -0.14%, +0.01% VGPRs: 819248 -> 821868 (+0.32%); split: -0.04%, +0.36% SpillSGPRs: 13313 -> 12464 (-6.38%) Latency: 293804093 -> 295046122 (+0.42%); split: -0.09%, +0.51% InvThroughput: 110002239 -> 110994978 (+0.90%); split: -0.03%, +0.93% VClause: 342458 -> 342596 (+0.04%); split: -0.12%, +0.16% SClause: 648566 -> 648046 (-0.08%); split: -0.12%, +0.04% Copies: 1728225 -> 1726679 (-0.09%); split: -0.66%, +0.57% Branches: 552973 -> 552963 (-0.00%); split: -0.02%, +0.02% PreSGPRs: 862360 -> 856820 (-0.64%); split: -0.69%, +0.05% PreVGPRs: 773689 -> 776818 (+0.40%); split: -0.02%, +0.42% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6178 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15718>	2022-04-04 19:27:12 +00:00
Francisco Jerez	03cf788891	iris: Replace unconditional QBO flush with iris_dirty_for_history(). We can now use the same cache tracking mechanism for synchronizing QBO writes instead of the unconditional PIPE_CONTROL performed currently, which is unable to invalidate any incoherent caches which may contain stale data for the buffer object. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15738>	2022-04-04 10:32:31 -07:00
Francisco Jerez	6cc09699cd	iris: Remove remaining history flushes. This removes a couple of remaining history flushes which were open-coded instead of using the iris_flush_and_dirty_for_history() helper. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15738>	2022-04-04 10:32:31 -07:00
Francisco Jerez	bbb103be0e	iris: Demote all callers of iris_flush_and_dirty_for_history() to iris_dirty_for_history(). The unconditional flushing performed by iris_flush_and_dirty_for_history() is now redundant with the memory barriers introduced previously in this series, which should be in a better position to determine from which domain the buffer will actually be used in the future, and whether an additional flush or invalidation is required or redundant with other PIPE_CONTROL commands emitted elsewhere. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15738>	2022-04-04 10:32:31 -07:00
Mike Blumenkrantz	bbe15d99e2	aux/trace: dump format in set_shader_images Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15323>	2022-04-04 13:16:16 +00:00
Tomeu Vizoso	d948f32365	ci/freedreno: Reduce concurrency when replaying traces on a630 We are running out of memory when replaying traces sometimes, reduce the number of concurrent retrace processes. Mesa: User error: GL_OUT_OF_MEMORY in glReadPixels warning: GL_OUT_OF_MEMORY while getting snapshot 1074335: warning: failed to get snapshot https://gitlab.freedesktop.org/mesa/mesa/-/jobs/20519522 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15682>	2022-04-04 12:48:40 +00:00
Juan A. Suarez Romero	689f9d2a5b	v3d: fix some leaks in cache Fix a couple of leaks introduced when adding support for on-disk shader cache. Fixes: `4468db20f7` ("v3d: add support for on-disk shader cache") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15733>	2022-04-04 10:49:59 +00:00
Iago Toral Quiroga	827ef5fba9	v3dv: fix limits for inline uniform blocks We don't support 'Update After Bind', however, the limits for this model also include the ones without it. See the with or without remark in the spec below: "maxPerStageDescriptorUpdateAfterBindInlineUniformBlocks is similar to maxPerStageDescriptorInlineUniformBlocks but counts descriptor bindings from descriptor sets created with or without the VK_DESCRIPTOR_SET_LAYOUT_CREATE_UPDATE_AFTER_BIND_POOL_BIT bit set." Fixes: dEQP-VK.api.info.vulkan1p2_limits_validation.ext_inline_uniform_block Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15732>	2022-04-04 09:28:55 +00:00
Tomeu Vizoso	51ab4ef4be	Revert "ci/panfrost: Disable some jobs due to a lab failure" Machines are back. This reverts commit `b5fd1fddd9`. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15730>	2022-04-04 08:59:38 +02:00
Samuel Pitoiset	c439735a91	radv: save/restore the stencil reference during internal driver operations I think I should improve this to be more robust. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6243 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15700>	2022-04-04 06:31:11 +00:00
Samuel Pitoiset	41ece97afb	radv: fix cleaning the image view for CmdCopyImageToBuffer() Fixes: `f07e67272e` ("radv: fix vk_object_base_init/finish for internal image views") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15707>	2022-04-04 06:12:12 +00:00
Karmjit Mahil	e5439bf4aa	pvr: Add stricter type checking in pvr_csb_pack(). Since the packing functions generated by csbgen use a void pointer for the buffer in which to pack, it's possible to easily write out of bounds. This commits attempts to reduce the chances by having the pack macro check that the pointer passed points to an element sized equally to the state word being packed. Catching these errors earlier. As can be seen in this commit, there already was a case of this: "pds_ctrl". The word size is meant to be 64 bits but the pointer was pointing to a 32 bit field. Although it's fine for the word size to be smaller than the storage pointed to by the pointer, this is not allowed just to be extra careful. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15687>	2022-04-04 03:40:48 +00:00
Konstantin Seurer	c4650cbdb0	radv: Replace magic constants with enum values Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15722>	2022-04-03 12:43:00 +00:00
Konstantin Seurer	c8fe408fcc	radv: Advertise ray primitive culling Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15722>	2022-04-03 12:43:00 +00:00
Konstantin Seurer	9f55888996	radv: Fully implement ray primitive culling Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15722>	2022-04-03 12:43:00 +00:00
Emma Anholt	e1de9b0de5	turnip: Allow image access on swapped formats. This is apparently something that gamescope would like to have, and the CTS's test coverage is happy with it. Fixes: #6011 (we hope) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	4cd51efedb	turnip: Disable tiling on 1D images. If we know the height is 1, then it would be a waste to align each miplevel to tile height. For non-mipmapped textures, it doesn't save us memory (since you still align to 4 on the last miplevel), but it should be better cache locality by not loading those unused lines. Incidentally, this gets us some more coverage of swap != WZYX cases in CTS tests, which often use optimal tiling without also testing linear. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	71fcb751eb	freedreno/a6xx: Set the color_swap field for storage descriptors. This field does appear to work as expected: with 1D/1DArray turnip storage images switched to be always linear, it fixes the dEQP-VK.image.store tests using a color swapped format (once we allow color swap). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	51b04a7dfb	turnip: Add support for VK_KHR_format_feature_flags2. This reports all of our storage formats as supporting read/write without format, since we don't have any in-shader format conversions. Similarly, shadow comparisons were already supported on all the depth formats. This extension is required for VK 1.3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15293>	2022-04-02 19:55:40 +00:00
Emma Anholt	44aff2beec	nir_to_tgsi: Add support for nir_intrinsic_image_samples. Found in 1 piglit test on r600. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15627>	2022-04-02 15:17:01 +00:00
Erico Nunes	a570e6c243	lima/ci: enable piglit in lima CI There are spare boards for it and the results appear stable. Dividing the load in 2 parallel runs results in a run time of around 10 minutes each. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15598>	2022-04-02 14:40:06 +00:00
Erico Nunes	148f8bc67f	lima/ci: enable CI again Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15598>	2022-04-02 14:40:06 +00:00
Erico Nunes	6f6fd1cf75	lima/ci: update deqp results Unfortunately some regressions sneaked in while CI was down. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15598>	2022-04-02 14:40:06 +00:00
Danylo Piliaiev	5ce06f8474	turnip: Use correct type for OUTARRAY in FormatProperties2 Fixes: `799a9db24c` ("turnip: Stop using VK_OUTARRAY_MAKE()") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15694>	2022-04-02 09:51:45 +00:00
Vinod Koul	28ae397be1	freedreno/registers: update dsi registers to support dsc Display Stream compression (DSC) compresses the display stream in host which is later decoded by panel. This requires addition of 3 new DSI registers to support DSC over DSI. Signed-off-by: Vinod Koul <vkoul@kernel.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14967>	2022-04-01 21:56:40 +00:00
Pavel Ondračka	b672814fe5	r300: don't assume position is always OUT[0] in rc_copy_output Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15677>	2022-04-01 21:18:42 +00:00
Pavel Ondračka	fb233ac411	r300: set PVS_XYZW_VALID_INST properly to last position write This is a potential performance optimization, from the docs: The PVS Instruction which updates the clip coordinatea position for the last time. This value is used to lower the processing priority while trivial clip and back-face culling decisions are made. This field must be set to valid instruction. Right now it is set at the last instruction. However, there is no measurable performance effect in either Unigine Sanctuary, Lightsmark or GLmark. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6045 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15677>	2022-04-01 21:18:42 +00:00
Bas Nieuwenhuizen	51757d2249	radv: Add more BVH vertex formats. AFAIU this is needed for DXR 1.1 with vkd3d-proton. Reviewed-by: Samuel Pitoiset<samuel.pitoiset@gmail.com> Reviewed-by: Konstantin Seurer <konstantin.seurer@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15475>	2022-04-01 21:00:20 +00:00
Corentin Noël	23f5e2edbd	nir_to_tgsi: Handle blocks defined as arrays of arrays Make sure to take all the array sizes into account when generating the TGSI. Makes the `piglit.spec@arb_arrays_of_arrays@execution@ubo@fs-const-explicit-binding` test pass Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15713>	2022-04-01 15:41:36 +00:00
Tomeu Vizoso	b5fd1fddd9	ci/panfrost: Disable some jobs due to a lab failure A dispatcher has had a hard disk failure and these devices are offline now. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15711>	2022-04-01 14:37:03 +00:00
Daniel Schürmann	6d383159d4	aco/optimizer: check recursively if we can eliminate s_and exec Totals from 2860 (2.12% of 134913) affected shaders: (GFX10.3) CodeSize: 5990728 -> 5979164 (-0.19%); split: -0.20%, +0.01% Instrs: 1094562 -> 1091653 (-0.27%); split: -0.28%, +0.01% Latency: 8689841 -> 8684523 (-0.06%); split: -0.07%, +0.00% InvThroughput: 1840533 -> 1840527 (-0.00%); split: -0.00%, +0.00% SClause: 51437 -> 51439 (+0.00%) Copies: 82461 -> 82472 (+0.01%) PreSGPRs: 83136 -> 83172 (+0.04%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15675>	2022-04-01 15:35:26 +02:00
Gert Wollny	492401c054	virgl: Don't support QUADS natively Using quads leads to a rather expensive buffer readback when quads are stored in display lists, so avoid them altogether. Closes: #5825 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15683>	2022-04-01 11:02:12 +00:00
Iago Toral Quiroga	597560e27c	broadcom/compiler: always enable per-quad on spill operations This ensures that any channels used for helper invocations are also spilled/filled correctly. Alternatively, we could recursively track all temps that get involved in computing values that are then used in explicit (dfdx,dfdy) or implicit (texture coordinates for mipmap or anisotropic filtering, etc) derivatives, and only enable per-quad on these (or disable spilling of any of these values). Fixes: dEQP-VK.graphicsfuzz.cov-dfdx-dfdy-after-nested-loops Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15705>	2022-04-01 08:53:50 +00:00
Samuel Pitoiset	e6d7a6a3b7	radv: enable VK_EXT_separate_stencil_usage This extension has been promoted to Vulkan 1.2 which means it has been silently enabled when we implemented Vulkan 1.2. Enable it explicitely to make mesamatrix happy and also for consistency. This extension was designed for potential performance improvements of MSAA depth/stencil images but it's currently a no-op in RADV. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15665>	2022-04-01 07:02:28 +00:00
Samuel Pitoiset	7c14671535	radv: use the common vk_framebuffer Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:59 +02:00
Samuel Pitoiset	8396a0d1fd	radv: remove now unused radv_cmd_buffer_{begin,end}_render_pass() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:59 +02:00
Samuel Pitoiset	be28b566b0	radv: convert the meta clear path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:59 +02:00
Samuel Pitoiset	67c2a6adc6	radv: convert the meta blit path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:59 +02:00
Samuel Pitoiset	e6fac090e6	radv: convert the meta resolve HW path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:53 +02:00
Samuel Pitoiset	b18afccb61	radv: convert the meta resolve depth/stencil FS path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:53 +02:00
Samuel Pitoiset	518c6d808e	radv: convert the meta resolve color FS path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:53 +02:00
Samuel Pitoiset	42db590006	radv: convert the meta blit 2d path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:53 +02:00
Samuel Pitoiset	fc3125ed6c	radv: convert the meta fast clear flush path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:52 +02:00
Samuel Pitoiset	70e663aa21	radv: convert the meta depth decompression path to dynamic rendering Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:52 +02:00
Samuel Pitoiset	9309c3d887	radv: rework the workaround that disables DCC for incompatible copies Rely on the image view to avoid using an extra structure for the render pass. This will allow to convert the meta operations to dynamic rendering. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15612>	2022-04-01 08:21:52 +02:00
Yonggang Luo	be1b30393b	util: Getting u_debug.h not depends on pipe/* Move pipe_debug_type into u_debug.h Move pipe_debug_callback into u_debug.h Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Yonggang Luo	b2ece67f11	util: Rename PIPE_DEBUG_TYPE to UTIL_DEBUG_TYPE Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Yonggang Luo	ab225a1e36	util: Rename pipe_debug_type to util_debug_type Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Yonggang Luo	dca7ea4a12	pipe: place `struct util_debug_callback` at the proper place in p_context.h Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Yonggang Luo	2ca6ef22f7	util: Rename pipe_debug_callback to util_debug_callback Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Yonggang Luo	523675e995	util: Rename pipe_debug_message to util_debug_message Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15657>	2022-04-01 01:52:43 +00:00
Mike Blumenkrantz	240cd8088c	lavapipe: set LVP_POISON_MEMORY for ci Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15620>	2022-04-01 01:14:03 +00:00
Mike Blumenkrantz	2d8d7c0638	zink: set LVP_POISON_MEMORY for ci Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15620>	2022-04-01 01:14:03 +00:00
Mike Blumenkrantz	0be484ece4	lavapipe: add an env var to enable poisoning memory allocations it's not random, but it's definitely not zero or any other expected value, which means it's going to break anything that was passing due to lucky uninitialized values Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15620>	2022-04-01 01:14:03 +00:00
Dave Airlie	dcadeb9a47	llvmpipe: fix nr_sampler_view in key creation. This was doing MAX2() but nr_sampler_views hasn't been initialised yet. Fixes: `690cc3bb80` ("llvmpipe: overhaul fs/cs variant keys to be simpler.") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15603>	2022-04-01 09:52:43 +10:00
Dave Airlie	57dd05616f	zink/query: rewrite the query handling code to pass validation. The current code was allowing multiple query pools of the same type to be active on the same commmand buffer which creates validation warnings. Restructure the query handling, so the query pools are allocated on the context on first use, then have queries allocate query_id from those pools. This approach creates a new start dynamic array that contains each time a vulkan query is started for a gallium query, and holds the flags for it rather than storing them in a massive array. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	b96ddc37bc	zink/query: only reset the range of queries in use. This reduces the amount of resetting needed. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	7f3a959d2f	zink/query: refactor get_query_result to map upfront. This maps upfront and processes the results in a clearer way. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	f090cac54a	zink/query: use a single query pool for XFB queries. There is no need to use a separate pool for each AFAIK. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	0d14c491de	zink: refactor out number of vk queries per gallium query helper Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	ce28974d06	zink/query: collapse the xfb_query_pool array into the normal one. This makes things a bit easier to follow. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	7303ba3a67	zink/query: consolidate xfb_buffers into one array. This makes the code more complex than needed, just use on buffers array. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15702>	2022-03-31 23:42:03 +00:00
Dave Airlie	6bbbe15a78	Reinstate: llvmpipe: allow vertex processing and fragment processing in parallel This reinstates patch: `ec8104c6b2` llvmpipe: allow vertex processing and fragment processing in parallel For now unknown reasons bisecting an error had lead to this patch, but the actual bug was in virglrendrerer and the accoring feature was now only enabled with https://gitlab.freedesktop.org/mesa/mesa/-/issues/6130 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6130 Original patch was Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15684>	2022-03-31 23:24:56 +00:00
Georg Lehmann	80a7ed273a	radv: Enable global bo list if 1.2 features are used. These features require the global bo list and the existing code only checked if the extensions which were promoted to 1.2 are enabled. Found by inspection. Cc: mesa-stable Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15701>	2022-03-31 20:35:57 +00:00
Ian Romanick	7fd1955412	nir: intel/compiler: Lower TXD on array surfaces on DG2+ DG2 can only do sample_d and sample_d_c on 1D and 2D surfaces. Cube maps and 3D surfaces were already handled, but 1D array and 2D array surfaces were not. Fixes the following Vulkan CTS failures on DG2: dEQP-VK.glsl.texture_functions.texturegradclamp.isampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.isampler2darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler1darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_fixed_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.sampler2darray_float_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler1darray_fragment dEQP-VK.glsl.texture_functions.texturegradclamp.usampler2darray_fragment The Fixes: tag below is a bit misleading. This commit adds another lowering, similar to the one in the Fixes: commit, that probably should have been added at the same time. I just want to make sure this commit gets applied everywhere that commit was also applied. Fixes: `635ed58e52` ("intel/compiler: Lower txd for 3D samplers on XeHP.") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15681>	2022-03-31 12:59:18 -07:00
Chad Versace	3f8224baee	intel/tools: Fix build without drivers If Meson was configured with -Dtools=intel but all Intel drivers were disabled, then Meson silently refused to build the tools. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15617>	2022-03-31 18:20:01 +00:00
Rajnesh Kanwal	d5405c1608	vulkan: Move common format function to vulkan/util/vk_format.h Moving duplicate vk_format helper functions to common vulkan/util/vk_format.h and also renaming vk_format_get_component_size_in_bits to match how amd and freedreno name the same function. Not moving this function to common code as freedreno's implementation is a bit different. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15696>	2022-03-31 17:18:22 +00:00
Rajnesh Kanwal	9f6571e80a	amd: Use common u_format.h implementation for vk_format_get_component_bits. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15696>	2022-03-31 17:18:22 +00:00
Karmjit Mahil	0d25db406b	pvr: Fix seg fault in vkAllocateDescriptorSets(). In cases when no immutable samplers were present but sampler descriptor set layout bindings were, a seg fault was being caused by an attempt to get the immutable sampler from within the immutable sampler array while the array was not allocated. This commit also remove the binding type check since only those specific types can have immutable samplers. The check is covered by the descriptor set layout creation and assignment of has_immutable_samplers. This commit also makes the immutable samplers const, since they're meant to be immutable. This commit also adds has_immutable_samplers field to descriptor set layout. Previously to check whether immutable samplers were present or not you'd check for the layout binding's descriptor count to be 0 and the immutable sampler offset to be 0. This doesn't tell you everything. If you have a descriptor of the appropriate type (VK_DESCRIPTOR_TYPE_{COMBINED_,}IMAGE_SAMPLER). So descriptor count of >1. The offset can be 0 if you don't have immutable sampler, or have them at the beginning of the immutable samplers array. So you can't determine if you really have immutable samplers or not. One could attempt to perform a NULL check on the array but this would not work in cases where you have following set layout bindings with immutable samplers as the array stores the whole layout's immutable samplers. Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Reviewed-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15688>	2022-03-31 17:12:21 +00:00
Frank Binns	93ee0c7911	pvr: fix clang unused function warning ../src/imagination/vulkan/pds/pvr_pds.c:230:31: warning: unused function 'pvr_pds_encode_doutv' [-Wunused-function] static ALWAYS_INLINE uint32_t pvr_pds_encode_doutv(uint32_t cc, ^ Signed-off-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15689>	2022-03-31 17:05:44 +00:00
Mike Blumenkrantz	33c800bf91	zink: remove radv cwrite driver workaround seems good fixes #6185 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15695>	2022-03-31 15:15:10 +00:00
Bas Nieuwenhuizen	9632e6eefa	radv: Fix vk_queue_to_radv for radv_image_queue_family_mask. radv_image_queue_family_mask was still using queue family index values for the special cases, while being passes a radv_queue_family enum. This adds the bits to the enum and vk_queue_to_radv so we can implement radv_image_queue_family_mask completely in terms of the enum. Fixes: `1ec4e568` ("radv: abstract queue family away from queue family index.") Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15423>	2022-03-31 14:43:10 +00:00
Adam Jackson	d43e6a9a49	dri: Remove the megadriver compat stub This is for compatibility with loaders that don't know about __DRI_DRIVER_GET_EXTENSIONS. xserver has supported that since 1.15.0, which is almost nine years old now. While we're about it, fix a comment in meson.build that used to be about megadrivers to reflect reality. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15649>	2022-03-31 09:58:49 -04:00
Adam Jackson	97554218f4	dri: Remove the globalDriverAPI hacks We now know that we always have access to the appropriate driver extensions, so we can get the right __DriverAPIRec from the vtable. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15649>	2022-03-31 09:58:47 -04:00
Adam Jackson	b6f7a4836a	dri: Fill in the driver extensions for the legacy createNewScreen paths The inner routine is going to want access to the driver extensions, but the legacy paths only give you the loader extensions. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15649>	2022-03-31 09:58:43 -04:00
Adam Jackson	c1dc98ceec	dri: Implement __DRI_DRIVER_VTABLE We're going to switch to using this instead of doing all these weird globalDriverAPI hacks, which are fragile and weird. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15649>	2022-03-31 09:58:40 -04:00
Adam Jackson	9c772de270	dri: Fold away some unused indirection in __DriverAPIRec The context-related API doesn't vary between dri2 and drisw. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15649>	2022-03-31 09:58:32 -04:00
Boris Brezillon	23bd889541	dzn: Properly support static blend constants The current code was assuming blend constants to be passed dynamically, which is wrong. Let's handle both the dynamic and static cases. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15608>	2022-03-31 12:21:35 +00:00
Boris Brezillon	f16a7aa9d6	dzn: Fix alpha blend factor translation When VK_BLEND_FACTOR_xxx_COLOR is passed to an alpha equation what we really want is the alpha component replicated on the RGBA channels, which has dedicated enums in D3D12. Let's make sure we use the right definition depending on the equation we're translating. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15608>	2022-03-31 12:21:35 +00:00
Samuel Pitoiset	738a6760e3	radv: suspend/resume queries during internal driver operations Pipeline statistics and occlusion queries shouldn't be enabled for internal driver operations like clears. Transform feedback queries don't have to be suspended because the driver doesn't use streamout. This fixes a bunch of Zink failures. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15660>	2022-03-31 11:47:22 +00:00
Erik Faye-Lund	83ed40cdcd	vbo/dlist: do not try to pad an empty draw In the case where u_index_generator returns zero new vertices, we never filled tmp_indices before trying to duplicate the last veretx. This causes us to read unitialized memory. This fixes a Valgrind issue triggering in glxgears on Zink: ---8<--- ==296461== Invalid read of size 2 ==296461== at 0x570F335: compile_vertex_list (vbo_save_api.c:733) ==296461== by 0x570FEFB: wrap_buffers (vbo_save_api.c:1021) ==296461== by 0x571050A: upgrade_vertex (vbo_save_api.c:1134) ==296461== by 0x571050A: fixup_vertex (vbo_save_api.c:1251) ==296461== by 0x57114D1: _save_Normal3f (vbo_attrib_tmp.h:315) ==296461== by 0x10B750: ??? (in /usr/bin/glxgears) ==296461== by 0x10A2CC: ??? (in /usr/bin/glxgears) ==296461== by 0x4B3F30F: (below main) (in /usr/lib/libc.so.6) ==296461== Address 0x11ca23de is 2 bytes before a block of size 1,968 alloc'd ==296461== at 0x4845899: malloc (in /usr/lib/valgrind/vgpreload_memcheck-amd64-linux.so) ==296461== by 0x570E647: compile_vertex_list (vbo_save_api.c:604) ==296461== by 0x570FEFB: wrap_buffers (vbo_save_api.c:1021) ==296461== by 0x571050A: upgrade_vertex (vbo_save_api.c:1134) ==296461== by 0x571050A: fixup_vertex (vbo_save_api.c:1251) ==296461== by 0x57114D1: _save_Normal3f (vbo_attrib_tmp.h:315) ==296461== by 0x10B750: ??? (in /usr/bin/glxgears) ==296461== by 0x10A2CC: ??? (in /usr/bin/glxgears) ==296461== by 0x4B3F30F: (below main) (in /usr/lib/libc.so.6) ---8<--- Fixes: `dcbf2423d2` ("vbo/dlist: add vertices to incomplete primitives") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15633>	2022-03-31 13:21:13 +02:00
Samuel Pitoiset	b784910ac7	radv: save/restore the stencil write mask during internal driver operations The slow depth/stencil clear path would overwrite the stencil write mask otherwise. This fixes few Zink failures. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15667>	2022-03-31 10:40:51 +00:00
Pierre-Eric Pelloux-Prayer	d58105fdd4	ac: remove LLVM 4.0 workaround This was added to fix https://bugs.freedesktop.org/show_bug.cgi?id=97988 but has been fixed in LLVM since. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15559>	2022-03-31 10:15:19 +00:00
Pierre-Eric Pelloux-Prayer	7d7c56e61a	radeonsi: drop LLVM global instruction selector I'm not sure if this is really used by anyone? Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15559>	2022-03-31 10:15:19 +00:00
Pierre-Eric Pelloux-Prayer	47152875c7	docs: document useful radeonsi env variables Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15559>	2022-03-31 10:15:19 +00:00
Pierre-Eric Pelloux-Prayer	9c2605ae42	drirc: enable radeonsi_zerovram for Black Geyser Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6180 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15559>	2022-03-31 10:15:19 +00:00
Boris Brezillon	329f16fffc	dzn: Make a bunch of functions private Looks like some functions that should have been marked static when transitioning from C++ methods to plain C where forgotten. Let's fix that now. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15607>	2022-03-31 09:46:36 +00:00
Boris Brezillon	4194c89ed8	dzn: Remove the dzn_cmd_exec_functions file That's a leftover from a previous version of the secondary command buffer implementation. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15607>	2022-03-31 09:46:36 +00:00
Boris Brezillon	b402cff591	dzn: Add Missing return type to dzn_translate_sampler_filter() Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15607>	2022-03-31 09:46:36 +00:00
Yonggang Luo	bf3c772e5e	ci: Improve vs2019 mesa_build.ps1 for remove the need of cmd.exe Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15595>	2022-03-31 09:16:27 +00:00
Yonggang Luo	55ca1c8db3	vulkan/microsoft: Remove `override_options: ['cpp_std=c++latest']` option for visual studio The project no longer uses c++20 features Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15595>	2022-03-31 09:16:27 +00:00
Daniel Schürmann	db8c401f71	aco/ra: fix stride check on subdword parallelcopies for create_vector On GFX6/7, info.rc is in full dwords. Fixes: `9476986e6f` ('aco/ra: special-case get_reg_for_create_vector_copy()') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15393>	2022-03-31 08:55:07 +00:00
Samuel Pitoiset	d32656bc65	radv: lower has_multiview_view_index in NIR This lowering is done in a new NIR pass where the layer is written before emit_vertex_with_counter for geometry shaders and after the position for other vertex stages. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15456>	2022-03-31 07:34:21 +00:00
Samuel Pitoiset	2b18234e61	radv: drop EXT or KHR suffixes for stuff promoted in Vulkan 1.3 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15663>	2022-03-31 07:13:17 +00:00
Yiwei Zhang	e8a63cf61e	virgl: fake modifier plane count query support dri2_create_image_from_fd might pass host modifier. Before virgl consumes modifier info from the guest side, fake the support so that the image creation can still proceed instead of bailing. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15658>	2022-03-31 00:00:01 +00:00
Jason Ekstrand	51077e821a	vulkan: Allow the driver to manually enable threaded submit This is useful for drivers that wish to be able to block inside their vk_queue::driver_submit hook. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15566>	2022-03-30 23:17:56 +00:00
Jason Ekstrand	08512aea09	vulkan: Replace various uses of device->timeline_mode What we really care about is if we're DEFERRED so we need to do a flush and if there can be any other threads we might race against. We don't really care about the timeline mode itself. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15566>	2022-03-30 23:17:56 +00:00
Jason Ekstrand	8e51778acf	vulkan/queue: Rework vk_queue_submit() Instead of basing everything on the timeline mode, base it on the submit mode of the queue. This makes a lot more sense since it's what we really care about anyway. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15566>	2022-03-30 23:17:56 +00:00
Jason Ekstrand	e0ffdc8ce0	vulkan/queue: Rework submit thread enabling Now that we have a threading mode in the device, we can set that based on the environment variable instead of delaying it to submit time. This allows us to avoid the static variable trickery we use to avoid reading environment variables over and over again. We also move the enabling of the submit thread up a level or two and give it a bit more obvious condition. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15566>	2022-03-30 23:17:56 +00:00
Jason Ekstrand	9ddab162b7	vulkan/queue: Add a submit mode enum This encapsulates all three possible submit modes: immediate, deferred, and threaded. It's more clear than the has_thread boolean combined with device-level checks. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15566>	2022-03-30 23:17:56 +00:00
Emma Anholt	97f17d4b38	glsl: Delete dont_lower_swz path of lower_quadop_vector. This was last used with Mesa classic, in _mesa_ir_link_shader(). Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15623>	2022-03-30 22:26:15 +00:00
Emma Anholt	761eb7e539	glsl: Delete unused EmitNoPow path. This was last used with i915c, now lower_fpow covers this class of lowering. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15623>	2022-03-30 22:26:15 +00:00
Jason Ekstrand	dc2c9bae25	vulkan: Add more VU comments to justify framebuffer asserts Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15674>	2022-03-30 20:43:12 +00:00
Mike Blumenkrantz	38064566c6	zink: update radv ci baseline just rebased cts locally and this is what popped out Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15672>	2022-03-30 17:50:35 +00:00
Alyssa Rosenzweig	0c1fde956b	panfrost: Add Valhall compressed formats We need to map to the interchange format, since there is no longer a pixel format for the memory layout. Use this new format table on v9. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	42b9295fa6	panfrost: Restrict Z/S formats for Valhall Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	ac51142bab	panfrost: Handle Valhall IDVS in job_uses_tiling Valhall-style IDVS uses a distinct job type which has to be handled separately. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	ee9b63a7bb	panfrost: Disable AFBC on Valhall Doesn't work yet. Kick the can down the road; I'd like to get textures and FBOs working at all before worrying about compressing them. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	5b056971a3	pan/bi: Preload r60/r61 for MSAA + blend shader This is the sort of leakiness I hate about blend shaders. MSAA + blend shader is somewhat obscure but gets hit in the CTS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	913a7ed41a	pan/bi: Use ID accessors for LEA_ATTR This is more portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	3e08672369	pan/bi: Split out load/store to thread storage We need a slightly different idiom on Valhall, so let's first split the helpers for encapsulation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	5e76467d5d	pan/bi: Use nir_tex_instr_has_implicit_derivative Rather tracking it ourselves. Slightly shorter. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	fc81415f47	pan/bi: Call Valhall backend passes on v9 These are required to lower the IR into something suitable for Valhall packing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	ac5eb4934b	pan/bi: Fix write_mask size We really need to stop tying the IR to Bifrost... Fixes: `3c817ed511` ("pan/bi: Model Valhall texture instructions") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Alyssa Rosenzweig	12edaae64a	pan/bi: Add .shadow modifier to TEX_GATHER Although TEX_GATHER looks like TEX_FETCH, it does support shadow comparators like TEX_SINGLE. Model this in the IR so we can use it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15586>	2022-03-30 17:29:12 +00:00
Rajnesh Kanwal	860e3f6ff9	pvr: Check if the buffer/image was bound before unbinding. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15641>	2022-03-30 14:26:07 +00:00
Rohan Garg	d876abeaa8	anv: Drop dead code in anv_UpdateDescriptorSets Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15666>	2022-03-30 15:19:47 +02:00
Omar Akkila	4e4b411378	ci: cherry-pick deqp fix for zlib dependency Zlib was bumped to 1.2.12 breaking links to the previous 1.2.11. Unfortunately, no tag currently carries the fix so cherry-pick it for now. Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15626>	2022-03-30 08:23:18 +00:00
Omar Akkila	803705425e	ci: uprev vkd3d-proton to v2.6 GitHub has deprecated the git:// protocol and no longer accepts them. One of them vkd3d-proton's dependencies 'dxil-spirv' was still using git:// for its submodules up until v2.6 where it now uses https:// instead. Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15626>	2022-03-30 08:23:18 +00:00
Omar Akkila	f2fa850888	ci: uprev Fossilize Fossilize used the git:// protocol for fetching submodules but as of January 11, 2022, GitHub has tempirarily disabled acceptance of the Git protocol until March 15, 2022 whereby it will be permanantly disabled. This patch uprevs to the Fossilize commit that switches submodule URLs to https:// instead. Otherwise, we would get an error stating that "The unauthenticated git protocol on port 9418 is no longer supported." when trying to clone submodules. See https://github.blog/2021-09-01-improving-git-protocol-security-github for more info. Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15626>	2022-03-30 08:23:18 +00:00
Lionel Landwerlin	684a4ea30c	intel/clc: fix missing pointer write Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `346a7f14fb` ("intel/compiler: Add code for compiling CL-style SPIR-V kernels") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15611>	2022-03-30 07:56:25 +00:00
Samuel Pitoiset	03888bf09c	radv: fix mismatch between radv_GetPhysicalDeviceMemoryProperties*() This fixes a regression with dEQP-VK.api.info.get_physical_device_properties2.memory_properties. This test expects the unused array elements to be untouched. Fixes: `87b65af43e` ("radv: Use common GetPhysicalDeviceMemoryProperties") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15629>	2022-03-30 07:00:57 +00:00
Corentin Noël	f86bc873ff	nir_to_tgsi: Require the block index to always be populated In some cases like when using `NIR_DEBUG=serialize`, impl->num_blocks is 0 which leads to assertions error in the blocklist. Make sure to require the num_blocks to be populated. Fixes: `74c02d99b2` Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15640>	2022-03-30 04:19:14 +00:00
Mike Blumenkrantz	6cfde0abe9	mesa/st: don't add pointsize to ES programs if it already exists an obvious missed case from the previous series, but there's no need to-readd an output if it already exists here Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15590>	2022-03-30 03:49:36 +00:00
Mike Blumenkrantz	2b22485702	mesa/st: rework pointsize constant uploads this now has a flag that is toggled when the last vertex stage changes, and this will trigger the appropriate updates during state validation Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15590>	2022-03-30 03:49:36 +00:00
Mike Blumenkrantz	0108323996	mesa/st: always flag last vertex stage constants for upload on pointsize change if the driver requires pointsize uploads, the pointsize has to actually be uploaded whenever the pointsize changes Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15590>	2022-03-30 03:49:36 +00:00
Mike Blumenkrantz	452d4d00df	mesa/st: rework atom flagging when pointsize changes if the driver requires pointsize uploads, only flag the last vertex stage for updates, not all vertex stages this should be functionally equivalent but without the unnecessary overhead of also scanning the other stages Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15590>	2022-03-30 03:49:36 +00:00
Mike Blumenkrantz	206d2f3127	zink: add driver workaround for broken EXT_depth_clip_control for #6186 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15513>	2022-03-30 03:36:33 +00:00
Mike Blumenkrantz	fe7c3eba33	llvmpipe: handle sampling from 2d views of 3d images this is seldom used but is required by KHR_gl_texture_3D_image cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15583>	2022-03-30 03:18:52 +00:00
Mike Blumenkrantz	d415c28e64	zink: force push descriptors cache update if hashing detects changes this was previously only forced if the program pointer changed, but programs can be freed and reused, and these are definite cases where the last set cannot be reused, so jam it in cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15591>	2022-03-30 03:04:47 +00:00
Mike Blumenkrantz	5461a1cbaa	lavapipe: enforce monotonic timeline incrementing maybe just being overly paranoid, but make sure that the timeline id gets compared while the lock is held in every scenario cc: mesa-stable Reviewed-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15453>	2022-03-30 02:47:45 +00:00
Mike Blumenkrantz	7aed40e4ab	lavapipe: allow timeline progress in GetSemaphoreCounterValue the vulkan spec doesn't explicitly state whether this function progresses a given semaphore's timeline, and apparently there are some cases where it's assumed that progress occurs if this function is called in a loop instead of WaitSemaphores, so check the current fence for completion Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15453>	2022-03-30 02:47:45 +00:00
Mike Blumenkrantz	4eca6e3e5d	lavapipe: fix xfb availability query copying xfb queries have 2 results cc: mesa-stable fixes (zink): spec@arb_query_buffer_object@qbo Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15644>	2022-03-30 02:31:56 +00:00
Dave Airlie	63d0440034	lavapipe: add loop unrolling. zink was giving us rolled spir-v and nothing was unrolling it, start unrolling the NIR in the frontend. unrolling would leave a texture with a constant texture_offset, translate it to a texture index. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15287>	2022-03-30 02:16:18 +00:00
Mike Blumenkrantz	8e2555234b	lavapipe: fix shader indexing of sampler arrays with const array index if it's a constant, then just return that bit, not the full bitmask Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15287>	2022-03-30 02:16:18 +00:00
Yonggang Luo	c91e3c6a42	util: Should not use ASSERTED in util_thread_get_time_nano The parameter is not ASSERTED Fixes: `0f1b3fc17f` ("util: Fixes unused parameter warnings") Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15655>	2022-03-29 23:33:21 +00:00
Mike Blumenkrantz	4525d7ed85	vulkan: update more headers to 1.3.210 Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15645>	2022-03-29 22:43:52 +00:00
Mike Blumenkrantz	e219351457	iris: assert that samplerview base_array_layer is zero for hw < skl this codepath is broken for older hardware Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15610>	2022-03-29 22:12:30 +00:00
Mike Blumenkrantz	4ec71a3429	crocus: assert that 3d samplerview base_array_layer is zero this codepath will not work on crocus Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15610>	2022-03-29 22:12:30 +00:00
Mike Blumenkrantz	65ec846f77	intel/isl: fix 2d view of 3d textures according to KHR_gl_texture_3D_image: If <target> is EGL_GL_TEXTURE_3D_KHR, <buffer> must be the name of a complete, nonzero, GL_TEXTURE_3D (or equivalent in GL extensions) target texture object, cast into the type EGLClientBuffer. <attr_list> should specify the mipmap level (EGL_GL_TEXTURE_LEVEL_KHR) and z-offset (EGL_GL_TEXTURE_ZOFFSET_KHR) which will be used as the EGLImage source; the specified mipmap level must be part of <buffer>, and the specified z-offset must be smaller than the depth of the specified mipmap level. thus a 2d view of a 3d surface is not only legal, it's part of the spec and must be supported when available cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15584>	2022-03-29 21:44:51 +00:00
Pavel Ondračka	806dcf9db7	r300: only output wpos in vertex shaders when needed Right now we always copy pos to wpos regardless of if the fragment shader needs it or not. Instead just do it only for the shaders that really need it. Shader-db is not able to measure preciselly the effect as we build only the non-wpos variants, but given that majority of fragment shaders don't needs the wpos, the real effect should be close: total instructions in shared programs: 105427 -> 104313 (-1.06%) instructions in affected programs: 53098 -> 51984 (-2.10%) total temps in shared programs: 13991 -> 13958 (-0.24%) temps in affected programs: 114 -> 81 (-28.95%) Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15321>	2022-03-29 21:37:24 +00:00
Pavel Ondračka	8c90a5fab7	r300: move r300_init_vs_outputs to r300_translate_vertex_shader Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15321>	2022-03-29 21:37:24 +00:00
Pavel Ondračka	882811b1ff	r300: restructure r300_vertex_shader Make the r300_vertex_shader structure resemble the r300_framgment_shader in order to allow support for shader variants. There is no functional change, but it will be used in the next commit to only output wpos when needed from vertex shaders. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15321>	2022-03-29 21:37:24 +00:00
Pavel Ondračka	d9449c5201	r300: optimize single write scenarios in rc_copy_output Right now we always write to a temp and than emit movs to both POS and WPOS. When the result is trivial, like: MOV temp[0], in[0] MOV output[0], temp[0] MOV output[1], temp[0] the dataflow analysis passes can take care of it, this is however one of the last optimizations we need the pass for. Additionally it fails even for some still trivial cases like: MAD temp[2], const[3], temp[0].wwww, temp[1]; MOV output[0], temp[2]; MOV output[2], temp[2]; This patch will just duplicate the output instuction when there is only a single output write. The long-term plan would be to just trust NIR, do as little in the backend as possible and optimize the remaining backend passes to not need any additional cleanups. total instructions in shared programs: 105862 -> 105427 (-0.41%) instructions in affected programs: 20527 -> 20092 (-2.12%) total temps in shared programs: 14029 -> 13991 (-0.27%) temps in affected programs: 152 -> 114 (-25.00%) Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15321>	2022-03-29 21:37:24 +00:00
Pavel Ondračka	5dcef1e7b8	r300: don't move position output to the end when duplicating it for WPOS Instead just emit both outputs as soon as possible. If the last write is inside a loop or a branch, emit it after the ENDLOOP or ENDIF. This saves some temps and also allows us to potentially benefit from R300_PVS_XYZW_VALID_INST as right now the position output write is always penultimate with the WPOS output being the last. total temps in shared programs: 14101 -> 14029 (-0.51%) temps in affected programs: 435 -> 363 (-16.55%) helped: 72 HURT: 0 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15321>	2022-03-29 21:37:24 +00:00
Mike Blumenkrantz	8cf10ae144	zink: add in radv passes to baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15654>	2022-03-29 21:24:01 +00:00
Mike Blumenkrantz	f3fa085fc6	zink: more radv fails Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15654>	2022-03-29 21:24:01 +00:00
Mike Blumenkrantz	4e6f1c1478	zink: update radv baseline it doesn't look like these ever passed? Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15654>	2022-03-29 21:24:01 +00:00
Kenneth Graunke	8831cb38aa	anv: Stop updating STATE_BASE_ADDRESS on XeHP Now that we're using 3DSTATE_BINDING_TABLE_POOL_ALLOC to set the base address for the binding table pool separately from surface states, we don't actually need to update surface state base address anymore. Instead, we can just set STATE_BASE_ADDRESS once at context creation, and never bother updating it again, saving some heavyweight flushes and freeing us from the need for address offsetting trickery. This patch was originally written by Jason Ekstrand, with fixes from Lionel Landwerlin, but was targeting Icelake. Doing it there requires additional changes (15:5 -> 18:8 binding table pointer formats) which also involve some trade-offs, whereas the XeHP change is purely a win, so we'll do it here first. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15616>	2022-03-29 20:45:59 +00:00
Kenneth Graunke	1967fd3b10	intel/compiler: Call inst->resize_sources before setting the sources You should probably resize the sources array before accessing entries that might be out of bounds. inst->resize_sources() always allocates enough space for at least 3 sources, so this is really only an issue when there are 4+ sources. Fixes: `a920979d4f` ("intel/fs: Use split sends for surface writes on gen9+") Fixes: `4f86a70599` ("intel/fs: Lower DW untyped r/w messages to LSC when available") Fixes: `d372abe397` ("intel/fs: Add surface OWORD BLOCK opcodes") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15632>	2022-03-29 13:06:17 -07:00
Dylan Baker	356f6bb8a5	docs: update calendar and link releases notes for 22.0.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15653>	2022-03-29 13:01:17 -07:00
Dylan Baker	215ae6b270	docs: add sah256 sum for mesa 22.0.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15653>	2022-03-29 13:01:12 -07:00
Dylan Baker	d85cdcad5d	docs: add release notes for 22.0.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15653>	2022-03-29 13:01:10 -07:00
Filip Gawin	5a459b8f6b	r300: fix swizzle handling in transformation of abs Helps with scalar opcodes like pow. (which are propagating first used channel) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15222>	2022-03-29 19:49:17 +00:00
Dylan Baker	85ae6598fa	docs: Add calendar entries for 22.1 release candidates. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15650>	2022-03-29 19:29:48 +00:00
Mike Blumenkrantz	01f3504e2f	zink: remove anv workaround for broken color writes it's healed! Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15564>	2022-03-29 19:17:10 +00:00
Mike Blumenkrantz	408f114830	zink: break out CmdSetColorWriteEnableEXT to util function this will otherwise become a no-op, and it's only intended that this be called if cwrite is actually being used Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15609>	2022-03-29 19:01:01 +00:00
Yonggang Luo	a1814067cd	nir: Move the define of snprintf to header nir.h The define of snprintf in nir_lower_atomics_to_ssbo.c is duplicated, so remove it from this file Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14014>	2022-03-30 00:45:11 +08:00
Yonggang Luo	153cb830c4	vtn: Fixes compiling error for mingw/ucrt by using setjmp/longjmp function instead compiler builtin The compiling error are: ``` [369/1463] Compiling C object src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj FAILED: src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj "cc" "-Isrc/compiler/nir/libnir.a.p" "-Isrc/compiler/nir" "-I../../src/compiler/nir" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/mapi" "-I../../src/mapi" "-Isrc/mesa" "-I../../src/mesa" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-Isrc/compiler" "-I../../src/compiler" "-Isrc/compiler/spirv" "-I../../src/compiler/spirv" "-fvisibility=hidden" "-fcolor-diagnostics" "-D_FILE_OFFSET_BITS=64" "-Wall" "-Winvalid-pch" "-std=c11" "-O2" "-g" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-DPACKAGE_VERSION=\"22.1.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_TIMESPEC_GET" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.1\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" "-Wno-override-init" "-Wno-initializer-overrides" -MD -MQ src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj -MF "src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj.d" -o src/compiler/nir/libnir.a.p/.._spirv_gl_spirv.c.obj "-c" ../../src/compiler/spirv/gl_spirv.c ../../src/compiler/spirv/gl_spirv.c:241:19: error: incompatible pointer types passing 'jmp_buf' (aka '_JBTYPE [16]') to parameter of type 'void ' [-Werror,-Wincompatible-pointer-types] if (vtn_setjmp(b->fail_jump)) { ^~~~~~~~~~~~ 1 error generated. [376/1463] Compiling C object src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj FAILED: src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj "cc" "-Isrc/compiler/nir/libnir.a.p" "-Isrc/compiler/nir" "-I../../src/compiler/nir" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/mapi" "-I../../src/mapi" "-Isrc/mesa" "-I../../src/mesa" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-Isrc/compiler" "-I../../src/compiler" "-Isrc/compiler/spirv" "-I../../src/compiler/spirv" "-fvisibility=hidden" "-fcolor-diagnostics" "-D_FILE_OFFSET_BITS=64" "-Wall" "-Winvalid-pch" "-std=c11" "-O2" "-g" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-DPACKAGE_VERSION=\"22.1.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_TIMESPEC_GET" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.1\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" "-Wno-override-init" "-Wno-initializer-overrides" -MD -MQ src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj -MF "src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj.d" -o src/compiler/nir/libnir.a.p/.._spirv_spirv_to_nir.c.obj "-c" ../../src/compiler/spirv/spirv_to_nir.c ../../src/compiler/spirv/spirv_to_nir.c:196:16: error: incompatible pointer types passing 'jmp_buf' (aka '_JBTYPE [16]') to parameter of type 'void ' [-Werror,-Wincompatible-pointer-types] vtn_longjmp(b->fail_jump, 1); ^~~~~~~~~~~~ ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14014>	2022-03-30 00:45:08 +08:00
Yonggang Luo	887d74dd2f	meson: Add predefined macro -D__MSVCRT_VERSION__=0x0700 only in mingw environment without _UCRT Fixed the following error: ``` 1 error generated. [18/1468] Compiling C object src/util/libmesa_util.a.p/half_float.c.obj FAILED: src/util/libmesa_util.a.p/half_float.c.obj "cc" "-Isrc/util/libmesa_util.a.p" "-Isrc/util" "-I../../src/util" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/mapi" "-I../../src/mapi" "-Isrc/mesa" "-I../../src/mesa" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-IC:/CI-Tools/msys64/clang64/include" "-fvisibility=hidden" "-fcolor-diagnostics" "-D_FILE_OFFSET_BITS=64" "-Wall" "-Winvalid-pch" "-std=c11" "-O2" "-g" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-DPACKAGE_VERSION=\"22.1.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-D__MSVCRT_VERSION__=0x0700" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_TIMESPEC_GET" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.1\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-pthread" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" -MD -MQ src/util/libmesa_util.a.p/half_float.c.obj -MF "src/util/libmesa_util.a.p/half_float.c.obj.d" -o src/util/libmesa_util.a.p/half_float.c.obj "-c" ../../src/util/half_float.c In file included from ../../src/util/half_float.c:30: In file included from ../../src/util/half_float.h:32: In file included from ../../src/util/u_cpu_detect.h:41: In file included from ../../src/util/u_thread.h:35: In file included from ../../include/c11/threads.h:64: ../../include/c11/threads_win32.h:136:5: error: implicit declaration of function 'timespec_get' is invalid in C99 [-Werror,-Wimplicit-function-declaration] timespec_get(&now, TIME_UTC); ^ ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14014>	2022-03-30 00:45:03 +08:00
Mike Blumenkrantz	0bef8bc054	zink: fix error logging for 2d z/s checking this is expected to fail and so is not an error Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15643>	2022-03-29 15:48:27 +00:00
Jason Ekstrand	4a08ee7ecf	spirv/libclc: Add generic versions of arithmetic functions Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15622>	2022-03-29 15:02:07 +00:00
Jason Ekstrand	688d478045	v3dv/queue: Rework multisync_free Thix fixes two bugs. First, we stop leaking in/out fences with multisync. Because the in_syncs and out_syncs parameters to set_multisync were arrays and not pointers to arrays, the caller's in_syncs and out_syncs pointers never got set and remained NULL so multisync_free() always sees to NULL pointers and does nothing, leaking both arrays. Not sure how this isn't showing up in the dEQP leak check tests. Second, the struct drm_v3d_multi_sync was in the scope of the then clause of the `if (device->pdevice->caps.multisync)` so it goes out of scope before the ioctl. This is, effectively, a use-after-free and, depending on stack allocation details, may result in the multisync extension struct getting stompped before the ioctl. Fixes: `ff8586c345` ("v3dv: enable multiple semaphores on cl submission") Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15512>	2022-03-29 14:38:41 +00:00
Mike Blumenkrantz	d001150d0c	doc: update extensions for lavapipe Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15636>	2022-03-29 12:41:51 +00:00
Mike Blumenkrantz	aed49c89a5	lavapipe: display EXT_graphics_pipeline_library Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15636>	2022-03-29 12:41:51 +00:00
Mike Blumenkrantz	d4d5a7abba	lavapipe: implement EXT_graphics_pipeline_library Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15636>	2022-03-29 12:41:51 +00:00
Mike Blumenkrantz	22fd70ca81	lavapipe: support KHR_pipeline_library this is basically a no-op since raytracing isn't supported Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15636>	2022-03-29 12:41:51 +00:00
Mike Blumenkrantz	0918095b9a	lavapipe: EXT_primitives_generated_query Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15636>	2022-03-29 12:41:51 +00:00
Mike Blumenkrantz	8c3b4dd996	vulkan: update spec to 1.3.210 Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15635>	2022-03-29 07:59:23 -04:00
Iago Toral Quiroga	7f6ecb8667	v3dv: add reference counting for descriptor set layouts The spec states that descriptor set layouts can be destroyed almost at any time: "VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout, and those descriptor sets must not be updated with vkUpdateDescriptorSets after the descriptor set layout has been destroyed. Otherwise, descriptor set layouts can be destroyed any time they are not in use by an API command." Based on a similar fix for RADV. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5893 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Iago Toral Quiroga	ca861bd6f4	v3dv: drop unnecessary memset We are already zeroing when we allocate the descriptor set layout memory with vk_object_zalloc. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Iago Toral Quiroga	591eed30b2	v3dv: fix sampler array addressing in v3dv_descriptor_set_layout Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15634>	2022-03-29 11:28:39 +00:00
Samuel Pitoiset	00cc52b282	radv: remove now unused radv_nir_compiler_options::layout Descriptors are lowered in NIR and the layout is no longer used in the backend compilers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15630>	2022-03-29 10:39:07 +00:00
Kenneth Graunke	50467aa558	iris: Properly tell the decoder about inherited binder addresses iris may rely on 3DSTATE_BINDING_TABLE_POOL_ALLOC's address being inherited from a previous batch. So, we need to tell the decoder about the address in case it sees binding tables to decode before it encounters such a command in the batch. We used to update surface_base, now we update bt_pool_base instead. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15625>	2022-03-29 02:35:54 -07:00
Kenneth Graunke	9bc97e4fc1	intel/decoder: Fix decoder handling of binding table pool alloc on XeHP 3DSTATE_BINDING_TABLE_POOL_ALLOC no longer has a "Binding Table Pool Enable" bit. It is always enabled. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15625>	2022-03-29 02:35:54 -07:00
Mihai Preda	08f74e7185	radeonsi: merge the copy_image shader generators Use a unified si_create_copy_image_cs() to generate both the 1D and 2D copy shaders; the shaders themselves remain separate. Also: - add a helper deref_ssa() for nir deref - add a helper set_work_size() for setting workgroup and grid sizes - pass si_context (instead of pipe_context) to the copy_image generator Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15268>	2022-03-29 09:06:57 +00:00
Mihai Preda	c0ef40bbce	radeonsi: convert copy_image_1d_array shader to NIR Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15268>	2022-03-29 09:06:57 +00:00
Mihai Preda	18722af9d2	radeonsi: convert copy_image shader to NIR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15268>	2022-03-29 09:06:57 +00:00
Vinson Lee	79ba1962ac	pvr: Remove duplicate variable queue_create. Fix defect reported by Coverity Scan. Evaluation order violation (EVALUATION_ORDER) write_write_typo: In queue_create = queue_create = &pCreateInfo->pQueueCreateInfos[0], queue_create is written twice with the same value. Fixes: `8991e64641` ("pvr: Add a Vulkan driver for Imagination Technologies PowerVR Rogue GPUs") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15604>	2022-03-29 08:56:27 +00:00
Danylo Piliaiev	10734fb748	turnip: enable has_ccu_flush_bug workaround for a660 It seems that a660 has the same bug. Without the workaround there are a lot of flakes with depth-stencil tests, e.g. in: dEQP-VK.pipeline.extended_dynamic_state.* dEQP-VK.renderpass.depth_stencil_write_conditions.* dEQP-VK.pipeline.stencil.format.d24_unorm_s8_uint.states.* Or guaranteed failures like of: dEQP-VK.pipeline.render_to_image.core.2d.huge.width.r8g8b8a8_unorm_d32_sfloat_s8_uint Enabling the workaround fixes all of them. cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15548>	2022-03-29 08:34:18 +00:00
Tales Lelo da Aparecida	fe66cff411	zink: validate and log errors on vulkan calls This commit also replaces debug_printf with mesa_loge Signed-off-by: Tales Lelo da Aparecida <tales.aparecida@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15601>	2022-03-28 21:46:59 -03:00
Mihai Preda	1f92c30170	radeonsi/tests: update baseline and flakes on vega20 The piglit suite was run on vega20, 10 times, on a 'debugoptimized' mesa build at commit `582e7f15` . The inconsistent failures were added to the flakes file. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15487>	2022-03-28 22:48:16 +00:00
Mihai Preda	a6d7c5aa31	radeonsi/tests: add flakes option to radeonsi-run-tests.py Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15487>	2022-03-28 22:48:16 +00:00
Mihai Preda	88c14ed1da	radeonsi/tests: fix file left open in radeonsi-run-tests.py Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15487>	2022-03-28 22:48:16 +00:00
Mike Blumenkrantz	e54fb81151	zink: run piglit's gpu profile increase coverage Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15606>	2022-03-28 22:35:32 +00:00
Konstantin Seurer	a8c7e8fef8	venus: Use trivial common entrypoints Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15613>	2022-03-28 22:15:45 +00:00
Konstantin Seurer	87b65af43e	radv: Use common GetPhysicalDeviceMemoryProperties Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15613>	2022-03-28 22:15:45 +00:00
Mike Blumenkrantz	3b39e4d913	lavapipe: run optimize loop before krangling pipeline layout in scenarios like: vec1 32 ssa_151 = deref_var &shadow_map (uniform sampler2D) vec1 32 ssa_152 = deref_var &shadow_map (uniform sampler2D) vec2 32 ssa_153 = vec2 ssa_151, ssa_152 vec1 32 ssa_154 = deref_var &param@4 (function_temp uvec2) intrinsic store_deref (ssa_154, ssa_153) (wrmask=xy /3/, access=0) vec1 32 ssa_160 = deref_var &param@4 (function_temp uvec2) vec2 32 ssa_164 = intrinsic load_deref (ssa_160) (access=0) vec1 32 ssa_167 = mov ssa_164.x vec1 32 ssa_168 = deref_cast (texture2D )ssa_167 (uniform texture2D) / ptr_stride=0, align_mul=0, align_offset=0 / vec1 32 ssa_169 = mov ssa_164.y vec1 32 ssa_170 = deref_cast (sampler )ssa_169 (uniform sampler) /* ptr_stride=0, align_mul=0, align_offset=0 */ vec1 32 ssa_172 = (float32)tex ssa_168 (texture_deref), ssa_170 (sampler_deref), ssa_171 (coord), ssa_166 (comparator) the real variable is stored to a function_temp and then loaded back again, which means it isn't a direct deref and lower_vri_instr_tex_deref() will crash because the variable can't be found BUT running only the passes needed to eliminate derefs will break other tests, so just run the whole optimize loop again here to avoid such issues for #5945 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15511>	2022-03-28 22:01:49 +00:00
Mike Blumenkrantz	6158e9448a	zink: add a couple flakes unsure what caused these but they started a week ago and were first seen in https://gitlab.freedesktop.org/mesa/mesa/-/jobs/19950494 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15621>	2022-03-28 21:50:04 +00:00
Yiwei Zhang	6d1cc0e47d	venus: let vn_android use vn_BindImageMemory2 and directly use reqs v2: remove an unused variable Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15619>	2022-03-28 21:39:05 +00:00
Timur Kristóf	2132d4278d	radv: Use correct buffer offset for conditional rendering. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15615>	2022-03-28 20:32:34 +00:00
Georg Lehmann	141ca78634	radv, aco: Packed iadd_sat/uadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	6b662a4f0c	radv: Lower 64bit iadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	50f585254c	aco: Implement scalar iadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	989f2b1785	aco: Implement 64bit uadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	16be909936	nir: Add an option to lower 64bit iadd_sat. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	922916bf64	nir: Move lower_usub_sat64 to nir_lower_int64_options. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15421>	2022-03-28 20:02:52 +00:00
Georg Lehmann	01fad6aa72	aco: Remove 0 data components from image stores. Image stores always write a full texel. The hardware writes zero for components not included in dmask. Totals from 387 (0.29% of 134913) affected shaders: (GFX10.3) VGPRs: 17216 -> 17136 (-0.46%) CodeSize: 1987652 -> 1981504 (-0.31%) MaxWaves: 9054 -> 9058 (+0.04%) Instrs: 361883 -> 361115 (-0.21%); split: -0.21%, +0.00% Latency: 5383187 -> 5381227 (-0.04%); split: -0.04%, +0.00% InvThroughput: 1373830 -> 1372097 (-0.13%) VClause: 9031 -> 9038 (+0.08%); split: -0.04%, +0.12% Copies: 25923 -> 25153 (-2.97%); split: -2.98%, +0.01% PreVGPRs: 14643 -> 14555 (-0.60%) Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14986>	2022-03-28 19:43:18 +00:00
Alejandro Piñeiro	81039feda4	broadcom: update language on V3D_DEBUG options Some typos, and bad grammar. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15593>	2022-03-28 19:21:48 +00:00
Alejandro Piñeiro	2f33c52daf	docs: document v3d/v3dv envvars As we are here we also update VC4_DEBUG option, in order to rely on VC4_DEBUG=help Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15593>	2022-03-28 19:21:48 +00:00
Emma Anholt	b8324a7387	r600: Implement memoryBarrier() in the non-SFN path. Previously we were just doing a group barrier for both membar and barrier. This sometimes worked out, because atomics and reads waited for ack already, but writes were not waiting for ack. Use the need_wait_ack pattern that scratch writes used, with a little refactoring for reusability. The refactor also incidentally fixes the atomics waiting for outstanding acks to be > 1 instead of > 0. Cc: mesa-stable Fixes: #6028 Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	991a95a352	r600: Disable SB when INTERP_SAMPLE is used. Avoids an assertion failure in the SB scheduler. Cc: mesa-stable Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	f315ea9eff	r600: Disable SB in the presence of indirection on temp arrays. Prevents several regressions when NIR-to-TGSI is enabled where it was allocating arrays on top of each other. Fixes vec3 fails on RV770, dEQP-GLES3.functional.shaders.metamorphic.bubblesort_flag.variant_1 and 2 in general, and fixes another piglit but breaks two others. Still, this seems to be a win. Cc: mesa-stable Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	955883cf0a	r600: Add a workaround and explanation for shadowcubearray TG4. With the NIR-to-TGSI transition, we had fewer other immediates and would end up dereffing past the end of the literals array. Cc: mesa-stable Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	ef151d24ab	r600: Fix ordering of SSBO loads versus texturing. The two types of instructions get added to the same CF list, but not the same instr list within the CF list. So, if you SSBO fetched your texcoord, the emission of the SSBO fetch would come after the texcoord fetch. Avoids regressions when NIR-to-TGSI starts optimizing more. Cc: mesa-stable Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	6d6043e628	r600: Drop unused debug options from the fork off of radeonsi. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	357c9d0603	r600: Drop unused sbcl debug option. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	626ab112f6	r600: Add shader-compiler debug knobs to the shader cache key. Otherwise, you'll get cached results from the previous debug knob state. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Emma Anholt	9acb622c44	ci/r600: Check in some expectation files for rv770 and Turks. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14429>	2022-03-28 10:54:46 -07:00
Pavel Ondračka	f82a0d10cd	r300: respect output_semantic_index when writing colors Right now we don't explicitly check it and we expect that the output_semantic_index array is always ordered. Unfortunately, this is not the case since `74c02d99b2` Fixes corruption in Amnesia: the Dark Descent. cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6179 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15523>	2022-03-28 17:46:32 +00:00
Emma Anholt	28d6a5af25	r600: Add shader precompile and shader-db support. This should reduce draw-time jank, and was useful for me in evaluating NIR-to-TGSI's impact for r600. Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14427>	2022-03-28 17:39:02 +00:00
Emma Anholt	f07b8a0cac	r600: Update the PS state when MSAA-ness changes, too. Avoids a regression when enabling shader precompilation, where the precompile would happen with MSAA disabled (so no sample mask export) but we'd never catch up to the shader being rendered with MSAA. Doesn't fix any current testcases, though. Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14427>	2022-03-28 17:39:02 +00:00
Emma Anholt	e0429d9fef	r600: Update the PS state before checking for cb_misc update. The update_ps_state updates ps_shader->current->ps_color_export_mask, so we could miss statechanges. Cc: mesa-stable Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14427>	2022-03-28 17:39:02 +00:00
Emma Anholt	a9a5b83b0a	r600: Drop nr_ps_max_color_exports Prior to `b652180107` ("r600g: don't snoop context state while building shaders"), it fed into the shader key, but nothing does any more. Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14427>	2022-03-28 17:39:02 +00:00
Connor Abbott	0b0b9274b6	freedreno/ci: Fix skip comment This test was never supposed to be skipped, and the referenced commit just exposed a bug in turnip fixed by the previous commit. It was hanging due to a CTS bug making the submit take way too long, which will be fixed once the CTS change lands. Also, add it to the a630 skips. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Connor Abbott	9d081d7561	tu: Correctly handle VK_IMAGE_CREATE_EXTENDED_USAGE_BIT In this case we should relax checks based on the format, since the user will be responsible for them when creating an image view. This gets dEQP-VK.image.sample_texture._bit_compressed_format_ not skipping again after VK-GL-CTS 736eec57dc0c ("Fix checkSupport in compressed texture sampling tests"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15563>	2022-03-28 17:16:54 +00:00
Samuel Pitoiset	1b2ccea63f	radv: advertise VK_EXT_depth_clip_control Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15192>	2022-03-28 16:50:12 +00:00
Samuel Pitoiset	43e83949dc	radv: implement VK_EXT_depth_clip_control Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15192>	2022-03-28 16:50:12 +00:00
Danylo Piliaiev	37939e9c54	turnip: Fix the lack of WFM before indirect draws We have to add WFM to pending bits when we are flushing into CP for indirect draw to know when they should apply WFM workaround. Fixes CTS tests: dEQP-VK.draw.renderpass.indirect_draw._data_from_compute.indirect_draw_count Fixes: `abf0ae014a` ("tu: Properly handle waiting on an earlier pipeline stage") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15577>	2022-03-28 16:09:07 +00:00
Gert Wollny	1994f1404e	virgl: re-enable PIPE_CAP_TGSI_TEXCOORD with new host versions Also upreaf the virglrenderer version used in the CI. v2: Update checksums of trace result images (0 pixels were different) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15446>	2022-03-28 15:21:58 +00:00
Samuel Pitoiset	4cfb5332d6	radv: lower adjusting gl_FragCoord.z for VRS in NIR fossils-db (Sienna Cichlid): Totals from 4432 (3.29% of 134913) affected shaders: VGPRs: 231232 -> 231880 (+0.28%) CodeSize: 24738224 -> 24718008 (-0.08%); split: -0.08%, +0.00% MaxWaves: 93120 -> 93000 (-0.13%) Instrs: 4540970 -> 4541062 (+0.00%); split: -0.01%, +0.01% Latency: 49658353 -> 49641444 (-0.03%); split: -0.05%, +0.01% InvThroughput: 9604328 -> 9603041 (-0.01%); split: -0.02%, +0.01% VClause: 66497 -> 66498 (+0.00%) SClause: 209530 -> 209532 (+0.00%); split: -0.01%, +0.01% Copies: 276135 -> 276249 (+0.04%); split: -0.14%, +0.18% PreSGPRs: 189409 -> 189415 (+0.00%) PreVGPRs: 207368 -> 207458 (+0.04%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15450>	2022-03-28 16:17:37 +02:00
Samuel Pitoiset	a42b6a4d39	radv: lower load_sample_mask_in in NIR No fossils-db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15450>	2022-03-28 16:17:34 +02:00
Mike Blumenkrantz	e23f881311	radv: fix CmdSetColorWriteEnableEXT(attachmentCount==MAX_RTS) cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15508>	2022-03-28 12:59:28 +00:00
Samuel Pitoiset	8c51874af4	radv,aco: lower color exports in NIR fossils-db (Sienna Cichlid): Totals from 27108 (20.09% of 134913) affected shaders: VGPRs: 1260608 -> 1261424 (+0.06%); split: -0.00%, +0.07% CodeSize: 112795868 -> 112785892 (-0.01%); split: -0.05%, +0.04% MaxWaves: 628608 -> 628448 (-0.03%); split: +0.00%, -0.03% Instrs: 20750003 -> 20749314 (-0.00%); split: -0.01%, +0.00% Latency: 288088081 -> 288015865 (-0.03%); split: -0.06%, +0.04% InvThroughput: 53944847 -> 53961693 (+0.03%); split: -0.01%, +0.04% VClause: 396463 -> 396467 (+0.00%); split: -0.02%, +0.02% SClause: 842088 -> 842150 (+0.01%); split: -0.03%, +0.04% Copies: 1244982 -> 1259026 (+1.13%); split: -0.01%, +1.14% PreSGPRs: 1251949 -> 1251909 (-0.00%) PreVGPRs: 1099647 -> 1100879 (+0.11%); split: -0.03%, +0.14% fossils-db (Polaris10): Totals from 23928 (17.60% of 135960) affected shaders: SGPRs: 1751792 -> 1751024 (-0.04%); split: -0.05%, +0.01% VGPRs: 1098964 -> 1098556 (-0.04%); split: -0.13%, +0.09% CodeSize: 99893472 -> 99837940 (-0.06%); split: -0.06%, +0.00% MaxWaves: 138322 -> 138306 (-0.01%); split: +0.03%, -0.04% Instrs: 19213995 -> 19211980 (-0.01%); split: -0.02%, +0.01% Latency: 273026926 -> 273109402 (+0.03%); split: -0.01%, +0.04% InvThroughput: 111160907 -> 111195187 (+0.03%); split: -0.04%, +0.07% VClause: 343058 -> 343097 (+0.01%); split: -0.02%, +0.03% SClause: 802756 -> 802884 (+0.02%); split: -0.04%, +0.06% Copies: 1729387 -> 1739208 (+0.57%); split: -0.04%, +0.61% PreSGPRs: 1090264 -> 1090303 (+0.00%); split: -0.00%, +0.01% PreVGPRs: 959490 -> 960600 (+0.12%); split: -0.04%, +0.15% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15263>	2022-03-28 11:47:35 +00:00
Jakob Bornecrantz	9e31991c6e	vulkan-device-select: Don't leak xcb_query_extension_reply_t Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15600>	2022-03-28 11:06:11 +00:00
Iago Toral Quiroga	ce849032a4	broadcom/compiler: allow ldunifa with indirect uniform loads We handle uniforms by copying them into the uniform stream to be consumed with ldunif when they have a constant offset. Otherwise we fallback to general TMU access, which has more latency. However, just like we did for UBOs and read-only SSBOs, we can also try to use the unifa mechanism to handle indirect accesses in certain cases instead of the TMU fallback. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15575>	2022-03-28 10:44:13 +00:00
Iago Toral Quiroga	ea3223e7a4	v3dv: implement VK_EXT_inline_uniform_block Inline uniform blocks store their contents in pool memory rather than a separate buffer, and are intended to provide a way in which some platforms may provide more efficient access to the uniform data, similar to push constants but with more flexible size constraints. We implement these in a similar way as push constants: for constant access we copy the data in the uniform stream (using the new QUNIFORM_UNIFORM_UBO_*) enums to identify the inline buffer from which we need to copy and for indirect access we fallback to regular UBO access. Because at NIR level there is no distinction between inline and regular UBOs and the compiler isn't aware of Vulkan descriptor sets, we use the UBO index on UBO load intrinsics to identify inline UBOs, just like we do for push constants. Particularly, we reserve indices 1..MAX_INLINE_UNIFORM_BUFFERS for this, however, unlike push constants, inline buffers are accessed through descriptor sets, and therefore we need to make sure they are located in the first slots of the UBO descriptor map. This means we store them in the first MAX_INLINE_UNIFORM_BUFFERS slots of the map, with regular UBOs always coming after these slots. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15575>	2022-03-28 10:44:13 +00:00
Georg Lehmann	37c0f68500	radv: Add more RT pipeline stubs. Entry points have to be provided even if the features are not supported. Helps Doom Eternal. Fixes: `f1095260a4` ("radv: Experimentally enable RT extensions.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15573>	2022-03-28 10:25:42 +00:00
Georg Lehmann	b8c8e3d975	radv: Add a vkCmdBuildAccelerationStructuresIndirectKHR stub. Since this entry point is provided by VK_KHR_acceleration_structure, radv has to implement it even if it doesn't support the indirect build feature. Helps Doom Eternal. Fixes: `82de184c3a` ("radv: Enable VK_KHR_acceleration_structure with RADV_PERFTEST=rt.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15573>	2022-03-28 10:25:42 +00:00
Rhys Perry	1ead285d92	aco: fix RA validation of 16-bit fma_mix operands Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15562>	2022-03-28 11:05:25 +01:00
Pierre-Eric Pelloux-Prayer	2bc933f7d5	glsl/nir/linker: fix shader_storage_blocks_write_access shader_storage_blocks_write_access was computed using the buffer indices in the program but ShaderStorageBlocksWriteAccess is used with the shader buffers. So if a VS had 3 SSBOs and a FS had 4, the mask for VS was 0x3 (correct) but the mask for the FS was 0x78 instead of 0x15. Fix this by substracting the index of the first shader buffer in the program's buffers. Fixes: `79127f8d5b` ("glsl: set ShaderStorageBlocksWriteAccess in the nir linker") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6184 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15552>	2022-03-28 11:06:31 +02:00
Pierre-Eric Pelloux-Prayer	61ee560bc5	glsl/nir/linker: update shader_storage_blocks_write_access for SPIR-V Most of the code inside the "!prog->data->spirv" blocks shouldn't be executed for SPIR-V except the part updating the writable mask. See https://gitlab.freedesktop.org/mesa/mesa/-/issues/6184 Cc: mesa-stable Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15552>	2022-03-28 10:37:45 +02:00
Daniel Schürmann	007cb02db9	aco: use branch definition as scratch register for SSA lowering Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15505>	2022-03-28 07:36:46 +00:00
Mike Blumenkrantz	ae710f3329	zink: use z24_in_z32f support and radv ci updates This uses the new transfer helper codepath in zink and fixes a bunch of fail on radv. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15516>	2022-03-26 01:22:16 +00:00
Dave Airlie	24a6693ece	u_transfer_helper: add a new option for handling z24 stored in z32 It might be possible to combine this with the other merge to avoid the overheads of making a temp copy. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15516>	2022-03-26 01:22:15 +00:00
Dave Airlie	90a6947632	u_transfer: refactor out code to check interleave/deinterleave path. The checks were reproduced making adding another one not so fun. rework the deinterleave path code to match the interleave path code. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15516>	2022-03-26 01:22:15 +00:00
Dave Airlie	783cab811d	util/format: add new z24/s8 packing helper to pack z32/s8. If zink runs on top of a vulkan impl with no 24-bit float support it needs support to pack into 24-bit for GL. To avoid having to make a temp copy, add a new helper to convert and pack. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15516>	2022-03-26 01:22:15 +00:00
Kenneth Graunke	823745dc27	intel/compiler: Use nir_opt_uniform_atomics() In general, an atomic intrinsic may perform separate atomics for every enabled SIMD channel, as each channel may operate on different memory. However, an extremely common case is for all channels to access the same memory location. In this case, we can simply perform a reduction/scan across the subgroup, and perform one atomic for the whole subgroup, rather than one per channel. For example, if an intrinsic says to take the minimum value of the existing memory and the value in each channel, we can do a thread-local minimum of all enabled channels, then do a single atomic to take the minimum of that and the existing memory. Our hardware doesn't optimize the case where multiple channels ask for atomics on the same memory location; it assumes the compiler will do so. nir_opt_uniform_atomics() uses divergence analysis to detect this case, adds the necessary subgroup operations, and moves the atomic inside a conditional that disables all but a single invocation. It even detects cases where the shader code already performs this kind of optimization, and avoids doing it a second time. This may not be the optimal solution for us. In the backend, we could detect this case and emit send(1) instructions with NoMask, rather than generating if...send(16)...endif, and a lot of unnecessary ALU ops. But it's simple to do, reuses the same path as ACO, and still provides most of the benefit by cutting up to 16x atomics down to a single atomic, which is more merciful to the memory bus. Improves performance of Shadow of the Tomb Raider by 5.5% on XeHP. Improves performance of a customer-internal benchmark on XeHP at 3840x2160 and low settings by approximately 30%. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Kenneth Graunke	49ef23f4a6	intel/compiler: Convert to LCSSA and use divergence analysis. We'll use this more shortly. For now, enable it to separately in case anything bisects to this. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Kenneth Graunke	b3942beecf	intel/compiler: Set divergence analysis options Although we don't use divergence analysis yet, we've had several work-in-progress series that make use of it. We may as well set our options so that those series can assume they're in place. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Kenneth Graunke	6fa66ac228	intel/compiler: Implement nir_intrinsic_last_invocation We haven't exposed this intrinsic as it doesn't directly correspond to anything in SPIR-V. However, it's used internally by some NIR passes, namely nir_opt_uniform_atomics(). We reuse most of the infrastructure in brw_find_live_channel, but with LZD/ADD instead of FBL. A new SHADER_OPCODE_FIND_LAST_LIVE_CHANNEL is like SHADER_OPCODE_FIND_LIVE_CHANNEL but from the other side. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Kenneth Graunke	af529b545a	nir: Teach nir_divergence_analysis about Intel-specific intrinsics - load_reloc_const is just an immediate constant load, it's convergent. - nir_intrinsic_load_global_const_block_intel should be convergent, it says the address must be uniform, and we uniformize the predicate - Lowered image intrinsics: image_deref_load_param_intel just reads information about an image, as long as the image variable is convergent it should be too. load_raw_intel...if the address we come up with is convergent, it ought to be as well. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15484>	2022-03-26 00:28:19 +00:00
Mike Blumenkrantz	3e9bd67f23	zink: add another radv flake literally no idea Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15589>	2022-03-26 00:06:47 +00:00
Caio Oliveira	c32d386ce2	intel/compiler: Inline TUE map computation into TUE Input lowering Refactor since the TUE compute function is simpler now and the comments make sense being near the lowering. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15022>	2022-03-25 23:29:19 +00:00
Caio Oliveira	c36ae42e4c	intel/compiler: Use nir_var_mem_task_payload Instead of reusing the in/out slot mechanism, use a separated NIR variable mode. This will make easier later to implement staging the output in shared memory (and storing all at the end to the URB). Note to get 64-bit type support we currently rely on the brw_nir_lower_mem_access_bit_sizes() pass. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15022>	2022-03-25 23:29:19 +00:00
Daniel Schürmann	2d1e6b756e	aco: remove 'high' parameter from can_use_opsel() No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15551>	2022-03-25 22:02:50 +00:00
Daniel Schürmann	b98a9dcc36	aco/optimizer: fix call to can_use_opsel() in apply_insert() The definition index is -1. Fixes: `54292e99c7` ('aco: optimize 32-bit extracts and inserts using SDWA ') Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15551>	2022-03-25 22:02:50 +00:00
Adam Jackson	8006179cfd	wsi/x11: xcb_wait_for_special_event failure is an error The only ways that function can return NULL are: - the xcb connection was closed - the window for the swapchain was destroyed - the special event listener was unregistered from another thread - malloc failure All of these are permanent errors, the swapchain is no longer in a usable state, so we should treat this as VK_ERROR_SURFACE_LOST_KHR. Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15558>	2022-03-25 19:31:13 +00:00
Alyssa Rosenzweig	f31208f778	pan/va: Lower BLEND to call blend shaders Do this as late as possible. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cb76cc1f1d	pan/va: Add packing unit tests Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	18bf478f1e	pan/va: Add shader-db support Reports the common subset from Bifrost, as well as Mali offline compiler style normalized cycle counts. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	8bc268f2d5	pan/va: Implement the cycle model Will feed into shader-db reporting, and maybe other things eventually. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	8a258a685c	pan/va: Test instruction selection lowerings Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	1745c89312	pan/va: Lower branch offsets Logic is lifted from bi_layout.c, adapted to work on instructions (not clauses) and for Valhall's off-by-one semantic which is annoyingly different than Bifrost. (But the same as Midgard -- Bifrost was annoyingly different than Midgard!) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	9a9b20e652	pan/va: Add instruction selection lowering pass Valhall removes certain instructions from Bifrost, requiring a canonical lowering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b796d32564	pan/va: Add constant lowering pass Valhall has a lookup table for common constants. Add a pass to take advantage of it, lowering away immediate indices. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b8f912e547	pan/va: Validate FAU before packing These are pre-conditions required for packing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	fd1906afea	pan/va: Add FAU validation Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	676d9c9441	pan/va: Add unit tests for ADD_IMM optimizations Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	13d7ca1300	pan/va: Optimize add with imm to ADD_IMM Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	f45654af59	pan/va: Add packing routines Mostly manual since Valhall is regular. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	edf284215d	pan/va: Add helpers for swapping bitwise sources Annoyingly different from Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	619566dea1	pan/va: Generate header containing enums We already collect enums in the ISA description XML. Export them for use in the compiler backend, particularly the packing code. Usually we'd use Mako for templating. In this case, the script is so trivial a template engine didn't seem worth it. (The obvious version with Mako was about 10 lines longer than just prints and f-strings used here.) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	7ad98ae96e	pan/va: Build opcode info structures Filled out the new structures from XML. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	40ed485e32	pan/va: Permit encoding more flags Missed the first time around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	76487c7eb4	pan/va: Unify flow control Group together dependency waits and flow control into a single enum. This simplifies the code, clarifies some detail, and ensures consistency moving forward. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cf6d1a81f6	pan/va: Add Bifrost-style LD_VAR instructions For use in the legacy non-MALLOC_IDVS flow. Especially useful in blit shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	295b802f64	pan/va: Add LD_VAR_BUF instructions Like LD_VAR_BUF_IMM but indirect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	e8590e0d04	pan/va: Add ST_TILE instruction Encoded like LD_TILE, required for some MSAA blend shaders. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	fa841273d4	pan/bi: Rename I->action to I->flow For consistency with the Valhall ISA. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	f5585700be	pan/bi: Model LD_VAR_BUF instructions These are indirect versions of LD_VAR_BUF_IMM, taking their index in bytes. Used for indirect varying loads (the NIR lowering is inefficient). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	97a13d6424	pan/bi: Augment ST_TILE with register format To model its Valhall incarnation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	c7f6b973b2	pan/bi: Check return addresses in blend shaders Required on Valhall, where jumping to 0x0 doesn't automatically terminate the program. Luckily the check is free there too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	1b7d7ebbab	pan/bi: Allow branch_offset on BLEND Required to model BLEND accurately on Valhall, where it encodes a special relative branch... Midgard style! Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	cfde0275e4	pan/bi: Model Valhall-style A(CMP)XCHG Handled consistently with computational atomics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	90867e8204	pan/bi: Add ATOM_RETURN pseudo-instruction Allows modeling Valhall's atomics better. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	7983a0d0dc	pan/bi: Rename PATOM_C to ATOM This is basically what's native on Valhall. Use the Valhall naming for the pseudo-instruction on Bifrost for consistency. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	b70a7c97bb	pan/bi: Gate late DCE/CSE on "optimize" Otherwise we can end up with unlowered ATOM.i32 on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Alyssa Rosenzweig	3485b8dc78	pan/bi: Use consistent modifier lists in packing If there are modifiers only used by pseudo instructions, not the real instructions, bi_packer can get out-of-sync with bi_opcodes, causing hard-to-debug issues. Do the stupid-simple thing to ensure this doesn't happen. This may be a temporary issue, depending whether ISA.xml and the IR get split out for better Valhall support. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15223>	2022-03-25 19:00:13 +00:00
Emma Anholt	b1fa2068b8	nouveau/nir: Enable nir_opt_move/sink. NIR load_consts/inputs tend to happen together at the top of the program. In the TGSI backend the loads got emitted at use time, while the NIR backend was emitting the loads at load intrinsic time. By sinking the intrinsics, we can greatly reduce register pressure. nv92 NIR results: total local in shared programs: 2024 -> 2020 (-0.20%) local in affected programs: 4 -> 0 total gpr in shared programs: 790424 -> 735455 (-6.95%) gpr in affected programs: 215968 -> 160999 (-25.45%) total instructions in shared programs: 6058339 -> 6051208 (-0.12%) instructions in affected programs: 410795 -> 403664 (-1.74%) total bytes in shared programs: 41820104 -> 41660304 (-0.38%) bytes in affected programs: 7147296 -> 6987496 (-2.24%) Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15542>	2022-03-25 18:34:39 +00:00
Rajnesh Kanwal	df1c7ca0e5	pvr: Use vk_common_GetDeviceQueue API. Removes pvr_GetDeviceQueue implementation. As we are now using the common vk_queue structure as a base for pvr_queue, we can use vk_common_GetDeviceQueue implementation instead. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15574>	2022-03-25 18:07:08 +00:00
Boris Brezillon	a3bdad314e	dzn: Compile-test the driver Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14766>	2022-03-25 16:21:45 +00:00
Erik Faye-Lund	a012b21964	microsoft: Initial vulkan-on-12 driver This is Dozen, the Vulkan on DirectX 12 driver. Not to be confused with DirectEggs. This is an early prototype, and not meant to be upstreamed as-is. Co-Authored-by: Boris Brezillon <boris.brezillon@collabora.com> Co-Authored-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Co-Authored-by: Louis-Francis Ratté-Boulianne <lfrb@collabora.com> Co-Authored-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14766>	2022-03-25 16:21:45 +00:00
Enrico Galli	6635d011cb	microsoft/spirv_to_dxil: Add missing ralloc_free Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14766>	2022-03-25 16:21:45 +00:00
Boris Brezillon	7e0ab29cfd	vulkan/util: Make STACK_ARRAY() work for arrays of pointers And add an explicit cast on the pointer returned by malloc() to make C++ happy. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14766>	2022-03-25 16:21:45 +00:00
Boris Brezillon	bb1fb07ecd	vulkan/image: Make MSVC C++ compiler happy Fix 'error C4576: a parenthesized type followed by an initializer list is a non-standard explicit type conversion syntax' errors by declaring an actual variable and returning it in vk_image_view_subresource_range(). All those MSVC/c++ related-constraints are quite annoying to be honest, but it looks like the D3D12 headers have been updated to plain C recently, which will allow us to write the driver in C, and hopefully get all this sort of issues behind us. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14766>	2022-03-25 16:21:45 +00:00
Mike Blumenkrantz	0312ca0175	zink: add anv cts skips from waiver Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15578>	2022-03-25 10:42:52 -04:00
Mike Blumenkrantz	9cbe5ac82a	zink: update radv fails this should be the exact current baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15576>	2022-03-25 08:58:15 -04:00
Rajnesh Kanwal	eacf944e52	pvr: Implement vkCreateSampler and vkDestroySampler APIs. Implements vkCreateSampler and vkDestroySampler APIs. Also fixes maxSamplerLodBias value from 15.0f to 16.0f as it's the max supported by our hardware. Also changing maxSamplerAnisotropy from 16.0f to 1.0f to temporarily disable anisotropy as we are missing software support for it. Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15557>	2022-03-25 12:12:08 +00:00
Boris Brezillon	b000abbe7f	vulkan/util: Get rid of VK_OUTARRAY_MAKE() Get rid of VK_OUTARRAY_MAKE() so people don't get tempted to use it and produce code that doesn't compile with MSVC, which doesn't support typeof(). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	5f309da5e4	vulkan/wsi: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	52a2aa44f3	vulkan/device_select: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	13efbdf830	venus: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	face6f6ddc	panvk: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	49c8b93288	anv: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:03 +00:00
Boris Brezillon	81ba3ab5d9	pvr: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Boris Brezillon	799a9db24c	turnip: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Boris Brezillon	56a2ccf058	v3dv: Stop using VK_OUTARRAY_MAKE() We're trying to replace VK_OUTARRAY_MAKE() by VK_OUTARRAY_MAKE_TYPED() so people don't get tempted to use it and make things incompatible with MSVC (which doesn't support typeof()). Suggested-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15522>	2022-03-25 11:00:02 +00:00
Caio Oliveira	f82731d0d7	intel/fs: Fix IsHelperInvocation for the case no discard/demote are used Use emit_predicate_on_sample_mask() helper that does check where to get the correct mask depending on whether discard/demote was used or not. Fixes: `45f5db5a84` ("intel/fs: Implement "demote to helper invocation"") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15400>	2022-03-25 08:20:27 +00:00
Caio Oliveira	bb311c22df	intel/fs: Initialize the sample mask in flags register when using demote Without this change, a check for "is helper invocation" could read uninitialized values. Fixes: `45f5db5a84` ("intel/fs: Implement "demote to helper invocation"") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15400>	2022-03-25 08:20:27 +00:00
Christian Gmeiner	2648ccb341	nir: Use const for nir_shader_get_entrypoint(..) nir_shader_get_entrypoint(..) should not modify the passed nir_shader object. Enforce this by marking shader paramenter as const. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15362>	2022-03-25 07:25:37 +00:00
Mike Blumenkrantz	eaf3c72a83	mesa/st: add special casing for pointsize constant updating during validate the previous method of using affected_states to trigger constant updates was ineffectual in the scenario where a ubo pointsize was needed on the first time a non-precompiled shader was used after being the not-last vertex stage: * have vs+gs -> gs precompiles with pointsize lowering -> gs constants get updated * remove gs -> vs was precompiled without pointsize lowering -> vs constants broken now just do a quick check as in st_atom_shader.c and set the flag manually to ensure the update is done correctly every time cc: mesa-stable fixes #6207 fixes (radv): KHR-GL46.texture_cube_map_array.image_op_fragment_sh KHR-GL46.texture_cube_map_array.sampling KHR-GL46.texture_cube_map_array.texture_size_fragment_sh KHR-GL46.constant_expressions* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15570>	2022-03-25 04:03:23 +00:00
Mike Blumenkrantz	6fb0bf51a3	zink: update anv icl ci list praise be to dynamic state fixes! Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15569>	2022-03-25 03:31:42 +00:00
Mike Blumenkrantz	6980eb4be2	zink: flush clears before toggling color write ensure these sync up onto the expected buffers Fixes: `3892c13381` ("zink: add an alternate path for EXT_color_write_enable usage") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15568>	2022-03-25 03:09:26 +00:00
Mike Blumenkrantz	4c6931fca9	zink: fix up color_write_enable workaround this needs to only swizzle to dummy surfaces if it's the workaround, not just if color_write_enable is active Fixes: `3892c13381` ("zink: add an alternate path for EXT_color_write_enable usage") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15568>	2022-03-25 03:09:26 +00:00
Rob Clark	c0f52f08a1	freedreno/ci: Update a306 expectations These have started to flakey UnexpectedPass somewhere along the way. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	03a70f554a	pipe-loader: Try loading freedreno for virtgpu device Freedreno will check if the virtgpu supports the pass-thru context, and if not will bail, falling back to virgl. TODO this requires that virgl is also enabled in the mesa build, even if it is not needed.. maybe there is a better way to handle this? Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	802f4da5ee	freedreno/drm: Add virtio backend Add a new backend to enable using native driver in a VM guest, via a new virtgpu context type which (indirectly) makes host kernel interface available in guest and handles the details of mapping buffers to guest, etc. Note that fence-fd's are currently a bit awkward, in that they get signaled by the guest kernel driver (drm/virtio) once virglrenderer in the host has processed the execbuf, not when host kernel has signaled the submit fence. For passing buffers to the host (virtio-wl) the egl context in virglrenderer is used to create a fence on the host side. But use of out-fence-fd's in guest could have slightly unexpected results. For this reason we limit all submitqueues to default priority (so they cannot be preepmted by host egl context). AFAICT virgl and venus have a similar problem, which will eventually be solveable once we have RESOURCE_CREATE_SYNC. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2200d674e4	freedreno/drm: Reorder device destroy Call backend specific cleanup fxn earlier. This is needed if the backend has things like bo's to delete, otherwise the handle_table will already be destroyed causing problems in bo_del() Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	ea339137b0	freedreno/drm: Extract out "softpin" submit/ringbuffer base class We are going to want basically the identical thing, other than flush_submit_list, for virtio backend. Now that we've moved various other dependencies into the base classes, extract out an abstract base class for submit/ringbuffer. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	72a427244f	freedreno/drm: Move ring_pool slab parent to base Prep to move most of sp submit/ringbuffer to something that can be re-used by virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	877f9049c3	freedreno/drm: Move bo idx to base The virtio backend will want this too, and it will make it easier to share most of the submit/ringbuffer implementation with the virtio backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2ac9b23f78	freedreno/drm: Move submit_queue to base The virtio backend will want this too. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	88a10c6216	freedreno/drm: Avoid CPU_PREP ioctl if bo is idle With userspace fences, if we know definitely that the buffer is idle (which implies that it is not shared with other processes, etc), then skip the ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9bcc983256	freedreno/drm: Add fd_bo_upload() There are some buffers that we mmap just to write to them a single time. Add the possibility of the drm backend to provide an alternate upload path to avoid these mmap's. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	115518ec35	freedreno/drm: Add FD_BO_SHARED hint With the virtio backend we will need to pass an extra flag when allocating buffers that will be shared cross-device (such as with virtio-wl for passing between host and guest) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	f846181fe5	freedreno/drm: Add FD_BO_NOMAP hint Add a hint for buffers that we won't need to mmap. With the virtio backend, virglrenderer needs to create a dmabuf fd for mapping into the host, which we want to avoid when possible. Low hanging fruit is to use this hint for anything tiled/ubwc. There are probably more bo's that can be flagged as such. TODO add fd_bo_upload() for memcpy to bo.. this would be useful for uploads, for example, shaders which we just write once and never touch again.. for virtio this could be implemented with a TRANSFER_TO_HOST ioctl. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	598405c91f	freedreno/drm: Rework bo creation path Decoupling handle and fd_bo creation simplifies things for "normal" drm drivers, avoiding duplication for the create vs import paths. But this is awkward for the virtio backend when wants to do multiple things in the same guest<->host round trip. So instead, split the paths in the interface backend and move the code sharing for the two different paths into the msm backend itself. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	9ea36968d3	freedreno/drm: Add fd_device_open() helper Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Rob Clark	2bc815878c	freedreno/drm: Split msm backend into subdir Let's keep things a bit better organized when we add a new backend. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14900>	2022-03-25 02:03:30 +00:00
Mike Blumenkrantz	5a58493baa	zink: update radv ci bunch of these I just removed by mistake Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15567>	2022-03-24 21:05:35 -04:00
Yiwei Zhang	e542981727	venus: update protocol to remove redundant decoders Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15537>	2022-03-25 00:33:56 +00:00
Renato Pereyra	737937f45e	venus: Increase the base sleep of vn_relax Based on profiling, these 10us sleeps are behaving closer to ~60us and causing higher-than-necessary CPU overhead. 120us seems like a good balance between latency management and overhead. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15565>	2022-03-25 00:16:39 +00:00
Jason Ekstrand	8068c68b1f	lavapipe: Delete render passes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15431>	2022-03-24 18:37:14 -05:00
Yonggang Luo	a315039917	c11: Fixes unused parameter warnings Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15553>	2022-03-24 21:56:48 +00:00
Yonggang Luo	0f1b3fc17f	util: Fixes unused parameter warnings The compiler warning: ``` ../src/mesa/util/u_thread.h: In function 'util_thread_get_time_nano': ../src/mesa/util/u_thread.h:239:34: warning: unused parameter 'thread' [-Wunused-parameter] 239 \| util_thread_get_time_nano(thrd_t thread) \| ~~~~~~~^~~~~~ ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15553>	2022-03-24 21:56:48 +00:00
Dave Airlie	81afd1e118	zink: update resource layout in copy_scanout This transitions the resource to TRANSFER_SRC_OPTIMAL, but never updates the res->layout field, so subsequent transitions are wrong and throw validation errors. UNASSIGNED-CoreValidation-DrawState-InvalidImageLayout(ERROR / SPEC): msgNum: 1303270965 - Validation Error: [ UNASSIGNED-CoreValidation-DrawState-InvalidImageLayout ] Object 0: handle = 0x6660f60, type = VK_OBJECT_TYPE_COMMAND_BUFFER; \| MessageID = 0x4dae5635 \| vkQueueSubmit(): pSubmits[0].pCommandBuffers[0] command buffer VkCommandBuffer 0x6660f60[] expects VkImage 0x2f000000002f[] (subresource: aspectMask 0x1 array layer 0, mip level 0) to be in layout VK_IMAGE_LAYOUT_COLOR_ATTACHMENT_OPTIMAL--instead, current layout is VK_IMAGE_LAYOUT_TRANSFER_SRC_OPTIMAL. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15545>	2022-03-24 21:42:43 +00:00
Jason Ekstrand	6646624fb9	lavapipe: Use vk_image_subresource_layer/level_count Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15535>	2022-03-24 21:21:10 +00:00
Jason Ekstrand	8c1bd1f9a2	lavapipe: Use vk_image_view Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15535>	2022-03-24 21:21:10 +00:00
Jason Ekstrand	e500faebc2	vulkan: Add a vk_image_view_subresource_range helper Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15535>	2022-03-24 21:21:10 +00:00
Boris Brezillon	5b12a498f5	vulkan/runtime: Add vk_cmd_queue.h to idep_vulkan_runtime_headers If we don't do that, meson might start compiling source files including vk_command_buffer.h which in turn includes vk_cmd_queue.h before this file is even generated, and we end up with errors like that https://gitlab.freedesktop.org/mesa/mesa/-/jobs/20157936#L1119. Fixes: `6bd8a3c7e4` ("vulkan/runtime: Add a vk_cmd_queue object to vk_command_buffer") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15546>	2022-03-24 19:43:02 +00:00
Igor Torrente	5d33068cd9	venus: add VK_EXT_extended_dynamic_state2 extension Implements all the necessary code in the device initialization and extension functions. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15428>	2022-03-24 17:35:43 +00:00
Matt Coster	c80fa97d18	pvr: ci: Initial freedesktop CI integration This patch adds the PowerVR driver to the following shared builds: * debian-arm64 * debian-clang * debian-release * debian-vulkan * fedora-release It also adds the associated "tools" to these builds: * debian-arm64-build-test * debian-release (already uses -Dtools=all) * fedora-release And removes them from these builds by expanding -Dtools=all: * debian-gallium Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15284>	2022-03-24 11:39:40 +00:00
Matt Coster	97c4ce44e9	pvr: Gate offline compiler build behind -Dtools=imagination This matches the behavior of e.g. panfrost's bifrost_compiler target. Signed-off-by: Matt Coster <matt.coster@imgtec.com> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15284>	2022-03-24 11:39:39 +00:00
Erik Faye-Lund	3b6ceea3e6	pvr: do not use fallthrough for unreachable code unreachable() doesn't lead to executing the code that follows it, neither in debug nor release builds. So falling through doesn't make any sense. This fixes a compile-error on clang. Let's move the default-block to the end to make it clearer that there's no intended fallthrough. Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15531>	2022-03-24 11:25:45 +00:00
Erik Faye-Lund	54c7a245ca	pvr: do not use fallthrough for unreachable code unreachable() doesn't lead to executing the code that follows it, neither in debug nor release builds. So falling through doesn't make any sense. This fixes a compile-error on clang. Let's move the default-block to the end to make it clearer that there's no intended fallthrough. Reviewed-by: Frank Binns <frank.binns@imgtec.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15531>	2022-03-24 11:25:45 +00:00
Lionel Landwerlin	8cdd5647c6	anv: don't store sample location sample count This information should match the current pipeline sample count. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	6f5f817c0f	anv: fix dynamic sample locations on Gen7/7.5 3DSTATE_MULTISAMPLE should be baked into the pipeline if not dynamic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `27ee40f4c9` ("anv: Add support for sample locations") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	8ad78671b3	anv: use local dynamic pointer more Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	1d250b7b95	anv: fix color write enable interaction with color mask Color writes & color masks occupy the same fields in the BLEND_STATE structure. So we need to store color mask (which are not dynamic) on the pipeline to merge that information with color writes. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b15bfe92f7` ("anv: implement VK_EXT_color_write_enable") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6111 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	a4f502de32	anv: fix VK_DYNAMIC_STATE_COLOR_WRITE_ENABLE_EXT state First, there is a problem if you do the following vkCmdSetColorWriteEnableEXT(attachmentCount = 8) vkCmdBindPipeline(GFX, with attachmentCount = 4) vkCmdDraw() vkCmdBindPipeline(GFX, with attachmentCount = 8) vkCmdDraw() Because in the dynamic state emission code we rely on the first pipeline to figure the number of BLEND_STATE entries to prepare. This is wrong, we should fill all entries so that the dynamic state works regardless of the number of attachments in the pipeline. With regard to the dynamic values, we should retain enable/disable values that do not concern the current pipeline. Second, 3DSTATE_WM was not always reemitted when the pipeline changed. But since it is not emitted as part of the pipeline, this results in inconsistent state being programmed. Third, we end up disabling the fragment stage completely in some cases. And that is programming the pipeline inconsistently and triggering a hang on TGL. v2: Fix comment (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b15bfe92f7` ("anv: implement VK_EXT_color_write_enable") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	f348103fce	anv: fix dynamic state emission The problem is that we missed looking at pipeline changes. Pipelines hold bits of dynamic states and when it changes we might need to reemit a packet. v2: fix comment (Tapani) Add missing anv_cmd_buffer_needs_dynamic_state() use (Tapani) Cc: mesa-stable Fixes: `505d176a8e` ("anv: disable baked in pipeline bits from dynamic emission path") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Lionel Landwerlin	1cd7d6ce37	anv: allow baking of 3DSTATE_DEPTH_BOUNDS in pipeline batch If it's not dynamic. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15310>	2022-03-24 10:49:07 +00:00
Erik Faye-Lund	85c59cafd8	docs: add a minimal docs page for radv RADV is the last driver in Mesa that doesn't have a webpage somewhere that we can link to from mesa3d.org. So let's make a minimal docs article for it where we link to relevant information elsewhere. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15494>	2022-03-24 10:08:45 +00:00
Corentin Noël	2aa8e56e52	virgl: Update virgl_protocol and use the provided constants Allow for better readability of the code. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14473>	2022-03-24 10:50:21 +01:00
Boris Brezillon	04d812b2d0	Revert "ci: Disable windows-vs2019" This reverts commit `04b80489d5`. Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15518>	2022-03-24 09:11:13 +00:00
Boris Brezillon	aa0c543533	lavapipe: Don't use VK_OUTARRAY_MAKE()/vk_outarray_append() MSVC doesn't support typeof(). If we want to keep compiling radv on windows we need to use the typed variants of those macros. Fixes: `dc8fdab71e` ("lavapipe: Use VK_OUTARRAY for GetPhysicalDeviceQueueFamilyProperties[2]") Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15518>	2022-03-24 09:11:13 +00:00
Boris Brezillon	eb1d009a3e	radv: Don't use VK_OUTARRAY_MAKE()/vk_outarray_append() MSVC doesn't support typeof(). If we want to keep compiling radv on windows we need to use the typed variants of those macros. Fixes: `10d69d5f0b` ("radv: fix returning empty drmFormatModifierTilingFeatures") Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15518>	2022-03-24 09:11:13 +00:00
Boris Brezillon	2daae1fab4	amd: Fix ac_gpu_info.c compilation on windows Fixes: `75a783ea73` ("ac: Query the amdgpu MEC firmware version.") Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15518>	2022-03-24 09:11:13 +00:00
Boris Brezillon	de039537da	aco: Fix an MSVC warning 'warning C4804: '<<': unsafe use of type 'bool' in operation' Fixes: `9934c86761` ("aco: use v_fma_mix to combine mul/add/fma input conversions") Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15518>	2022-03-24 09:11:13 +00:00
Cristian Ciocaltea	f392960331	virgl/ci: Add support for dEQP GL vtest-ing Provide the 'virpipe-on-gl' job for running dEQP GL tests on virpipe/vtest. The rationale is to help reducing the regressions in Virglrenderer CI due to sharing the Mesa CI infrastructure. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Acked-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15490>	2022-03-24 08:39:06 +00:00
Tomeu Vizoso	9f43dac0ca	ci/freedreno: Increase console timeout for perf jobs Piglit is very sparse in its status output and downloads of big traces can take a while. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15527>	2022-03-24 05:33:54 +00:00
Tomeu Vizoso	22d45f99ff	ci/iris: Increase console timeout for perf jobs Piglit is very sparse in its status output and downloads of big traces can take a while. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15527>	2022-03-24 05:33:54 +00:00
Lionel Landwerlin	a55cc061fd	iris: don't synchronize BO for batch decoding We don't need to go to the kernel to synchronize the BO we want to decode with INTEL_DEBUG=bat, mostly because we'll decode what was written by the driver in the batch. This also works around an issue in the simulation environment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9ac81f1890` ("iris: decoder fixes") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15517>	2022-03-24 04:59:44 +00:00
Jason Ekstrand	729f95a110	i915: Use the sin/cos lowering in nir_opt_algebraic.py It's nearly identical. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15539>	2022-03-24 00:59:21 +00:00
Tomeu Vizoso	d0e99e566f	ci/freedreno: Update checksum for GolfWithYourFriends trace The MR below changed the rendering slightly and the checksum isn't valid any more: "ir3, turnip, freedreno: Shader preambles" https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15526>	2022-03-24 00:04:20 +00:00
Mike Blumenkrantz	1e45254b97	zink: lower txp for array textures this is also illegal according to spirv spec fixes #6177 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15510>	2022-03-23 23:18:48 +00:00
Mike Blumenkrantz	7143a5a147	zink: lower txp for cube and ms textures this is illegal according to spirv spec Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15510>	2022-03-23 23:18:48 +00:00
Mike Blumenkrantz	4e35ed8c67	nir/lower_tex: add txp lowering option for arrays this is illegal in vulkan, so zink needs to be able to lower these Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15510>	2022-03-23 23:18:47 +00:00
Yonggang Luo	821c141f9d	gallium: Remove unused macro PIPE_ARCH_SSSE3 After removal inline function `_mm_shuffle_epi8`, the macro `PIPE_ARCH_SSSE3` have no need anymore, remove it too. Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14077>	2022-03-23 21:44:04 +00:00
Yonggang Luo	55a5635fb2	llvmpipe: Revise u_sse.h to remove unused _mm_shuffle_epi8 inline function error log: ``` [2/43] Compiling C object src/gallium/drivers/llvmpipe/libllvmpipe.a.p/lp_setup_tri.c.obj FAILED: src/gallium/drivers/llvmpipe/libllvmpipe.a.p/lp_setup_tri.c.obj "cc" "-Isrc/gallium/drivers/llvmpipe/libllvmpipe.a.p" "-Isrc/gallium/drivers/llvmpipe" "-I../../src/gallium/drivers/llvmpipe" "-I../../src/gallium/include" "-Isrc/gallium/auxiliary" "-I../../src/gallium/auxiliary" "-Iinclude" "-I../../include" "-Isrc" "-I../../src" "-Isrc/compiler/nir" "-I../../src/compiler/nir" "-Isrc/util" "-I../../src/util" "-IC:/CI-Tools/msys64/clang64/include" "-fvisibility=hidden" "-fcolor-diagnostics" "-Wall" "-Winvalid-pch" "-std=c11" "-O0" "-g" "-DPACKAGE_VERSION=\"22.0.0-devel\"" "-DPACKAGE_BUGREPORT=\"https://gitlab.freedesktop.org/mesa/mesa/-/issues\"" "-DHAVE_WINDOWS_PLATFORM" "-DHAVE_SURFACELESS_PLATFORM" "-DUSE_ELF_TLS" "-DUSE_TLS_BEHIND_FUNCTIONS" "-DENABLE_ST_OMX_BELLAGIO=0" "-DENABLE_ST_OMX_TIZONIA=0" "-DEGL_NO_X11" "-DDEBUG" "-DHAVE___BUILTIN_BSWAP32" "-DHAVE___BUILTIN_BSWAP64" "-DHAVE___BUILTIN_CLZ" "-DHAVE___BUILTIN_CLZLL" "-DHAVE___BUILTIN_CTZ" "-DHAVE___BUILTIN_EXPECT" "-DHAVE___BUILTIN_FFS" "-DHAVE___BUILTIN_FFSLL" "-DHAVE___BUILTIN_POPCOUNT" "-DHAVE___BUILTIN_POPCOUNTLL" "-DHAVE___BUILTIN_UNREACHABLE" "-DHAVE___BUILTIN_TYPES_COMPATIBLE_P" "-DHAVE_FUNC_ATTRIBUTE_CONST" "-DHAVE_FUNC_ATTRIBUTE_FLATTEN" "-DHAVE_FUNC_ATTRIBUTE_MALLOC" "-DHAVE_FUNC_ATTRIBUTE_PURE" "-DHAVE_FUNC_ATTRIBUTE_UNUSED" "-DHAVE_FUNC_ATTRIBUTE_WARN_UNUSED_RESULT" "-DHAVE_FUNC_ATTRIBUTE_WEAK" "-DHAVE_FUNC_ATTRIBUTE_FORMAT" "-DHAVE_FUNC_ATTRIBUTE_PACKED" "-DHAVE_FUNC_ATTRIBUTE_RETURNS_NONNULL" "-DHAVE_FUNC_ATTRIBUTE_ALIAS" "-DHAVE_FUNC_ATTRIBUTE_NORETURN" "-DHAVE_FUNC_ATTRIBUTE_VISIBILITY" "-DHAVE_UINT128" "-D_WINDOWS" "-D_WIN32_WINNT=0x0A00" "-DWINVER=0x0A00" "-DPIPE_SUBSYSTEM_WINDOWS_USER" "-D_USE_MATH_DEFINES" "-DUSE_SSE41" "-DUSE_GCC_ATOMIC_BUILTINS" "-DHAS_SCHED_H" "-DHAVE_CET_H" "-DHAVE_STRTOF" "-DHAVE_STRTOK_R" "-DHAVE_QSORT_S" "-DHAVE_ZLIB" "-DHAVE_ZSTD" "-DHAVE_COMPRESSION" "-DLLVM_AVAILABLE" "-DMESA_LLVM_VERSION_STRING=\"13.0.0\"" "-DLLVM_IS_SHARED=1" "-DDRAW_LLVM_AVAILABLE" "-DMESA_EXECMEM" "-DVK_USE_PLATFORM_WIN32_KHR" "-Werror=implicit-function-declaration" "-Werror=missing-prototypes" "-Werror=return-type" "-Werror=empty-body" "-Werror=incompatible-pointer-types" "-Werror=int-conversion" "-Wimplicit-fallthrough" "-Wno-missing-field-initializers" "-fno-math-errno" "-fno-trapping-math" "-Qunused-arguments" "-fno-common" "-Wno-microsoft-enum-value" "-Werror=format" "-Wformat-security" "-Werror=thread-safety" "-ffunction-sections" "-fdata-sections" "-pthread" "-D_FILE_OFFSET_BITS=64" "-D__STDC_CONSTANT_MACROS" "-D__STDC_FORMAT_MACROS" "-D__STDC_LIMIT_MACROS" "-Werror=pointer-arith" "-Werror=gnu-empty-initializer" -MD -MQ src/gallium/drivers/llvmpipe/libllvmpipe.a.p/lp_setup_tri.c.obj -MF "src/gallium/drivers/llvmpipe/libllvmpipe.a.p/lp_setup_tri.c.obj.d" -o src/gallium/drivers/llvmpipe/libllvmpipe.a.p/lp_setup_tri.c.obj "-c" ../../src/gallium/drivers/llvmpipe/lp_setup_tri.c In file included from ../../src/gallium/drivers/llvmpipe/lp_setup_tri.c:37: In file included from ../../src/gallium/drivers/llvmpipe/lp_setup_context.h:38: In file included from ../../src/gallium/drivers/llvmpipe/lp_setup.h:31: In file included from ../../src/gallium/drivers/llvmpipe/lp_jit.h:40: In file included from ../../src/gallium/auxiliary/gallivm/lp_bld_limits.h:37: In file included from ../../src/util/u_cpu_detect.h:41: In file included from ../../src/util/u_thread.h:35: In file included from ../../include/c11/threads.h:64: In file included from ../../include/c11/threads_win32.h:58: In file included from C:/CI-Tools/msys64/clang64/x86_64-w64-mingw32/include/windows.h:69: In file included from C:/CI-Tools/msys64/clang64/x86_64-w64-mingw32/include/windef.h:9: In file included from C:/CI-Tools/msys64/clang64/x86_64-w64-mingw32/include/minwindef.h:163: In file included from C:/CI-Tools/msys64/clang64/x86_64-w64-mingw32/include/winnt.h:1555: In file included from C:/CI-Tools/msys64/clang64/lib/clang/13.0.0/include/x86intrin.h:15: In file included from C:/CI-Tools/msys64/clang64/lib/clang/13.0.0/include/immintrin.h:37: C:/CI-Tools/msys64/clang64/lib/clang/13.0.0/include/tmmintrin.h:582:1: error: redefinition of '_mm_shuffle_epi8' _mm_shuffle_epi8(__m128i __a, __m128i __b) ^ ../../src/gallium/auxiliary/util/u_sse.h:159:1: note: previous definition is here _mm_shuffle_epi8(__m128i a, __m128i mask) ^ 1 error generated. ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14077>	2022-03-23 21:44:04 +00:00
Georg Lehmann	81b2008af9	nir/legalize_16bit_sampler_srcs: Don't guess source type. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	b5fe1187ec	nir/fold_16bit_sampler_conversions: Fix src type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	88ec73e5e8	nir/fold_16bit_sampler_conversions: Fix dest type mismatches. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5996 Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Georg Lehmann	798e47be51	nir/fold_16bit_sampler_conversions: Don't fold dest upcasts. This is not a valid optimization. Fixes: `fb29cef8dd` ("nir: add many passes that lower and optimize 16-bit input/outputs and samplers") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14895>	2022-03-23 20:55:39 +00:00
Cristian Ciocaltea	18111c3787	Revert "ci: Convert generate-env.sh to a POSIX compliant script" This reverts commit `9904ea2c76` since it is not able to properly escape all special characters (i.e. [']). The POSIX conversion is not needed anymore, as we have made 'bash' available in LAVA rootfs. Reported-by: Daniel Stone <daniels@collabora.com> Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15524>	2022-03-23 20:31:06 +00:00
Cristian Ciocaltea	315842e27c	ci: Make bash available in LAVA rootfs Ensure 'bash' shell interpreter is available in LAVA rootfs since it is going to be a dependency requirement from several scripts, e.g. generate-env.sh. Suggested-by: Daniel Stone <daniels@collabora.com> Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15524>	2022-03-23 20:31:06 +00:00
Kai Wasserbäch	a5884df949	fix(clover): FTBFS: Added missing include for ConstantInt for LLVM 15 With LLVM 15 the include of llvm/IR/Constants.h is required for ConstantInt. This commit fixes an FTBFS. Cc: mesa-stable Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15232>	2022-03-23 20:08:13 +00:00
Erik Faye-Lund	36d38b4386	docs: fixup breakage in release-calendar Seems the branch was accidentally changed to no longer match the release. Let's fix that. Fixes: `9ba636cdc7` ("docs: update calendar and link releases notes for 21.3.8") Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15528>	2022-03-23 20:01:44 +00:00
Mark Janes	85e314db5d	Revert "intel/fs: handle interpolation modes for at_sample and at_offset too" This reverts commit `5afbb0e730`. Closes: #6198 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15534>	2022-03-23 11:03:47 -07:00
Erik Faye-Lund	80a2974826	pvr: zero-initialize variable Because of incomplete codepaths further down in this function, we end up trying to use this variable without initialization. Let's initialize it to zero, just to keep the compiler happy. This avoids a compile-time warning, which would get treated as an error on CI. Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15525>	2022-03-23 16:49:14 +00:00
Erik Faye-Lund	fd7203ce80	pvr: use a helper to translate stencil-ops This gives us a single place to handle the conversion, and fixes a compilation warning by adding an explicit cast. Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15525>	2022-03-23 16:49:14 +00:00
Erik Faye-Lund	cc171840ff	pvr: use a helper to translate compare-ops This gives us a single place to handle the conversion, and fixes a compilation warning by adding an explicit cast. Reviewed-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15525>	2022-03-23 16:49:14 +00:00
Erik Faye-Lund	f7e92c8869	microsoft/compiler: remove phi-value limit There's no guarantee that we don't have more than 128 PHI values either, so let's stop asuming so. We do this by changing dxil_phi_set_incoming to dxil_phi_add_incoming, which lets us add more incoming phi-values to the current one instead of setting a new set of them. This also lets us reduce these stack-arrays a bit, down to something much more reasonable. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15519>	2022-03-23 16:33:21 +00:00
Erik Faye-Lund	d752c4bc7c	microsoft/compiler: ralloc incoming phi-values Reserving 127 incoming values for every phi instruction is neither robust nor memory efficient. Let's ralloc this array instead when filling it. This way, we only pay for what we use here. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15519>	2022-03-23 16:33:21 +00:00
Alyssa Rosenzweig	d8b5d45dc1	pan/va: Add atomic instructions Equivalent to their Bifrost counterparts, with a much more sensible encoding. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Alyssa Rosenzweig	0ac9841809	pan/va: Allow omitting staging registers It's not usually valid, but sr_count == 0 is encodable and used for the non-RETURN variant of ATOM1. Allow dis/assembling this syntax. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Alyssa Rosenzweig	e6ca668d45	pan/va: Allow forcing staging flags to read-write Required for the correct encoding of atomics. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15515>	2022-03-23 15:50:41 +00:00
Daniel Schürmann	832d67e99d	nir: rename nir_src_is_dynamically_uniform to nir_src_is_always_uniform As this function doesn't check for any control-flow dependence, it only returns true for statically (or globally) uniform values. The same holds true for is_binding_dynamically_uniform() in nir_opt_gcm(). Rename to better reflect that property. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14994>	2022-03-23 14:02:08 +00:00
Erik Faye-Lund	a9eddf9edc	pvr: fixup typos when allocating object These objects should have been allocated with VK_SYSTEM_ALLOCATION_SCOPE_OBJECT as the last argument, not the specific object type. Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15521>	2022-03-23 14:17:34 +01:00
Erik Faye-Lund	abf995b1da	pvr: use zloadformat instead of zstoreformat Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15521>	2022-03-23 14:17:34 +01:00
Thomas H.P. Andersen	8138086889	pvr: fix overlapping comparison The comparison here will always be true. This changes it so height is only set if the type is not a 1D. Fixes a warning with clang Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Frank Binns <frank.binns@imgtec.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15514>	2022-03-23 13:04:38 +00:00
Lionel Landwerlin	df059c6781	intel/clc: deal with SPIRV-Tools linker new behavior We're now required to provide all modules to link at the same SPIRV version. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	21aa1d3de1	intel/clc: fixup shared memory offsets We're running the io lowering twice so need to reset some fields so the offset don't go over what is really needed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	de9c2312ea	intel/clc: compile fix Fixes: `c15bf88f01` ("intel: Add a little OpenCL C compiler binary") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	a7f264f33a	intel/clc: add option to printout kernel prog_data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	451f907d16	intel/kernel: enable linkage cap Linkage should have happened before this in intel_clc. This just silence a parser warning. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	bb4ff3e6e2	intel/kernel: enable groups caps This is roughly the same as SpvCapabilityGroupNonUniform (subgroup_basic). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	218db59b25	intel/dev: default to B stepping on DG2 for offline compiler Most people won't have A0 stepping now. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Jason Ekstrand	4b2c78c08a	spirv: Implement the function portion of the Linkage capability Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Jason Ekstrand	80a076382d	nir: Allow nir_var_mem_global variables Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15486>	2022-03-23 10:24:31 +00:00
Lionel Landwerlin	dc8c77cc8f	anv: implement EXT_tooling_info This is required by 1.3. Fixes CTS with newer loader : dEQP-VK.api.tooling_info.validate_getter Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `df8ac77af8` ("anv: Advertise Vulkan 1.3") Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15491>	2022-03-23 09:51:57 +00:00
Lionel Landwerlin	3889dda1f1	vulkan: move EXT_tooling_info implementation to runtime Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15491>	2022-03-23 09:51:57 +00:00
Erik Faye-Lund	d5ed8d4126	gallium: rename image atomic inc-wrap cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15448>	2022-03-23 08:54:06 +00:00
Erik Faye-Lund	880d848b7d	gallium: rename image atomic float-add cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15448>	2022-03-23 08:54:06 +00:00
Erik Faye-Lund	ab26020017	gallium: rename window-space position cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15448>	2022-03-23 08:54:06 +00:00
Mike Blumenkrantz	36373e8e1e	draw: fix nonzero stream primitives generated queries the fastpath here can only be taken if there is exactly one stream active, as this will otherwise break nonzero stream primitives generated queries in truth, this num_vertex_streams thing should be a bitmask so that the case of num_streams=1,stream_id!=0 could also be fastpathed, but the complexity probably isn't worth it given the infrequency of use cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15506>	2022-03-23 04:08:14 +00:00
Mike Blumenkrantz	32f117f5f8	draw: fix gs vertex stream counting this can't be determined from pipe_shader_state::stream_output, as this only contains xfb info, which is not the same as the vertex stream info, and may break primitives generated queries cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15506>	2022-03-23 04:08:14 +00:00
Brian Paul	03d342e4b2	vulkan/wsi/x11: add null pointer check for the has_dri3_v1_2 test This fixes a crash (ver_reply is NULL) when DISPLAY points to a remote display. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6040 Signed-off-by: Brian Paul <brianp@vmware.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15469>	2022-03-23 03:19:40 +00:00
Kai Wasserbäch	948ad5ac23	fix(FTBFS): clover: work around removal of PointerType::getElementType() `PointerType::getElementType()` was deprected and is gone now [0]. The temporary workaround is using `Type::getPointerElementType()`, longterm this needs to use [1]. This commit fixes an FTBFS. [0] <`d593cf7945`> [1] <https://llvm.org/docs/OpaquePointers.html> Closes: #6042 Cc: mesa-stable Signed-off-by: Kai Wasserbäch <kai@dev.carbon-project.org> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15091>	2022-03-23 00:50:27 +00:00
Dave Airlie	3bbd404457	gallivm/sample: detect if rho is inf or nan and flush to zero. When using cubemaps and the u/v values are 0, then this point can be arrived at with rho = nan, and if rho is NaN, then lod calculations end up at the max lod, whereas the spec suggests they should end up at the most negative lod. This fixes dEQP-VK.glsl.texture_functions.query.texturequerylod.samplercube_float_zero_uv_width_fragment Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15335>	2022-03-23 10:21:12 +10:00
Chia-I Wu	ce84b1c30f	venus: update venus-protocol headers This requires vn_extension_get_spec_version to be updated as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15498>	2022-03-22 21:10:56 +00:00
Chia-I Wu	5d188274ee	venus: add vn_extension_get_spec_version It is a wraper for vn_info_extension_get Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15498>	2022-03-22 21:10:56 +00:00
wingdeans	0819869ca8	r600: Fix small leak in SfnLog stderr_streambuf used to not get destroyed with SfnLog Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15412>	2022-03-22 20:24:21 +00:00
Mike Blumenkrantz	23f38553c8	zink: add RADV to list of broken drivers for EXT_color_write_enable ref #6185 Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15509>	2022-03-22 20:08:35 +00:00
Alyssa Rosenzweig	d2fb6879a2	panfrost: Process scissor state earlier Otherwise, if batch->scissor_culls_everything is set for a single draw, every draw after it in the batch will be skipped because the new scissor/viewport state will never be processed. Process scissor state early in draw_vbo to fix this interaction. We do need to be careful: setting something on the batch can only happen when we've decided on a batch. If we have to select a fresh batch due to too many draws, that must happen first. This is pretty clear in the code but worth noting for the diff. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Reviewed-by: Icecream95 <ixn@disroot.org> Fixes: `79356b2e` ("panfrost: Skip rasterizer discard draws without side effects") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5839 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6136 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15365>	2022-03-22 19:44:40 +00:00
Iván Briano	5afbb0e730	intel/fs: handle interpolation modes for at_sample and at_offset too Cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15424>	2022-03-22 19:05:05 +00:00
Cristian Ciocaltea	c2d29e940d	virgl/ci: Add jobs for running trace tests on LAVA Provide new jobs virgl-lava-traces and virgl-lava-traces-performance to run piglit trace tests on Intel based LAVA runners. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	8726ea34d3	ci: Dynamically adjust LIBGL_ALWAYS_SOFTWARE for crosvm For an increased flexibility in operation, do not set 'CROSVM_LIBGL_ALWAYS_SOFTWARE=true' when not using llvmpipe Gallium driver. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	356c9c25c3	ci: Allow specifying any shell command via HWCI_TEST_SCRIPT Interpret the value of HWCI_TEST_SCRIPT environment variable as a shell command. This allows, for example, to provide additional environment variables: HWCI_TEST_SCRIPT="VAR1=VAL1 VAR2=VAL2 /path/to/script" Additionally, add the missing execute permission flags to gtest-runner.sh script. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	fb2ffc2f67	ci: Provide consistent results location in LAVA There is an out-of-sync approach regarding the location of the results folder: some scripts refer to it via $CI_PROJECT_DIR/results, while others just assume it is located in the current working directory. Usually $PWD points to $CI_PROJECT_DIR, but in some cases this is not the case, hence let's ensure the 'results' folder can always be found in the current working directory. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	3e01cea1bd	ci: Remove obsolete CROSVM_TEST_SCRIPT env var This was used in the past for passing the path to a script to be executed inside a crosvm instance. Currently, setting this variable has no effect, hence remove it from generate-env.sh. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	040fb521b3	ci: Add PIGLIT_REPLAY_LOOP_TIMES to generate-env.sh PIGLIT_REPLAY_LOOP_TIMES is currently used to override the default configuration when running virgl trace performance tests in LAVA. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	f4cea722a6	ci: Use script relative paths in crosvm-runner For an increased portability, do not rely on 'CI_PROJECT_DIR' to reference script relative resources and, instead, compute their paths based on the crosvm-runner.sh invocation file path as indicated by $0. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	fdc55725a0	ci: Make kernel image available in LAVA for KVM use cases In order to run a VM (e.g. crosvm) through HWCI_TEST_SCRIPT on a LAVA target, it's necessary to download a kernel image on the target device. When HWCI_KVM is set to 'true', we can safely assume HWCI_TEST_SCRIPT contains a command or the path to a script which expects the kernel image to be available under /lava-files/${KERNEL_IMAGE_NAME}. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	86ce26f1fc	ci: Load KVM kernel module for LAVA runners If 'HWCI_KVM' enviroment variable is set, load the KVM kernel module specific to the detected CPU virtualisation extensions: vmx for Intel VT and svm for AMD-V. As an additional optimization, handle HWCI_KERNEL_MODULES probing in the main shell process instead of creating an unnecessary subshell. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	03bceca92f	ci: Enable KVM_AMD and KVM_INTEL kernel modules Build and deploy KVM kernel modules in rootfs image to be used for running crossvm in LAVA environment. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	295243751f	ci: Add crosvm runtime dependencies for LAVA Provide the required packages in the rootfs image in order to allow running crosvm inside LAVA environment. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	c30c7ba834	ci: Build crosvm for LAVA runners This is the first step to add support for running crosvm inside LAVA. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	e6252dec9e	ci: Set CI_JOB_JWT_FILE to a fixed path outside /tmp Having CI_JOB_JWT_FILE pointing to path on /tmp makes it difficult to be managed in VM contexts (e.g. crosvm) because the /tmp mountpoint usually refers to a local filesystem rather than the host one. Additionally, there is another restriction for 'piglit-traces-test' job to have the file available on the root filesystem. To avoid amending all the jobs that might be affected, let's just set the variable to a fixed path '/minio_jwt'. Note we also need to do this in the 'variables:' section instead of 'before_script:' in order to be able to reference it in CI job variables, e.g. PIGLIT_REPLAY_EXTRA_ARGS Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	9904ea2c76	ci: Convert generate-env.sh to a POSIX compliant script This shell script will be used in environments (e.g. LAVA) where bash is not available, hence let's make sure it is POSIX compliant in order to be able to execute on any modern shell interpreter. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Cristian Ciocaltea	e7ab2ba94e	ci: Avoid altering EXTRA_CARGO_ARGS environment variable Use a dedicated DEQP_RUNNER_CARGO_ARGS variable instead of EXTRA_CARGO_ARGS in build-deqp-runner.sh to pass custom arguments when invoking 'cargo install'. This is to avoid modifications of EXTRA_CARGO_ARGS which might have negative side-effects in the scripts which rely on this variable and import build-deqp-runner.sh instead of executing it in a subshell. Fixes: `8729c6e981` ("ci: Support building and installing deqp-runner from source") Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15208>	2022-03-22 17:01:09 +00:00
Rhys Perry	543a5487cf	radv,aco: lower image descriptor loads in NIR fossil-db (Sienna Cichlid): Totals from 2926 (1.80% of 162293) affected shaders: Instrs: 2315110 -> 2306644 (-0.37%); split: -0.37%, +0.00% CodeSize: 12581592 -> 12546588 (-0.28%); split: -0.28%, +0.00% VGPRs: 130216 -> 130208 (-0.01%) SpillSGPRs: 477 -> 474 (-0.63%); split: -5.03%, +4.40% Latency: 29686188 -> 29678804 (-0.02%); split: -0.05%, +0.02% InvThroughput: 6926545 -> 6926286 (-0.00%); split: -0.02%, +0.02% SClause: 73761 -> 72996 (-1.04%); split: -1.16%, +0.12% Copies: 144068 -> 137279 (-4.71%); split: -4.78%, +0.07% Branches: 47466 -> 47483 (+0.04%); split: -0.01%, +0.04% PreSGPRs: 118042 -> 117377 (-0.56%); split: -1.34%, +0.77% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	15640e58d9	radv,aco: lower texture descriptor loads in NIR fossil-db (Sienna Cichlid): Totals from 39445 (24.30% of 162293) affected shaders: MaxWaves: 875988 -> 875972 (-0.00%) Instrs: 35372561 -> 35234909 (-0.39%); split: -0.41%, +0.03% CodeSize: 190237480 -> 189379240 (-0.45%); split: -0.47%, +0.02% VGPRs: 1889856 -> 1889928 (+0.00%); split: -0.00%, +0.01% SpillSGPRs: 10764 -> 10857 (+0.86%); split: -2.04%, +2.91% SpillVGPRs: 1891 -> 1907 (+0.85%); split: -0.32%, +1.16% Scratch: 260096 -> 261120 (+0.39%) Latency: 477701150 -> 477578466 (-0.03%); split: -0.06%, +0.03% InvThroughput: 87819847 -> 87830346 (+0.01%); split: -0.03%, +0.04% VClause: 673353 -> 673829 (+0.07%); split: -0.04%, +0.11% SClause: 1385396 -> 1366478 (-1.37%); split: -1.65%, +0.29% Copies: 2327965 -> 2229134 (-4.25%); split: -4.58%, +0.34% Branches: 906707 -> 906434 (-0.03%); split: -0.13%, +0.10% PreSGPRs: 1874153 -> 1862698 (-0.61%); split: -1.34%, +0.73% PreVGPRs: 1691382 -> 1691383 (+0.00%); split: -0.00%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	52f850238a	radv,aco: lower buffer descriptor loads in NIR fossil-db (Sienna Cichlid): Totals from 75420 (46.47% of 162293) affected shaders: MaxWaves: 1878200 -> 1879228 (+0.05%); split: +0.06%, -0.00% Instrs: 54021103 -> 54141370 (+0.22%); split: -0.04%, +0.26% CodeSize: 287813520 -> 288293352 (+0.17%); split: -0.04%, +0.21% VGPRs: 3267576 -> 3266296 (-0.04%); split: -0.04%, +0.00% SpillSGPRs: 10445 -> 10904 (+4.39%); split: -0.31%, +4.70% SpillVGPRs: 1818 -> 1811 (-0.39%); split: -1.05%, +0.66% Scratch: 955392 -> 954368 (-0.11%) Latency: 563477854 -> 562131282 (-0.24%); split: -0.31%, +0.08% InvThroughput: 111860104 -> 111553968 (-0.27%); split: -0.30%, +0.02% VClause: 958432 -> 961415 (+0.31%); split: -0.34%, +0.65% SClause: 1917415 -> 1926952 (+0.50%); split: -0.69%, +1.19% Copies: 3812945 -> 3916758 (+2.72%); split: -0.27%, +2.99% Branches: 1611235 -> 1612022 (+0.05%); split: -0.04%, +0.08% PreSGPRs: 3095505 -> 3126580 (+1.00%); split: -0.06%, +1.07% PreVGPRs: 2773011 -> 2773013 (+0.00%) Most regressions seem to be because ACO's convert_pointer_to_64_bit() can't be CSE'd with radv_nir_apply_pipeline_layout()'s convert_pointer_to_64_bit(). This should be improved by later commits. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	22050fcf9d	radv,aco: lower vulkan_resource_index in NIR fossil-db (Sienna Cichlid): Totals from 31338 (19.31% of 162293) affected shaders: MaxWaves: 758634 -> 758616 (-0.00%) Instrs: 26398289 -> 26378282 (-0.08%); split: -0.09%, +0.01% CodeSize: 141048208 -> 140971060 (-0.05%); split: -0.07%, +0.01% VGPRs: 1373656 -> 1373736 (+0.01%) SpillSGPRs: 9944 -> 9924 (-0.20%); split: -0.24%, +0.04% SpillVGPRs: 1892 -> 1898 (+0.32%); split: -0.95%, +1.27% Latency: 308570144 -> 308528462 (-0.01%); split: -0.03%, +0.02% InvThroughput: 57698072 -> 57684901 (-0.02%); split: -0.07%, +0.04% VClause: 440357 -> 440602 (+0.06%); split: -0.02%, +0.08% SClause: 974724 -> 973315 (-0.14%); split: -0.18%, +0.04% Copies: 1944925 -> 1945103 (+0.01%); split: -0.11%, +0.12% Branches: 799444 -> 799461 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 1619860 -> 1619233 (-0.04%); split: -0.05%, +0.02% PreVGPRs: 1252813 -> 1252863 (+0.00%); split: -0.00%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	3d52d3cd9b	ac/llvm: implement nir_tex_src_{texture,sampler}_handle nir_tex_src_{texture,sampler}_handle is either the actual descriptor as a vec4/vec8 or a pointer passed to load_sampler_desc. The sample index isn't adjusted using FMASK when these are used. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	53ccb2a9a7	ac/llvm: remove deref chasing for tg4 integer workaround Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	dc4df0ce8b	ac/llvm: implement nir_intrinsic_bindless_image_sparse_load Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	238915fdd1	ac/llvm: remove deref requirement for image fmask loads Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	e82aba88dc	nir: allow bindless image/texture/sampler handles to be vectors Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	e6672f6fd1	radv: move radv_declare_shader_args() out of shader_variant_compile() Declaring them earlier will allow us to access them in NIR. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	a56bb2d545	ac/llvm: implement implement load_{scalar,vector}_arg_amd and load_smem_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	53996cf979	aco: implement load_{scalar,vector}_arg_amd and load_smem_amd Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Rhys Perry	ff52a724d4	nir: add load_{scalar,vector}_arg_amd and load_smem_amd intrinsics load_smem_gcn is similar to load_global/load_global_constant, but it's guaranteed to use SMEM and it's much easier to utilize the format's 32-bit offset source. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12773>	2022-03-22 16:33:27 +00:00
Frank Binns	8991e64641	pvr: Add a Vulkan driver for Imagination Technologies PowerVR Rogue GPUs Co-authored-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Co-authored-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Co-authored-by: Simon Perretta <simon.perretta@imgtec.com> Co-authored-by: Alexander Wasey <Alexander.Wasey@imgtec.com> Signed-off-by: Frank Binns <frank.binns@imgtec.com> Signed-off-by: Rajnesh Kanwal <rajnesh.kanwal@imgtec.com> Signed-off-by: Karmjit Mahil <Karmjit.Mahil@imgtec.com> Signed-off-by: Simon Perretta <simon.perretta@imgtec.com> Signed-off-by: Alexander Wasey <Alexander.Wasey@imgtec.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15243>	2022-03-22 15:04:55 +00:00
Danylo Piliaiev	5d151ddfba	turnip: Disallow non-linear tiling when casting R8G8 to other fmts R8G8 have a different block width/height and height alignment from other formats that would normally be compatible (like R16), and so if we are trying to, for example, sample R16 as R8G8 we need to demote to linear. Follows the fix in Freedreno: `b97e3bb2e1` Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
Danylo Piliaiev	a70b197741	turnip: Force linear mode for non-ubwc R8G8 formats Non-UBWC tiled R8G8 is probably buggy since media formats are always either linear or UBWC. There is no simple test to reproduce the bug. However it was observed in the wild leading to an unrecoverable hang on a650/a660. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5926 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15465>	2022-03-22 13:47:21 +00:00
shansheng.wang	1bd8ef0c89	frontends/va: fix coredump as creating surface with VAConfigAttrib As creating surface with VAConfigAttrib, checking if modifier from attrib list is null Signed-off-by: shanshengwang <shansheng.wang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15483>	2022-03-22 13:07:56 +00:00
Mike Blumenkrantz	f9390dac23	zink: use the current compute shader, not the base one this is the current variant and the one that should be used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15501>	2022-03-22 12:54:43 +00:00
Mike Blumenkrantz	d8d7ba8b5d	zink: create compute pipeline after updating shader variants it turns out shader variants don't work if you generate them after you determine which variant to use Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15501>	2022-03-22 12:54:43 +00:00
Ganesh Belgur Ramachandra	582e7f1599	radeonsi: NIR equivalent of si_create_clear_buffer_rmw_cs() Replaced the existing internal TGSI compute shader, which clears a read-modify-write buffer, with its NIR equivalent. The disassembly shader generated by the new NIR variant is identical to the previous implementation. These changes remove the additional conversion step from TGSI to NIR for the shader at runtime. Tested on a Navi 23 card. Reviewed-by: Mihai Preda <mhpreda@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15356>	2022-03-22 12:35:41 +00:00
Mihai Preda	ff2b2bc568	amd/ac_gpu_info: fix warning on fread unused result fixes this warning: ignoring return value of 'fread' declared with attribute 'warn_unused_result' [-Wunused-result] Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15502>	2022-03-22 11:50:37 +00:00
Kenneth Graunke	49dd707ca2	intel: Add INTEL_DEBUG=noccs alias for INTEL_DEBUG=norbc When CCS compression first came out on Skylake, we referred to it as "renderbuffer compression", or RBC for short. However, that name has long since fallen out of favor, and we refer to it as CCS nearly everywhere. This patch renames DEBUG_NO_RBC to DEBUG_NO_CCS inside the codebase for clarity, and adds INTEL_DEBUG=noccs. The legacy INTEL_DEBUG=norbc name continues to work, because it's one line of code and having both names makes our lives easier in the interim. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15447>	2022-03-22 06:23:10 +00:00
Kenneth Graunke	85d30846db	nir: Print divergence status of SSA values if analysis was ever run. After running divergence analysis, we include "div" or "con" for each SSA def's divergence/convergence status: vec1 32 div ssa_35 = fddy ssa_34 vec1 32 con ssa_36 = fddy ssa_6.x We omit this before the first time divergence analysis has been run. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15445>	2022-03-21 22:46:34 -07:00
Mike Blumenkrantz	09a37ad046	zink: ci updates these should be properly disabled now Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15500>	2022-03-21 23:13:21 -04:00
Mike Blumenkrantz	2139ef8951	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 23:00:56 -04:00
Mike Blumenkrantz	5b77a17472	zink: use the right query type for primitives generated this should've always been clipping invocations, but I got scared because then tests with rasterization_discard=1 fail and I didn't handle that instead Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 23:00:40 -04:00
Mike Blumenkrantz	9153fb7fbe	zink: use EXT_color_write_enable to mask out primgen+rasterizer_discard output by disabling color and depth write, the side effects of force-disabling discard can be mitigated fixes: KHR-GL46.tessellation_shader.single.isolines_tessellation KHR-GL46.tessellation_shader.tessellation_control_to_tessellation_evaluation.data_pass_through KHR-GL46.tessellation_shader.tessellation_invariance.invariance_rule3 KHR-GL46.tessellation_shader.tessellation_shader_point_mode.points_verification KHR-GL46.tessellation_shader.tessellation_shader_quads_tessellation.degenerate_case KHR-GL46.tessellation_shader.tessellation_shader_quads_tessellation.inner_tessellation_level_rounding KHR-GL46.tessellation_shader.tessellation_shader_tessellation.gl_InvocationID_PatchVerticesIn_PrimitiveID KHR-GL46.tessellation_shader.vertex.vertex_spacing Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 23:00:28 -04:00
Mike Blumenkrantz	3892c13381	zink: add an alternate path for EXT_color_write_enable usage for drivers where this is broken/missing, the same effect can be achieved by feeding the renderpass a framebuffer with null/dummy attachments Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:12 -04:00
Mike Blumenkrantz	447099629e	zink: use EXT_color_write_enable when possible this should only verify that drivers aren't completely broken by enabling it and have no other changes Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:12 -04:00
Mike Blumenkrantz	ecca2564c1	zink: disable color_write_enable on ANV this breaks the driver Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:12 -04:00
Mike Blumenkrantz	49a20e0981	zink: start a unified driver workarounds struct eventually anything that checks driverID should be moved here for ease of reference/updating Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:11 -04:00
Mike Blumenkrantz	0a80f94de0	zink: force disable rasterization discard if primgen query is active this query requires rasterization to pass the clipping invocations stage, which means discard is impossible Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:11 -04:00
Mike Blumenkrantz	795925e0ff	zink: hook up EXT_color_write_enable no functional changes Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15392>	2022-03-21 22:58:11 -04:00
Qiang Yu	9401990e6f	nir/linker: set varying from uniform as flat Flat varying can save some rasterization compute cost and register needed by shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Qiang Yu	aaf951c47f	lima: enable nir lower_varying_from_uniform Mali GPU pass varying by memory, so enable this optimization. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Qiang Yu	2617e6c028	nir/linker: disable varying from uniform lowering by default This fixes performance regression for Specviewperf/Energy on AMD GPU. Other GPUs passing varying by memory may choose to re-enable it as need. Fixes: `2604625043` ("nir/linker: support uniform when optimizing varying") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15341>	2022-03-22 01:33:23 +00:00
Alyssa Rosenzweig	b219e9a96e	asahi: Port driver to macOS 12.x ABI There's lots of reshuffling required. Nothing "interesting", though. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	1c6e77bd04	asahi: Don't clobber clear colours Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	b3be6d5263	asahi: Handle flushes of depth-only rendering Not totally right but this should be a start. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	a0197c16bc	asahi: Wire in u_transfer_helper We need it for emulating packed depth/stencil as separate depth/stencil resources, populating separate_stencil for us as required. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	3cd2833549	asahi: Generate IOGPU attachments dynamically Take a pipe_framebuffer_state and go from there. We need some care to handle separate stencil, but the logic is largely routine. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	5d58d5caaf	asahi: Add separate_stencil, internal_format fields Currently unused, but will be set when u_transfer_helper is integrated for emulating packed depth/stencil. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:30 +00:00
Alyssa Rosenzweig	2c23bb49ef	asahi: Add size field to slices Needed to size attachments for the kernel... for some reason. We already compute this; just save it. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Alyssa Rosenzweig	f5ae88d36f	asahi: Identify IOGPU_MISC data structure This will be elaborated upon soon. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Alyssa Rosenzweig	d5ee1eacf1	asahi: Add stencil buffer attachment type Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Alyssa Rosenzweig	50f9b4ceba	asahi: Identify IOGPU Internal Pipelines structure Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Alyssa Rosenzweig	eb9da583d7	asahi: Identify aux framebuffer data structure Total guess at the name. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Alyssa Rosenzweig	535f1c1166	asahi: Identify IOGPU Clear Z/S structure Not sure on the details yet but identify and dump the data structure to start. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15482>	2022-03-22 00:19:29 +00:00
Yonggang Luo	d9c3601e29	util: trim trailing space for files src/util/*/ Using the following bash script doing that ``` cd src/util find . -type f -print0 \| xargs -0 -n1 sed -i 's/[ \t]*$//' ``` Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15093>	2022-03-21 17:57:15 +00:00
Nanley Chery	da82358a52	ci/anv: Changes from enabling 8/16bpp CCS more - Fixes in dEQP-VK.dynamic_rendering.suballocation.multisample_resolve.* - Fails in dEQP-VK.drm_format_modifiers.export_import.* Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15420>	2022-03-21 17:36:10 +00:00
Nanley Chery	e2f0c859c2	Revert "anv: Disable CCS_E for some 8/16bpp copies on TGL+" This reverts commit `d68b2db89c`. With this change, no regressions have been observed with the dEQP-VK.synchronization* test group. There are regressions with dEQP-VK.drm_format_modifiers.export_import.*, but those have been root-caused to be test issues (see 3575). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6125 Fixes: `57445adc89` ("anv: Re-enable CCS_E on TGL+") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15420>	2022-03-21 17:36:10 +00:00
Dylan Baker	fe65c5671b	gallium/opencl: set OCL_ICD_FILENAMES with devenv So that `meson devenv` also sets up OpenCL. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15471>	2022-03-21 16:58:14 +00:00
Pavel Ondračka	19db6b760a	r300: set PVS_LAST_VTX_SRC_INST properly to last input read From docs: The PVS Instruction which uses the Input Vertex Memory for the last time. This value is used to free up the Input Vertex Slots ASAP. This field must be set to a valid instruction. Right now it is set to the last instruction. When the last read is inside a loop, set it on the outhermost ENDLOOP. This could in theory help performance, but none of my usual benchmarks including GLmark, Unigine Sanctuary or Lightsmark show any measurable performance difference. Suggested in: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6045 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15252>	2022-03-21 16:37:11 +00:00
Karol Herbst	43c3f4386b	nir: fix nir_sweep for printf I hit a memory corruption trying to implement printf for Rusticl Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15474>	2022-03-21 13:23:04 +00:00
Lionel Landwerlin	44555e7f2c	ci: enable intel-clc on some platforms We'll have to figure out the cross compiling strategy, in particular for Android. But as it stands we can't have the target & host llvm packages installed at the same time so we can't really compile it. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	beadd0cb24	ci: enable llvm on debian-release build Needed for intel-clc. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	57dd9c66bd	ci: add clang/spirv-tools/llvm-spirv packages to fedora container Needed for intel-clc. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	9ca29c687b	intel/clc: disable tool prior to Gfx12.5 platforms This tool is currently only aimed at Gfx version 12.5+ with COMPUTE_WALKER. We could make it work on earlier platforms but they require pushing gl_SubgroupInvocation and the CLC code is missing the back-end compiler set-up bits for that. v2: Commit description by Jason Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	c735c4ca85	intel/clc: specify supported extensions Having everything ever known to man is confusing our SPIRV parser. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	a29b1d5716	intel/clc: allow producing SPIRV files Useful to debug the parser. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	77e929a527	intel/clc: allow multiple CL files to be compiled together v2: use util_dynarray_append() (Jason) identation fixes (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	c15bf88f01	intel: Add a little OpenCL C compiler binary v2: Fix up indentation (Marcin) s/gen/gfx/ (Marcin) Deal with fd closing in success/fail cases (Marin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	b1e7ce84cc	meson: try to find clang-cpp before going through each module Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	ec6e247a40	intel/fs: handle inline data on OpenCL style kernels This is for Gfx12.5 with the COMPUTE_WALKER::Inline Data payload. We do this in a similar way to the compute kernels. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	4d8e788663	intel/kernel: Implement some Intel built-in functions v2: Document mangled function names (Marcin) Fixup progress & metadata (Marcin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	346a7f14fb	intel/compiler: Add code for compiling CL-style SPIR-V kernels v2: simplify INTEL_DEBUG expressions (Marcin) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	8c11912582	intel/debug: Dump KERNEL source when INTEL_DEBUG=cs Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	d1bddfba6b	intel/nir: Add optimizations to help OpenCL-style kernels Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Lionel Landwerlin	4ec5da7270	intel/nir/fs: replace COMPUTE \|\| KERNEL by gl_shader_stage_is_compute() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Jason Ekstrand	e9ff6f4f06	nir/print: Add support for generic pointers The way they're handled is that deref->modes is treated as a bitfield of possible modes. Variables are required to have a specific mode and derefs with deref_type_var are as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13171>	2022-03-21 11:26:44 +00:00
Yonggang Luo	24bc6c51e1	glx/egl: improve dri null screen related error messages. Convert from `failed to create dri screen` to more exact error message for easier debugging Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15480>	2022-03-21 09:31:31 +00:00
Pierre-Eric Pelloux-Prayer	5b14a0e390	crocus: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	b83bbd6f7f	d3d12: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	42fe3c5815	etnaviv: replace opencoded slab_zalloc Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	f8cbbaff5f	freedreno: replace opencoded slab_zalloc Reviewed-by: Rob Clark <robdclark@chromium.org> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	06b72f1e1a	lima: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	38aad273aa	iris: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	4e29299e2b	v3d: replace opencoded slab_zalloc Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	af7117df71	vc4: replace opencoded slab_zalloc Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	ff1744ee27	virgl: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	6fb52ba30e	zink: replace opencoded slab_zalloc Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	3dc6236b3c	r600: replace opencoded slab_zalloc Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:58 +01:00
Pierre-Eric Pelloux-Prayer	4b559f791c	radeonsi: replace opencoded slab_zalloc Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15277>	2022-03-21 09:47:57 +01:00
Mike Blumenkrantz	bd06680b61	lavapipe: KHR_synchronization2 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15413>	2022-03-21 04:55:37 +00:00
Mike Blumenkrantz	6d62c671ab	lavapipe: add QueueSubmit2 implementation this converts the existing code to use VkSubmitInfo2 structs and adds a new QueueSubmit wrapper which generates these structs from VkSubmitInfo Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15413>	2022-03-21 04:55:37 +00:00
Mike Blumenkrantz	00a6676506	lavapipe: add sync2 cmdbuf method implementations nothing special here Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15413>	2022-03-21 04:55:37 +00:00
Mike Blumenkrantz	256e4d7949	lavapipe: fix typo in set_event execution Fixes: `eb7eccc76f` ("lavapipe: Use generated command queue code") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15413>	2022-03-21 04:55:37 +00:00
Mike Blumenkrantz	01d597a3a5	docs: update lavapipe features and relnotes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15463>	2022-03-21 04:39:56 +00:00
Mike Blumenkrantz	a364f8c532	lavapipe 1.3 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15463>	2022-03-21 04:39:56 +00:00
Mike Blumenkrantz	0b8ecb5406	lavapipe: add a GetPhysicalDeviceToolPropertiesEXT stub when no tools are loaded, this will otherwise crash Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15463>	2022-03-21 04:39:56 +00:00
Mike Blumenkrantz	741be76963	lavapipe: EXT_subgroup_size_control Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15354>	2022-03-21 04:09:48 +00:00
Mike Blumenkrantz	f150bff04f	llvmpipe: fix variable naming insanity in cs generator in the top part of this function, the x/y/z size variables are used to represent the loop iterator limits in the bottom part, they change to represent the loop iterator values my brain. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15354>	2022-03-21 04:09:48 +00:00
Mike Blumenkrantz	102f6ebc57	llvmpipe: fix subgroup id construction the coroutine idx is based on the number of x loops, but the subgroup id is based on the coroutine's total loops Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15354>	2022-03-21 04:09:48 +00:00
Mike Blumenkrantz	f33399aa9e	llvmpipe: fix gl_NumSubgroups this is (x * y * z) / subgroup_size, not num_x_loops Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15354>	2022-03-21 04:09:48 +00:00
Mike Blumenkrantz	7f1050f207	lavapipe: EXT_inline_uniform_block Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	1ba1ee9e7c	lavapipe: implement EXT_inline_uniform_block this is a lot of machinery to propagate the block sizes down from the descriptor layout to the pipeline layout to the rendering_state block data is appended to ubo0 immediately following push constant data (if it exists), which requires that a new buffer be created and filled any time either type of data changes shader handling is done by propagating the offset of each block relative to the start of its descriptor set, then accumulating the sizes of every uniform block in each preceding descriptor set into the offset, then adding on the push constant size, and finally adding that on to the existing load_ubo deref offset update-after-bind is no longer an issue since each instance of pc+block data is its own immutable buffer that can never be modified Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	249fe9673a	lavapipe: remove unused struct member this was used at some point I think? Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	5bbb39a652	lavapipe: use stream uploader for push constant upload now instead of having static per-stage buffer regions and letting llvmpipe do the upload, lavapipe creates a new pipe_resource and chucks it away with take_ownership=true to allow it to be destroyed once it's no longer in use this also alters ubo0 mechanics such that the buffer is now sized exactly to the size of the push constants in the pipeline and push constants are only updated when the appropriate shader stage is flagged Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	c264b1b6ab	lavapipe: save pipeline stages that push constants are active on Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	526e898dfc	lavapipe: add a stream uploader to rendering_state and queue objects not currently used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	de01a47b1c	lavapipe: zalloc pipeline layout structs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	08732eca5d	lavapipe: don't emit compute states during draw there's a separate function for this Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15457>	2022-03-21 03:58:14 +00:00
Mike Blumenkrantz	40607f5088	lavapipe: KHR_shader_terminate_invocation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Mike Blumenkrantz	68fd0668eb	lavapipe: extend demote->discard pass to handle terminate Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Mike Blumenkrantz	82d3e7515e	lavapipe: EXT_shader_demote_to_helper_invocation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Mike Blumenkrantz	3e28ce374e	lavapipe: run some shader passes for demote handling pipe internals already support discard, so as long as everything is rewritten to use discard, we don't need any further changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Mike Blumenkrantz	cdcfcb7916	nir/lower_is_helper_invocation: create load_helper_invocation instr with bitsize=1 the specification stipulates that this is a bool value, so don't load it as an int or else nir_validate explodes Fixes: `f17b41ab4f` ("nir: add lowering pass for helperInvocationEXT()") Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15402>	2022-03-21 03:20:33 +00:00
Mike Blumenkrantz	fe33835091	zink: handle conversion for vertices statistics query with LINE_LOOP draws the converted number of vertices is 2x that of the "real" number of vertices, so divide the results by 2 this works nicely since zink performs cpu readback for all queries, and shader aggregation should be simple in the future as well fixes: KHR-GL46.pipeline_statistics_query_tests_ARB.functional_primitives_vertices_submitted_and_clipping_input_output_primitives Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15401>	2022-03-21 02:55:37 +00:00
Mike Blumenkrantz	d25437d54f	zink: store vertices statistics query to context Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15401>	2022-03-21 02:55:37 +00:00
M Henning	c36f3f2db8	nouveau: Fix out-of-bounds access in AlgebraicOpt for cases where we calculate the absolute value of the result of a unary operation We can remove this problematic check since it's redundant with the one several lines down. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15476>	2022-03-20 05:35:06 +00:00
M Henning	d12e16fc3b	nouveau: Handle unaligned tlsBase during spills Without this, 128-bit or 64-bit register spills can generate unaligned loads and stores if tlsBase is unaligned. Fixes glsl-1.50/execution/variable-indexing/gs-input-array-vec3-index-rd with NV50_PROG_USE_NIR=1 on kepler Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13693>	2022-03-20 05:24:51 +00:00
Kenneth Graunke	4de13d53fe	iris: Fix MOCS for copy regions These were, unfortunately, backwards. The source is the texture. The destination is the render target. Fixes: `d8cb76211c` ("iris: Fix MOCS for buffer copies") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15473>	2022-03-19 01:56:35 -07:00
Andrey Konovalov	ed2f496ce4	ir3: set local_size for shaders of MESA_SHADER_KERNEL type ir3_compile_shader_nir() should set local_size[] and local_size_variable fields not only for compute shaders, but for the OpenCL kernels too. v2: use gl_shader_stage_is_compute() instead of explicit comparison with MESA_SHADER_[COMPUTE,KERNEL]. Signed-off-by: Andrey Konovalov <andrey.konovalov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14863>	2022-03-18 23:20:25 +00:00
Emma Anholt	44c52e94e9	ci/traces: Make sure we have no pre-existing traces-db before starting. bare-metal can reboot boards into an existing rootfs on intermittent device failure, but traces-db doesn't do any sanity-checking of the local downloads of traces and would proceed to just trying to replay them. Nuke any existing trace db so that it re-downloads every time, same as LAVA or docker container tests do. Fixes: #5585 Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15440>	2022-03-18 22:54:41 +00:00
Michel Zou	49e2d39c66	lavapipe: fix i686 mingw build Fixes: `987e8a5a` Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15464>	2022-03-18 22:42:51 +00:00
Mike Blumenkrantz	47209e80db	lavapipe: zalloc lvp_image_view structs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15462>	2022-03-18 22:27:25 +00:00
Mike Blumenkrantz	4a13b11f4a	lavapipe: break out resolves into separate functions Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15462>	2022-03-18 22:27:25 +00:00
Mike Blumenkrantz	441c553ef7	lavapipe: store number of immutable samplers to pipeline layout Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15462>	2022-03-18 22:27:25 +00:00
Jason Ekstrand	7030d14e0d	spirv: Properly mangle generic pointers Fixes: `a8e53a772f` ("spirv: Add generic pointer support") Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15470>	2022-03-18 21:52:05 +00:00
Eric Engestrom	9ba636cdc7	docs: update calendar and link releases notes for 21.3.8 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15466>	2022-03-18 21:07:14 +00:00
Eric Engestrom	daf6ee08d6	docs: add release notes for 21.3.8 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15466>	2022-03-18 21:07:14 +00:00
Emma Anholt	f831ba238f	ci/turnip: Increase the hangcheck timer to 2 seconds. We get a lot of useful coverage from running graphicsfuzz with spilling enabled, but it's also pretty slow and can cause intermittent hangcheck failures. I thought I'd categorized them when merging !14839 (device loss on reset), but it looks like not all of them and we're now more likely to have flakes take out the whole test run when a single flake makes the rest of the caselist a flake. This is a little unfortunate in that it means our test environment is not the same as a stock system you would want to run deqp on to submit conformance, but I think it's an improvement in the test maintenance work vs needing to fix things up later. We have some other tests besides turnip that can trigger hangchecks which we might also like this increase for (some disabled traces, for example). However, freedreno GL has a 5-second timeout waiting for idle when mapping, and a couple of 2-second timeouts in a row can result in spurious failures in other tests! Fixes: #6163 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15435>	2022-03-18 19:07:24 +00:00
Alyssa Rosenzweig	0cbe4dd4c4	pan/bi: Use bi_dontcare for ZS_EMIT This is more portable and avoids special casing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	1b934d5962	pan/bi: Emit arch-specific code for bi_dontcare We use bi_dontcare() to specify any encoding where we don't care about the value, with a preference for power-efficient encodings. On Bifrost, a (possibly nonexistant) FAU read is the best encoding. On Valhall, that encoding doesn't exist so just use a zero. That should be good enough in practice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	222d17fc67	pan/bi: Model Valhall action on bi_instr Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	38625af010	pan/bi: Add Valhall-specific zero builder When emitting code during or after register allocation, we need to be able to emit constants without running the constant->{LUT, move, uniform} pass running after. In particular, we need to access the constant 0 to implement spill code. Add a Valhall-specific zero for this purpose. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	666b714a37	pan/bi: Don't analyze helper reqs in !frag shaders Waste of time, and possibly invalid too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	a16163a9fd	pan/bi: Print Valhall-specific FAU indices We'll emit these shortly, prepare the printer. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:06 +00:00
Alyssa Rosenzweig	32ca920023	pan/bi: Use vertex/instance ID helpers Enables portability to Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	1e37113ede	pan/bi: Add helpers to get vertex/instance ID These are preloaded in different places across Bifrost and Valhall. Abstract that away so code using the builder isn't littered with "is Valhall?" checks. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	76a09b8cd3	pan/va: Fix ST_CVT definitions They are basicallly just STORE with an extra source and the memory access modifier in a different place. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	9b7a45e3dc	pan/va: Align error messages in disassembler tests Makes it easier to spot the difference, less eye scanning. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	3a4b864197	pan/va: Add missing .auto32 register format Clipped to .auto for consistency with Bifrost (and the existing IR). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	263c5ef194	pan/va: Add LEA_ATTR_IMM instruction Encoded like LEA_TEX_IMM. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	459c6ac23f	pan/va: Model LEA_TEX_IMM more accurately The unknown field is a descriptor type, which we model as an opcode2 since it's a fixed constant. This allows us to disambiguate LEA_TEX_IMM from LEA_ATTR_IMM. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	17caccd15d	pan/va: Correct definition of ZS_EMIT It's a message instruction, not an ALU one... duh. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	508335c927	panfrost: Add Tiler Job to v9 XML Legacy tiling job, semantics are the same as on Midgard. Useful for blits and transform feedback. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	e635dc9ea5	panfrost: Refactor XML to permit non-IDVS jobs Tiler jobs look similar, but don't have the Allocations fields. Refactor to make this possible to express. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	53f1fa9219	panfrost: Fix definition of DCD on v9 The position and varying shader environment descriptors are additional sections of the job, rather than part of the (fragment only) DCD. This distinction matters for non-IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	6d51c1b898	panfrost: Fix primitive restart with 32-bit indices There's an overflow here if index_size = 4. Caught when bringing up Valhall, not sure why this was never caught before. Yikes. Fixes: `7a6a5f3fe1` ("panfrost: Handle explicit primitive restart") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	02f519601a	panfrost: Correct ASTC decode mode XML The narrow/wide bit was backwards. Fixes: `bfba7533c7` ("panfrost: Add Valhall Plane Descriptor XML") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	fd87135852	pan/decode: Unify tiler job handling Instead of adding a third Valhall path, let's use GenXML to unify our paths. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Alyssa Rosenzweig	a9ca751a8f	pan/decode: Handle blend arrays on Valhall Required for MRT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15461>	2022-03-18 18:52:05 +00:00
Connor Abbott	fc381fa1e3	tu: Actually expose VK_EXT_texel_buffer_alignment Oops... Fixes: `3d04c435` ("tu: Trivially implement VK_EXT_texel_buffer_alignment") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15451>	2022-03-18 18:30:20 +00:00
Omar Akkila	f18429340e	lavapipe: Lift fence check into dedicated function Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15443>	2022-03-18 18:15:39 +00:00
Georg Lehmann	4f6c7a6025	radv: Don't hash ycbcr sampler base object. Stops gamescope from recompiling pipelines on every start. Cc: mesa-stable Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15454>	2022-03-18 17:56:15 +00:00
Jason Ekstrand	012bfde7f3	panvk: Hook up emulated secondary command buffers Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Boris Brezillon	18fced0226	panvk: Refcount the descriptor set and pipeline layouts Lifetime of descriptor sets and pipeline layouts are odd. Let's refcount them so we don't end up with use-after-free patterns. That means we can't use custom allocators for those objects. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Jason Ekstrand	df92f56d8d	vulkan/runtime: Add emulated secondary command buffer support Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Boris Brezillon	25542f12d7	vulkan/cmd_queue: Fix the allocation scope VK_SYSTEM_ALLOCATION_SCOPE_COMMAND is used for transient allocations that are not expected to live outside the vkXxx(). Use VK_SYSTEM_ALLOCATION_SCOPE_OBJECT for cmd_entry allocations. v2 (Jason Ekstrand): - Also fix the manually typed entrypoints in vk_cmd_enqueue.c Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Boris Brezillon	1437ee749b	vulkan/cmd_queue: Track allocation errors in vk_cmd_queue Needed to report allocation failures when vkEndCommandBuffer() is called. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Jason Ekstrand	6cb95877b5	vulkan/cmd_queue: Auto-generate more vk_cmd_enqueue_unless_primary_Cmd* Instead of one MANUAL_COMMANDS, we now have two deny-lists: MANUAL_COMMANDS and NO_ENQUEUE_COMMANDS. The former is for things which have a manually typed implementation in vk_cmd_enqueue.c and the later is for things we want to ignore entirely. This lets us auto-generate vk_cmd_enqueue_unless_primary_Cmd* entrypoints for the manually typed vk_cmd_enqueue_Cmd* entrypoints. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Jason Ekstrand	3cffffc441	vulkan/cmd_queue: Generate enqueue_if_not_primary entrypoints These check the command buffer level and enqueue the command if it's not a primary but uses vk_device::command_dispatch_table for primaries. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Jason Ekstrand	8f29c833da	vulkan/cmd_queue: Add a vk_cmd_queue_execute() helper Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14406>	2022-03-18 17:29:16 +00:00
Mike Blumenkrantz	e0910f5ef8	Revert "features: fix some vk extension listings" This reverts commit `a3e9388953`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15460>	2022-03-18 17:24:23 +00:00
Jason Ekstrand	68fe847a26	lavapipe: Drop GetPhysicalDeviceQueueFamilyProperties Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:15 -05:00
Jason Ekstrand	dc8fdab71e	lavapipe: Use VK_OUTARRAY for GetPhysicalDeviceQueueFamilyProperties[2] This fixes bugs with lavapipe's hand-rolled pCount handling. The driver is supposed to set pCount to the number of queues actually written in the case where it's initialized to a value that's too large. It's also supposed to handle pCount being too small. Fixes: `b38879f8c5` ("vallium: initial import of the vulkan frontend") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:15 -05:00
Jason Ekstrand	91cb714dc1	panvk: Drop GetPhysicalDeviceQueueFamilyProperties Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:15 -05:00
Jason Ekstrand	19f56e3fc4	v3dv: Drop GetPhysicalDeviceQueueFamilyProperties Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:14 -05:00
Jason Ekstrand	2a779f98dc	turnip: Drop tu_legacy.c The remaining three helpers all have helpers in the common code. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 11:19:08 -05:00
Jason Ekstrand	205bf5d9cb	radv: Drop GetPhysicalDeviceQueueFamilyProperties Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 10:22:59 -05:00
Jason Ekstrand	8d7cbe026e	anv: Drop GetPhysicalDeviceQueueFamilyProperties Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 10:10:37 -05:00
Jason Ekstrand	25664c6194	vulkan: Add a 2 wrapper for vkGetPhysicalDeviceQueueFamilyProperties Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15459>	2022-03-18 10:10:33 -05:00
Mike Blumenkrantz	a3e9388953	features: fix some vk extension listings memory model is 1.3 and descriptor indexing isn't required by anything Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15458>	2022-03-18 14:49:40 +00:00
Jason Ekstrand	cdaa3a899c	anv: Use layerCount for clears and transitions in BeginRendering The Vulkan spec was recently clerified to say that transitions only happen to the bound layers: "Automatic layout transitions apply to the entire image subresource attached to the framebuffer. If multiview is not enabled and the attachment is a view of a 1D or 2D image, the automatic layout transitions apply to the number of layers specified by VkFramebufferCreateInfo::layers. If multiview is enabled and the attachment is a view of a 1D or 2D image, the automatic layout transitions apply to the layers corresponding to views which are used by some subpass in the render pass, even if that subpass does not reference the given attachment." This is in the context of render passes but it applies to dynamic rendering because the implicit layout transition stuff is a Mesa pseudo- extension and inherits those rules. For clears, the Vulkan spec says: "renderArea is the render area that is affected by the render pass instance. The effects of attachment load, store and multisample resolve operations are restricted to the pixels whose x and y coordinates fall within the render area on all attachments. The render area extends to all layers of framebuffer." Again, this is in the context of render passes but the same principals apply to dynamic rendering where the layerCount and renderArea are specified as part of the vkCmdBeginRendering() call. Fixes: `3501a3f9ed` ("anv: Convert to 100% dynamic rendering") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15441>	2022-03-18 09:27:15 -05:00
Iago Toral Quiroga	4f284254e4	v3dv: support importing external semaphores This was waiting for multisync support in our kernel interface so we can wait on the actual imported payload of a semaphore rather than the last job we submitted. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	fa1b10f36d	v3dv: lock around noop job submits Any thread we create may end up creating/submitting at least a noop job, which is a shared object. Before multisync, this was an issue only for the creation of the job itself, but with multisync we can also modify parameters of the noop job every time it is used (for signaling and serialization configuration). This change adds a noop mutex that all threads (main, wait and master) take before submitting a noop job to ensure concurrent access is not an issue. Fixes flakyness observed with multisync with the following test: dEQP-VK.api.command_buffers.secondary_execute_twice Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	daa865fb2c	v3dv: fix semaphore wait from CPU job If a CPU job comes first in a command buffer with a semaphore wait operation we need to wait on the CPU for the semaphore to be signaled before we process the job. We have been doing this with a WaitForIdle operation, but that only works if the semaphore has been submitted for signaling from the same instance of the driver. If we have an imported payload from another instance in our semaphore however, waitForIdle may return too early since the submission to signal the semaphore may have been submitted by a different instance of the driver as well, and our wait for idle checks only know about this instance submissions. To fix this, we always submit a noop job from our instance that waits on the semaphores on the GPU and follow up with WaitForIdle to wait for that to complete. Fixes test failures and/or assert crashes in: dEQP-VK.synchronization.cross_instance.* (when enabling support for semaphore imports) Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	3b8ab8a9ce	v3dv: don't signal semaphores/fences from a wait thread When we have a wait thread we can't ensure that the last job in the last command buffer will be the one to signal semaphores because in this case there is no gurantee that jobs from command buffers in the batch will be submitted to the GPU in order, as those put in a wait thread will be submitted later when the event wait operation is completed. Instead, we need to wait for all outstanding wait threads to complete and only then we should signal any semaphores or fences. This also fixes a bug where the wait for events was the last job in the command buffer. In this case, once the event wait is completed we have no additional jobs to submit and thus would never try to signal semaphores or fences. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	03840bfcd1	v3dv: fix temporary imports of semaphores and fences with multisync This is preparatory work to expose support for importing semaphores, which was waiting on kernel multisync support. When we implemented user-space multisync support we didn't handle temporary fence/semaphore payload imports at all, so we fix that here. Also, we add a has_temp boolean flag to identify the case where we have a temporary payload in a fence/sempahore instead of just checking if temp_sync is not 0. This is necessary to support semaphore imports (for which we are not exposing support yet) because these need to drop the temporary payload when they are used as wait semaphores in a submit, but we can't destroy the underlying temp_sync at that point because it needs to survive at least until the submit is finished, so instead we use a flag to tell if we have an active temporary payload or not, and we simply destroy any temp_sync on a semaphore destroy or any new import on the same semaphore. We only strictly need this flag for semaphores because fences drop the temporary payload when they are reset, which happens in the CPU and can only be done if the GPU is not using the fence, but we add the same flag for the fence for consistency. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	5a11a2fb6c	v3dv: don't expose image load/store features for linear images Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	0590ce1362	v3dv: return early on image to buffer blit copies if image is linear This path uses a shader blit to implement the copy which is only supported for tiled images (except 1D). While blit_shader() already checks for this, this path does a lot of heavy lifting to prepare for the blit_shader call so we rather avoid that if possible when we know blit_shader won't be able to implement the blit. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Iago Toral Quiroga	397f4963ed	v3dv: TFU destination must be UIF We had some code that considered the possibility that the destination might be linear when configuring TFU jobs, but we never actually allow for this to happen since we avoid hitting these paths in that case, as the TFU always produces UIF results. Instead, add an assert when producing the TFU packet to ensure we are expecting a UIF result. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15342>	2022-03-18 13:17:58 +00:00
Mike Blumenkrantz	9e20d68785	zink: add some nice docs for batch usage and tracking Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15429>	2022-03-18 12:42:31 +00:00
Mike Blumenkrantz	3472fed4da	zink: set vbo resource usage on bind this is how other descriptor binds work, and it makes repeated draws with the same vbo binds faster since this was a redundant operation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15429>	2022-03-18 12:42:31 +00:00
Mike Blumenkrantz	8294d45424	zink: only update usage on buffer rebind if rebinds occurred this is a harmless case, but it's still wrong cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15429>	2022-03-18 12:42:31 +00:00
Mike Blumenkrantz	7da211e24f	zink: force-add usage when adding last-ref tracking this fixes desync+crash when: 1. usage is added for bs A 2. tracking is added for bs B 3. tracking is removed for bs B 4. context is destroyed 5. usage A is now dangling and will crash if accessed as seen in glmark2 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15429>	2022-03-18 12:42:31 +00:00
Alejandro Piñeiro	e3d905ec39	v3dv/pipeline: use new helper vk_shader_module_to_nir In addition to use the helper, we also remove some of the lowering we had at preprocess_nir, as they are called now by the helper. As we are here we also move the call to nir_lower_sysvals_to_varyings, that for some reason we were calling it before preprocess_nir. It is worth to note that with this change we lose the ability to debug the NIR just after spirv_to_nir using V3D_DEBUG, as now this is done on vk_spirv_to_nir, and as mentioned that includes several lowerings now. The workaround to that is to use NIR_DEBUG. We also needed to change how to check the entrypoint on the broadcom compiler, checking just if it is an entrypoint, instead of assuming that the name will be "main". v2: tweak comment, squash v3dv and compiler change (Iago) Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15449>	2022-03-18 11:05:11 +00:00
Lionel Landwerlin	8b71118aa0	anv: flush tile cache with query copy command This fixes the test_resolve_non_issued_query_data vkd3d-proton test. This change is required on TGL+ (maybe ICL?) because on all platforms 3D pipeline writes are not coherent with CS. On previous platform we fixed this by flushing the render cache to make sure data is visble to CS before it writes to memory. But on more recently platforms, flushing the render cache leaves the data in the tile cache which is still not coherent with CS, so we need to flush that one too. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14552>	2022-03-18 10:02:33 +00:00
Lionel Landwerlin	4e30da7874	anv: emit timestamp & availability using the same part of CS We've run into issues before where PIPE_CONTROL races MI_STORE_* commands. So make sure we emit the availability using the same type of CS so that memory writes are properly ordered. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14552>	2022-03-18 10:02:33 +00:00
Juan A. Suarez Romero	730a294b90	v3dv: implement VK_EXT_line_rasterization Allow to choose the line rasterization algorithm. It supports rectangular and Bresenham-style line rasterization. v2 (Iago): - Update documentation. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15407>	2022-03-18 09:38:38 +00:00
Juan A. Suarez Romero	22759e9174	v3dv: add subpixel precision definition Move number of bits for subpixel precision in rasterizer to a define. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15407>	2022-03-18 09:38:38 +00:00
Juan A. Suarez Romero	b53dda6da8	broadcom: add line rasterization mode to packet definition Add the supported line rasterization modes as enums in the XML packet definition. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15407>	2022-03-18 09:38:38 +00:00
Juan A. Suarez Romero	102ae4bdc8	broadcom: add on-disk cache debug option Add support for`V3D_DEBUG=cache`, which prints on-disk cache events. v2: - Use same debug format for v3d and v3dv (Alejandro) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15380>	2022-03-18 08:58:01 +00:00
Juan A. Suarez Romero	4468db20f7	v3d: add support for on-disk shader cache It stores the compiled shaders on disk so further executions will load them from disk instead of compiling, improving the overall performance. This is noticeable in games where they suddenly get stuck for a while because they start to compile one or more shaders. v2: - Remove comment (Alejandro) - Use malloc() + helper to simplify code (Alejandro) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15380>	2022-03-18 08:58:01 +00:00
Jason Ekstrand	a929bafc77	panvk: Only implement Get*MemoryRequirements2 The runtime code will provide the 1.0 entrypoints for us. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	bc8b30ba55	panvk: Drop QueueBindSparse Now that we've switched to the common sync/submit framework, this is implemented in runtime/vk_queue.c. We don't need to provide the stub. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	2fc2ec17db	panvk: Drop BindImage/BufferMemory We already provide the 2 versions and the Vulkan runtime will map the 1.0 entrypoints to the 2 versions for us. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	f9b773a417	panvk: Implement VK_KHR_copy_commands2 This is just 2 versions of all the copy/blit entrypoings. The common Vulkan runtime code will implement the 1.0 versions in terms of the 2 versions. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	b573b22628	panvk: Implement VK_KHR_synchronization2 It's easier to switch to sync2 before CmdPipelineBarrier gets any more complicated. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	39c395d1d2	panvk: Move core properties into their respective core structs Currently, we only support a few features from 1.1. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	ff30dd11a7	panvk: Re-arrange GetPhysicalDeviceProperties2 Put the 1.0 properties and limits first. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	34139d9f51	panvk: Add a 1.3 features struct The only thing that gets pulled into this is private data. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Jason Ekstrand	dd03dba7fd	panvk: Re-arrange GetPhysicalDeviceFeatures2 Put the 1.0 features at top followed by 1.1 and then 1.2. For filling out the actual 1.1 and 1.2 structs, use the helpers. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15436>	2022-03-18 08:15:50 +00:00
Iago Toral Quiroga	5c1302f47c	v3dv: expose VK_EXT_image_drm_format_modifier This has been implemented for a while but we could not expose it on Vulkan 1.0 because the extension declares a dependency on VK_KHR_sampler_ycbcr_conversion, which we don't implement, and CTS would complain. On Vulkan 1.1 however, VK_KHR_sampler_ycbcr_conversion was promoted to core as an optional feature, and this is enough for the the dependency to be satisfied, even if the feature is not supported, meaning that we can now expose the extension. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15426>	2022-03-18 06:42:06 +00:00
Dave Airlie	cdd1b86591	lavapipe: add EXT_texel_buffer_alignment support. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15442>	2022-03-18 04:45:32 +00:00
Mike Blumenkrantz	efa724133f	zink: flag sample locations for re-set on batch flush this needs to be re-set any time the cmdbuf changes cc: mesa-stable Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15397>	2022-03-18 04:31:49 +00:00
Ian Romanick	19330eeb1d	intel/fs: Force destination types on DP4A instructions Most of the time, this doesn't matter. On the versions with _sat, if the destination type is incorrect, the clamping will not happen correctly. Fixes the following CTS tests: dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.all_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.limits_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_packed_uu_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_ss_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_su_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_us_v4i8_out32 dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.small_uu_v4i8_out32 v2: Update anv-tgl-fails.txt. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Fixes: `0f809dbf40` ("intel/compiler: Basic support for DP4A instruction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15417>	2022-03-17 22:39:04 +00:00
Felix DeGrood	3bd9b25060	intel: change INTEL_MEASURE output to microseconds Change time event durations from ns -> us. Microseconds are easier to work with. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Felix DeGrood	2e6d14cc7b	intel: increase INTEL_MEASURE batch/buffer sizes Increase default batch_size and buffer_size from 16 -> 64. These are sized to be big enough to service most games. As games have become more demanding, larger sizes become necessary. Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Felix DeGrood	e0c9032db8	anv: add indirect draw to INTEL_MEASURE Reviewed-by: Mark Janes <markjanes@swizzler.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15348>	2022-03-17 22:14:42 +00:00
Dave Airlie	f452317849	clover/nir: respect lower to scalar options. This just calls the lower alu to scalar pass like mesa/st Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15433>	2022-03-17 22:00:49 +00:00
Dave Airlie	f34260bbf7	vulkan: update vk video headers for new vulkan headers. These got out of sync update to the latest video headers. Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15434>	2022-03-17 21:14:28 +00:00
Connor Abbott	3d04c43576	tu: Trivially implement VK_EXT_texel_buffer_alignment The previous alignment of 64 bytes, which we got from the blob, indicates that single-texel alignment isn't supported. So just do a trivial no-op implementation that returns the same alignment as before. This matches what newer blobs that expose this extension do. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15427>	2022-03-17 20:45:19 +00:00
Rhys Perry	177b54ebe9	aco/tests: add v_fma_mix tests Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	1092f37805	aco: use v_fma_mix to combine mul/add/fma output conversions fossil-db (Sienna Cichlid): Totals from 42 (0.03% of 134913) affected shaders: CodeSize: 596904 -> 596332 (-0.10%); split: -0.10%, +0.00% Instrs: 110194 -> 109902 (-0.26%) Latency: 1205239 -> 1204915 (-0.03%); split: -0.03%, +0.00% InvThroughput: 189697 -> 189375 (-0.17%) VClause: 1365 -> 1366 (+0.07%) Copies: 5429 -> 5414 (-0.28%); split: -0.33%, +0.06% Branches: 4034 -> 4026 (-0.20%) fossil-db (Navi): Totals from 42 (0.03% of 134913) affected shaders: CodeSize: 596044 -> 595488 (-0.09%); split: -0.10%, +0.00% Instrs: 110845 -> 110540 (-0.28%) Latency: 1206131 -> 1205747 (-0.03%) InvThroughput: 190178 -> 189809 (-0.19%) VClause: 1372 -> 1370 (-0.15%); split: -0.29%, +0.15% Copies: 5671 -> 5641 (-0.53%); split: -0.56%, +0.04% Branches: 4033 -> 4025 (-0.20%) fossil-db (Vega): Totals from 42 (0.03% of 135048) affected shaders: CodeSize: 605824 -> 605352 (-0.08%); split: -0.08%, +0.00% Instrs: 115975 -> 115706 (-0.23%) Latency: 1399845 -> 1398912 (-0.07%) InvThroughput: 489901 -> 489442 (-0.09%) VClause: 1314 -> 1311 (-0.23%); split: -0.38%, +0.15% Copies: 9673 -> 9666 (-0.07%); split: -0.12%, +0.05% Branches: 4025 -> 4024 (-0.02%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	21304b772c	aco: apply clamp to v_fma_mix fossil-db (Sienna Cichlid): Totals from 2536 (1.88% of 134913) affected shaders: CodeSize: 17314568 -> `17282960` (-0.18%) Instrs: 3191438 -> 3187487 (-0.12%) Latency: 59465090 -> 59407885 (-0.10%) InvThroughput: 10271466 -> 10260512 (-0.11%) fossil-db (Navi): Totals from 2512 (1.86% of 134913) affected shaders: CodeSize: 17194700 -> 17173396 (-0.12%) Instrs: 3215093 -> 3212430 (-0.08%) Latency: 60174315 -> 60142593 (-0.05%) InvThroughput: 9491103 -> 9483979 (-0.08%) fossil-db (Vega): Totals from 2512 (1.86% of 135048) affected shaders: CodeSize: 17186776 -> 17165472 (-0.12%) Instrs: 3311166 -> 3308503 (-0.08%) Latency: 65737409 -> 65716096 (-0.03%) InvThroughput: 21735857 -> 21719792 (-0.07%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	35196b6d89	aco: combine add/mul as v_fma_mix into fma fossil-db (Sienna Cichlid): Totals from 7345 (5.44% of 134913) affected shaders: CodeSize: 73840060 -> 73768936 (-0.10%); split: -0.10%, +0.00% Instrs: 13701603 -> 13684183 (-0.13%); split: -0.13%, +0.00% Latency: 185389373 -> 185306538 (-0.04%); split: -0.04%, +0.00% InvThroughput: 33785020 -> 33757593 (-0.08%); split: -0.08%, +0.00% VClause: 237337 -> 237338 (+0.00%) SClause: 485728 -> 485720 (-0.00%) Copies: 935900 -> 935279 (-0.07%); split: -0.07%, +0.00% Branches: 480721 -> 480722 (+0.00%) fossil-db (Navi): Totals from 10649 (7.89% of 134913) affected shaders: VGPRs: 756624 -> 756516 (-0.01%); split: -0.02%, +0.01% CodeSize: 92156580 -> 91707900 (-0.49%); split: -0.49%, +0.00% MaxWaves: 159402 -> 159476 (+0.05%); split: +0.07%, -0.02% Instrs: 17155827 -> 17070449 (-0.50%); split: -0.50%, +0.00% Latency: 246296456 -> 245487120 (-0.33%); split: -0.33%, +0.00% InvThroughput: 41438159 -> 41117424 (-0.77%); split: -0.77%, +0.00% VClause: 323790 -> 323867 (+0.02%); split: -0.00%, +0.03% SClause: 612077 -> 612034 (-0.01%); split: -0.01%, +0.00% Copies: 1103012 -> 1102775 (-0.02%); split: -0.03%, +0.01% Branches: 555893 -> 555896 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 824372 -> 824378 (+0.00%) PreVGPRs: 740390 -> 740363 (-0.00%); split: -0.01%, +0.01% fossil-db (Vega): Totals from 10950 (8.11% of 135048) affected shaders: SGPRs: 1034528 -> 1034560 (+0.00%) VGPRs: 794092 -> 794104 (+0.00%); split: -0.01%, +0.01% CodeSize: 94409768 -> 93955568 (-0.48%); split: -0.48%, +0.00% MaxWaves: 38950 -> 38939 (-0.03%); split: +0.00%, -0.03% Instrs: 18162637 -> 18070934 (-0.50%); split: -0.51%, +0.00% Latency: 291718455 -> 290772451 (-0.32%); split: -0.32%, +0.00% InvThroughput: 109114674 -> 108489767 (-0.57%); split: -0.57%, +0.00% VClause: 334498 -> 334579 (+0.02%); split: -0.01%, +0.03% SClause: 628871 -> 628825 (-0.01%); split: -0.01%, +0.00% Copies: 1674477 -> 1674850 (+0.02%); split: -0.02%, +0.04% PreSGPRs: 834800 -> 834802 (+0.00%) PreVGPRs: 750460 -> 750415 (-0.01%); split: -0.01%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	9934c86761	aco: use v_fma_mix to combine mul/add/fma input conversions fossil-db (Sienna Cichlid): Totals from 11558 (8.57% of 134913) affected shaders: VGPRs: 829392 -> 825200 (-0.51%); split: -0.52%, +0.02% SpillSGPRs: 7845 -> 8399 (+7.06%) CodeSize: 101822704 -> 101677172 (-0.14%); split: -0.25%, +0.11% MaxWaves: 172216 -> 173182 (+0.56%); split: +0.59%, -0.03% Instrs: 19061343 -> 18883450 (-0.93%); split: -0.93%, +0.00% Latency: 256011590 -> 255177378 (-0.33%); split: -0.39%, +0.06% InvThroughput: 46104438 -> 45604059 (-1.09%); split: -1.12%, +0.04% VClause: 352211 -> 351948 (-0.07%); split: -0.21%, +0.13% SClause: 676506 -> 676961 (+0.07%); split: -0.04%, +0.11% Copies: 1246571 -> 1237745 (-0.71%); split: -0.97%, +0.26% Branches: 626229 -> 626241 (+0.00%); split: -0.02%, +0.03% PreSGPRs: 882176 -> 888853 (+0.76%); split: -0.00%, +0.76% PreVGPRs: 796705 -> 792304 (-0.55%); split: -0.56%, +0.00% fossil-db (Navi): Totals from 11558 (8.57% of 134913) affected shaders: VGPRs: 803900 -> 798660 (-0.65%); split: -0.73%, +0.08% SpillSGPRs: 7894 -> 8492 (+7.58%); split: -0.10%, +7.68% CodeSize: 96892596 -> 97134716 (+0.25%); split: -0.05%, +0.29% MaxWaves: 181454 -> 183014 (+0.86%); split: +0.94%, -0.08% Instrs: 18186813 -> 18093994 (-0.51%); split: -0.56%, +0.05% Latency: 253385909 -> 253325528 (-0.02%); split: -0.15%, +0.12% InvThroughput: 43315355 -> 42805541 (-1.18%); split: -1.33%, +0.15% VClause: 338755 -> 338535 (-0.06%); split: -0.16%, +0.10% SClause: 656561 -> 656829 (+0.04%); split: -0.07%, +0.11% Copies: 1162235 -> 1153558 (-0.75%); split: -1.07%, +0.32% Branches: 588536 -> 588542 (+0.00%); split: -0.03%, +0.03% PreSGPRs: 854849 -> 861640 (+0.79%); split: -0.00%, +0.80% PreVGPRs: 783401 -> 779031 (-0.56%); split: -0.56%, +0.00% fossil-db (Vega): Totals from 11516 (8.53% of 135048) affected shaders: SGPRs: 1072128 -> 1076288 (+0.39%); split: -0.01%, +0.40% VGPRs: 821312 -> 818124 (-0.39%); split: -0.43%, +0.04% SpillSGPRs: 11952 -> 12677 (+6.07%) CodeSize: 96378496 -> 96707596 (+0.34%); split: -0.04%, +0.38% MaxWaves: 42614 -> 42883 (+0.63%); split: +0.68%, -0.04% Instrs: 18672844 -> 18600274 (-0.39%); split: -0.44%, +0.05% Latency: 296658786 -> 296338296 (-0.11%); split: -0.21%, +0.10% InvThroughput: 111665547 -> 111283559 (-0.34%); split: -0.40%, +0.06% VClause: 343001 -> 342826 (-0.05%); split: -0.14%, +0.09% SClause: 646684 -> 646657 (-0.00%); split: -0.05%, +0.04% Copies: 1715316 -> 1712895 (-0.14%); split: -0.53%, +0.39% PreSGPRs: 850737 -> 856543 (+0.68%); split: -0.04%, +0.72% PreVGPRs: 775293 -> 772215 (-0.40%); split: -0.41%, +0.02% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	eeef1bbe65	aco: refactor selection of mad/fma In the future, whether we need to use fma will depend on which multiplication is chosen. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	e12bee3cb7	aco: improve support for v_fma_mix Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Rhys Perry	79c8740c6e	aco: fix fp16 opcode definitions The v_fma_mix optimizations assume v_cvt_f16_f32 and v_mul_f16 use a v2b definition. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14769>	2022-03-17 19:04:17 +00:00
Dylan Baker	6d4eb4a72e	mesa/main: replace use of simple_list with util/list Because really, do we want simple_list? Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14983>	2022-03-17 18:25:55 +00:00
Dylan Baker	4b10a4aaaa	util/list.h: Add docstrings for list_add and list_addtail Which have easily confused parameters: the first argument is the item to be added, the second is the list to add to; but this could easily be the other way around. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14983>	2022-03-17 18:25:55 +00:00
Alyssa Rosenzweig	b0faf422b7	pan/va: Use XML for special FAU page 0 Now all special FAU handling is unified, which makes both assembler and disassembler considerably nicer. This adds some more special FAU indices from page 0 that were previously missing, allowing them to be assembled and disasembled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	31a171d92d	pan/va: Use boring names for FAU special pages 1/3 There's no magic underlying interpretation, be.. uniform. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	76159ee379	pan/va: Remove immediate modes from XML/asm Now replaced by inference in the assembler, as they should be. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	81498f1538	pan/va: Use 64-bit special FAU for pages 1 and 3 This aligns with how the hardware actually sees special FAU. Also fix the names while we're at it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	139867cb43	pan/va: Rename imm_mode -> fau_page In accordance with new information on the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	3bd1401075	pan/va: Handle uniforms from page 1 Like Bifrost, Valhall can access 2x as many fast acess uniforms as previously thought. However, on Valhall this requires using the pagination mechanism. Support this in the dis/assembler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	cf43a1cc58	pan/va: Rewrite FAU handling in dis/assembler FAU pages do not need to be specified explicitly in the assembly. Rather, they should be inferred by the assembler by the instructions used. Rewrite the code handling this in alignment with new information about the hardware. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	95b7908d2d	pan/va: Fix BLEND instruction There's only one staging register, the other register is just offset due to the Msg64 source. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	c7e8e8b319	pan/va: Handle 64-bit sources in message instrs These take up two slots, reading an aligned register pair, even though they are in a 32-bit instruction. Required to correctly model BLEND. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	9878469833	pan/va: Add start property to source The bit position of sources is more complicated than (8 * index). Make it a part of the Valhall reflection information. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Alyssa Rosenzweig	5a759140b0	pan/va: Fix typo in BLEND text Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15364>	2022-03-17 18:06:17 +00:00
Tomeu Vizoso	96e17287b4	ci/freedreno: Disable a618 jobs Some of these machines are experiencing networking problems currently. Disable for now so people aren't blocked. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15430>	2022-03-17 17:43:06 +00:00
Erik Faye-Lund	115298b71e	gallium: rename ballot cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	b3ce733da9	gallium: rename clock cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	7984c5884c	gallium: rename group-vote cap This cap is no longer TGSI specific, so let's rename it to reflect reality. Because the name got a bit vague when removing the TGSI-bits, let's add some more details to the name. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	a6d7ead686	gallium: rename texture query samples cap This isn't specific to TGSI, so let's update the name to reflect reality. Because the name of the opcode was TGSI specific, let's pick a new one, based on the naming of the PIPE_CAP_TEXTURE_QUERY_LOD cap. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	930b38e7cd	gallium: rename read-outputs cap This cap is no longer TGSI-specific, so let's update the name to reflect reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	2dff9bea4f	gallium: rename array-components cap This cap is no longer TGSI specific, so let's update the name to reflect reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	350329feb1	gallium: rename sysval caps These aren't spiecic to TGSI any more, so let's rename them to reflect reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	df40de91d9	gallium: rename fine derivative cap This is no longer TGSI specific, so let's rename it to reflect the reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	2a8e11e101	gallium: rename pixel-coord caps These aren't specific to TGSI, so let's rename them to reflect the reality. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:42 +00:00
Erik Faye-Lund	89797fac56	gallium: rename layer-viewport caps Similar to the previous commits, these aren't TGSI specific, so let's drop TGSI from their name. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:41 +00:00
Erik Faye-Lund	8ac7dc9cf6	gallium: rename vs instance id cap This cap is no longer specific to TGSI, so let's rename it and update the documentation to reflect that. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:41 +00:00
Erik Faye-Lund	f8809fbdb8	gallium: rename pack half-float cap This cap no longer has anything to do with TGSI, as the lowering happens on GLSL IR, and applies just as much to NIR drivers. So let's rename this cap and update the docs to reflect the current situation. Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15316>	2022-03-17 16:44:41 +00:00
Jason Ekstrand	0f048c5782	panvk: Convert to the common sync/submit framework Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15296>	2022-03-17 16:22:10 +00:00
Lionel Landwerlin	d68b9f0e6b	anv: zero-out anv_batch_bo anv_batch_bo has a length field that we use to flush cachelines. Not having that field initialized properly leads us to access out of bound memory. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15425>	2022-03-17 15:56:14 +00:00
Lionel Landwerlin	78acae3865	anv: fix variable shadowing Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `83fee30e85` ("anv: allow multiple command buffers in anv_queue_submit") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15425>	2022-03-17 15:56:14 +00:00
Samuel Pitoiset	174d086e8c	radv: enable radv_disable_aniso_single_level for DXVK/vkd3d It seems the default D3D behavior and it's complicated to emulate this in DXVK/vkd3d. Enable it by default to prevent rendering issues in other games not listed here. Cc: 22.0 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15368>	2022-03-17 15:10:06 +00:00
Samuel Pitoiset	590eb9d640	radv: do not compute the cache UUID for LLVM if it's not used If the LLVM version (even minor) isn't the same on the OS that precompiles shaders vs the OS that runs them, the cache UUID would be different, even if only ACO is used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15406>	2022-03-17 14:49:35 +00:00
Sagar Ghuge	2e336c602d	intel/fs: Add Wa_14014435656 For any fence greater than local scope, always set flush type to at least invalidate so that fence goes on properly. v2: Fixup condition to trigger workaround (Lionel) v3: Simplify workaround (Curro) v4: Don't drop the existing WA (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Sagar Ghuge	6031ad4bf6	intel/fs: Add Wa_22013689345 v2: Use a simpler framework (Lionel) v3: Rebase, add task/mesh (Lionel) v4: Fixup fence exec size (SIMDX -> SIMD1) v5: Fix invalidate_analysis, add finishme comment (Curro) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: 22.0 <mesa-stable> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Anuj Phogat	5cc4075f95	anv, iris: Add Wa_16011411144 for DG2 v2: Use CS_STALL instead of FLUSH_ENABLE in Iris (Lionel) Add missing CS_STALL after SO_BUFFER change in Anv (Lionel) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14947>	2022-03-17 14:18:02 +00:00
Juan A. Suarez Romero	c432bfe74b	broadcom/ci: Update flake list Some of the tests marked as flake didn't show up as flakes for a long time (more than 3 months). So likely they are already fixed. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15411>	2022-03-17 13:56:41 +00:00
Connor Abbott	072fdcabcd	tu: Enable UniformBufferUpdateAfterBind UBOs are now read at run-time via the preamble so this can be enabled. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	9932ca8a3f	ir3, turnip: Use ldc.k to push UBOs This reuses the same UBO analysis to do the pushing in the shader preamble via the ldc.k instruction instead of in the driver via CP_LOAD_STATE6. The const_data UBO is exempted as it uses a different codepath that isn't as critical. Don't do this on gallium because there are some regressions. Aztec Ruins in particular regresses a bit, and nothing I've benchmarked benefits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	221a912b8c	ir3: Refactor ir3_compiler_create() to take an options struct This will let us add more options without creating too much churn. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	acba08b58f	ir3: Implement and document ldc.k Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	fccc35c2de	ir3: Add preamble optimization pass Now that everything is plumbed through, we can tie it together. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	986f7adfee	ir3: Don't include preamble instructions in stats Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	42e21c751b	ir3: Insert frag coord code after preamble To match the pre-preamble behavior, and so that we can better schedule it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	b6fe69d855	ir3: Support prefetching with preambles Since the NIR pass runs very late, it needs to be aware of preambles, and when creating the instruction we need to move it to the start block so that RA doesn't overwrite it in the preamble. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	00d7ad334a	ir3/legalize: Handle inserting (ei) with preamble Make sure that shaders with a preamble are still considered early-release so that we don't regress them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	ccc64b7e00	ir3: Plumb through store_uniform_ir3 intrinsic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	944f4e6f8a	ir3: Better assemble/disassemble stc Add in the type, even though it turns out to not be that useful. Add in support for assembling it. Add some notes based on computerator experiments. And add support for the indirect a1.x mode that's needed for storing c64.x and later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	3244e659e0	ir3: Implement basic shader preamble intrinsics These will be used to implement the ir3-specific shader preamble lowering in NIR. shps is conceptually similar to getone (although it technically can't be duplicated) and shpe is similar to other barriers, since it has to happen after any stores to the constant file in the preamble. Add NIR intrinsics and plumbs them through ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	7ad57d9af1	ir3: Don't count reserved user consts in ubo_state::size Previously we included the reserved user consts (for Vulkan push constants) as part of the pushed UBO contents, but that led to a problem because when calculating the worst-case space for UBOs we didn't factor in the reserved user consts. We'll have the same problem when doing the same thing in the preamble optimization pass. Stop including the reserved size in ubo_state::size, and have ir3_setup_consts() add it in instead, so we won't forget to add it anywhere. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	e274354204	ir3: Fix scan.macro valid flags Right now we don't support any. We could probably support const, but that's not worth it because we could optimize a reduce of a const better anyway. Fixes: `1a78604d20` ("ir3: Add support for subgroup arithmetic") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	3b96ad70ee	nir: Add a preamble optimization pass This pass tries to move computations that are uniform for the entire draw to the preamble. There's also an API for backends to insert their own instructions into the preamble, for porting existing UBO pushing passes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	31221ee556	nir: Add a "deep" instruction clone For the shader preamble, we need to add support for cloning one instruction at a time into the preamble, but we also need to rewrite sources. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Connor Abbott	d1b017d479	nir: Add preamble functions These are functions that run before the entrypoint at least once per draw and write their results via store_preamble, and then are loaded in the rest of the shader via load_preamble. We will add users in the following commits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13148>	2022-03-17 12:15:45 +00:00
Juan A. Suarez Romero	dfb6438392	v3dv: change MESA_GLSL_CACHE envvar reference This was renamed to MESA_SHADER_CACHE. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15390>	2022-03-17 11:16:45 +01:00
Juan A. Suarez Romero	1350e7b607	radv: change MESA_GLSL_CACHE envvar reference This was renamed to MESA_SHADER_CACHE. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15390>	2022-03-17 11:16:45 +01:00
Juan A. Suarez Romero	fb856c9501	ci: use MESA_SHADER_CACHE envvar This was renamed from MESA_GLSL_CACHE. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15390>	2022-03-17 11:15:53 +01:00
Juan A. Suarez Romero	54d0a2cfad	util/disk_cache: rename MESA_GLSL_CACHE envvar Rename MESA_GLSL_CACHE to MESA_SHADER_CACHE, as the on-disk cache can store not only GLSL but also SPIR-V shaders. v2: - Keep old envvar as deprecated (Mike) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15390>	2022-03-17 11:15:53 +01:00
Igor Torrente	6d7f04e5de	venus: add VK_EXT_calibrated_timestamps extension Implements all the necessary code in the device initialization and extension functions. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15389>	2022-03-17 00:02:21 +00:00
Igor Torrente	88638ceb2d	venus: move vkGetCalibratedTimestamps to vn_protocol_driver_device.h Update venus-protocol files to move vkGetCalibratedTimestamps function from vn_protocol_driver_transport.h to vn_protocol_driver_device.h. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15389>	2022-03-17 00:02:21 +00:00
Emma Anholt	da834a12cf	vulkan: Make sure we've loaded our connectors when querying plane props. If you hadn't already called wsi_GetPhysicalDeviceDisplayProperties2KHR or wsi_GetDrmDisplayEXT before calling GetPhysicalDeviceDisplayPlaneProperties2KHR, then the connectors list wouldn't be populated and you'd get no plane properties. Fixes failure of dEQP-VK.wsi.display.get_display_plane_capabilities when run on its own. Fixes: #4575 Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15353>	2022-03-16 21:43:46 +00:00
Gurchetan Singh	83de19c900	zink: emulate some more memory If ZINK_HEAP_DEVICE_LOCAL_VISIBLE isn't available natively, then fallback to device local memory without asserting. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15202>	2022-03-16 21:03:16 +00:00
Emma Anholt	9e9a366cad	ci/turnip: Drop alpha_to-coverage flake note on a618. It's only ever been seen on a630. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	2f25d16653	turnip: Use the DRM or KGSL GPU reset status ioctls to report device loss. ANGLE-on-venus-on-turnip and zink-on-turnip want real data here for EGL's reset tests. This required moving the remaining GPU-reset-causing tests from flakes or xfails to skips. Otherwise, the rest of the caselist associated with them ends up being marked as fails as well. The alternative would be to put these tests in their own test groups with tests_per_group = 1, but that didn't seem worth the effort. Or, we could finally do something with https://gitlab.freedesktop.org/anholt/deqp-runner/-/issues/14. Fixes: #5955 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	add2121969	ci/freedreno: Remove some xfails for tests that now skip. The last CTS uprev correctly turned them into NotSupported. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	ffa9438183	ci/freedreno: Drop the skips of spirv_ids_abuse in pre-merge. The crash was fixed in `62a7acee93`, and runtime of the tests locally is 5-17s each with a hot shader cache, 11-25s without. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	46b30a5589	ci/lvp: Stop skipping spv-stable-maze-flatten-copy-composite The runtime was fixed in VK-GL-CTS 1751124d2870840bada579c236a66d38b48c039b, now it's just 1 second. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Emma Anholt	5dd74533b2	ci: Drop skips of spv-stable-pillars-volatile-nontemporal-store The runtime was fixed in VK-GL-CTS 7cc65f6c02276767407233e74c7174d88dab7919. Now it's .6s on lvp, 20ms on turnip a618. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14839>	2022-03-16 19:28:04 +00:00
Jason Ekstrand	893fa30afe	anv: Include scissors in viewport calculations It's tricky to always get the render area to the viewport code. In particular, it's not provided to secondary command buffers as part of the inheritance info so we have to bend over backwards and look for a framebuffer. With VK_KHR_dynamic_rendering, there is no framebuffer and this approach won't work and we'll need something better if we want competent guardbands in secondary command buffers. The good news is that any client that's sloppily rendering and trusting the clipper to keep things inside the render area will set a scissor and that's something they have to set inside the secondary. We can dig through the scissor state and also include the corresponding scissor (if any) and use that for our render area. This should give us the same secondary command buffer performance with VK_KHR_dynamic_rendering. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	b4e38e174f	anv: Move viewport/scissor emit to genX_cmd_buffer.c There's never been a particularly good reason to stick these in gfx7/8. We mostly did it to deduplicate the binary a bit but this shouldn't emit all that much code. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	2c04373c45	anv: Calculate the real guardband based on render area Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	12d815bcac	intel/guardband: Take min/max instead of total size Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	d89d6c7a4d	docs: Add high-level documentation for Vulkan render passes Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	7521b5db18	docs: Add the start of Vulkan runtime docs Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:45 -05:00
Jason Ekstrand	3501a3f9ed	anv: Convert to 100% dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 13:13:36 -05:00
Jason Ekstrand	8112e6d601	anv: Drop pipeline pass/subpass in favor of rendering_info This is about the only "small" change we can make in the process of converting from render-pass-based to dynamic-rendering-based. Make everything in pipeline creation work in terms of dynamic rendering and create the dynamic rendering structs from the render pass as-needed. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:16 -05:00
Jason Ekstrand	ee9c068043	anv/pipeline: Stop pretending we're the validator This was ill-conceived at best. Yes, it checks for a few error conditions but it doesn't check much and what checks it has are very far away from the code that relies on those invariants. If we care about these invariants, we should add asserts near the code that makes those assumptions rather than pretending to be the validation layers. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:16 -05:00
Jason Ekstrand	2da152b5e6	anv: Stop treating color input attachments specially Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	1ad0f1b004	anv/pass: Make unused color attachments VK_ATTACHMENT_UNUSED Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	9bbecbed7a	anv: Better null surface state size for dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	fff3f8bfe5	anv: Fix handling of null depth/stencil attachments with dynamic rendering Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	030b231ba9	vulkan/framebuffer: Add a flags field Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	c3d8ca9300	vulkan/render_pass: Add an optimization for UNDEFINED+LOAD_OP_CLEAR Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	17395d395a	vulkan/render_pass: Support fragment shading rate Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	6b61953684	vulkan/render_pass: Provide self-dependeny information Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	ca5ad9cbee	vulkan: Add helpers for getting rendering info from a renderpass These helpers are used by vkCreateGraphicsPipelines to get the VkPipelineRenderingCreateInfo and in vkCmdBeginCommandBuffer to get the VkCommandBufferInheritanceRenderingInfo. This is required because the Vulkan runtime code can't yet hook and modify calls made to driver- provided functions. Instead, we just provide a helper to be used in leu of vk_find_struct_const(). The structs themselves are stored in the render pass so we can pass back a pointer and there's no need to construct one on the stack or stuff it in the pipeline. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	1d726940d2	vulkan: Add a common CmdBegin/EndRederPass implementation This implements vkCmdBeginRenderPass, vkCmdEndRenderPass, and vkCmdNextSubpass in terms of the new vkCmdBegin/EndRendering included in VK_KHR_dynamic_rendering and Vulkan 1.3. All subpass dependencies and implicit layout transitions are turned into actual barriers. It does require VK_KHR_synchronization2 because it always uses the 64-bit version of the pipeline stage and access bitfields. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	874aeb8743	vulkan: Add a common vk_render_pass struct This basically contains everything in pCreateInfo plus one or two extra bits that might be useful. Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	83101429bf	anv: Convert to vk_framebuffer Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	d84e6b8f22	vulkan: Add a common vk_framebuffer struct Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14961>	2022-03-16 12:51:15 -05:00
Jason Ekstrand	acbb0d86f7	panvk: Implement VK_EXT_vertex_attribute_divisor Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:58:55 -05:00
Boris Brezillon	58587c32cb	panvk: Implement indexed rendering Since we can do 8-bit index buffers, also advertise VK_EXT_type_uint8. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:58:46 -05:00
Boris Brezillon	a08b695386	panvk: Fix per-instance attribute handling We were assuming per-vertex attributes so far. Let's extend the logic to support per-instance attributes with or without custom instance divisors. Now that we've got it all hooked up, we can enable VK_EXT_vertex_attribute_divisor. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:57:51 -05:00
Boris Brezillon	417cf3d35e	panvk: No-op zero-vertex draws Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15295>	2022-03-16 09:57:51 -05:00
Gert Wollny	3244100557	Revert "virgl: Enable PIPE_CAP_TGSI_TEXCOORD when the host supports it" This reverts commit `2fbb4e85f7`. With this CAP enabled the host doesn't correctly handle the passing the invariant flag between stages, and using surfaceless in the client seems to trigger this error Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15409>	2022-03-16 14:33:54 +01:00
Samuel Pitoiset	60517948af	radv: fix missing destruction of the inotify thread The notifier state must be destroyed when a device is destroyed. Oops. This fixes crashes at launch with The Witcher 3. Fixes: `c50557d961` ("radv: allow applications to dynamically change RADV_FORCE_VRS") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15405>	2022-03-16 11:57:41 +00:00
Vadym Shovkoplias	550f48a826	anv: implement EXT_depth_clip_control A new extension allowing the application to use the OpenGL depth range in NDC, i.e. with depth in range [-1, 1], as opposed to Vulkan’s default of [0, 1]. v2: - call gfx8_cmd_buffer_emit_viewport on ANV_CMD_DIRTY_PIPELINE (Jason) - remove redundant !! operator since negativeOneToOne must be true or false (Tapani) - coding style changes (Lionel) Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15304>	2022-03-16 11:22:24 +00:00
Charlie Turner	61ad60dc00	ci, radv: Update flake expectations dEQP-VK.api.object_management.multithreaded_shared_resources.image_2d is flakey crashing on RENOIR and VEGA10. Thanks Rhys Perry for pointing out the flake. Acked-by: Martin Roukala <martin.roukala@mupuf.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15386>	2022-03-16 09:45:42 +00:00
Charlie Turner	a815d4001a	ci, valve: Bump the trigger container This contains some fixes in the executor client related to multipart HTTP uploads. It will hopefully improve the stability of jobs! Acked-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15386>	2022-03-16 09:45:39 +00:00
Charlie Turner	729537438e	ci, valve: Show real kernel addresses in KFENCE reports. Acked-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15386>	2022-03-16 09:45:31 +00:00
Samuel Pitoiset	737c86da62	radv: only clear VRAM for app and descriptor BOs when set via drirc We don't have to clear other internal BOs when it's set for eg. vkd3d. This should reduce the number of SDMA clears. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15156>	2022-03-16 07:19:15 +00:00
Samuel Pitoiset	45b909a5a0	radv/winsys: remove old comment about zerovram RADV requires Linux 4.15+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15156>	2022-03-16 07:19:15 +00:00
Mike Blumenkrantz	acb300a3f0	lavapipe: EXT_image_robustness Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15388>	2022-03-16 04:58:41 +00:00
Mike Blumenkrantz	6345575f8a	gallivm: fix oob image detection for cube/1dArray/2dArray/cubeArray these all need to check for z coord oob to avoid crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15388>	2022-03-16 04:58:41 +00:00
Mike Blumenkrantz	b7fbaf924d	lavapipe: EXT_pipeline_creation_cache_control again, technically passing is still passing Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15379>	2022-03-16 04:46:06 +00:00
Mike Blumenkrantz	9bce878490	lavapipe: EXT_pipeline_creation_feedback cts passes with mostly quality warnings, but it does pass Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15379>	2022-03-16 04:46:06 +00:00
Mike Blumenkrantz	dffe8141bd	lavapipe: KHR_zero_initialize_workgroup_memory Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15403>	2022-03-16 04:05:14 +00:00
Mike Blumenkrantz	e106c1294b	llvmpipe: add handling for zeroing cs shared memory since this is just allocated by the cpu, it needs to be zeroed if the shader expects that behavior Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15403>	2022-03-16 04:05:14 +00:00
Mike Blumenkrantz	f72d5a930b	lavapipe: KHR_format_feature_flags2 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15395>	2022-03-16 02:12:05 +00:00
Mike Blumenkrantz	90e091b072	lavapipe: use VkFormatFeatureFlags2 in format detection Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15395>	2022-03-16 02:12:05 +00:00
Mike Blumenkrantz	65bf1cbc26	gallium: add flag to draw info to indicate converted draws this draw mode in particular requires driver-specific conversions for queries (e.g., number of vertices), so pass that info through the only limitation is that it doesn't work for dlists, but I have yet to see a real use case of a statistics query being used with dlists Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15326>	2022-03-16 01:45:48 +00:00
Jason Ekstrand	864f3c0ee0	panvk: Fix SSBO buffer offsets Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15398>	2022-03-16 01:27:28 +00:00
Jason Ekstrand	6214cce382	panvk: Require 16B alignment for UBOs This is required by MALI_UNIFORM_BUFFER. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15398>	2022-03-16 01:27:28 +00:00
Lionel Landwerlin	a54f5e8e00	anv: silence compiler warnings Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6146 Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15387>	2022-03-16 01:02:05 +00:00
Daniel Stone	2221e3d487	ci: Add new Panfrost G52 skip This started failing for some reason, has been seen in https://gitlab.freedesktop.org/mesa/mesa/-/jobs/19776551 and others. Signed-off-by: Daniel Stone <daniels@collabora.com> Suggested-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15396>	2022-03-16 00:41:46 +00:00
Stefan Dirsch	c287ed4f39	meson: restore private requires to libdrm in dri.pc file Due to a typo the private requires to libdrm were lost in dri.pc. Fixed another typo: Infastructure --> Infrastructure Fixes: `3ae3569d82` ("meson: restore dri.pc file") Signed-off-by: Stefan Dirsch <sndirsch@suse.com> Tested-by: Stefan Dirsch <sndirsch@suse.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15375>	2022-03-15 23:59:46 +00:00
Emma Anholt	3b90d3997a	turnip: use vk_shader_module_to_nir(). Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	5a0e081e00	panvk: Use vk_shader_module_to_nir() Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	0c871d89ae	panvk: Use vk_shader_module Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	0b4a80b4c4	anv: Use vk_shader_module_to_nir() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Jason Ekstrand	21b405fbbc	vulkan: Add a vk_shader_module_to_nir() helper This encapsulates all the little bits needed to turn a shader module into some mostly reasonable NIR. It handles inlining functions, lowering variable initializers, handling per-member structs and other trickiness that is needed for consuming the output of spirv_to_nir. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15305>	2022-03-15 23:13:16 +00:00
Mike Blumenkrantz	40fcd8ef83	lavapipe: enable KHR_memory_model support lavapipe's memory is always coherent, so this is already supported Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15377>	2022-03-15 22:17:43 +00:00
Mike Blumenkrantz	13d900de0d	llvmpipe: set nir_shader_compiler_options::use_scoped_barrier required for vk memory model Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15377>	2022-03-15 22:17:43 +00:00
Mike Blumenkrantz	e3e3186855	lavapipe: strip unneeded scoped barriers most of these do nothing and can't be emitted without breaking shaders Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15377>	2022-03-15 22:17:43 +00:00
Connor Abbott	a83ea0253f	ir3: Use isam for bindless readonly ssbo loads Since this isn't hooked up in gallium, only do it for bindless for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	625ebb977f	ir3: Actually use wrmask in emit_sam I noticed that isam emitted for SSBO loads was writing all 4 components, which this fixes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	5f020bcc8d	ir3/lower_spill: Fix corner case with oob offsets If the base register is killed, it may be reused as the destination of a ldp. In that case we should just skip resetting it afterwards. Fixes regressions in dEQP-VK.ssbo.layout.random.scalar.38 later. Fixes: `9912c61362` ("ir3/spill: Support larger spill slot offset") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	6304c7cb82	ir3/parser: Don't use right recursion This fixes memory exhaustion errors when doing shader replacement with very large shaders. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	f9d9c0172a	tu: Add an extra storage descriptor for isam Based on a workaround the blob does. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	1ec3d39407	tu: Handle UBO/SSBO descriptors with different sizes We reuse the otherwise-unused offset channel to represent the array stride, so that reindexing works properly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Connor Abbott	5ba3ea1eb3	tu: Rewrite dynamic descriptor handling We need to prepare for storage buffers having different sizes from uniform buffers. This switches dynamic_offset_offset to have units of bytes, the same as offset, and as a nice bonus we can more easily combine the dynamic and non-dynamic paths in various different places. This also entails rewriting the code that patches dynamic descriptors, since we can no longer assume a linear mapping between indices in dynamicOffsets and descriptor locations which the previous approach heavily relied on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15288>	2022-03-15 21:36:38 +00:00
Mike Blumenkrantz	6f7f6df287	zink: export indirect io pipe caps Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15327>	2022-03-15 21:25:05 +00:00
Mike Blumenkrantz	02569428a8	zink: fix unreachable() location in ntv streamout info super annoying to debug otherwise Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15327>	2022-03-15 21:25:05 +00:00
Mike Blumenkrantz	b53ee02192	zink: add DOUBLE glsl type for streamout export not used yet but someday Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15327>	2022-03-15 21:25:05 +00:00
Mike Blumenkrantz	68267aeab8	zink: add nir_var_function_temp support to ntv I said I'd never do this, but here we are Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15327>	2022-03-15 21:25:05 +00:00
Emma Anholt	eb9b092001	turnip: Enable VK_EXT_display_control using the common code. It's all implemented now, so we can turn it back on. Passes 15/16 tests when X11 isn't running, and 1/16 when it is, with no failures in either mode. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15351>	2022-03-15 20:08:58 +00:00
Samuel Pitoiset	10d69d5f0b	radv: fix returning empty drmFormatModifierTilingFeatures From the Vulkan spec: "drmFormatModifierTilingFeatures is a bitmask of VkFormatFeatureFlagBits that are supported by any image created with format and drmFormatModifier. The returned drmFormatModifierTilingFeatures must contain at least one bit." This fixes recent CTS dEQP-VK.drm_format_modifiers.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15383>	2022-03-15 19:43:48 +00:00
Samuel Pitoiset	dc247e5d43	radv: remove VK_AMD_shader_info support This extension is quite old and useless now. VK_KHR_pipeline_executable_properties should be used instead. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15299>	2022-03-15 19:23:53 +00:00
Bas Nieuwenhuizen	a0ccc46969	radv: Expose VK_VALVE_descriptor_set_host_mapping for vkd3d only. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15363>	2022-03-15 18:52:41 +00:00
Hans-Kristian Arntzen	86a7b5e276	radv: Implement VK_VALVE_descriptor_set_host_mapping. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15363>	2022-03-15 18:52:41 +00:00
Bas Nieuwenhuizen	6c0bc7eb07	vk: Update xml and headers to 1.3.207. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15363>	2022-03-15 18:52:41 +00:00
Dave Airlie	c6ac3e017d	llvmpipe/fs: add missing depth_clamp key printing Helps debugging shaders better. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15385>	2022-03-15 18:35:20 +00:00
Mike Blumenkrantz	1b2c4d7196	lavapipe: KHR_shader_integer_dot_product Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15384>	2022-03-15 18:07:47 +00:00
Mike Blumenkrantz	4cf9e24039	gallivm: implement nir_op_pack_32_4x8_split just reusing existing helpers and llvm can optimize it for us Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15384>	2022-03-15 18:07:47 +00:00
Mike Blumenkrantz	a33a07bd06	lavapipe: maintenance4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15378>	2022-03-14 21:30:13 -04:00
Mike Blumenkrantz	b2f69a8bb8	lavapipe: set maxBufferSize for maintenance4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15378>	2022-03-14 21:30:13 -04:00
Mike Blumenkrantz	987e8a5a0c	lavapipe: implement vkGetDevice*MemoryRequirements for maint4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15378>	2022-03-14 21:30:13 -04:00
Mike Blumenkrantz	49cac7b33d	lavapipe: ref/unref pipeline layouts for pipeline creation required by maintenance4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15378>	2022-03-14 21:30:13 -04:00
Mike Blumenkrantz	2f9976debc	lavapipe: always clone shader nir for shader states these become owned and freed by llvmpipe, so ensure that freeing them there won't cause crashes cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15281>	2022-03-15 00:58:22 +00:00
Jason Ekstrand	2170c3ac63	panvk: Use the correct integer border colors Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	8dd917b9f0	panvk: Rework texture, sampler, and image binding index calculation This adds a new get_resource_deref_binding helper which decodes a resource deref into set, binding, and index. To make texture instructions nicer, the index can optionally be split into immediate and SSA parts. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	17e79b044e	panvk: Skip ZS setup if there is no depth/stencil attachment Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	4f843db0a1	panvk: Make panvk_image_view derive from vk_image_view Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Jason Ekstrand	1865b7a93e	panvk: Make panvk_image derive from vk_image Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15382>	2022-03-14 23:04:09 +00:00
Eric Engestrom	5dbbc0f0a8	Revert "glx: Fix build errors with --enable-mangling (v2)" This reverts commit `a27f2d991b`. As of `a0829cf23b` ("GL: drop symbols mangling support"), this extra complexity isn't needed anymore. Signed-off-by: Eric Engestrom <eric.engestrom@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/2298>	2022-03-14 22:14:25 +00:00
Igor Torrente	f30334b6c4	Venus: add VN_CMD_ENQUEUE to vn_cmd_encode_memory_barriers Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15344>	2022-03-14 21:47:19 +00:00
Igor Torrente	a65d2ef1c1	Venus: Adjust VN_CMD_ENQUEUE to set VN_COMMAND_BUFFER_STATE_INVALID This improves the issue of a return inside the macro. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15344>	2022-03-14 21:47:19 +00:00
Igor Torrente	6cdbc0299a	Venus: Add VN_CMD_ENQUEUE macro with vkCmd* common code Several `vn_Cmd` share the same code to enqueue the command to the command stream. This adds a macro with this common code. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15344>	2022-03-14 21:47:19 +00:00
Samuel Pitoiset	1dfee91fdf	radv: export the pipeline hash via VK_KHR_pipeline_executable_properties This will help to match RGP<->Fossilize pipelines. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15279>	2022-03-14 20:20:40 +00:00
Rhys Perry	c4cf92cad7	radv,aco,ac/llvm: fix indirect dispatches on the compute queue on GFX7-10 Since neither PKT3_LOAD_SH_REG_INDEX nor PKT3_COPY_DATA work with compute queues on GFX7-10, we have to load the dispatch size from memory in the shader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15064>	2022-03-14 19:54:36 +00:00
Rhys Perry	973967c49d	aco: split and recombine unaligned sgpr inputs An example is the num_work_groups argument. Fixes invalid assembly with func.compute.num-workgroups.basic.q0 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15064>	2022-03-14 19:54:36 +00:00
Juan A. Suarez Romero	000b935c50	v3dv/ci: add test to skip list Add test that it is a timeout in the CI, but otherwise it passes. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15374>	2022-03-14 18:55:13 +00:00
Boris Brezillon	ff8aa15fa0	panvk: Add support for texel buffers Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Boris Brezillon	9dc8382de8	panvk: Add a dummy sampler for NIR tex operations that don't take one In the NIR domain, some texture operations don't require a sampler, but Bifrost/Midgard always want one. Let's add a dummy sampler to handle that case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Jason Ekstrand	a35e721162	panvk: Stop advertising Vulkan 1.1 We're nowhere close to even having Vulkan 1.0 working yet, there's no reason to get too excited about 1.1. It just means piles more test crashes for features we're claiming to support but don't. If we want to enable more tests, we can turn on the extensions for those features once we actually have them working. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15334>	2022-03-14 18:06:01 +00:00
Erik Faye-Lund	dd9b8881e0	Revert "ci: downgrade sphinx to v3.x" The readthedocs theme now supports Sphinx 4.x, so there's no longer any reason to stick with 3.x. This reverts commit `a545b6eda0`. Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15212>	2022-03-14 17:21:57 +00:00
Icecream95	d5870c45ae	panfrost: Optimise recalculation of max sampler view Previously we always searched through 128 sampler views for set sampler views, now we never look above the maximum updated view. Fixes: `304851422a` ("panfrost: Fix set_sampler_views for big GL") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15366>	2022-03-14 16:33:40 +00:00
Icecream95	3e405afeb9	panfrost: Don't initialise the trampolines array PIPE_MAX_SHADER_SAMPLER_VIEWS is 128, so we just end up initialising a kilobyte of memory for no reason, when usually only a couple of sampler views are used. Fixes: `53ef20f08d` ("panfrost: Handle NULL sampler views") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15366>	2022-03-14 16:33:40 +00:00
Samuel Pitoiset	b003a101ee	radv: stop zeroing radv_sample_locations_state in barriers This is useless because all fields should be correctly filled if the pNext struct is found. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15164>	2022-03-14 12:45:29 +00:00
Samuel Pitoiset	612a12a42c	radv: move waiting for events to CmdWaitEvents2KHR() CmdPipelineBarrier doesn't have events. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15164>	2022-03-14 12:45:29 +00:00
Samuel Pitoiset	c6d776f092	radv: remove unnecessary check in FreeCommandBuffers() cmd_buffer->pool should never be NULL. Even if AllocateCommandBuffers() fails, the successfully created cmdbuffers would have it set correctly. From the Vulkan spec: "VUID-vkFreeCommandBuffers-pCommandBuffers-parent Each element of pCommandBuffers that is a valid handle must have been created, allocated, or retrieved from commandPool." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15164>	2022-03-14 12:45:29 +00:00
Samuel Pitoiset	01ec899083	radv: remove unnecessary NULL check in TrimCommandPool() This function seems rarely used or maybe never but I noticed this. From the Vulkan spec: "VUID-vkTrimCommandPool-commandPool-parameter commandPool must be a valid VkCommandPool handle". Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15164>	2022-03-14 12:45:29 +00:00
Samuel Pitoiset	269b1232ee	radv: remove useless check in radv_cmd_buffer_upload_data() ptr shouldn't be NULL. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15164>	2022-03-14 12:45:29 +00:00
Samuel Pitoiset	0eaf9dbce3	radv: fix compatibility with VK_IMAGE_CREATE_EXTENDED_USAGE_BIT Some formats can be accepted if a compatible format is also supported and VK_IMAGE_CREATE_EXTENDED_USAGE_BIT used. Fixes new CTS dEQP-VK.image.extended_usage_bit_compatibility.*. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6046 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15063>	2022-03-14 11:16:33 +00:00
Samuel Pitoiset	42f84a5886	radv: update inputs_read when lowering the view index Otherwise inputs_read doesn't contain the information. This shouldn't fix anything in practice because radv_shader_info gathers this from the variable. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15337>	2022-03-14 10:26:41 +00:00
Ernst Sjöstrand	e5f3689cff	intel/compiler: Fix non-trivial designated initializer Not supported by GCC 7. src/compiler/nir/nir_builder_opcodes.h:14156:118: sorry, unimplemented: non-trivial designated initializers not supported src/intel/compiler/brw_mesh.cpp:515:7: note: in expansion of macro ‘nir_store_per_primitive_output’ Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Fixes: `bc4f8c073a` ("intel/compiler: inject MUE initialization") Signed-off-by: Ernst Sjöstrand <ernstp@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15360>	2022-03-14 09:56:04 +00:00
Samuel Pitoiset	d7514c5f04	radv: stop waiting for DMA to be idle for all transfer operations Only copy operations actually use CP DMA. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15194>	2022-03-14 09:36:28 +00:00
Gert Wollny	1b354ab913	Revert "llvmpipe: allow vertex processing and fragment processing in parallel" This reverts commit `ec8104c6b2`. llvmpipe: allow vertex processing and fragment processing in parallel The commit breaks the the virglrenderer vtest environment used in the virglrednerer CI and running wayland in virtualized environments. Related: #6130 Related: #6110 Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Acked-by: Dave Airlie <airlied@redhat.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15338>	2022-03-14 09:22:22 +00:00
Samuel Pitoiset	5f3d3be24a	radv: fix indirect dispatches on the compute queue on GFX10.3+ For weird reasons, the COPY_DATA packet doesn't seem to copy anything while on the compute queue. Instead, use PKT3_LOAD_SH_REG_INDEX which seems to work as expected. Note that LOAD_SH_REG_INDEX on the compute queue is only supported by the CP on GFX10.3, so we need to implement a different solution (load from the indirect BO in the shader) for older generations. This should fix the Control RT GPU hang. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15053>	2022-03-14 08:54:23 +00:00
Samuel Pitoiset	53ccfbb996	amd: add PKT3_LOAD_SH_REG_INDEX It seems only available on GFX8+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15053>	2022-03-14 08:54:23 +00:00
Daniel Schürmann	70aea6b41a	aco/ra: refactor collect_vars() to return a sorted vector The vector of IDs is sorted with decreasing sizes, and by increasing assigned registers. This decouples register assingment from ssa IDs. Totals from 12694 (9.41% of 134913) affected shaders: (GFX10.3) VGPRs: 757864 -> 757848 (-0.00%); split: -0.00%, +0.00% CodeSize: 72350540 -> 72348688 (-0.00%); split: -0.02%, +0.02% MaxWaves: 237018 -> 237020 (+0.00%); split: +0.00%, -0.00% Instrs: 13545494 -> 13544699 (-0.01%); split: -0.03%, +0.02% Latency: 148539203 -> 148533292 (-0.00%); split: -0.01%, +0.00% InvThroughput: 30319086 -> 30320382 (+0.00%); split: -0.01%, +0.01% VClause: 326875 -> 327028 (+0.05%); split: -0.05%, +0.09% SClause: 479833 -> 479837 (+0.00%); split: -0.00%, +0.00% Copies: 862152 -> 860914 (-0.14%); split: -0.43%, +0.28% Branches: 317775 -> 317777 (+0.00%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11526>	2022-03-14 08:32:10 +00:00
Daniel Schürmann	61c36b6dc0	aco/ra: refactor find_vars() to return a vector instead of std::set<> No fossil-db changes. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11526>	2022-03-14 08:32:10 +00:00
Daniel Schürmann	9476986e6f	aco/ra: special-case get_reg_for_create_vector_copy() This function implements separate handling for p_create_vector during get_regs_for_copies(). This simplifies some code and lets more precisely select swap instructions if possible. Totals from 876 (0.65% of 134913) affected shaders: (GFX10.3) VGPRs: 53312 -> 53336 (+0.05%) CodeSize: 3792936 -> `3788160` (-0.13%); split: -0.15%, +0.03% MaxWaves: 16084 -> 16078 (-0.04%) Instrs: 707449 -> 706385 (-0.15%); split: -0.19%, +0.04% Latency: 6288293 -> 6286677 (-0.03%); split: -0.03%, +0.01% InvThroughput: 4264450 -> 4263671 (-0.02%); split: -0.02%, +0.00% VClause: 18655 -> 18679 (+0.13%); split: -0.20%, +0.33% Copies: 55397 -> 54353 (-1.88%); split: -2.45%, +0.57% Branches: 12426 -> 12415 (-0.09%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11526>	2022-03-14 08:32:10 +00:00
Daniel Schürmann	9181e8ceba	aco/ra: count constant moves in get_reg_create_vector() Also implements a correct_pos_mask to keep track which operands are already at the right target position. To mitigate some regressions, call get_reg_impl() less often. Totals from 1229 (0.91% of 134913) affected shaders: (GFX10.3) VGPRs: 60216 -> 59848 (-0.61%) CodeSize: 3716496 -> 3711268 (-0.14%); split: -0.19%, +0.05% MaxWaves: 27952 -> 28004 (+0.19%) Instrs: 685983 -> 685035 (-0.14%); split: -0.20%, +0.06% Latency: 6727587 -> 6725340 (-0.03%); split: -0.06%, +0.02% InvThroughput: 9289043 -> 9289866 (+0.01%); split: -0.02%, +0.03% VClause: 17730 -> 17740 (+0.06%); split: -0.25%, +0.30% Copies: 54352 -> 53420 (-1.71%); split: -2.46%, +0.75% Branches: 12122 -> 12121 (-0.01%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11526>	2022-03-14 08:32:10 +00:00
Samuel Pitoiset	7ad1eb4e8c	radv: rework the CS regalloc hang workaround Move it to the pipeline creation to reduce computations in the hot path. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15162>	2022-03-14 08:05:32 +01:00
Samuel Pitoiset	d532da6e96	radv: fix the CS regalloc hang workaround on GFX6 and few GFX7 chips RadeonSI uses a different terminology and info->blocks is actually the number of threads, not the number of blocks (ie. info->grid). Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15162>	2022-03-14 08:05:28 +01:00
Mike Blumenkrantz	952679b944	zink: use dynamic rasterizer_discard state when possible Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15328>	2022-03-14 04:08:29 +00:00
Mike Blumenkrantz	ae4bdea73e	zink: move dynamic state2 pipeline state to substruct in pipeline state Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15328>	2022-03-14 04:08:29 +00:00
Mike Blumenkrantz	d1f12b1924	zink: assert that the dynamic state array size is big enough Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15328>	2022-03-14 04:08:29 +00:00
Dave Airlie	6cc4cdaa6f	radv/winsys: add support for queues without user fences. The video queues don't have user fence support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15257>	2022-03-14 10:41:04 +10:00
Dave Airlie	c6755e85e3	radv/winsys: add a ring level detection for ib bo usage. The video rings can't use IB bos v2: add use_ib flag to avoid a bunch of checks. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15257>	2022-03-14 10:41:01 +10:00
Dave Airlie	31b82afe6e	radv/winsys: add nop packets for uvd and vcn dec. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15257>	2022-03-14 10:40:52 +10:00
Dave Airlie	5819b4c1d3	radv/winsys: complete ring/ip translations. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15257>	2022-03-14 10:40:23 +10:00
Mihai Preda	816ad5ede8	radeonsi/tests: update piglit baseline on vega20 current as of commit: `6731460194` Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15273>	2022-03-13 20:11:47 +00:00
Konstantin Seurer	16cb957e8b	radv: Enable KHR_ray_query Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14565>	2022-03-13 12:02:05 +01:00
Konstantin Seurer	78c84f11a5	radv: Lower ray queries Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14565>	2022-03-13 12:02:05 +01:00
Konstantin Seurer	357bd1829f	nir,spirv: Preserve ray_query_value Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14565>	2022-03-13 12:02:05 +01:00
Konstantin Seurer	6d2e95db7b	radv: Move common code to seperate file Signed-off-by: Konstantin Seurer <konstantin.seurer@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14565>	2022-03-13 12:02:05 +01:00
Dave Airlie	177805cc03	radv: try and fix internal transfer queue mapping The WSI code wants to remain generic and try and use vulkan APIs, even though these queues aren't exposed through the API. Add the transfer queue to the end of the queues. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Tested-by: Mike Lothian <mike@fireburn.co.uk> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15357>	2022-03-13 02:37:19 +00:00
Mike Blumenkrantz	57ddfcaa2d	zink: anv ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15361>	2022-03-13 00:16:56 +00:00
Alyssa Rosenzweig	8c6cb15200	panfrost: Handle txs of cube arrays We need to divide the array length by 6 to match what OpenGL expects. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15254>	2022-03-12 17:34:01 +00:00
Alyssa Rosenzweig	53f1e57ee7	pan/bi: Handle non-2D arrays Handle arrays generically by using the last component of the coordinate source as the array index. That works for both 2D arrays and cube arrays, fixing cube arrays. Cube arrays were already handled correctly in core Panfrost code. This code path is not tested in dEQP-GLES31 without exposing OES_cube_map_array, which depends on OES_geometry_shader, which we don't have. Yet we do expose PIPE_CAP_CUBE_ARRAY, so ARB_cube_map_array is exposed. Disabling PIPE_CAP_CUBE_ARRAY would be an easy band-aid fix, but it's easy enough to handle correctly. dEQP-GLES31 passes with a hack enabling OES_cube_map_array [without geometry shaders]. Also fixes 1D arrays on Bifrost for the same reasons. Fixes: `70d6c5675d` ("pan/bi: Emit TEXC with builder") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15254>	2022-03-12 17:34:01 +00:00
Alyssa Rosenzweig	1f97819fbe	panfrost: Emulate GL_CLAMP on Bifrost Hardware support was removed with Midgard. Use mesa/st to emulate GL_CLAMP with nir_lower_tex automatically (the Zink lowering), and disable GL_MIRROR_CLAMP which isn't lowered correctly. Fixes texwrap Piglit tests on G52. Fixes: `f9ceab7b23` ("panfrost: Fix CLAMP wrap mode") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15253>	2022-03-12 17:16:00 +00:00
Rob Clark	05d6877235	freedreno/ir3: Don't try re-swapping cat3 srcs This can lead us to endless loops of "progress".. Note fixes commit commit really just exposed an existing problem. Fixes: `9c9e8c3349` ("nir: Reorder ffma and fsub combining") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6133 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15336>	2022-03-12 16:42:00 +00:00
Rob Clark	fa59556e1a	freedreno/ir3: Remove unused define Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15336>	2022-03-12 16:42:00 +00:00
Rob Clark	5f6dce05c3	mesa: Easier shader capture for android I've typed this patch a few times already.. lets just add some debug code which can be easily switched on so I don't have to type it again. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15336>	2022-03-12 16:41:59 +00:00
Dave Airlie	cd00247db5	crocus: force ignore_sample_mask_out on gen4/5 for precompile on gen4/5 this won't ever be anything else, avoids some recompiles. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15315>	2022-03-12 08:12:41 +00:00
Dave Airlie	7edda218fd	intel: add some missing debug recompile info. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15315>	2022-03-12 08:12:41 +00:00
Jason Ekstrand	65db6b0e7c	bifrost: Constant fold after lower_explicit_io nir_lower_explicit_io generates mul+add chains even for constants. One round of constant folding should get rid of these. This fixes all of the dEQP-VK.glsl.conversions.* tests on panvk. GoGoGoGo'd-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15349>	2022-03-12 03:51:54 +00:00
Alyssa Rosenzweig	e0b6493c49	mesa: Remove unused framebuffer validation Probably unused since we deleted classic. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15255>	2022-03-11 22:02:23 -05:00
Jason Ekstrand	1aa120b10f	bifrost: Handle nir_op_frexp* and nir_op_ldexp Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	d2a09f3dd3	bifrost: Implement fine and coarse derivatives We leave the undecorated ops as fine so we don't disturb panfrost. For coarse derivatives, we use a lane ID of 0 for the first lane and 1 or 2 for the second depending on axis. This ensures that coarse derivatives are quad-uniform. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	83010c57a6	bifrost: Simplify derivatives a bit Instead of two magic ternary operations, define a new `axis` temporary which is 1 for X and 2 for Y. Then define everything else in terms of this variable. In particular, the mask operation we do on LANE_ID is a mask so it makes more sense to use ~axis than 1/2 but in the other order. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jason Ekstrand	c043e93ca5	bifrost: Lower usub_borrow Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15352>	2022-03-12 02:27:02 +00:00
Jonathan Gray	e50eb1ce7a	util: fix msvc build Fix msvc build regression after `0536b69133` reported by Prodea Alexandru-Liviu. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6137 Fixes: `0536b69133` ("util: fix build with clang 10 on mips64") Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15355>	2022-03-12 11:38:27 +11:00
Jason Ekstrand	d65dbe8018	anv: Allow MSAA resolve with different numbers of planes The Vulkan spec for VK_KHR_depth_stencil_resolve allows a format mismatch between the primary attachment and the resolve attachment within certain limits. In particular, VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03181 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED and VkFormat of pDepthStencilResolveAttachment has a depth component, then the VkFormat of pDepthStencilAttachment must have a depth component with the same number of bits and numerical type VUID-VkSubpassDescriptionDepthStencilResolve-pDepthStencilResolveAttachment-03182 If pDepthStencilResolveAttachment is not NULL and does not have the value VK_ATTACHMENT_UNUSED, and VkFormat of pDepthStencilResolveAttachment has a stencil component, then the VkFormat of pDepthStencilAttachment must have a stencil component with the same number of bits and numerical type So you can resolve from a depth/stencil format to a depth-only or stencil-only format so long as the number of bits matches. Unfortunately, this has never been tested because the CTS tests which purport to test this are broken and actually test with a destination combined depth/stencil format. Fixes: `5e4f9ea363` ("anv: Implement VK_KHR_depth_stencil_resolve") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15333>	2022-03-11 22:25:42 +00:00
Mike Blumenkrantz	ffd67b39e7	zink: remove flake this had already been resolved by the time the flake was added Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15350>	2022-03-11 14:46:41 -05:00
Jason Ekstrand	95a44a5b09	lavapipe: Use the auto-generated vk_enqueue_BeginRendering Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15325>	2022-03-11 11:40:41 -06:00
Jason Ekstrand	f76621f719	vulkan/cmd_queue: Properly support non-array pointer members Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5440 Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15325>	2022-03-11 11:36:53 -06:00
Thong Thai	027f1302fc	radeonsi: add option to disable EFC Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	23e5b910c5	radeon: add EFC support to only VCN2.0 devices Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5228 Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	9fa6ab962a	frontends/va: zero-copy efc Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	9602526568	frontends/va: add encoder format conversion (EFC) support Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Thong Thai	73153746d5	gallium: add parameters for encoder format conversion (EFC) support Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15196>	2022-03-11 14:10:08 +00:00
Tapani Pälli	adea096029	ci: update various ci result files Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Tapani Pälli	2a14baab85	mesa: check for valid internalformat with glTex[Sub]Image This changes our error handling to be compatible with upcoming specification change that unifies glTex[Sub]Image error handling between OpenGL ES 2.0 vs ES 3.0+ specifications, see: https://gitlab.khronos.org/opengl/API/-/issues/147 OpenGL ES 2.0.25 spec states: "Specifying a value for internalformat that is not one of the above values generates the error INVALID_VALUE. If internalformat does not match format, the error INVALID_OPERATION is generated." This fixes following new tests: KHR-GLES31.core.compressed_format.* KHR-GLES32.core.compressed_format.* v2: GL_INVALID_OPERATION -> GL_INVALID_VALUE in extension checks, remove (now overlapping) extension checks from _mesa_gles_error_check_format_and_type (Eric Anholt) v3: take GLES version in to account in internalformat checks Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12936>	2022-03-11 09:58:28 +00:00
Lionel Landwerlin	6cea8a43fa	anv: silence compiler warning Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	90000aea9b	anv: make a couple of descriptor function private Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	e12698724e	anv: rename host only descriptor internal flag We add an assert to verify that those are not bound. v2: Drop != 0 (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	87f59b18cf	anv: don't lazy allocate surface states in descriptor sets In `4001d9ce1a` we started lazily allocating surface states in the descriptor sets rather than upfront in the descriptor pool. This was to workaround vkd3d-proton allocating more than we could handle at the HW level. The issue introduced in that change is that we didn't protect the descriptor pool free list as well as the anv_state_stream which are now potentially used from different threads through the descriptor set write functions. This reverts the lazy allocation part of that change. Host only descriptor sets changes remain. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4001d9ce1a` ("anv: Handle VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE for descriptor sets") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Lionel Landwerlin	71cd6a7b84	anv: fix acceleration structure descriptor copies We're not supposed to have a VkWriteDescriptorSetAccelerationStructureKHR when doing a copy. We should instead get the acceleration structure object from the source descriptor. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15241>	2022-03-11 08:47:15 +00:00
Pierre-Eric Pelloux-Prayer	968d68125c	radeonsi: don't clear framebuffer.state before dcc decomp This causes inconsistencies between sctx->framebuffer.state and other sctx->framebuffer properties (like compressed_cb_mask). The point of this code was to fix an issue with vi_separate_dcc_stop_query, which was removed by `804e292440` we can safely drop it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6099 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15261>	2022-03-11 08:31:36 +00:00
Kenneth Graunke	01442cf4d4	iris: Restore flagging of dirty bindings in binder_realloc When I switched iris over to use 3DSTATE_BINDING_TABLE_POOL_ALLOC, I stopped flagging things dirty when allocating a new binder, because the contents of the binding table were still valid, thanks to us not having to subtract Surface State Base Address anymore. This unfortunately missed the point that the old binding table is in the old buffer, which is no longer what the binder pool base address points to. So we'd either need to copy it over, or just flag it dirty and re-emit it on the next draw. Fixes misrendering in Ryujinx. Fixes: `8b9045e7a4` ("intel: Use 3DSTATE_BINDING_TABLE_POOL_ALLOC exclusively on Gfx11+") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15314>	2022-03-11 07:59:18 +00:00
Samuel Pitoiset	b366fef091	radv: optimize the number of loaded components for VS inputs in NIR fossils-db (Sienna Cichlid): Totals from 3691 (2.74% of 134913) affected shaders: VGPRs: 121368 -> 121584 (+0.18%); split: -0.36%, +0.54% CodeSize: 7597912 -> 7561140 (-0.48%); split: -0.66%, +0.18% MaxWaves: 104706 -> 104772 (+0.06%) Instrs: 1441229 -> 1437652 (-0.25%); split: -0.53%, +0.28% Latency: 5500766 -> 5482101 (-0.34%); split: -0.45%, +0.11% InvThroughput: 804401 -> 797178 (-0.90%); split: -1.09%, +0.20% VClause: 25185 -> 25143 (-0.17%); split: -0.50%, +0.33% SClause: 27486 -> 27445 (-0.15%); split: -0.57%, +0.42% Copies: 143816 -> 147900 (+2.84%); split: -0.54%, +3.38% PreSGPRs: 109584 -> 110396 (+0.74%); split: -0.04%, +0.79% PreVGPRs: 95541 -> 94583 (-1.00%); split: -1.12%, +0.12% fossils-db (Polaris10): Totals from 1773 (1.30% of 135960) affected shaders: SGPRs: 80848 -> 80864 (+0.02%); split: -0.14%, +0.16% VGPRs: 56424 -> 55600 (-1.46%); split: -1.47%, +0.01% CodeSize: 1732588 -> 1696840 (-2.06%); split: -2.07%, +0.01% MaxWaves: 12103 -> 12106 (+0.02%) Instrs: 347684 -> 341597 (-1.75%); split: -1.76%, +0.01% Latency: 2542840 -> 2523946 (-0.74%); split: -0.95%, +0.21% InvThroughput: 924601 -> 905102 (-2.11%); split: -2.13%, +0.02% VClause: 9565 -> 9545 (-0.21%); split: -0.51%, +0.30% SClause: 10587 -> 10333 (-2.40%); split: -2.82%, +0.43% Copies: 19321 -> 20307 (+5.10%); split: -0.78%, +5.88% PreSGPRs: 30879 -> 30875 (-0.01%); split: -0.20%, +0.18% PreVGPRs: 41211 -> 41270 (+0.14%); split: -0.73%, +0.87% Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15317>	2022-03-11 07:40:10 +00:00
Dave Airlie	1ec4e568de	radv: abstract queue family away from queue family index. If we introduce another queue type (video decode) we can have a disconnect between the RADV_QUEUE_ enum and the API queue_family_index. currently the driver has GENERAL, COMPUTE, TRANSFER which would end up at QFI 0, 1, <nothing> since we don't create transfer. Now if I add VDEC we get GENERAL, COMPUTE, TRANSFER, VDEC at QFI 0, 1, <nothing>, 2 or if you do nocompute GENERAL, COMPUTE, TRANSFER, VDEC at QFI 0, <nothing>, <nothing>, 1 This means we have to add a remapping table between the API qfi and the internal qf. This patches tries to do that, in theory right now it just adds overhead, but I'd like to exercise these paths. v2: add radv_queue_ring abstraction, and pass physical device in, as it makes adding uvd later easier. v3: rename, and drop one direction as unneeded now, drop queue_family_index from cmd_buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13687>	2022-03-11 04:38:55 +00:00
Mike Blumenkrantz	afb4cced5c	lavapipe: more descriptor validation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	0c130d64d3	lavapipe: validate per-stage descriptor limits when creating pipeline layouts this is super annoying to track down later, so just crash early if it's seen Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	21abb01fb9	lavapipe: make device limits a physical device struct it's useful to have this info around and a bit simpler to gather info on init Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14656>	2022-03-11 04:26:28 +00:00
Mike Blumenkrantz	5ab0e3f0bb	anv: fix some dynamic rasterization discard cases in pipeline construction cc: mesa-stable Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	1e3e7b3a4d	anv: fix CmdSetColorWriteEnableEXT for maximum rts Fixes: `b15bfe92f7` ("anv: implement VK_EXT_color_write_enable") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15280>	2022-03-11 04:02:02 +00:00
Mike Blumenkrantz	52f6978484	anv: fix xfb usage with rasterizer discard in the initial implementation, a stream like: * CmdBeginTransformFeedbackEXT * CmdSetRasterizerDiscardEnableEXT * CmdDraw * CmdEndTransformFeedbackEXT * CmdBeginTransformFeedbackEXT * CmdDraw * CmdEndTransformFeedbackEXT would never enable transform feedback, as it only checked for the change in rasterizer_discard state Fixes: `4d531c67df` ("anv: support rasterizer discard dynamic state") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15269>	2022-03-11 03:37:17 +00:00
Dave Airlie	e8c3be0eb8	crocus: don't map scanout buffers as write-back This essentially ports `6440523077` Author: Keith Packard <keithp@keithp.com> Date: Fri Aug 6 16:11:18 2021 -0700 iris: Map scanout buffers WC instead of WB [v2] to crocus. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15313>	2022-03-11 03:26:41 +00:00
Mike Blumenkrantz	42e78ba125	llvmpipe: fix occlusion queries with early depth test for genuine early depth tests, the samplecount must be updated after depth test but before samplemask is applied for inferred-early or regular depth tests, the samplemask can be applied before the depth test Fixes: `d9276ae965` ("llvmpipe: handle gl_SampleMask writing.") fixes: dEQP-VK.fragment_operations.early_fragment.sample_count_early_fragment_tests_depth_samples_4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15319>	2022-03-11 00:45:05 +00:00
Jason Ekstrand	f7175bf416	lavapipe: Use the common vk_enqueue_CmdBindDescriptorSets Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	ac58e93633	lavapipe: Reference count pipeline layouts Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	48a10c5dd3	lavapipe: Allocate descriptor set layouts with DEVICE scope Because they can come and go at any time, we can't use OBJECT scope because that might confuse the client allocator. Instead, use DEVICE scope and always allocate off the device allocator. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	94ea3b9c03	vulkan/cmd_queue: Add a common vk_cmd_enqueue_CmdBindDescriptorSets In order for this to work, the driver must reference-count pipeline layouts so we can take a reference while the command is in the queue. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Jason Ekstrand	c1070556a0	vulkan/cmd_queue: Add a driver_free_cb hook If a driver sets driver_data but not driver_free_cb, driver_data will get freed along with the command. If a driver sets driver_free_cb, driver_data will not get automatically freed but the callback will get called before the rest of the data structure is freed. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15329>	2022-03-10 21:08:36 +00:00
Mike Blumenkrantz	2106c3bab6	lavapipe: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15322>	2022-03-10 20:40:56 +00:00
Mike Blumenkrantz	cf5c32a4b2	lavapipe: run nir_opt_copy_prop_vars during optimization loop this enables better elimination of operations fixes: dEQP-VK.graphicsfuzz.spv-stable-mergesort-flatten-selection-dead-continues fixes #5458 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15322>	2022-03-10 20:40:56 +00:00
Mike Blumenkrantz	c94c8a7029	lavapipe: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15320>	2022-03-10 20:24:01 +00:00
Mike Blumenkrantz	6a4c7ef728	lavapipe: skip format checks for EXTENDED_USAGE we can effectively skip any kind of checks here and just assume that one of two scenarios is in effect: * the user is about to attempt some incredibly illegal behavior that VVL will catch * the user is about to attempt a pro gamer move and we'll be fine in either case, it's EXTENDED_USAGE, so hopefully we're about to make a texture view from a compatible and supported format cc: mesa-stable fixes: dEQP-VK.image.extended_usage_bit_compatibility.image_format_properties* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15320>	2022-03-10 20:24:01 +00:00
Mike Blumenkrantz	c40dc39b5a	lavapipe: use the correct value for dynamic render resolve attachment indexing subpass->color_count is (obviously) not set yet, so this would just clobber the color attachments any time resolves were used Fixes: `8a6160a354` ("lavapipe: VK_KHR_dynamic_rendering") fixes: dEQP-VK.draw.dynamic_rendering.multiple_interpolation.structured.with_sample_decoration.4_samples Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15330>	2022-03-10 19:48:59 +00:00
Dave Airlie	938488f439	lavapipe: remove broken workaround for zink depth texturing. Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15297>	2022-03-11 04:58:34 +10:00
Dave Airlie	30cb63bead	zink: workaround depth texture mode alpha. Since spir-v only has single channel depth sampling, it breaks with the old school GL_ALPHA depth mode swizzle, so just detect that case and smash all the channels. Cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15297>	2022-03-11 04:58:01 +10:00
Connor Abbott	cdee38a57b	tu: Expose subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	1a78604d20	ir3: Add support for subgroup arithmetic Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	a433db60c1	ir3: Track physical edges when inserting (ss) for shared regs Normally this wouldn't matter, but it will matter for the upcoming scan macro because the running tally is communicated through a shared register across a physical edge. It may also matter if a live-range split occurs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	410e746198	util/bitset: Fix off-by-one in __bitset_set_range Fixes: `b3b03e33c9` ("util/bitset: add BITSET_SET_RANGE(..)") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	72b32d83fb	ir3/spill: Mark reload destination as early-clobber Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	2ff5826f09	ir3/ra: Add IR3_REG_EARLY_CLOBBER We'll need this to model the subgroup reduction macros. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	34803d15ab	ir3/ra: Add proper support for multiple destinations We weren't considering the other destinations when allocating a destination, so we could allocate overlapping destinations. This wasn't done before because we never had a need for it, but the subgroup reduction macros will need it. The trickiest part of this is that we have to rewrite the compress_regs_left fallback, because we may have to move around the other already-allocated destinations. We now have a list of destinations to (re)allocate in addition to the popped live intervals. For the rest of the destination handling, we can just bail out if the proposed spot for something overlaps another destination, but for the fallback we have to handle all the cases gracefully. I also added support for odd combinations of multiple destinations where some of them are tied, which we'll use in the next commit to handle early-clobber destinations and which will actually be used because one of the destinations of the subgroup reduction macro will be early-clobber. The result is that the order of intervals to allocate is now a lot more complicated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	ab0ed4ff3f	ir3/ra: Sanitize parallel copy flags better For pcopies we only care about the register's type, i.e. whether its a half-register and whether it's an array (plus its size). Copying over other flags like IR3_REG_RELATIV just leads to sadness and validator assertions. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	0135660dfc	ir3/ra: Fix ra_foreach_dst_n Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	077d07a983	ir3/ra: Fix tied destination handling with multiple destinations Before, we were careful to 1. Get the source physreg. 2. Allocate the destination. 3. Insert a copy with the source being the physreg from step 1. and this guaranteed that if the tied source were moved in step 2 we'd still insert a copy from the correct place. However this won't work with multiple destinations because an earlier destination could've already moved the tied source around. Instead flip steps 2 and 3 (we'll insert the copy before we allocate the interval, but that's ok) and run the first two steps in a separate loop before any destinations are allocated. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	9cc42242d5	ir3/sched: Support multiple destinations Note: this is a behavior change for arrays, because it will count the entire array instead of just the components written in the register pressure calculation. However this is more accurate since this matches how RA works. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	a6be8fd0ea	ir3/dce: Support multiple destinations Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Connor Abbott	89217f636c	ir3/cp_postsched: Support multiple destinations Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14107>	2022-03-10 17:15:29 +00:00
Jason Ekstrand	cc4f0e804e	vulkan,lavapipe: Move some enqueue helpers to common code Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	b98cfe9cb4	lavapipe: Re-use auto-generated vk_cmd_enqueue entrypoints Re-use auto-generated vk_cmd_enqueue entrypoints instead of generating our own version doing the same thing. In order to effectively do this, we also add an allow-list of which entrypoints lavapipe actually handles to avoid issues where the autogenerated one stomps a vkCmdFoo2 wrapper. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Jason Ekstrand	66cb64c8ad	lavapipe: Reset the free_cmd_buffers list in TrimCommandPool We delete all the command buffers but they're still in the list so future allocations may try to re-use them post-free and another trim will re-delete them. Fixes: `b38879f8c5` ("vallium: initial import of the vulkan frontend") Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Jason Ekstrand	719b949575	vulkan/cmd_queue: Generate enqueue entrypoints Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	47fc0b39c7	vulkan/cmd_queue: Properly deconstify array of pointers When manipulating an array of pointers, what we want to desconstify is the array, not the entry type. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Jason Ekstrand	0d5dc6ef6a	vulkan/cmd_queue: Stop generating enqueue helpers for INTEL perf queries They don't return void and they're not used by anyone except the Intel drivers so there's no point in supporting them. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Jason Ekstrand	cf8cf8a827	vulkan/cmd_queue: Re-flow MANUAL_COMMANDS This just makes it all a bit easier to read. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	290e33ab20	vulkan/cmd_queue: Remove duplicate entries in MANUAL_COMMANDS Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Louis-Francis Ratté-Boulianne	6bd8a3c7e4	vulkan/runtime: Add a vk_cmd_queue object to vk_command_buffer This is paving the road for generic secondary command buffer support, where commands are simply recorded in a software queue and replayed on the primary command buffer when vkCmdExecuteCommands() is called. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Louis-Francis Ratté-Boulianne	ad4d2da90a	vulkan/cmd_queue: Add an initializer for the vk_cmd_queue object Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Boris Brezillon	dd0f6cb45b	vulkan/cmd_queue: Constify vk_cmd_queue.alloc The implementation shouldn't modify the allocator. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15311>	2022-03-10 15:52:10 +00:00
Mike Blumenkrantz	a3d096f4ba	lavapipe: add the full list of cts fails easier to keep track this way Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15294>	2022-03-10 10:03:16 -05:00
Akihiko Odaki	b70f14188d	virgl: Check texture multisample compatibility v2: Support VIRGL_FORMAT_NONE (Gert Wollny) Signed-off-by: Akihiko Odaki <akihiko.odaki@gmail.com> Suggested-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15115>	2022-03-10 10:34:12 +00:00
Akihiko Odaki	571c5e8fdc	virgl/ci: Uprev virglrenderer Suggested-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Akihiko Odaki <akihiko.odaki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15115>	2022-03-10 10:34:12 +00:00
Danylo Piliaiev	c4703cd846	tu: Implement VK_EXT_depth_clip_control Since negativeOneToOne is a static property of the pipeline and viewport state could be dynamic, we have to defer viewport state emission until negativeOneToOne value is known. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6070 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14363>	2022-03-10 11:08:50 +02:00
Iago Toral Quiroga	49b5431197	broadcom/compiler: remove unused functions Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15302>	2022-03-10 07:25:37 +00:00
Dylan Baker	45770ac286	docs: add release notes for 22.0.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Dylan Baker	8474817253	docs: Add calendar entries for 22.0 release. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Dylan Baker	b7e1df14f0	docs: update calendar and link releases notes for 22.0.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15312>	2022-03-09 22:47:56 +00:00
Timur Kristóf	75a783ea73	ac: Query the amdgpu MEC firmware version. MEC (Micro Engine Compute) is the firmware which is responsible for the compute-only queues on AMD GPUs. It is present on GFX7 and newer. This patch will query the version of this firmware and print it among the others. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15283>	2022-03-09 21:31:48 +00:00
Rob Clark	f4ec900953	mesa: Fix discard_framebuffer for fbo vs winsys GL is annoying when it comes to having different enums for winsys vs fbo. Note that the issue this closes was only accidentially exposed by a change the resulted in sysmem vs GMEM path taken. Fixes: `db2ae51121` ("mesa: Skip partial InvalidateFramebuffer of packed depth/stencil.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6103 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15308>	2022-03-09 20:40:53 +00:00
Emma Anholt	d5d8519cb5	docs/ci: Add docs for using a POE switch to control boards, like nouveau. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	e8da28d5e8	docs/ci: Update some bare-metal CI docs. We haven't been using initramfs in a long time, don't point people that direction. Do point people at existing instances of these CI variants, though. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	5497d60639	ci/nouveau: Add a manual run for the Jetson Nano (GM20B). The test suite is full of flakes around transform feedback, atomics, and tess. But, I hope it can be useful for regression testing core Mesa reworks. This required updating the kernel to 5.16.12 to get a more stable boot process. That kernel rebuild caused an update of the container with piglit which that was missed in a previous MR, so we got new xfails in x86 swrast. Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> (nouveau) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	1b374f8c91	ci/nouveau: Add nouveau support to the rootfs. This required updating the kernel to 5.16.12 to get a more stable boot process. That kernel rebuild caused an update of the container with piglit which that was missed in a previous MR, so we got new xfails in x86 swrast. Also, including modules on arm64 exposed a bug in v3d's poe-powered.sh rsyncing of modules. Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	a9e67738d6	ci: Stop xz-compressing firmware for ramdisks. This ends up breaking nouveau because the renames break symlinks in the firmware directory structure. We don't need it any more since we stopped doing ramdisks. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	9b918c4df2	ci/bare-metal: Increase maximum retry count for POE boots. The manual jetson CI job I'm introducing has serious boot reliability trouble, but also we've seen frequent intermittent failures on bcm where at least 2 boots don't seem to be enough (#6041). Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Emma Anholt	45b7648cb1	ci/bare-metal: Drop the BM_POE_USERNAME/PASSWORD env var checks. They're unused since the transition to SNMP in the rpi test farm. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15201>	2022-03-09 19:47:04 +00:00
Mike Blumenkrantz	c24bca2d3a	zink: lower dmod on AMD hardware this hardware won't return the correct value from dmod instructions, so lower it to ensure that cts passes nobody else will ever hit this, so perf isn't an issue and regular fmod can be left alone fixes (amd): KHR-GL46.gpu_shader_fp64.builtin.mod_d* Fixes: `5fae35fb17` ('zink: fix 64bit float shader ops ') Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15306>	2022-03-09 19:13:02 +00:00
Mike Blumenkrantz	1845957a31	zink: add another radv fail it looks like this one was erroneously excluded Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15307>	2022-03-09 14:00:06 -05:00
Mike Blumenkrantz	e70b6be117	zink: update radv fails Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15307>	2022-03-09 14:00:06 -05:00
Chia-I Wu	889d050739	venus: add VK_EXT_vertex_attribute_divisor Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	4752429e36	venus: add VK_EXT_shader_stencil_export Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	1ecd481bd7	venus: add VK_EXT_robustness2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	25795308ef	venus: add VK_EXT_depth_clip_enable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	bebe5e3925	venus: add VK_EXT_conservative_rasterization Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	f0e0daf46b	venus: add VK_EXT_shader_demote_to_helper_invocation Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Chia-I Wu	99473f610a	venus: update venus-protocol headers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15265>	2022-03-09 17:24:49 +00:00
Marcin Ślusarz	823cffbe1c	anv: include Primitive Header in mesh shader per-primitive output Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	f410c1142f	anv: set number of viewports in clip state (mesh) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	81df66bfff	intel/compiler: mark some variables as per-primitive in FS if they come from MS Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	8c16ce53a9	intel/compiler: handle ViewportIndex, PrimitiveID and Layer in MUE setup Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	bc4f8c073a	intel/compiler: inject MUE initialization Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Marcin Ślusarz	333a490e32	intel/compiler: shift mesh urb read/write window when offset is too large Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15303>	2022-03-09 16:52:59 +00:00
Samuel Pitoiset	6c1c9067d9	aco: always emit vk_cvt_pkrtz_f16_f32 for nir_op_pack_half_2x16_split From the VK_KHR_shader_float_controls extension: "5) Do any of the “Pack” GLSL.std.450 instructions count as conversion instructions and have the rounding mode applied?" "RESOLVED: No, only instructions listed in “section 3.32.11. Conversion Instructions” of the SPIR-V specification count as conversion instructions." This is also the same logic as the LLVM backend. No fossils-db changes on Sienna Cichlid. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15301>	2022-03-09 16:24:20 +00:00
Erik Faye-Lund	fa41bd0687	docs: improve language in zink article Turns out, this was not proper use of language! Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15300>	2022-03-09 16:19:16 +00:00
Erik Faye-Lund	e666134975	docs: fixup zink gl 4.3 requirements The multiViewport feature isn't required for GL 4.3, it's required for GL 4.1. Technically speaking, we could have just dropped it because we already list the maxViewports requirement. But it seems better to be very clear here to me. Fixes: `29f8f21bff` ("docs: document zink GL 4.3 requirements") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15300>	2022-03-09 16:19:16 +00:00
Iago Toral Quiroga	44feff93c2	broadcom/compiler: don't always assign r5 if available Instead, only favor assigning r5 if we have first decided to assign an accumulator. This helps with assining r5 to short lived uniforms, favoring accumulator rotation to facilitate QPU merges. total instructions in shared programs: 12656164 -> 12628339 (-0.22%) instructions in affected programs: 5368373 -> 5340548 (-0.52%) helped: 17420 HURT: 9996 total uniforms in shared programs: 3704776 -> 3704863 (<.01%) uniforms in affected programs: 12247 -> 12334 (0.71%) helped: 23 HURT: 78 total max-temps in shared programs: 2153505 -> 2152684 (-0.04%) max-temps in affected programs: 26468 -> 25647 (-3.10%) helped: 569 HURT: 328 total fills in shared programs: 4656 -> 4657 (0.02%) fills in affected programs: 43 -> 44 (2.33%) helped: 0 HURT: 1 total sfu-stalls in shared programs: 34728 -> 34403 (-0.94%) sfu-stalls in affected programs: 3411 -> 3086 (-9.53%) helped: 842 HURT: 534 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	77f58b46d9	broadcom/compiler: add comment on why we don't use r5 with ldunifa Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	5b140428b0	broadcom/compiler: adjust register threshold for 2-thread compiles We have twice the registers in this case so it makes sense to double this as well. While this causes slight regressions in shader-db stats (due to additional register pressure), it helps us hide latency of memory reads better on 2-thread compiles, where the thread switch mechanism will be less effective. This shows a ~3% performance improvement on the UE4 SunTemple demo. total instructions in shared programs: 12642413 -> 12656164 (0.11%) instructions in affected programs: 2272652 -> 2286403 (0.61%) helped: 2924 HURT: 3389 total uniforms in shared programs: 3703861 -> 3704776 (0.02%) uniforms in affected programs: 213729 -> 214644 (0.43%) helped: 823 HURT: 1272 total max-temps in shared programs: 2150686 -> 2153505 (0.13%) max-temps in affected programs: 191332 -> 194151 (1.47%) helped: 1900 HURT: 1891 total spills in shared programs: 3255 -> 3274 (0.58%) spills in affected programs: 166 -> 185 (11.45%) helped: 3 HURT: 6 total fills in shared programs: 4630 -> 4656 (0.56%) fills in affected programs: 367 -> 393 (7.08%) helped: 7 HURT: 15 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	a35b47a0b1	broadcom/compiler: add a strategy to disable scheduling of general TMU reads This can add quite a bit of register pressure so it makes sense to disable it to prevent us from dropping to 2 threads or increase spills: total instructions in shared programs: 12672813 -> 12642413 (-0.24%) instructions in affected programs: 256721 -> 226321 (-11.84%) helped: 719 HURT: 77 total threads in shared programs: 415534 -> 416322 (0.19%) threads in affected programs: 788 -> 1576 (100.00%) helped: 394 HURT: 0 total uniforms in shared programs: 3711370 -> 3703861 (-0.20%) uniforms in affected programs: 28859 -> 21350 (-26.02%) helped: 204 HURT: 455 total max-temps in shared programs: 2159439 -> 2150686 (-0.41%) max-temps in affected programs: 32945 -> 24192 (-26.57%) helped: 585 HURT: 47 total spills in shared programs: 5966 -> 3255 (-45.44%) spills in affected programs: 2933 -> 222 (-92.43%) helped: 192 HURT: 4 total fills in shared programs: 9328 -> 4630 (-50.36%) fills in affected programs: 5184 -> 486 (-90.62%) helped: 196 HURT: 0 Compared to the stats before adding scheduling of non-filtered memory reads we see we that we have now gotten back all that was lost and then some: total instructions in shared programs: 12663186 -> 12642413 (-0.16%) instructions in affected programs: 2051803 -> 2031030 (-1.01%) helped: 4885 HURT: 3338 total threads in shared programs: 415870 -> 416322 (0.11%) threads in affected programs: 896 -> 1348 (50.45%) helped: 300 HURT: 74 total uniforms in shared programs: 3711629 -> 3703861 (-0.21%) uniforms in affected programs: 158766 -> 150998 (-4.89%) helped: 1973 HURT: 499 total max-temps in shared programs: 2138857 -> 2150686 (0.55%) max-temps in affected programs: 177920 -> 189749 (6.65%) helped: 2666 HURT: 2035 total spills in shared programs: 3860 -> 3255 (-15.67%) spills in affected programs: 2653 -> 2048 (-22.80%) helped: 77 HURT: 21 total fills in shared programs: 5573 -> 4630 (-16.92%) fills in affected programs: 3839 -> 2896 (-24.56%) helped: 81 HURT: 15 total sfu-stalls in shared programs: 39583 -> 38154 (-3.61%) sfu-stalls in affected programs: 8993 -> 7564 (-15.89%) helped: 1808 HURT: 1038 total nops in shared programs: 324894 -> 323685 (-0.37%) nops in affected programs: 30362 -> 29153 (-3.98%) helped: 2513 HURT: 2077 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	f783bd0d2a	broadcom/compiler: define v3d-specific delays for NIR instructions We do a few changes over NIR's defaults: 1. Lower delay for texture reads. Empirically, we don't observe any benefits with delays over 50 and since this delay value is still used by the scheduler in the "favor register pressure" case it is benefitial to avoid overestimating it too much. 2. Adjust delay for non-filtered TMU reads to the delay selected for texture reads. 3. In our case, UBO reads from dynamically uniform addresses don't use the TMU and have a latency of 1 instruction in the best case scenario or 4 at worse, so we go with 1 so we don't try to move this early. This helps us get back some of what we lost when updating the default scheduler configuration to add a delay for non-filtered memory reads: total instructions in shared programs: 13126587 -> 12671765 (-3.46%) instructions in affected programs: 3764097 -> 3309275 (-12.08%) helped: 14664 HURT: 4244 total threads in shared programs: 407208 -> 415522 (2.04%) threads in affected programs: 8716 -> 17030 (95.39%) helped: 4224 HURT: 67 total uniforms in shared programs: 3812698 -> 3711224 (-2.66%) uniforms in affected programs: 335170 -> 233696 (-30.28%) helped: 2816 HURT: 3551 total max-temps in shared programs: 2318430 -> 2159345 (-6.86%) max-temps in affected programs: 539991 -> 380906 (-29.46%) helped: 13173 HURT: 1440 total spills in shared programs: 49086 -> 5966 (-87.85%) spills in affected programs: 48306 -> 5186 (-89.26%) helped: 1655 HURT: 28 total fills in shared programs: 55810 -> 9328 (-83.29%) fills in affected programs: 54821 -> 8339 (-84.79%) helped: 1659 HURT: 22 LOST: 0 GAINED: 3 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	fed51585c4	nir/schedule: allow drivers to decide about instruction latency On V3D reading UBOs from uniform addresses uses a more efficient mechanism with lower latency. On other platforms there may be simular scenarios. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	e7a4e97076	nir/schedule: use larger delay for non-filtered memory reads This has been pending for a long time. It is not very consistent to add a significant delay for textures and not do it for UBOs, etc The reason we have not been doing this so far is the accumulated effect on register pressure for V3D as shown by shader-db results below, but from the point of view of a generic scheduler it makes sense to do this. Later patches will address V3D specific issues with register pressure derived from this by letting the driver control its instruction delay settings. total instructions in shared programs: 12662138 -> 13126587 (3.67%) instructions in affected programs: 1813091 -> 2277540 (25.62%) helped: 2410 HURT: 10499 total threads in shared programs: 415858 -> 407208 (-2.08%) threads in affected programs: 17348 -> 8698 (-49.86%) helped: 8 HURT: 4333 total uniforms in shared programs: 3711483 -> 3812698 (2.73%) uniforms in affected programs: 128012 -> 229227 (79.07%) helped: 3474 HURT: 2143 total max-temps in shared programs: 2138763 -> 2318430 (8.40%) max-temps in affected programs: 318780 -> 498447 (56.36%) helped: 588 HURT: 11997 total spills in shared programs: 3860 -> 49086 (1171.66%) spills in affected programs: 709 -> 45935 (6378.84%) helped: 23 HURT: 1595 total fills in shared programs: 5573 -> 55810 (901.44%) fills in affected programs: 1067 -> 51304 (4708.25%) helped: 23 HURT: 1595 LOST: 3 GAINED: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	3bd041e2fb	nir/schedule: handle nir_intrinsic_group_memory_barrier Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	46e330c07e	nir/schedule: fix handling of generic memory barrier We can get a generic nir_intrinsic_memory_barrier to represent a barrier involving multiple semantics (instead of getting individual specific barriers for each semantic). This means that we need to consider these as potentially affecting shared memory access as well. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:04 +00:00
Iago Toral Quiroga	9ef499b315	broadcom/compiler: stop moving UBO loads before NIR scheduling This doesn't have any significant impact shader-db stats and would reduce our capacity to hide latency from the loads, so it is probably undesirable: total instructions in shared programs: 12663189 -> 12663186 (<.01%) instructions in affected programs: 4222 -> 4219 (-0.07%) helped: 9 HURT: 4 total uniforms in shared programs: 3711624 -> 3711629 (<.01%) uniforms in affected programs: 186 -> 191 (2.69%) helped: 0 HURT: 2 total max-temps in shared programs: 2138822 -> 2138857 (<.01%) max-temps in affected programs: 569 -> 604 (6.15%) helped: 1 HURT: 9 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15276>	2022-03-09 15:53:03 +00:00
Michel Zou	bf7777a5d4	lavapipe: set non-zero device/driver uuid Closes #5875 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15230>	2022-03-09 15:10:45 +00:00
Danylo Piliaiev	2e878293f4	turnip: Make autotuner work with reusable command buffers To achieve it each command buffer now has its own GPU memory. However the BOs usage by autotuner is not optimal, the ideal pattern would be to use some memory pool to suballocate small GPU memory chunks, since most command buffers have only a few renderpasses. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5990 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14996>	2022-03-09 12:56:31 +00:00
Gert Wollny	d9e400b9b6	virgl: Add a few more formats to the format table These formats are used by the piglit arb_texture_buffer_object-formats fs arb Adding them here keeps the piglit from crashing, but most of the related tests don't pass. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13645>	2022-03-09 09:59:53 +00:00
Kenneth Graunke	8b9045e7a4	intel: Use 3DSTATE_BINDING_TABLE_POOL_ALLOC exclusively on Gfx11+ On Icelake and later, we can use a new 3DSTATE_BINDING_TABLE_POOL_ALLOC command to update the location of the binder (buffer containing binding table entries), rather than having to move Surface State Base Address via a STATE_BASE_ADDRESS command. This has less stalling and also means our surface addresses can remain relative to a fixed 4GB address range, meaning we don't have to re-stream them any time the binder changes. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	e3a0e97300	intel: Limit Wa_1607854226 to Gfx12.0 only This workaround is needed on all Gfx12.0 parts, but doesn't appear to be necessary on XeHP. The other drivers do not appear to be applying this workaround on those parts. As further evidence, we accidentally added the 3DSTATE_BINDING_TABLE_POOL_ALLOC commands after switching back to GPGPU mode, which would be an incorrect way to implement the workaround, and things seem to be working. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	ab47cad4fb	iris: Rename surface_base_address to binder_address in a few places On Gfx11+, we're going to stop changing Surface State Base Address and instead start changing the Binding Table Pool address instead. So, rename a few things to track the last binder address, which is what we're actually changing, regardless of how we program it. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Kenneth Graunke	db34c71513	iris: Use more efficient binding table pointer formats on Icelake+. Skylake and older use a 15:5 binding table pointer format, which means our binder can be at most 64kB in size. Each binding table within the binder must be aligned to 32B. XeHP uses a new 20:5 binding table format, which allows us to increase the binder size to 1MB while retaining the nice 32B alignment. Larger binders mean fewer stalls as we update the base address for the binder. Icelake and Tigerlake can either use the 15:5 format or an 18:8 format. 18:8 mode requires the base of each binding table to be aligned to 256B instead of 32B, but it gives us a maximum binder size of 512kB. We can store 64 binding table entries in a 256B chunk (256B / 4B = 64), but only 8 entries in a 32B chunk (32B / 4B = 8). Assuming that most binding tables have fewer than 64 entries, this means that with the 18:8 format, we're likely to be able to fit 2048 (512KB / 256B) tables into a a buffer before needing to allocate a new one and stall. Technically, the old format could also store 2048 binding tables per buffer as well (64KB / 32B = 2048). However, tables that needed more than 8 entries would need multiple 32B chunks. A single table would take multiple aligned chunks, while with the larger 256B format, it could fit in a single one. This cuts binder resets by 6.3% on a Shadow of Mordor benchmark trace. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Jason Ekstrand	a83c91a261	blorp: Add a binding_table_offset_to_pointer helper On Gen11+, we have a feature that requires us to shift binding table offsets by 3. This adds a helper which gives the driver a hook to do this if it so chooses. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14507>	2022-03-09 09:18:59 +00:00
Pierre-Eric Pelloux-Prayer	3c3a8f853d	gallium/tc: zero alloc transfers Otherwise this causes trouble with unitialized memory, eg with: struct si_transfer { struct threaded_transfer b; struct si_resource *staging; }; 'staging' will not be initialized and this causes #6109. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6109 Cc: mesa-stable Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15298>	2022-03-09 08:48:59 +00:00
Pierre-Eric Pelloux-Prayer	caeec6262d	util/slab: add slab_zalloc A a variant that clears the allocated object to 0. Cc: mesa-stable Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15298>	2022-03-09 08:48:59 +00:00
Danylo Piliaiev	2cd30266f1	tu: Refactor VS DECODE/DEST to be emitted in two pkt4 Refactor to emit VFD_DECODE and VFD_DEST_CNTL in two packets regardless of attribute count. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14584>	2022-03-09 08:21:40 +00:00
Mike Blumenkrantz	fa323cb93a	mesa/st: make export_point_size shader key clobber existing psiz this is necessary to upload the API value using the uniform constant Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	9ae0cdc453	mesa/st: check max output components for adding pointsize during precompile Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	070a7b506d	mesa/st: count FF shaders as needing psiz export for precompile this is consistent with logic for regular compiles Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	1183988d74	mesa/st: precompile with API pointsize only if the shader doesn't have pointsize this is a more accurate hint, and maintains the existing behavior given subsequent changes to this area Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	96098d7c25	mesa/st: simplify pointsize precompile conditional Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	882abacde3	mesa/st: simplify pointsize shader update conditional ES contexts have no API toggle for this, so it will never be flagged Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	b73663c51e	mesa: always set PointSizeEnabled for API_OPENGLES2 this is implicit, so make it explicit Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	9d98bc238e	mesa/st: only add pointsize output if it doesn't exceed max component limit fixes (zink-nvidia): dEQP-GLES31.functional.geometry_shading.basic* Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	0d80aed363	nir/gather_info: check copy_deref instrs for writing outputs this is a valid way to write an output even though it usually gets rewritten to some other instruction later on Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	fdee7240ff	mesa/st: conditionally add pointsize outputs to ES tess/geom shaders if the driver requires this value to be set, add it if the shader doesn't use the ext to allow exporting it Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	597bd11b7b	mesa/st: add a gl_program struct flag to skip psiz exports for xfb if this output did not exist in the original shader, then it must not be exported in xfb Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Mike Blumenkrantz	53cbba83eb	glsl: store OES/EXT point_size extension enablement to shader struct Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15228>	2022-03-09 05:10:21 +00:00
Boris Brezillon	02906baba7	panvk: Implement vkCmdDispatch() Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	1056b3e39e	panvk: Add support for storage image Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	eca0a0e29e	panvk: Move dummy attribute buffer emission out of emit_{attribute,varying}_bufs So we can easily add entries after the standard varyings/attributes (like image descriptors in the attribute array). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	0e7e1b64fc	panvk: Add support for storage/uniform buffers with dynamic offsets The idea of storing offsets in a separate UBO and lowering accesses to UBOs/SSBOs with a dynamic offset was not great. Let's apply the offset at UBO/SSBO emission time instead. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	13378e4129	panvk: Support creation of compute pipelines Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	b05ffc9fec	panvk: Add support for storage buffers Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Boris Brezillon	9a20467cf2	panvk: Add support for push constants Push constants are stored in a separate UBO. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15248>	2022-03-09 04:50:41 +00:00
Mike Blumenkrantz	3634063b08	zink: handle spirv xfb insanity this comes in two flavors: * streamout of array<struct/block> * partial streamout of array/struct/block for the former: * arrays of structs can just be blasted out in the initial var declaration (easy) * arrays of blocks must be output to separate xfb buffers for each array block, which requires skipping initial xfb blast-off and instead propagating the values using tmp variables at a later point for the latter: * the optimal way to do this is to unwrap the struct first to figure out what's being emitted, at which point the value can be extracted and exported fixes the rest of spec@arb_gl_spirv@execution@xfb ...except spec@arb_gl_spirv@execution@xfb@vs_block_array, which I'm suspecting is broken due to vtn bugs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	dfbb7c1f73	zink: store shader to ntv_context it's insane the gymnastics that have to be done because this wasn't stored Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	76fe6f7235	zink: handle remaining xfb corner cases during analysis this now handles inlining of stupid types (dvec3, dmatX) and complex types (goku) as seen in cts fixes: KHR-Single-GL46.enhanced_layouts.xfb_explicit_location KHR-Single-GL46.enhanced_layouts.xfb_struct_explicit_location Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	3f7da0c584	zink: fix xfb analysis variable finding for arrays this fixes clipdistance exports cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	4ed7329236	zink: correctly set xfb packed output offsets cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	432700fc61	zink: store the correct number of components for xfb packing outputs cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	a5c7d34fdf	zink: use 64bit mask for xfb analysis I don't know how this worked before since all the values are oob? cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15224>	2022-03-09 04:39:52 +00:00
Mike Blumenkrantz	a0feafbcc8	ci: more stoney flakes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	ddfa9e7cc9	ci: add another stoney flake from #6109 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	6ab720f1f4	aux/cso: stop tracing during cso_unbind() this unnecessarily bloats lavapipe traces Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	0e51c47816	aux/trace: dump more rasterizer state members Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	ec45b7ed32	aux/trace: dump clear_texture colors Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	296e26eec8	aux/trace: dump clear colors as uints dumping as float is nice if the clear color is a float, but if it isn't then the value is useless Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	8142fc5a45	aux/trace: rzalloc the context struct this has problems if pointers are garbage cc: mesa-stable Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	f1cdaf36df	aux/trace: more screen methods Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14854>	2022-03-09 03:54:37 +00:00
Mike Blumenkrantz	204ea77b06	lavapipe: fix pipeline creation for blend and zs states these values are read based on the specified subpass containing the required attachments, not on the overall renderpass cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15282>	2022-03-09 01:13:42 +00:00
Mike Blumenkrantz	7114490115	lavapipe: update multisample state after blend state null blend pipeline state will zero the blend struct, which would cause values set here to be overwritten cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15282>	2022-03-09 01:13:42 +00:00
Rob Clark	711f0d1df4	turnip: Don't call getenv() directly I noticed it was using getenv directly when I tried to use 'setprop mesa.tu.debug ..' on android. Use os_get_option() instead so we get sysprop fallback on android. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15289>	2022-03-09 00:22:36 +00:00
Gert Wollny	c9d99b7eec	virgl: Fix texture transfers by using a staging resource This commit fixes the following flaws in the implementation: * when a resource was re-allocated, the guest side storage was also allocated * when a source needs a readback before being written to, then the call would go through vws->transfer_get, thereby bypassing the staging resource, and this would fail on the host, because no the allocated IOV was too small (just one byte) * if the texture write would need neither flush nor readback, the old code path would be used expecting that guest side backing stogage for the texture. v2: - actually do a readback to the stageing resource when it is required - fix typo (Lepton) v3: Don't use stageing transfers if the host can't read back the data by rendering to an FBO or calling getTexImage, because in this case we rely on the IOV to hold the date. v4: Also don't use staging transfers if the format is no readback format. Otherwise we have to deal with the resolve blit, and this is currently not working correctly. v5: add a new flag that indicates whether non-renderable textures can be read back (either via glGetTexImage or GBM) v6: Restrict the use of staging texture transfers to textures that can be read back, and on GLES also if the they are bound to scanout and the host uses minigbm to allocate such textures. For that replace the flag indicating the capability to read back non-renderable textures with a cap that indicates whether scanout textures can be read back. v7: update virglrenderer version in the CI v8: update use of stageing (Chia-I) v9: remove superflous check and assignment (Chia-I) v10: disable stageing textures for arrays with stencil format. This is a workaround for failures of the CI. Fixes: `cdc480585c` virgl/drm: New optimization for uploading textures Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14495>	2022-03-08 23:39:27 +01:00
Mike Blumenkrantz	165a880f1a	llvmpipe: clamp surface clear geometry avoid oob writes to avoid crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14655>	2022-03-08 22:03:28 +00:00
Mike Blumenkrantz	2d1b506acf	lavapipe: clamp clear attachments rects there is at least one unnamed game which has problems with this, so try to avoid crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14655>	2022-03-08 22:03:28 +00:00
Mike Blumenkrantz	4c76a19ca3	llvmpipe: fix debug print iterating in set_framebuffer_state this would potentially access garbage memory by checking the existing state using the incoming state's iterator values cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14857>	2022-03-08 21:42:02 +00:00
Mike Blumenkrantz	3ef093f697	zink: ci updates cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15274>	2022-03-08 21:30:29 +00:00
Mike Blumenkrantz	5fae35fb17	zink: fix 64bit float shader ops this was being set from back before zink actually supported 64bit natively and only 32bit was functional, but it breaks 64bit support cc: mesa-stable fixes (lavapipe): KHR-GL46.gpu_shader_fp64.builtin.mod_dvec2 KHR-GL46.gpu_shader_fp64.builtin.mod_dvec3 KHR-GL46.gpu_shader_fp64.builtin.mod_dvec4 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15274>	2022-03-08 21:30:29 +00:00
Mike Blumenkrantz	9579df6a7f	zink: run nir_lower_phis_to_scalar in optimization loop fixes (lavapipe): dEQP-GLES3.functional.shaders.switch.switch_in_do_while_loop_dynamic* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15274>	2022-03-08 21:30:29 +00:00
Samuel Pitoiset	342e6f8332	radv,aco,llvm: lower post shuffle vertex in NIR fossils-db (Sienna Cichlid): Totals from 774 (0.57% of 134913) affected shaders: VGPRs: 26496 -> 26312 (-0.69%) CodeSize: 1825936 -> 1828812 (+0.16%); split: -0.04%, +0.20% MaxWaves: 22046 -> 22062 (+0.07%) Instrs: 347634 -> 347975 (+0.10%); split: -0.05%, +0.15% Latency: 1363949 -> 1356426 (-0.55%); split: -0.59%, +0.04% InvThroughput: 221529 -> 221380 (-0.07%); split: -0.10%, +0.04% VClause: 5682 -> 5676 (-0.11%); split: -1.46%, +1.36% SClause: 7485 -> 7411 (-0.99%); split: -1.48%, +0.49% Copies: 30481 -> 30420 (-0.20%); split: -0.51%, +0.31% PreVGPRs: 19717 -> 19656 (-0.31%) fossil-db (Polaris10): Totals from 896 (0.66% of 135960) affected shaders: SGPRs: 49824 -> 49648 (-0.35%); split: -0.39%, +0.03% VGPRs: 31040 -> 29948 (-3.52%); split: -3.62%, +0.10% CodeSize: 875960 -> 875920 (-0.00%); split: -0.06%, +0.05% MaxWaves: 6380 -> 6429 (+0.77%) Instrs: 171522 -> 171482 (-0.02%); split: -0.07%, +0.05% Latency: 1356082 -> 1334386 (-1.60%); split: -1.61%, +0.01% InvThroughput: 553389 -> 552957 (-0.08%); split: -0.08%, +0.00% VClause: 4317 -> 4244 (-1.69%); split: -2.41%, +0.72% SClause: 6157 -> 6139 (-0.29%); split: -0.45%, +0.16% Copies: 9340 -> 9235 (-1.12%); split: -1.24%, +0.12% PreVGPRs: 22366 -> 22116 (-1.12%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15113>	2022-03-08 19:18:01 +00:00
Timur Kristóf	4b99b528f5	nir: Introduce workgroup_index and ability to lower workgroup_id to it. The workgroup_index is intended for situations when a 3 dimensional workgroup_id is not available on the HW, but a 1 dimensional index is. In this case, we can use lower the 3D ID to use this. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Timur Kristóf	6a4c01f3ef	nir: Extract lower_id_to_index into a separate function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Timur Kristóf	64acec0ef9	nir: Fix lowering terminology of compute system values: "from"->"to". This is to match other NIR terminology. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15103>	2022-03-08 17:36:31 +00:00
Jason Ekstrand	541f08cd4c	panvk: Non-destructively stub GetRenderAreaGranularity Don't crash. Just print a warning and return 1x1. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Jason Ekstrand	afe2ef9afc	panvk: Advertise zero sparse format properties This is the correct implementation when you don't support sparse and fixes piles of CTS crashes. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Jason Ekstrand	8feed1f114	panvk: Advertise VK_KHR_get_physical_device_properties2 All the entrypooints are already implemented and a bunch of Vulkan CTS tests assume this extension exists. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15285>	2022-03-08 17:03:47 +00:00
Rob Clark	3937c47c48	gallium/dri: Add missing in_fence_fd initialization Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6108 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15272>	2022-03-08 16:38:00 +00:00
Yogesh Mohan Marimuthu	3332fcd9c0	vulkan/device_select: add has_vulkan11 flag with has_pci_bus flag In EnumeratePhysicalDevices(), pci bus info is available only in vulkan version >= 1.1. hence adding has_vulkan11 flag in places where has_pci_bus is used in EnumeratePhysicalDevices() code flow. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14535>	2022-03-08 14:23:41 +00:00
Yogesh Mohan Marimuthu	1cc549949d	vulkan/device_select: for vulkan 1.0 use vid/did for boot_vga In device select layer EnumeratePhysicalDevices() function pci bus information is available only in case of vulkan >= 1.1. Hence use vid/did to match boot_vga device in case of vulkan 1.0. Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14535>	2022-03-08 14:23:41 +00:00
Timur Kristóf	5b9bf3434f	nir: Fix handling of NV_mesh_shader PRIMITIVE_INDICES output. PRIMITIVE_INDICES is a flat array in NV_mesh_shader, not a proper arrayed output, as opposed to D3D-style mesh shaders where it's addressed by the primitive index. Prevent assigning several slots to primitive indices, to avoid issues. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15160>	2022-03-08 13:44:10 +00:00
Rhys Perry	d068eb53e8	aco/insert_exec_mask: optimize top-level transition to exact before demote fossil-db (Sienna Cichlid): Totals from 5767 (3.55% of 162293) affected shaders: Instrs: 3264949 -> 3257527 (-0.23%); split: -0.23%, +0.00% CodeSize: 17835692 -> 17806004 (-0.17%); split: -0.17%, +0.00% Latency: 45990060 -> 45987924 (-0.00%); split: -0.00%, +0.00% InvThroughput: 7643850 -> 7643835 (-0.00%); split: -0.00%, +0.00% Copies: 193641 -> 186219 (-3.83%); split: -3.84%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15244>	2022-03-08 12:49:59 +00:00
Rhys Perry	42a5be975a	aco/insert_exec_mask: use get_exec_op No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15244>	2022-03-08 12:49:59 +00:00
Rhys Perry	aa55ecc296	aco/insert_exec_mask: fix top-level to-exact with non-global exact mask After transitioning to exact after a discard, the exec stack might be: [exact\|global, wqm, exact] Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15244>	2022-03-08 12:49:59 +00:00
Cristian Ciocaltea	ded789fcaf	radeonsi/ci: Mark a bunch of flaky tests on stoney radeonsi-stoney-gl:amd64 job fails due to random crashes of some 'dEQP-GLES3.functional.buffer.map.write.explicit_flush.*' tests. Fix the pipeline by adding them to 'radeonsi-stoney-flakes.txt'. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15238>	2022-03-08 12:52:54 +02:00
Cristian Ciocaltea	090f6f1b33	ci/zink: Report flake test Mark 'KHR-GL46.shader_image_load_store.advanced-sso-subroutine' as flake since it failed several times while attempting to merge this MR. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15238>	2022-03-08 12:52:54 +02:00
Cristian Ciocaltea	e8882f0ef8	ci: Improve interrupt signal handling in crosvm-runner.sh Run crosvm as a background process in order to allow intercepting interrupt signals (INT, TERM) and properly release/cleanup any allocated resources. This is particularly helpful when one or more crosvm tasks hang, which will eventually prevent subsequent instances to be started - currently we can handle up to 128 concurrent crosvm instances per runner. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15238>	2022-03-08 12:52:54 +02:00
Cristian Ciocaltea	1f0a0839eb	ci: Increase limit of concurrent crosvm instances per runner Ensure we can handle up to 128 concurrent crosvm instances per runner with the current CID generator. This is a safety margin for the new 64-core runners. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15238>	2022-03-08 12:52:54 +02:00
Dave Airlie	8346983775	gallivm/nir: extract a valid texture index according to exec_mask. When using indirect textures, some lanes may not be active, particularly in a loop, so as with some other areas, extracting the correct lane is needed here. This extracts the last valid one. KHR-GL45.texture_barrier.* on zink. Fixes: `e168d148d7` ("gallivm/nir: handle non-uniform texture offsets") Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15259>	2022-03-08 08:42:06 +00:00
Samuel Pitoiset	e449acac9d	radv/ci: remove unused files These files are no longer used. Fixes: `cc327a0fe4` ("amd, ci: Remove unused runners.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Charlie Turner <cturner@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15264>	2022-03-08 08:25:34 +01:00
Ilia Mirkin	4e45847d0a	freedreno: add a420 deqp-runner files This doesn't actually get run in CI, but this helps track outstanding issues / expectations. This is from a run on my IFC6540 with A420. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	08db7f456a	freedreno/a4xx: expose shaders and images, as well as ES 3.1 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	0277f491ae	freedreno/ir3: disable conversion folding on a4xx Experiments suggest that e.g. add.u r0.y, hr0.x, hr0.y will result in the summed value in both the high and low words of r0.y. This only happens with odd registers, not even ones (r0.x works fine). Seen in the bit_count lowering (which turns out to be unnecessary, but this is still a larger problem). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	4684425150	freedreno/ir3: no need to count bits 16b at a time for a4xx This also works out nicely since a4xx has some sort of problem with the 16b-based lowering. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	f628fd30de	freedreno/a4xx: improve condition for disabling early z This helps some subtests in the early-z piglit test, but leaves one occlusion-based test still failing. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Ilia Mirkin	d11543ec52	freedreno/a4xx: extend astc and tg4 workarounds to compute shaders Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15251>	2022-03-08 01:23:05 +00:00
Mike Blumenkrantz	4a03619d81	Revert "lavapipe: accurately set image/ssbo access based on shader usage" This reverts commit `821a49981f`. still flaky Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15271>	2022-03-07 18:00:42 -05:00
Matt Turner	8860ff3310	intel/perf: Destination array calculation into function Cuts 119 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 917511 0 0 917511 e0007 meson-generated_.._intel_perf_metrics.c.o (before) 796986 0 0 796986 c293a meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14130948 365708 210004 14706660 e067e4 iris_dri.so (before) 14009332 365708 210004 14585044 de8cd4 iris_dri.so (after) text data bss dec hex filename 8124225 214264 22820 8361309 7f955d libvulkan_intel.so (before) 8002609 214264 22820 8239693 7dba4d libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	d80d3c6760	intel/perf: Fix mistake in description string Along with fixing the grammar, this allows it to be deduplicated since the properly worded description exists in later generations' XMLs. Cuts 96 B from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 917613 0 0 917613 e006d meson-generated_.._intel_perf_metrics.c.o (before) 917511 0 0 917511 e0007 meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14131044 365708 210004 14706756 e06844 iris_dri.so (before) 14130948 365708 210004 14706660 e067e4 iris_dri.so (after) text data bss dec hex filename 8124321 214264 22820 8361405 7f95bd libvulkan_intel.so (before) 8124225 214264 22820 8361309 7f955d libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	7024b8e0eb	intel/perf: Mark intel_perf_counter_* enums as PACKED Reduces their sizes from 4 bytes to 1. Cuts 6 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 924401 0 0 924401 e1af1 meson-generated_.._intel_perf_metrics.c.o (before) 917613 0 0 917613 e006d meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14137732 365708 210004 14713444 e08264 iris_dri.so (before) 14131044 365708 210004 14706756 e06844 iris_dri.so (after) text data bss dec hex filename 8131009 214264 22820 8368093 7fafdd libvulkan_intel.so (before) 8124321 214264 22820 8361405 7f95bd libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	6c0246dcf4	intel/perf: Store indices to strings rather than pointers The compiler does a good job of deduplicating strings already, but we can eliminate the pointers to each string by combining the strings into a single char array and storing only an index into that array. The longest of the char arrays is the descriptions array, which is a little over 45 KiB, so still under MSVC's 64 KiB string literal limit [0]. Because the string length is under 64 KiB we can use uint16_t as the index type, which roughly doubles our savings as compared to an int. This cuts 77 KiB from iris_dri.so (0.5%) and libvulkan_intel.so (0.9%). text data bss dec hex filename 926811 25920 0 952731 e899b meson-generated_.._intel_perf_metrics.c.o (before) 924401 0 0 924401 e1af1 meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14190852 391628 210004 14792484 e1b724 iris_dri.so (before) 14137732 365708 210004 14713444 e08264 iris_dri.so (after) text data bss dec hex filename 8184097 240184 22820 8447101 80e47d libvulkan_intel.so (before) 8131009 214264 22820 8368093 7fafdd libvulkan_intel.so (after) relinfo: iris_dri.so (before): 17765 relocations, 17545 relative (98%), 452 PLT entries, 1 for local syms (0%), 0 users iris_dri.so (after) : 15605 relocations, 15385 relative (98%), 452 PLT entries, 1 for local syms (0%), 0 users libvulkan_intel.so (before): 10720 relocations, 6989 relative (65%), 355 PLT entries, 1 for local syms (0%), 0 users libvulkan_intel.so (after) : 8560 relocations, 4829 relative (56%), 355 PLT entries, 1 for local syms (0%), 0 users [0] https://docs.microsoft.com/en-us/cpp/cpp/string-and-character-literals-cpp?view=msvc-170&viewFallbackFrom=vs-2019 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	df5e743c80	intel/perf: Use slimmer intel_perf_query_counter_data struct intel_perf_query_counter contains fields for things we can't or don't want to store in our static data (like runtime-determined max values) or oa_read_counter function pointers which are dependent on the GPU gen and would make deduplication very ineffective. Cuts 16 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 926811 43200 0 970011 ecd1b meson-generated_.._intel_perf_metrics.c.o (before) 926811 25920 0 952731 e899b meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14190852 408908 210004 14809764 e1faa4 iris_dri.so (before) 14190852 391628 210004 14792484 e1b724 iris_dri.so (after) text data bss dec hex filename 8184097 257464 22820 8464381 8127fd libvulkan_intel.so (before) 8184097 240184 22820 8447101 80e47d libvulkan_intel.so (after) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	bbbbb0325b	intel/perf: Use a function to initialize perf counters And specifically mark it with ATTRIBUTE_NOINLINE. Otherwise it will be inlined and actually slightly increase code size. Cuts 505 KiB from iris_dri.so and libvulkan_intel.so. text data bss dec hex filename 1538720 0 0 1538720 177aa0 meson-generated_.._intel_perf_metrics.c.o (before) 926811 43200 0 970011 ecd1b meson-generated_.._intel_perf_metrics.c.o (after) text data bss dec hex filename 14751700 365708 210004 15327412 e9e0b4 iris_dri.so (before) 14190852 408908 210004 14809764 e1faa4 iris_dri.so (after) text data bss dec hex filename 8744913 214264 22820 8981997 890ded libvulkan_intel.so (before) 8184097 257464 22820 8464381 8127fd libvulkan_intel.so (after) Relocations increase because the counter initializations are moved from code (in .text) to pointers (in .text) to .rodata, which require relocations. relinfo: iris_dri.so (before): 15605 relocations, 15385 relative (98%), 452 PLT entries, 1 for local syms (0%), 0 users iris_dri.so (after) : 17765 relocations, 17545 relative (98%), 452 PLT entries, 1 for local syms (0%), 0 users libvulkan_intel.so (before): 8560 relocations, 4829 relative (56%), 355 PLT entries, 1 for local syms (0%), 0 users libvulkan_intel.so (after) : 10720 relocations, 6989 relative (65%), 355 PLT entries, 1 for local syms (0%), 0 users Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	5e6c7a572e	intel/perf: Deduplicate perf counters No changes in resulting code (yes, seriously!). GCC constant propagates the static const arrays into the code, yielding bit for bit identical results. This will however enable further cleanups. Before this patch, we emit 11916 different initializations of intel_perf_query_counter. With this patch we emit an array of 539 and initialize the intel_perf_query_counters in terms of those. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Matt Turner	3172b5bbb8	intel/perf: Don't print leading space from desc_units() Just an annoyance I noticed when I needed to generate the description string in two different places. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Emma Anholt	12e065ddec	intel/perf: Move some static blocks of C code out of the python script. Now my editor can help me format code as I type. Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15237>	2022-03-07 21:09:54 +00:00
Chia-I Wu	c795ae8b88	venus: fix properties of unsupported external fences/semaphores compatibleHandleTypes should be cleared. Fixed dEQP-VK.api.external.semaphore.sync_fd.info_timeline. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15266>	2022-03-07 20:43:50 +00:00
Ian Romanick	63d399b3fb	iris/ci: Mark amd_performance_monitor tests as flakes. On one attempt to merge !15210 failed because spec@amd_performance_monitor@measure,UnexpectedPass. In that same report, the other two tests were listed under "some flakes." Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Nanley Chery <nanley.g.chery@intel.com> Acked-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15220>	2022-03-07 10:54:04 -08:00
Danylo Piliaiev	e2fc99b188	turnip: Add "rast_order" debug option to force rast order access Enables rasterization order attachment access for all pipelines, see VK_ARM_rasterization_order_attachment_access for details. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15262>	2022-03-07 17:07:18 +00:00
Pierre-Eric Pelloux-Prayer	52ceb9dcb6	gallium/tc: warn if an app is incompatible with cpu_storage Instead of silently ignoring unmap calls. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15074>	2022-03-07 14:51:16 +01:00
Pierre-Eric Pelloux-Prayer	a5a8e19741	radeonsi: enable tc cpu_storage by default Enable for all applications for all small buffers. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15074>	2022-03-07 14:51:15 +01:00
Pierre-Eric Pelloux-Prayer	cd0ef9b3f4	gallium/u_threaded: late alloc cpu_storage Instead of allocating cpu_storage in threaded_resource_init, defer the allocation to first use (in tc_buffer_map). This avoids needless memory allocation if tc_buffer_disable_cpu_storage is called before tc_buffer_map. map_buffer_alignment is stored and serves as a "can cpu_storage be used" flag. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15074>	2022-03-07 14:51:15 +01:00
Pierre-Eric Pelloux-Prayer	7070178dfd	radeonsi: use 1 shader compilation thread if NIR_PRINT is used This avoids getting multiple shaders NIR code being printed at the same time making the log unusable. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15074>	2022-03-07 14:51:15 +01:00
Georg Lehmann	6731460194	nir: Fix source type for fragment_fetch_amd. Like txf_ms, these take integers not floats. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15242>	2022-03-07 12:21:12 +00:00
Pierre-Eric Pelloux-Prayer	50be692253	radeonsi/tests: always add the --gpu argument To get the same argument in multi and single gpu situation. Fixes: `21b0153833` ("radeonsi/tests: print PCI-id of GPU device under test") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15240>	2022-03-07 11:53:14 +01:00
Pierre-Eric Pelloux-Prayer	9c49550163	radeonsi: change rounding mode to round to even Use ROUND_TO_EVEN instead of TRUNCATE; this matches what pal and radv do. This fixes the spec@ext_framebuffer_multisample@turn-on-off tests. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mihai Preda <mhpreda@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15240>	2022-03-07 11:53:14 +01:00
José Expósito	4c3f8d3af8	egl/wayland: fix crash in dri2_initialize_wayland_swrast When "dri2_wl_formats_init" fails in "dri2_initialize_wayland_swrast", the "dri2_display_destroy" function is called for clean up. However, the "dri2_egl_display" was not associated with the display in its "DriverData" field yet. The following cast in "dri2_display_destroy": struct dri2_egl_display *dri2_dpy = dri2_egl_display(disp); Expands to: _EGL_DRIVER_TYPECAST(drvname ## _display, _EGLDisplay, obj->DriverData) Crashing. Signed-off-by: José Expósito <jose.exposito89@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13972>	2022-03-07 08:26:01 +00:00
José Expósito	43258dc802	egl/wayland: fix crash in dri2_initialize_wayland_drm When "dri2_wl_formats_init" fails in "dri2_initialize_wayland_drm", the "dri2_display_destroy" function is called for clean up. However, the "dri2_egl_display" was not associated with the display in its "DriverData" field yet. The following cast in "dri2_display_destroy": struct dri2_egl_display *dri2_dpy = dri2_egl_display(disp); Expands to: _EGL_DRIVER_TYPECAST(drvname ## _display, _EGLDisplay, obj->DriverData) Crashing. Addresses-Coverity-ID: 1494541 ("Resource leak") Signed-off-by: José Expósito <jose.exposito89@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13972>	2022-03-07 08:26:01 +00:00
Mike Blumenkrantz	a0bfd65d0f	zink: hide descriptor debug behind #ifdef I've gotten a feel for this, but it's annoying to always see it spamming away Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	e0030bc39f	zink: invalidate non-punted recycled descriptor sets that are not valid these sets may contain refs from the descriptors which need to be removed to avoid invalid memory access if the ref is leaked cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	d63f3c31b7	zink: stop leaking descriptor sets when migrating a recycled set here, the set was previously invalid and in the recycled table, meaning it can be reused directly so long as it's first invalidated the previous code would instead pop a different set off the allocation array, leaking this one cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	9a91a520de	zink: mark fbfetch push sets as non-cached these can't be cached, so ensure the value isn't uninitialized Test case 'KHR-GL46.blend_equation_advanced.blend_all.GL_HARDLIGHT_KHR_all_qualifier'.. ==1193311== Conditional jump or move depends on uninitialised value(s) ==1193311== at 0x634EF05: update_push_ubo_descriptors (zink_descriptors.c:1230) ==1193311== by 0x634FCC5: zink_descriptors_update (zink_descriptors.c:1412) ==1193311== by 0x63A5EA1: void zink_draw<(zink_multidraw)1, (zink_dynamic_state)3, true, false>(pipe_context, pipe_draw_info const, unsigned int, pipe_draw_indirect_info const, pipe_draw_start_count_bias const, unsigned int, pipe_vertex_state, unsigned int) (zink_draw.cpp:788) ==1193311== by 0x635A0AF: void zink_draw_vbo<(zink_multidraw)1, (zink_dynamic_state)3, true>(pipe_context, pipe_draw_info const, unsigned int, pipe_draw_indirect_info const, pipe_draw_start_count_bias const*, unsigned int) (zink_draw.cpp:907) ==1193311== by 0x6174397: tc_call_draw_single (u_threaded_context.c:3150) ==1193311== by 0x616C403: tc_batch_execute (u_threaded_context.c:211) ==1193311== by 0x616CA44: _tc_sync (u_threaded_context.c:362) ==1193311== by 0x61725E9: tc_texture_map (u_threaded_context.c:2274) ==1193311== by 0x5A9DAE9: pipe_texture_map_3d (u_inlines.h:572) ==1193311== by 0x5A9EB80: st_ReadPixels (st_cb_readpixels.c:530) ==1193311== by 0x5A0647A: read_pixels (readpix.c:1178) ==1193311== by 0x5A0647A: _mesa_ReadnPixelsARB (readpix.c:1195) ==1193311== by 0x5A06517: _mesa_ReadPixels (readpix.c:1210) cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	ab3725f533	zink: fix descriptor cache pointer array allocation this got mixed up during some refactor and started indexing based on the number of bindings instead of the number of descriptors, which means that array descriptor bindings would have overlapping array memory cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	c5f585f45a	zink: wait on program cache fences before destroying programs if these still have outstanding cache jobs, deleting the object now will cause a crash maybe fixes some cts flakiness? cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	382798ddbd	zink: use a fence for pipeline cache update jobs otherwise there's nothing to wait on cc: mesa-stable Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	cbd9a91f64	zink: add function for refcounting zink_program structs step one in maybe cleaning up and deduplicating some program code Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	9a6c58b2f7	zink: always update shader variants when rebinding a gfx program the variant data may have changed in the meanwhile, so do a pass to ensure that the correct variant is used cc: mesa-stable fixes caselist: dEQP-GLES31.functional.shaders.sample_variables.sample_mask.discard_half_per_two_samples.multisample_texture_4 dEQP-GLES31.functional.shaders.sample_variables.sample_mask.discard_half_per_two_samples.singlesample_rbo Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15226>	2022-03-07 04:07:58 +00:00
Mike Blumenkrantz	821a49981f	lavapipe: accurately set image/ssbo access based on shader usage Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15233>	2022-03-07 03:56:46 +00:00
Mike Blumenkrantz	bfae16ca34	lavapipe: scan shaders for image/ssbo access and generate per-stage masks Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15233>	2022-03-07 03:56:46 +00:00
Mike Blumenkrantz	fcf58e75d0	lavapipe: heap-allocate rendering_state struct this thing is like 28k now, which is just way too big to have on the stack Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15233>	2022-03-07 03:56:46 +00:00
Mike Blumenkrantz	c82dcdf598	gallivm: avoid division by zero when computing cube face this is illegal and produces NaNs which blow up the sample instr cc: mesa-stable fixes (llvmpipe and zink): KHR-GL45.incomplete_texture_access.sampler dEQP-GLES31.functional.program_uniform.by_pointer.render.array_in_struct.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_pointer.render.array_in_struct.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_pointer.render.array_in_struct.sampler2D_samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_pointer.render.basic.samplerCube_both dEQP-GLES31.functional.program_uniform.by_pointer.render.basic.samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_pointer.render.basic.samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_pointer.render.basic_struct.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_pointer.render.basic_struct.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_pointer.render.nested_structs_arrays.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_pointer.render.nested_structs_arrays.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_pointer.render.nested_structs_arrays.sampler2D_samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_pointer.render.struct_in_array.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_pointer.render.struct_in_array.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_pointer.render.struct_in_array.sampler2D_samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_value.render.array_in_struct.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_value.render.array_in_struct.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_value.render.array_in_struct.sampler2D_samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_value.render.basic.samplerCube_both dEQP-GLES31.functional.program_uniform.by_value.render.basic.samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_value.render.basic.samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_value.render.basic_struct.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_value.render.basic_struct.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_value.render.nested_structs_arrays.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_value.render.nested_structs_arrays.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_value.render.nested_structs_arrays.sampler2D_samplerCube_vertex dEQP-GLES31.functional.program_uniform.by_value.render.struct_in_array.sampler2D_samplerCube_both dEQP-GLES31.functional.program_uniform.by_value.render.struct_in_array.sampler2D_samplerCube_fragment dEQP-GLES31.functional.program_uniform.by_value.render.struct_in_array.sampler2D_samplerCube_vertex Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15246>	2022-03-06 02:44:17 +00:00
Mike Blumenkrantz	cf9454bb2a	gallivm: fix debug prints for halfs Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15246>	2022-03-06 02:44:17 +00:00
Icecream95	ba18799ca1	pan/bi: Don't assign slots for the blend second source Another instruction might write to the second source, and then an INSTR_INVALID_ENC fault will be raised because the tuple will write to and read from the register at the same time. Fixes: `795638767d` ("pan/bi: Use fused dual source blending") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	66a604efb5	pan/bi: Skip psuedo sources in ISA.xml The second staging register source for the +BLEND instruction should not be packed nor disassembled, so skip it when include_pseudo is not set. Fixes: `795638767d` ("pan/bi: Use fused dual source blending") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	9d4441c71a	panfrost: Fix ubo_mask calculation BITSET_MASK returns ~0 when given an input of zero, when we need it to return 0 instead. Fixes shaders with only sysvals but no UBOs when push constants are disabled. This breaks when 31 or 32 UBOs are used, but PAN_MAX_CONST_BUFFERS is currently set to 16. Fixes: `c246af0dd8` ("panfrost: Only upload UBOs when needed") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	0b232b8659	panfrost: Improve comment for emit_fragment_job Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	24101d944b	pan/bi: Add documentation for bifrost_nir_lower_store_component Taken from the commit that introduced the function, `95458c4033` ("pan/bi: Lower stores with component != 0"). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	42caddcf6b	pan/bi: Make disassembler build reproducibly Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:55:00 -05:00
Icecream95	d6c431c2e3	panfrost: Re-emit descriptors after resource shadowing This could be made slightly more efficient by only setting the dirty state that is needed, but eventually you reach a point where it's cheaper to re-emit everything than work out what can or can't be kept. Fixes rendering issues in Duckstation. Fixes: `cd2c1ef9da` ("panfrost: Dirty track textures/samplers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:54:58 -05:00
Icecream95	b164ee0d7b	panfrost: Set dirty state in set_shader_buffers Otherwise the pointer (which is uploaded as a sysval) won't be updated when a new SSBO is bound. Fixes: `c34b760b9f` ("panfrost: Dirty track constant buffers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	cb8c47b15e	pan/bi: Check dependencies of both destinations of instructions TEXC can have two destinations; the value for neither of them can be used in the same bundle, so extend the code to check for this to iterate over both destinations. Fixes artefacts in the game "LIMBO". Fixes: `a303076c1a` ("pan/bi: Add bi_instr_schedulable predicate") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	9e714f7455	pan/bi: Add interference between destinations Trying to write to overlapping register ranges from a single instruction is undefined behaviour, so add interference between the nodes to avoid this. Hit in a dual-texture instruction in LIMBO. Fixes: `9146bafbb4` ("pan/bi: Add dual texture fusing pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	198cb4a77a	panfrost: Disable point size upper limit clamping The hardware already clamps this, there is no need to do it in the shader. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	66684339d5	panfrost: Update point size limits to match hardware behaviour Found while reverse-engineering the tiler heap format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:50:09 -05:00
Icecream95	d54efebf04	panfrost: Set PIPE_CAP_QUADS_FOLLOW_PROVOKING_VERTEX_CONVENTION Fixes arb-provoking-vertex-render Piglit test. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:47:24 -05:00
Icecream95	948300da27	pan/mdg: Use util_logbase2 instead of C99 log2 log2 operates on double, we only need the integer util/ function. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15250>	2022-03-05 14:27:44 -05:00
Ilia Mirkin	e42a8a5b92	a4xx: add emission of compute state, and compute dispatch Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	63bba1dc6c	a4xx: add logic to emit image/ssbo state Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	aac7028b58	freedreno/ir3: support a4xx compute differences Mainly the workgroup id comes injected via consts by the hardware (or CP), and we must make room for it, otherwise the driver won't know where to put it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Ilia Mirkin	6fb5e64ead	freedreno/ir3: support a4xx in load/store buffer/image emission Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14794>	2022-03-05 03:21:05 -05:00
Rob Clark	e9cd4fba6f	freedreno/perfetto+fdperf: Set SYSPROF param No need to check error return and deal with older kernels. Older kernels won't have this param but their default behavior allows for systemwide perfcntr collection. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15236>	2022-03-04 16:06:34 -08:00
Rob Clark	af4b7f74b2	freedreno/drm: Add SYSPROF param Add new param for putting kernel in system-profiling mode and add corresponding fd_pipe_set_param() mechanism. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15236>	2022-03-04 16:06:34 -08:00
Rob Clark	f925794b16	freedreno: Update uapi header Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15236>	2022-03-04 16:05:10 -08:00
Rob Clark	d2e498b6a5	egl+libsync: Add helper to complain about invalid fence fd's Debugging fd lifetime issues can be hard. Add a helper for debug builds to print out an error if an fd is not a fence fd, and sprinkle it around Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15094>	2022-03-04 22:16:20 +00:00
Rob Clark	1e25f3b282	android: Push in-fence-fd down to driver Rather than immediately stall on the CPU in SwapBuffers() if the in-fence for the dequeued buffer is not yet signaled, push it down to the driver. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6048 Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15094>	2022-03-04 22:16:20 +00:00
Rob Clark	dfac374220	gallium/dri: Extend image extension to support in-fence Extend dri so that an in-fence-fd can be plumbed through to driver. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15094>	2022-03-04 22:16:20 +00:00
Samuel Pitoiset	af2951dde8	radv/ci: update list of expected failures Add dEQP-VK.glsl.builtin.precision_double.determinant.compute.mat3 which fails on all generations. It looks like CTS should relax tolerance slightly. Co-authored-by: Charlie Turner <cturner@igalia.com> Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15234>	2022-03-04 18:43:18 +01:00
Samuel Pitoiset	51c6fdf708	radv/ci: skip dEQP-VK.renderpass2.depth_stencil_resolve.*_samplemask They randomly hang on Navi10 and randomly fail on Sienna Cichlid. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15234>	2022-03-04 18:43:16 +01:00
Juan A. Suarez Romero	7ffee7f1ab	v3d: rebind sampler view if resource changed the BO When discarding the whole resource to create a new one, if this resource is used by a sampler view, a rebind must be done to use the new resource. But this must be done when setting the sampler views, because we don't have access to those samplers before. v2: - Pack shader state on setting sampler views (Iago) - Use a serial ID to know when to rebind sampler views (Juan) v3: - Move check to caller (Iago) - Keep rebind sampler view on BO change (Iago) v4: - Rename "serial_bo" to "serial_id" (Iago) - Add comments (Iago) Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6027 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15171>	2022-03-04 17:20:28 +00:00
Alyssa Rosenzweig	7bda838c56	panfrost: Push twice as many uniforms The limit for Bifrost is twice as high as previously thought -- the limit is 64 slots of FAU, not 64 words. Each slot is 2 words. We can push twice as much, saving a considerable number of cycles in some cases. total instructions in shared programs: 2454260 -> 2431502 (-0.93%) instructions in affected programs: 845176 -> 822418 (-2.69%) helped: 3376 HURT: 304 helped stats (abs) min: 1.0 max: 60.0 x̄: 7.92 x̃: 6 helped stats (rel) min: 0.13% max: 45.45% x̄: 4.60% x̃: 4.11% HURT stats (abs) min: 1.0 max: 60.0 x̄: 13.06 x̃: 8 HURT stats (rel) min: 0.16% max: 35.09% x̄: 7.58% x̃: 6.52% 95% mean confidence interval for instructions value: -6.50 -5.87 95% mean confidence interval for instructions %-change: -3.75% -3.43% Instructions are helped. total tuples in shared programs: 1963383 -> 1951560 (-0.60%) tuples in affected programs: 638622 -> 626799 (-1.85%) helped: 2959 HURT: 573 helped stats (abs) min: 1.0 max: 54.0 x̄: 5.61 x̃: 4 helped stats (rel) min: 0.15% max: 28.57% x̄: 3.61% x̃: 3.12% HURT stats (abs) min: 1.0 max: 50.0 x̄: 8.35 x̃: 6 HURT stats (rel) min: 0.25% max: 27.34% x̄: 6.24% x̃: 4.92% 95% mean confidence interval for tuples value: -3.61 -3.08 95% mean confidence interval for tuples %-change: -2.18% -1.85% Tuples are helped. total clauses in shared programs: 387817 -> 365111 (-5.85%) clauses in affected programs: 135527 -> 112821 (-16.75%) helped: 3489 HURT: 25 helped stats (abs) min: 1.0 max: 43.0 x̄: 6.52 x̃: 5 helped stats (rel) min: 0.82% max: 58.33% x̄: 17.48% x̃: 15.87% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.56 x̃: 1 HURT stats (rel) min: 2.94% max: 11.11% x̄: 6.87% x̃: 6.67% 95% mean confidence interval for clauses value: -6.67 -6.26 95% mean confidence interval for clauses %-change: -17.65% -16.96% Clauses are helped. total cycles in shared programs: 201842.21 -> 168754.04 (-16.39%) cycles in affected programs: 84035.50 -> 50947.33 (-39.37%) helped: 3547 HURT: 136 helped stats (abs) min: 0.041665999999999315 max: 54.0 x̄: 9.33 x̃: 8 helped stats (rel) min: 0.17% max: 80.77% x̄: 36.10% x̃: 36.84% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.18% max: 12.24% x̄: 1.18% x̃: 0.61% 95% mean confidence interval for cycles value: -9.26 -8.71 95% mean confidence interval for cycles %-change: -35.34% -34.11% Cycles are helped. total arith in shared programs: 74918.46 -> 75022.62 (0.14%) arith in affected programs: 22471.04 -> 22575.21 (0.46%) helped: 1571 HURT: 1492 helped stats (abs) min: 0.041665999999999315 max: 1.125 x̄: 0.17 x̃: 0 helped stats (rel) min: 0.17% max: 40.00% x̄: 2.50% x̃: 1.96% HURT stats (abs) min: 0.041665999999999315 max: 2.375 x̄: 0.25 x̃: 0 HURT stats (rel) min: 0.16% max: 100.00% x̄: 5.35% x̃: 2.37% 95% mean confidence interval for arith value: 0.02 0.05 95% mean confidence interval for arith %-change: 1.08% 1.56% Arith are HURT. total ldst in shared programs: 174812 -> 137889 (-21.12%) ldst in affected programs: 81319 -> 44396 (-45.41%) helped: 3722 HURT: 0 helped stats (abs) min: 1.0 max: 62.0 x̄: 9.92 x̃: 8 helped stats (rel) min: 1.82% max: 100.00% x̄: 47.18% x̃: 43.75% 95% mean confidence interval for ldst value: -10.20 -9.64 95% mean confidence interval for ldst %-change: -47.97% -46.39% Ldst are helped. total quadwords in shared programs: 1757124 -> 1714130 (-2.45%) quadwords in affected programs: 584065 -> 541071 (-7.36%) helped: 3474 HURT: 173 helped stats (abs) min: 1.0 max: 90.0 x̄: 12.66 x̃: 9 helped stats (rel) min: 0.26% max: 34.18% x̄: 8.78% x̃: 8.33% HURT stats (abs) min: 1.0 max: 26.0 x̄: 5.76 x̃: 4 HURT stats (rel) min: 0.45% max: 20.66% x̄: 4.48% x̃: 2.63% 95% mean confidence interval for quadwords value: -12.21 -11.37 95% mean confidence interval for quadwords %-change: -8.36% -7.95% Quadwords are helped. total threads in shared programs: 52898 -> 53142 (0.46%) threads in affected programs: 262 -> 506 (93.13%) helped: 250 HURT: 6 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.92 0.99 95% mean confidence interval for threads %-change: 93.69% 99.28% Threads are helped. total spills in shared programs: 161 -> 107 (-33.54%) spills in affected programs: 54 -> 0 helped: 27 HURT: 0 total fills in shared programs: 1386 -> 796 (-42.57%) fills in affected programs: 590 -> 0 helped: 27 HURT: 0 Fixes: `d4dccea0ba` ("panfrost: Add UBO push data structure") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00
Alyssa Rosenzweig	e7cfe18099	pan/bi: Run CSE after lowering FAU Lowering FAU can add moves from uniforms. If a uniform is moved out to a register mulitple times in a basic block, these moves can be CSE'd, saving instructions at the cost of register pressure. 854 shaders in my shader-db are helped on cycle count (average 2.94% reduction in cycles). Only 9 shaders have hurt thread count, and there is no change in spills or fills. Overall, this seems to be a win. Prevents instruction count regressions from the next commit. total instructions in shared programs: 2454423 -> 2444690 (-0.40%) instructions in affected programs: 386274 -> 376541 (-2.52%) helped: 2105 HURT: 0 helped stats (abs) min: 1.0 max: 116.0 x̄: 4.62 x̃: 2 helped stats (rel) min: 0.04% max: 27.27% x̄: 3.64% x̃: 1.92% 95% mean confidence interval for instructions value: -4.91 -4.33 95% mean confidence interval for instructions %-change: -3.83% -3.45% Instructions are helped. total tuples in shared programs: 1963534 -> 1957106 (-0.33%) tuples in affected programs: 233562 -> 227134 (-2.75%) helped: 1491 HURT: 117 helped stats (abs) min: 1.0 max: 63.0 x̄: 4.44 x̃: 2 helped stats (rel) min: 0.04% max: 24.53% x̄: 4.39% x̃: 2.59% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.61 x̃: 1 HURT stats (rel) min: 0.18% max: 8.33% x̄: 1.44% x̃: 1.05% 95% mean confidence interval for tuples value: -4.28 -3.71 95% mean confidence interval for tuples %-change: -4.20% -3.73% Tuples are helped. total clauses in shared programs: 387848 -> 387079 (-0.20%) clauses in affected programs: 13718 -> 12949 (-5.61%) helped: 583 HURT: 60 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.42 x̃: 1 helped stats (rel) min: 1.11% max: 25.00% x̄: 8.28% x̃: 6.67% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.86% max: 20.00% x̄: 4.58% x̃: 4.00% 95% mean confidence interval for clauses value: -1.29 -1.10 95% mean confidence interval for clauses %-change: -7.57% -6.58% Clauses are helped. total cycles in shared programs: 201866.21 -> 201682.92 (-0.09%) cycles in affected programs: 6241.79 -> 6058.50 (-2.94%) helped: 952 HURT: 98 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.20 x̃: 0 helped stats (rel) min: 0.12% max: 26.00% x̄: 4.05% x̃: 2.38% HURT stats (abs) min: 0.041665999999999315 max: 0.16666700000000034 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.60% x̃: 1.43% 95% mean confidence interval for cycles value: -0.19 -0.16 95% mean confidence interval for cycles %-change: -3.80% -3.24% Cycles are helped. total arith in shared programs: 74924.00 -> 74660.12 (-0.35%) arith in affected programs: 9303.67 -> 9039.79 (-2.84%) helped: 1513 HURT: 118 helped stats (abs) min: 0.04166399999999726 max: 2.625 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.07% max: 33.33% x̄: 4.68% x̃: 2.67% HURT stats (abs) min: 0.041665999999999315 max: 0.16666800000000137 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.18% max: 8.70% x̄: 1.55% x̃: 1.37% 95% mean confidence interval for arith value: -0.17 -0.15 95% mean confidence interval for arith %-change: -4.48% -3.98% Arith are helped. total quadwords in shared programs: 1757254 -> 1751978 (-0.30%) quadwords in affected programs: 197399 -> 192123 (-2.67%) helped: 1464 HURT: 110 helped stats (abs) min: 1.0 max: 51.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.04% max: 21.95% x̄: 4.16% x̃: 2.52% HURT stats (abs) min: 1.0 max: 7.0 x̄: 1.71 x̃: 1 HURT stats (rel) min: 0.21% max: 13.04% x̄: 1.65% x̃: 0.93% 95% mean confidence interval for quadwords value: -3.58 -3.13 95% mean confidence interval for quadwords %-change: -3.97% -3.53% Quadwords are helped. total threads in shared programs: 52899 -> 52890 (-0.02%) threads in affected programs: 18 -> 9 (-50.00%) helped: 0 HURT: 9 HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: -1.00 -1.00 95% mean confidence interval for threads %-change: -50.00% -50.00% Threads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15239>	2022-03-04 15:22:04 +00:00
Henry Goffin	c8f644ec44	frontends/va: ignore incoming frame_num from VA picture parameters The Gallium pipe video "frame_num" variable is internally used as a counter of elapsed reference frames since the last IDR. The incoming frame_num field from VA picture parameters is not equivalent; the VA value may wrap to zero prematurely, as it is a 16-bit struct field with a documented max value of 2^(log2_max_frame_num_minus4 + 4)-1. This change improves "infinite GOP" single-client live streaming, where it is reasonable for the server to desire an endless series of P-frames without IDR. Without this change, it is difficult/impossible for an application to encode a P- or B-frame after the VA frame_num field wraps around to zero, depending on the backend encoder implementation. This change has no effect on existing applications that always signal an IDR frame and reset the VA frame_num to zero before it wraps around. For example, the FFmpeg vaapi encoder ignores the VA documentation and sends an un-wrapped VA frame_num, which results in identical computation of the internal frame_num (as long as each GOP is less than 65536 frames). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5768 Reviewed-by: Thong Thai <thong.thai@amd.com> patch revision 3: correctly avoid incrementing frame_num when the encoded frame is not a reference, per h264 spec and ffmpeg behavior Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14332>	2022-03-04 14:17:20 +00:00
Rhys Perry	d28b6b6856	aco: rework removal of jumps over branches Only allow this in situations where we know it's safe. In particular, this stops removal of unconditional branches like with block_kind_continue_or_break. Fixes dEQP-VK.graphicsfuzz.fragcoord-control-flow hang. fossil-db (Sienna Cichlid): Totals from 34 (0.02% of 162293) affected shaders: Instrs: 84115 -> 84178 (+0.07%); split: -0.00%, +0.08% CodeSize: 463372 -> 463624 (+0.05%); split: -0.00%, +0.06% Latency: 3467316 -> 3467652 (+0.01%) InvThroughput: 3085493 -> 3085578 (+0.00%) Branches: 3221 -> 3284 (+1.96%); split: -0.03%, +1.99% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `f030b75b7d` ("aco: relax condition to remove branches in case of few instructions") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15214>	2022-03-04 12:32:36 +00:00
Samuel Pitoiset	059f870d74	ac/nir: implement nir_op_pack_{uint,sint}_2x16 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15231>	2022-03-04 08:06:56 +00:00
Samuel Pitoiset	9b113f1b6c	aco: implement nir_op_pack_{uint,sint}_2x16 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15231>	2022-03-04 08:06:56 +00:00
Samuel Pitoiset	6532307555	nir: introduce nir_pack_{sint,uint}_2x16 instructions These instructions have AMD hardware equivalent and they will be used to lower fragment shader outputs in NIR. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15231>	2022-03-04 08:06:56 +00:00
Xiaohui Gu	4d81c60e11	iris: Mark a dirty update when vs_needs_sgvs_element value changed Add vs_needs_sgvs_element value check when updating vertex element dirty state in iris_update_compiled_vs to solve render error of Android game "Genshin Impact". Signed-off-by: Xiaohui Gu <xiaohui.gu@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15142>	2022-03-04 05:41:38 +00:00
Yiwei Zhang	aaa25cda0b	venus: add VK_EXT_image_robustness support Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Yiwei Zhang	ba212bf888	venus: add VK_EXT_provoking_vertex support Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Yiwei Zhang	33ba61b059	venus: add VK_EXT_line_rasterization support Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Yiwei Zhang	58182eb096	venus: update to latest venus protocol Added the below extension support: - VK_EXT_line_rasterization - VK_EXT_provoking_vertex Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Yiwei Zhang	20efd9eff3	venus: group extensions promoted to 1.3 Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Yiwei Zhang	fe3815b7fa	venus: clean up physical device features and properties Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15205>	2022-03-04 01:04:13 +00:00
Daniel Schürmann	ca4595e01a	nir/opt_shrink_vectors: update docstring in order to reflect the various recent improvements. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00
Daniel Schürmann	405829cd85	nir/opt_shrink_vectors: remove duplicate components from vecN vecN instructions which are only used by other ALU will now get duplicate channels removed. i915g: total instructions in shared programs: 396309 -> 396294 (<.01%) instructions in affected programs: 186 -> 171 (-8.06%) r300: total instructions in shared programs: 1165059 -> 1164354 (-0.06%) instructions in affected programs: 35884 -> 35179 (-1.96%) total temps in shared programs: 165497 -> 165326 (-0.10%) temps in affected programs: 2990 -> 2819 (-5.72%) softpipe: total instructions in shared programs: 2860028 -> 2859084 (-0.03%) instructions in affected programs: 55539 -> 54595 (-1.70%) total temps in shared programs: 516939 -> 516546 (-0.08%) temps in affected programs: 6623 -> 6230 (-5.93%) Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00
Daniel Schürmann	e5963478c2	nir/opt_shrink_vectors: shrink load_const properly This patch enables removal of arbitrary channels in load_const instructions, if they are either unused or duplicates of other channels and only used by ALU. Totals from 692 (0.51% of 134913) affected shaders: (GFX10.3) VGPRs: 21832 -> 21544 (-1.32%) CodeSize: 1322016 -> 1313080 (-0.68%); split: -0.68%, +0.01% Instrs: 243635 -> 242231 (-0.58%); split: -0.58%, +0.00% Latency: 1856138 -> 1857237 (+0.06%); split: -0.09%, +0.15% InvThroughput: 424298 -> 421671 (-0.62%); split: -0.62%, +0.01% VClause: 4580 -> 4583 (+0.07%); split: -0.02%, +0.09% SClause: 14336 -> 14354 (+0.13%); split: -0.04%, +0.17% Copies: 8897 -> 8859 (-0.43%); split: -0.45%, +0.02% PreSGPRs: 20439 -> 20437 (-0.01%) PreVGPRs: 16011 -> 15907 (-0.65%); split: -0.97%, +0.32% i915g: total instructions in shared programs: 396471 -> 396309 (-0.04%) instructions in affected programs: 6408 -> 6246 (-2.53%) total const in shared programs: 56458 -> 56422 (-0.06%) const in affected programs: 407 -> 371 (-8.85%) LOST: shaders/closed/steam/trine-2/fp-3.shader_test FS r300: total instructions in shared programs: 1164421 -> 1165059 (0.05%) instructions in affected programs: 143981 -> 144619 (0.44%) total temps in shared programs: 165488 -> 165497 (<.01%) temps in affected programs: 318 -> 327 (2.83%) total consts in shared programs: 922140 -> 921952 (-0.02%) consts in affected programs: 12438 -> 12250 (-1.51%) softpipe: total instructions in shared programs: 2859978 -> 2860028 (<.01%) instructions in affected programs: 183355 -> 183405 (0.03%) total temps in shared programs: 517071 -> 516939 (-0.03%) temps in affected programs: 1416 -> 1284 (-9.32%) total imm in shared programs: 103601 -> 102767 (-0.81%) imm in affected programs: 3928 -> 3094 (-21.23%) Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468>	2022-03-04 00:18:58 +00:00
Dave Airlie	a10b5d7086	crocus: change the line width workaround for gfx4/5 This fixes piglit line-flat-clip-color and the hud fps counter. Fixes: `6b7a68b7c2` ("crocus: add missing line smooth bits.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15229>	2022-03-04 00:06:28 +00:00
Chia-I Wu	bbbbf39559	venus: abort when stuck This gives MESA-VIRTIO: debug: stuck in ring seqno wait with iter at 4096 MESA-VIRTIO: debug: stuck in ring seqno wait with iter at 8192 MESA-VIRTIO: debug: stuck in ring seqno wait with iter at 12288 MESA-VIRTIO: debug: stuck in ring seqno wait with iter at 16384 MESA-VIRTIO: debug: aborting Aborted which should be more friendly than printing the messages forever. On my i7-7820HQ, this aborts after roughly 4+8+16+32=60 seconds Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15200>	2022-03-03 21:48:13 +00:00
Daniel Schürmann	ccf4bcd162	aco/ra: don't immediately assign a register for p_branch These get now assigned after handling phis. Totals from 564 (0.42% of 134913) affected shaders: (GFX10.3) CodeSize: 5519744 -> 5515308 (-0.08%) Instrs: 1063045 -> 1061936 (-0.10%) Latency: 11880452 -> 11875904 (-0.04%) InvThroughput: 2259933 -> 2259581 (-0.02%); split: -0.02%, +0.00% Copies: 86908 -> 85799 (-1.28%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	c3070773f8	aco/tests: add test for branch definition RA Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	32d0bae8ec	aco: fix branch definition validation Like how they have to be register allocated differently, branch definitions at merge block predecessors need to be validated differently. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	bed5a31005	aco: add validate_instr_defs() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	d5349a99c2	aco/ra: fix register allocation of branch definitions fossil-db (Sienna Cichlid): Totals from 704 (0.52% of 134913) affected shaders: CodeSize: 7177288 -> 7182072 (+0.07%); split: -0.00%, +0.07% Instrs: 1371781 -> 1372977 (+0.09%); split: -0.00%, +0.09% Latency: 17993572 -> 18001344 (+0.04%); split: -0.00%, +0.04% InvThroughput: 4198996 -> 4199569 (+0.01%); split: -0.00%, +0.01% Copies: 122456 -> 123516 (+0.87%); split: -0.01%, +0.88% Branches: 43815 -> 43818 (+0.01%); split: -0.02%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	608d48b787	aco/ra: add get_reg_phi() helper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Rhys Perry	ceca5e68c4	aco: remove vcc hint from branch definitions This doesn't seem to have much benefit anymore. fossil-db (Sienna Cichlid): Totals from 198 (0.15% of 134913) affected shaders: CodeSize: 2610536 -> 2610872 (+0.01%); split: -0.01%, +0.02% Instrs: 479001 -> 479085 (+0.02%); split: -0.01%, +0.03% Latency: 7310684 -> 7300735 (-0.14%); split: -0.16%, +0.02% InvThroughput: 2439084 -> 2437446 (-0.07%); split: -0.07%, +0.00% SClause: 14760 -> 14722 (-0.26%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13432>	2022-03-03 20:21:08 +00:00
Pavel Ondračka	558f632967	r300: schedule TEX instructions before OUT instructions NIR-to-TGSI produces partial output writes contrary to the old paths that always wrote the full outputs. Therefore if there is now a partial output write ready to be scheduled and nothing else besides a tex is ready, we would schedule the output write first. This was not a problem before as usually at last some component of the full output write depended on the tex result. This is not optimal from the performance point of view and resulted in ~20% slowdown in the Unigine demos. The docs say: The first OUTPUT instruction will reserve space in the output register fifo. This space is limited, therefore issuing an OUTPUT earlier than necessary may cause threads to stall earlier than necessary. You should not set an ALU instruction as type OUTPUT unless it is actually writing to an output register, or it is the last instruction of the program. Fix it by explicitly prefering a TEX before OUT and restore the performance: 9.66 -> 12.12 fps (as compared to 11.83 with the old glsl-to-TGSI path) in Unigine Sanctuary. No change in Lightsmark or GLmark. This is also a win from the intructions point of view as we are usually able to schedule the partial output writes in a single pair at the end. total instructions in shared programs: 106009 -> 105891 (-0.11%) instructions in affected programs: 10153 -> 10035 (-1.16%) helped: 118 HURT: 0 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5840 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15165>	2022-03-03 20:05:32 +00:00
Pavel Ondračka	aff1a85c09	r300: remove some dead logic in tex pair scheduling The max_score == -1 condition is already before so this will never trigger. Its unclear what was the intention anyway. Now we emit either: - if we have accumulated enough tex intructions for a full block - if we have nothing else to emit - or if we can emit all remaining tex instructions already. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15165>	2022-03-03 20:05:32 +00:00
Igor Torrente	688b23885b	Venus: Add `vn_physical_device_{features, properties}` for better organization New extensions properties/feature are being put in the `vn_physical_device` which is not ideal from an organization point of view. Here the `vn_physical_device_{features,properties}` are two new struct to help the `vn_physical_device` organzation. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15170>	2022-03-03 19:43:52 +00:00
Ilia Mirkin	539fae796a	freedreno/a4xx: fix integer tg4 Something is slightly off in the integer values returned. It passes many tests without the fixup, but the dEQP-GLES31 tests complain. The blob ends up doing 3x gathers, and selects between them based on getinfo results. Since we already have a per-sampler key with some spare bits, just stick the bit-size info in there. And we can derive signedness from the associated type info. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	96211adf77	freedreno/a4xx: add swizzles to shader keys for tg4 workaround Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	68a2d25d0d	freedreno/a4xx: move tex_type to header This will be used in several places. Factor it out for common use. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	8ed07c0da9	nir: remove bogus logic to allow cube + offset to work This was done for an a4xx hack which is now removed. No API allows cube texturing to have offsets. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Ilia Mirkin	37306ba3f1	freedreno/ir3: remove bogus tg4 -> tex lowering pass It can't be done. This just provides bad results. The blob had a comparable approach where they fixed up coordinates, but that also can't work with a separate texture definition with nearest filtering. By then, might as well provide a unswizzled variant instead, and using native functionality. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14670>	2022-03-03 18:26:43 +00:00
Alex Xu (Hello71)	80bf9c7b97	r300/compiler/tests: print regoff_t as size_t fixes compilation on musl Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13949>	2022-03-03 17:48:17 +00:00
Samuel Pitoiset	516aee64cc	radv,aco: do not lower nir_op_pack_{unorm,snorm}_2x16 v_cvt_pknorm_{u16,i16}_f32 can be emitted instead, it's supported on all generations. No fossils-db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15215>	2022-03-03 14:54:12 +01:00
Michel Zou	f1f1b3d7f8	vulkan/wsi: drop unused wsi_create_win32_image fixes: `ed391d2a` Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15088>	2022-03-03 06:13:07 +00:00
Andrii Simiklit	ddf2778269	glsl: add member's location layout qualifier rules for `arrayed` in/out blocks From Section 4.4.1 (Input Layout Qualifiers) of the GLSL 4.50 spec: "For some blocks declared as arrays, the location can only be applied at the block level: When a block is declared as an array where additional locations are needed for each member for each block array element, it is a compile-time error to specify locations on the block members. That is, when locations would be under specified by applying them on block members, they are not allowed on block members. For arrayed interfaces (those generally having an extra level of arrayness due to interface expansion), the outer array is stripped before applying this rule" From Section 1.2.1 (Changes from Revision 6 of GLSL Version) of the GLSL 4.50 spec: "Private Bug 15678: Don’t allow location = on block members where the block needs an array of locations" From Section 4.4.1 (Input Layout Qualifiers) of the GLSL ES 3.20 spec "If an input is declared as an array of blocks, excluding per-vertex-arrays as required for tessellation, it is an error to declare a member of the block with a location qualifier" From Section 1.1.3 (Changes from GLSL ES 3.2 revision 3) of the GLSL ES 3.20 spec: "Arrayed blocks cannot have layout location qualifiers on members" Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Signed-off-by: Andrii Simiklit <andrii.simiklit@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11522>	2022-03-03 05:42:45 +00:00
Mike Blumenkrantz	0313110c92	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	712ce86bd1	zink: split primitives generated queries if xfb/gs states change if one of these states change then it affects which result needs to be used for that query, so split it up over multiple query ids to make sure the correct result is obtained fixes (lavapipe): GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_pause_resume GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_states Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	0cb3ae949c	zink: split out query suspending into util function no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Mike Blumenkrantz	5aecec48ee	zink: update query states before starting renderpass during draw this gives some leeway for doing transfer ops without crashing the renderpass Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15227>	2022-03-03 05:21:40 +00:00
Ilia Mirkin	965ab44c50	nvc0: disable EXT_texture_sRGB_RG8 Looks like the green component doesn't get srgb-decoding, and no obvious way to force it. It works fine on nv50 though. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15211>	2022-03-03 04:37:12 +00:00
Ilia Mirkin	897a7fbbf1	mesa: enable GL_EXT_texture_sRGB_RG8 on desktop Looks like an extension number was assigned in late 2020. This makes it possible to hook up this format to teximage-colors without teaching it about ES. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15211>	2022-03-03 04:37:12 +00:00
Mike Blumenkrantz	af5f49f663	zink: remove loop from generated tcs this is already using per-vertex io, no need to add conditionals to verify Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15225>	2022-03-03 02:58:43 +00:00
Rob Clark	7e63fa2bb1	freedreno/registers: Add a couple regs we need for kernel Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15221>	2022-03-03 02:19:47 +00:00
Dave Airlie	34379a937f	gallivm/llvmpipe: add support for NIR to the linear/aos paths. When the AOS/linear code was added it only worked with TGSI which meant nothing in mesa upstream was really using it. This adds support to analyse NIR shaders, and adds aos support to the backend. AOS support is limited to mov,vec,fmul,tex sampling in order to accelerate mostly compositing operations. I've tested weston uses the fast path. gnome-shell can't use it yet as we can't optimise the depth test paths. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Dave Airlie	6efd489ac9	gallivm/nir: split load_const out into backend helper. This just makes adding aos support easier. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Dave Airlie	65c7ca617f	llvmpipe/linear: fix disk caching. Acked-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15140>	2022-03-03 01:39:39 +00:00
Mike Blumenkrantz	503362f008	zink: switch to u_foreach_bit for ntv image access decorations no functional changes Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15217>	2022-03-03 01:28:13 +00:00
Mike Blumenkrantz	fcdcfd9967	zink: emit Aliased decorations for any image that isn't explicitly marked restrict these might be aliased fixes: arb_shader_image_load_store-restrict fixes #6090 Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15217>	2022-03-03 01:28:13 +00:00
Mike Blumenkrantz	351378ae80	zink: remove a bunch of flakes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15219>	2022-03-03 01:14:20 +00:00
Dave Airlie	e1e9a44a69	lavapipe: always set read/write on ssbo/images. This fixes a regressions with overlap in llvmpipe, this is pessimistic we should write code to make it work properly. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15219>	2022-03-03 01:14:20 +00:00
Alyssa Rosenzweig	b48236ea3e	pan/bi: Add arithmetic flag to RSHIFT ops Models ops like ARSHIFT_OR.i32 on Valhall without adding piles of new instructions to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	0b0e74ae82	pan/bi: Extend LD_TILE with a register format Required for Valhall. NIR has the information anyway, pass it along. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	74107abfc6	pan/bi: Add BRANCHZI instruction Technically this is just JUMP on Valhall, but the semantic is an indirect branch based on comparing with zero. It can also be used as a conservative branch (like BRANCHC), but this isn't modeled. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	3dc2095b07	pan/bi: Model LD_BUFFER instructions We'll use these to read from UBOs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:44 +00:00
Alyssa Rosenzweig	5796777889	pan/bi: Model offset for LOAD/STORE Needed to model the immediate offset on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	039bb4e68c	pan/bi: Model pos/vary segments in STORE instructions For Bifrost, we model load/store segments, for example for thread local storage. We need something similar on Valhall -- access modifiers. There are four access modifiers on Valhall, controlling memory subsystem optimizations for the access: none: Nothing may be assumed. Corresponds to "global". istream: Internally streaming within the GPU. Corresponds to "pos", as it's used for position stores. estream: Externally streaming outside the GPU. Corresponds to "vary", as it's used for varying stores. force: Force access in discarded threads. Corresponds to "tl", as it's required for correct behaviour of helper invocations that use the stack. If these access modifiers end up being useful outside these fixed purposes, we may need to rework this part of the IR. For now, this should suffice. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	aaa39f0e60	pan/bi: Model LEA_BUF_IMM in the IR Required for varying stores in malloced IDVS jobs on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	eba9ef4c25	pan/bi: Add LD_VAR_BUF_IMM.f16/f32 instructions For use on Valhall with memory-allocated IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	48a398bf5b	pan/bi: Generalize I->table for Valhall Can be reused for resource tables in a natural way. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	20891e75c2	pan/bi: Extend BLEND to take a register format Needed on Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	3c817ed511	pan/bi: Model Valhall texture instructions These act like a TEXC+immediate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	234d3efb9b	pan/va: Add memory access modifier to LOADs Might be required for correct spilling in some circumstances. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	79aa4af078	pan/va: Remap "store segment" to "memory access" For now, the difference does not matter. However it's better to model the actual hardware behaviour, rather than isomorphic driver behaviour, when we can do so. So fix the names. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	254a641290	pan/va: Fix LEA_BUF_IMM definition Technically the table is folded, too; the 0xD refers to table 61. But this instruction is more general than previously thought. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	7c798fbb9f	pan/va: Fix definitions of LD_VAR_BUF_IMM So close! However, LD_VAR_IMM is something else -- Bifrost-style varying interpolation, without a hardware buffer. For ES3, we'll need to support both. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	c62836661e	pan/va: Add TEX_GATHER instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	65cb3af38a	pan/va: Add TEX_DUAL instruction Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	47b70ca584	pan/va: Add modifiers required for gathers Mostly isomorphic to Bifrost-style gathers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Alyssa Rosenzweig	431e7e54a6	pan/va: Handle force_enum differing from name Needed for secondary register width, for dual texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15216>	2022-03-03 00:41:43 +00:00
Ian Romanick	0a6c6dcb00	i915g: Emit better code for SEQ(x, 0) and SNE(x, 0) total instructions in shared programs: 789000 -> 788481 (-0.07%) instructions in affected programs: 16179 -> 15660 (-3.21%) helped: 157 HURT: 0 helped stats (abs) min: 3 max: 12 x̄: 3.31 x̃: 3 helped stats (rel) min: 1.56% max: 14.29% x̄: 4.24% x̃: 2.56% 95% mean confidence interval for instructions value: -3.51 -3.10 95% mean confidence interval for instructions %-change: -4.70% -3.78% Instructions are helped. LOST: 0 GAINED: 3 v2: Drop setting src1 to zero. Suggested by Emma. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15210>	2022-03-03 00:07:58 +00:00
Ian Romanick	374da6fc41	i915g: Handle constants composed exclusively of 0 or ±1 specially This can avoid some cases where a constant has to be loaded into a temporary register. v2: Update i915-g33-fails.txt. total instructions in shared programs: 788625 -> 782376 (-0.79%) instructions in affected programs: 166269 -> 160020 (-3.76%) helped: 1578 HURT: 0 helped stats (abs) min: 3 max: 21 x̄: 3.96 x̃: 3 helped stats (rel) min: 1.56% max: 33.33% x̄: 4.82% x̃: 3.45% 95% mean confidence interval for instructions value: -4.06 -3.86 95% mean confidence interval for instructions %-change: -5.00% -4.64% Instructions are helped. LOST: 0 GAINED: 35 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15210>	2022-03-03 00:07:58 +00:00
Ian Romanick	06eb9fb125	nir/algebraic: Optimize some cases of (sXX(a, b) != 0.0) I noticed the SGE case while looking at the output of shaders/closed/steam/trine-2/fp-3.shader_test on i915g. These are especially bad on i915 that needs two instructions to implement SNE. An alternative would be to duplicate the sne(sXX(a, b), 0.0) rules in an algebraic pass that occurs after bool_to_float. Doing the work earlier seems preferable. i915 total instructions in shared programs: 788274 -> 788223 (<.01%) instructions in affected programs: 666 -> 615 (-7.66%) helped: 5 HURT: 0 helped stats (abs) min: 9 max: 12 x̄: 10.20 x̃: 9 helped stats (rel) min: 5.00% max: 11.11% x̄: 8.12% x̃: 8.16% 95% mean confidence interval for instructions value: -12.24 -8.16 95% mean confidence interval for instructions %-change: -10.81% -5.43% Instructions are helped. LOST: 0 GAINED: 2 The two gained shaders are assembly fragment programs in Euro Truck Simulator 2. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15210>	2022-03-03 00:07:58 +00:00
Ian Romanick	7d055c93e0	i915g/ci: update piglit fails I believe these were fixed by !14573. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15210>	2022-03-03 00:07:58 +00:00
Emma Anholt	d506d910e4	nir: Switch to using nir_vec_scalars() for things that used nir_channel(). This should reduce follow-on optimization work to copy-propagate and dead-code away the movs generated in construction of vectors. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14865>	2022-03-02 22:28:58 +00:00
Emma Anholt	16c064dfaf	nir: Add a helper for setting up a nir_ssa_scalar struct. Trivial, but will help users avoid some struct constructions that can be awkward in C. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14865>	2022-03-02 22:28:58 +00:00
Emma Anholt	d95f9d189a	nir: Introduce a nir_vec_scalars() helper using nir_ssa_scalar. Many users of nir_vec() do so by nir_channel()-ing a new ssa defs as movs from other vectors to put the new vector together, which then just have to get copy-propagated into the ALU srcs and DCEed away the temporary movs. If they instead take nir_ssa_scalar, we can avoid that extra work. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14865>	2022-03-02 22:28:58 +00:00
Dave Airlie	48b3ef625e	vulkan/wsi: handle queue families properly for non-concurrent sharing mode. "queueFamilyIndexCount is the number of queue families having access to the image(s) of the swapchain when imageSharingMode is VK_SHARING_MODE_CONCURRENT. pQueueFamilyIndices is a pointer to an array of queue family indices having access to the images(s) of the swapchain when imageSharingMode is VK_SHARING_MODE_CONCURRENT." If the type isn't concurrent, don't attempt to access the arrays. dEQP-VK.wsi.xlib.swapchain.create.exclusive_nonzero_queues on lavapipe. Fixes: `5b13d74583` ("vulkan/wsi/drm: Break create_native_image in pieces") Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15101>	2022-03-02 20:47:16 +00:00
Emma Anholt	221ce1b35a	ci/freedreno: Consolidate some information about an a630 flake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15197>	2022-03-02 19:53:26 +00:00
Emma Anholt	a64408dcd5	ir3: Don't assert on not finding the VS output for an FS input. It should return undefined data, not terminate the program. Fixes some piglit tests poking at the UB handling. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15197>	2022-03-02 19:53:26 +00:00
Rhys Perry	feb7e30e2d	radv: include disable_aniso_single_level and adjust_frag_coord_z in key Fixes potential pipeline caching bug. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15175>	2022-03-02 19:05:28 +00:00
Emma Anholt	2de15273d5	freedreno: Improve robustness behavior for VBs with offset > size. We were just emitting the bad reloc (either an assert fail on a debug build or for a release build likely a GPU hang from the resulting fault). Given that the GLES 3.2 spec's robust context requirement says we should return undefined data but not terminate for element indices outside of the VB, ignoring the offset in this case seems like a better behavior to have in all cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15198>	2022-03-02 17:31:49 +00:00
Emma Anholt	92945825f5	freedreno: Fix start_slot handling in set_vertex_buffers. mesa/st only ever provides a 0 for this value, but let's be ready if it ever does something else. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15198>	2022-03-02 17:31:49 +00:00
Emma Anholt	35ddb65ea6	freedreno: Use the resource size rather than BO size for VFD_FETCH[].SIZE. We should be using the API size to clamp, rather than what we allocated the BO into. Fixes: #13 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15198>	2022-03-02 17:31:49 +00:00
Erik Faye-Lund	b21e7e1ef7	docs: match build-flags markup with meson docs In meson.rst, we document build-flags with double backticks, which puts it inside a code-tag in the rendered HTML instead of a cite-tag like we currently do. Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15213>	2022-03-02 14:36:42 +00:00
Erik Faye-Lund	23d3fb6da2	docs: fix a broken link Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15213>	2022-03-02 14:36:42 +00:00
Lionel Landwerlin	96c8880900	intel/fs: fix total_scratch computation We only have a single prog_data::total_scratch for all shader variants (SIMD 8, 16, 32). Therefore we should always max the total_scratch on top of existing variant. We probably haven't run into that issue before because we compile by increasing SIMD size and higher SIMD size is more likely to spill. But for bindless shaders with return shaders, if the last return part doesn't spill, we completely ignore the previous parts' scratch computation. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15193>	2022-03-02 13:13:03 +00:00
Juan A. Suarez Romero	5b43075888	v3d: enable texture filtering anisotropic Seems we already had implemented this feature (see commit `521e1d0275` "broadcom/vc5: Add support for anisotropic filtering"), but we didn't enable the proper capability. Also update the maximum level of anistropy supported. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4201 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15180>	2022-03-02 12:55:17 +00:00
Caio Oliveira	dc77542ed2	intel/compiler: Use pass helper in brw_nir_adjust_offset_for_arrayed_indices Also change the code to preserve certain metadata: control flow is not changed so both block indices and dominance information is preserved. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15206>	2022-03-02 10:46:23 +00:00
Iago Toral Quiroga	f761f8fd9e	broadcom/compiler: simplify node/temp translation during register allocation Now that we don't sort our nodes we can arrange them so we can easily translate between nodes and temps without a mapping table, just applying an offset. To do this we have a single array of nodes where twe put first the nodes for accumulators and then the nodes for temps. With this setup we can ensure that for any given temp T, its node is always T + ACC_COUNT. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	871b0a7f6a	broadcom/compiler: don't sort nodes for register allocation Nodes are allocated in order to registers so initially sorting was used to ensure that nodes with smaller life ranges would be assigned first and therefore be more likely to get accumulators. However, since `d81a6e5f1d` now we don't rely on order to make decisions about accumulators and instead we make policy decisions based on actual liveness, so sorting is no longer strictly relevant to this decision. Furthermore, we are not re-sorting nodes after each spill either, since that would probably require that we rebuild the interference graph after each spill (the graph identifies nodes by their index). Shader-db results show a significant improvement in instruction counts, due to more optimal accumulator assignments. The reason for this is that we use a round-robin policy for choosing the next accumulator to assign. The idea behind this is preventing nearby temps to be assigned to the same accumulator so that QPU scheduling is more flexible, but if we sort our nodes, we are basically not assigning temps in program order any more and the round-robin policy becomes less effective: total instructions in shared programs: 13000420 -> 12663189 (-2.59%) instructions in affected programs: 11791267 -> 11454036 (-2.86%) helped: 62890 HURT: 19987 total threads in shared programs: 415874 -> 415870 (<.01%) threads in affected programs: 20 -> 16 (-20.00%) helped: 2 HURT: 4 total uniforms in shared programs: 3711652 -> 3711624 (<.01%) uniforms in affected programs: 43430 -> 43402 (-0.06%) helped: 134 HURT: 173 total max-temps in shared programs: 2144876 -> 2138822 (-0.28%) max-temps in affected programs: 123334 -> 117280 (-4.91%) helped: 4112 HURT: 1195 total spills in shared programs: 3870 -> 3860 (-0.26%) spills in affected programs: 1013 -> 1003 (-0.99%) helped: 14 HURT: 12 total fills in shared programs: 5560 -> 5573 (0.23%) fills in affected programs: 1765 -> 1778 (0.74%) helped: 14 HURT: 17 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	4483cd24af	broadcom/compiler: sink uniform loads total instructions in shared programs: 13014428 -> 13000420 (-0.11%) instructions in affected programs: 743624 -> 729616 (-1.88%) helped: 1392 HURT: 611 total threads in shared programs: 415858 -> 415874 (<.01%) threads in affected programs: 16 -> 32 (100.00%) helped: 8 HURT: 0 total uniforms in shared programs: 3720410 -> 3711652 (-0.24%) uniforms in affected programs: 113442 -> 104684 (-7.72%) helped: 635 HURT: 29 total max-temps in shared programs: 2154268 -> 2144876 (-0.44%) max-temps in affected programs: 61279 -> 51887 (-15.33%) helped: 1124 HURT: 187 total spills in shared programs: 4002 -> 3870 (-3.30%) spills in affected programs: 265 -> 133 (-49.81%) helped: 6 HURT: 0 total fills in shared programs: 5788 -> 5560 (-3.94%) fills in affected programs: 603 -> 375 (-37.81%) helped: 6 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	e228642cf5	broadcom/compiler: move constants before their first user For us they are basically uniforms too so we want to make their lifespans short to facilitate allocating them to accumulators. total instructions in shared programs: 13043585 -> 13015385 (-0.22%) instructions in affected programs: 8326040 -> 8297840 (-0.34%) helped: 24939 HURT: 19894 total threads in shared programs: 415860 -> 415858 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 1 total uniforms in shared programs: 3721953 -> 3720451 (-0.04%) uniforms in affected programs: 96134 -> 94632 (-1.56%) helped: 744 HURT: 435 total max-temps in shared programs: 2173431 -> 2154260 (-0.88%) max-temps in affected programs: 264598 -> 245427 (-7.25%) helped: 10858 HURT: 841 total spills in shared programs: 4005 -> 4010 (0.12%) spills in affected programs: 700 -> 705 (0.71%) helped: 5 HURT: 10 total fills in shared programs: 5801 -> 5817 (0.28%) fills in affected programs: 1346 -> 1362 (1.19%) helped: 6 HURT: 11 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	a1998a9f43	broadcom/compiler: disallow TMU spills if max tmu spills is 0 If we are compiling with a strategy that does not allow TMU spills we should not allow spilling anything that is not a uniform. Otherwise the RA cost/benefit algorithm may choose to spill a temp that is not uniform and that will cause us to immediately fail the strategy and fallback to the next one, even if we could've instead chosen to spill more uniforms to compile the program successfully with that strategy. Some relevant shader-db stats: total instructions in shared programs: 13040711 -> 13043585 (0.02%) instructions in affected programs: 234238 -> 237112 (1.23%) helped: 73 HURT: 172 total threads in shared programs: 415664 -> 415860 (0.05%) threads in affected programs: 196 -> 392 (100.00%) helped: 98 HURT: 0 total uniforms in shared programs: 3717266 -> 3721953 (0.13%) uniforms in affected programs: 12831 -> 17518 (36.53%) helped: 6 HURT: 100 total max-temps in shared programs: 2174177 -> 2173431 (-0.03%) max-temps in affected programs: 4597 -> 3851 (-16.23%) helped: 79 HURT: 21 total spills in shared programs: 4010 -> 4005 (-0.12%) spills in affected programs: 55 -> 50 (-9.09%) helped: 5 HURT: 0 total fills in shared programs: 5820 -> 5801 (-0.33%) fills in affected programs: 186 -> 167 (-10.22%) helped: 5 HURT: 0 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Iago Toral Quiroga	cbb4d0dded	broadcom/compiler: increase cost of TMU spills to 10 Our cost was 5 which matches the number of instructions we have to add for a TMU spill (a fill is 4 instructions). Uniform spills on the other hand add an extra instruction for each fill and remove one instruction for the spill itself. These have a cost of 1. Therefore, if we have a single spill+fill, we end up with +9 instructions if it is a TMU spill and +0 instructions with a uniform spill, so making the former only 5 times more costly is probably not a good idea, and this is without even considering the added latency of the TMU accesses. Relevant shader-db changes show this causes as a marginal instruction count increase in a few shaders but better thread counts and lower TMU spilling overall: total instructions in shared programs: 13037315 -> 13040711 (0.03%) instructions in affected programs: 370106 -> 373502 (0.92%) helped: 187 HURT: 321 total threads in shared programs: 415090 -> 415664 (0.14%) threads in affected programs: 574 -> 1148 (100.00%) helped: 287 HURT: 0 total uniforms in shared programs: 3706674 -> 3717266 (0.29%) uniforms in affected programs: 63075 -> 73667 (16.79%) helped: 40 HURT: 395 total max-temps in shared programs: 2176080 -> 2174177 (-0.09%) max-temps in affected programs: 15838 -> 13935 (-12.02%) helped: 316 HURT: 34 total spills in shared programs: 4247 -> 4010 (-5.58%) spills in affected programs: 2599 -> 2362 (-9.12%) helped: 107 HURT: 14 total fills in shared programs: 6121 -> 5820 (-4.92%) fills in affected programs: 3622 -> 3321 (-8.31%) helped: 108 HURT: 13 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15168>	2022-03-02 08:09:11 +00:00
Marek Olšák	a02dd17cb3	radeonsi: fix an assertion failure with register shadowing The problem is that dirty_states must be 0 for any state that is NULL in "queued". This code was flagging dirty_states for such states because it was only looking at "emitted". It should have been looking at "queued". Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	0f96948dfa	radeonsi: fix register shadowing after the pm4 state size was decreased Fixes: `946bd90a09` "radeonsi: decrease the size of si_pm4_state::pm4 except for cs_preamble_state" Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	66e20d2bf7	ac: add an environment variable that parses IBs in files Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	3394f0ae14	ac: define PKT3_ATOMIC_MEM Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	ff9e4409c1	ac: parse SET_SH_REG_INDEX packet Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	0cae7a59c0	ac/llvm: update LLVM processor names for gfx10.3 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15209>	2022-03-01 22:30:24 +00:00
Marek Olšák	87d83f4103	ci: add point coord failures to d3d12 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:56 +00:00
Marek Olšák	5ca7c20cf7	st/mesa: do nir_lower_io() for inputs & outputs with transform feedback info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:56 +00:00
Marek Olšák	ee4c5b1699	gallium/aux: add helper nir_gather_stream_output_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	2a708efec3	gallium/util: add util_dump_stream_output_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	1dcd1eac6a	nir: pass nir_shader into nir_recompute_io_bases instead of func_impl Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	606811bded	nir: add nir_print_xfb_info Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	ad68a1ee5a	nir: add nir_gather_xfb_info_from_intrinsics for lowered IO Drivers will use this. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	d4c051b047	nir: add nir_lower_io_passes() with new transform feedback moved from radeonsi without the vectorization, which won't be needed for now. We will lower IO in st/mesa instead of radeonsi to get the transform feedback info into store instructions. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	3528dcdfa1	nir: add nir_io_semantics::no_varying, no_sysval_output, and helpers This is for drivers that have separate store instructions for varyings, system value outputs (such as clip distances), and transform feedback. The flags tell the driver not to store the output to those locations. This will be used by radeonsi initially, and then maybe by a new linker. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	548b2d47b2	nir: scalarize transform feedback info in nir_lower_io_to_scalar Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	cc5505088b	nir: add shader_info::xfb_strides NIR now fully contains pipe_stream_output_info in shader_info and IO intrinsics if lower_io_variables is true. radeonsi will not use pipe_stream_output_info after this, and other drivers are free to follow that. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	4636fa7f38	nir: add transform feedback info into nir_intrinsic_store_output This will allow compaction of transform feedback varyings because they are no longer tied to varying slots with this information. It will also make transform feedback info available to all NIR passes after IO is lowered. It's meant to replace pipe_stream_output_info. Other intrinsics are not used with transform feedback. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	2c6e41bfe1	nir: fix nir_io_semantics::gs_streams in nir_lower_io_to_scalar gs_streams is relative to the component. Also clear the high bits. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	73ef225fc2	nir: validate write_mask for all intrinsics that have it Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14388>	2022-03-01 21:59:55 +00:00
Marek Olšák	dd733fa52e	radeonsi: fix broken VK-GL buffer interop Fixes: `ad9b5ac0a1` - radeonsi: more fixes for si_buffer_from_winsys_buffer for GL-VK interop Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6063 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15124>	2022-03-01 20:51:04 +00:00
Nanley Chery	c1a7d520f3	anv: Disable aux if the explicit modifier lacks it For dmabuf imports, configure the primary surface without support for compression if the modifier doesn't specify it. This helps to create VkImages with memory requirements that are compatible with the buffers apps provide. Suggested-by: Philip Langdale <philipl@overt.org> Cc: 22.0 <mesa-stable> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5940 Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Nanley Chery	cada519482	anv: Refactor anv_image_init_from_create_info Use a variable to store the anv_image_create_info struct. We'll modify it for a bug fix in the next patch. Cc: 22.0 <mesa-stable> Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Nanley Chery	8d2b7e558b	anv: Change a parameter of the implicit layout fn Replace the create_info parameter with isl_extra_usage_flags to more closely match the parameters of explicit layout function. Tested-by: Philip Langdale <philipl@overt.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15181>	2022-03-01 20:05:50 +00:00
Alyssa Rosenzweig	c3eee6327c	pan/va: Add missing copyright notice Minor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	eda00fd39d	pan/bi: Extract INSTRUCTION_CASE macro Useful across multiple optimization tests. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	ffde1f359b	pan/bi: Adapt bi_lower_branch for Valhall Disable the Bifrost optimization; it's not portable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	f3937d9874	pan/bi: Trade off registers/threads on Valhall It's only v6 that's missing this feature. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:23 +00:00
Alyssa Rosenzweig	7637502c8d	pan/bi: Add BI_SUBGROUP_SUBGROUP16 option Valhall uses 16-wide warps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	ec9c1f8fa6	pan/bi: Wire Valhall disassembler into compiler Useful when we grow Valhall support (soon!) Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	31e991d801	pan/bi: Support standalone Valhall disassembly $ bifrost_compiler disasm --gpu=G78 foo.bin Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	600f689a98	pan/bi: Allow CSE of preloaded registers Needed to CSE `LEA_VARY` in varying shaders on Valhall. No shader-db changes on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	3154df232b	pan/bi: Use a progress loop for constant folding Needed to fold the dependent patterns produced by texture instructions during NIR->Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	e5582710f3	pan/bi: Mark NOP as having no destinations More accurate and more convenient. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	2604c65174	panfrost: Unify barrier+helper handling These are unified in the hardware, so let's unify them in pan_shader_info. Hoisting this logic to pan_shader.c avoids the need to duplicate this logic for Midgard/Bifrost (RSD packing) and Valhall (SPD packing). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	30d0c2e390	panfrost: Set texel_interleave on Valhall Instead of specifying the tiling on the texture descriptor, Valhall specifies it on the plane descriptor. There is a new flag on the texture descriptor specifying only whether the planes are interleaved or not. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	407bda4d8c	panfrost: Adapt estimate_texture_payload_size to Valhall The plane descriptor is larger than earlier surface descriptors, so we need to be somewhat careful here. This removes a memory micro-optimization in the interest of simplifying the code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	469a36d071	panfrost: Don't emit compression tags on Valhall Unnecessary. To avoid even more #if/#endif soup, merge the v4, v5-v8, and v9 paths together -- by returning 0 as the compression tag on v4 or v9. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	087b63cb07	panfrost: Allow uploading fragment SPDs SPDs don't have the state dependence that fragment RSDs do. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	e42b0c68f4	panfrost: Don't pack blend constants with blend shaders It's probably harmless, but it is logically meaningless. The DDK doesn't do it, I don't see a reason for us to, either. In theory this should be a small overhead win. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	111f5af303	panfrost: Generalize some is_bifrost users Valhall would want these too. Regretting the is_bifrost check at all.. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	36a2b8d039	panfrost: Add PAN_MESA_DEBUG=dump option To dump all graphics memory via the new pandecode_dump_mappings function(), since for Valhall I have to do this often enough to warrant a dynamic flag. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	28743a5556	panfrost: Rename prepare_rsd->prepare_shader This hook will be repurposed on Valhall to prepare the Shader Program Descriptor, which takes the role of the RSD. Rename to avoid confusion. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	631c01fc42	panfrost: Add an enum for Valhall resource tables Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	f3c971e0fe	panfrost: Make Divisor E an integer on v9 For consistency with previous architecture's XML files. Logically this is an 1-bit unsigned integer, not a boolean. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	b19afaf307	panfrost: Clarify contains descriptor? bit Influences cache prefetching. I don't see a good reason to put anything other than descriptors inside shader resources, meaning always setting this bit is appropriate (at least for GLES). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	1df6b0d7e2	panfrost: Remove Invalidate Cache from Valhall job header Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	217e038289	panfrost: Add Tile Render Order enum to fragment jobs Not sure what this is needed for yet. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Alyssa Rosenzweig	52ccd21e6b	panfrost: Extend SPD size There is software-defined state at the end we don't need. Model in the XML for correct behaviour. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15204>	2022-03-01 19:43:22 +00:00
Thong Thai	0136545d16	radeonsi: add check for graphics to si_try_normal_clear Cc: mesa-stable Signed-off-by: Thong Thai <thong.thai@amd.com> Acked-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15177>	2022-03-01 19:05:06 +00:00
Lionel Landwerlin	214092da87	anv: fix fast clear type value with external images Disable fast clear if not supported by the external modifier. v2: Set fast_clear value to NONE in case of import/export from/to external v3: Move logic next to existing acquire/release checks (Nanley) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6056 Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15096>	2022-03-01 17:37:13 +00:00
Oleksandr Gabrylchuk	02fab4cf9e	venus: Implement guest vram blob type. Add support of GUEST_VRAM type of blob. These are dedicated heap memory allocations required for vk support on hypervisors that don't support runtime injections of host memory into guest physical address space. The flow of usage: 1) Host VM reserves dedicated heap memory 2) Device get info about memory reservations and report it to guest using mmio registers 3) Guest virtio-gpu driver on starts checks mmio registers for physical address and length of reserved region. Then it reserves it in guest. 4) On each call of vkAllocateMemory() guest driver gets chunk of required memory and send it to host using sg list. It uses one sg entry for 1 blob call. Heap is managed on guest using drm memory manager (drm_mm). Signed-off-by: Oleksandr.Gabrylchuk <Oleksandr.Gabrylchuk@opensynergy.com> Signed-off-by: Andrii Pauk <Andrii.Pauk@opensynergy.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14536>	2022-03-01 17:25:56 +00:00
Marek Olšák	fd3451babd	amd: update addrlib Reviewed-by: Yifan Zhang <yifan1.zhang@amd.com> Tested-by: Yifan Zhang <yifan1.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15155>	2022-03-01 17:03:00 +00:00
Marek Olšák	f8cf5ea982	amd: add support for gfx1036 and gfx1037 chips Both are identified as GFX1036 for simplicity. Reviewed-by: Yifan Zhang <yifan1.zhang@amd.com> Tested-by: Yifan Zhang <yifan1.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15155>	2022-03-01 17:03:00 +00:00
Marek Olšák	48046d5bd8	ac: set correct cache size per TCC for Yellow Carp Reviewed-by: Yifan Zhang <yifan1.zhang@amd.com> Tested-by: Yifan Zhang <yifan1.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15155>	2022-03-01 17:03:00 +00:00
Samuel Pitoiset	4380916b76	radv: disable DCC for Fable Anniversary, Dragons Dogma, GTA IV and more Also Starcraft 2 and The Force Unleashed II. These games are known to be affected by the feedback loop issue. We will fix this properly soon but as a hotfix disabling DCC should be enough. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4424 Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15203>	2022-03-01 16:33:18 +00:00
Vadym Shovkoplias	dc921f7377	iris: Do not apply SCANOUT allocation flags for SHARED-only requests It provides similar solution as in [1]. This was workaround for the users of gbm_bo_create_with_modifiers(), which were unable to specify the buffer usage (GPU / GPU+DISPLAY). But after the commit [2] this become possible. And forcing usage to GBM_BO_USE_SCANOUT migrated directly into gbm_bo_create_with_modifiers [3], allowing us to remove such workarounds from the drivers. [1]: `ef3b31c9` ("v3d: Don't force SCANOUT for PIPE_BIND_SHARED requests") [2]: `268e12c6` ("gbm: add gbm_{bo,surface}_create_with_modifiers2") [3]: `ad50b47a` ("gbm: assume USE_SCANOUT in create_with_modifiers") Suggested-by: Roman Stratiienko <roman.o.stratiienko@globallogic.com> Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5642 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14264>	2022-03-01 16:04:44 +00:00
Timur Kristóf	93087f71e6	ac/nir: Extract final mesh shader output counts to a separate function. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	11957d3863	aco: Remove superfluous code for mesh shader workgroup ID. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	2d5aae032b	ac/nir: Properly invalidate mesh shader metadata. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	3a3bd9cff1	ac/nir: Fix workgroup ID in mesh shader waves other than the first. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	57775dd76a	ac/nir: Store mesh shader API and HW workgroup size in lowering state. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	d0f45c7c49	ac/nir: Reuse existing nir_builder for emit_ms_finale. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Timur Kristóf	74f1e7965e	ac/nir: Use vertex count minus 1 to determine max index in mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15199>	2022-03-01 15:37:12 +00:00
Charlie Turner	16b417b8d6	ci, valve: Add the dEQP runners for Valve CI v2. - Build the runner image as part of the CI for the boot2container project, rather than as a manually step using the build instructions in valve-trigger.dockerfile. - Depend on a non-default kernel build hosted in the valve-infra package repository. This does reduce the current caching feature of local artifacts, but makes it easier to chop and change kernels on a per-project or even per-test basis. v3. - Depend on a kernel built and stored in the valve-infra generic package repo. - Build the runner container using ci-templates as part of the CI in valve-infra. - Now that the runner container is built in the valve-infra CI, I dropped the source import of client.py and message.py. They are built in the runner container. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14660>	2022-03-01 13:04:14 +00:00
Charlie Turner	f0aee991bf	amd, ci: Categorize the sections of the CI file. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14660>	2022-03-01 13:04:14 +00:00
Charlie Turner	58186df32c	amd, ci: Drop log level in SPIRV -> NIR code generator. See `786fa3435c` for the rationale of this variable, but the point is to avoid many error reports for conformance conformance issues within the VK-CTS shaders. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14660>	2022-03-01 13:04:14 +00:00
Charlie Turner	cc327a0fe4	amd, ci: Remove unused runners. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14660>	2022-03-01 13:04:14 +00:00
Charlie Turner	befe3a9e48	ci, valve: Add support scripts for the Valve bare-metal farm. - Add the scripts to the prepared Mesa artifacts for use in later runner stages. - Add a template generator (generate_b2c.py) which reads and validates (very lightly for now) the Gitlab job environment and then spits out a YAML file describing the necessary test workload to be sent to a Valve CI gateway. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14660>	2022-03-01 13:04:14 +00:00
Samuel Pitoiset	1e010348ee	radv: remove color exports in presence of holes If there is holes, eg. if only MRT0 and MRT2 are exported, we have to set MRT1 to SPI_SHADER_32_R to avoid a GPU hang but the export can still be removed from the fragment shader. fossils-db (Sienna Cichlid): Totals from 565 (0.42% of 134913) affected shaders: VGPRs: 13328 -> 11456 (-14.05%) CodeSize: 613232 -> 548224 (-10.60%); split: -11.13%, +0.53% LDS: 284672 -> 296960 (+4.32%) MaxWaves: 17624 -> 17684 (+0.34%) Instrs: 113056 -> 100445 (-11.15%); split: -11.68%, +0.53% Latency: 684327 -> 639348 (-6.57%); split: -7.17%, +0.60% InvThroughput: 122877 -> 104382 (-15.05%); split: -15.18%, +0.13% VClause: 2601 -> 2323 (-10.69%); split: -10.77%, +0.08% SClause: 5629 -> 5443 (-3.30%); split: -3.91%, +0.60% Copies: 9393 -> 8720 (-7.16%); split: -8.22%, +1.05% PreSGPRs: 14623 -> 13666 (-6.54%); split: -6.76%, +0.22% PreVGPRs: 9847 -> 8503 (-13.65%) fossils-db (Polaris10): Totals from 565 (0.42% of 135960) affected shaders: SGPRs: 28064 -> 27104 (-3.42%) VGPRs: 12516 -> 10544 (-15.76%); split: -15.79%, +0.03% CodeSize: 516920 -> 456536 (-11.68%); split: -11.68%, +0.00% MaxWaves: 4369 -> 4418 (+1.12%) Instrs: 97771 -> 85903 (-12.14%); split: -12.14%, +0.00% Latency: 767482 -> 708545 (-7.68%); split: -7.97%, +0.29% InvThroughput: 280017 -> 235744 (-15.81%) VClause: 2270 -> 2090 (-7.93%); split: -8.50%, +0.57% SClause: 5185 -> 5012 (-3.34%); split: -3.86%, +0.52% Copies: 8328 -> 7555 (-9.28%); split: -9.35%, +0.07% Branches: 1143 -> 1113 (-2.62%) PreSGPRs: 13816 -> 12725 (-7.90%); split: -7.92%, +0.02% PreVGPRs: 9707 -> 8270 (-14.80%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15108>	2022-03-01 12:28:47 +01:00
Rhys Perry	f800af2231	ac/nir: remove TCS nir_var_shader_out memory barrier nir_var_shader_out writes are only used for later TES invocations, so I don't think there's any need for the TCS workgroup to wait for them. fossil-db (Sienna Cichlid): Totals from 1691 (1.04% of 162293) affected shaders: Instrs: 710699 -> 709008 (-0.24%) CodeSize: 3830168 -> 3823404 (-0.18%) Latency: 3396997 -> 3007934 (-11.45%) InvThroughput: 1212094 -> 1082823 (-10.67%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15195>	2022-03-01 11:02:43 +00:00
Caio Oliveira	7460199a2f	intel/compiler: Lower Task/Mesh I/O before SIMD specific lowering These are the same for all variants, so just lower it before cloning the nir_shader for each of them. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15019>	2022-03-01 07:35:13 +00:00
Danylo Piliaiev	549e861dc1	turnip: Implement VK_EXT_physical_device_drm Copied from ANV and V3DV. v1. Fix a build error for clang "unannotated fall-through between switch labels" ( Hyunjun Ko <zzoon.ko@igalia.com> ) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6011 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14971>	2022-03-01 07:10:40 +00:00
Pierre-Eric Pelloux-Prayer	bb6ba8f21f	radeonsi/drirc: use force_gl_vendor for Maya Otherwise OpenCL initialization fails with "unknown vendor id 0". Reviewed-by: Qiang Yu <yuq825@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15151>	2022-03-01 07:43:26 +01:00
Ilia Mirkin	d3196bac51	nouveau: add dEQP/GLCTS run failure info for GF108/GT215 I happened to have these plugged in. Ran them against mesa 21.3 and recent VK-GL-CTS tree (shortly after vulkan-cts-1.2.8). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14797>	2022-02-28 22:36:31 -05:00
Nanley Chery	dc05615ec1	Revert "anv: Require the local heap for CCS on XeHP" This reverts commit `382f6ccda8`. The spec requires that all color images created with the same tiling (and a few other properties) support the same memoryTypeBits. So this wasn't a valid change. It also wasn't necessary - we already have a mechanism in anv_BindImageMemory2 for disabling compression if the BO doesn't support it. With this, XeHP passes the tests in dEQP-VK.memory.requirements.*tiling_optimal Fixes: `382f6ccd` ("anv: Require the local heap for CCS on XeHP") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15068>	2022-03-01 00:02:51 +00:00
Nanley Chery	203c8be09f	anv: Add a perf warning in anv_BindImageMemory2 It reports: "BO lacks implicit CCS. Disabling the CCS aux usage." Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15068>	2022-03-01 00:02:51 +00:00
Nanley Chery	74e446b45b	anv: Fall back to HiZ when disabling CCS on HiZ+CCS When an image configured for HIZ_CCS/HIZ_CCS_WT is bound to a BO lacking implicit CCS, we disable any compression it may have had. Such images are still compatible with ISL_AUX_USAGE_HIZ however. Fall back to that aux usage to retain the performance benefit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15068>	2022-03-01 00:02:51 +00:00
Nanley Chery	ffbde42b93	anv: Don't disable HiZ/MCS in anv_BindImageMemory2 When an image is bound to a BO lacking implicit CCS, we disable any compression it may have had. This is unnecessary in the cases where the compression type doesn't depend on the BO having implicit CCS support. Avoid this disabling for ISL_AUX_USAGE_MCS and ISL_AUX_USAGE_HIZ. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15068>	2022-03-01 00:02:51 +00:00
Connor Abbott	ed9a0d48a9	ir3: Use isam for bindless images In the bindless case, we don't have to keep any shadow descriptors and can just reuse the IBO descriptor as a texture descriptor. Now that we're emitting the swizzle we can just flip this on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	06485f7d3d	tu: Call nir_opt_access This adds some small optimizations, and enables lowering to isam in more cases where the app didn't specify readonly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	58d72f45e5	ir3/nir: Fix 1d array readonly images ncoords includes the array index, and the NIR source has the array index as its last component, so we have to insert the extra y coordinate in the middle in this case. Fixes: `0bb0cac` ("freedreno/ir3: handle image buffer") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	21ac044c3e	ir3: Don't always set bindless_tex with readonly images Fixes: `274f381` ("ir3: Plumb through bindless support") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	bb1e0eba08	freedreno/fdl: Set swizzle on storage descriptor It appears to be unused by ldib/stib, but it will let us use isam on IBO descriptors for bindless images. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	00be8c4619	freedreno: Replace A6XX_IBO with A6XX_TEX_CONST Since these were reverse-engineered, it's become clear that IBO descriptors are just a subset of texture descriptors, and bindless reads of readonly images actually use isam on the IBO descriptor, further confirming that the two are always compatible, even if not all of the texture fields exist for IBOs. It's pointless to have a separate type for IBOs, and just leads to things getting out-of-sync unnecessarily which has already happened. Just remove it and use TEX_CONST insted. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Connor Abbott	e1c4c2ac60	ir3: Use CAN_REORDER instead of NON_WRITEABLE CAN_REORDER takes volatile into account, and is closer to what we actually require to use texture instructions, which is that we can arbitrarily reorder loads. Fixes: `aa93896` ("freedreno/ir3: adjust condition for when to use ldib") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15114>	2022-02-28 23:33:22 +00:00
Danylo Piliaiev	95fabff8de	turnip: Set drmFormatModifierTilingFeatures From Vulkan spec for VkDrmFormatModifierProperties2EXT: "drmFormatModifierTilingFeatures is a bitmask of VkFormatFeatureFlagBits that are supported by any image created with format and drmFormatModifier." "The returned drmFormatModifierTilingFeatures must contain at least one bit." "Therefore, if the returned drmFormatModifier is DRM_FORMAT_MOD_LINEAR, then drmFormatModifierPlaneCount must equal the format planecount, and drmFormatModifierTilingFeatures must be identical to the VkFormatProperties2::linearTilingFeatures returned in the same pNext chain." Relevant tests: dEQP-VK.drm_format_modifiers.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15032>	2022-02-28 22:53:40 +00:00
Mike Blumenkrantz	473f488639	zink: add layer asserts for 3d imageview creation make sure there's no other mishaps here in the future Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15172>	2022-02-28 17:42:58 +00:00
Mike Blumenkrantz	8e67928862	zink: more accurately clamp 3d fb surfaces to corresponding 2d target if more than 1 layer is being bound, this is an array, otherwise it's just regular 2d Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15172>	2022-02-28 17:42:58 +00:00
Mike Blumenkrantz	59b0105e65	zink: clamp 3d/array shader images to lower dimensionality using layer counts this creates the view type expected by the shader instead of doing weird stuff like trying to create a 3D imageview with layers > 1 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15172>	2022-02-28 17:42:58 +00:00
Mike Blumenkrantz	26d05e5a38	zink: directly create surfaces for shader images avoid the implicit clamping of fb surfaces in zink_create_surface() in order to provide more granularity no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15172>	2022-02-28 17:42:58 +00:00
Mike Blumenkrantz	69ec429c00	zink: restrict clear flushing on sampler/image bind to compute binds this is otherwise going to be handled on the next renderpass start Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15172>	2022-02-28 17:42:58 +00:00
Mike Blumenkrantz	b7b494299d	zink: use VK_EXT_depth_clip_control when available this saves a few ALUs in vertex stages Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15174>	2022-02-28 17:21:29 +00:00
Mike Blumenkrantz	95708c13ee	glx/drisw: handle GL_RESET_NOTIFICATION_STRATEGY fixes (llvmpipe): KHR-NoContext.gl45.robustness.lose_context_on_reset Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15061>	2022-02-28 16:10:00 +00:00
Mike Blumenkrantz	fba13486df	zink: update psiz handling to fix xfb output now when gl_PointSize and gl_PointSizeMESA are both present, the former will be used for xfb with a new location and the latter will be exported by the shader fixes (zink): GTF-GL46.gtf30.GL3Tests.transform_feedback.transform_feedback_builtins Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15184>	2022-02-28 15:42:20 +00:00
Mike Blumenkrantz	b28cff9f4a	nir/lower_psiz_mov: stop clobbering existing exports for this pass to work with xfb, the original value in the shader must be preserved when xfb is active, and the driver must export only the newly created output Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15184>	2022-02-28 15:42:19 +00:00
Mike Blumenkrantz	3267417c22	nir/lower_psiz: create the store instruction more accurately creating this at the start of the shader means it will get optimized out when the pass is used to overwrite existing psiz values, and creating it at the end means it will get optimized out in geometry shaders, so instead just walk the instructions and create another store right after the existing one Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15184>	2022-02-28 15:42:19 +00:00
Jonathan Gray	6250a3bc18	util: use correct type in sysctl argument Fixes build on OpenBSD/macppc powerpc error: incompatible pointer types passing 'int ' to parameter of type 'size_t ' (aka 'unsigned long *') [-Werror,-Wincompatible-pointer-types] Fixes: `01bd21eef8` ("gallium: Import Dennis Smit cpu detection code.") Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6511>	2022-02-28 14:28:23 +00:00
Jonathan Gray	0536b69133	util: fix build with clang 10 on mips64 On mips64, the compiler does not allow use of non-zero argument with __builtin_frame_address(). However, the returned frame address is only used when PIPE_ARCH_X86 is defined. The compile error can be avoided by making #ifdef PIPE_ARCH_X86 cover the getting of frame address too. The argument checking of __builtin_frame_address() has been present as a debug assert in clang 8. In clang 10, there is a proper runtime check for the argument. This is why the build has not failed before. Fixes: `dc94a0506f` ("gallium: Do not add -Wframe-address option for gcc <= 4.4.") from Visa Hankala Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6511>	2022-02-28 14:28:23 +00:00
Jonathan Gray	f12c107b03	util/u_atomic: fix build on clang archs without 64-bit atomics Make this build on clang architectures that don't have 64-bit atomic instructions. Clang doesn't allow redeclaration (and therefore redefinition) of the __sync_* builtins. Use #pragma redefine_extname to work around that restriction. Clang also turns __sync_add_and_fetch into __sync_fetch_and_add (and __sync_sub_and_fetch into __sync_fetch_and_sub) in certain cases, so provide these functions as well. Fixes: `a6a38a038b` ("util/u_atomic: provide 64bit atomics where they're missing") patch from Mark Kettenis Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6511>	2022-02-28 14:28:23 +00:00
Daniel Stone	d07df90bf4	Revert "CI: Disable Panfrost T720 jobs" This reverts commit `35209b94a6`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15191>	2022-02-28 14:08:34 +00:00
Daniel Stone	bd55458304	Revert "CI: Disable panfrost-t760" This reverts commit `b9b444e0b8`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15191>	2022-02-28 14:08:34 +00:00
Adrián Larumbe	6d0824abcc	panfrost: fix segfault in pandecode The structure wrapped around the rb tree node was being freed, but not the node itself, which caused a segmentation fault when accessing its parent node. Add rb tree node remove call to fix it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15188>	2022-02-28 12:40:32 +00:00
Daniel Stone	af0f9a31b3	CI: Disable Panfrost T720 jobs Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15189>	2022-02-28 07:08:38 +00:00
Daniel Stone	114e48e923	CI: Disable panfrost-t760 The DUTs are extremely tempremental for some reason. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15186>	2022-02-27 20:26:17 +00:00
Erik Faye-Lund	f2dee2ea55	docs: update irc channel According to the virglrenderer README.rst[1] file, #virgil3d has moved from FreeNode to OFTC. Let's update our link as well. While we're at it, use a proper link as well. [1]: https://gitlab.freedesktop.org/virgl/virglrenderer/-/blob/master/README.rst Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	5558f7de59	docs: mark virgl gles2 renderer as done Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	5cce0d1d9d	docs: update virgl description Since this text was written, VirGL has become a shipping, production quality solution. It's no longer a research project. Let's update the text to reflect that. While we're at it, let's drop the project from the page title, as this is no longer the docs for the entire project. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	ca4ad9c2cb	docs: link to gitlab instead of cgit While we're at it, let's update the releasing article as well. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	32de503b02	docs: master -> main Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	76ee3eeae9	docs: Virgl -> VirGL The name used for this project is usually stylized as VirGL instead of "Virgil" or "Virgil 3D" these days. Let's be consistent. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	4ebd3b041b	docs: qemu -> QEMU This is the official syling of the name, let's use that instead of lower-case for consistensy. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:41 +00:00
Erik Faye-Lund	5494b68694	docs: add missing get This sentence doesn't make sense without a 'get' or something similar here. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:40 +00:00
Erik Faye-Lund	2015d458b6	docs: remove a few repeated words It doesn't make sense to repeat these, let's fix that. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:40 +00:00
Erik Faye-Lund	6897266ce0	docs: import virgl docs The docuentation has been imported verbatim from https://virgil3d.github.io/. Acked-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13836>	2022-02-27 11:09:40 +00:00
Mike Blumenkrantz	e1964e1dde	zink: don't free non-fbfetch dsl structs when switching to fbfetch this triggers invalid access when recycling in-flight non-fbfetch sets cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	03a80490a4	zink: free push descriptor pools on deinit these are owned by the context, so destroy them when the context requests destruction cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	698ae34844	zink: fix cached descriptor set invalidation for array bindings need to iterate over the descriptors in the binding to invalidate the whole thing here ================================================================= ==546534==ERROR: AddressSanitizer: heap-use-after-free on address 0x61a0000ae6c0 at pc 0x7fe20e26fd9d bp 0x7ffd92be6bc0 sp 0x7ffd92be6bb8 READ of size 8 at 0x61a0000ae6c0 thread T0 #0 0x7fe20e26fd9c in zink_descriptor_set_refs_clear ../src/gallium/drivers/zink/zink_descriptors.c:950 #1 0x7fe20e401304 in zink_destroy_surface ../src/gallium/drivers/zink/zink_surface.c:340 #2 0x7fe20e21311b in zink_surface_reference ../src/gallium/drivers/zink/zink_surface.h:106 #3 0x7fe20e21a5b9 in zink_sampler_view_destroy ../src/gallium/drivers/zink/zink_context.c:835 #4 0x7fe20c41d35f in tc_sampler_view_destroy ../src/gallium/auxiliary/util/u_threaded_context.c:1848 #5 0x7fe20e210ff7 in pipe_sampler_view_reference ../src/gallium/auxiliary/util/u_inlines.h:216 #6 0x7fe20e22d592 in zink_set_sampler_views ../src/gallium/drivers/zink/zink_context.c:1532 #7 0x7fe20c41a3d8 in tc_call_set_sampler_views ../src/gallium/auxiliary/util/u_threaded_context.c:1393 #8 0x7fe20c411706 in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:211 #9 0x7fe20c4124ba in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:362 #10 0x7fe20c42b728 in tc_destroy ../src/gallium/auxiliary/util/u_threaded_context.c:4250 #11 0x7fe20b65176a in st_destroy_context_priv ../src/mesa/state_tracker/st_context.c:387 #12 0x7fe20b65669f in st_destroy_context ../src/mesa/state_tracker/st_context.c:1009 #13 0x7fe20b7055ab in st_context_destroy ../src/mesa/state_tracker/st_manager.c:944 #14 0x7fe20a9c75bd in dri_destroy_context ../src/gallium/frontends/dri/dri_context.c:256 #15 0x7fe20a9d4bef in driDestroyContext ../src/gallium/frontends/dri/dri_util.c:534 #16 0x7fe22361f25c in drisw_destroy_context ../src/glx/drisw_glx.c:429 #17 0x7fe223625d95 in glXDestroyContext ../src/glx/glxcmds.c:523 #18 0x7fe22636aaeb in glXDestroyContext /home/zmike/src/libglvnd-v1.3.2/src/GLX/libglx.c:332 #19 0x7fe2269d9e7d in glXDestroyContext /home/zmike/src/libglvnd-v1.3.2/src/GL/g_libglglxwrapper.c:384 #20 0x41b88a in tcu::lnx::x11::glx::GlxRenderContext::~GlxRenderContext() /home/zmike/src/VK-GL-CTS/framework/platform/lnx/X11/tcuLnxX11GlxPlatform.cpp:734 #21 0x41b8e9 in tcu::lnx::x11::glx::GlxRenderContext::~GlxRenderContext() /home/zmike/src/VK-GL-CTS/framework/platform/lnx/X11/tcuLnxX11GlxPlatform.cpp:735 #22 0x2323aa7 in deqp::gles31::Context::destroyRenderContext() /home/zmike/src/VK-GL-CTS/modules/gles31/tes31Context.cpp:77 #23 0x2323969 in deqp::gles31::Context::~Context() /home/zmike/src/VK-GL-CTS/modules/gles31/tes31Context.cpp:55 #24 0x232278e in deqp::gles31::TestPackage::deinit() /home/zmike/src/VK-GL-CTS/modules/gles31/tes31TestPackage.cpp:102 #25 0x2c866c2 in tcu::DefaultHierarchyInflater::leaveTestPackage(tcu::TestPackage) /home/zmike/src/VK-GL-CTS/framework/common/tcuTestHierarchyIterator.cpp:75 #26 0x2c87058 in tcu::TestHierarchyIterator::next() /home/zmike/src/VK-GL-CTS/framework/common/tcuTestHierarchyIterator.cpp:252 #27 0x2c365da in tcu::TestSessionExecutor::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:122 #28 0x2c00b0c in tcu::App::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuApp.cpp:221 #29 0x4141b7 in main /home/zmike/src/VK-GL-CTS/framework/platform/tcuMain.cpp:58 #30 0x7fe2263e155f in __libc_start_call_main (/lib64/libc.so.6+0x2d55f) #31 0x7fe2263e160b in __libc_start_main_impl (/lib64/libc.so.6+0x2d60b) #32 0x413fa4 in _start (/home/zmike/src/VK-GL-CTS/build/external/openglcts/modules/glcts+0x413fa4) 0x61a0000ae6c0 is located 64 bytes inside of 1328-byte region [0x61a0000ae680,0x61a0000aebb0) freed by thread T0 here: #0 0x7fe226cb6627 in free (/usr/lib64/libasan.so.6+0xae627) #1 0x7fe20aab1751 in unsafe_free ../src/util/ralloc.c:302 #2 0x7fe20aab16c8 in unsafe_free ../src/util/ralloc.c:295 #3 0x7fe20aab13c3 in ralloc_free ../src/util/ralloc.c:265 #4 0x7fe20e269234 in descriptor_pool_free ../src/gallium/drivers/zink/zink_descriptors.c:286 #5 0x7fe20e26937d in descriptor_pool_delete ../src/gallium/drivers/zink/zink_descriptors.c:296 #6 0x7fe20e26ff53 in zink_descriptor_pool_reference ../src/gallium/drivers/zink/zink_descriptors.c:967 #7 0x7fe20e270db2 in zink_descriptor_program_deinit ../src/gallium/drivers/zink/zink_descriptors.c:1071 #8 0x7fe20e3b6536 in zink_destroy_gfx_program ../src/gallium/drivers/zink/zink_program.c:695 #9 0x7fe20e1eaaf9 in zink_gfx_program_reference ../src/gallium/drivers/zink/zink_program.h:242 #10 0x7fe20e20d386 in zink_shader_free ../src/gallium/drivers/zink/zink_compiler.c:2099 #11 0x7fe20e3b9f0b in zink_delete_shader_state ../src/gallium/drivers/zink/zink_program.c:1074 #12 0x7fe20c3e29ad in util_shader_reference ../src/gallium/auxiliary/util/u_live_shader_cache.c:188 #13 0x7fe20e3ba11e in zink_delete_cached_shader_state ../src/gallium/drivers/zink/zink_program.c:1093 #14 0x7fe20c41709e in tc_call_delete_fs_state ../src/gallium/auxiliary/util/u_threaded_context.c:998 #15 0x7fe20c411706 in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:211 #16 0x7fe20c4124ba in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:362 #17 0x7fe20c423683 in tc_flush ../src/gallium/auxiliary/util/u_threaded_context.c:3003 #18 0x7fe20b62d996 in st_flush ../src/mesa/state_tracker/st_cb_flush.c:60 #19 0x7fe20b62dbe3 in st_glFlush ../src/mesa/state_tracker/st_cb_flush.c:94 #20 0x7fe20ae4bded in _mesa_make_current ../src/mesa/main/context.c:1493 #21 0x7fe20ae49702 in _mesa_free_context_data ../src/mesa/main/context.c:1187 #22 0x7fe20b65668b in st_destroy_context ../src/mesa/state_tracker/st_context.c:1005 #23 0x7fe20b7055ab in st_context_destroy ../src/mesa/state_tracker/st_manager.c:944 #24 0x7fe20a9c75bd in dri_destroy_context ../src/gallium/frontends/dri/dri_context.c:256 #25 0x7fe20a9d4bef in driDestroyContext ../src/gallium/frontends/dri/dri_util.c:534 #26 0x7fe22361f25c in drisw_destroy_context ../src/glx/drisw_glx.c:429 #27 0x7fe223625d95 in glXDestroyContext ../src/glx/glxcmds.c:523 #28 0x7fe22636aaeb in glXDestroyContext /home/zmike/src/libglvnd-v1.3.2/src/GLX/libglx.c:332 #29 0x7fe2269d9e7d in glXDestroyContext /home/zmike/src/libglvnd-v1.3.2/src/GL/g_libglglxwrapper.c:384 previously allocated by thread T0 here: #0 0x7fe226cb691f in __interceptor_malloc (/usr/lib64/libasan.so.6+0xae91f) #1 0x7fe20aab0c81 in ralloc_size ../src/util/ralloc.c:120 #2 0x7fe20aab0e33 in rzalloc_size ../src/util/ralloc.c:153 #3 0x7fe20aab12c8 in rzalloc_array_size ../src/util/ralloc.c:233 #4 0x7fe20e26c76d in allocate_desc_set ../src/gallium/drivers/zink/zink_descriptors.c:657 #5 0x7fe20e26e9cb in zink_descriptor_set_get ../src/gallium/drivers/zink/zink_descriptors.c:840 #6 0x7fe20e2747aa in zink_descriptors_update ../src/gallium/drivers/zink/zink_descriptors.c:1424 #7 0x7fe20e36fc48 in void zink_draw<(zink_multidraw)1, (zink_dynamic_state)2, true, false>(pipe_context, pipe_draw_info const, unsigned int, pipe_draw_indirect_info const, pipe_draw_start_count_bias const, unsigned int, pipe_vertex_state, unsigned int) ../src/gallium/drivers/zink/zink_draw.cpp:788 #8 0x7fe20e29166d in zink_draw_vbo<(zink_multidraw)1, (zink_dynamic_state)2, true> ../src/gallium/drivers/zink/zink_draw.cpp:907 #9 0x7fe20c424982 in tc_call_draw_single ../src/gallium/auxiliary/util/u_threaded_context.c:3155 #10 0x7fe20c411706 in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:211 #11 0x7fe20c4124ba in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:362 #12 0x7fe20c41f7a9 in tc_texture_map ../src/gallium/auxiliary/util/u_threaded_context.c:2279 #13 0x7fe20b630757 in pipe_texture_map_3d ../src/gallium/auxiliary/util/u_inlines.h:572 #14 0x7fe20b6341f6 in st_ReadPixels ../src/mesa/state_tracker/st_cb_readpixels.c:546 #15 0x7fe20b42fea7 in read_pixels ../src/mesa/main/readpix.c:1178 #16 0x7fe20b42fea7 in _mesa_ReadnPixelsARB ../src/mesa/main/readpix.c:1195 #17 0x7fe20b42ffc0 in _mesa_ReadPixels ../src/mesa/main/readpix.c:1210 #18 0x2a6d094 in glu::readPixels(glu::RenderContext const&, int, int, tcu::PixelBufferAccess const&) /home/zmike/src/VK-GL-CTS/framework/opengl/gluPixelTransfer.cpp:61 #19 0x29eaa06 in deqp::gls::ShaderExecUtil::FragmentOutExecutor::execute(int, void const* const, void const) /home/zmike/src/VK-GL-CTS/modules/glshared/glsShaderExecUtil.cpp:677 #20 0x25a600b in iterate /home/zmike/src/VK-GL-CTS/modules/gles31/functional/es31fOpaqueTypeIndexingTests.cpp:585 #21 0x2322b53 in deqp::gles31::TestCaseWrapper<deqp::gles31::TestPackage>::iterate(tcu::TestCase) /home/zmike/src/VK-GL-CTS/modules/gles31/tes31TestCaseWrapper.hpp:86 #22 0x2c376fd in tcu::TestSessionExecutor::iterateTestCase(tcu::TestCase*) /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:302 #23 0x2c366e3 in tcu::TestSessionExecutor::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:139 #24 0x2c00b0c in tcu::App::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuApp.cpp:221 #25 0x4141b7 in main /home/zmike/src/VK-GL-CTS/framework/platform/tcuMain.cpp:58 #26 0x7fe2263e155f in __libc_start_call_main (/lib64/libc.so.6+0x2d55f) cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	62b8daa889	zink: set shader key size to 0 for non-generated tcs Test case 'dEQP-GLES31.functional.shaders.builtin_functions.common.modf.vec2_mediump_tess_control'.. ================================================================= ==539161==ERROR: AddressSanitizer: unknown-crash on address 0x60400008cfef at pc 0x7fffdb47b2d6 bp 0x7fffffffa490 sp 0x7fffffffa488 READ of size 4 at 0x60400008cfef thread T0 #0 0x7fffdb47b2d5 in XXH_read32 ../src/util/xxhash.h:531 #1 0x7fffdb47bfbf in XXH_readLE32 ../src/util/xxhash.h:608 #2 0x7fffdb47bfbf in XXH_readLE32_align ../src/util/xxhash.h:620 #3 0x7fffdb47bfbf in XXH32_endian_align ../src/util/xxhash.h:797 #4 0x7fffdb47bfbf in XXH32 ../src/util/xxhash.h:831 #5 0x7fffdb480b49 in _mesa_hash_data ../src/util/hash_table.c:631 #6 0x7fffded8c10a in shader_module_hash ../src/gallium/drivers/zink/zink_program.c:82 #7 0x7fffded8cad8 in get_shader_module_for_stage ../src/gallium/drivers/zink/zink_program.c:144 #8 0x7fffded8cf64 in update_gfx_shader_modules ../src/gallium/drivers/zink/zink_program.c:182 #9 0x7fffded8dcc2 in zink_update_gfx_program ../src/gallium/drivers/zink/zink_program.c:257 #10 0x7fffdec63463 in update_gfx_program ../src/gallium/drivers/zink/zink_draw.cpp:223 #11 0x7fffded7aab9 in update_gfx_pipeline<true> ../src/gallium/drivers/zink/zink_draw.cpp:445 #12 0x7fffded4a88b in void zink_draw<(zink_multidraw)1, (zink_dynamic_state)2, true, false>(pipe_context, pipe_draw_info const, unsigned int, pipe_draw_indirect_info const, pipe_draw_start_count_bias const, unsigned int, pipe_vertex_state, unsigned int) ../src/gallium/drivers/zink/zink_draw.cpp:777 #13 0x7fffdec6c5b2 in zink_draw_vbo<(zink_multidraw)1, (zink_dynamic_state)2, true> ../src/gallium/drivers/zink/zink_draw.cpp:907 #14 0x7fffdcdff982 in tc_call_draw_single ../src/gallium/auxiliary/util/u_threaded_context.c:3155 #15 0x7fffdcdec706 in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:211 #16 0x7fffdcded4ba in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:362 #17 0x7fffdcdfa492 in tc_buffer_map ../src/gallium/auxiliary/util/u_threaded_context.c:2251 #18 0x7fffdb7f2439 in pipe_buffer_map_range ../src/gallium/auxiliary/util/u_inlines.h:393 #19 0x7fffdb7f56c2 in _mesa_bufferobj_map_range ../src/mesa/main/bufferobj.c:488 #20 0x7fffdb803300 in map_buffer_range ../src/mesa/main/bufferobj.c:3734 #21 0x7fffdb8036e7 in _mesa_MapBufferRange ../src/mesa/main/bufferobj.c:3817 #22 0x29ecb02 in deqp::gls::ShaderExecUtil::BufferIoExecutor::readOutputBuffer(void const, int) /home/zmike/src/VK-GL-CTS/modules/glshared/glsShaderExecUtil.cpp:1069 #23 0x29ee499 in deqp::gls::ShaderExecUtil::TessControlExecutor::execute(int, void const const, void const) /home/zmike/src/VK-GL-CTS/modules/glshared/glsShaderExecUtil.cpp:1390 #24 0x246264c in deqp::gles31::Functional::CommonFunctionCase::iterate() /home/zmike/src/VK-GL-CTS/modules/gles31/functional/es31fShaderCommonFunctionTests.cpp:400 #25 0x2322b53 in deqp::gles31::TestCaseWrapper<deqp::gles31::TestPackage>::iterate(tcu::TestCase) /home/zmike/src/VK-GL-CTS/modules/gles31/tes31TestCaseWrapper.hpp:86 #26 0x2c376fd in tcu::TestSessionExecutor::iterateTestCase(tcu::TestCase) /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:302 #27 0x2c366e3 in tcu::TestSessionExecutor::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:139 #28 0x2c00b0c in tcu::App::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuApp.cpp:221 #29 0x4141b7 in main /home/zmike/src/VK-GL-CTS/framework/platform/tcuMain.cpp:58 #30 0x7ffff6dbc55f in __libc_start_call_main (/lib64/libc.so.6+0x2d55f) #31 0x7ffff6dbc60b in __libc_start_main_impl (/lib64/libc.so.6+0x2d60b) #32 0x413fa4 in _start (/home/zmike/src/VK-GL-CTS/build/external/openglcts/modules/glcts+0x413fa4) 0x60400008cff1 is located 0 bytes to the right of 33-byte region [0x60400008cfd0,0x60400008cff1) allocated by thread T0 here: #0 0x7ffff769191f in __interceptor_malloc (/usr/lib64/libasan.so.6+0xae91f) #1 0x7fffded8c608 in get_shader_module_for_stage ../src/gallium/drivers/zink/zink_program.c:115 #2 0x7fffded8cf64 in update_gfx_shader_modules ../src/gallium/drivers/zink/zink_program.c:182 #3 0x7fffded8dcc2 in zink_update_gfx_program ../src/gallium/drivers/zink/zink_program.c:257 #4 0x7fffdec63463 in update_gfx_program ../src/gallium/drivers/zink/zink_draw.cpp:223 #5 0x7fffded7aab9 in update_gfx_pipeline<true> ../src/gallium/drivers/zink/zink_draw.cpp:445 #6 0x7fffded4a88b in void zink_draw<(zink_multidraw)1, (zink_dynamic_state)2, true, false>(pipe_context, pipe_draw_info const, unsigned int, pipe_draw_indirect_info const, pipe_draw_start_count_bias const, unsigned int, pipe_vertex_state, unsigned int) ../src/gallium/drivers/zink/zink_draw.cpp:777 #7 0x7fffdec6c5b2 in zink_draw_vbo<(zink_multidraw)1, (zink_dynamic_state)2, true> ../src/gallium/drivers/zink/zink_draw.cpp:907 #8 0x7fffdcdff982 in tc_call_draw_single ../src/gallium/auxiliary/util/u_threaded_context.c:3155 #9 0x7fffdcdec706 in tc_batch_execute ../src/gallium/auxiliary/util/u_threaded_context.c:211 #10 0x7fffdcded4ba in _tc_sync ../src/gallium/auxiliary/util/u_threaded_context.c:362 #11 0x7fffdcdfa492 in tc_buffer_map ../src/gallium/auxiliary/util/u_threaded_context.c:2251 #12 0x7fffdb7f2439 in pipe_buffer_map_range ../src/gallium/auxiliary/util/u_inlines.h:393 #13 0x7fffdb7f56c2 in _mesa_bufferobj_map_range ../src/mesa/main/bufferobj.c:488 #14 0x7fffdb803300 in map_buffer_range ../src/mesa/main/bufferobj.c:3734 #15 0x7fffdb8036e7 in _mesa_MapBufferRange ../src/mesa/main/bufferobj.c:3817 #16 0x29ecb02 in deqp::gls::ShaderExecUtil::BufferIoExecutor::readOutputBuffer(void* const, int) /home/zmike/src/VK-GL-CTS/modules/glshared/glsShaderExecUtil.cpp:1069 #17 0x29ee499 in deqp::gls::ShaderExecUtil::TessControlExecutor::execute(int, void const const, void const) /home/zmike/src/VK-GL-CTS/modules/glshared/glsShaderExecUtil.cpp:1390 #18 0x246264c in deqp::gles31::Functional::CommonFunctionCase::iterate() /home/zmike/src/VK-GL-CTS/modules/gles31/functional/es31fShaderCommonFunctionTests.cpp:400 #19 0x2322b53 in deqp::gles31::TestCaseWrapper<deqp::gles31::TestPackage>::iterate(tcu::TestCase) /home/zmike/src/VK-GL-CTS/modules/gles31/tes31TestCaseWrapper.hpp:86 #20 0x2c376fd in tcu::TestSessionExecutor::iterateTestCase(tcu::TestCase*) /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:302 #21 0x2c366e3 in tcu::TestSessionExecutor::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuTestSessionExecutor.cpp:139 #22 0x2c00b0c in tcu::App::iterate() /home/zmike/src/VK-GL-CTS/framework/common/tcuApp.cpp:221 #23 0x4141b7 in main /home/zmike/src/VK-GL-CTS/framework/platform/tcuMain.cpp:58 #24 0x7ffff6dbc55f in __libc_start_call_main (/lib64/libc.so.6+0x2d55f) cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	861fc10bfc	zink: skip extra descriptor lookups for images during barrier updates Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	cd7ea80e70	zink: add layout to sampler descriptor hash this can have more than one value, so avoid stale cache entries Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	7431c30999	zink: fix typo for image descriptor rebinds Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Mike Blumenkrantz	a179977b8e	zink: update descriptor refs after starting renderpass this ensures that swapchain images will have been acquired before potentially accessing swapchain images bound as descriptors fixes caselist like: dEQP-GLES31.functional.fbo.color.texcubearray.r8ui dEQP-GLES31.functional.primitive_bounding_box.blit_fbo.blit_default_to_fbo Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15173>	2022-02-26 15:26:08 +00:00
Jonathan Gray	f0398180a5	radv: use MAJOR_IN_SYSMACROS for sysmacros.h include fixes build on OpenBSD ../src/amd/vulkan/radv_device.c:35:10: fatal error: 'sys/sysmacros.h' file not found Fixes: `7aaa54feb5` ("radv: implement VK_EXT_physical_device_drm") Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13448>	2022-02-26 01:00:29 +00:00
Jonathan Gray	afece589dc	util: fix util_cpu_detect_once() build on OpenBSD Correct type for sysctl argument to fix the build. ../src/util/u_cpu_detect.c:631:29: error: incompatible pointer types passing 'int ' to parameter of type 'size_t ' (aka 'unsigned long *') [-Werror,-Wincompatible-pointer-types] sysctl(mib, 2, &ncpu, &len, NULL, 0); ^~~~ Fixes: `5623c75e40` ("util: Fix setting nr_cpus on some BSD variants") Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13448>	2022-02-26 01:00:29 +00:00
Jonathan Gray	623ff4ec42	util: fix u_print.cpp build on OpenBSD move include so va_list will be picked up via stdarg.h In file included from ../src/util/u_printf.cpp:24: ../src/util/u_printf.h:43:41: error: unknown type name 'va_list'; did you mean '__va_list'? size_t u_printf_length(const char *fmt, va_list untouched_args); ^~~~~~~ __va_list /usr/include/machine/_types.h:126:27: note: '__va_list' declared here typedef __builtin_va_list __va_list; ^ and add includes to u_printf.h as suggested by Ilia Mirkin stdarg.h for va_list and stddef.h for size_t Signed-off-by: Jonathan Gray <jsg@jsg.id.au> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13448>	2022-02-26 01:00:29 +00:00
Jonathan Gray	7d609431d4	util: unbreak non-linux mips64 build Put linux specific path inside an ifdef. Unbreaks mips64 build on OpenBSD and likely other systems without Elf64_auxv_t. Fixes: `88b234d7a7` ("gallivm: add basic mips64 support and set mcpu to mips64r5 on ls3a4000") Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15166>	2022-02-26 10:47:43 +11:00
Marcin Ślusarz	e5c39bc427	intel/compiler: optimize flat inputs mask calculation Don't bother looking at urb if variable is not flat. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15169>	2022-02-25 22:34:22 +00:00
Marcin Ślusarz	e2cb562dd1	intel/compiler: ignore per-primitive attrs when calculating flat input mask If we say that per-primitive attributes are flat (which is communicated by 3DSTATE_SBE.ConstantInterpolationEnable), GPU freaks out and applies it to other (non-flat) attributes. Fixes: `be89ea3231` ("intel/compiler: Handle per-primitive inputs in FS") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15169>	2022-02-25 22:34:22 +00:00
Alyssa Rosenzweig	216da26b3f	pan/va: Add TEX_FETCH assembler case Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:03 +00:00
Alyssa Rosenzweig	794836daf0	pan/va: Handle sr_write_count in the disassembler Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	eee6dad0c9	pan/va: Fix definitions of TEX_SINGLE and TEX_FETCH Fix the definitions of the basic texturing instructions. In particular, a register format and a write mask were previously missing, as well as incorrect handling of staging registers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	a58807fa95	pan/va: Don't use staging index as a sideband It would cause us to get incorrect disassembly when the syntax is flipped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	49a4cc6af8	pan/va: Handle extended staging counts in assembler Needed for texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	142ba9fea6	pan/va: Allow forcing enums for 1-bit modifiers Ocassionally the 0 value has a meaningful value that's not meaningfully default, so we want an enum to encode both possible states. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	20fce28dfd	pan/va: Add MUX.v2i16 and MUX.v4i8 opcodes Basically identical to MUX.i32, slight differences in opcode and swizzling only. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Alyssa Rosenzweig	97f8fad37b	pan/va: Remove incorrect TEX test cases Not close enough to salvage; TEX is going to be redefined. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15182>	2022-02-25 21:53:02 +00:00
Emma Anholt	b1f349dff4	nir: Allow the _replicates opcodes to have num_components != 4. This required relaxing a core NIR assertion which I don't think is doing any important validation. The shader-db effects here are small, but they're important for avoiding a regression when we start doing per-component DCE in opt_shrink_vectors (https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12468) softpipe shader-db: total instructions in shared programs: 2859777 -> 2859454 (-0.01%) instructions in affected programs: 18881 -> 18558 (-1.71%) total temps in shared programs: 293994 -> 293914 (-0.03%) temps in affected programs: 418 -> 338 (-19.14%) i915g: total instructions in shared programs: 407562 -> 407544 (<.01%) instructions in affected programs: 570 -> 552 (-3.16%) r300: total instructions in shared programs: 1414450 -> 1414459 (<.01%) instructions in affected programs: 44494 -> 44503 (0.02%) total vinst in shared programs: 473782 -> 473727 (-0.01%) vinst in affected programs: 1102 -> 1047 (-4.99%) total sinst in shared programs: 231224 -> 231216 (<.01%) sinst in affected programs: 432 -> 424 (-1.85%) total temps in shared programs: 197605 -> 197607 (<.01%) temps in affected programs: 103 -> 105 (1.94%) crocus hsw: total instructions in shared programs: 8158185 -> 8158134 (<.01%) instructions in affected programs: 10927 -> 10876 (-0.47%) Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15178>	2022-02-25 12:31:48 -08:00
Daniel Schürmann	f030b75b7d	aco: relax condition to remove branches in case of few instructions This patch relaxes the conditions under which we remove branch instructions. Totals from 27246 (20.20% of 134913) affected shaders: (GFX10.3) CodeSize: 193413312 -> 192924928 (-0.25%) Instrs: 36146788 -> 36024692 (-0.34%) Latency: 528374112 -> 528469044 (+0.02%); split: -0.01%, +0.02% InvThroughput: 106198759 -> 106216583 (+0.02%); split: -0.00%, +0.02% Branches: 1040640 -> 918543 (-11.73%) Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8647>	2022-02-25 15:38:08 +00:00
Samuel Pitoiset	53ca85ac2a	radv,drirc: move RADV workarounds to 00-radv-defaults.conf Because we have to maintain two different packages of Mesa, one specific to RADV and another one for RadeonSI and such, it's a bit annoying to have to synchronize the drirc entries. Currently, only our Mesa package installs 00-mesa-defaults.conf which means we have to backport the drirc RADV changes. This splits 00-mesa-defaults.conf in two to move the drirc RADV entries to src/amd/vulkan/00-radv-defaults.conf. Meson will install the file only if RADV is built. There is still a caveat for common drirc workarounds like for WSI but they are rare enough and we could still duplicate them if needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15152>	2022-02-25 15:05:56 +01:00
Timur Kristóf	1ca6b2f216	aco: Support memory modes properly with load/store_buffer_amd. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15161>	2022-02-25 14:08:39 +01:00
Timur Kristóf	ba4b48e787	aco: Support task_payload with barriers, refactor allowed storage class. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15161>	2022-02-25 14:08:36 +01:00
Timur Kristóf	cd0dd5d6b7	aco: Add storage class for Task Shader payload. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15161>	2022-02-25 13:20:08 +01:00
Timur Kristóf	962b2fe214	spirv: Use task_payload mode for generic task outputs and mesh inputs. This new mode will be only used for the actual payload variables and not the number of launched mesh shader workgroups, which will still be treated as an output. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14930>	2022-02-25 06:52:07 +00:00
Timur Kristóf	f629fbd778	nir: Add new variable mode for task/mesh payload. Task shader outputs work differently than other shaders, so they need special consideration. Essentially, they have two kinds of outputs: 1. Number of mesh shader workgroups to launch. Will be still represented by a shader output. 2. Optional payload of up to (at least) 16K bytes. These payload variables behave similarly to shared memory, but the spec doesn't actually define them as shared memory (also, they may be implemented differently by each backend), so we need to add a new NIR variable mode for them. These payload variables can't be represented by shader outputs because the 16K bytes don't fit the 32x vec4 model that NIR uses for its output variables. This patch adds a new NIR variable mode: nir_var_mem_task_payload and corresponding explicit I/O intrinsics, as well as support for this new mode in nir_lower_io. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14930>	2022-02-25 06:52:07 +00:00
Timur Kristóf	d2d6eca081	radv: Refactor mesh shader draws and add num_workgroups. Several of the new draw packets need this argument including all of the taskmesh commands, so it's best to always declare it. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	bf519a7d47	ac/nir: Refactor mesh shader output code to smaller functions. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	a84789f795	ac/nir: Make sure to exclude special outputs from arrayed output masks. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	3956c03b05	ac/nir: Sanitize mesh shader primitive indices using umin. This makes our implementation friendlier to potentially buggy shaders, meaning that it will less likely to hang the GPU. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	0746b98f4a	ac/nir: Properly handle when mesh API workgroup size is smaller than HW. The problem is that the real workgroup launched on NGG HW can be larger than the size specified by the API, and the extra waves need to keep up with barriers in the API waves. There are 2 different cases: 1. The whole API workgroup fits in a single wave. We can shrink the barriers to subgroup scope and don't need to insert any extra ones. 2. The API workgroup occupies multiple waves, but not all. In this case, we emit code that consumes every barrier on the extra waves. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	d88516a23f	ac/nir: Move LDS area for primitive count to the beginning. This makes it impossible for out of bounds vertex and primitive attribute stores and indices stores to overwrite this. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	9cc9cf77a8	aco: Fix multiview view index for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	082b691141	aco: Fix workgroup_id.y and .z for NV_mesh_shader. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	10ebfb3bf2	aco: Allow 1-byte loads and stores with load/store_buffer_amd Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Timur Kristóf	1ee3d49e3e	radv: Better exclude special MS outputs from driver location assignment. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15034>	2022-02-25 06:31:33 +00:00
Guilherme Gallo	d1c6185b5a	ci: skqp: Add Vulkan support for a630_skqp job This commit adds support for Vulkan backend on a630_skqp job. = Needed changes - Needed to install libvulkan-dev package on system - Refactored the way the available skqp reports are printed tested in development builds with skia tools Piglit expectations had to be updated in various drivers due to !14750 not having bumped the tags when it tried to uprev. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14686>	2022-02-25 05:50:06 +00:00
Guilherme Gallo	8bfef8bf6b	ci: skqp: Build skqp from android-cts-10.0_r11 tag with Clang The Android CTS 10 version is relative old when compared with skia main branch, which was being used before. Some modifications in the skqp build/runner scripts were needed to make it run on CI. - skqp versions from android-cts have already all assets inside platform_tools folder. - along with the assets, are the render and unit files which are expected to pass in the Android CTS execution. - removed custom test files from the a630 folder, to make it comply with the CTS expectations. - include new patches to remove Python2 dependencies and avoid the installation of it in rootfs. - strip binariesthe built binaries `skqp` and `list_gpu_unit_tests`, as `is_debug = false` gn argument did not work, maybe it is not well tested in development builds with skia tools - use Clang instead of GCC. The GCC support is not so graceful as it is in the skia main branch, some NEON instructions needs to be turned off in the GCC compilation, causing different tests result. This change does not imply a bigger rootfs, since the built skqp binary uses GCC libc++ and other library runtimes. So clang is just a build dependency. = Changes in skqp results = Some errors were found for GL backend and unit tests. GLES and VK tests are green. All the failed tests were classified as expected to fail in the render and unit tests list. ``` gl_blur2rectsnonninepatch gl_bug339297_as_clip gl_bug6083 gl_dashtextcaps ``` ``` SRGBReadWritePixels (../../tests/SRGBReadWritePixelsTest.cpp:214 Could not create sRGB surface context. [OpenGL]) ``` Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14686>	2022-02-25 05:50:06 +00:00
Mike Blumenkrantz	5ffed2a299	features: VK_EXT_depth_clip_control for lavapipe Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15126>	2022-02-25 05:27:27 +00:00
Mike Blumenkrantz	07c0801e60	lavapipe: EXT_depth_clip_control Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15126>	2022-02-25 05:27:27 +00:00
Emma Anholt	f458c7f200	ci/zink: Add testing of dEQP GLES3.1/3.2. I think this has been kind of just an oversight. Increases runtime by a minute, to 5:30. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15159>	2022-02-24 16:16:57 -08:00
Emma Anholt	b4132bd026	ci/zink: Move testing to shared 64-core runners at Google. Now the main deqp and piglit run takes about 4:30 of runner time in a single job. Added a couple of flakes that hit this MR, but which I think predate it (probably due to not having #zink-ci until recently). Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15159>	2022-02-24 16:15:56 -08:00
Erik Faye-Lund	834db3aa8d	docs: remove incorrect drivers from extension This extension isn't wired up in Gallium, so there's just no way a Gallium driver like Panfrost exposes it. While there were support in i965 for this for the cancelled Broxton GPU, thre's no such support in the Iris driver. And since Broxton has been cancelled, it's unlikely to be wired up any time soon. Fixes: `da23a31726` ("docs/features: Update ASTC entries for Panfrost") Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15145>	2022-02-24 23:25:56 +00:00
Alyssa Rosenzweig	988d5aae74	panfrost: Flush resources when shadowing When we shadow a resource, the backing BO is changed; as such, existing references to the resource become invalid. So batches accessing the resource need to be flushed (or otherwise have their references invalidated). The wrong behaviour change (not flushing) was introduced when we started tracking resources instead of BOs. The issue manifested as a severe performance regression in glmark2's -bbuffer test, particular the subdata subtest. The issue is magnified on slow CPUs; without the fix, the test becomes completely CPU bound Relevant glmark2 -bbuffer test from 43fps to 84fps. Apparently, this causes functional issues too -- this performance-minded change also fixes a few piglits. Fixes: `cecb889481` ("panfrost: Do tracking of resources, not BOs") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Chris Healy <cphealy@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13502>	2022-02-24 23:11:20 +00:00
Alyssa Rosenzweig	5536852d60	panfrost: Handle NULL samplers Fixes a NULL dereference in Piglit fp-fragment-position, getting the test to pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Alyssa Rosenzweig	53ef20f08d	panfrost: Handle NULL sampler views Fixes a NULL dereference in Piglit fp-fragment-position. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Alyssa Rosenzweig	304851422a	panfrost: Fix set_sampler_views for big GL Roughly use the freedreno logic to handle all the extra things that will come up in our Piglit sooner than later. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Alyssa Rosenzweig	4b2769493e	panfrost/ci: Update xfails list These tests seem to be passing now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13203>	2022-02-24 22:48:30 +00:00
Kenneth Graunke	97f18d2929	blorp: Add blorp_measure hooks to the blitter codepaths I had missed these when hooking up the original support. Fixes: `31eeb72e45` ("blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15157>	2022-02-24 21:42:16 +00:00
Kenneth Graunke	e6b7e74308	iris: Set MI_FLUSH_DW::PostSyncOperation correctly The MI_FLUSH_DW post-sync operation uses the same encoding as the PIPE_CONTROL one so we can use the same helper. Write PS Depth Count is not supported, of course, as the blitter has no depth pipeline. This means that we can write the timestamp register from the blitter. Fixes: `604d97671b` ("iris: Add support for flushing the blitter (hackily)") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15157>	2022-02-24 21:42:16 +00:00
Pavel Ondračka	c393753daa	r300: add predicate instructions to statistics of vertex shaders All of IF, ELSE, ENDIF, BREAK and CONTINUE were already translated to the predication instructions in rc_vert_fc so all the flow control we count at the moment is just BGNLOOP and ENDLOOP. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15077>	2022-02-24 21:31:03 +00:00
Pavel Ondračka	8eb9bffdfc	r300: report number of loops in shader statistics Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15077>	2022-02-24 21:31:03 +00:00
Pavel Ondračka	517b37a08c	r300: use %u specifiers when printing unsigned stats values Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6019 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15077>	2022-02-24 21:31:03 +00:00
Pavel Ondračka	e7978412c3	r300: only print shader statistics when compilation succeeds This allows to disregard the huge shaders that won't run anyway and hopefully make catching shader regressions that result in a compile failure easier. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15077>	2022-02-24 21:31:03 +00:00
Mike Blumenkrantz	b124f83bc2	zink: add a flake channel Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15129>	2022-02-24 20:13:53 +00:00
Alyssa Rosenzweig	cd2a4cc47c	pan/bi: Unit test message preloading optimization To make sure it is applied in the cases we expect it to be, to avoid code generation regressions. Functional regressions are expected to be caught by integration-testing, so that is not focused on here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:13:21 -05:00
Alyssa Rosenzweig	eb1479bda2	pan/bi: Support message preloading Preload LD_VAR_IMM or VAR_TEX instructions in the first block of fragment shaders on v7. Preloaded messages write to fixed registers; when replacing instructions we insert moves from the registers at the start of the program and hope coalescing goes to town. (Admittedly we don't do any coalescing yet...) The extra moves hurts instruction count in some cases; the win for cycle count should cancel this out. When we get smarter copy prop or RA, those moves should go away anyway. This optimization may hurt register pressure by extending the lifetime of up to eight registers written in the first block. This is expected to be acceptable: on a large shader-db, there are no additional spills/fills, and only two shaders are hurt on thread count. This optimization only applies to v7, as the hardware was not introduced on v6 and was removed for Valhall. total instructions in shared programs: 2451624 -> 2454286 (0.11%) instructions in affected programs: 909046 -> 911708 (0.29%) helped: 4719 HURT: 3341 helped stats (abs) min: 1.0 max: 10.0 x̄: 1.49 x̃: 1 helped stats (rel) min: 0.08% max: 33.33% x̄: 6.79% x̃: 3.92% HURT stats (abs) min: 1.0 max: 50.0 x̄: 2.90 x̃: 2 HURT stats (rel) min: 0.12% max: 66.67% x̄: 6.39% x̃: 3.45% 95% mean confidence interval for instructions value: 0.27 0.39 95% mean confidence interval for instructions %-change: -1.55% -1.11% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total tuples in shared programs: 1969529 -> 1963429 (-0.31%) tuples in affected programs: 601327 -> 595227 (-1.01%) helped: 5907 HURT: 1297 helped stats (abs) min: 1.0 max: 8.0 x̄: 1.41 x̃: 1 helped stats (rel) min: 0.07% max: 33.33% x̄: 7.25% x̃: 5.26% HURT stats (abs) min: 1.0 max: 40.0 x̄: 1.73 x̃: 1 HURT stats (rel) min: 0.16% max: 31.75% x̄: 3.38% x̃: 2.02% 95% mean confidence interval for tuples value: -0.88 -0.81 95% mean confidence interval for tuples %-change: -5.52% -5.15% Tuples are helped. total clauses in shared programs: 401689 -> 387830 (-3.45%) clauses in affected programs: 136944 -> 123085 (-10.12%) helped: 8427 HURT: 4 helped stats (abs) min: 1.0 max: 4.0 x̄: 1.65 x̃: 2 helped stats (rel) min: 0.49% max: 50.00% x̄: 19.88% x̃: 18.18% HURT stats (abs) min: 1.0 max: 4.0 x̄: 2.50 x̃: 2 HURT stats (rel) min: 1.96% max: 19.05% x̄: 14.18% x̃: 17.86% 95% mean confidence interval for clauses value: -1.66 -1.63 95% mean confidence interval for clauses %-change: -20.15% -19.58% Clauses are helped. total cycles in shared programs: 202735.83 -> 201862.21 (-0.43%) cycles in affected programs: 16295.46 -> 15421.83 (-5.36%) helped: 3349 HURT: 1962 helped stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.32 x̃: 0 helped stats (rel) min: 0.24% max: 100.00% x̄: 40.77% x̃: 33.33% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.10 x̃: 0 HURT stats (rel) min: 0.09% max: 31.40% x̄: 2.95% x̃: 1.94% 95% mean confidence interval for cycles value: -0.17 -0.16 95% mean confidence interval for cycles %-change: -25.48% -23.76% Cycles are helped. total arith in shared programs: 74665.50 -> 74920.00 (0.34%) arith in affected programs: 16059.92 -> 16314.42 (1.58%) helped: 860 HURT: 3409 helped stats (abs) min: 0.041665999999999315 max: 0.25 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.24% max: 37.50% x̄: 4.73% x̃: 2.56% HURT stats (abs) min: 0.041665999999999315 max: 1.5833329999999997 x̄: 0.09 x̃: 0 HURT stats (rel) min: 0.09% max: 100.00% x̄: 8.99% x̃: 4.21% 95% mean confidence interval for arith value: 0.06 0.06 95% mean confidence interval for arith %-change: 5.83% 6.62% Arith are HURT. total texture in shared programs: 13083.50 -> 11877 (-9.22%) texture in affected programs: 1663 -> 456.50 (-72.55%) helped: 2377 HURT: 3 helped stats (abs) min: 0.5 max: 1.0 x̄: 0.51 x̃: 0 helped stats (rel) min: 6.25% max: 100.00% x̄: 87.12% x̃: 100.00% HURT stats (abs) min: 0.5 max: 0.5 x̄: 0.50 x̃: 0 HURT stats (rel) min: 0.00% max: 25.00% x̄: 16.67% x̃: 25.00% 95% mean confidence interval for texture value: -0.51 -0.50 95% mean confidence interval for texture %-change: -87.98% -86.00% Texture are helped. total vary in shared programs: 10220.62 -> 4183.88 (-59.06%) vary in affected programs: 10126.50 -> 4089.75 (-59.61%) helped: 8538 HURT: 0 helped stats (abs) min: 0.125 max: 1.0 x̄: 0.71 x̃: 0 helped stats (rel) min: 7.14% max: 100.00% x̄: 74.74% x̃: 87.50% 95% mean confidence interval for vary value: -0.71 -0.70 95% mean confidence interval for vary %-change: -75.32% -74.16% Vary are helped. total quadwords in shared programs: 1766717 -> 1757161 (-0.54%) quadwords in affected programs: 553801 -> 544245 (-1.73%) helped: 6760 HURT: 711 helped stats (abs) min: 1.0 max: 11.0 x̄: 1.58 x̃: 1 helped stats (rel) min: 0.09% max: 29.41% x̄: 5.31% x̃: 4.84% HURT stats (abs) min: 1.0 max: 33.0 x̄: 1.54 x̃: 1 HURT stats (rel) min: 0.10% max: 31.13% x̄: 2.53% x̃: 1.61% 95% mean confidence interval for quadwords value: -1.31 -1.25 95% mean confidence interval for quadwords %-change: -4.67% -4.46% Quadwords are helped. total threads in shared programs: 52899 -> 52897 (<.01%) threads in affected programs: 4 -> 2 (-50.00%) helped: 0 HURT: 2 total preloads in shared programs: 0 -> 116492 preloads in affected programs: 0 -> 116492 helped: 0 HURT: 8604 HURT stats (abs) min: 2.0 max: 24.0 x̄: 13.54 x̃: 14 HURT stats (rel) min: 0.00% max: 0.00% x̄: 0.00% x̃: 0.00% 95% mean confidence interval for preloads value: 13.45 13.63 95% mean confidence interval for preloads %-change: 0.00% 0.00% Preloads are HURT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 14:09:14 -05:00
Alyssa Rosenzweig	c8437cd415	pan/bi: Account for message preloading in shaderdb If a message-passing instruction like LD_VAR is preloaded, it will no longer be counted in the shader cycle counts. Add a special message preload counter that approximates the cost of preloading, so this information doesn't get a lost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	19541dc8c8	pan/bi: Add bi_before_nonempty_block helper To be used in the message preloading pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	6618697e0e	panfrost: Pack message preloads from compiler Include full message preload descriptors in the RSD on v7, and do the obvious packing for fragment shader message preloads. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	bd06a26662	panfrost: Add an unpacked message preload struct The compiler will soon produce preloaded messages, but it should not pack them itself, as this would require depending on GenXML or handcoding bitfields / bit packs in the compiler. Instead, add a struct encoding the unpacked form of the message, used as ABI between the compiler and the common driver. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:51:04 -05:00
Alyssa Rosenzweig	2d0c4973dc	panfrost: Remove Message Preload Descriptor from v6.xml It is an anachronism, as this descriptor was added in v7 and, seemingly, removed immediately after. Good work. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9438>	2022-02-24 12:50:58 -05:00
Igor Torrente	b130f8f4cf	venus: add macros to help with future extensions Currently we have to add almost the same code to the `vn_physical_device_init_{features, properties}` to add the extension to the `physical_dev->{features, properties}` list. These macros improves the code reusage. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15059>	2022-02-24 15:55:57 +00:00
Alyssa Rosenzweig	43bbe367ea	panfrost/ci: Move T860 flake to skip Actually an xfail but occassionally passes and gives us no new information, only noise. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Alyssa Rosenzweig	5c07f7c427	panfrost/ci: Move T720 flakes to skips Doesn't seem like these will be resolved anytime soon.. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-and-acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15154>	2022-02-24 14:51:31 +00:00
Tomeu Vizoso	eecc62ccbd	Revert "ci: Disable jobs to the Collabora lab" This reverts commit `f692bda484`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15153>	2022-02-24 13:44:09 +01:00
Iago Toral Quiroga	cf99584f51	broadcom/compiler: move uniforms right before their first use after scheduling On V3D the quality of the code we generate is significantly affected by how we decide to assign accumulators during register allocation, which is determined by liveness, favoring short-lived temps. There are many shaders that end up doing a whole lot of uniform loads first, and using them later, which is very inconvenient for our register allocation process because this increases uniform liveness and causes us to use accumulators less efficientely, leading to significant churn. To fix this, we move uniforms right before their first use in the same block, but we need to do this after NIR scheduling, which means we are doing it in non-SSA form, since the scheduler has a tendency to undo this optimization and it is not easy to modify it to avoid it, since it works in more abstract terms, using instruction dependencies, estimated register pressure and instruction delay information to do its work, which are very different concepts. total instructions in shared programs: 13316738 -> 13033613 (-2.13%) instructions in affected programs: 10389172 -> 10106047 (-2.73%) helped: 55442 HURT: 16144 total threads in shared programs: 413722 -> 415048 (0.32%) threads in affected programs: 1428 -> 2754 (92.86%) helped: 680 HURT: 17 total loops in shared programs: 1716 -> 1690 (-1.52%) loops in affected programs: 26 -> 0 helped: 26 HURT: 0 total uniforms in shared programs: 3704313 -> 3705181 (0.02%) uniforms in affected programs: 687730 -> 688598 (0.13%) helped: 2920 HURT: 7384 total max-temps in shared programs: 2364785 -> 2175190 (-8.02%) max-temps in affected programs: 1215387 -> 1025792 (-15.60%) helped: 49667 HURT: 1556 total spills in shared programs: 4241 -> 4248 (0.17%) spills in affected programs: 642 -> 649 (1.09%) helped: 11 HURT: 19 total fills in shared programs: 6115 -> 6125 (0.16%) fills in affected programs: 1276 -> 1286 (0.78%) helped: 11 HURT: 21 total sfu-stalls in shared programs: 34381 -> 36578 (6.39%) sfu-stalls in affected programs: 16055 -> 18252 (13.68%) helped: 3647 HURT: 5206 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Iago Toral Quiroga	f1d20ec67c	nir/nir_opt_move: handle non-SSA defs We just skip register defs and avoid moving register reads across them. This allows us to run this pass in non-SSA form. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Iago Toral Quiroga	fe2249eac5	nir: add a nir_instr_def_is_register helper This returns true if the instruction has a dest that is not an SSA value. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Iago Toral Quiroga	0a04468704	nir/nir_opt_move: allow to move uniform loads Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15056>	2022-02-24 11:36:00 +00:00
Tomeu Vizoso	f692bda484	ci: Disable jobs to the Collabora lab In anticipation of infrastructure work. This commit is to be reverted later in the day. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15150>	2022-02-24 07:34:14 +01:00
Tomeu Vizoso	c0695bb473	ci: Allow disabling the whole of the Collabora farm Add a global-level variable that allows disabling all jobs that would have gone to the Collabora lab, to be used in case of outages. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15150>	2022-02-24 07:33:45 +01:00
Emma Anholt	a5fa7e04d7	ci/lvp: Update the asan fails list. Many tests had been fixed but weren't being run due to test reshuffles from uprevs. Add some explanations for what remains. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15133>	2022-02-24 02:09:02 +00:00
Alyssa Rosenzweig	6b2eda6b72	pan/bi: Reorder pushed uniforms to avoid moves On Bifrost and Valhall, push uniforms are loaded into Fast Access Uniform Random Access Memory (FAU-RAM). FAU-RAM is organized as an array of 64-bit slots. A given tuple (Bifrost) or instruction (Valhall) may access at most a single 64-bit slot. If an instruction requires uniforms from multiple 64-bit slots, a uniform-to-register move must be inserted to avoid the hazard. However, if an instruction requires a pair of 32-bit uniforms from the same 64-bit slot, no move is required. To reduce the number of moves we emit, this commit adds an optimization pass that reorders pushed uniforms, trying to group uniforms used by the same instruction. The pass works by creating a graph of pushed uniforms, where edges denote the "both 32-bit uniforms required by the same instruction" relationship. We perform depth-first search on this graph to find the connected components, where each connected component is a cluster of uniforms that are used together. We then select pairs of uniforms from each connected component. The remaining unpaired uniforms (from components of odd sizes) are paired together arbitrarily. In principle, we should weight the graph by number of occurences and choose pairs that maximize the total selected edge weight. This is left for future work, as it is nontrivial -- selecting these edges optimally appears to be NP-hard at first blush. Implementation note: As position and varying shaders share FAU on Bifrost, extra care is taken with a `push_offset` shader stage info parameter that ensures varying shaders do not reorder uniforms selected by the previous position shader. total instructions in shared programs: 2503343 -> 2451758 (-2.06%) instructions in affected programs: 1553309 -> 1501724 (-3.32%) helped: 14256 HURT: 8 helped stats (abs) min: 1.0 max: 80.0 x̄: 3.62 x̃: 3 helped stats (rel) min: 0.06% max: 36.36% x̄: 7.31% x̃: 6.67% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 1.30% max: 12.50% x̄: 4.99% x̃: 3.85% 95% mean confidence interval for instructions value: -3.66 -3.58 95% mean confidence interval for instructions %-change: -7.41% -7.20% Instructions are helped. total tuples in shared programs: 2008399 -> 1969627 (-1.93%) tuples in affected programs: 1146344 -> 1107572 (-3.38%) helped: 12867 HURT: 147 helped stats (abs) min: 1.0 max: 61.0 x̄: 3.03 x̃: 2 helped stats (rel) min: 0.17% max: 42.86% x̄: 6.79% x̃: 4.65% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.20 x̃: 1 HURT stats (rel) min: 0.29% max: 20.00% x̄: 2.12% x̃: 1.19% 95% mean confidence interval for tuples value: -3.03 -2.93 95% mean confidence interval for tuples %-change: -6.82% -6.57% Tuples are helped. total clauses in shared programs: 408005 -> 401708 (-1.54%) clauses in affected programs: 90760 -> 84463 (-6.94%) helped: 6006 HURT: 164 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.08 x̃: 1 helped stats (rel) min: 0.45% max: 33.33% x̄: 12.44% x̃: 14.29% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 1.64% max: 25.00% x̄: 9.81% x̃: 5.26% 95% mean confidence interval for clauses value: -1.03 -1.01 95% mean confidence interval for clauses %-change: -12.03% -11.66% Clauses are helped. total cycles in shared programs: 203308.37 -> 202737.83 (-0.28%) cycles in affected programs: 19264.71 -> 18694.17 (-2.96%) helped: 3024 HURT: 41 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.19 x̃: 0 helped stats (rel) min: 0.17% max: 33.33% x̄: 3.83% x̃: 2.83% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.30% max: 5.88% x̄: 1.41% x̃: 0.93% 95% mean confidence interval for cycles value: -0.19 -0.18 95% mean confidence interval for cycles %-change: -3.89% -3.64% Cycles are helped. total arith in shared programs: 76265.67 -> 74669.25 (-2.09%) arith in affected programs: 45001.50 -> 43405.08 (-3.55%) helped: 12945 HURT: 97 helped stats (abs) min: 0.041665999999999315 max: 2.5416680000000014 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.17% max: 50.00% x̄: 8.06% x̃: 4.88% HURT stats (abs) min: 0.041665999999999315 max: 0.125 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.21% max: 33.33% x̄: 2.16% x̃: 0.96% 95% mean confidence interval for arith value: -0.12 -0.12 95% mean confidence interval for arith %-change: -8.16% -7.81% Arith are helped. total quadwords in shared programs: 1796563 -> 1766803 (-1.66%) quadwords in affected programs: 948830 -> 919070 (-3.14%) helped: 12078 HURT: 219 helped stats (abs) min: 1.0 max: 42.0 x̄: 2.49 x̃: 2 helped stats (rel) min: 0.10% max: 33.33% x̄: 5.57% x̃: 5.26% HURT stats (abs) min: 1.0 max: 4.0 x̄: 1.21 x̃: 1 HURT stats (rel) min: 0.33% max: 6.67% x̄: 2.00% x̃: 1.14% 95% mean confidence interval for quadwords value: -2.46 -2.38 95% mean confidence interval for quadwords %-change: -5.52% -5.36% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14163>	2022-02-24 01:35:33 +00:00
Timothy Arceri	6eec8fcbfa	glsl/nir: free GLSL IR right after we convert to NIR Gives us memory back faster which is useful for pathalogical CTS tests. The GLSL IR was previously used after converting to NIR for things like building the GL resource list but we have had a NIR version for this for some time and I don't believe there are any other use cases left for keeping the old IR hanging around this long. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15127>	2022-02-24 01:10:49 +00:00
Emma Anholt	0fda2ac4f0	ci/virgl: Drop the bvec4_from_mat4x2_vs xfail. The fix has landed in VK-GL-CTS 1.3.1.0, we were just not noticing it because this is also in the flakes list. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14962>	2022-02-23 23:09:20 +00:00
Emma Anholt	9e710af830	ci/softpipe: Move most of testing to shared 64-core runners at Google. The single job takes about 3:30 of runner time. I don't have a good explanation for the crash->fail test changes. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14962>	2022-02-23 23:09:20 +00:00
Emma Anholt	73b37f9ff0	ci/lavapipe: Test 1/3 of lavapipe on the shared 64-core google runners. Now we can get through 1/3 of the testsuite in about 3:30, while previously we did 1/10th. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14962>	2022-02-23 23:09:20 +00:00
Emma Anholt	0f64f4bdb5	ci/llvmpipe: Move most of testing to shared 64-core runners at Google. These runners are configured to have a single job take up the whole runner, which means we get to use threads to our hearts content. The pile of cores means we don't need to spawn separate jobs to try to load-balance across fdo's shared runner capacity. Having dedicated runners means we won't get our MRs blocked as much waiting on non-Mesa testing happening on fd.o. We manage to complete all of this llvmpipe testing in about 6:15. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14962>	2022-02-23 23:09:20 +00:00
Emma Anholt	6859b614a2	ci: Stash the ldd and ccache stats output under collapsed sections. You rarely need to look at these, they're just nice to have sometimes. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14962>	2022-02-23 23:09:20 +00:00
Samuel Pitoiset	a2c1fa9137	radv: initialize extra state for internal pipelines at one place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14650>	2022-02-23 22:29:55 +00:00
Samuel Pitoiset	959e8586aa	radv: remove useless radv_blend_state::single_cb_enable field This was only used for meta operations. DCC/FMASK/FCE pipelines only declare one color attachment and the color writemask of the second color attachment is 0 for the HW CB resolve. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14650>	2022-02-23 22:29:55 +00:00
Samuel Pitoiset	8347d3dfd7	radv: initialize VGT_GS_OUT_PRIM_TYPE earlier Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14650>	2022-02-23 22:29:55 +00:00
Samuel Pitoiset	9fb0831ca1	radv: initialize more depth/stencil states earlier Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14650>	2022-02-23 22:29:55 +00:00
Dmitry Baryshkov	b4bef890ee	freedreno/regs: remove 5nm DSI PHY regs 5nm PHY is a variation of 7nm PHY, they use the same register definitions. To remove duplication, drop 5nm defs. Cc: Robert Foss <robert.foss@linaro.org> Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15051>	2022-02-23 21:25:22 +00:00
Eric Engestrom	c9e6d3ba73	docs: update calendar and link releases notes for 21.3.7 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15149>	2022-02-23 21:20:34 +00:00
Eric Engestrom	9bb16991b8	docs: add release notes for 21.3.7 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15149>	2022-02-23 21:20:34 +00:00
Dave Airlie	b77ef4dd60	draw/so: don't use pre clip pos if we have a tes either. This check for geom shader needed to be expanded for tess support. dEQP-VK.transform_feedback.simple.depth_clip_control_tese with lvp Fixes: `dacf8f5f5c` ("draw: hook up final bits of tessellation") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15128>	2022-02-23 20:56:42 +00:00
Alyssa Rosenzweig	31b7ebcbc7	pan/mdg: Fix overflow in intra-bundle interference There are up to 4 instructions in the latter stage (if a branch is included), not 3. Bump the limit to fix memory corruption. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15147>	2022-02-23 20:42:33 +00:00
Jordan Justen	0fffaa9fca	anv: Align state pools to 2MiB on XeHP Suggested-by: Jason Ekstrand <jason.ekstrand@collabora.com> Fixes: `c17e2216dd` ("anv: Align buffer VMA to 2MiB for XeHP") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15054>	2022-02-23 20:15:24 +00:00
Jordan Justen	5a28d2482f	anv: Align GENERAL_STATE_POOL_MIN_ADDRESS to 2MiB Fixes: `c17e2216dd` ("anv: Align buffer VMA to 2MiB for XeHP") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15054>	2022-02-23 20:15:24 +00:00
Alyssa Rosenzweig	d986731da9	iris,crocus,i915g: Don't stub flush_frontbuffer This callback is only intended for software rasterizers, layered drivers, and other special drivers that go through the software winsys path. Remove the unimplemented stubs from the Intel drivers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Dave Airlie <airlied@redhat.com> [crocus] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15118>	2022-02-23 19:49:54 +00:00
Alyssa Rosenzweig	51689a2b80	panfrost: Simplify panfrost_resource_get_handle Unify the exit paths to clean up the logic. There are logically three modes we support (KMS without renderonly, KMS with renderonly, and FD); these each correspond to a leg of a small if statement. Outside of the small if's, everything else should be identical. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Reviewed-by: James Jones <jajones@nvidia.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15120>	2022-02-23 18:31:55 +00:00
Alyssa Rosenzweig	b5734cc1c4	panfrost: Fix FD resource_get_handle When handle->type is WINSYS_HANDLE_TYPE_FD, the caller wants a file descriptor for the BO backing the resource. We previously had two paths for this: 1. If rsrc->scanout is available, we prime the GEM handle from the KMS device (rsrc->scanout->handle) to a file descriptor via the KMS device. 2. If rsrc->scanout is not available, we prime the GEM handle from the GPU (bo->gem_handle) to a file descriptor via the GPU device. In both cases, the caller passes in a resource (with BO) and expects out a file descriptor. There are no direct GEM handles in the function signature; the caller doesn't care which GEM handle we prime to get the file descriptor. In principle, both paths produce the same file descriptor for the same BO, since both GEM handles represent the same underlying resource (viewed from different devices). On grounds of redundancy alone, it makes sense to remove the rsrc->scanout path. Why have a path that only works sometimes, when we have another path that works always? In fact, the issues with the rsrc->scanout path are deeper. rsrc->scanout is populated by renderonly_create_gpu_import_for_resource, which does the following: 1. Get a file descriptor for the resource by resource_get_handle with WINSYS_HANDLE_TYPE_FD 2. Prime the file descriptor to a GEM handle via the KMS device. Here comes strike number 2: in order to get a file descriptor via the KMS device, we had to /already/ get a file descriptor via the GPU device. If we go down the KMS device path, we effectively round trip: GPU handle -> fd -> KMS handle -> fd There is no good reason to do this; if everything works, the fd is the same in each case. If everything works. If. The lifetimes of the GPU handle and the KMS handle are not necessarily bound. In principle, a resource can be created with scanout (constructing a KMS handle). Then the KMS view can be destroyed (invalidating the GEM handle for the KMS device), even though the underlying resource is still valid. Notice the GPU handle is still valid; its lifetime is tied to the resource itself. Then a caller can ask for the FD for the resource; as the resource is still valid, this is sensible. Under the scanout path, we try to get the FD by priming the GEM handle on the KMS device... but that GEM handle is no longer valid, causing the PRIME ioctl to fail with ENOENT. On the other hand, if we primed the GPU GEM handle, everything works as expected. These edge cases are not theoretical; recent versions of Xwayland trigger this ENOENT, causing issue #5758 on all Panfrost devices. As far as I can tell, no other kmsro driver has this 'special' kmsro path; the only part of resource_get_handle that needs special handling for kmsro is getting a KMS handle. Let's remove the broken, useless path, fix Xwayland, bring us in line with other drivers, and delete some code. Thank you for coming to my ted talk. Closes: #5758 Fixes: `7da251fc72` ("panfrost: Check in sources for command stream") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-and-tested-by: Jan Palus <jpalus@fastmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Reviewed-by: James Jones <jajones@nvidia.com> Acked-by: Daniel Stone <daniels@collabora.com> Tested-by: Dan Johansen <strit@manjaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15120>	2022-02-23 18:31:55 +00:00
Dmitry Baryshkov	22efeec399	freedreno/registers: add new register for 7nm DSI PHY v4.3 (sm8450) Signed-off-by: Dmitry Baryshkov <dmitry.baryshkov@linaro.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15052>	2022-02-23 17:28:17 +00:00
Alyssa Rosenzweig	04b80489d5	ci: Disable windows-vs2019 Currently down. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15148>	2022-02-23 15:12:41 +00:00
Rhys Perry	ded9cb904f	anv: Enable nir_opt_access This commit will enable pass for searching readonly / writeonly access when it's missing. We don't support shaderStorageImageReadWithoutFormat and the optimization pass causes those shaders to take the write-only path which does support formatless. Following games are affected with positive result: - Wolfenstein: Youngblood - Wolfenstein II: The New Colossus https://gitlab.freedesktop.org/mesa/mesa/-/issues/3138 - Rage 2 https://gitlab.freedesktop.org/mesa/mesa/-/issues/5791 - The Surge 2 https://gitlab.freedesktop.org/mesa/mesa/-/issues/5805 - Metro Exodus https://gitlab.freedesktop.org/mesa/mesa/-/issues/4703 - DOOM Eternal https://gitlab.freedesktop.org/mesa/mesa/-/issues/4273 Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3138,https://gitlab.freedesktop.org/mesa/mesa/-/issues/5791,https://gitlab.freedesktop.org/mesa/mesa/-/issues/4273 Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15082>	2022-02-23 13:11:12 +00:00
Alyssa Rosenzweig	abb7f04674	panfrost: Inline pan_emit_sfbd_tiler Easier to read, the common code was already common. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	910d4f8245	panfrost: Remove pan_emit_fbd thunking Use a common interface. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	8dc7757754	panfrost: Remove unrelated comment Not sure what this was supposed to describe, but it's not the code here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	099d61c95d	panfrost: Use txl instead of tex in the blitter We always blit from a particular level, so it's a waste to compute the LOD. This corresponds to a simple texture instruction with implement 0 LOD, which is the optimal texturing path on Bifrost -- it maps to TEXS_2D but does not require helper invocations. Functional change on Bifrost: Blit shaders no longer set .computed_lod or shader_contains_barrier. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	5b1a00c565	panfrost: Inline pan_blit_emit_dcd Easier to follow the logic without having a million arguments passed around. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	c9784c9512	panfrost: Decouple tiler job and DCD emit We can share the "emit quad" logic, even though the DCDs differ. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	a13d87c484	panfrost: Annotate slow clears as such We should realistically be using the clear shaders from PanVK once they're moved to common. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	1eb3dbafdb	panfrost: Set defaults for deprecated DCD fields There are always set to true. Don't pollute the driver code with them, make their existence a local detail to pre-Valhall XML and that's it. Functional change: "four components per vertex" is now set on vertex job DCDs. This should be a no-op. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	bd3d7e33b6	panfrost: Use pan_shader_prepare_rsd in blitter This reduces code duplication and will ease Valhall porting. Functional changes on v7: * Shader contains barrier is now set (perf loss, fixed later in series) * Shader register allocation is now set (perf win) * Point sprite inverted, no-op for blit shaders Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Alyssa Rosenzweig	6fc81f163e	pan/mdg: Fix partial execution mode names cont -> skip, last -> kill, and fix the special case handling. It's just an enum. Makes the disassembly easier to read and closer to Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15123>	2022-02-23 12:56:30 +00:00
Danylo Piliaiev	7e703e4428	turnip: Always use GMEM for feedback loops in autotuner For ordinary feedback loops GMEM is a lot faster than sysmem since we don't set SINGLE_PRIM mode. For feedback loops with ordered rasterization GMEM should also be faster. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	ebc23ac963	turnip: Implement VK_ARM_rasterization_order_attachment_access Trivially implemented by using A6XX_GRAS_SC_CNTL_SINGLE_PRIM_MODE. This extension is useful for emulators e.g. AetherSX2 PS2 emulator and could drastically improve performance when blending is emulated. Relevant tests: dEQP-VK.rasterization.rasterization_order_attachment_access.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	d6c89e1e4a	turnip: Merge LRZ and DEPTH_PLANE draw states They were emitted at the same time. Frees 1 draw state for us to use. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Danylo Piliaiev	dab34bd5c8	turnip: Use LATE_Z when there might be depth/stencil feedback loop Otherwise a shader invocation would read the value which should have been set AFTER this shader invocation. Fixes tests: dEQP-VK.rasterization.rasterization_order_attachment_access.depth.samples_1.multi_draw_barriers dEQP-VK.rasterization.rasterization_order_attachment_access.stencil.samples_1.multi_draw_barriers Fixes: `71595a189a` ("tu: Fix feedback loops in sysmem mode") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15106>	2022-02-23 11:31:59 +00:00
Paulo Zanoni	d10fd5b7c9	iris: fix register spilling on compute shaders on XeHP XeHP scratch space is handled differently. Commit `ae18e1e707` implemented support for it, but handled it differently between render and compute shaders: it calculates scratch_addr differently and doesn't pin the buffer on compute. Make it work on compute shaders by calling pin_scratch_space() from iris_compute_walker(), which fixes both the address and the pinning. This commit can be verified by the two-year-old-but-still-unreviewed Piglit MR 234. You can also verify this by running a very simple compute shader with INTEL_DEBUG=spill_fs. References: https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/234 Fixes: `ae18e1e707` ("iris: Add support for scratch on XeHP") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15070>	2022-02-22 22:16:57 +00:00
Kenneth Graunke	c46d3acf0e	anv: Raise vertex input bindings and attributes limits slightly This raises our vertex input bindings limit from 28 to 31, and our vertex input attribute limit from 28 to 29. We could theoretically go higher, but it will take additional work. The 3DSTATE_VERTEX_BUFFERS and 3DSTATE_VERTEX_ELEMENTS limits are 33 vertex buffers, and 34 vertex elements. But we need up to two vertex elements for system values (FirstVertex, BaseVertex, BaseInstance, DrawID), and we currently use two vertex bindings for those. There is another hidden limit: our compiler backend only supports the push model for VS inputs currently. 3DSTATE_VS only allows URB Read Lengths between [0, 15], which is measured in pairs of inputs, which means we can theoretically push no more than 32 vertex elements. This is no artifical limit either, as a vec4 element takes up 4 registers in the payload, and 32 * 4 = 128, the entire size of our register file. Plus, the VS Thread payload needs at least g0 and g1 for other things, so we can really only push 31. We can theoretically support one additional binding, by combining our two SGV bindings into a single upload. In order to support additional vertex elements, we would need to add support to the backend compiler for the pull model for VS inputs. References: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5917 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14991>	2022-02-22 21:31:06 +00:00
Mike Blumenkrantz	dabba7d726	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15067>	2022-02-22 21:16:55 +00:00
Mike Blumenkrantz	3029000389	zink: remove zink_descriptor_util_init_null_set() no longer used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15067>	2022-02-22 21:16:55 +00:00
Mike Blumenkrantz	7266182be0	zink: allow null descriptor set layouts I got confused while writing this somehow because of the null descriptor feature, which enables drivers to consume a null descriptor, which has no relation to a descriptor layout containing no descriptors failing to accurately use zero descriptors can put layouts over the maximum per-stage limits, which causes tests to crash fixes (lavapipe): KHR-GL46.shading_language_420pack.binding_uniform_block_array KHR-GL46.multi_bind.dispatch_bind_buffers_base Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15067>	2022-02-22 21:16:55 +00:00
Timur Kristóf	3759a16d8a	ac/nir/ngg: Fix mixed up primitive ID after culling. When NGG culling is enabled, make sure that the correct primitive ID is exported by each lane. Fixes: `e97f0463a8` "ac/nir: Implement NGG deferred attribute culling in NIR." Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6050 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15055>	2022-02-22 18:15:24 +00:00
Mike Blumenkrantz	c063d8ff64	zink: prune ci lists I don't know why I thought running GL3.2 and GL4.6 was a good idea, but it wasn't Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15065>	2022-02-22 18:02:00 +00:00
Emma Anholt	59bc17d57a	turnip: Request no implicit sync when we have no implicit-sync WSI BOs. I chose to implement this as a global flag in the device, because otherwise we would end up with extra draw overhead trying to avoid it in the implicit-sync WSI case, and you're probably going to end up needing implicit sync anyway because you used one of the BOs in any of the submitted cmdbufs. To do better than this, we would probably want a skip-implicit-sync flag on the BOs in the BO list, rather than global on the submit. Reports about venus on turnip say that this flag reduces worst-case QueueSubmit time in a game workload from ~10ms to ~4ms. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14838>	2022-02-22 17:36:05 +00:00
Samuel Pitoiset	83ee08f6d1	radv: fix build on BSD Just disable inotify for BDS systems. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6060 Fixes: `c50557d961` ("radv: allow applications to dynamically change RADV_FORCE_VRS") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15105>	2022-02-22 17:16:21 +00:00
Alyssa Rosenzweig	2e86767370	pan/bi: Add BIFROST_MESA_DEBUG=nosb option To disable the new scoreboarding optimizations when debugging. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	c81c022e66	pan/bi: Implement basic scoreboarding pass Extend our existing bi_scoreboard infrastructure with a simple data flow analysis pass that calculates which dependency slots need waiting. We still lack a heuristic for selecting dependency slots. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	8f25d88d90	pan/bi: Print scoreboarding state Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	6ad9a7f650	pan/bi: Add scoreboard state to IR To a limited degree, scoreboarding must be global, so add the data structures for tracking this to the IR. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	91c02893d8	pan/bi: Clean up nits in liveness analysis Fix minor silly things. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	734a8bdc5d	pan/bi: Use bi_exit_block The "generic" one is a vestige of Midgard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	75406a561f	pan/bi: Add bi_{start, exit}_block helpers Useful for data flow analysis. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	e5423bb129	pan/bi: Do not cull post-RA staging writes Bifrost post-RA dead code elimination can cull the destinations of regular ALU instructions, by weakening from a register write to a temporary write. However, there is no way to suppress staging writes, so culling the destinations will result in invalid code generation. Fixes a regression in dEQP-GLES3.functional.shaders.switch.switch_in_for_loop_static_vertex with scoreboarding. The root cause there is the backend dead code elimination not being sufficiently aggressive in the presence of control flow. Usually this does not matter, since the backend optimizations are intended to be local with global optimizations happening in NIR. Unfortunately, our implementation of IDVS hits this hard. That will need to be optimized (probably by specializing IDVS shaders in NIR instead of the backend). In the mean time, let's fix the actual bug affecting scoreboarding. No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	87d46f40c8	pan/bi: Cull DTSEL_IMM dests in post-RA DCE They are useless (given the semantics of DTSEL_IMM) and complicate scoreboarding. Just remove them in the pass that removes all the other silly register destinations. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Alyssa Rosenzweig	956b969616	pan/bi: Clarify requirement for barriers Barriers need to wait on all outstanding messages. This is more of an API requirement than a hardware requirement, but it's still an invariant the scoreboarding pass must respect. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14298>	2022-02-22 16:57:30 +00:00
Erik Faye-Lund	9a14ddc22d	docs: add license to the redirects script I always intended this to be covered by the MIT license like with the rest of my contributions, but somehow forgot to add it. Let's add that license to make things clear. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14751>	2022-02-22 16:15:47 +00:00
Adam Jackson	2eb644e470	mesa: Enable GL_NV_pack_subimage This just legalizes a few of the pixelstore pack parameters in GLES2 that are already legal in desktop and GLES3. glamor takes advantage of this in the GetImage and software-fallback paths. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14977>	2022-02-22 10:45:28 -05:00
Alyssa Rosenzweig	606ac8d61e	pan/bi: Enable nir_opt_shrink_vectors total instructions in shared programs: 1939513 -> 1935815 (-0.19%) instructions in affected programs: 809066 -> 805368 (-0.46%) helped: 3195 HURT: 865 helped stats (abs) min: 1.0 max: 15.0 x̄: 1.99 x̃: 1 helped stats (rel) min: 0.10% max: 25.00% x̄: 2.26% x̃: 1.28% HURT stats (abs) min: 1.0 max: 22.0 x̄: 3.09 x̃: 2 HURT stats (rel) min: 0.10% max: 83.33% x̄: 2.67% x̃: 1.39% 95% mean confidence interval for instructions value: -1.00 -0.82 95% mean confidence interval for instructions %-change: -1.34% -1.08% Instructions are helped. total tuples in shared programs: 1523194 -> 1521789 (-0.09%) tuples in affected programs: 745526 -> 744121 (-0.19%) helped: 2947 HURT: 1844 helped stats (abs) min: 1.0 max: 18.0 x̄: 2.06 x̃: 1 helped stats (rel) min: 0.15% max: 25.00% x̄: 2.65% x̃: 1.59% HURT stats (abs) min: 1.0 max: 29.0 x̄: 2.54 x̃: 1 HURT stats (rel) min: 0.09% max: 40.00% x̄: 2.32% x̃: 1.52% 95% mean confidence interval for tuples value: -0.39 -0.20 95% mean confidence interval for tuples %-change: -0.85% -0.62% Tuples are helped. total clauses in shared programs: 329158 -> 325350 (-1.16%) clauses in affected programs: 111654 -> 107846 (-3.41%) helped: 2787 HURT: 498 helped stats (abs) min: 1.0 max: 17.0 x̄: 1.57 x̃: 1 helped stats (rel) min: 0.76% max: 40.00% x̄: 6.92% x̃: 5.26% HURT stats (abs) min: 1.0 max: 3.0 x̄: 1.14 x̃: 1 HURT stats (rel) min: 0.87% max: 50.00% x̄: 4.73% x̃: 3.77% 95% mean confidence interval for clauses value: -1.21 -1.10 95% mean confidence interval for clauses %-change: -5.39% -4.93% Clauses are helped. total cycles in shared programs: 172084.50 -> 166827.62 (-3.05%) cycles in affected programs: 74698.83 -> 69441.96 (-7.04%) helped: 3706 HURT: 568 helped stats (abs) min: 0.041665999999999315 max: 19.0 x̄: 1.44 x̃: 1 helped stats (rel) min: 0.24% max: 75.00% x̄: 9.48% x̃: 6.90% HURT stats (abs) min: 0.041665999999999315 max: 1.0 x̄: 0.15 x̃: 0 HURT stats (rel) min: 0.25% max: 50.00% x̄: 2.21% x̃: 1.42% 95% mean confidence interval for cycles value: -1.28 -1.18 95% mean confidence interval for cycles %-change: -8.18% -7.67% Cycles are helped. total arith in shared programs: 57145.04 -> 57211.37 (0.12%) arith in affected programs: 27595.12 -> 27661.46 (0.24%) helped: 1933 HURT: 2259 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.09 x̃: 0 helped stats (rel) min: 0.16% max: 33.33% x̄: 2.74% x̃: 1.52% HURT stats (abs) min: 0.04166399999999726 max: 1.3333329999999997 x̄: 0.11 x̃: 0 HURT stats (rel) min: 0.10% max: 100.00% x̄: 2.79% x̃: 1.62% 95% mean confidence interval for arith value: 0.01 0.02 95% mean confidence interval for arith %-change: 0.07% 0.40% Arith are HURT. total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 10222 (-8.39%) vary in affected programs: 5643 -> 4707.25 (-16.58%) helped: 3196 HURT: 0 helped stats (abs) min: 0.125 max: 1.875 x̄: 0.29 x̃: 0 helped stats (rel) min: 2.78% max: 75.00% x̄: 18.49% x̃: 15.00% 95% mean confidence interval for vary value: -0.30 -0.29 95% mean confidence interval for vary %-change: -18.88% -18.11% Vary are helped. total ldst in shared programs: 146420 -> 140270 (-4.20%) ldst in affected programs: 66027 -> 59877 (-9.31%) helped: 2942 HURT: 10 helped stats (abs) min: 1.0 max: 19.0 x̄: 2.09 x̃: 2 helped stats (rel) min: 0.90% max: 100.00% x̄: 16.81% x̃: 8.33% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 2.22% max: 50.00% x̄: 13.03% x̃: 3.33% 95% mean confidence interval for ldst value: -2.15 -2.02 95% mean confidence interval for ldst %-change: -17.53% -15.89% Ldst are helped. total quadwords in shared programs: 1398329 -> 1392117 (-0.44%) quadwords in affected programs: 704641 -> 698429 (-0.88%) helped: 3677 HURT: 1299 helped stats (abs) min: 1.0 max: 26.0 x̄: 2.51 x̃: 1 helped stats (rel) min: 0.10% max: 26.92% x̄: 2.64% x̃: 1.89% HURT stats (abs) min: 1.0 max: 20.0 x̄: 2.31 x̃: 1 HURT stats (rel) min: 0.11% max: 44.44% x̄: 2.34% x̃: 1.55% 95% mean confidence interval for quadwords value: -1.34 -1.16 95% mean confidence interval for quadwords %-change: -1.44% -1.25% Quadwords are helped. total threads in shared programs: 35234 -> 35311 (0.22%) threads in affected programs: 119 -> 196 (64.71%) helped: 91 HURT: 14 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.60 0.87 95% mean confidence interval for threads %-change: 70.08% 89.92% Threads are helped. total loops in shared programs: 125 -> 125 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 149 -> 144 (-3.36%) spills in affected programs: 22 -> 17 (-22.73%) helped: 1 HURT: 0 total fills in shared programs: 966 -> 956 (-1.04%) fills in affected programs: 44 -> 34 (-22.73%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	e0e63c2a8e	pan/bi: Specialize IDVS in NIR It's a bit more code, but it's needed to chew through control flow since we don't have a backend version of dead_cf. Results are really good, meaning I really screwed this up the first time around (hence the cc mesa-stable). total instructions in shared programs: 1963576 -> 1939513 (-1.23%) instructions in affected programs: 671053 -> 646990 (-3.59%) helped: 4436 HURT: 729 helped stats (abs) min: 1.0 max: 43.0 x̄: 5.75 x̃: 6 helped stats (rel) min: 0.21% max: 100.00% x̄: 6.47% x̃: 5.17% HURT stats (abs) min: 1.0 max: 22.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.50% max: 50.00% x̄: 10.45% x̃: 9.09% 95% mean confidence interval for instructions value: -4.77 -4.55 95% mean confidence interval for instructions %-change: -4.36% -3.80% Instructions are helped. total tuples in shared programs: 1533335 -> 1523194 (-0.66%) tuples in affected programs: 483167 -> 473026 (-2.10%) helped: 3414 HURT: 1288 helped stats (abs) min: 1.0 max: 20.0 x̄: 3.73 x̃: 2 helped stats (rel) min: 0.27% max: 100.00% x̄: 4.87% x̃: 3.03% HURT stats (abs) min: 1.0 max: 19.0 x̄: 2.02 x̃: 1 HURT stats (rel) min: 0.24% max: 38.10% x̄: 8.10% x̃: 5.88% 95% mean confidence interval for tuples value: -2.28 -2.03 95% mean confidence interval for tuples %-change: -1.62% -1.02% Tuples are helped. total clauses in shared programs: 351432 -> 329158 (-6.34%) clauses in affected programs: 142237 -> 119963 (-15.66%) helped: 5328 HURT: 3 helped stats (abs) min: 1.0 max: 43.0 x̄: 4.18 x̃: 4 helped stats (rel) min: 0.74% max: 100.00% x̄: 19.44% x̃: 17.24% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 9.09% max: 12.50% x̄: 10.90% x̃: 11.11% 95% mean confidence interval for clauses value: -4.25 -4.11 95% mean confidence interval for clauses %-change: -19.72% -19.12% Clauses are helped. total cycles in shared programs: 202830.92 -> 172084.50 (-15.16%) cycles in affected programs: 117078.42 -> 86332 (-26.26%) helped: 5450 HURT: 1 helped stats (abs) min: 0.083333 max: 49.0 x̄: 5.64 x̃: 5 helped stats (rel) min: 1.42% max: 100.00% x̄: 27.94% x̃: 25.64% HURT stats (abs) min: 0.25 max: 0.25 x̄: 0.25 x̃: 0 HURT stats (rel) min: 2.46% max: 2.46% x̄: 2.46% x̃: 2.46% 95% mean confidence interval for cycles value: -5.74 -5.54 95% mean confidence interval for cycles %-change: -28.30% -27.58% Cycles are helped. total arith in shared programs: 57274.29 -> 57145.04 (-0.23%) arith in affected programs: 16418.33 -> 16289.08 (-0.79%) helped: 2442 HURT: 1784 helped stats (abs) min: 0.041665999999999315 max: 0.75 x̄: 0.14 x̃: 0 helped stats (rel) min: 0.23% max: 100.00% x̄: 5.51% x̃: 2.87% HURT stats (abs) min: 0.041665999999999315 max: 0.9166670000000003 x̄: 0.12 x̃: 0 HURT stats (rel) min: 0.00% max: 100.00% x̄: 25.13% x̃: 9.09% 95% mean confidence interval for arith value: -0.04 -0.03 95% mean confidence interval for arith %-change: 6.61% 8.24% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total texture in shared programs: 12857 -> 12857 (0.00%) texture in affected programs: 0 -> 0 helped: 0 HURT: 0 total vary in shared programs: 11157.75 -> 11157.75 (0.00%) vary in affected programs: 0 -> 0 helped: 0 HURT: 0 total ldst in shared programs: 177208 -> 146420 (-17.37%) ldst in affected programs: 117098 -> 86310 (-26.29%) helped: 5447 HURT: 0 helped stats (abs) min: 1.0 max: 49.0 x̄: 5.65 x̃: 5 helped stats (rel) min: 1.92% max: 100.00% x̄: 27.91% x̃: 25.64% 95% mean confidence interval for ldst value: -5.75 -5.55 95% mean confidence interval for ldst %-change: -28.27% -27.56% Ldst are helped. total quadwords in shared programs: 1436507 -> 1398329 (-2.66%) quadwords in affected programs: 515101 -> 476923 (-7.41%) helped: 5150 HURT: 111 helped stats (abs) min: 1.0 max: 39.0 x̄: 7.46 x̃: 6 helped stats (rel) min: 0.17% max: 100.00% x̄: 10.02% x̃: 8.24% HURT stats (abs) min: 1.0 max: 9.0 x̄: 2.01 x̃: 1 HURT stats (rel) min: 0.43% max: 21.62% x̄: 3.57% x̃: 1.94% 95% mean confidence interval for quadwords value: -7.41 -7.11 95% mean confidence interval for quadwords %-change: -9.98% -9.49% Quadwords are helped. total threads in shared programs: 35025 -> 35228 (0.58%) threads in affected programs: 218 -> 421 (93.12%) helped: 208 HURT: 5 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 50.00% max: 50.00% x̄: 50.00% x̃: 50.00% 95% mean confidence interval for threads value: 0.91 0.99 95% mean confidence interval for threads %-change: 93.40% 99.55% Threads are helped. total loops in shared programs: 128 -> 125 (-2.34%) loops in affected programs: 3 -> 0 helped: 3 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 100.00% max: 100.00% x̄: 100.00% x̃: 100.00% total spills in shared programs: 158 -> 149 (-5.70%) spills in affected programs: 15 -> 6 (-60.00%) helped: 9 HURT: 0 total fills in shared programs: 1133 -> 966 (-14.74%) fills in affected programs: 197 -> 30 (-84.77%) helped: 9 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Alyssa Rosenzweig	3c1021cd1e	panvk: Use more reliable assert for UBO pushing The important thing isn't the number of words pushed, it's that there are no UBOs required for us to upload. Check that instead. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15090>	2022-02-22 15:21:09 +00:00
Georg Lehmann	d223b7f096	radv, aco: Add u_foreach_bit to .clang-format. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15083>	2022-02-22 14:57:29 +00:00
Xaver Hugl	a6be12fdad	gbm: improve documentation about the lifetime of resources Signed-off-by: Xaver Hugl <xaver.hugl@gmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10906>	2022-02-22 14:42:52 +01:00
Marek Olšák	62074cb4ac	ac: update shadowed registers based on PAL Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	e74929bfef	radeonsi: move Arcturus code outside the gfx9 branch preparation for a future commit Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	c740fd18ba	ac/llvm: replace structured by vindex != NULL in ac_build_buffer_store_common "raw" (IDXEN=0) and "structured" (IDXEN=1) do bounds checking differently. From `si_make_buffer_descriptor`: * - For VMEM and inst.IDXEN == 0 or STRIDE == 0, it's in byte units. * - For VMEM and inst.IDXEN == 1 and STRIDE != 0, it's in units of STRIDE. so there is a difference between setting vindex = i32_0 and vindex = NULL. Instead of having the `structured` flag, we can just check if vindex is NULL. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	1038382baf	ac/llvm: replace structured by vindex != NULL in ac_build_tbuffer_store "raw" (IDXEN=0) and "structured" (IDXEN=1) do bounds checking differently. From `si_make_buffer_descriptor`: * - For VMEM and inst.IDXEN == 0 or STRIDE == 0, it's in byte units. * - For VMEM and inst.IDXEN == 1 and STRIDE != 0, it's in units of STRIDE. so there is a difference between setting vindex = i32_0 and vindex = NULL. Instead of having the `structured` flag, we can just check if vindex is NULL. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	c8e2c6faf6	radeonsi: use SET_SH_REG_INDEX with index=3 for registers containing CU_EN This matches PAL and RADV behavior. It's for preemption. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	79a7ab642a	ac/surface: add more elements to meta equations because HTILE can use them according to gfx10SwizzlePattern.h Fixes: `9fabbf2150` - ac/surface: copy the HTILE equations to the surface Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	9a28f79f7b	ac/surface/tests: fix missing NUM_PKRS extraction in test_modifier Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	12e00be09b	radeonsi: apply the LLVM discard bug workaround to LLVM 13 only It was fixed in LLVM 14. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	21f169b2fb	ac,radeonsi: rework and optimize how TMPRING_SIZE is set Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Yogesh Mohan Marimuthu	3c7df183a3	radeonsi: prepare clamp, alpha test before mrtz prepare Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Yogesh Mohan Marimuthu	c268dafae5	radeonsi: move clamp, alpha test from si_export_mrt_color() to new function Signed-off-by: Yogesh Mohan Marimuthu <yogesh.mohanmarimuthu@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	1485f683e3	radeonsi: fix the unaligned clear_buffer fallback with TC Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	4e49a05e37	radeonsi: increase the tesselation factor ring size based on PAL Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	37c26a72a4	radeonsi: remove bit gaps in SI_RESOURCE_FLAG_* Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	f865631b1b	radeonsi: replace SI_RESOURCE_FLAG_UNMAPPABLE with PIPE_RESOURCE_FLAG_UNMAPPABLE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	790362c10e	radeonsi: don't map buffers that VK made unmappable Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	ad9b5ac0a1	radeonsi: more fixes for si_buffer_from_winsys_buffer for GL-VK interop Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
jiadozhu	c9097d8939	radeonsi: fix crash in flush_resource when used with buffers glWaitSemaphoreEXT triggers si_flush_resource callback on pipe buffer resources, which may cause segmentation fault. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	f968cb921d	radeonsi: reduce the max TBO/SSBO binding size to 512 MB to help 32-bit builds Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	1585b4584d	radeonsi: document an unexpected behavior of PS_DONE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	cc5fd00c5c	radeonsi: change ACCUM_ISOLINE to 12 based on PAL Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	9c5b4fa097	radeonsi: program SQ_THREAD_TRACE_CTRL.AUTO_FLUSH_MODE on gfx10.3 discovered internally Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	18a1af4929	radeonsi: always set FLUSH_ON_BINNING_TRANSITION The hardware does the right thing automatically. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	a87ab82f25	radeonsi: add assertions to check if buffer_map/texture_map calls are valid Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	f23e07525f	winsys/amdgpu: fix a warning of defining radeon_screen_create_t twice Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	bf7d25cd1b	ac/llvm: remove unused function dpp_row_sl Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	cfaaa0892f	ac/surface: don't set the display flag for 1D textures Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	2f2fca24d2	ac/gpu_info: print units for some radeon_info fields Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	53f683ff67	ac: add a gfx9 workaround for high priority compute Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	197467c238	amd: add a workaround for an SQ perf counter bug Cc: mesa-stable@lists.freedesktop.org Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	95af3cc2f8	amd: remove the _UMD suffix from register definitions It was mistakenly added to indicate it's for a User-Mode Driver, but all defined registers in Mesa are. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:04 +00:00
Marek Olšák	707a94f3c5	winsys/radeon: fix a hang due to introducing spi_cu_en Fixes: `5406ad93` "radeonsi: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5989 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15098>	2022-02-22 11:41:03 +00:00
Iago Toral Quiroga	c4a78a2d2a	broadcom/compiler: fix register class patching for postponed spills If we have a postponed spill, the temp we create at ip is no longer the spilled temp and therefore is affected by the thrsw injection. Fixes corruption in the additive blending animation demo from Three.js. Fixes: `f3c3228522` ('broadcom/compiler: do not rebuild the interference graph after each spill') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15112>	2022-02-22 11:17:10 +00:00
Tapani Pälli	09b86b4061	iris: setup internal_format for memory object resources We need to setup internal_format for resource in case main surface was not configured (iris_resource_configure_main) which is the case with vertex buffer objects, otherwise transfer helper will make wrong decisions when copying such a resource. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14925>	2022-02-22 10:56:21 +00:00
Erik Faye-Lund	5c5243adb7	vulkan/wsi: use buffer-image code-path on Windows This means we no longer map optimal-tiling images in order to display them on screen, but instead copy via a buffer, which is guaranteed to linearize. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Erik Faye-Lund	2e8c119531	vulkan/wsi: add transition to/from transfer-src state Without these barriers, we'll try to blit from the image when it's in the incorrect layout, which is illegal. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Erik Faye-Lund	25a37fabb7	vulkan/wsi: untangle buffer-images from prime Not all Vulkan implementations allows rendering to linear images, so in order to support scanning out from these on Windows we might have to copy through a buffer like we do in the PRIME path. To avoid reimplementing the same, let's instead generalize the code a bit so it doesn't have to specfy any PRIME-specific details. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Boris Brezillon	32c2db1b25	vulkan/wsi: Don't open-code vk_format_get_blocksize() We already have a standard helper to retrieve the texel block size from a VkFormat, let's use it instead of adding a new helper. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Suggested-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Boris Brezillon	7c1c6606d0	vulkan/wsi: Use ALIGN_POT() instead of open-coding it align_u32() and ALIGN_POT() are doing the same thing. Replace align_u32() calls by ALIGN_POT() ones and get rid of align_u32(). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Suggested-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Erik Faye-Lund	83938a1bb3	vulkan/wsi: pass win32-swapchain directly Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12210>	2022-02-22 10:04:34 +00:00
Marcin Ślusarz	fddab17e54	anv: cleanup begin_subpass & end_subpass Share values of some expressions instead of duplicating the same logic. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15079>	2022-02-22 09:09:05 +00:00
Marcin Ślusarz	f91bfc80ba	intel/compiler: remove redundant code from fs_visitor::run_* Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15079>	2022-02-22 09:09:05 +00:00
Gert Wollny	2fbb4e85f7	virgl: Enable PIPE_CAP_TGSI_TEXCOORD when the host supports it Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15025>	2022-02-22 08:46:54 +00:00
Juan A. Suarez Romero	db8a202b41	vc4: remove redundant initialization These assignments are not required. Partially fixes https://gitlab.freedesktop.org/mesa/mesa/-/issues/3966 Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15080>	2022-02-22 08:29:51 +00:00
Samuel Pitoiset	369b8cffea	radv,aco,llvm: lower adjusting vertex alpha in NIR Instead of duplicating the same lowering in both compiler backends. This pass will be used to do more VS input lowering. fossils-db (Polaris10): Totals from 48 (0.04% of 135960) affected shaders: VGPRs: 1692 -> 1684 (-0.47%) CodeSize: 54016 -> 53964 (-0.10%); split: -0.11%, +0.01% MaxWaves: 339 -> 341 (+0.59%) Instrs: 11260 -> 11247 (-0.12%); split: -0.13%, +0.02% Latency: 88165 -> 88113 (-0.06%); split: -0.07%, +0.01% InvThroughput: 36153 -> 36093 (-0.17%) Copies: 583 -> 568 (-2.57%) fossils-db (Pitcairn): Totals from 43 (0.03% of 135960) affected shaders: VGPRs: 1548 -> 1552 (+0.26%) CodeSize: 47900 -> 47820 (-0.17%) Instrs: 10751 -> 10731 (-0.19%) Latency: 83029 -> 82873 (-0.19%) VClause: 168 -> 164 (-2.38%) SClause: 393 -> 391 (-0.51%) Copies: 705 -> 685 (-2.84%) Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15076>	2022-02-22 08:08:42 +00:00
Qiang Yu	100c80392a	util/util_vertex_state_cache: remove error check when deinit Application may exit without freeing created display list, this may leave the cache not empty. This is triggered by Abaqus which just close X11 display without calling any of GLX cleanup functions like glXDestroyContext. But GLX hook to X11 display close function to free GLX screen resource. So display list as a context resource has not been freed, but cache as a screen resource is freed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14926>	2022-02-22 07:10:40 +00:00
Qiang Yu	f1650bd39b	driconf: add Abaqus configs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14926>	2022-02-22 07:10:40 +00:00
Qiang Yu	1dac5454ea	glx: keep native window glx drawable by driconf option DRI3 window back buffer is a client resource, so it's destroyed when context switch drawable for native window. But some application like Abaqus may leave a dirty back buffer and reuse it when switch back. So add a driconf option for these kind of app to keep the entire GLX drawable for native window. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14926>	2022-02-22 07:10:40 +00:00
Qiang Yu	4a420c50f2	glx: merge drawable release to the same function Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14926>	2022-02-22 07:10:40 +00:00
Qiang Yu	bf09c08e31	glx: fix pbuffer refcount init glXMakeCurrent* may miss release pbuffer if pbuffer is created with refcount=0. This won't happen when pbuffer had different GLX id and X pixmap id. cc: mesa-stable Fixes: `bc8a51a79a` ("glx: no need to create extra pixmap for pbuffer") Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14926>	2022-02-22 07:10:40 +00:00
Iago Toral Quiroga	a4b164b57b	broadcom/compiler: only patch temps that existed before the current spill When we spill we add new temps. We should be careful not to access liveness for these until we have re-computed it after all spills and fill for that the spilled temp have been processed so as to avoid out-of-bounds accesses to the c->temp_start and c->temp_end arrays. This fixes a crash in a Three.js demo when we try to patch register classes after a TMU spill that was caused because we would incorrectly try to patch the same temps we had just added for the spill itself, which is not only unnecessary but also incorrect since we these temps would not have liveness information available yet and thus would cause out of bounds accesses. Fixes: `f3c3228522` ('broadcom/compiler: do not rebuild the interference graph after each spill') Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15107>	2022-02-22 06:41:51 +00:00
Marek Olšák	7ec8a3205e	gallivm: fix build with LLVM 15 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15095>	2022-02-22 01:28:11 +00:00
Marek Olšák	7893192613	ci: bump piglit version Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14750>	2022-02-21 21:42:04 +00:00
Marek Olšák	eb4bd06ef4	gallium: add PIPE_RESOURCE_FLAG_UNMAPPABLE for shared unmappable buffers We need to handle this in u_threaded_context for GL-VK interop. Drivers should set this when importing buffers if needed. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14750>	2022-02-21 21:42:04 +00:00
Jiadong Zhu	10e4bad803	st/mesa: set GL_DYNAMIC_STORAGE_BIT for GL-VK interop buffers GL_DYNAMIC_STORAGE_BIT needs to be set to make glBufferSubData work. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14750>	2022-02-21 21:42:04 +00:00
Erik Faye-Lund	2cd0779f83	nir/spirv: guard macros in case of redefinition On some systems, these macros are already defined, and being defined slightly differently causes them to emit redefinition-warnings. Let's wrap them in ifdefs to avoid the warnings. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15084>	2022-02-21 19:47:17 +00:00
Alyssa Rosenzweig	694fe73976	asahi: Fix use-after-free in shader key We need to take ownership of shader keys before we can insert them into the hash table. Caught by ASan. ==6343==ERROR: AddressSanitizer: stack-buffer-overflow on address 0x00016bc51410 at pc 0x00010498d6cc bp 0x00016bc50240 sp 0x00016bc4f9d0 READ of size 592 at 0x00016bc51410 thread T0 #0 0x10498d6c8 in MemcmpInterceptorCommon(void, int ()(void const, void const, unsigned long), void const, void const, unsigned long)+0x208 (libclang_rt.asan_osx_dynamic.dylib:arm64e+0x196c8) #1 0x10498da08 in wrap_memcmp+0x98 (libclang_rt.asan_osx_dynamic.dylib:arm64e+0x19a08) #2 0x10b7f3f18 in asahi_shader_key_equal agx_state.c:867 #3 0x10a482e7c in hash_table_search hash_table.c:325 #4 0x10b7f4e94 in agx_update_shader agx_state.c:899 #5 0x10b7f0dc4 in agx_draw_vbo agx_state.c:1590 #6 0x10a7c28c4 in u_vbuf_draw_vbo u_vbuf.c:1498 #7 0x10a5db03c in cso_multi_draw cso_context.c:1639 #8 0x10aed03d0 in _mesa_validated_drawrangeelements draw.c:1812 #9 0x10aed08d4 in _mesa_DrawElements draw.c:1945 Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15109>	2022-02-21 11:53:32 -05:00
Michel Dänzer	ed246aa29b	ci: Remove unused is-for-marge YAML anchor Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14225>	2022-02-21 14:41:01 +00:00
Michel Dänzer	40addce70e	ci: Use $CI_PIPELINE_SOURCE Instead of $CI_MERGE_REQUEST_SOURCE_BRANCH_NAME. Shorter and clearer. Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14225>	2022-02-21 14:41:01 +00:00
Michel Dänzer	c642eed38c	ci: Use $CI_COMMIT_BRANCH This was recently added to indicate pipelines for branches. v2: * Modify .gitlab-ci/test-source-dep.yml as well. Acked-by: Emma Anholt <emma@anholt.net> # v1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14225>	2022-02-21 14:41:01 +00:00
Rhys Perry	a9ac270c5f	nir/validate: don't add instrs not present in shader to shader_gc_list This makes the set smaller and GC list validation faster. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13547>	2022-02-21 11:57:22 +00:00
Rhys Perry	925c5f817d	nir/validate: don't validate the GC list by default This seems really slow. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13547>	2022-02-21 11:57:18 +00:00
Samuel Pitoiset	e6853de6b0	radv: set profile_peak when capturing with SQTT Using new CTX OP to set/get stable pstates. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Samuel Pitoiset	5cf4f0cc91	radv/winsys: add support for new CTX OP to set/get stable pstates Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Samuel Pitoiset	cdf9a1a911	ac: add ac_gpu_info::has_stable_pstate Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Samuel Pitoiset	595c81166b	include/drm-uapi: update amdgpu_drm.h for new CTX OP to set/get stable pstates Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Samuel Pitoiset	0e060e7a08	meson: bump libdrm_amdgpu version to 2.4.110 For the new AMDGPU CTX OP stable pstates. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Samuel Pitoiset	4d38890c9e	ci: upgrade to libdrm 2.4.110 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14038>	2022-02-21 11:16:11 +00:00
Mihai Preda	665474b278	radeonsi/tests: update glcts baseline on vega20 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15075>	2022-02-21 10:56:59 +00:00
Mihai Preda	49183c113d	radeonsi/tests: update piglit baseline on vega20 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15075>	2022-02-21 10:56:59 +00:00
Mihai Preda	21b0153833	radeonsi/tests: print PCI-id of GPU device under test Allows to identify GPUs in a system with multiple devices of the same model. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15044>	2022-02-21 09:43:40 +00:00
Marcin Ślusarz	037e98a10c	anv: don't set color state when input state was requested Fixes: `814dc66935` ("anv: Allocate surface states per-subpass") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15081>	2022-02-21 08:45:05 +00:00
Dave Airlie	c108a68954	ci/lavapipe: fixup results after proper reference counting. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	ec8104c6b2	llvmpipe: allow vertex processing and fragment processing in parallel This removes the block at the end of fragment processing and allow multiple scenes to be queued to the rasterizer up to MAX_SCENES. Running heaven with this shows up to 39 scenes been allocated in the first few renders, but it also removes all stalls in rendering until the present stalls for completion. It reduces frame rendering times (not enough to make it > 1 fps) but looks to be about 10% increase. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	0dccec8e89	llvmpipe: add support for fence_server_sync. This fixes multi-context flushes with async fs processing. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	1318cb4c5a	lavapipe: pass partial results flags through. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	7ae32fdb9a	llvmpipe/query: add support for partial query waits. Without overlap, the fence should always be signalled so this won't be hit, but once overlap is enabled, then cases will start to fail due to missing this. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	2b658dea03	gallium: add partial bit to the query flags. This is to match the vulkan partial bit so lavapipe can work with new llvmpipe changes. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	d807226d37	llvmpipe: check framebuffer resources for all scenes for references. Some inflight scenes might be using the framebuffers as a dest, make sure to get flushing correct. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	fae5188de7	llvmpipe: add images to the scene resource tracker. This adds all the images to the scene resource tracker. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	eff32fa591	llvmpipe: add ssbo to resources reference by scenes. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	ef34719459	llvmpipe: pass ssbo write mask down into setup. this will be used to keep track of ssbo buffer references. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	403f9aea0e	llvmpipe: add writeable resource tracking to the scene. The scene tracks resource, but only currently tracks textures, in order for scene overlap to work properly, it needs to track fb, ssbo and image resources as well. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	486dada6bc	llvmpipe: size initial allocation and free scenes Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	4649a941e3	llvmpipe: handle dynamically creating scenes when needed This will create scenes from the slab allocator up the to max Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	8bdc40088b	llvmpipe: base the scene queue size of the max number of scenes. If the max scenes increases then the queue will get resized. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	d269634fcd	llvmpipe/scene: move to slab allocated objects for scenes. Currently we only allocate one scene, but I'd like to increase that so move it to a slab allocator. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	d89f91d54c	llvmpipe/flush: always finish whether for cpu/gpu access. Subsequent GPU access to resources for reading in the vertex shader may rely on previous fragment shaders being flushed out. Always finish here. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	ccdee8aade	llvmpipe: convert texture barrier to a finish. Need to flush the rasterizer and wait for everything to finish, with new overlap flush isn't enough. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	604ed15e56	lavapipe: handle non-timeline semaphores wait/signal. When llvmpipe is allowed execute fragment shaders overlapping with other stuff, we have to start using the pipe fences. With presentation, the acquire path needs to signal a semaphore that can be waited on by the user, so add support for passing signal/wait semaphores for non-timeline in, and just put a fence pointer in the semaphore for that case. This fixes rendering once we allow overlapping rasterization. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	70dfa8b32f	lavapipe: don't flush on transfer operations. The pipeline barrier/wait event code should handle this. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	20696aa170	lavapipe: execute a finish in pipeline barrier and event waiting. Refactor out the code for finishing a fence and used it in pipeline barrier and event waiting. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	1f1e62c15d	lavapipe: handle endless fence timeout properly. If the users ask for an infinte timeout, just pass it through to gallium. When llvmpipe ends up allowing async fragment shaders, it's important to get this right for lots of CTS tests. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	7a1426db66	lavapipe: fix pipeline statistic query results with availability. The availability is meant to be the last integer value written, but for pipeline stats this was being done wrong. calculate the availability position properly. With the old non-overlapping execution model queries never were unavailable. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Dave Airlie	aeed07f604	drisw: fence drawing to the swap/copy buffers. Currently neither llvmpipe or softpipe ever leave any drawing in the pipeline, but I'd like to change that for llvmpipe. This makes drisw block for completed rendering before sending data to the X server. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14923>	2022-02-21 03:52:02 +00:00
Ilia Mirkin	65c4b6a4c6	freedreno/ir3: document GETINFO's x/y results The zw were already known, but throw them in here too. I'm not extremely happy with the description of "y", feels like there's a simpler explanation there, but I couldn't find it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14672>	2022-02-21 02:09:19 +00:00
Qiang Yu	80974a5f1e	radeonsi: fix depth stencil multi sample texture blit This causes the flushed_depth_texture is allocated without multi sample. So the blit will cause VM fault. cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14990>	2022-02-21 01:47:23 +00:00
Dave Airlie	0f989a840e	crocus: fix leak on gen4/5 stencil fallback blit path. Noticed by Ilia. Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15100>	2022-02-21 10:21:56 +10:00
Ilia Mirkin	357dae424f	freedreno/a4xx: make luminance formats renderable, add missing L8A8_SNORM If the luminance formats aren't renderable, they back out to R* formats, but those will end up with a 1 in alpha rather than 0 when textured. So instead make them explicitly renderable, which will cause the correct texture format swizzle to be applied. Fixes query-rgba-signed-components and probably others. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15097>	2022-02-20 16:58:03 +00:00
Ilia Mirkin	56b1bd086f	freedreno/a4xx: use correct macro for color Doesn't actually matter since all the colors are encoded the same. But for consistency... Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15097>	2022-02-20 16:58:03 +00:00
Danylo Piliaiev	a814a4f9db	turnip: Add a refcount mechanism to BOs Until now we have lived without a refcount mechanism in the driver because in Vulkan the user is responsible for handling the life span of memory allocations for all Vulkan objects, however, imported BOs are tricky because the kernel doesn't refcount so user-space needs to make sure that: 1. When importing a BO into the same device used to create it (self-importing) it does not double free the same BO. 2. Frees imported BOs that were not allocated through the same device. Our initial implementation always freed BOs when requested, so we handled 2) correctly but not 1) on drm and we would double-free self-imported BOs because kernel doesn't return a unique gem_handle on each import. Beside this the submit ioctl checks for duplicates in the BO list and returns an error if there is one. This fixes the problem for good by adding refcounts to BOs so that self-imported BOs have a refcnt > 1 and are only freed when all references are freed. KGSL on the other hand does not have the same problems, at least not with ION buffers which are used for exportable BOs on pre 5.10 android kernels. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5936 Fixes CTS tests: dEQP-VK.drm_format_modifiers.export_import.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15031>	2022-02-19 15:16:55 +00:00
Lionel Landwerlin	2763a8af5a	anv/genxml/intel/fs: fix binding shader record entry Bit is flipped compared to all the other packets. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `705395344d` ("intel/fs: Add support for compiling bindless shaders with resume shaders") Fixes: `c3ac9afca3` ("anv: Create and return ray-tracing pipeline SBT handles") Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15078>	2022-02-19 13:50:56 +00:00
Chia-I Wu	5f3e50b27c	venus: trace vn_ring_wait_space It is good to know that we run out of ring space and have to wait. This happens easily with fossilize-replay because encoding a vkCreateGraphicsPipeline takes microseconds while executing it can take milliseconds, >100ms sometimes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14966>	2022-02-19 03:57:30 +00:00
Chia-I Wu	7cb2e9a8f0	venus: cache VkFormatProperties This is for fossilize-replay which keeps querying for the same formats. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14966>	2022-02-19 03:57:30 +00:00
Alyssa Rosenzweig	e392dd8237	pan/bi: Promote MUX to CSEL in the scheduler Helps scheduling, and makes scheduling more predictable when deciding between MUX and CSEL. total tuples in shared programs: 1523328 -> 1516256 (-0.46%) tuples in affected programs: 509800 -> 502728 (-1.39%) helped: 1977 HURT: 181 helped stats (abs) min: 1.0 max: 48.0 x̄: 3.71 x̃: 2 helped stats (rel) min: 0.04% max: 14.29% x̄: 1.98% x̃: 1.28% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.43 x̃: 1 HURT stats (rel) min: 0.14% max: 7.69% x̄: 1.40% x̃: 0.70% 95% mean confidence interval for tuples value: -3.47 -3.08 95% mean confidence interval for tuples %-change: -1.79% -1.60% Tuples are helped. total clauses in shared programs: 350552 -> 349906 (-0.18%) clauses in affected programs: 34839 -> 34193 (-1.85%) helped: 570 HURT: 49 helped stats (abs) min: 1.0 max: 16.0 x̄: 1.22 x̃: 1 helped stats (rel) min: 0.67% max: 20.00% x̄: 3.26% x̃: 2.22% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.92% max: 16.67% x̄: 4.31% x̃: 4.17% 95% mean confidence interval for clauses value: -1.13 -0.96 95% mean confidence interval for clauses %-change: -2.95% -2.38% Clauses are helped. total cycles in shared programs: 202589.37 -> 202512.25 (-0.04%) cycles in affected programs: 7644.46 -> 7567.33 (-1.01%) helped: 771 HURT: 147 helped stats (abs) min: 0.041665999999999315 max: 1.8333360000000027 x̄: 0.11 x̃: 0 helped stats (rel) min: 0.16% max: 14.29% x̄: 2.10% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.07 x̃: 0 HURT stats (rel) min: 0.24% max: 7.41% x̄: 1.49% x̃: 1.11% 95% mean confidence interval for cycles value: -0.09 -0.07 95% mean confidence interval for cycles %-change: -1.69% -1.36% Cycles are helped. total arith in shared programs: 56755.96 -> 56585.50 (-0.30%) arith in affected programs: 18746.29 -> 18575.83 (-0.91%) helped: 1605 HURT: 352 helped stats (abs) min: 0.04166399999999726 max: 1.8333360000000027 x̄: 0.12 x̃: 0 helped stats (rel) min: 0.07% max: 20.00% x̄: 1.92% x̃: 1.12% HURT stats (abs) min: 0.041665999999999315 max: 0.3333340000000007 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.17% max: 33.33% x̄: 2.09% x̃: 1.08% 95% mean confidence interval for arith value: -0.09 -0.08 95% mean confidence interval for arith %-change: -1.34% -1.07% Arith are helped. total quadwords in shared programs: 1429737 -> 1424670 (-0.35%) quadwords in affected programs: 418175 -> 413108 (-1.21%) helped: 1682 HURT: 198 helped stats (abs) min: 1.0 max: 35.0 x̄: 3.17 x̃: 2 helped stats (rel) min: 0.04% max: 13.33% x̄: 1.72% x̃: 1.29% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.38 x̃: 1 HURT stats (rel) min: 0.15% max: 7.41% x̄: 1.30% x̃: 0.92% 95% mean confidence interval for quadwords value: -2.86 -2.53 95% mean confidence interval for quadwords %-change: -1.48% -1.32% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a8418abd74	pan/bi: Revert "Fix load_const of 1-bit booleans" This reverts commit `29d319c767`. Now that we use nir_lower_bool_to_bitsize, we don't see 1-bit booleans anymore, so the issue this fixed doesn't apply. Actually, that issue was (in part) why I started looking into boolean handling in the first place. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	21bdee7bcc	pan/bi: Switch to lower_bool_to_bitsize Instead of ingesting 1-bit booleans and trying to force everything to be 16-bit, except when it isn't, and creating a mess in the backend... just use the NIR pass designed to select bitsize for booleans. Yes, this means we need to handle more NIR instructions, but the handling is easier and the conversion is more obvious (except for some edge cases like 16-bit vectorized b32csel). This generates noticeably better code, and the generated code will be easier to optimize. total instructions in shared programs: 90257 -> 88941 (-1.46%) instructions in affected programs: 49145 -> 47829 (-2.68%) helped: 201 HURT: 2 helped stats (abs) min: 1.0 max: 40.0 x̄: 6.57 x̃: 3 helped stats (rel) min: 0.29% max: 13.89% x̄: 2.57% x̃: 1.90% HURT stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 HURT stats (rel) min: 2.15% max: 2.74% x̄: 2.45% x̃: 2.45% 95% mean confidence interval for instructions value: -7.71 -5.26 95% mean confidence interval for instructions %-change: -2.84% -2.20% Instructions are helped. total tuples in shared programs: 73740 -> 72922 (-1.11%) tuples in affected programs: 36564 -> 35746 (-2.24%) helped: 184 HURT: 7 helped stats (abs) min: 1.0 max: 74.0 x̄: 4.49 x̃: 2 helped stats (rel) min: 0.30% max: 16.67% x̄: 2.86% x̃: 1.89% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.29 x̃: 1 HURT stats (rel) min: 0.12% max: 12.50% x̄: 4.26% x̃: 3.33% 95% mean confidence interval for tuples value: -5.29 -3.28 95% mean confidence interval for tuples %-change: -3.06% -2.13% Tuples are helped. total clauses in shared programs: 15993 -> 15928 (-0.41%) clauses in affected programs: 2464 -> 2399 (-2.64%) helped: 35 HURT: 16 helped stats (abs) min: 1.0 max: 27.0 x̄: 2.31 x̃: 1 helped stats (rel) min: 0.49% max: 18.88% x̄: 7.63% x̃: 5.88% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.79% max: 6.25% x̄: 1.91% x̃: 1.01% 95% mean confidence interval for clauses value: -2.46 -0.09 95% mean confidence interval for clauses %-change: -6.38% -2.90% Clauses are helped. total cycles in shared programs: 7622.13 -> 7594.75 (-0.36%) cycles in affected programs: 1078.67 -> 1051.29 (-2.54%) helped: 103 HURT: 4 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.27 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 3.62% x̃: 2.44% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 HURT stats (rel) min: 0.13% max: 7.14% x̄: 2.94% x̃: 2.25% 95% mean confidence interval for cycles value: -0.33 -0.19 95% mean confidence interval for cycles %-change: -4.14% -2.61% Cycles are helped. total arith in shared programs: 2762.46 -> 2728.08 (-1.24%) arith in affected programs: 1550.12 -> 1515.75 (-2.22%) helped: 197 HURT: 6 helped stats (abs) min: 0.041665999999999315 max: 3.0833319999999986 x̄: 0.18 x̃: 0 helped stats (rel) min: 0.32% max: 21.05% x̄: 2.93% x̃: 1.61% HURT stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 HURT stats (rel) min: 0.13% max: 20.00% x̄: 5.78% x̃: 3.37% 95% mean confidence interval for arith value: -0.21 -0.13 95% mean confidence interval for arith %-change: -3.20% -2.15% Arith are helped. total quadwords in shared programs: 68155 -> 67555 (-0.88%) quadwords in affected programs: 27944 -> 27344 (-2.15%) helped: 151 HURT: 9 helped stats (abs) min: 1.0 max: 52.0 x̄: 4.09 x̃: 3 helped stats (rel) min: 0.23% max: 12.35% x̄: 2.87% x̃: 2.17% HURT stats (abs) min: 1.0 max: 5.0 x̄: 1.89 x̃: 1 HURT stats (rel) min: 0.20% max: 6.76% x̄: 1.91% x̃: 1.13% 95% mean confidence interval for quadwords value: -4.67 -2.83 95% mean confidence interval for quadwords %-change: -2.99% -2.21% Quadwords are helped. total threads in shared programs: 2232 -> 2233 (0.04%) threads in affected programs: 1 -> 2 (100.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	a64534754d	pan/bi: Handle vectorized u2f16/i2f16 Will be useful when we enable int16, I guess... No shader-db changes. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	6a05852f5b	pan/bi: Handle trivial i2i32 lower_bool_to_bitsize can generate i2i32 from a 32-bit source, which is trivial but needs to be handled explicitly to avoid going down the 8-bit conversion path. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	f7d44a46cd	pan/bi: Optimize replication Bifrost's 16-bit support comes in the form of vectorized instructions, so when we manipulate scalars, we usually replicate to both bottom and top halves of 32-bit registers. Add an analysis pass that detects replication. Then, use that replication pass to optimize out useless swizzle instructions (by changing them to plain moves, which can be copypropped). This optimization is a slight shader-db win on its own, and allows us to transition to lower_bool_to_bitsize without regressing shader-db. total instructions in shared programs: 90323 -> 90257 (-0.07%) instructions in affected programs: 2513 -> 2447 (-2.63%) helped: 20 HURT: 0 helped stats (abs) min: 1.0 max: 16.0 x̄: 3.30 x̃: 2 helped stats (rel) min: 1.25% max: 11.11% x̄: 4.80% x̃: 4.29% 95% mean confidence interval for instructions value: -5.05 -1.55 95% mean confidence interval for instructions %-change: -6.06% -3.54% Instructions are helped. total tuples in shared programs: 73769 -> 73740 (-0.04%) tuples in affected programs: 1611 -> 1582 (-1.80%) helped: 17 HURT: 0 helped stats (abs) min: 1.0 max: 9.0 x̄: 1.71 x̃: 1 helped stats (rel) min: 0.58% max: 16.67% x̄: 4.80% x̃: 3.33% 95% mean confidence interval for tuples value: -2.70 -0.71 95% mean confidence interval for tuples %-change: -7.06% -2.54% Tuples are helped. total clauses in shared programs: 15997 -> 15993 (-0.03%) clauses in affected programs: 27 -> 23 (-14.81%) helped: 4 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 7.69% max: 25.00% x̄: 18.17% x̃: 20.00% 95% mean confidence interval for clauses value: -1.00 -1.00 95% mean confidence interval for clauses %-change: -29.91% -6.44% Clauses are helped. total cycles in shared programs: 7623.13 -> 7622.13 (-0.01%) cycles in affected programs: 64.83 -> 63.83 (-1.54%) helped: 13 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.08 x̃: 0 helped stats (rel) min: 1.02% max: 5.56% x̄: 2.82% x̃: 2.50% 95% mean confidence interval for cycles value: -0.13 -0.02 95% mean confidence interval for cycles %-change: -3.79% -1.85% Cycles are helped. total arith in shared programs: 2763.75 -> 2762.46 (-0.05%) arith in affected programs: 67.17 -> 65.88 (-1.92%) helped: 18 HURT: 0 helped stats (abs) min: 0.0416660000000002 max: 0.375 x̄: 0.07 x̃: 0 helped stats (rel) min: 1.02% max: 22.22% x̄: 5.68% x̃: 3.16% 95% mean confidence interval for arith value: -0.11 -0.03 95% mean confidence interval for arith %-change: -8.56% -2.80% Arith are helped. total quadwords in shared programs: 68173 -> 68155 (-0.03%) quadwords in affected programs: 1258 -> 1240 (-1.43%) helped: 14 HURT: 0 helped stats (abs) min: 1.0 max: 3.0 x̄: 1.29 x̃: 1 helped stats (rel) min: 0.42% max: 8.70% x̄: 3.88% x̃: 3.67% 95% mean confidence interval for quadwords value: -1.64 -0.93 95% mean confidence interval for quadwords %-change: -5.27% -2.49% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	35ff537814	pan/bi: Constant fold swizzles on constants This lets us avoid generating SWZ instructions. Those instructions could be constant folded but that complicates the replication analysis introduced in the next commit. Almost no shader-db changes. quadwords HURT: shaders/glmark/1-22.shader_test MESA_SHADER_FRAGMENT: 718 -> 722 (0.56%) total quadwords in shared programs: 68169 -> 68173 (<.01%) quadwords in affected programs: 718 -> 722 (0.56%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	62533a6e64	pan/bi: Lower swizzles on MUX.v2i16 We'll generate this in a moment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Alyssa Rosenzweig	8bd4976d98	pan/bi: Lower swizzles on CSEL.i32/MUX.i32 This is counter-intuitive, but required for correct operation when CSEL.i32 takes a 1-bit (stored 16-bit) boolean argument. The impedance mismatch ultimately is between CSEL.b32 (nir's bcsel, nonexistant in the hardware) and the lowering CSEL.i32. However, a similar problem exists even with MUX.i32 which lacks a good way of zero/sign-extending booleans. Cherry-picked from my Valhall branch though the issue also affects Bifrost. Fixes piglit shaders@glsl-vs-if-bool on Bifrost. Unfortunately, shader-db is quite unhappy :-( The proper fix is to use lower_bool_to_bitsize, but that can't be backported to mesa-stable. total instructions in shared programs: 157539 -> 158953 (0.90%) instructions in affected programs: 55621 -> 57035 (2.54%) helped: 2 HURT: 259 helped stats (abs) min: 2.0 max: 2.0 x̄: 2.00 x̃: 2 helped stats (rel) min: 2.11% max: 2.67% x̄: 2.39% x̃: 2.39% HURT stats (abs) min: 1.0 max: 40.0 x̄: 5.47 x̃: 2 HURT stats (rel) min: 0.36% max: 16.13% x̄: 2.55% x̃: 1.59% 95% mean confidence interval for instructions value: 4.44 6.40 95% mean confidence interval for instructions %-change: 2.21% 2.82% Instructions are HURT. total tuples in shared programs: 132322 -> 132907 (0.44%) tuples in affected programs: 31806 -> 32391 (1.84%) helped: 5 HURT: 152 helped stats (abs) min: 1.0 max: 2.0 x̄: 1.40 x̃: 1 helped stats (rel) min: 0.39% max: 3.03% x̄: 1.70% x̃: 1.61% HURT stats (abs) min: 1.0 max: 42.0 x̄: 3.89 x̃: 2 HURT stats (rel) min: 0.29% max: 18.18% x̄: 2.50% x̃: 1.79% 95% mean confidence interval for tuples value: 2.88 4.58 95% mean confidence interval for tuples %-change: 1.87% 2.85% Tuples are HURT. total clauses in shared programs: 28672 -> 28698 (0.09%) clauses in affected programs: 869 -> 895 (2.99%) helped: 1 HURT: 24 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 5.88% max: 5.88% x̄: 5.88% x̃: 5.88% HURT stats (abs) min: 1.0 max: 2.0 x̄: 1.12 x̃: 1 HURT stats (rel) min: 0.49% max: 33.33% x̄: 8.46% x̃: 3.59% 95% mean confidence interval for clauses value: 0.82 1.26 95% mean confidence interval for clauses %-change: 3.84% 11.93% Clauses are HURT. total cycles in shared programs: 15119.04 -> 15137.88 (0.12%) cycles in affected programs: 922.87 -> 941.71 (2.04%) helped: 4 HURT: 79 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.05 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.57% x̃: 1.35% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.24 x̃: 0 HURT stats (rel) min: 0.30% max: 20.00% x̄: 2.83% x̃: 2.12% 95% mean confidence interval for cycles value: 0.17 0.29 95% mean confidence interval for cycles %-change: 1.86% 3.37% Cycles are HURT. total arith in shared programs: 4922.71 -> 4947.71 (0.51%) arith in affected programs: 1423.79 -> 1448.79 (1.76%) helped: 5 HURT: 177 helped stats (abs) min: 0.0416669999999999 max: 0.0833330000000001 x̄: 0.06 x̃: 0 helped stats (rel) min: 0.40% max: 3.17% x̄: 1.82% x̃: 1.67% HURT stats (abs) min: 0.041665999999999315 max: 1.75 x̄: 0.14 x̃: 0 HURT stats (rel) min: 0.30% max: 22.22% x̄: 2.50% x̃: 1.52% 95% mean confidence interval for arith value: 0.11 0.17 95% mean confidence interval for arith %-change: 1.86% 2.90% Arith are HURT. total quadwords in shared programs: 120605 -> 120956 (0.29%) quadwords in affected programs: 26535 -> 26886 (1.32%) helped: 6 HURT: 143 helped stats (abs) min: 1.0 max: 7.0 x̄: 2.83 x̃: 1 helped stats (rel) min: 0.93% max: 6.33% x̄: 2.29% x̃: 1.71% HURT stats (abs) min: 1.0 max: 21.0 x̄: 2.57 x̃: 2 HURT stats (rel) min: 0.34% max: 13.79% x̄: 2.02% x̃: 1.22% 95% mean confidence interval for quadwords value: 1.86 2.86 95% mean confidence interval for quadwords %-change: 1.45% 2.24% Quadwords are HURT. total threads in shared programs: 4670 -> 4669 (-0.02%) threads in affected programs: 2 -> 1 (-50.00%) helped: 0 HURT: 1 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14576>	2022-02-19 03:02:10 +00:00
Emma Anholt	a2b7d9b9cd	ci/freedreno: Add a known spilling hangcheck flake. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Emma Anholt	b39d5e9705	ci/freedreno: Cut down pre-merge a630 VK coverage. We've got lots of VK coverage on 618, so take some of the load off (but leave a little bit of testing just to make sure we don't totally break 630). This should help with our Marge times since we've added some other coverage to 630 that's started overloading the runners. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Emma Anholt	04790ec8bb	ci/freedreno: Move a 60s timeout test to skips instead of flakes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15085>	2022-02-19 02:37:13 +00:00
Connor Abbott	7e8d885919	spirv: Rewrite determinant calculation The old calculation for mat3 was clever, but it turns out that a straightforward application of subdeterminants similar to how mat4 is handled is more efficient: on a scalar architecture with some sort of combined multiply+add instruction with a negate modifier (both fairly common), the new determinant is 9 instructions vs. 15 for the old one, and without the multiply-add it's 14 instructions vs. 18 for the old one. When used as a routine for inverse() the savings are compounded, because we now use the same method as used to compute the adjucate matrix and so CSE can combine most of the calculations with the adjucate matrix ones. Once mat3 and mat4 use the same method for computing determinants, we can combine them into a single recursive function. I also pulled up the mat_subdet() function because it was doing basically what we need, so it's now shared between determinant and inverse. This shrinks the implementation significantly, as can be seen from the diffstat. The real reason I want to change this, though, is that it fixes dEQP-VK.glsl.builtin.precision_fp16_storage16b.inverse.compute.mat3 with turnip. Qualcomm uses round-to-zero for 16-bit frcp, which combined with some inaccuracy in the old method of calculating the determinant led us to fail. Qualcomm's driver uses something like the new method to calculate the determinant in the inverse. We could argue that Mesa's method should be allowed, because round-to-zero for floating-point division is within spec and there are no precision guarantees given for determinant() or inverse(). However we might as well use the more efficient method. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14652>	2022-02-19 02:03:25 +00:00
Connor Abbott	c21065c87a	util/blob: Clarify rules on blob::data Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15028>	2022-02-19 01:25:46 +00:00
Connor Abbott	6761550357	nir/serialize: Don't access blob->data directly It won't work if the blob is fixed-size and we overrun the size, which will be the case with the Vulkan pipeline cache. This gets a bit tricky for the repeated-header optimization, because we can't read the header from the blob. Instead we have to store the header itself. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15028>	2022-02-19 01:25:46 +00:00
Alyssa Rosenzweig	9168dcbbc1	pan/bi: Disambiguate IDVS variants in shader-db Label IDVS variants as being MESA_SHADER_{POSITION, VARYING} stages; reserve the MESA_SHADER_VERTEX label for non-IDVS shaders. This reduces confusion where a single shader compiles to two MESA_SHADER_VERTEX shaders with different stats. While we're at it, de-vendor the blend shader stage name; these stats are internal anyway. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15086>	2022-02-19 00:01:07 +00:00
Alyssa Rosenzweig	01d1bf6228	asahi: Wire in pure integer texture formats Passes dEQP-GLES3.functional.texture.format.sized.2d.r* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:33 +00:00
Alyssa Rosenzweig	fded99b1c5	asahi: Support LOD clamps Passes: dEQP-GLES3.functional.texture.mipmap.2d.min_lod.* dEQP-GLES3.functional.texture.mipmap.2d.max_lod.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:33 +00:00
Alyssa Rosenzweig	cc3e98e201	asahi: Identify minimum/maximum LOD fields Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:33 +00:00
Alyssa Rosenzweig	6554790dfb	asahi: Add LOD clamp packing unit tests With GTest. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	e3a5c1b478	asahi: Add LOD type Automatically packs and unpacks float <==> clamped 4:6 fixed point, used for min/max LOD fields on the Sampler descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	db93090ffc	asahi: Allow GenXML to be used in C++ C++ requires explicit casts from integers to enums. Fixes errors like the following when trying to use Asahi GenXML from a GTest unit test. src/asahi/lib/agx_pack.h:554:23: error: assigning to 'enum agx_channels' from incompatible type 'uint64_t' (aka 'unsigned long long') values->channels = __gen_unpack_uint(cl, 0, 6); Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	055c5a59f8	agx: Round and clamp array indices Conforming with the GLSL spec. Fixes: dEQP-GLES3.functional.shaders.texture_functions.texture.sampler2darray_fixed_fragment (and probably others) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	a822b7b6cc	agx: Naturally align uniform pushes Required to pack correctly, e.g if we push a 16-bit value then a 64-bit value. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	0c2bbb470a	agx: Add agx_size_align_16 helper Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	9aeb5156bc	agx: Add typed move helper Useful for u2u16 in lowering code. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	830d16e9f0	asahi: Add AGX_PUSH_ARRAY_SIZE_MINUS_1 Required to clamp array indices against the array sizes per the GLSL spec. Metal also does this, implying it's required by the hardware for correct operation. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	7b4ea2fd38	asahi: Implement texturing with non-zero start level Unsure if this comes up anywhere. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	11072cfd21	asahi: Handle reloads of specific cube/mipfaces The texture descriptor we construct for reloading needs to respect the surface's texture/layer selection. Fix exactly the same bug as `b8c31ac06d` ("lima: fix glCopyTexSubImage2D"). Fixes: dEQP-GLES2.functional.texture.specification.basic_copytexsubimage2d.2d_rgb dEQP-GLES2.functional.texture.specification.basic_copytexsubimage2d.2d_rgba dEQP-GLES2.functional.texture.specification.basic_copytexsubimage2d.cube_rgb dEQP-GLES2.functional.texture.specification.basic_copytexsubimage2d.cube_rgba Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	062ca49ca7	asahi: Add agx_map_texture_{cpu,gpu} helpers Streamline access to particular layer/levels. These patterns show up across the driver and are easy to screw up, so add a helper. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	a8bf729f8a	asahi: Support 2D array and 3D textures As far as I can tell, these must be tiled. Other than that, the implementation is completely routine. Passes dEQP-GLES3.functional.texture.format.unsized.2d_array Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	204e2ffe1b	asahi: Track mipmap state explicitly Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	e714fae263	asahi: Pass correct tile shift to tiling routines Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	5f10ffd6e2	asahi: Handle page alignment of miptrees Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	2c490cd4e3	asahi: Align linear texture's strides to 64 bytes Required to pack the stride, and should improve cache performance. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	5d957011ff	asahi: Align allocations to effective tile size May be smaller than 64x64. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	25f48996a6	asahi: Rename bpp to blocksize Will matter for block compressed formats. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	856f64de24	asahi: Allow tiling of all bpps Use the usual macro trick via Panfrost. Fixes textures with formats with non-32-bit bpp, including: dEQP-GLES2.functional.texture.specification.basic_teximage2d.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	2028873ef6	asahi: Dynamically configure tile size We need to shrink the tile size when using small images (including due to mipmapping) or when using large block sizes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	d103d64df6	asahi: Add some notes to XML about mipmapping Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	aea6d7f17f	asahi: Handle tiling of 2D arrays and 3D Nothing special required, just need to respect the Z coordinate. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	06b2d97666	asahi: Add 2D Array and 3D texture dimensions Add to XML and translate in the driver. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	266382d252	asahi: Respect mip level when rendering Use hardware mip level field. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	1a3e21a4de	asahi: Identify Level field of render target descriptor Hardware support for rendering into nonzero mip levels. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	65368d2f9a	asahi: Don't redefine MIN2/MAX2 The tiling function was written before the Mesa driver... Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	570004175f	asahi: Streamline modifier selection We can only use linear for 2D images, not even 2D arrays. Even for 2D images, we only want to use linear if: * We are required to use linear due to window system requirements. * The texture is streaming. Otherwise, we want to use tiled textures. (Or better, compressed, but we don't support that yet.) Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14903>	2022-02-18 23:48:32 +00:00
Alyssa Rosenzweig	2d6233d04f	nir: Check all sizes in nir_alu_instr_is_comparison nir_alu_instr_is_comparison needs to consider all comparison opcodes regardless of size. Otherwise, they will be missed by nir_opt_move/sink. Without this change, lowering booleans to integers regresses register pressure (and spills/fills) significantly in certain shaders on Panfrost, like android/com.miHoYo.GenshinImpact/1420.shader_test. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15073>	2022-02-18 19:22:01 +00:00
Alyssa Rosenzweig	8f4b3c749e	pan/bi: Test avoiding FADD.v2f16 hazards in scheduler There are many of them, and integration testing of the scheduler won't hit every case. Add targeted unit tests for the various scheduling hazards of this funny instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	9d95561c93	pan/bi: Test avoiding *FADD.v2f16 hazard in optimizer This hazard exists but is obscure enough to be missed on our existing test coverage (e.g the conformance tests). Add piles of unit tests for it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	24d2bdb1e0	pan/bi: Avoid *FADD.v2f16 hazard in scheduler Obscure encoding restriction. Fixes crash (assertion fail when instruction packing) in asphalt9/2659.shader_test on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	8e0eb592d5	pan/bi: Avoid *FADD.v2f16 hazard in optimizer This is a very obscure encoding restriction in the Bifrost ISA. Unknown if any real apps or tests hit this, but we still need to get it right sadly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15072>	2022-02-18 16:15:04 +00:00
Alyssa Rosenzweig	c2178d09d0	pan/va: Identify LEA_TEX_IMM table Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	839f15259a	pan/va: Fix conservative branch handling Mixed up lanes and conservative branch combine. Fix that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	81a9c857c8	pan/va: Make subgroup 4-bits Future proofing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	9e851e75de	pan/va: Fix some units Remove the todos. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	47733ad1e1	pan/va: Parse units from the XML We need this information for cycle counting in Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15069>	2022-02-18 15:51:46 +00:00
Alyssa Rosenzweig	239d59ecdd	panvk: Don't use UBOs for meta_clear It must always be pushed, so constructing a uniform remap table is useless. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14913>	2022-02-18 15:29:48 +00:00
Alyssa Rosenzweig	030dadb5f4	pan/mdg: Remove todo we'll probably never get to Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:48 +00:00
Alyssa Rosenzweig	0e726d918f	pan/mdg: Assert that we don't see unknown jumps I still don't understand why we don't see continues. But in case we do, scream loudly so it can't be fixed. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:48 +00:00
Alyssa Rosenzweig	8b70e7491a	pan/mdg: Delete dedicated fdot2 lowering It's just lower_alu_to_scalar total instructions in shared programs: 72542 -> 72528 (-0.02%) instructions in affected programs: 673 -> 659 (-2.08%) helped: 4 HURT: 1 helped stats (abs) min: 1.0 max: 11.0 x̄: 3.75 x̃: 1 helped stats (rel) min: 0.28% max: 6.79% x̄: 3.07% x̃: 2.60% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 3.03% max: 3.03% x̄: 3.03% x̃: 3.03% 95% mean confidence interval for instructions value: -8.65 3.05 95% mean confidence interval for instructions %-change: -6.32% 2.62% Inconclusive result (value mean confidence interval includes 0). total bundles in shared programs: 32051 -> 32036 (-0.05%) bundles in affected programs: 207 -> 192 (-7.25%) helped: 3 HURT: 0 helped stats (abs) min: 1.0 max: 10.0 x̄: 5.00 x̃: 4 helped stats (rel) min: 3.28% max: 13.89% x̄: 8.29% x̃: 7.69% total quadwords in shared programs: 56496 -> 56487 (-0.02%) quadwords in affected programs: 422 -> 413 (-2.13%) helped: 2 HURT: 0 total registers in shared programs: 5106 -> 5104 (-0.04%) registers in affected programs: 8 -> 6 (-25.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	28cd2c9cca	pan/mdg: Delete stray comment Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	eb0ef85cb6	pan/mdg: Clarify some ISA unknowns Nothing usefully new here, just trying to improve signal:noise ratio on the disassembly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	3a53e46fcd	pan/mdg: Handle 8/16-bit UBO loads These will be seen by the compiler when we enable fp16 constant buffers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	8d949ecd3a	pan/mdg: Model zero/sign extension for 8/16-bit loads The destinations are packed as if 32-bit even for 8/16-bit loads, so the mask needs to be constructed accordingly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	ff970767a3	pan/mdg: Print optimized and scheduled shader To help identify problems across the compiler, print more forms of the shader with MIDGARD_MESA_DEBUG=shaders. Roughly matches the Bifrost compiler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Alyssa Rosenzweig	b707dabbac	pan/mdg: Pull out skip_internal boolean Aligns with Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14888>	2022-02-18 15:04:47 +00:00
Jose Maria Casanova Crespo	90f966e05f	v3dv/v3d: Fix copyright holder to Raspberry Pi Ltd Acked-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15057>	2022-02-18 11:50:07 +01:00
Kenneth Graunke	4ed9fd62c9	anv: Lower bufferImageGranularity to 1 from 64 The Vulkan 1.3 spec says: "The implementation-dependent limit bufferImageGranularity specifies a page-like granularity at which linear and non-linear resources must be placed in adjacent memory locations to avoid aliasing. Two resources which do not satisfy this granularity requirement are said to alias. bufferImageGranularity is specified in bytes, and must be a power of two. Implementations which do not impose a granularity restriction may report a bufferImageGranularity value of one. Note: Despite its name, bufferImageGranularity is really a granularity between "linear" and "non-linear" resources." We set this limit to 64 bytes (a cacheline) at the dawn of time, without any real rationale attached. There shouldn't be any restrictions here. Our tile sizes are typically 4K, and tiled resource addresses are aligned to the tile size, and the extent is also a multiple of the tile sized. So if a linear resource occurs before a tiled one, there will naturally be some space due to the alignment of the tiled resource's starting address. If a linear resource occurs after a tiled one, the tiled resource's ending address is already 4K aligned, which is already guaranteeing that they won't share a cacheline. So I think it should be fine to reduce this to 1. The other Vulkan driver for our hardware seems to advertise 1 here as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15066>	2022-02-18 09:52:00 +00:00
Juan A. Suarez Romero	bfdb1064c5	vc4/ci: make piglit test mandatory Make piglit test jobs to run always, as piglit testsuite offers more coverage for the VC4 driver. On the other hand, make the EGL testing manually, as we don't have enough devices to execute all the tests fast enough. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Acked-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15045>	2022-02-18 09:02:55 +00:00
Iago Toral Quiroga	750eeecf4e	broadcom/compiler: document that spill_base is used for spills and scratch Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	8883975209	broadcom/compiler: drop spill_count and add spilling boolean We added spill_count to handle uniform batch spills, which we no longer do. What we want now is a way to know if we are spilling registers. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	f3c3228522	broadcom/compiler: do not rebuild the interference graph after each spill Instead, we only recompute liveness and we add new nodes and interferences to the graph manually (we also need to patch register classes in some cases). To assist in this process, we also add an ip counter to our instructions that we also recompute after each spill, which we use to identify registers that cross thrsw boundries introduced with TMU spills and fills and adjust their register classes accordingly (removing their capacity to use accumulators). This significantly reduces the CPU cost of spills. Using shaders/closed/gputest/piano/7.shader_test as reference: Compile time up to the first successful compile strategy in main is ~24s and with this change it is ~11s. With this speed up, we can now try all 2-thread compile strategies (including the fallback scheduler) in only ~15s. A full shader-db run results in: Total CPU time (seconds): 9904.67 -> 9087.98 (-8.25%) Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	59caaa7fb3	broadcom/compiler: reset spill/fill counts after lowering thread count. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	92d819aaa0	broadcom/compiler: fix end of TMU sequence check We may be pipelining TMU writes and reads, in which case we can see both TMUWT and LDTMU at the end of a TMU sequence, so we should not assume that a TMUWT always terminates a sequence. Also, we had a bug where we were using inst instead of scan_inst to check if we find another TMUWT after the curent instruction. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	40e091267d	broadcom/compiler: define max number of tmu spills for compile strategies Instead of whether they are allowed to spill or not. This is more flexible. Also, while we are not currently enabling spilling on any 4-thread strategies, should we do that in the future, always prefer a 4-thread compile. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Iago Toral Quiroga	919aedbfec	broadcom/compiler: choose compile strategy with lowest spilling Until now we would only allow spilling as a last resort in the last 2 strategies, however, it is possible that in some cases earlier strategies may produce less spills if we allowed spilling on them. Likewise, the fallback scheduler can sometimes produce less spills than 2 threads with optimizations disabled. With this change, we start allowing all our 2-thread strategies to spill, and instead of choosing the first strategy that is successful, we choose the one that doesn't spill or the one with the least amount of spilling. It should be noted that this may incur in a significant increase of compile times. We will address this in a follow-up patch. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15041>	2022-02-18 08:38:19 +00:00
Alyssa Rosenzweig	294a357b33	panfrost,asahi,radv: Don't set internal=true manually nir_builder_init_simple_shader does this automatically now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14936>	2022-02-17 23:30:46 +00:00
Alyssa Rosenzweig	7ec1d96e5e	nir: Set internal=true in nir_builder_init_simple_shader Matches the expected use by callers. We do need to fix up a few callers which use this call for external shaders. v2: Fix up a radv call site (Rhys). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [v1] Acked-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14936>	2022-02-17 23:30:46 +00:00
Ian Romanick	a01b262990	nir: Add missing dependency on nir_opcodes.py Commit `38800b38` changed nir_opcodes.py, but that doesn't seem to have triggered nir_opt_algebraic.py. The change in `75ef5991` depends on opt_algebraic lowering 16-bit versions of slt, but if opt_algebraic is not rebuilt, this may not happen. This resulted in some people seeing assertion failures in, for example, dEQP-VK.spirv_assembly.instruction.compute.float16.arithmetic_3.step, due to the backend seeing nir_op_slt that it didn't know how to handle. v2: Add nir_opcodes.py to nir_algebraic_py so that all the per-driver algebraic passes pick up the dependency too. Rename it to nir_algebraic_depends. Suggested by Emma. Closes: #6047 Fixes: `d1992255bb` ("meson: Add build Intel "anv" vulkan driver") Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15050>	2022-02-17 22:57:33 +00:00
Lionel Landwerlin	7a52286215	anv: add a custom AcquireNextImage2KHR func So that we can plug our intel_measure framework. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14998>	2022-02-17 22:15:23 +00:00
Felix DeGrood	6e939ca865	anv/measure: Fix INTEL_MEASURE for ANV INTEL_MEASURE broke while implementing the common sync and submit framework. Re-adding missing INTEL_MEASURE entry point for command buffer submit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14998>	2022-02-17 22:15:23 +00:00
Igor Torrente	aa2652958a	venus: add VK_EXT_custom_border_color extension Implements all the necessary code in the device initialization and feature/property query functions. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15026>	2022-02-17 21:02:37 +00:00
Igor Torrente	5252c6c009	venus: venus-protocol groundwork to VK_EXT_custom_border_color These are the changes automatically generated from the venus-protocol repository. Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15026>	2022-02-17 21:02:37 +00:00
Lionel Landwerlin	768930a73a	nir: fix lower_memcpy memcpy is divided into chunks that are vec4 sized max. The problem here happens with a structure of 24 bytes : struct { float3 a; float3 b; } If you memcpy that struct, the lowering will emit 2 load/store, one of sized 8, next one sized 16. But both end up located at offset 0, so we effectively drop 2 floats. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `a3177cca99` ("nir: Add a lowering pass to lower memcpy") Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15049>	2022-02-17 15:12:45 +00:00
Mike Blumenkrantz	bc63802596	zink: radv ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15060>	2022-02-17 14:56:51 +00:00
Lionel Landwerlin	d4b1d8bfc4	intel/dev: provide some default values for no_hw v2: Move into return (Tapani) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15027>	2022-02-17 07:49:08 +00:00
Cristian Ciocaltea	9ef8af357d	virgl/ci: Setup virtio-vsock based IPC The mechanism currently used to pass data from the dEQP child process executed in a crosvm guest environment towards the deqp-runner wrapper script that starts the crosvm instance is based on creating, writing and reading regular files. In addition to the main drawback of using the storage, this approach is potentially unreliable because the data cannot be transferred in real-time and there is no control on ending the transmission. It also requires a forced sleep for syncing the content, while the minimum amount of time necessary to wait cannot be easily and safely determined. Replace this with an IPC based on the virtio transport for virtual sockets (virtio-vsock). Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14995>	2022-02-17 06:32:30 +00:00
Cristian Ciocaltea	fcce90a095	ci: Enable kernel virtio transport for Virtual Sockets Enable support for Virtual Sockets over virtio in kernel configuration to optimize the data transfer between crosvm and host system. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14995>	2022-02-17 06:32:30 +00:00
Cristian Ciocaltea	5b9788c3c9	ci: Add socat utility Provide the 'socat' utility in 'debian/x86_test-gl' container to be used later for improving the inter-process communication with crosvm guest tasks based on the virtio transport for Virtual Sockets (virtio-vsock). Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14995>	2022-02-17 06:32:30 +00:00
Cristian Ciocaltea	4745638f18	ci: Ensure Mesa Shader Cache resides on tmpfs Having the Mesa Shader Cache stored on a tmpfs mount point reduces the tests execution duration by 2-3 %, while preventing several hundreds of megabytes to be written on the storage media. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14995>	2022-02-17 06:32:30 +00:00
Yiwei Zhang	8e138b8bd1	venus: add necessary format list for ahb image creation Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15017>	2022-02-17 01:45:45 +00:00
Yiwei Zhang	7c9f6c9964	venus: pass necessary format list at ahb image format query Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15017>	2022-02-17 01:45:45 +00:00
Yiwei Zhang	c144df0fa8	venus: clean up android wsi and ahb image builder Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15017>	2022-02-17 01:45:45 +00:00
Yiwei Zhang	31904d082d	venus: deep copy format list info for deferred image creation The img->deferred_info will out-live vn_CreateImage, so we need a deep copy of the VkImageFormatListCreateInfo struct. This change also avoids tracking VkImageFormatListCreateInfo struct with a zero viewFormatCount. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15017>	2022-02-17 01:45:44 +00:00
Dave Airlie	b805d3e6ab	lavapipe: reference gallium fences correctly. Make sure to take references in all the correct places to get right lifetimes for these objects and avoid leaks. Fixes: `94a4982805` ("lavapipe: implement timeline semaphores") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15046>	2022-02-17 00:01:39 +00:00
Guilherme Gallo	794009c9ee	ci: Add unit tests for lava_job_submitter These tests will explore some scenarios involving LAVA delays to submit the job to the device, some device delays outputting data to LAVA logs, and sensitive data protection. For example, the subtests from test_retriable_follow_job, "timed out more times than retry attempts" and "very long silence" caught a bug where a job retried until the limited attempts and the CI job still succeeded. https://gitlab.freedesktop.org/mesa/mesa/-/jobs/18325174 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14876>	2022-02-16 23:32:39 +00:00
Guilherme Gallo	694005343b	ci: Install pytest and freezegun plugin lava_job_submitter.py unit tests are written in pytest and uses freezegun in order to simulate timeouts in some tests scenarios. So, this commit adds the packages `python3-pytest` and `python3-freezegun` to fulfill this dependencies. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14876>	2022-02-16 23:32:39 +00:00
Guilherme Gallo	addac10443	ci: Make LAVA jobs fail CI job when retry is exhausted When the lava_job_submitter.py retry loop finishes normally (without falling through break-loop) it means that the submitter has exceeded the retry count limit. However, when it happens the script finishes normally. This patch adds a treatment to this case, warning the user what happened and forcing the job to fail. Moreover, this commit will make retry configurations configurable by CI job, as it can take the default value from the following variables: - LAVA_DEVICE_HANGING_TIMEOUT_SEC - LAVA_WAIT_FOR_DEVICE_POLLING_TIME_SEC - LAVA_LOG_POLLING_TIME_SEC - LAVA_NUMBER_OF_RETRIES_TIMEOUT_DETECTION Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14876>	2022-02-16 23:32:39 +00:00
Jason Ekstrand	df0e2a1565	anv: Don't assume depth/stencil attachments have depth If a secondary command buffer is used and the client provides a framebuffer and that framebuffer has a stencil-only attchment, we would try to get the aux usage for the depth component of that attachment and crash. Check the aspects of the image before looking at aux usage. This fixes at least the following SkQP tests on my Tigerlake: - vk_circular-clips - vk_filterfastbounds - vk_innershapes_bw - vk_lineclosepath - vk_multipicturedraw_rrectclip_simple - vk_pathinvfill - vk_quadclosepath - vk_rrect_clip_bw - vk_windowrectangles Fixes: `0d8b9c529c` ("anv: Allow PMA optimization to be enabled in secondary command buffers") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15048>	2022-02-16 23:02:09 +00:00
Alyssa Rosenzweig	3697907231	panfrost: Fix Malloc Vertex definition A few missing things and a few wrong things, nothing major. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	1ca2358d6b	panfrost: Flesh out compute jobs Valhall has a new twist on Mali's task splitting voodoo, plus compute offset support. On Bifrost + Vulkan, compute offsets needed lowering on Bifrost (gl_GlobalID). Valhall saves a few instructions here. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	6d5ddf69e2	panfrost: Update Shader Environment descriptor Disambiguate the name, add a missing field, shorten a field, remove a dated comment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	cf95a1c308	panfrost: Add Valhall fields to tiler descriptor Mostly to support layered rendering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	c011ea6c26	panfrost: Shuffle render target AFBC for Valhall I'm not sure why this is different, although it adds support for new AFBC modifiers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Alyssa Rosenzweig	1ee09eaca8	panfrost: Add Valhall additions to the framebuffer There are a few minor changes. Nothing fundamanetal. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15047>	2022-02-16 22:05:55 +00:00
Iván Briano	81f97905c3	intel/compiler: make CLUSTER_BROADCAST always deal with integers This way we don't run afoul of regioning restrictions around floating point types. Cc: 22.0 <mesa-stable> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15039>	2022-02-16 21:36:42 +00:00
Iván Briano	11544435ad	anv: only advertise 64b atomic floats if 64b floats are supported Cc: 22.0 <mesa-stable> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15039>	2022-02-16 21:36:42 +00:00
Samuel Pitoiset	377884c9c8	radv: do not enable per-vertex VRS if the FS uses gl_FragCoord It breaks postprocessing in some games like Ghostrunner, Deathloop, Street Fighter V. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15042>	2022-02-16 21:09:12 +00:00
Samuel Pitoiset	3cc53c047a	radv: allow to force per-vertex VRS in the tessellation stage It's more useful than I thought. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15042>	2022-02-16 21:09:12 +00:00
Dave Airlie	eebe298a87	llvmpipe: fix linear rast samples check. The checks didn't work for the samples == 0 case, just fix it to use the helper. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15040>	2022-02-16 20:48:52 +00:00
Emma Anholt	3f4bfecee6	nir: Add some notes about const/uniform array access rules in GL. I was doing some RE on freedreno and we had some questions about when the hardware might need non-uniform or non-constant array access for various descriptor types, so let's leave some notes for the next person. Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13621>	2022-02-16 20:06:21 +00:00
Emma Anholt	ca1ec72726	nv30/40: Switch to using NIR-to-TGSI by default. shader-db results (note that we expect many more loops unrolled, so instr count is probably understating the win): nv30: total instructions in shared programs: 16535069 -> 14299105 (-13.52%) instructions in affected programs: 16377286 -> 14141322 (-13.65%) total gpr in shared programs: 81255 -> 67268 (-17.21%) gpr in affected programs: 56714 -> 42727 (-24.66%) LOST: 0 GAINED: 824 nv40: total instructions in shared programs: 20907673 -> 18428749 (-11.86%) instructions in affected programs: 20755510 -> 18276586 (-11.94%) total gpr in shared programs: 104200 -> 82831 (-20.51%) gpr in affected programs: 80278 -> 58909 (-26.62%) 14130 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14130>	2022-02-16 17:22:40 +00:00
Samuel Pitoiset	80716b6f7e	radv: enable radv_disable_aniso_single_level for The Evil Within 1&2 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/6033 Fixes: `5ce4017a2b` ("radv,aco: do not disable anisotropy filtering for non-mipmap images") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15011>	2022-02-16 17:28:46 +01:00
Thierry Reding	108e6eaa83	tegra: Use private reference count for resources With the recent addition of the shortcuts aiming to avoid atomic operations, the reference count on resources can become unbalanced in the Tegra driver since they are wrapped and then proxied to the Nouveau driver. Fix this by keeping a private reference count. Fixes: `7688b8ae98` ("st/mesa: eliminate all atomic ops when setting vertex buffers") Reviewed-by: Karol Herbst <kherbst@redhat.com> Tested-by: Karol Herbst <kherbst@redhat.com>	2022-02-16 16:20:48 +01:00
Thierry Reding	e8ce0a3357	tegra: Use private reference count for sampler views With the recent addition of the shortcuts aiming to avoid atomic operations, the reference count on sampler views can become unbalanced in the Tegra driver since they are wrapped and then proxied to the Nouveau driver. Fix this by keeping a private reference count. Fixes: `ef5d427413` ("st/mesa: add a mechanism to bypass atomics when binding sampler views") Reviewed-by: Karol Herbst <kherbst@redhat.com> Tested-by: Karol Herbst <kherbst@redhat.com>	2022-02-16 16:20:40 +01:00
Matti Hamalainen	4e252cbc7d	aux/trace: fix dumping of pipe_texture_target I had missed a int -> enum conversion in one recently added function and it's probably nice to also dump the target value also in trace_dump_resource_template() so let's do just that. Signed-off-by: Matti Hamalainen <ccr@tnsp.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14980>	2022-02-16 14:06:16 +00:00
Timur Kristóf	1912503224	radv: Don't disturb dynamic primitive topology with mesh shading. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14653>	2022-02-16 13:42:39 +00:00
Timur Kristóf	da719792ad	radv: Disable IB2 on compute queues. The "IB2" indirect buffer command is not supported on compute queues according to PAL, and it indeed causes GPU hangs when task shaders are used together with vkCmdExecuteCommands. Cc: mesa-stable Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15006>	2022-02-16 13:09:20 +00:00
Pierre-Eric Pelloux-Prayer	3fad6325cd	radeonsi: use SI_PROFILE_CLAMP_DIV_BY_ZERO for viewperf Only one shader from the Creo subtests needs this. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14931>	2022-02-16 11:13:25 +00:00
Pierre-Eric Pelloux-Prayer	9685b5785b	radeonsi: add SI_PROFILE_CLAMP_DIV_BY_ZERO To enable divide by zero clamping per shader, instead of per app. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14931>	2022-02-16 11:13:25 +00:00
Corentin Noël	1367424501	ci: Uprev virglrenderer and crosvm Ensure that we are using a recent virglrenderer to catch potential regressions early. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15023>	2022-02-16 10:21:26 +00:00
Connor Abbott	3ef858a6f6	ir3/spill: Fix simplify_phi_nodes with multiple loop nesting Once we simplified a phi node, we never updated the definition it points to, which meant that it could become out of date if that definition were also simplified, and we didn't check that when rewriting sources. That could happen when there are multiple nested loops with phi nodes at the header. Fix it by updating the phi's pointer. Since we always update sources after visiting the definition it points to, when we go to rewrite a source, if that source points to a simplified phi, the phi's pointer can't be pointing to a simplified phi because we already visited the phi earlier in the pass and updated it, or else it's been simplified in the meantime and this isn't the last pass. This way we don't need to keep recursing when rewriting sources. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15035>	2022-02-16 09:55:39 +00:00
Tapani Pälli	d3b4202b63	mesa/st: always use DXT5 when transcoding ASTC format This fixes artifacts seen in games when using ASTC transcoding, we need to use DXT5 for proper alpha channel support. Number of components is a block specific property, there is no easy way to see if we will require >1bit alpha support or not, so simply use DXT5 to have support in place. Fixes: `91cbe8d855` ("gallium: Add a transcode_astc driconf option") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15029>	2022-02-16 08:50:28 +00:00
Samuel Pitoiset	103699677b	radv: allow to force per-vertex VRS if the config file is present This is needed to add the primitive shading rate output to the vertex or geometry shaders, even if the default value is 1x1. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:29 +01:00
Samuel Pitoiset	c50557d961	radv: allow applications to dynamically change RADV_FORCE_VRS This introduces inotify support in RADV to handle changes from the RADV_FORCE_VRS_CONFIG_FILE. This is similar to MangoHUD. I'm personally not sure it's the best solution but let's try this way and change it later if we have issues (or if we have a lightweight solution). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:29 +01:00
Samuel Pitoiset	5e2d9202e2	radv: add RADV_FORCE_VRS_CONFIG_FILE to configure per-vertex VRS Similar to RADV_FORCE_VRS but from a file. If present, this is used instead of RADV_FORCE_VRS. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:29 +01:00
Samuel Pitoiset	a862e15ee7	radv: rename RADV_FORCE_VRS_NONE to RADV_FORCE_VRS_1x1 and accept 1x1 It's the default value. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:27 +01:00
Samuel Pitoiset	b4f526cc19	radv: only re-emit the per-vertex VRS rates if necessary This reduces the overhead slightly when the VRS rates don't change. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:23 +01:00
Samuel Pitoiset	99d7c13051	radv: rework RADV_FORCE_VRS to make it more dynamic The VRS rates are now emitted from the command buffer via an user SGPR which will allow to change the rates dynamically in later changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:21 +01:00
Samuel Pitoiset	cbd5724a6d	aco: implement nir_intrinsic_load_vrs_rates_amd Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:19 +01:00
Samuel Pitoiset	772aeff2c4	ac/llvm: implement nir_intrinsic_load_vrs_rates_amd Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:17 +01:00
Samuel Pitoiset	85436896c4	radv: declare a new shader argument for loading the VRS rates Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:15 +01:00
Samuel Pitoiset	74b932f8d3	nir: add nir_intrinsic_load_vrs_rates_amd This intrinsic specific to RADV will be used to load VRS rates from an user SGPR when RADV_FORCE_VRS is enabled by the application. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14713>	2022-02-16 08:11:12 +01:00
Jason Ekstrand	9cbcdfb514	anv: use vk_image_view::format for creating dynamic renderpasses Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15007>	2022-02-16 00:14:50 +00:00
Jason Ekstrand	818c5dedf4	vulkan: Add back vk_image_view::format Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15007>	2022-02-16 00:14:50 +00:00
Jason Ekstrand	05e9e7767d	vulkan: Rename vk_image_view::format to view_format When I originally added vk_image_view, I was overly clever when it came to the format field. I decided to make it only contain the bits of the format contained in the selected aspects. However, this is confusing (not generally a good thing) and it's also not always what you want. The Vulkan 1.3.204 spec says: "When using an image view of a depth/stencil image to populate a descriptor set (e.g. for sampling in the shader, or for use as an input attachment), the aspectMask must only include one bit, which selects whether the image view is used for depth reads (i.e. using a floating-point sampler or input attachment in the shader) or stencil reads (i.e. using an unsigned integer sampler or input attachment in the shader). When an image view of a depth/stencil image is used as a depth/stencil framebuffer attachment, the aspectMask is ignored and both depth and stencil image subresources are used." So, while the restricted format makes sense for texturing, it doesn't for when the image is being used as an attachment. What we probably actually want is both versions of the format. We'll call the one given by the VkImageViewCreateInfo vk_image_view::format and the restricted one vk_image_view::view_format. This is just the first commit which switches format to view_format so the compiler will make sure we get them all. The next commit will re-add vk_image_view::format but this time unmodified. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15007>	2022-02-16 00:14:50 +00:00
Yiwei Zhang	9dd15295e3	venus: properly destroy deferred ahb image before real image creation Fixes: `19b7b09885` ("venus: prepare image creation helpers for AHB") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15037>	2022-02-15 21:32:02 +00:00
Emma Anholt	3652ff2fa1	draw: Don't look at .nir if !IR_NIR. I suspect this double-check and comment was due to originally using ir.nir as the condition, which might be uninitialized if !IR_NIR. You could only take the branch if IR_NIR was set, and you should always not take if it !IR_NIR, so it worked out in the end, but it would cause spurious valgrind warnings if you hadn't zeroed out your TGSI shader's struct. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14896>	2022-02-15 18:15:29 +00:00
Emma Anholt	780949c62b	i915g: Initialize the rest of the "from_nir" temporary VS struct. draw looked at the uninitialized XFB state, which should just be zeroed out since i915 doesn't have XFB. Fixes: `2b3fc26da8` ("i915g: Switch to using nir-to-tgsi.") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14896>	2022-02-15 18:15:29 +00:00
Emma Anholt	a6a651b96a	r300: Delete the loop unrolling. There were two paths in this code: One was transform_loops, which would try to detect loops with an iteration count it understood and unroll that many times. The other was emulate_loops, which would just figure out how many instructions the program could have and still compile (hopefully), and unroll this loop however many times would fit in that. The transform_loops had no analysis as good as GLSL or NIR loop unrolling have, so it shouldn't be missed -- any opportunity it found would only be due to bugs in the unrolling code. The emulate_loops path had an issue with computing the number of times it should try to unroll -- if you had more instrs than ALUs available already, you'd overflow and unroll approximately infinitely many times, OOMing the system. But, also, it's better to throw a compiler error about unsupported loops than to run the loop an incorrect number of times and call it a success. Fixes: #5883, #6018 Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15004>	2022-02-15 17:27:17 +00:00
Mike Blumenkrantz	4f3f245b78	zink: radv ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15033>	2022-02-15 11:22:44 -05:00
Simon Ser	ffdac8bfa7	vulkan/wsi/wayland: ensure added formats have flags A format needs to be either alpha or opaque, but can't be neither. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14874>	2022-02-15 09:13:21 +00:00
Simon Ser	5617d5c885	vulkan/wsi/wayland: de-duplicate wsi_wl_display_add_wl_shm_format Re-use wsi_wl_display_add_drm_format_modifier from wsi_wl_display_add_wl_shm_format instead of maintaining two separate switches for DRM and shm formats. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14874>	2022-02-15 09:13:21 +00:00
Simon Ser	1bdf350197	vulkan/wsi/wayland: introduce wsi_wl_display_add_vk_format_modifier This is a helper to avoid repetitive code in wsi_wl_display_add_drm_format_modifier. No functional changes, just refactoring. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14874>	2022-02-15 09:13:21 +00:00
Simon Ser	8ac42a3b1b	vulkan/wsi/wayland: switch from alpha/opaque bools to bitfield This makes the numerous wsi_wl_display_add_vk_format calls easier to follow: "ALPHA" is easier to decode than "true, false". No functional changes, just refactoring. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14874>	2022-02-15 09:13:21 +00:00
Juan A. Suarez Romero	801d33b2d9	vc4/ci: update failing piglit tests See https://gitlab.freedesktop.org/mesa/mesa/-/issues/6038. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15012>	2022-02-15 09:21:23 +01:00
Tapani Pälli	ecc0041030	iris: fix a leak on surface states Cc: mesa-stable Closes:https://gitlab.freedesktop.org/mesa/mesa/-/issues/6013 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15010>	2022-02-15 07:06:50 +02:00
Dave Airlie	cb296047f2	gallivm: fix missing cast in 4-bit blending paths. This got noticed on an llvm debug build. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14987>	2022-02-15 04:09:33 +00:00
Mike Blumenkrantz	b7f1f5c77d	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15020>	2022-02-15 03:33:31 +00:00
Mike Blumenkrantz	e8ba9cee27	zink: always invalidate streamout counter buffer if not resuming this otherwise treates begin/end/begin the same as begin/pause/resume cc: mesa-stable fixes: KHR-GL46.texture_view.view_classes KHR-GL46.transform_feedback.capture_geometry_separate_test KHR-GL46.transform_feedback.capture_vertex_separate_test KHR-GL46.transform_feedback.query_geometry_separate_test KHR-GL46.transform_feedback.query_vertex_separate_test Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15020>	2022-02-15 03:33:31 +00:00
Mike Blumenkrantz	5e7f4e66cc	zink: export PIPE_SHADER_CAP_INDIRECT_TEMP_ADDR Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15020>	2022-02-15 03:33:31 +00:00
Mike Blumenkrantz	c99bd08921	zink: map R8G8B8X8_SRGB -> R8G8B8A8_SRGB this fixes a weird texstore bug that seems specific to this format Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15020>	2022-02-15 03:33:31 +00:00
Mike Blumenkrantz	89477baf95	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15018>	2022-02-14 20:57:51 -05:00
Mike Blumenkrantz	b054a233df	zink: activate conditional render for compute dispatch when necessary fixes: KHR-GL46.compute_shader.conditional-dispatching Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15018>	2022-02-14 20:56:55 -05:00
Mike Blumenkrantz	ef4ab90d04	zink: restart conditional render when crossing batch boundary this will only happen when conditional render was being used outside of a renderpass Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15018>	2022-02-14 20:56:55 -05:00
Mike Blumenkrantz	2cc309dd39	zink: always terminate conditional render when flushing a batch we might not know whether conditional render is active, so forcibly disable when necessary Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15018>	2022-02-14 20:56:55 -05:00
Mike Blumenkrantz	082b42fbda	zink: track internal conditional render state this allows no-oping redundant calls Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15018>	2022-02-14 20:56:55 -05:00
Iván Briano	db48dcb4f3	intel/compiler: remove what looks like a bad rebase This bit in the compiler looks like it was added by accident on one of the latest versions of the original commit, but it clearly doesn't belong there. Fixes: `03e1e19246` ("anv: Refactor descriptor copy") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15016>	2022-02-15 01:04:47 +00:00
Dave Airlie	127bcbed18	gallivm/st/lvp: add flags arg to get_query_result_resource api. Currently this just has wait, but in order to get the right answer for vulkan partial, lavapipe/llvmpipe need to pass a partial flag through here in the future. This just changes the API so that's possible. v2: use an enum (zmike) Acked-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15009>	2022-02-15 10:12:01 +10:00
Emma Anholt	b995a8eba4	nir_to_tgsi: Add support for FBFETCH. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	eaf6e3d3af	nir_to_tgsi: Don't vectorize 64-bit instructions, to keep virgl happy. virglrenderer makes invalid shaders when faced with vector 64-bit instructions, which GLSL-to-TGSI never produced. While this doesn't fix everything, it does get more tests running, and virgl probably the primary consumer of 64-bit TGSI. virgl may be deprecating its host 64-bit support, at which point we can drop this workaround. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	036d7172c2	virgl: Move double operands to a temp to avoid double-swizzling bugs. virglrenderer applies the swizzle when packing doubles. Since we use scan now, we can just use the file count for texture detection. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	e643cb40a3	virgl: Move tex immediate operands to a temp to avoid virglrenderer bug. Prior to the noted MR, virglrenderer encoded the tex operands in a limited-size temp buffer which we could easily overflow with TGSI immediates. GLSL-to-TGSI always emitted an extra MOV, so keep that behavior when nir-to-tgsi feeds us more optimized TGSI. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	bc912bace1	virgl: Add workarounds for virglrenderer input/sv signedness bugs. GLSL-to-TGSI would emit some MOVs that made things OK, but NTT successfully copy-propagates the inputs and sysvals more, triggering bugs in virglrenderer when the signedness of the input is different from that of the ALU op. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	f25ccac374	virgl: Apply TGSI transforms to compute shaders, too. We need to do this for the upcoming virglrenderer workarounds. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	c491afaeab	virgl: Add a workaround for virglrenderer output writemask bugs. Various workaround paths in virglrenderer assume that outputs are written with a full writemask, which is not required by TGSI. To work around that, store affected outputs in a temp and do a full write after each writemasked write. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	2fd9e072ce	virgl: Work around old virglrenderer's BARRIER counting bug. One less regression from doing nir-to-tgsi on CI's virglrenderer. Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Emma Anholt	af19774dd4	tgsi_translate: Make the procType public when translating. This means that tgsi_translate users can check the PIPE_SHADER stage without having to separately tgsi_scan(). Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15014>	2022-02-14 22:06:37 +00:00
Igor Torrente	e5405e6400	venus: Exposes VK_EXT_4444_formats extension Allows venus to passthrough the VK_EXT_4444_formats extension to the vulkan client. And add code to the device initialization and feature query functions. Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Signed-off-by: Igor Torrente <igor.torrente@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14954>	2022-02-14 21:56:22 +00:00
Yiwei Zhang	2a87a741ae	turnip: advertise VK_EXT_queue_family_foreign Both Venus and Android AHB requires this extension. Turnip ignores VK_SHARING_MODE_EXCLUSIVE so this is a no-op. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Acked-by: Rob Clark <robdclark@chromium.org> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14836>	2022-02-14 21:27:35 +00:00
Danylo Piliaiev	0b2da9d795	ir3: Limit the maximum imm offset in nir_opt_offset for shared vars STL/LDL have 13 bits to store imm offset. Fixes crash in CS compilation in Monster Hunter World. Fixes: `b024102d7c` ("freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars.") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14968>	2022-02-14 20:56:46 +00:00
Marcin Ślusarz	b6557b80a5	intel/compiler: fix array & struct IO lowering in mesh shaders We really need offsets to be in dwords, not in vec4s. The bug manifests as random failure of func.mesh.clipdistance.5 crucible test, where stores to gl_MeshVerticesNV[x].gl_ClipDistance[4+n] actually write to gl_MeshVerticesNV[x].gl_ClipDistance[1+n]. Fixes: `1f438eb033` ("intel/compiler: Implement Mesh Output") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14997>	2022-02-14 19:18:23 +00:00
Timur Kristóf	e6cfd1ed64	spirv: Create PRIMITIVE_INDICES for NV_mesh_shader on-demand. The shader can have SpvOpWritePackedPrimitiveIndices4x8NV while the output variable may not exist. This seems to be a defect in the NV_mesh_shader SPIR-V spec, let's work around it by creating the variable on-demand. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:45 +01:00
Timur Kristóf	0445802ab2	compiler: Extract num_mesh_vertices_per_primitive function. Prevent code duplication. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15005>	2022-02-14 11:13:42 +01:00
Samuel Pitoiset	32155851f1	radv: remove set but unused radv_buffer::shareable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14937>	2022-02-14 09:28:17 +00:00
Samuel Pitoiset	af420833f0	radv: remove useless NULL checks in vkBind{Buffer,Image}Memory2() The memory object must be a valid vkDeviceMemory handle. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14937>	2022-02-14 09:28:17 +00:00
Samuel Pitoiset	2dcd12f38b	radv: fix finding shaders by PC Shaders are allocated contiguously in memory for a pipeline and the freelist.next pointer is a pointer to the pipeline now. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14950>	2022-02-14 08:31:14 +01:00
Samuel Pitoiset	306e153c18	radv: make the trap handler shader BO resident Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14950>	2022-02-14 08:31:12 +01:00
Samuel Pitoiset	a224b7a057	radv: fix allocating/uploading the trap handler shader Since shaders are allocated per pipeline, the trap handler shader was not uploaded at all. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14950>	2022-02-14 08:31:10 +01:00
Shmerl	da8c2f5ed3	docs/features: Mark VK_KHR_ray_query in progress Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15000>	2022-02-12 19:18:13 -05:00
Ilia Mirkin	3d41414d26	freedreno/ir3: split up load/store/atomic by generation Some bits are slightly different on a4xx. Use the encodings that work. Perhaps these can be combined at some point if we get a proper understanding of what they mean. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:11 -05:00
Ilia Mirkin	b91b036322	isaspec: add gen-based leaf bitset separation This is necessary for some ops which have slightly different encoding on a4xx/a5xx, but are otherwise identical. This helps keeping the compiler from having to worry about these details and creating separate ops. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:46:07 -05:00
Ilia Mirkin	40468430a4	isaspec: fix gen_max to be 2^32-1 The minus sign has higher preference than shift: >>> 1 << 32 - 1 2147483648 >>> (1 << 32) - 1 4294967295 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14789>	2022-02-12 13:45:57 -05:00
Alyssa Rosenzweig	9dc30f99ae	panfrost: Flesh out the Shader Program Descriptor Only breaking change since Bifrost is that the shader contains barrier? flag is now fragment-only, meaning it is just a spawn helper threads flag. This affects compute shaders slightly. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	60b37424d9	panfrost: Simplify Valhall preload descriptor Honestly, we could stand to do the same to Bifrost... Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	1e9a35648a	panfrost: Clarify unknowns in z/stencil descriptor Depth culling and clamping. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:55 -05:00
Alyssa Rosenzweig	733d5f061d	panfrost: Add more fields to Attribute Descriptor More XML Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:35 -05:00
Alyssa Rosenzweig	b31f6a821d	panfrost: Update primitive descriptor for Valhall Contains stuff needed for layered rendering. Unfortunately, there's no more provoking vertex per draw -- ugh! That's fine for Vulkan (just don't set provokingVertexModePerPipeline), but requires inserting extra flushes on desktop OpenGL. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15003>	2022-02-12 09:32:18 -05:00
Bas Nieuwenhuizen	d8d32cf773	radv: Only wait on CS/PS to finish if we wait on a semaphore. I think plain submission doesn't need it. Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14574>	2022-02-11 23:45:24 +00:00
Bas Nieuwenhuizen	79131b6ee6	radv: Fix preamble argument order. Used the wrong cmdbuffer in the wrong situation. Oops. Fixes: `915e9178fa` ("radv: Split out commandbuffer submission.") Reviewed-By: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14574>	2022-02-11 23:45:24 +00:00
Bas Nieuwenhuizen	7adb3c0f7f	radv: Use larger arena sizes. For some games that take like 400 MiB of shader binaries, the number of shader arenas ends up going >1500. Cut that down a bit by using larger arenas. 8 MiB should still be decent with small BAR and should still cut things down from ~1500 to ~50 buffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14591>	2022-02-11 23:20:21 +00:00
Erico Nunes	0f9756f480	lima/ppir: refactor bitcopy to use unsigned char This code does not work as expected when built with clang and -fstrict-aliasing. Redefine it in unsigned char operations so that it does not violate strict aliasing rules. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	7297f931f0	lima/ppir: initialize slots array for dummy/undef Some functions in ppir iterate the ppir_op_info slots arrays looking for the PPIR_INSTR_SLOT_END token. The dummy/undef internal ops may appear in the scheduling code and their slots arrays did not contain that token, which could result in invalid array reads. Reported by gcc -fsanitize=address. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	5b15849366	lima/gpir: avoid invalid write in regalloc Reported by gcc -fsanitize=address, sometimes gpir regalloc attempts to handle an uninitialized node->value_reg (containing the value -1), which results in an invalid array access. Avoid it for now to prevent crashes, but more investigation may be required later on. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Cc: 22.0 <mesa-stable> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	58c619c3d6	lima: remove an unneeded lima_job_get assignment Reported by scan-build. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	116f01c853	lima: add some checks for potential null pointer dereference scan-build complains about a potential null pointer dereference in some places around the lima code. None of those seem to be a real issue as of now, but let's add some asserts to cover for that and clean up the warning list. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	60888c95bd	lima: fix warning of garbage value access scan-build complains that an access of reg[j+1] in this code might return garbage. Let's take the chance to clean this open coded sorting code up and just use qsort. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	2d807979c9	lima/ppir: initialize spill_costs array in regalloc Static analysis complains that spill_costs might be accessed in non-initialized positions. It does not seem to be an issue with the current code which initializes it for every relevant register index, but we can also just initialize it to not have to worry about that. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	8a8f34c320	lima/ppir: avoid ppir_codegen_outmod implicit conversion Fix some clang -Wenum-conversion warnings like: warning: implicit conversion from enumeration type 'ppir_outmod' to different enumeration type 'ppir_codegen_outmod' [-Wenum-conversion] Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	3d77950b8b	lima/ppir: clean up override-init warnings Define ppir_op_unsupported as 0 so that we don't have to do the initialization to -1. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Erico Nunes	823be63216	lima/gpir: clean up override-init warnings Define gpir_op_unsupported as 0 so that we don't have to do the initialization to -1. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14894>	2022-02-11 21:55:10 +00:00
Chia-I Wu	8a30b1541c	venus: use 64KB alignment for suballocations TGL CCS surface addresses must be aligned to 64KB. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/15001>	2022-02-11 21:26:45 +00:00
Yiwei Zhang	05a2cea14c	venus: no roundtrip needed for shmem backed by BLOB_MEM_HOST3D A successful DRM_IOCTL_VIRTGPU_MAP on BLOB_MEM_HOST3D implies a roundtrip. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14658>	2022-02-11 21:16:42 +00:00
Yiwei Zhang	a76d1e0e74	venus: init renderer_info at renderer creation (part 2) Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14658>	2022-02-11 21:16:42 +00:00
Yiwei Zhang	ddba7337c7	venus: init renderer_info at renderer creation (part 1) Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14658>	2022-02-11 21:16:42 +00:00
Daniel Schürmann	1bbbabedb7	aco/insert_exec_mask: refactor and remove some unnecessary WQM handling code Some cases cannot happen and don't need to be handled anymore. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	d7d7b9974a	aco/insert_exec_mask: refactor and simplify get_block_needs() Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	fcc5dec8d6	aco/insert_exec_mask: remove ever_again_needs and Exact_Branch This information is not required anymore. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	cbb1b095ca	aco/insert_exec_mask: remove some unnecessary WQM loop handling code These workarounds are were necessary to prevent infinite loops with helper lane registers containing wrong data. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	580a63b4ac	aco/insert_exec_mask: remove Preserve_WQM flag If WQM is needed anywhere after discard_if(), it will also be flagged as WQM. We can rely on that to preserve the WQM mask. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	a5a40beefa	aco: don't emit WQM for bool_to_scalar_condition This was only necessary to ensure that the source is computed in WQM if the current exec mask is in WQM. Totals from 23170 (17.17% of 134913) affected shaders: (GFX10.3) VGPRs: 1384464 -> 1383400 (-0.08%); split: -0.08%, +0.01% SpillSGPRs: 7575 -> 7574 (-0.01%) CodeSize: 146444752 -> 146317104 (-0.09%); split: -0.13%, +0.04% MaxWaves: 429870 -> 429868 (-0.00%) Instrs: 27202586 -> 27170316 (-0.12%); split: -0.17%, +0.05% Latency: 379488313 -> 379335412 (-0.04%); split: -0.07%, +0.03% InvThroughput: 69500561 -> 69487704 (-0.02%); split: -0.03%, +0.01% VClause: 473080 -> 473038 (-0.01%); split: -0.02%, +0.01% SClause: 1080576 -> 1080571 (-0.00%); split: -0.06%, +0.06% Copies: 1492865 -> 1504604 (+0.79%); split: -0.40%, +1.19% Branches: 711295 -> 716849 (+0.78%); split: -0.01%, +0.79% PreSGPRs: 1331362 -> 1328402 (-0.22%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	f816dd1be7	aco: don't propagate WQM for p_as_uniform This was needed, so that in case of active helper lanes, these contain the correct value. It is now handled implicitly. Totals from 1004 (0.74% of 134913) affected shaders: (GFX10.3) CodeSize: 7581020 -> 7580892 (-0.00%); split: -0.00%, +0.00% Instrs: 1454940 -> 1454908 (-0.00%); split: -0.00%, +0.00% Latency: 12984953 -> 12984894 (-0.00%); split: -0.00%, +0.00% InvThroughput: 3173037 -> 3173049 (+0.00%); split: -0.00%, +0.00% PreSGPRs: 47498 -> 47273 (-0.47%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Daniel Schürmann	825cd696dc	aco/insert_exec_mask: stay in WQM while helper lanes are still needed This patch flags all instructions WQM which don't require Exact mode, but depend on the exec mask as long as WQM is needed on any control flow path afterwards. This will mostly prevent accidental copies of WQM values within Exact mode, and also makes a lot of other workarounds unnecessary. Totals from 17374 (12.88% of 134913) affected shaders: (GFX10.3) VGPRs: 526952 -> 527384 (+0.08%); split: -0.01%, +0.09% CodeSize: 33740512 -> 33766636 (+0.08%); split: -0.06%, +0.14% MaxWaves: 488166 -> 488108 (-0.01%); split: +0.00%, -0.02% Instrs: 6254240 -> 6260557 (+0.10%); split: -0.08%, +0.18% Latency: 66497580 -> 66463472 (-0.05%); split: -0.15%, +0.10% InvThroughput: 13265741 -> 13264036 (-0.01%); split: -0.03%, +0.01% VClause: 122962 -> 122975 (+0.01%); split: -0.01%, +0.02% SClause: 334805 -> 334405 (-0.12%); split: -0.51%, +0.39% Copies: 275728 -> 282341 (+2.40%); split: -0.91%, +3.31% Branches: 92546 -> 90990 (-1.68%); split: -1.68%, +0.00% PreSGPRs: 504119 -> 504352 (+0.05%); split: -0.00%, +0.05% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14951>	2022-02-11 19:05:30 +00:00
Ian Romanick	59889eb3ae	Re-indentation after the previous commit Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:34 +00:00
Ian Romanick	912299cb39	glsl: Eliminate ir_assignment::condition Reformatting is left for the next commit. v2: Remove assignments from the contructors. :face_palm: Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	fb630cd783	glsl: Make ir_assignment::condition private And add get_condition(). This proof that nothing remains that could possibly set ::condition to anything other than NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	5208c116f2	glsl: Don't visit rvalues in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	1c22f06970	glsl: Don't lower vector indexing in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	2652b9a83d	glsl: Don't split structures in the condition of an assignment At this point, this should always be NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	d07551ad4e	glsl: Don't split arrays in the condition of an assignment At this point, this should always be NULL. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	41f6b42b08	glsl: Don't tree graft in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	97ffca80a8	glsl: Don't dead-built-in varying eliminate in the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	231459ad26	glsl: Remove unused condition parameter from ir_assignment constructor Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	ec5c9649ba	glsl: Don't constant-fold the condition of an assignment At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	697a460e49	glsl: Don't clone assignment conditions At this point, this should always be NULL. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	d27e3c6adc	glsl: Eliminate unused conditional assignment constructor Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	50a0d0d875	glsl: Remove the ability to read text IR with conditional assignments Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	710adf2e60	glsl: Add ir_assignment constructor that takes just a write mask The other constructor that takes a write mask and a condition will be removed shortly. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	afee5dc63f	glsl: Lower if to conditional select instead of conditional assignment Platforms that don't have flow control also don't have anything that could be written that has a side effect. It should be safe to implement these condition writes as foo = csel(condition, bar, foo); This should eliminate the last thing in the GLSL compiler that can create new conditions on assignments. Everything else that can store something in ir_assignment::condition derives it from a pre-existing condition. v2: Fix bad rebase. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	c3626e1580	glsl/ir_builder: Eliminate unused conditional assignment builders Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	46a9df6aac	glsl: Don't try to emit the "linear sequence" in lower_variable_index_to_cond_assign When there are four or fewer elements left in the array partition, the strategy changes from a binary search of nested flow control to sequence of conditional assignments like (assign, dest, src[constant_i+0], index == constant_i+0) (assign, dest, src[constant_i+1], index == constant_i+1) (assign, dest, src[constant_i+2], index == constant_i+2) (assign, dest, src[constant_i+3], index == constant_i+3) or (assign, dest[constant_i+0], src, index == constant_i+0) (assign, dest[constant_i+1], src, index == constant_i+1) (assign, dest[constant_i+2], src, index == constant_i+2) (assign, dest[constant_i+3], src, index == constant_i+3) Realistically, the first case should use ir_triop_csel instead. The second case will either get turned back into flow control like if (index == constant_i+0) (assign, dest[constant_i+0], src) if (index == constant_i+1) (assign, dest[constant_i+1], src) if (index == constant_i+2) (assign, dest[constant_i+2], src) if (index == constant_i+3) (assign, dest[constant_i+3], src) or a sequence of conditional selects like (assign, dest[constant_i+0], (csel, index == constant_i+0, src, dest[constant_i+0])) (assign, dest[constant_i+1], (csel, index == constant_i+1, src, dest[constant_i+1])) (assign, dest[constant_i+2], (csel, index == constant_i+2, src, dest[constant_i+2])) (assign, dest[constant_i+3], (csel, index == constant_i+3, src, dest[constant_i+3])) The former case should continue to use the binary search. The later case could be generated from the binary search by other lowering passes. At the end of the day, conditional assignments don't really help anything here, so stop using them. Radeon R430 total instructions in shared programs: 2398683 -> 2398419 (-0.01%) instructions in affected programs: 5143 -> 4879 (-5.13%) helped: 9 HURT: 8 total vinst in shared programs: 616292 -> 616010 (-0.05%) vinst in affected programs: 4467 -> 4185 (-6.31%) helped: 9 HURT: 8 total sinst in shared programs: 315417 -> 315667 (0.08%) sinst in affected programs: 2568 -> 2818 (9.74%) helped: 2 HURT: 15 total flowcontrol in shared programs: 1049 -> 1048 (-0.10%) flowcontrol in affected programs: 7 -> 6 (-14.29%) helped: 1 HURT: 0 total presub in shared programs: 47027 -> 47027 (0.00%) presub in affected programs: 127 -> 127 (0.00%) helped: 1 HURT: 1 total omod in shared programs: 3618 -> 3615 (-0.08%) omod in affected programs: 8 -> 5 (-37.50%) helped: 3 HURT: 0 total temps in shared programs: 450757 -> 451312 (0.12%) temps in affected programs: 837 -> 1392 (66.31%) helped: 8 HURT: 6 total consts in shared programs: 1031928 -> 1031920 (<.01%) consts in affected programs: 1211 -> 1203 (-0.66%) helped: 6 HURT: 7 The shaders that were hurt for temps... are all lies. None of those shaders should have compiled as all 6 had more than 32 temps to begin with. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	05703a49f9	glsl: Use csel in do_vec_index_to_cond_assign This matches what NIR does. See nir_vector_extract. This improves code generation for several reasons. First, it only requires 3 comparisons instead of 4 (vec3(i > 0, i > 1, i > 2) vs vec4(i == 0, i == 1, i == 2, i == 3)). Secondly, it shortens the liverange of some values, possibly quite dramatically. Consider a loop in the old version (after lowering if-statements to selects): loop { ... x = csel(i == 0, a[0], x); x = csel(i == 1, a[1], x); x = csel(i == 2, a[2], x); x = csel(i == 3, a[3], x); ... } x is live for the whole loop across iterations. In the new version, x is only live while its value is needed: loop { ... t0 = csel(i > 0 , a[1], a[0]); t1 = csel(i > 2 , a[3], a[2]); x = csel(i > 1, t1, t0); ... } Outside a loop, this also means more values of the array may have their liveness reduced sooner (by consuming two values at once). All Intel platforms had similar results. (Tigerlake shown) total instructions in shared programs: 21171336 -> 21163615 (-0.04%) instructions in affected programs: 89680 -> 81959 (-8.61%) helped: 40 HURT: 4 helped stats (abs) min: 1 max: 450 x̄: 193.68 x̃: 196 helped stats (rel) min: 0.41% max: 13.32% x̄: 6.01% x̃: 6.22% HURT stats (abs) min: 1 max: 12 x̄: 6.50 x̃: 6 HURT stats (rel) min: 0.50% max: 0.66% x̄: 0.58% x̃: 0.58% 95% mean confidence interval for instructions value: -229.68 -121.28 95% mean confidence interval for instructions %-change: -6.93% -3.89% Instructions are helped. total cycles in shared programs: 832879641 -> 829513122 (-0.40%) cycles in affected programs: 44738430 -> 41371911 (-7.52%) helped: 35 HURT: 2 helped stats (abs) min: 2 max: 189948 x̄: 96186.49 x̃: 116154 helped stats (rel) min: 0.37% max: 11.08% x̄: 5.88% x̃: 6.47% HURT stats (abs) min: 4 max: 4 x̄: 4.00 x̃: 4 HURT stats (rel) min: 0.69% max: 0.69% x̄: 0.69% x̃: 0.69% 95% mean confidence interval for cycles value: -112881.94 -69092.06 95% mean confidence interval for cycles %-change: -6.77% -4.27% Cycles are helped. total spills in shared programs: 8061 -> 7338 (-8.97%) spills in affected programs: 873 -> 150 (-82.82%) helped: 24 HURT: 0 total fills in shared programs: 7501 -> 6388 (-14.84%) fills in affected programs: 1389 -> 276 (-80.13%) helped: 24 HURT: 0 Radeon R430 total instructions in shared programs: 2449852 -> 2449136 (-0.03%) instructions in affected programs: 6285 -> 5569 (-11.39%) helped: 64 HURT: 0 helped stats (abs) min: 4 max: 12 x̄: 11.19 x̃: 12 helped stats (rel) min: 8.16% max: 21.62% x̄: 12.09% x̃: 10.91% total consts in shared programs: 1032517 -> 1032482 (<.01%) consts in affected programs: 966 -> 931 (-3.62%) helped: 35 HURT: 0 helped stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 helped stats (rel) min: 2.94% max: 10.00% x̄: 4.26% x̃: 3.57% Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Ian Romanick	ed66d7c385	glsl/lower_vector_derefs: Don't emit conditional assignments Use if-statements instead. Any hardware that supports this sort of tessellation has flow control, so it will probably emit the conditional assignment using an if-statement anyway. This is definitely what st_glsl_to_nir does. v2: Fix copy-and-paste bug in the ir_type_swizzle handling. This bug caused segfaults in tests/spec/arb_tessellation_shader/execution/variable-indexing/tcs-patch-vec4-swiz-index-wr.shader_test. Reviewed-by: Matt Turner <mattst88@gmail.com> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14573>	2022-02-11 17:25:33 +00:00
Pavel Ondračka	a04aa4bc08	r300: transform fs sin and cos input to [0,1) range in NIR shader-db stats with RV530 (together with the vs commit): total instructions in shared programs: 65194 -> 64721 (-0.73%) instructions in affected programs: 6718 -> 6245 (-7.04%) total consts in shared programs: 45363 -> 45353 (-0.02%) consts in affected programs: 466 -> 456 (-2.15%) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5982 Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14957>	2022-02-11 16:09:45 +00:00
Pavel Ondračka	3f97306b95	r300: transform vs sin and cos input to [-PI,PI] range in NIR The python generator is mostly copy-pasted from lima. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14957>	2022-02-11 16:09:45 +00:00
Pavel Ondračka	1109381e0e	r300: use nir lowering for sin and cos on R300 and R400 The nir approximation is a bit more precise so there is one more instruction for the scalar version but if the shader actually uses vector one or some other stuff like sin(x) followed by cos(x) we save more. This nir approximation importantly seems to have better precision so this should also fix some piglits/dEQPs. With my shader-db and faked R300: total instructions in shared programs: 67751 -> 65978 (-2.62%) instructions in affected programs: 8637 -> 6864 (-20.53%) total temps in shared programs: 9191 -> 9137 (-0.59%) temps in affected programs: 486 -> 432 (-11.11%) total consts in shared programs: 45427 -> 45412 (-0.03%) consts in affected programs: 856 -> 841 (-1.75%) total lits in shared programs: 2317 -> 2346 (1.25%) lits in affected programs: 69 -> 98 (42.03%) Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14957>	2022-02-11 16:09:45 +00:00
Samuel Pitoiset	aa3405e812	radv/winsys: fix initializing debug/perftest options if multiple instances Since the winsys uses refcount, options like RADV_DEBUG_ZERO_VRAM might have not been initialized if the first instance wasn't created with application info. This fixes missing zerovram for vkd3d-proton. Cc: 21.3 22.0 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14978>	2022-02-11 16:12:47 +01:00
Rohan Garg	4001d9ce1a	anv: Handle VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE for descriptor sets Ensure that surface state does not get created for VK_DESCRIPTOR_POOL_CREATE_HOST_ONLY_BIT_VALVE and we don't bind to the binding table for descriptor sets created with this pool. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14683>	2022-02-11 12:03:42 +00:00
Daniel Schürmann	af4b26c53a	radv: move nir_opt_shrink_stores from radv_optimize_nir() No need to call this pass in a loop. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:52 +01:00
Daniel Schürmann	2a92452a0e	nir/opt_shrink_vectors: Remove shrinking of store intrinsics data source This is done via nir_opt_shrink_stores. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:50:47 +01:00
Daniel Schürmann	2dba7e6056	nir: split nir_opt_shrink_stores from nir_opt_shrink_vectors This patch moves the shrinking of store sources into a separate pass. The reasoning behind this is that this pass usually only needs to be called once while nir_shrink_vectors might better be called several times. This allows to move the pass(es) out of the optimization loops. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14480>	2022-02-11 11:34:01 +01:00
Christian Gmeiner	b07372312d	Revert "nir: make tgsi_varying_semantic_to_slot(..) public" This reverts commit `edbdd97723`. As etnaviv's TGSI compiler is gone we make that function private again. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12889>	2022-02-11 08:48:10 +00:00
Christian Gmeiner	367b6c553d	etnaviv: drop TGSI based backend compiler I think it is time to switch to NIR per default and forget about the past. There might be stuff that will explode in different ways now but thats all fixable. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12889>	2022-02-11 08:48:10 +00:00
Jason Ekstrand	29393a40ee	v3dv: Use the common command pool implementation The only interesting information stored in v3dv_cmd_pool is the list of command buffers and that's already tracked by vk_command_pool. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	fcad979b72	v3dv: Don't use vk_alloc/free2 for command buffers The pool will always have a valid allocator. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	09b5f0f3ac	anv: Use the common vk_command_pool Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	3007f8e95f	anv: Don't call DestroyCommandBuffers in AllocateCommandBuffers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	5a15c3eff7	anv: Drop anv_cmd_buffer::pool Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	2170ec2845	anv: Don't use vk_alloc/free2 for command buffers The pool will always have a valid allocator. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	37f3da90dd	vulkan: Implement of a bunch of VkCommandPool functions Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	f424d1e9ab	vulkan/queue: Assert command buffers have the right queue family We've got enough information in common code to track this now so we may as well throw in a helpful assert. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	bda4c4f6b6	vulkan: Take a vk_command_pool in vk_command_buffer_init() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:25 +00:00
Jason Ekstrand	d59caf5d11	turnip: Use vk_command_pool Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	6fb9e4e7ff	v3dv: Use vk_command_pool Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	2746be68f1	lavapipe: Use vk_command_pool Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	2a5ae138b4	panvk: Use vk_command_pool Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	29a164b088	radv: Use vk_command_pool Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	c5b8ee8810	anv: Use vk_command_pool Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	a10b6d1c6f	vulkan: Add a common vk_command_pool base struct Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Louis-Francis Ratté-Boulianne	5e263cc324	vulkan/runtime: Add a level field to vk_command_buffer Looks like 3 implementations already have that field in their private command_buffer struct, and having it at the vk_command_buffer opens the door for generic (but suboptimal) secondary command buffer support. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Jason Ekstrand	7b0e306854	anv: Call vk_command_buffer_finish if create fails This wasn't much of a problem before because vk_command_buffer_finish() doesn't do much on an empty command buffer. However, it's about to be responsible for managing the pool's list of command buffers so it will be critical to get this right. Fixes: `c9189f4813` ("anv: Use a common vk_command_buffer structure") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14917>	2022-02-11 08:06:24 +00:00
Paulo Zanoni	cb50e4ac4d	iris: use the same VM for every context Now that we have no overlapping memory addresses for different contexts we can go ahead and also share the same VM for every context. This should make VM binding much easier to implement once we have Kernel support for it. v2: We now have engines_ctx, set it there too. v3: Fix indenting (Ken). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12028>	2022-02-11 01:42:45 +00:00
Paulo Zanoni	782efa29e6	iris: have a single border color pool per bufmgr Have a single border color pool per bufmgr instead of per context. We want to have a single VM shared among every context and the border color pool is the last feature preventing us from having that. Previously we had 1024 colors per context but once the buffer was full we just waited for the buffer to be unused and restarted it. After this patch we have 4096 colors for every single context and we can't just flush buffers if they are full, so we simply return black. There are many strategies we could try to implement to help alleviate this new 4096 limit, none of which are implemented by this patch: - We could just expand the buffer to the full 16MB we can use, allowing 262144 colors. - We could use multiple buffers and make the contexts refcount them, so eventually older buffers would reach zero references and be recycled, moving us to a working set maximum from a lifetime maximum. - We could also make the border color pool be a standard memzone and then give smaller buffers to each context when they need, so the limit would be in the number of contexts that can use border color pools. This was my first implementation but Ken suggested I switch to the one provided by this patch, which is simpler. Keep it like this since border colors don't seem to be used very much and other Mesa drivers such as radeonsi also seem to employ the "return black once we reach the limit" strategy. As a last note, we could also move the contents of iris_border_color.c to iris_bufmgr.c in order to avoid breaking some abstractions we have in Iris, like we do with iris_bufmgr_get_border_color_pool(). I can do this in case we want it. v2: Switch from standard memzone to a per-screen thing (see above). v3: Actually make it per bufmgr. Just making it per screen is not enough, since screens can share the same VM, an example being the gputest benchmark suite. v4: Rebase. v5: Remove dead code, lock around hash table lookup (Ken). v6: Simple rebase. v7: Another rebase (for_each_batch). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12028>	2022-02-11 01:42:45 +00:00
Paulo Zanoni	70dcffde4e	iris: handle IRIS_MEMZONE_BINDER with a real vma_heap like the others We're moving towards a path where all contexts share the same virtual memory - because this will make implementing vm_bind much easier - , and to achieve that we need to rework the binder memzone. As it is, different contexts will choose overlapping addresses. So in this patch we adjust the Binder to be 1GB - per Ken's suggestion - and use a real vma_heap for it. As a bonus the code gets simpler since it just reuses the same pattern we already have for the other memzones. Credits to Kenneth Granunke for helping me with this change. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12028>	2022-02-11 01:42:45 +00:00
Mike Blumenkrantz	79b946a9d2	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	68b099b8eb	zink: implement generated tcs variants using spirv shortcut this codepath needs to update literally one spirv word to get the right shader (why can't this be a spec constant?) so just update that word instead of running all the nir passes and doing a full recompile fixes: KHR-GL46.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_MaxPatchVertices_Position_PointSize KHR-GL46.tessellation_shader.tessellation_control_to_tessellation_evaluation.gl_PatchVerticesIn Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	e1ef00aed7	zink: move pipeline tcs patch_vertices value to tcs shader key making this available for shader update use no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	63236f9ea9	zink: add a tcs shader key only applies for generated tcs no functional changes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	a9d2b86c2c	zink: store the spirv_shader to the zink_shader struct for generated tcs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	6c0740c129	zink: split off CreateShaderModule into util function Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	cd9b099038	zink: store the tcs_vertices_out spirv word to the spirv_shader struct Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	f2baa0638b	zink: store the tcs_vertices_out spirv word Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	64f1960915	zink: make spirv_builder_emit_exec_mode_literal() return the word for the param Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	25d6651324	zink: make spirv_buffer_emit_word() return the word that was written Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	5663fac5ca	zink: break out spirv shader dumping into separate function debugging++ Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14976>	2022-02-11 01:29:39 +00:00
Mike Blumenkrantz	ff92d2f188	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14974>	2022-02-11 01:17:28 +00:00
Mike Blumenkrantz	8ff96efcfd	zink: always set VkPipelineMultisampleStateCreateInfo::pSampleMask by initializing this on context creation, we can ensure that the correct value is always here cc: mesa-stable fixes: dEQP-GLES31.functional.texture.multisample.samples_1.sample_mask_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_1.sample_mask_and_sample_coverage dEQP-GLES31.functional.texture.multisample.samples_1.sample_mask_and_sample_coverage_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_1.sample_mask_only dEQP-GLES31.functional.texture.multisample.samples_2.sample_mask_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_2.sample_mask_and_sample_coverage dEQP-GLES31.functional.texture.multisample.samples_2.sample_mask_and_sample_coverage_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_2.sample_mask_only dEQP-GLES31.functional.texture.multisample.samples_3.sample_mask_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_3.sample_mask_and_sample_coverage dEQP-GLES31.functional.texture.multisample.samples_3.sample_mask_and_sample_coverage_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_3.sample_mask_only dEQP-GLES31.functional.texture.multisample.samples_4.sample_mask_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_4.sample_mask_and_sample_coverage dEQP-GLES31.functional.texture.multisample.samples_4.sample_mask_and_sample_coverage_and_alpha_to_coverage dEQP-GLES31.functional.texture.multisample.samples_4.sample_mask_only Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14974>	2022-02-11 01:17:28 +00:00
Dave Airlie	da0e00e0b9	gallivm: add coroutine attribute that llvm requires. Running llvm in debug mode asserts on this being missing. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14963>	2022-02-11 00:32:21 +00:00
Jesse Natalie	31e2c3f550	microsoft/compiler: Fill interpolation for sysval inputs to non-vertex shader Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5483 Reviewed-by: Michael Tang <tangm@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14984>	2022-02-11 00:19:17 +00:00
Jesse Natalie	9dba60f397	d3d12: Only force point sampling for emulated shadow samplers Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14981>	2022-02-10 23:33:35 +00:00
Iván Briano	e2a5e2d5a0	anv: make the pointer valid before we assign stuff into it Fixes: `665ffd4bf9` ("anv: Update VK_KHR_fragment_shading_rate for newer HW") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14982>	2022-02-10 23:06:13 +00:00
Caio Oliveira	12b4aad803	anv: Enable requiredSubgroupSize for Task/Mesh Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14979>	2022-02-10 22:37:03 +00:00
Kenneth Graunke	577d131bcf	anv: Increase maxBoundDescriptorSets to 32 We recently had a request to support a larger maxBoundDescriptorSets, specifically 32, and there doesn't appear to be a reason we need to restrict this to 8. According to vulkan.gpuinfo.org reports, most Vulkan drivers appear to support 32. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14952>	2022-02-10 13:56:46 -08:00
Jesse Natalie	a84265938b	driconf: Add Heaven entries for Windows .exe Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14965>	2022-02-10 20:19:00 +00:00
Jesse Natalie	2360c25d63	d3d12: Don't add a second dual-source output for Heaven Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14965>	2022-02-10 20:19:00 +00:00
Jesse Natalie	22fc534930	d3d12: Default newly-created resources to not-resident Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	e3a2cb4b67	d3d12: Implement residency management algorithm Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	40dafd0094	d3d12: Add a budget/usage callback to the screen Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	671deb541e	d3d12: Add residency info to d3d12_bo This is all currently immutable, but will be used to manage the residency of the underlying D3D objects in a future commit. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	f4c74f74f8	d3d12: Add sampler's textures to batch bo tracking This will be important for residency in a future change, but also is necessary for synchronize() to work correctly for TBOs. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	34e53d4c9c	d3d12: Move ID3D12Fence from context to screen There's already a single command queue for the screen, meaning that all commands are being serialized implicitly into that queue. There's no need to have separate fences for parallel contexts when those fences would all share the same underlying timeline. This adds an explicit lock to expand the scope of the implicit screen command queue ordering to include fence signals. Each context still gets its own submit sequence, which is used for 1 purpose right now: A uniqueness check in the state manager to see if states are coming from separate command lists, to apply promotion and decay logic. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	7ce2d5aece	d3d12: Forward wait condition from query -> result buffer The no-wait condition was wrong before. If the query was used in the current batch (query->fence_value == context->fence_value), we'd continue on with the operation instead of returning false. Then the buffer map would see that the bo is referenced in the current batch, and would flush and wait, even though we were asked not to wait. This fixes the condition by simply using the (correct) buffer map logic. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Jesse Natalie	5cbd7093af	d3d12: When mapping a resource used in the current batch without blocking, at least flush Also, resource_is_busy needs to opportunistically retire batches, so apps can spin on non-blocking resource maps and eventually succeed. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14959>	2022-02-10 20:06:15 +00:00
Ian Romanick	1cb3d1a6ae	nir: Produce correct results for atan with NaN Properly handling NaN adversely affects several hundred shaders in shader-db (lots of Skia and a few others from various synthetic benchmarks) and fossil-db (mostly Talos and some Doom 2016). Only apply the NaN handling work-around when the shader demands it. v2: Add comment explaining the 1.0*y_over_x. Suggested by Caio. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `2098ae16c8` ("nir/builder: Move nir_atan and nir_atan2 from SPIR-V translator") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	7d0d9b9fbc	nir: Properly handle various exceptional values in frexp frexp_sig of ±0, ±Inf, or NaN should just return the input unmodified. frexp_exp of ±Inf or NaN is undefined, and frexp_exp of ±0 should return the input unmodified. This seems to already work. No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `23d30f4099` ("spirv,nir: lower frexp_exp/frexp_sig inside a new NIR pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	93ed87af28	spirv: Produce correct result for GLSLstd450Tanh with NaN No shader-db or fossil-db changes on any Intel platform. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `9f9432d56c` ("Revert "spirv: Use a simpler and more correct implementaiton of tanh()"") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	e442b9d792	spirv: Produce correct result for GLSLstd450Modf with Inf GLSLstd450ModfStruct too. No shader-db or fossil-db changes on any Intel platform. v2: Fix handling 16-bit (and presumably 64-bit) values. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `f92a35d831` ("vtn: Fix Modf.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	75ef5991f5	spriv: Produce correct result for GLSLstd450Step with NaN NOTE: This commit needs "nir: All set-on-comparison opcodes can take all float types" or regressions will occur in other Vulkan SPIR-V tests. No shader-db changes on any Intel platform. NOTE: This commit depends on "nir: All set-on-comparison opcodes can take all float types". v2: Fix handling 16-bit (and presumably 64-bit) values. About 280 shaders in Talos are hurt by a few instructions, and a couple shaders in Doom 2016 are hurt by a few instructions. Tiger Lake Instructions in all programs: 159893290 -> 159895026 (+0.0%) SENDs in all programs: 6936431 -> 6936431 (+0.0%) Loops in all programs: 38385 -> 38385 (+0.0%) Cycles in all programs: 7019260087 -> 7019254134 (-0.0%) Spills in all programs: 101389 -> 101389 (+0.0%) Fills in all programs: 131532 -> 131532 (+0.0%) Ice Lake Instructions in all programs: 143624235 -> 143625691 (+0.0%) SENDs in all programs: 6980289 -> 6980289 (+0.0%) Loops in all programs: 38383 -> 38383 (+0.0%) Cycles in all programs: 8440083238 -> 8440090702 (+0.0%) Spills in all programs: 102246 -> 102246 (+0.0%) Fills in all programs: 131908 -> 131908 (+0.0%) Skylake Instructions in all programs: 134185495 -> 134186618 (+0.0%) SENDs in all programs: 6938790 -> 6938790 (+0.0%) Loops in all programs: 38356 -> 38356 (+0.0%) Cycles in all programs: 8222366923 -> 8222365826 (-0.0%) Spills in all programs: 98821 -> 98821 (+0.0%) Fills in all programs: 125218 -> 125218 (+0.0%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `1feeee9cf4` ("nir/spirv: Add initial support for GLSL 4.50 builtins") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	38a94c82e6	intel/fs: Don't optimize out 1.0x and -1.0x This (sort of) matches the behavior of nir_opt_algebraic. This ensures that subnormal values are properly flushed to zero. With the aid of "nir/search: Float sources of texture instructions are float users" and "nir/search: Transitively apply is_only_used_as_float", there would have been no shader-db regressions on Intel platforms. However, those caused a significant increase in compile time. Since the instruction regressions were so small, I just dropped those commits rather than improve them. All Haswell and newer platforms had similar results. (Ice Lake shown) total instructions in shared programs: 20125042 -> 20125094 (<.01%) instructions in affected programs: 7184 -> 7236 (0.72%) helped: 0 HURT: 32 HURT stats (abs) min: 1 max: 4 x̄: 1.62 x̃: 2 HURT stats (rel) min: 0.11% max: 1.49% x̄: 0.85% x̃: 0.78% 95% mean confidence interval for instructions value: 1.39 1.86 95% mean confidence interval for instructions %-change: 0.74% 0.96% Instructions are HURT. total cycles in shared programs: 862745586 -> 862746551 (<.01%) cycles in affected programs: 109872 -> 110837 (0.88%) helped: 12 HURT: 23 helped stats (abs) min: 2 max: 774 x̄: 90.83 x̃: 19 helped stats (rel) min: 0.07% max: 25.23% x̄: 3.06% x̃: 0.40% HURT stats (abs) min: 2 max: 1106 x̄: 89.35 x̃: 12 HURT stats (rel) min: 0.08% max: 45.40% x̄: 3.01% x̃: 0.47% 95% mean confidence interval for cycles value: -60.09 115.23 95% mean confidence interval for cycles %-change: -2.21% 4.07% Inconclusive result (value mean confidence interval includes 0). All of the shaders hurt are in either UE4 shooter-game or shooter_demo. Tiger Lake Instructions in all programs: 159893213 -> 159893290 (+0.0%) SENDs in all programs: 6936431 -> 6936431 (+0.0%) Loops in all programs: 38385 -> 38385 (+0.0%) Cycles in all programs: 7019259514 -> 7019260087 (+0.0%) Spills in all programs: 101389 -> 101389 (+0.0%) Fills in all programs: 131532 -> 131532 (+0.0%) Ice Lake Instructions in all programs: 143624164 -> 143624235 (+0.0%) SENDs in all programs: 6980289 -> 6980289 (+0.0%) Loops in all programs: 38383 -> 38383 (+0.0%) Cycles in all programs: 8440082767 -> 8440083238 (+0.0%) Spills in all programs: 102246 -> 102246 (+0.0%) Fills in all programs: 131908 -> 131908 (+0.0%) Skylake Instructions in all programs: 134185424 -> 134185495 (+0.0%) SENDs in all programs: 6938790 -> 6938790 (+0.0%) Loops in all programs: 38356 -> 38356 (+0.0%) Cycles in all programs: 8222366529 -> 8222366923 (+0.0%) Spills in all programs: 98821 -> 98821 (+0.0%) Fills in all programs: 125218 -> 125218 (+0.0%) Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Fixes: `f5dd6dfe01` ("anv: enable VK_KHR_shader_float_controls and SPV_KHR_float_controls") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	38800b385c	nir: All set-on-comparison opcodes can take all float types Extend `4195a9450b` so that the next poor fool doesn't come along and say, "sge does the right thing for 16-bit sources, but slt gives a NIR validation failure. What the deuce?" NOTE: This commit is necessary to prevent regressions in GLSLstd450Step tests of 16-bit sources at "spriv: Produce correct result for GLSLstd450Step with NaN". Fixes: `4195a9450b` ("nir: sge operation is defined for floating-point types") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	97ce3a56bd	nir/search: Constify instr parameter to nir_search_expression::cond Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Ian Romanick	4dd4135551	nir: Constify def parameter to nir_ssa_def_bits_used Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13999>	2022-02-10 18:15:39 +00:00
Otavio Pontes	510d248299	nir: Use proper macro to set bits of variable correctly When slots is 64 only the first bit was being set, instead of setting all 64 bits of the variable, so for that case the function get_variable_io_mask() always returned 0. This behaviour caused variables that are being used both on producer and consumer to be considered unused and thus being removed on nir_remove_unused_io_vars(). Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14955>	2022-02-10 17:19:54 +00:00
Daniel Stone	7a0ace7d4e	Revert "ci: Disable Windows for now" This reverts commit `be385ab5bc`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14975>	2022-02-10 16:44:16 +00:00
Georg Lehmann	c2168f845e	nir/lower_mediump: Treat u2u16 like i2i16. There is a comment in nir_fold_16bit_sampler_conversions saying that these are the same, but the code only checks for i2i16. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14893>	2022-02-10 16:13:54 +00:00
Mike Blumenkrantz	532665c73c	zink: anv (icl) ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14973>	2022-02-10 16:01:04 +00:00
Danylo Piliaiev	b84f059680	freedreno/pps: Expose same counters as blob Expose most of the counters exposed by blob. By faking the value of counters returned from kgsl I found the exact underlying counters and constant coefficients being used. Note, coefficients for counters that depend on time are NOT verified. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14323>	2022-02-10 15:15:33 +00:00
Samuel Pitoiset	03ab9d895e	radv/ci: update CI lists for CTS 1.3.1.0 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Martin Roukala <martin.roukala@mupuf.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14946>	2022-02-10 14:52:42 +00:00
Daniel Schürmann	fce6ca0f3a	radv: remove exports without color attachment or writemask This lets us make use of NIR's more advanced DCE. This includes removing of CF constructs, PS inputs and VS outputs. Totals from 1959 (1.45% of 134913) affected shaders: (GFX10.3) VGPRs: 73464 -> 71944 (-2.07%); split: -3.79%, +1.72% SpillSGPRs: 6 -> 0 (-inf%) CodeSize: 4860324 -> 4675248 (-3.81%); split: -4.92%, +1.11% LDS: 2619904 -> 2781696 (+6.18%); split: -0.37%, +6.55% MaxWaves: 50614 -> 50852 (+0.47%); split: +1.63%, -1.16% Instrs: 924233 -> 887836 (-3.94%); split: -5.01%, +1.07% Latency: 5635532 -> 5418083 (-3.86%); split: -4.53%, +0.67% InvThroughput: 1107764 -> 1077542 (-2.73%); split: -3.44%, +0.71% VClause: 17361 -> 16163 (-6.90%); split: -8.38%, +1.47% SClause: 31886 -> 29323 (-8.04%); split: -8.52%, +0.48% Copies: 53529 -> 52127 (-2.62%); split: -5.30%, +2.68% Branches: 22993 -> 22802 (-0.83%); split: -3.44%, +2.61% PreSGPRs: 53123 -> 51395 (-3.25%); split: -3.60%, +0.35% PreVGPRs: 59699 -> 57424 (-3.81%); split: -5.13%, +1.32% Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14771>	2022-02-10 14:23:26 +00:00
Daniel Stone	be385ab5bc	ci: Disable Windows for now Docker on Windows is broken for some reason, so just disable it for now. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14970>	2022-02-10 12:51:06 +00:00
Lionel Landwerlin	137e170bcb	anv: update limit for maxVertexInputBindingStride Before: maxVertexInputBindingStride = 2048 (gen7+) After: maxVertexInputBindingStride = 2048 (gen7/gen8) maxVertexInputBindingStride = 4095 (gen9+) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14958>	2022-02-10 10:06:22 +00:00
Chia-I Wu	f93059b19f	venus: fix two VN_TRACE_SCOPE's in the same scope Make sure __LINE__ is expanded. Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14960>	2022-02-10 05:11:46 +00:00
Nanley Chery	987bc44954	iris: Drop the iris_resource aux usage bit fields A big reason we had these fields was to help create a set of surface states for a resource. That's largely being handled through other means now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	ae763940e8	iris: Compute aux.possible_usages from aux.usage We're going to remove res->aux.possible_usages. This will simplify the commit in which we do so. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	d905018a2c	iris: Use iris_sample_with_depth_aux more often We're going to remove res->aux.sampler_usages. This will simplify the commit in which we do so. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	85a7fb1e19	intel/isl: Add format assertions for surfaces using CCS This caught some invalid CCS surface states created by iris. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	05b8b08ef4	iris: Avoid making some invalid CCS surface states Although a resource may support CCS with its original format, a texture view of that resource may have a format that doesn't support compression. Don't create CCS surface states for such texture views. This change affects iris' behavior when running piglit's arb_texture_view-rendering-formats_gles3 test on SKL. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	a9beb87dce	iris: Inline some surface_state.cpu references Now that we're using fill_surface_states, these aren't needed anymore. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	d705faad6c	iris: Add and use fill_surface_states This helper simplifies some repeated logic. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	eb51fd0414	iris: Add and use use_surface_state This helper simplifies some repeated logic. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	89ebdd67c4	iris: Add and use iris_surface_state::aux_usages An iris_surface_state can have a different set of possible aux usages than its iris_resource. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	b60af618a0	iris: Drop res param from surf_state_offset_for_aux This has been unused since commit `117a0368b0`. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	ce37e176f1	iris: Drop format param from fast_clear_color It's unused. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Nanley Chery	6778b3a379	iris: Don't fast clear with the view format Fast clear with the resource format instead. This is safe to do because can_fast_clear_color ensures that the clear color generates the same pixel with either the view format or the resource format. On SKL, this prevents us from using an invalid surface state. This platform doesn't support CCS_E with sRGB formats, but prior to this patch we allowed fast-clearing with this combination. Piglit's fcc-write-after-clear test can trigger this. Fixes: `230952c210` ("iris: Don't support sRGB + Y_TILED_CCS on gen9") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14806>	2022-02-10 04:47:14 +00:00
Mike Blumenkrantz	68c1b50e48	aux/draw: fix llvm tcs lane vec generation the idx param for LLVMBuildInsertElement is zero-indexed based on the value of 'vector_length' (always 4), and the vector length is (obviously) sized to 'vector_length', so this should be the member of the vec that is being inserted, not the invocation index cc: mesa-stable fixes (zink, but only on my one machine): KHR-GL46.tessellation_shader.single.max_patch_vertices KHR-GL46.tessellation_shader.tessellation_shader_tc_barriers.barrier_guarded_read_write_calls dEQP-GLES31.functional.tessellation.shader_input_output.barrier dEQP-GLES31.functional.tessellation.shader_input_output.patch_vertices_5_in_10_out dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_isolines_geometry_output_points dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_isolines_point_mode_geometry_output_triangles dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_quads_geometry_output_points dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_quads_point_mode_geometry_output_lines dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_triangles_geometry_output_points dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_triangles_point_mode_geometry_output_lines Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14949>	2022-02-10 04:14:28 +00:00
Bas Nieuwenhuizen	8d5be0a2b3	radv: Add submit locking with trace bo. Otherwise cmdbuffers from different queues can override the trace id from each other, making for a very confusing hang report. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14868>	2022-02-10 03:49:02 +00:00
Ian Romanick	e3cbc328e0	gallivm/nir: Call nir_lower_bool_to_int32 after nir_opt_algebraic_late All of the opcodes in nir_opt_algebraic_late are the unsized (1-bit) versions. If the lowering to int32 happens first, many of the optimizations and lowerings won't happen. Of particular importance is the lowering of fisfinite. If a shader happens to contain fisfinite of an fp16 value, it will assert later during compliation. Reviewed-by: Dave Airlie <airlied@redhat.com> Fixes: `78b4e417d4` ("gallivm: handle fisfinite/fisnormal") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14942>	2022-02-10 03:12:46 +00:00
Emma Anholt	d633eace3f	ci/freedreno: Try to detect a wedged MMU that's happened recently. Possibly since the VK-GL-CTS 1.3.1.0 uprev. It doesn't seem to recover, like it says. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14945>	2022-02-10 01:13:31 +00:00
Emma Anholt	b7278b2281	ci/lvp: Add a flake that's shown up a couple of times since VKCTS 1.3.1. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14945>	2022-02-10 01:13:31 +00:00
Emma Anholt	2d15f9e3c2	ci/r300: Drop xfails that were fixed with the VK-GL-CTS 1.3.1.0 uprev. Acked-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	20469009c7	nir: Delete the per-instr SSA liveness impl. It was introduced for nir-to-tgsi, and I found that it was the wrong approach. There's a reason nobody else does RA this way. Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	74c02d99b2	nir_to_tgsi: Replace the NIR SSA liveness with TGSI reg-level liveness. Allocating NIR registers ends up being required for drivers like r600 and nv30, which don't do their own allocation (except in some cases on r600 where sb is used). Rather than add a NIR register liveness impl (https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14158), switch from NIR-based liveness to just doing the same channel-based liveness logic that the NIR registers needed at the TGSI level. The actual liveness code here basically comes straight out of brw_vec4_live_variables.cpp. Since we do the liveness in TGSI now, it also means we don't need to be careful about not reading SSA values from later TGSI instructions (which may be useful for doing some greedy instruction selection in generating TGSI instructions). i915g: total instructions in shared programs: 400719 -> 380730 (-4.99%) instructions in affected programs: 284760 -> 264771 (-7.02%) total tex_indirect in shared programs: 12289 -> 12290 (<.01%) tex_indirect in affected programs: 4 -> 5 (25.00%) total temps in shared programs: 32172 -> 22086 (-31.35%) temps in affected programs: 30647 -> 20561 (-32.91%) LOST: 0 GAINED: 148 r300: total instructions in shared programs: 1472463 -> 1459286 (-0.89%) instructions in affected programs: 507009 -> 493832 (-2.60%) total temps in shared programs: 212143 -> 201678 (-4.93%) temps in affected programs: 78007 -> 67542 (-13.42%) softpipe: total temps in shared programs: 517071 -> 294387 (-43.07%) temps in affected programs: 509324 -> 286640 (-43.72%) Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	f4ce3178d9	nir_to_tgsi: Track our TGSI insns in blocks before emitting tokens. To do register allocation well, we want to have a point before ureg_insn_emit() to look at the liveness of the values and allocate them to TGSI temporaries. In order to do that, we have to switch from ureg_OPCODE() emitting TGSI tokens directly to a new ntt_OPCODE() that stores the ureg args in a block structure. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	3f84c67af8	tgsi: Refactor out a tgsi_util_get_src_usage_mask(). The function operated on a tgsi_full_instruction, but for code generation in NIR-to-TGSI I want to reuse this logic using pieces of tgsi_ureg structs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Emma Anholt	e92209f299	i915g: Report the temps usage This is another important metric for this driver, and we don't do our own RA so ours is just what TGSI uses. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14404>	2022-02-10 00:36:57 +00:00
Eric Engestrom	bfcc7c20c8	docs: update calendar and link releases notes for 21.3.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14956>	2022-02-10 00:28:37 +00:00
Eric Engestrom	d66a22a02b	docs: add release notes for 21.3.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14956>	2022-02-10 00:28:37 +00:00
Dylan Baker	aabc7034d7	docs: update calendar for 22.0.0-rc2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14953>	2022-02-09 20:42:46 +00:00
Danylo Piliaiev	97c90c514f	turnip: Depth/stencil formats should not expose any bufferFeatures From the Vulkan 1.3.205 spec, section 19.3 "43.3. Required Format Support": Mandatory format support: depth/stencil with VkImageType VK_IMAGE_TYPE_2D [...] bufferFeatures must not support any features for these formats See https://gitlab.khronos.org/vulkan/vulkan/-/merge_requests/4849 Fixes CTS tests: dEQP-VK.api.buffer.invalid_buffer_features.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14927>	2022-02-09 20:11:22 +00:00
Samuel Pitoiset	53dc5f774d	radv: only emit the per-vertex VRS state if the pipeline forced it If the primitive shading rate is not written by the last VGT stage (like if no FS), it's useless to emit the VRS state. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14907>	2022-02-09 17:40:37 +01:00
Samuel Pitoiset	d0171dffe1	radv: do not force per-vertex VRS if there is no pixel shader This has no effect. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14907>	2022-02-09 17:40:37 +01:00
Samuel Pitoiset	2451290bc4	radv: rewrite RADV_FORCE_VRS directly in NIR This introduces a small NIR pass that exports VARYING_SLOT_PRIMITIVE_SHADING_RATE if RADV_FORCE_VRS is used, instead of doing this in both backend compilers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14907>	2022-02-09 17:40:34 +01:00
Juan A. Suarez Romero	7955df28a6	v3dv/ci: Update failure list Add more failing tests to the expected failures. These are obtained after executing the full Vulkan CTS. v2: - Add comments in the tests (Alejandro) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14948>	2022-02-09 15:46:23 +00:00
Mike Blumenkrantz	5d7bae5ab3	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14852>	2022-02-09 09:13:37 -05:00
Mike Blumenkrantz	63fa2ab978	zink: add Sample decorations to fragment shader inputs with sample shading PIPE_CAP_FORCE_PERSAMPLE_INTERP is broken for the no-attachment case, so this is the only option fixes (lavapipe): KHR-GL46.sample_shading.render* dEQP-GLES31.functional.sample_shading.min_sample_shading* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14852>	2022-02-09 09:13:37 -05:00
Tomeu Vizoso	0cb5333a14	iris/ci: Enable Whiskey Lake boards by default The boards should be stable now. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14822>	2022-02-09 13:16:25 +00:00
Qiang Yu	fe560aeb12	radeonsi: workaround Specviewperf13 Catia hang on GFX9 The root cause is unknown but PAL always update IA_MULTI_VGT_PARAM whenever primitive type change. cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Singed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14944>	2022-02-09 12:51:38 +00:00
Jordan Justen	e2cd0c3a3c	intel/fs: Assert that old pull-const code is not used if devinfo->has_lsc Jason changed this to use LSC in: `f5876dfdb9` ("intel/fs: Lower uniform pull constant load message to LSC dataport") Cc: 22.0 <mesa-stable> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14384>	2022-02-09 10:39:17 +00:00
Tapani Pälli	562f7eef5b	iris: invalidate L3 read only cache when VF cache is invalidated When enabling the caching of index,vertex data in the L3 RO Cache (L3BypassDisable), we need to use L3ReadOnlyCacheInvalidationEnable to invalidate cache when buffer is modified by CPU/GPU. Ref: bspec 46314 Fixes: `ed8f2c4cbe` ("iris: Cache VB/IB in L3$ for Gen12") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14815>	2022-02-09 10:05:10 +00:00
Tapani Pälli	7a6ea04795	anv: invalidate L3 read only cache when VF cache is invalidated When enabling the caching of index,vertex data in the L3 RO Cache (L3BypassDisable), we need to use L3ReadOnlyCacheInvalidationEnable to invalidate cache when buffer is modified by CPU/GPU. Ref: bspec 46314 Fixes: `6c345ddbe4` ("anv: Cache VB/IB in L3$ for Gfx12") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5941 Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14815>	2022-02-09 10:05:10 +00:00
Tapani Pälli	442628b702	intel/genxml: add PIPE_CONTROL field for L3 read only cache invalidation Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14815>	2022-02-09 10:05:10 +00:00
Rohan Garg	03e1e19246	anv: Refactor descriptor copy Refactor descriptor copies to use the existing helper functions instead of rolling our own. In order to facilitate this, we need to store the appropriate buffer views for the relevant descriptors internally and reuse them in the helpers. Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14909>	2022-02-09 09:24:37 +00:00
Samuel Pitoiset	6fba52cfd2	radv: allow RADV_FORCE_VRS with pipeline VRS declared as dynamic This is for vkd3d which needs to always declare the VRS dynamic state because it's fully dynamic in DX12. Ignoring the VRS dynamic state when it's a no-op seems the best way to handle this, although it's definitely not perfect. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14910>	2022-02-09 08:17:17 +00:00
Kenneth Graunke	413ea503ba	iris: Disable PIPE_CAP_PREFER_BACK_BUFFER_REUSE This cap bit only affects DRI_PRIME setups. Since iris now uses the blitter to perform dGPU -> iGPU copies asynchronously, it's better to always use at least two backbuffers so the 3D engine can start rendering the next frame during the copy. See commit `d17e752857` where this change was made for radeonsi. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13877>	2022-02-09 07:45:43 +00:00
Kenneth Graunke	e3cb620b55	iris: Use the hardware blitter for DRI PRIME blits In a hybrid graphics setup, Mesa allocates two buffers for the window surface. The first is what the discrete card renders to; it lives in VRAM and is usually tiled and possibly compressed. The second is a shadow copy that lives in system memory (readable by the integrated card with the displays); it's usually linear and uncompressed. Mesa's window system code schedules blits to update the shadow copy when needed, typically at the end of a frame. These can be fairly costly when running a full-screen application at high resolutions. We'd like to use the blitter for these copies, as it lets us perform the copy asynchronously, letting the 3D engine race ahead and start rendering the next frame. If we used the 3D engine, the next frame could not start rendering until the PRIME blit finishes, giving us less time to draw the frame. Fortunately, Tigerlake introduced new blitter commands which can operate at full memory bandwidth. DRI PRIME blits happen via the Gallium blit() hook. We can detect that case by looking for the PIPE_BIND_PRIME_BLIT_DST flag on the destination resource. This patch detects that case and calls iris_copy_region() on IRIS_BATCH_BLITTER to handle it. We know a priori that the blitter can handle this operation (it's not a scaled blit, the formats match and should not be 96bpp, there's no combined depth stencil, or other weird edge cases). blorp_copy() will also assert that edge cases don't occur. Together with the next patch, this improves performance on DG1 Hybrid scenarios by about 5-6%. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13877>	2022-02-09 07:45:43 +00:00
Kenneth Graunke	f9eba6e2b5	iris: Allow IRIS_BATCH_BLITTER in iris_copy_region() This updates iris_copy_region() to support using the blitter batch. (Future patches will actually do so; for now, we keep using render.) Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13877>	2022-02-09 07:45:43 +00:00
Melissa Wen	70a219d4a3	broadcom/simulator: enable multisync in the simulator Use drmSyncobjSignal to signal out_syncobjs when a GPU job submission ends in the simulator. With this, we can enable multisync support in the simulator and keep the multisync approach to process fence by submitting a serialized no-op job that adds the fence to the array of out syncobjs, i.e. syncobjs to be signaled in the kernel when a job completes (job post deps). Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14768>	2022-02-09 07:22:42 +00:00
Ilia Mirkin	5200e1c212	translate: improve sse2 32-bit unsigned -> float conversion The existing logic would drop the low bit. Instead, let's drop the high bit, do the conversion, and then add the fixed constant back in if the value had the high bit set originally. Fixes KHR-GL45.direct_state_access.vertex_arrays_attribute_format on drivers that use this module to handle the format conversion. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Emma Anholt <emma@anholt.net> Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14922>	2022-02-09 06:04:25 +00:00
Ilia Mirkin	0b69f7b15d	rtasm: add pcmpgtd operation This will be used shortly by the translate code. Available in SSE2. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Emma Anholt <emma@anholt.net> Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14922>	2022-02-09 06:04:25 +00:00
Ilia Mirkin	55b735c51a	rtasm: fix printf specifier for ptrdiff_t In practice it's a small number, but new gcc versions complain. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Emma Anholt <emma@anholt.net> Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14922>	2022-02-09 06:04:25 +00:00
Mike Blumenkrantz	7d1727079c	zink: ci updates hooray Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14932>	2022-02-09 05:11:40 +00:00
Erik Faye-Lund	883017b67e	zink: do not copy colors through floats Copying per compoents might flush NaN values, leading to changes in the values, so it'd be safer to copy as unsigned integers here. But in one of the cases here we can do even better, and just copy the whole damn union instead. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14932>	2022-02-09 05:11:40 +00:00
Jason Ekstrand	745fc95659	zink: Re-interpret formats when using vkCmdClearColorImage() vkCmdClearColorImage() doesn't take a view format so it always uses the underlying format of the image. If there's texture views going on, we need to manually mangle the colors into the image format. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14932>	2022-02-09 05:11:40 +00:00
Ilia Mirkin	86eaff29c0	st/mesa: only enable ARB_enhanced_layouts if there are xfb buffers It really doesn't make sense without any xfb support. One could limp along, but our validation does not work as-is. Doesn't seem important to support this use-case. This disables GL_ARB_enhanced_layouts on crocus with gen4/5. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14869>	2022-02-09 03:31:16 +00:00
Ilia Mirkin	13c6f401cc	glsl: only validate xfb_buffer values when we have enhanced layouts XFB might not be supported, and the shader wouldn't be setting this flag. But validation would still fail, since the number of xfb buffers would be 0. So only validate if an xfb_buffer is set in the qualifiers. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5415 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14869>	2022-02-09 03:31:16 +00:00
Ilia Mirkin	c17a3392c4	glsl: simplify conditions for setting various allowed flags Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14869>	2022-02-09 03:31:16 +00:00
Emma Anholt	2883e8f33d	nir_to_tgsi: Add a flag for lowering fabs, and use it in r300/i915. Saves instructions if the same fabs value is used multiple times. i915g: total instructions in shared programs: 397005 -> 396525 (-0.12%) instructions in affected programs: 11061 -> 10581 (-4.34%) LOST: 0 GAINED: 22 r300 (not r500): total instructions in shared programs: 180286 -> 179767 (-0.29%) instructions in affected programs: 27102 -> 26583 (-1.91%) total temps in shared programs: 29692 -> 29638 (-0.18%) temps in affected programs: 356 -> 302 (-15.17%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14938>	2022-02-08 18:50:01 -08:00
Emma Anholt	f4ee7146f9	nir: Split the flag for lowering of fabs and fneg to source modifiers. i915 and r300 have fneg source modifier but not fabs, and doing it in NIR can save us some backend pain. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14938>	2022-02-08 18:50:01 -08:00
Emma Anholt	bd24f418c3	r300: Throw a compile error instead of an assert in r300 swizzle rewrites. I hit this on shader-db, but I really just want to get stats for unrelated changes. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14938>	2022-02-08 18:50:01 -08:00
Emma Anholt	4968f8c066	r300: Demote a compiler assert(0) to a compile failure. This triggers in shader-db and doesn't have an obvious fix. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14938>	2022-02-08 18:50:01 -08:00
Jesse Natalie	97d13b2deb	d3d12: Fix take_ownership semantic for constant buffers Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14941>	2022-02-09 02:14:34 +00:00
Emma Anholt	d52d500f17	r300: Request that nir-to-tgsi avoid generating TGSI_OPCODE_CMP. Given that our fcsels are on float-bools, we can emit the LRP directly and save the backend having to emit a SLT to turn the CMP src[0] into a bool. This required passing a codegen flags struct for nir-to-tgsi. I think this is a good way forward for it, as the alternative I think has mostly been adding flags to nir_shader_compiler_options (since adding PIPE_SHADER_CAPs is an unreasonable amount of pain). r300 shader-db: total instructions in shared programs: 1484320 -> 1472463 (-0.80%) instructions in affected programs: 243588 -> 231731 (-4.87%) total temps in shared programs: 212485 -> 212143 (-0.16%) temps in affected programs: 3845 -> 3503 (-8.89%) Acked-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14886>	2022-02-09 01:19:13 +00:00
Dave Airlie	4a1ba7914a	ci/lavapipe: update lvp asan results after leak fixes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14939>	2022-02-09 09:55:50 +10:00
Dave Airlie	2f9089f6de	lavapipe: fix sampler + sampler view leaks. The compute sampler views are using a different method of generation so have to be deleted explicitly. Fixes: `e94fd4cc65` ("lavapipe: rename vallium to lavapipe") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14939>	2022-02-09 09:54:21 +10:00
Pavel Ondračka	1f5330de3a	r300: fix transformation of abs modifiers with negate It is being overwritten by the memset. Just set the only remaining member RelAddr explicitly. Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14915>	2022-02-08 22:54:02 +00:00
Emma Anholt	ef112db311	ci: Bump VK-GL-CTS to 1.3.1.0. The main thing is VK 1.3 testing, but also includes test bugfixes. The 1.3 CTS required an uprev of deqp-runner to handle a new style of test output, and that deqp-runner brings in some neat new features, too (piglit in your deqp-runner suite, and extension list checking). A bunch of VK tests got renamed, so I replaced panvk's custom test list with simple include filters on the main test list. Acked-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> (panvk) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:36 +00:00
Emma Anholt	3f34251495	ci/broadcom: Remove unused v3dv xfails file. It's actually in broadcom-rpi4-fails.txt. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:35 +00:00
Emma Anholt	648dd03e32	ci/panfrost: Add a flake a few of us have run into in the last couple days. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14920>	2022-02-08 22:16:35 +00:00
Jesse Natalie	60775780ae	d3d12: Allow 8bit index buffer conversions by vbuf Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	52766e020f	d3d12: Use CPU storage in TC for buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	96d68cb300	d3d12: Add a buffer busy callback to the bufmgr Not all cached buffers can be mapped, so using map with do-not-wait is a terrible heuristic. Use an explicit buffer busy callback which is always false, since buffers are only put into the cache once they're free. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	9d6febad5d	d3d12: Actually suballocate and cache buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	d0f4f8efae	d3d12: Fix offset for buf/image copies with suballocated buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	fb08bc8d76	d3d12: Don't suballocate TBO buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	58a9a63d9e	d3d12: Fix TBOs from suballocated buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	c35f77aa84	d3d12: Delete make_resource_writeable This never did anything useful AFAICT since we didn't actually suballocate buffers, and when this ended up being invoked it breaks the ability to read back XFB data. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	caae9b0e1f	d3d12: Always respect offsets when mapping a bo, not just when there's a range Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	b48aea0ec8	d3d12: Fix range calculation for suballocated buffers in d3d12_bo_unmap Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	2659098d6d	d3d12: Fix set constant buffers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Jesse Natalie	7ec0e2b893	tc: CPU storage needs to be freed with align_free Cc: mesa-stable Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14933>	2022-02-08 20:36:29 +00:00
Alyssa Rosenzweig	12446491c1	panfrost: Fix Depth Source enum As I suspected... sigh. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	32e58c2dd4	panfrost: Remove unused layout enums Folded into Valhall-specific plane descriptor enums. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	d27d46a266	panfrost: Remove some indexed formats on Valhall Block compressed formats like ETC2 are now indicated in the plane descriptor, rather than the pixel format descriptor. Various other minor formats were removed in Valhall; remove them from the XML so we don't accidentally try to use them. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	8c51b54bd1	panfrost: Update supported job types Remove a few that no longer exist, and rename IDVS helper to Malloc Vertex. The distinction between Malloc Vertex jobs and regular Indexed Vertex jobs is that the hardware allocates varying buffers dynamically for Malloc Vertex jobs. Regular IDVS and even legacy tiler jobs are also supported where desired. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	d70a48a706	panfrost: Flesh out tiler heap descriptor Merged with the Buffer descriptor, hence why it shares a type nibble. However, Bifrost uses a dedicated tiler heap descriptor, and I see no benefit to merging. So pretending it's a dedicated descriptor on Valhall too allows us to reuse the Bifrost code with no modifications. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	62173fa532	panfrost: Strip % in GenXML names A new Valhall enum will represent percentages, so allow that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Alyssa Rosenzweig	e514f4c0b1	panfrost: Flesh out Buffer descriptor Add fields required for structured buffers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14935>	2022-02-08 20:27:09 +00:00
Jason Ekstrand	4c61c8a0b8	vulkan,lavapipe: Simplify command recording code-gen The Entrypoint class already has utilities for gettingt he parameter list as either declarations or as comma-separated argument names for a call. Use that instead of hand-rolling it. The only modification we need to make is to add the ability to start the list somewhere other than at the beginning. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14919>	2022-02-08 19:50:57 +00:00
Mike Blumenkrantz	cb781fc350	lavapipe: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14911>	2022-02-08 18:38:20 +00:00
Mike Blumenkrantz	1532556eb0	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14911>	2022-02-08 18:38:20 +00:00
Mike Blumenkrantz	08c2b9d7cb	lavapipe: use util_pack_color_union() for generating clear colors this enables clamping for packed formats (e.g., RGB10_A2UI) where color values may exceed the width of the component cc: mesa-stable fixes (zink): KHR-GL45.direct_state_access.renderbuffers_storage* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14911>	2022-02-08 18:38:20 +00:00
Emma Anholt	6faaeca584	ci/freedreno: Add another unsizedArrayLength flake. Started appearing on Feb 1, but given that the rest of this test group flakes, I assume it's similar. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14934>	2022-02-08 18:04:37 +00:00
Andrii Pauk	cf4de7d8ff	venus: Allow usage of virtio-mmio based device Libdrm reports bustype as DRM_BUS_PLATFORM for virtio-mmio based device. DRM_BUS_PCI is reported only for virtio-pci based devices. Add possibility to use devices with DRM_BUS_PLATFORM. Signed-off-by: Andrii Pauk <Andrii.Pauk@opensynergy.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14531>	2022-02-08 17:53:21 +00:00
Daniel Schürmann	5e9df85b1a	aco: optimize discard_if when WQM is not needed afterwards Totals from 11560 (8.57% of 134913) affected shaders: (GFX10.3) CodeSize: 12092560 -> 11997652 (-0.78%) Instrs: 2205325 -> 2181598 (-1.08%) Latency: 15376048 -> 15356958 (-0.12%); split: -0.12%, +0.00% InvThroughput: 3526105 -> 3525120 (-0.03%); split: -0.03%, +0.00% Copies: 98543 -> 87601 (-11.10%) Branches: 16919 -> 16873 (-0.27%) PreSGPRs: 291584 -> 291532 (-0.02%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Daniel Schürmann	13c3137960	aco: merge block_kind_uses_[demote\|discard_if] These serve the same purpose. The new name is block_kind_uses_discard. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Daniel Schürmann	e7d1c8cc5e	aco: make Preserve_WQM independent from block_kind_uses_discard_if Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Daniel Schürmann	08b8500dfb	aco: remove block_kind_discard This case doesn't seem to happen in practice. No need to micro-optimize it. This patch merges instruction selection for discard/discard_if. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Daniel Schürmann	b67092e685	aco: emit nir_intrinsic_discard() as p_discard_if() This simplifies the code and emits a slightly better sequence in some cases. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14805>	2022-02-08 16:16:07 +00:00
Charles Baker	1895e17591	mesa: align constant/uniform uploads to driver expected alignment This fixed a problem for Zink where uniform buffer alignment varies by GPU, e.g. 64 bytes for an RTX 2070 SUPER but 256 bytes for a GTX 1070 Ti. Tested running Superposition on Windows 10 with Nvidia 1070 Ti with 496.13 driver. Without the fix Superposition soft locks on its splash screen. With the fix Superposition runs through its benchmark. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14674>	2022-02-08 15:08:21 +00:00
Charles Baker	418c77640b	zink: Fix MSVC RTC in zink_get_framebuffer_imageless() The bit fields in zink_framebuffer_state cause a false positive with MSVC's run-time checks enabled. setting state.num_attachments in zink_get_framebuffer_imageless(). Writing some bits of num_attachments involves reading bits from layers and samples that haven't been initialized. Fixed by assigning to num_attachments earlier in the function. Not quite sure why that makes a difference but at a guess there's a heuristic that considers assignment close to declaration as initialization. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14566>	2022-02-08 14:43:00 +00:00
Mike Blumenkrantz	86cb664cd8	zink: export PIPE_CAP_CULL_DISTANCE_NOCOMBINE fixes: KHR-GL46.cull_distance.functional Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14878>	2022-02-08 14:03:09 +00:00
Mike Blumenkrantz	7e9481eaac	gallium: add PIPE_CAP_CULL_DISTANCE_NOCOMBINE for drivers where separate cull distance variables are required, this lets them avoid having to write yet another pass to undo gallium's mangling of shader info Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14878>	2022-02-08 14:03:09 +00:00
Lionel Landwerlin	93a90fc85d	anv: fix conditional render for vkCmdDrawIndirectByteCountEXT We just forgot about conditional render for this entry point. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `2be89cbd82` ("anv: Implement vkCmdDrawIndirectByteCountEXT") Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14891>	2022-02-08 13:33:38 +00:00
Lionel Landwerlin	5d3e419378	anv: enable ray queries Only on platforms that support it. v3: Split out code setting up ray query shadow buffer (Caio) Don't forget to setup ray query globals even when no shadow buffer is used (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	c78be5da30	intel/fs: lower ray query intrinsics v2: Add helper for acceleration->root_node computation (Caio) v3: Update comment on "done" bit (Caio) Remove progress bool value for impl function (Caio) Don't use nir_shader_instructions_pass to search the shader (Caio) v4: Rename variable for if/else block (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	712d8fb043	intel/nir: document RT builder Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	e06f9d49bc	nir/lower_shader_calls: consider relocated constants as rematerializable After all they're constants. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	0465714790	intel/nir/rt: add more helpers for ray queries v2: Split stack_id helper in sync/async version (Caio) Fixup a few bit field mistake (Caio) Simplify some bitfield manipulations (Caio) v3: Remove duplicated helper (Caio) Simplify brw_nir_rt_set_dword_bit_at (Caio) Comment brw_nir_rt_query_mark_init (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	d5b994ec8a	intel/nir/rt: make RT manipulation helpers helper invocations ready Since we need to be able to perform ray queries in helper invocations, we need to have all the helpers properly tag their load/store operations so that they operate in helper lanes. v2: Switch from macros to inline functions (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	fb69fed65b	intel/nir: document committed argument Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	b0624e414f	intel/fs: make trivial shader complete tracing operations with missing shaders v2: Apply workaround only on < DG2-512-C0 & < DG2-128-B0 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	35bd19f53d	intel/nir/rt: load bvh_level value off mem_hit structure Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	6d9ae6ec1e	intel: add a new intrinsic to get the shader stage from bindless shaders We'll use this to apply ray tracing operations in our trivial return shader based on the stage we're in. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	b8f087b0e6	nir/builder: add nir_ior_imm() helper Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	bb40e999d1	intel/nir: use a single intel intrinsic to deal with ray traversal In the future we'll want to reuse this intrinsic to deal with ray queries. Ray queries will use a different global pointer and programmatically change the control/level arguments of the trace send instruction. v2: Comment on barrier after sync trace instruction (Caio) Generalize lsc helper (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	39f6cd5d79	intel/nir: fix shader call lowering We're replacing a generic instruction by an intel specific one, we need to remove the previous instruction. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c5a42e4010` ("intel/fs: fix shader call lowering pass") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	2665595244	intel/fs: limit FS dispatch to SIMD16 when using ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	57eed6698b	intel/compiler: tracker number of ray queries in prog_data Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:25 +00:00
Lionel Landwerlin	9b366243ed	intel/fs: load more fields from BVH instance leafs v2: Fixup mask (Caio) Drop old comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	c89024e446	intel/fs: don't set allow_sample_mask for CS intrinsics Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `77486db867` ("intel/fs: Disable sample mask predication for scratch stores") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	9d22f8ed23	intel/fs: add support for ACCESS_ENABLE_HELPER v2: Factor out fragment shader masking on send messages (Caio) Update comments (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	c199f44d17	intel/fs: name sources for A64 opcodes Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	23ce94ff7e	intel/nir/rt: add a new number of SIMD lanes per DSS helper v2: Add prefix brw_nir_rt (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	61c9b7a82e	intel/fs: add support for Eu/Thread/Lane id This index will be used for accessing ray query data in memory. v2: Drop a MOV (Caio) v3: Rework back code emission (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	3dabe93257	intel/fs: rework dss_id opcode into generic opcode We'll want different types of IDs based on topology. Let's make this more flexible and also move the bit shifting code a layer above where it's easier to do bitshifting operations, especially if you need to stash things into temporary registers. v2: Keep previous comment. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	4deb8e86df	nir: change intel dss_id intrinsic to topology_id This will allow to reuse the same intrinsic for various topology based ID. v2: fix intrinsic comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13719>	2022-02-08 12:55:24 +00:00
Lionel Landwerlin	fdb74a77d2	intel/ds: fix compilation with perfetto Forgot to test with perfetto... Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9da3d714b8` ("anv: add dynamic rendering traces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5992 Reviewed-by: Rohan Garg <rohan.garg@intel.com> Tested-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14914>	2022-02-08 12:29:21 +00:00
Dylan Baker	a52e4871fe	meson: add radv to meson devenv I either rebased this out of the original PR, just failed to commit it and then reset it. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14879>	2022-02-07 21:22:12 -08:00
Mike Blumenkrantz	8335fdfeaf	vk/sync: add asserts for timeline semaphore count matching spec requires that the number of timeline waits/signals matches the base number of waits/signals if there are any timeline semaphores being processed by the submit, so asserting here is in line with what validation will yield failure to match these will also hang every driver I've tested, so asserting here potentially saves some people their desktop session Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14741>	2022-02-08 04:09:13 +00:00
Mike Blumenkrantz	388f23eabe	zink: min/max blit region in coverage functions these regions might not have the coords in the correct order, which will cause them to fail intersection tests, resulting in clears that are never applied cc: mesa-stable fixes: GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_all_buffer_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_color_and_depth_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_color_and_stencil_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_linear_filter_color_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_magnifying_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_minifying_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_missing_buffers_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_nearest_filter_color_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_dimensions_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_height_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_negative_width_blit GTF-GL46.gtf30.GL3Tests.framebuffer_blit.framebuffer_blit_functionality_scissor_blit Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14867>	2022-02-08 03:42:01 +00:00
Mike Blumenkrantz	b656ab75a6	zink: reject invalid draws cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14859>	2022-02-08 03:29:45 +00:00
Mike Blumenkrantz	e38c13830f	zink: fix PIPE_CAP_TGSI_BALLOT export conditional this requires VK_EXT_shader_subgroup_ballot cc: mesa-stable fixes (lavapipe): KHR-GL46.shader_ballot_tests.ShaderBallotAvailability KHR-GL46.shader_ballot_tests.ShaderBallotFunctionRead Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14858>	2022-02-08 03:16:06 +00:00
Mike Blumenkrantz	8907964dcd	zink: export PIPE_SHADER_CAP_TGSI_CONT_SUPPORTED this is supported and has been for a while Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14858>	2022-02-08 03:16:06 +00:00
Pierre-Eric Pelloux-Prayer	413bf889e7	radeonsi/blit: relax conditions to use sdma copy for prime buffers We don't need to check if it's imported: PIPE_BIND_DRI_PRIME is enough. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	3b27ad1504	radeonsi: create prime buffers as uncached `8791e831b1` marked imported prime buffers as uncached (useful when prime buffer is allocated by the display GPU), but they should also be created as uncached (useful when allocated by the render GPU). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	18c38bf78f	gallium: rename PIPE_BIND_DRI_PRIME The new name PIPE_BIND_PRIME_BLIT_DST is more precise. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Pierre-Eric Pelloux-Prayer	42c149e36b	gallium/dri: add missing PIPE_BIND_DRI_PRIME handling `e9c3dbd046` added PIPE_BIND_DRI_PRIME but it was only set when importing a prime buffer. This commit adds handling of this flag in the other codepath = the one where the prime buffer is allocated by the render GPU. With this change PIPE_BIND_DRI_PRIME is still only set for the render GPU - the display GPU will never see this flag; a future commit will rename it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14615>	2022-02-08 00:13:07 +00:00
Kenneth Graunke	3926be368e	ci/iris: Mark qbo tests as flakes These appear to have some kind of race condition and usually fail, but sometimes pass. We had already attempted to mark them as flakes on amly, but need to mark them as flakes on KBL+ too. See https://gitlab.freedesktop.org/mesa/mesa/-/jobs/18522605 and https://gitlab.freedesktop.org/mesa/mesa/-/jobs/18527737 where these unexpectedly passed on KBL, and also where the top-level test not being caught by the regex led to failures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14916>	2022-02-07 14:30:52 -08:00
Zoltán Böszörményi	df1751a2bb	crocus: Enable compat profile the same way as core profile Signed-off-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11756>	2022-02-07 19:41:44 +00:00
Kenneth Graunke	604d97671b	iris: Add support for flushing the blitter (hackily) To flush the blitter, we need to use MI_FLUSH_DW rather than the usual PIPE_CONTROL we use on the 3D engine. Most of our code is set up to suggest flushes via PIPE_CONTROL commands, however, so we hackily just emit MI_FLUSH_DW when they ask for any kind of PIPE_CONTROL flush. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Kenneth Graunke	9c5dc4985b	blorp: Assert that blorp_copy() on the blitter can handle it Safeguards against callers that don't guarantee the necessary things. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Kenneth Graunke	d2646e147b	intel/genxml: Add missing MI_FLUSH_DW::Flush CCS field Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14912>	2022-02-07 09:50:01 -08:00
Rhys Perry	7ddad1b93a	radv: fix R_02881C_PA_CL_VS_OUT_CNTL with mixed cull/clip distances Matches radeonsi. Seems Vulkan CTS doesn't really test cull distances. Removing VARYING_SLOT_CULL_DIST0/VARYING_SLOT_CULL_DIST1 variables doesn't break any of dEQP-VK.clipping.*, except for tests which read the variables in the fragment shader. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5984 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14882>	2022-02-07 12:36:38 +00:00
Danylo Piliaiev	44bdac9849	tu: Implement VK_AMD_buffer_marker to support Graphics Flight Recorder Graphics Flight Recorder is: "The Graphics Flight Recorder (GFR) is a Vulkan layer to help trackdown and identify the cause of GPU hangs and crashes. It works by instrumenting command buffers with completion tags." This is a nice little tool which could help quickly identify the call which hanged. Or if command buffer is executed for too long. The tiling nature of our GPU shouldn't be a big issue aside from lower performance. For non-segfault case, if: - Hang happens at the same place in cmdbuf and draw/dispatch is not finished at that point - it is likely that there is an infinite loop in some of the shaders in this draw. - Hang happens always in different place - likely there is nothing wrong and command buffer just takes too long to execute and you should try increasing hangcheck_period_ms. If it doesn't help it is likely a synchronization issue. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13553>	2022-02-07 12:53:34 +02:00
Daniel Stone	b561946497	egl/wayland: Don't replace existing backbuffer in get_buffers If the surface already has a current backbuffer - say through a buffer_age query - we do not want to replace it in get_buffers, because it means the result we'd previously returned them is stale. If we already have a backbuffer set on the surface, keep it locked in no matter what until we hit SwapBuffers. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14873>	2022-02-07 09:57:41 +00:00
Daniel Stone	3da8300562	egl/wayland: Reset buffer age when destroying buffers A buffer age of 0 means that the buffer is uninitialised or has unknown content. We rely on the buffer age initially being 0 through zalloc when the surface is first created; when they are first used for a swap, we set their age to 1, and then we increment the age of every buffer in the chain with a non-zero age when we swap. Now that we can release buffers, both through dmabuf-feedback as well as detecting when we're using a deeper swapchain than the compositor needs, make sure to reset their age as they are released. Without doing this, the age will stay as it was before it was released and be incremented, returning the wrong age to the user the first time a previously-released buffer slot has been reused. Signed-off-by: Daniel Stone <daniels@collabora.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5977 Fixes: `22d796feb8` ("egl/wayland: break double/tripple buffering feedback loops") Fixes: `b5848b2dac` ("egl/wayland: use surface dma-buf feedback to allocate surface buffers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14873>	2022-02-07 09:57:41 +00:00
Emma Anholt	fa4390f7bf	ci/iris: Add skips and flakes notes for recent #intel-ci logs. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Emma Anholt	0cf32b5079	ci/crocus: Add recent flakes from #intel-ci Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Emma Anholt	6423045957	ci/softpipe,llvmpipe: Disable Xvfb server reset on piglit runs. The resets take time that we don't need to spend. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14904>	2022-02-07 09:33:59 +00:00
Samuel Pitoiset	9ea4029f9f	Revert "radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets."" The most famous RADV revert over the past months. This was an issue in RADV and not an use-after-free (descriptor set layouts can be destroyed almost at any time). This reverts commit `b775aaff1e`. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14621>	2022-02-07 08:24:36 +01:00
Samuel Pitoiset	66f7289d56	radv: add reference counting for descriptor set layouts The spec states that descriptor set layouts can be destroyed almost at any time: "VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout, and those descriptor sets must not be updated with vkUpdateDescriptorSets after the descriptor set layout has been destroyed. Otherwise, descriptor set layouts can be destroyed any time they are not in use by an API command." Based on ANV. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5893 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14621>	2022-02-07 08:24:36 +01:00
Dave Airlie	37c3be6947	crocus: find correct relocation target for the bo. If we have batch a + b, and writing to batch b, causes batch a to flush, all the bo->index get reset, and we try to submit a -1 to the kernel. Look the bo index up when creating relocations. Fixes crash seen in KHR-GL46.compute_shader.pipeline-post-fs and a trace from Wasteland 3 Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14905>	2022-02-07 18:01:36 +10:00
Zoltán Böszörményi	d774059a0c	crocus: enable GL46 tests for HSW in ci Signed-off-by: Zoltán Böszörményi <zboszor@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14889>	2022-02-06 23:46:28 +00:00
Alyssa Rosenzweig	0299600efb	asahi: Fix memory unsafety in delete_sampler_state The type is wrong, masked by a void*, meaning the free is completely wrong. ASan is rightfully unhappy. Fixes crashes destroying the context. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14901>	2022-02-06 15:46:55 -05:00
Alyssa Rosenzweig	786871c87e	agx: Don't kill helper threads in ld_var Apparently this is yet another .kill bit. Fixes: dEQP-GLES3.functional.shaders.derivate.dfdx.linear.* dEQP-GLES3.functional.shaders.derivate.dfdy.linear.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	367d93bcd4	agx: Handle texture array indices These need to be converted to integers. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	b459473bb9	agx: Implement nir_op_txb Like explicit LODs, biases must be 16-bit, so add a lowering rule for this. With the LOD mode selection updated for txb, we can then ingest biases like explicit LODs and allowlist txb. Passes: dEQP-GLES2.functional.shaders.texture_functions.fragment.texture2d_bias dEQP-GLES2.functional.texture.mipmap.2d.bias.* Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	e2903f66ec	agx: Translate LOD modes more generically Now includes support for auto_load_bias mode. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	d5b7d629d7	agx: Add AUTO_LOD_BIAS mode Automatic load with a bias. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14899>	2022-02-06 15:02:39 +00:00
Alyssa Rosenzweig	93f2ae1205	asahi: Correctly set IOGPU_ATTACHMENT::size Not sure what this is used for, but let's not lie to the kernel. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14898>	2022-02-06 09:48:34 -05:00
Alyssa Rosenzweig	daab41b80b	asahi: Identify IOGPU_ATTACHMENT::size Oops. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14898>	2022-02-06 09:48:34 -05:00
Charmaine Lee	945a1e0b8c	mesa: fix misaligned pointer returned by dlist_alloc In cases where the to-be-allocated node size with padding exceeds BLOCK_SIZE but without padding doesn't, a new block is not created and no padding is done to the previous instruction, causing a misaligned pointer to be returned. v2: Per Ilia Mirkin's suggestion, remove the extra condition in the first if statement, let it unconditionally pad the last instruction if needed. The updated currentPos will then be taken into account in the block size checking. This fixes crash seen with lightsmark and Optuma apitraces Fixes: `05605d7f53` (' mesa: remove display list OPCODE_NOP') Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Neha Bhende <bhenden@vmware.com> Tested-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14871>	2022-02-05 22:45:01 +00:00
Neha Bhende	9230b28533	svga: store shared_mem_size in svga_compute_shader instead of svga_context When new context was created, shared_mem_size was getting overwritten. This fixes glretrace failure seen with manhattan, aztec and BASS2_intro apitraces Fixes: `247c61f2d0` ('svga: Add support for compute shader, shader buffers and image views') Tested with glretrace, piglit Reviewed-by: Charmaine Lee <charmainel@vmware.com> (cherry picked from commit dd6793ec9218782b1b716a87582d7219bae4e75f) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14870>	2022-02-05 22:17:32 +00:00
Kenneth Graunke	c7a357787f	anv: Increase maxUniformBufferRange to 2^30 when not using the sampler The limit here is from the RENDER_SURFACE_STATE height/width/depth fields - it's 2^30 for ISL_FORMAT_RAW buffers, and 2^27 otherwise. anv_isl_format_for_descriptor_type() uses ISL_FORMAT_R32G32B32A32_FLOAT for uniform buffers when compiler->indirect_ubos_use_sampler is set (Icelake and earlier), but ISL_FORMAT_RAW when it isn't (Tigerlake+). So we can increase the limit on Tigerlake and later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14892>	2022-02-05 11:23:16 -08:00
Pavel Ondračka	49f4f1ec22	r300: fix deadcode elimination in loops with breaks We are updating the deadcode state while walking the program backwards. When encountering ENDLOOP, we scan the loop, mark everything in the loop as used and than continue as usuall. We were previously trying to be smart with the breaks. This was however not working as expected. Instead, save the most pesimistic deadcode state from the ENDLOOP and just restore it anytime we see a break. This keeps the code simple and more importantly does not touch the flat and IF(-ELSE)-ENDIF paths at all so reduces the chances of regression. No changes with my shader-db. Fixes piglits on RV530: shaders/ssa/fs-if-def-else-break.shader_test spec/glsl-1.10/execution/vs-loop-array-index-unroll.shader_test Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5832 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14661>	2022-02-05 15:39:26 +01:00
Lionel Landwerlin	9da3d714b8	anv: add dynamic rendering traces v2: Get rid of subpass_count Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14798>	2022-02-04 23:43:48 +00:00
Lionel Landwerlin	d0811ca046	anv: flush utrace before at device destroy Ensuring any remaining traces are displayed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14798>	2022-02-04 23:43:48 +00:00
Mike Blumenkrantz	960e72417f	zink: use scanout obj when returning resource param info embarrassing typo since the base obj has no modifier data available cc: mesa-stable fixes #5980 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14875>	2022-02-04 23:25:36 +00:00
Boris Brezillon	a8fbfcfbd3	pan/midg: Support 8/16 bit load/store Needed for panvk copy shaders to support 8 or 16bit formats. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	59ea6e2e27	pan/midg: Add a pass to lower non-logbase2 global/shared loads Compute shaders might do vec3(Xbits) loads which are translated to LD.<next_pow2(3 * X)> by the midgard compiler. This might cause out-of-bound accesses potentially leading to pagefaults if the access is at the end of a BO. One solution to avoid that (maybe not the best) is to split non-log2 loads to make sure we only read what's requested. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	3f9bce08e1	pan/midg: Fix swizzle packing on 64bit instructions with src-expansion + dst-shrinking In that case, the mask is specified on 32bit lanes, so we need to shift it if it's > 0x3. The expand modifier will take care of selecting the right side of the 32bit vector. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	da474d5d14	pan/midg: Fix the upper/lower limit on 8bit vectors If I'm correct, the lower/upper split on 8bit vectors is 8, not 4. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	b58c262144	pan/midg: Fix 64-bit swizzle printer Swizzling happens in 2 steps on Midgard: 1. Vector expansion/shuffling 2. Swizzling at the instruction-size granularity, but defined using the source size. Those size are different if the source is expanded. So, when we print 64 bit swizzles on an expanded source, we first need to apply an offset if the high part of the 32bit vector was selected, and then divide the result by 2 to account for vector expansion. To sum-up, swizzling on midgard is complicated, and I'm not sure I got it right, but it seems to print what I expect on the few compute shaders using 64bit arithmetic I debugged. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	39e4b7279d	pan/midg: Fix swizzling on 8-bit sources Even though 8-bit ALUs are not supported, we can have [un]pack_32_4x8 instructions which translate to IMOVs, and those operate on 8-bit vectors. The problem is, the swizzling granularity is 16 bit, which means we don't support MOV.i8 R0.xyzw, TMP0.xxxx, R1.zyxw and the compiler doesn't even complain, it just applies 8 bit swizzling directly, which obviously doesn't work. This is probably not the right way to fix that, but I thought I'd raised the issue with a hack to fix, so we can get the discussion started. (Found while debugging FB store lowering on Midgard). Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	65209b1adb	pan/midg: Prefix scalar immediates with '#' instead of '<' We already do that for scalar instructions, so let's do it for vector instructions with a single component too. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	36bb1ac453	pan/midg: Remove spurious printf() in print_vector_constants() Also tried to replace that one by an fprintf(fp, ...), but it pollutes the dump too. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:35 -05:00
Boris Brezillon	967eb4988e	pan/midg: Add intra-bundle interferences The register allocator assumes instructions are executed sequentially and allows one instruction to overwrite a portion of a register written by a previous instruction if this portion is never used. But scalar and vector ALUs might be executed in parallel if they are part of the same bundle, and when such instructions write to the same portion of the register file, the result is undefined. Let's add intra-bundle interferences to avoid this situation. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14885>	2022-02-04 17:12:34 -05:00
Danylo Piliaiev	183bc15bdb	turnip: Unconditionaly remove descriptor set from pool's list on free We didn't remove desc set from the pool's list if pool was host_memory_base. On the other hand in there is no point in removing desc set from the list in DestroyDescriptorPool/ResetDescriptorPool. Fixes: `da7a4751` ("turnip: Drop references to layout of all sets on pool reset/destruction") Fixes cts tests: dEQP-VK.api.buffer_marker.graphics.default_mem.bottom_of_pipe.memory_dep.draw dEQP-VK.api.buffer_marker.graphics.default_mem.bottom_of_pipe.memory_dep.dispatch Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14855>	2022-02-04 21:07:30 +00:00
Jesse Natalie	81061ed645	docs: Update d3d12 features Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	444e18beaa	d3d12: GL4.2 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	126d992097	d3d12: Allow RGB VS inputs without an alpha channel Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	a3a3599a08	d3d12: When adding new output varyings, write 0s This avoids undefined behavior in some cases, and in the case where the new output varying is actually a sysval like viewport index, the DXIL validator will require it to be written. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	ccaa79a1ba	d3d12: Don't add arrayed VS outputs when next stage uses per-vertex inputs Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	1b11113efa	d3d12: Don't force a GS to be added for 'flat' sysvals Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	3feae7ec5d	d3d12: Update nir varying bitmasks when linking stages Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	27deedc104	d3d12: Fix location compares in MSAA disable Locations can only be compared against SYSTEM_VALUE_* if the var mode is system_value. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	a26f647caa	d3d12: Update depth invert to deal with multi-viewport Turn the context state and shader key into a bitmask. When lowering the depth invert into the shader, scan for writes to viewport index. If found, move position to the end of the function (or current vertex) and check if the current viewport needs the depth invert before applying. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	52b3e6be5f	d3d12: Fix linkage for viewport index Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	bafa0e0369	d3d12: Bind 16 scissor rects when scissor disabled Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	929985893a	d3d12: Enable BPTC (BC6/BC7) Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	918647000f	microsoft/compiler: Set flag for VP/RT array index from VS/DS Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Jesse Natalie	5954c8e524	microsoft/compiler: Handle SV_ViewportArrayIndex Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14881>	2022-02-04 20:49:23 +00:00
Chia-I Wu	47f4cb2405	zink: set needs_mesa_flush_wsi for venus venus relies on wsi_image_create_info and wsi_memory_signal_submit_info. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14872>	2022-02-04 19:58:40 +00:00
Chia-I Wu	737d94a545	zink: always chain wsi_image_create_info for scanout images Chaining wsi_image_create_info tells the drivers that the image can use VK_IMAGE_LAYOUT_PRESENT_SRC_KHR layout. We still use wsi_image_create_info::scanout to indicate whether this is legacy scanout or uses modifiers. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14872>	2022-02-04 19:58:40 +00:00
Chia-I Wu	66e6e8afe6	zink: set dma-buf bit for shared resources Set the dma-buf bit when supported. This is done because zink_resource_get_handle exports dma-bufs when WINSYS_HANDLE_TYPE_FD is requested. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14872>	2022-02-04 19:58:40 +00:00
Alyssa Rosenzweig	f8feaee0dd	agx: Call nir_lower_discard_if We still need to implement discard itself, but this means we don't need to worry about discard_if. This compiles down to the same idiom as the vendor compiler (Metal) generates Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14217>	2022-02-04 19:09:23 +00:00
Alyssa Rosenzweig	aaf00a1b4d	nir,zink: Make lower_discard_if a common pass This pass (lowering discard_if to control flow and unconditional discard) was originally written for Zink, but is useful for hardware that lacks conditional discard instructions like AGX. In theory AGX could implement a conditional discard with CSEL, but the vendor compiler uses a lowering like this one. Since I like not writing code, I'd like to use the pass that's already in tree. v2: Don't preserve dominance (Jason). Assert we don't see demotes or terminates (Jason). Add Mike's ack. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14217>	2022-02-04 19:09:23 +00:00
Adam Jackson	538356e3e6	glx: Use the new no-error driver interface Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12474>	2022-02-04 18:36:24 +00:00
Adam Jackson	b1d585ca36	egl: Use the new no-error driver interface Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12474>	2022-02-04 18:36:24 +00:00
Adam Jackson	3e3d75d16a	dri: Epoch how no-error context creation works The bug here is that the DRI context "flags" are intended to alias the GLX context flag values, and they don't, DRI's no-error flag is GLX's reset-isolation flag. GLX (and EGL!) treat no-error as a context attribute, and reset isolation predates Mesa's no-error implementation by several years. The GL_KHR_no_error spec does describe it as a "context flag", though, so maybe that's why we do it as a (DRI) context flag. In order to unalias these we need a new contract with the loader. We remove the old __DRI_NO_ERROR extension, and add a new __DRI_RENDERER_HAS_CONTEXT_NO_ERROR value to query. Loaders can key on that to know to pass no-error-ness through as a context attribute, matching the GLX/EGL calling convention. We go ahead and define __DRI_CTX_FLAG_RESET_ISOLATION as well, and update the drivers to refuse it since we don't support it yet. This means mismatched drivers/loaders will not be able to create no-error contexts. Too bad. If you want performance that badly you can build both things at once. Reviewed-by: Emil Velikov <emil.l.velikov@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12474>	2022-02-04 18:36:24 +00:00
Dylan Baker	5e994c5d98	meson: add LIBGL_DRIVERS_PATH to the devenv This allows using built dri OpenGL drivers with the `meson devenv` command Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14826>	2022-02-04 09:08:52 -08:00
Dylan Baker	2f916f2be6	meson: add support for `meson devenv` with vulkan Meson devenv is a feature added in meson 0.58 (thus the features is version guarded) that allows creating a shell environment with environment variables automatically setup for running the project inside the build dir. Some variables (such as LD_LIBRARY_PATH and PATH) are set automatically, others must be added by the project. For vulkan is is relativley simple, we create a new, uninstalled, icd file for each driver and set the VK_ICD_FILENAMES variable appropriately. This can be used with: ```sh meson devenv -C $builddir ``` then, vulkan applications will automatically use the uninstall vulkan driver, no need to install. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14826>	2022-02-04 09:08:47 -08:00
Erik Faye-Lund	3abe9ccbd4	vulkan/util: simplify multialloc init The syntax we're using doesn't work when included into C++ sources. So let's make it C++ compabible. It turns out, nobody needs this extra definition which is what's causing issues. Let's just make the initializer trivial without casting the struct. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14850>	2022-02-04 10:16:42 +00:00
Erik Faye-Lund	0d0ecbd987	vulkan/util: Add explicit casts to make c++ happy We're about to need including this header from a C++ source, so let's add some explicit casts for C++ compatibility. In one case we can make things a bit cleaner by moving the char-pointer-ism to the place that needs it, so let's clean that up while we're at it. Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14850>	2022-02-04 10:16:42 +00:00
Erik Faye-Lund	676c65d8d5	vulkan/util: Add extern "C" to allow inclusion from c++ We're about to need including this header from a C++ source, so let's wrap the whole thing in extern "C". Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14850>	2022-02-04 10:16:42 +00:00
Danylo Piliaiev	fded7a95c5	turnip: Expose VK_KHR_shader_non_semantic_info This is entirely implemented in the SPIR-V frontend. Relevant CTS tests: dEQP-VK.spirv_assembly.instruction.compute.non_semantic_info.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	ff059605aa	turnip: Implement VK_KHR_zero_initialize_workgroup_memory Moved nir_lower_compute_system_values to lower load_local_invocation_index which could be emitted by nir_zero_initialize_shared_memory. Relevant CTS tests: dEQP-VK.compute.zero_initialize_workgroup_memory.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	c6d1cac6e5	turnip: Expose VK_EXT_image_robustness VK_EXT_image_robustness is a strict subset of VK_EXT_robustness2 so we could just expose it. Relevant CTS tests: dEQP-VK.robustness.image_robustness.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	03f9deecb8	turnip: Use the shared helpers to expose 1.3 core extensions/limits Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Danylo Piliaiev	679713d5f9	turnip/doc: Update turnip extension list We were missing VK_EXT_subgroup_size_control and VK_EXT_extended_dynamic_state2. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14829>	2022-02-04 09:24:06 +00:00
Kenneth Graunke	fd0e4aedeb	iris: Make an iris_foreach_batch macro that skips unsupported batches IRIS_BATCH_BLITTER isn't supported prior to Tigerlake; in general, batches may not be supported on all hardware. In most cases, querying them is harmless (if useless): they reference nothing, have no commands to flush, and so on. However, the fence code does need to know that certain batches don't exist, so it can avoid adding inter-batch fences involving them. This patch introduces a new iris_foreach_batch() iterator macro that walks over all batches that are actually supported on the platform, while skipping the others. It provides a central place to update should we add or reorder more batches in the future. Fixes various tests in the piglit.spec.ext_external_objects.* category. Thanks to Tapani Pälli for catching this. Fixes: `a90a1f15` ("iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14834>	2022-02-04 08:06:12 +00:00
Dave Airlie	c4b400285a	llvmpipe/triangle: don't store area in fixed_position. It doesn't get used again, just extract the sign and move on. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Dave Airlie	cd4d2e920c	llvmpipe: just move opaque alpha lookup closer to use. Saves looking this up before checking the variant. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Dave Airlie	a448695eee	llvmpipe: refactor lp_rast_shader_inputs. viewport_index won't be >= 16 layer should be < 2048 view_index should be < 2048 this leaves the last 64-bits as padding, which something expects, but not having to write to it means we have to write less memory every triangle. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Dave Airlie	fb17da6c50	llvmpipe/setup: remove opaque from setup triangle This isn't used in the rast side of things, so just pass it out here. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Dave Airlie	66188e8693	llvmpipe: inline retry_triangle_ccw This reduces some of the overheads in the callstack here. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Dave Airlie	086a9b7869	llvmpipe: optimise triangle setup a bit. the bboxpos = bbox copy was visible overhead on perf traces, but we don't need to really do it. Extract all the things from bbox needed early, then don't copy it just overwrite it. use boolean to fix power build. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14846>	2022-02-04 06:31:15 +00:00
Danylo Piliaiev	5e4bf6d100	turnip: Do not use hw binning if tiles per pipe are over the limit Otherwise GPU would hang. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5951 Freedreno commit as a reference: `39d00722b2` Fixes VK cts tests on a618 if their memory limit is raised to 1024 MB: dEQP-VK.pipeline.render_to_image.core.2d_array.huge.width_height.r8g8b8a8_unorm_d16_unorm dEQP-VK.pipeline.render_to_image.core.2d_array.huge.width_height.r8g8b8a8_unorm_s8_uint dEQP-VK.pipeline.render_to_image.core.2d_array.huge.width_height.r8g8b8a8_unorm_d24_unorm_s8_uint dEQP-VK.pipeline.render_to_image.core.2d_array.huge.width_height.r8g8b8a8_unorm_d32_sfloat_s8_uint dEQP-VK.pipeline.render_to_image.core.cube.huge.width_height.r8g8b8a8_unorm dEQP-VK.pipeline.render_to_image.core.cube.huge.width_height.r8g8b8a8_unorm_d16_unorm dEQP-VK.pipeline.render_to_image.core.cube.huge.width_height.r8g8b8a8_unorm_s8_uint dEQP-VK.pipeline.render_to_image.core.cube.huge.width_height.r8g8b8a8_unorm_d24_unorm_s8_uint dEQP-VK.pipeline.render_to_image.core.cube_array.huge.width_height.r8g8b8a8_unorm dEQP-VK.pipeline.render_to_image.core.cube_array.huge.width_height.r8g8b8a8_unorm_d24_unorm_s8_uint Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Tested-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14849>	2022-02-04 06:01:41 +00:00
Danylo Piliaiev	c6e8198f1b	turnip: Add TU_GMEM envvar to test different gmem sizes Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14849>	2022-02-04 06:01:41 +00:00
Omar Akkila	dac3e6f372	venus: Advertise VK_EXT_extended_dynamic_state support Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14866>	2022-02-04 04:23:18 +00:00
Omar Akkila	19e313e1c8	venus: Implement VK_EXT_extended_dynamic_state commands This implements hooks for the following commands: - vkCmdBindVertexBuffers2 - vkCmdSetCullMode - vkCmdSetDepthBoundsTestEnable - vkCmdSetDepthCompareOp - vkCmdSetDepthTestEnable - vkCmdSetDepthWriteEnable - vkCmdSetFrontFace - vkCmdSetPrimitiveTopology - vkCmdSetScissorWithCount - vkCmdSetStencilOp - vkCmdSetStencilTestEnable - vkCmdSetViewportWithCount Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14866>	2022-02-04 04:23:18 +00:00
Jesse Natalie	3affb69eaa	docs: Update d3d12 features Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	7430742b16	d3d12: ARB_gpu_shader_fp64 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	b8ecb8be79	d3d12: Handle structs in TCS variants Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	c448931d23	d3d12: Handle structs in GS variants Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	252a89a2c9	d3d12: Set lower full fp64 compiler options flag when needed Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	204102099a	d3d12: Lower [de]construction of doubles via math ops into pack/unpack ops Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	4daa3eac2c	d3d12: Add int64 support Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	397e117e96	d3d12: Get OPTIONS1 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	0144e7b18d	d3d12: Add a driver version to the screen to be used for workarounds Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	944832d3d7	d3d12: Cache a modifyable copy of the nir options in d3d12_screen Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	2529a0df89	d3d12: Use the right constant for GS varying limits Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	d6daa1cc7a	d3d12: Use a constant define for max anisotropy Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	bcfac68ce9	d3d12: Update max input, output, and varying caps The simple-varyings piglit test attempts to use GL_MAX_VARYING_FLOATS varyings, PLUS one additional vector for position (which is not used as input to the PS). "Reserve" that additional position vector by removing it from the max varyings and max PS inputs. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	0044e80b82	microsoft/compiler: Handle structs in I/O signatures Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	eb0cefae6d	microsoft/compiler: Map I/O base locations to input IDs When dealing with a vertex input that takes multiple rows, the value of nir_intrinsic_base points to a driver-location-based index, but we need to emit a location-based index (or more specifically, an index that increments once per input, not once per register). Add a mapping to the module of base -> ID. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	efe5c2d6f3	microsoft/compiler: Process signatures before the shader code This lets us set up some metadata based on I/O vars without having to do multiple passes over them. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	c154d403d3	microsoft/compiler: Handle I/O vars larger than a vec4 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	cdc49fb605	microsoft/compiler: Lower 64bit I/O to 32 and then run lower_pack Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	b24cfd0d40	microsoft/compiler: Handle b2f64 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	4d17393ba0	microsoft/compiler: Set dx11_1_double_extensions flag for dfma/ddiv Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	4c8935d325	microsoft/compiler: Fix dxil_nir_lower_double_math_instr pass for vectors Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	c8bd830dfb	microsoft/compiler: Fix make_double and split_double to respect swizzles Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	92191349e9	microsoft/compiler: Fix splitdouble struct name Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	dde3b04d44	microsoft/compiler: It's possible to have doubles without int64 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	3b9483e89d	microsoft/compiler: Add never-supported double ops to lower_doubles bitmask Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	ce6dbbabf9	microsoft/compiler: Only treat tess level location as special if it's a patch constant Fixes: `a550c059` ("microsoft/compiler: For load_input from DS, use loadPatchConstant") Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	0c711dc823	microsoft/compiler: Only prep phis for the current function Fixes: `41af9620` ("microsoft/compiler: Emit all NIR functions into the DXIL module") Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Jesse Natalie	87d22c2465	microsoft/compiler: Lower mul_2x32_64 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Reviewed-By: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14837>	2022-02-04 00:07:53 +00:00
Emma Anholt	a177f0de8f	ci: Uprev vulkan-cts to 1.2.8.0 This brings in some interesting new vulkan tests and fixes for the spurious KHR-GL TF failures. Also, reduces the runtime of dEQP-GLES31.functional.ssbo.layout.random.all_shared_buffer.36 so that it should stop timing out. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	3ce19d2db2	llvmpipe: Disable an assertion that may not be quite right. It triggered on uprevving VK-GL-CTS, and @airlied says it's tripped apparently spuriously before. There seems to be some interesting logic behind it, so leave the big comment for whoever can revisit the issue some day. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	6c2f6cd86f	ci/i915: Update rendering hash for plot3d trace. Its rendering changed slightly at some point, but it's fine. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	940b9ff6c9	ci/freedreno: Reduce concurrency for a618 vk_full. This ran into OOM-kills with the CTS uprev. Looking at caselists at the time of fail, some had 500MB of system memory used by the CTS (mostly spirv string codegen), plus whatever BOs were allocated, and the lazors are only 4GB it looks like. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	5039fc3dc7	ci/turnip: Extend the full-vk-run job timeouts. Between adding features and increased test coverage, we're hitting the 1-hour job limit. !13441 tried to increase the full run timeout for LAVA, but by having not bumped the gitlab-ci timeout value it ended up just letting the job keep running in LAVA after gitlab had given up on it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	47d0e63c59	ci/freereno: Reduce run-by-default a630-vk coverage. In the autotune merge, we added another 1/15th run of a configuration knob, thinking that was small enough to be in the noise. But actually the main run is only 1/9th, so another 1/15th took us from nearly hitting the job runtime target, to totally missing it. Crank things back down to keep MRs flowing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13779>	2022-02-03 22:41:23 +00:00
Emma Anholt	4f22f4ca1a	r300: Simplify DCE by assuming all output writes are used. No change on shader-db. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14847>	2022-02-03 14:28:46 -08:00
Emma Anholt	17cea74b8c	r300: Set up shadow sampler lowering in precompiles. Otherwise you end up lowering all shadow samples to a MOV dst temp[0].0000, which is pretty silly. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14847>	2022-02-03 14:28:44 -08:00
Emma Anholt	5f55e7b845	r300: Fix missing \n in an error message. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14847>	2022-02-03 14:28:41 -08:00
Mike Blumenkrantz	41ed470f6f	zink: add synchronization for conditional render buffer doesn't seem to do anything on any drivers I've tested, but maybe it's needed somewhere Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14853>	2022-02-03 19:15:26 +00:00
Mike Blumenkrantz	1e96542390	zink: add VK_BUFFER_USAGE_CONDITIONAL_RENDERING_BIT_EXT for query binds required by spec cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14853>	2022-02-03 19:15:26 +00:00
Rhys Perry	0447a2303f	aco: don't encode src2 for v_writelane_b32_e64 Encoding src2 doesn't cause issues for print_asm() because we have a workaround there, but it does for RGP and it seems the developers are not interested in fixing it. https://github.com/GPUOpen-Tools/radeon_gpu_profiler/issues/61 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Tested-by: Tatsuyuki Ishi <ishitatsuyuki@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14832>	2022-02-03 16:52:00 +00:00
Rhys Perry	5e3b8eeac4	aco: add test for optimizations with casts Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	6b1dfa7eac	aco: fix neg(mul)/abs(mul) optimization with different bit-size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	13bbc7c882	aco: don't combine add/mul of different bit-size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	3d8a8c6fc1	aco: don't apply omod/clamp of different bit-size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	7e30f99b0a	aco: don't combine fneg/fabs of different bit-size Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	27f1f5537d	aco/tests: implement sub-dword program inputs Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Rhys Perry	e86b88f85b	aco/tests: add a bunch more building helpers Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14810>	2022-02-03 16:02:04 +00:00
Alyssa Rosenzweig	1410d150e7	panfrost: Fix texel interleave flag on Valhall Interleave mode specified per-plane on Valhall. The texture descriptor proper merely has a flag specifying whether planes are somehow interleaved (u-interleaved, AFBC, or block compressed formats) or whether they are all linear (and uncompressed). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14851>	2022-02-03 15:43:27 +00:00
Alyssa Rosenzweig	3bf34a1494	panfrost: Add remaining ZS/CRC XML Flesh out the ZS/CRC XML, adding fields required for AFBC. Valhall allows AFBC compressing stencil buffers independent of depth buffers, which is a new feature since Bifrost. That results in a shuffling of the descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14851>	2022-02-03 15:43:27 +00:00
Alyssa Rosenzweig	bfba7533c7	panfrost: Add Valhall Plane Descriptor XML This looks superficially like the Bifrost "Surface" descriptor, but it additionally specifies the in-memory representation of blocks (clumps). If I understand correctly, decompression is controlled by the plane descriptor, rather than the texture descriptor level. This is a bit more flexible than Bifrost. Once the new fields here are wired up to Mesa, my dEQP-GLES2.functional.texture.* failures should go away... I hope! Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14851>	2022-02-03 15:43:27 +00:00
Alyssa Rosenzweig	c34381d8e8	panfrost: Fix alignments on Valhall Otherwise we get DATA_INVALID_FAULT trying to run even trivial null jobs. For each descriptor, set the correct alignment. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14851>	2022-02-03 15:43:27 +00:00
Alyssa Rosenzweig	a98f0e280e	panfrost: Remove blend shader return value on v9 Removed since there's a new ABI for blend shaders. Even if we always write 0, it's better not to pack this at all, and to denoise the dumps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14851>	2022-02-03 15:43:27 +00:00
Alejandro Piñeiro	5d8c659678	v3d/drm-shim: remove drm-shim driver After starting to use a new version of the simulator, it got outdated. We made some initial effort to update it, but it was not working. Taking into account that no one is using it, it is better to just remove it. We keep the noop drm drivers, as they could have some value for developers that doesn't have access to the v3dv3 simulator. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14682>	2022-02-03 09:53:29 +00:00
Shirish S	6f17d8acc9	radeonsi: allocate protected buffer only if required protected buffer allocations need to be made if the context is secure Signed-off-by: Shirish S <shirish.s@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14848>	2022-02-03 10:34:12 +01:00
Pierre-Eric Pelloux-Prayer	eaa87b1a46	radeonsi: limit loop unrolling for LLVM < 13 Without this change LLVM 12 hits this error: """ LLVM ERROR: Error while trying to spill SGPR0_SGPR1 from class SReg_64: Cannot scavenge register without an emergency spill slot! """ when running glcts KHR-GL46.arrays_of_arrays_gl.AtomicUsage test. Fixes: `9ff086052a` ("radeonsi: unroll loops of up to 128 iterations") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14848>	2022-02-03 10:34:12 +01:00
Samuel Pitoiset	52c850445e	radv: stop setting streamout state when a new pipeline is bound It's required to have a valid graphics bound pipeline with XFB when vkCmdBeginTransformFeedbackKHR() is called. This removes extra work when binding a pipeline. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14610>	2022-02-03 07:57:07 +00:00
Iago Toral Quiroga	7561ea8fa1	broadcom/compiler: allow ldunifa with read-only SSBOs Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14830>	2022-02-03 07:35:07 +00:00
Iago Toral Quiroga	0a8449b07c	broadcom/compiler: fix offset alignment for ldunifa when skipping The intention was to align the address to 4 bytes (32-bit), not 16 bytes. Fixes: `bdb6201ea1` ("broadcom/compiler: use ldunifa with unaligned constant offset") Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14830>	2022-02-03 07:35:07 +00:00
Dylan Baker	04f6e91de0	docs: update calendar for 22.0.0-rc1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14845>	2022-02-02 23:54:10 +00:00
Dylan Baker	647df89664	docs: reset new_features.txt Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14843>	2022-02-02 23:46:15 +00:00
Mike Blumenkrantz	f8a9010410	llvmpipe: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14835>	2022-02-02 23:03:40 +00:00
Mike Blumenkrantz	9a75392cd8	llvmpipe: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS this cap is broken cc: mesa-stable fixes: GTF-GL46.gtf21.GL2Tests.glGetUniform.glGetUnifor Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14835>	2022-02-02 23:03:40 +00:00
Mike Blumenkrantz	9a38dab2d1	zink: disable PIPE_SHADER_CAP_FP16_CONST_BUFFERS this cap is broken cc: mesa-stable fixes: GTF-GL46.gtf21.GL2Tests.glGetUniform.glGetUniform Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14835>	2022-02-02 23:03:40 +00:00
Dylan Baker	366d83a30e	VERSION: bump version for 22.0 release Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14840>	2022-02-02 22:49:09 +00:00
Bas Nieuwenhuizen	0395c483d4	radv: Handle SDMA for padding. Also assert that nobody actually needs to chain an SDMA IB because we have not implemented non-PKT3 chaining. Fixes: `ef40f2ccc2` ("radv/amdgpu: Fix handling of IB alignment > 4 words.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5923 Tested-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14781>	2022-02-02 22:23:17 +00:00
Emma Anholt	dbcdededb2	intel: Add missing dep of gen_*_header.py on utils.py. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14725>	2022-02-02 11:21:57 -08:00
Emma Anholt	3d5ee08c15	freedreno/isaspec: Add missing dep of encode.py/decode.py calls on isa.py Fixes: #5921 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14725>	2022-02-02 11:21:56 -08:00
Caio Oliveira	242c7a6513	anv: Add experimental support for VK_NV_mesh_shader Enable setting ANV_EXPERIMENTAL_NV_MESH_SHADER=1 environment variable. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Caio Oliveira	d9416cd8bd	intel/dev: Enable Mesh Shading for DG2 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	da273b2b7b	anv: Put first few push constants directly into Task/Mesh InlineData Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	27c32fd14b	anv: include ClipDistance array in mesh shader per-vertex output Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	c95b4ac2eb	anv: tell the hardware about gl_[Clip\|Cull]Distance in mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	bbde9f2448	anv: Implement indirect dispatch for Mesh pipeline Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	18e628135d	anv: Add support for UBOs, SSBOs and push constants in Mesh pipeline Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	f12f5ae2f8	anv: Add support for non-zero firstTask in vkCmdDrawMeshTasksNV Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Marcin Ślusarz	97da3e0814	anv: Enable conditional rendering in vkCmdDrawMeshTasksNV Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Caio Oliveira	ef04caea9b	anv: Implement Mesh Shading pipeline The Mesh pipeline is implemented as a variant of the regular (primitive) Graphics Pipeline. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Caio Oliveira	325ac235a7	anv: Add boilerplate for VK_NV_mesh_shader Use minimum values for the properties. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Caio Oliveira	c93cbc77f7	intel/common: Add helper for URB allocation in Mesh pipeline Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Caio Marcelo de Oliveira Filho	b01c73fd0a	intel: Add INTEL_URB_DEREF_BLOCK_SIZE_MESH And corresponding value in XML. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13662>	2022-02-02 18:17:57 +00:00
Alyssa Rosenzweig	4c38229ac1	pan/va: Add ARM_shader_framebuffer_fetch asm test This is a nontrivial chunk of code that makes for a nice dis/assembler test case (and caught a bug already...). Add it to the observatory. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	a99eac8a49	pan/va: Handle shift lanes in assembler Noticed in a program using ARM_shader_framebuffer_fetch. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	b3c7159308	pan/va: Add lots of swizzle assembler tests The swizzle handling in ISA.xml was broken in a bunch of place. Now that we've fixed these issues, let's add tons of tests to validate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	57bb3c7158	pan/va: Add 2-channel 8-bit swizzles for conversions Instructions like V2S8_TO_V2S16 need a special 4-bit special selecting any two bytes. The definition is the same as Bifrost. Let's call this a half-swizzle since we need a name, and it is indeed half a swizzle... Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	cdb7c4e42d	pan/va: Vectorize 8->16-bit conversions Matches Bifrost, too. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	441c47ff74	pan/va: Fix lane select for [US]_TO_[USF]32 The lane select is in bit 28, this is covered by the "16-bit swizzle" mode. However, the source type isn't inferred from the name in valhall.py, so explicitly annotate the source as 16-bit. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	46cd0ddcb6	pan/va: Fix MKVEC.v2i16 lane select The lanes are at bit 28 and bit 26 respectively. This matches the 16-bit "swizzle" encoding. In general the handling of widens/swizzles/lane/lanes on Valhall is rather confused but... one problem at a time. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	a95ca2402c	pan/va: Test LD_TILE assembly Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	dc61e362f4	pan/va: Add missing fields to LD_TILE Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	0f9007985d	pan/va: Add missing <clamp/> to V2F32_TO_V2F16 For parity with Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Alyssa Rosenzweig	3fff018c98	pan/va: Add .absolute bit to BRANCHZI Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14833>	2022-02-02 17:42:01 +00:00
Lionel Landwerlin	665ffd4bf9	anv: Update VK_KHR_fragment_shading_rate for newer HW Per primitive & attachment shading rate support added. v2: Rebase on KHR_dynamic_rendering Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	fc837e9f8b	anv/pass: rely on precomputed dynamic rendering pass/subpass more For instance, the current code in genX_cmd_buffer.c assumes that the depth/stencil attachments & resolves will be at the end of all attachments, but that won't be the case anymore with fragment rate shading. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	16763e8b8e	anv: force primitive shading rate write in last geometry stage v2: Use new helper to check if stage supports variable shading rate setting v3: Update comment & iterate backward (Caio) Apply only to relevant platforms (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	0cd93c59ef	intel/compiler: add primitive rate output support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	cebf284ac1	intel/compiler: add a new pass to lower shading rate into HW format Rework: * Jason: Modernize brw_nir_lower_shading_rate_output: 1. Use nir_shader_instructions_pass() 2. Use *_imm builder helpers. 3. Use nir_intrinsic_base() instead of ->const_index[0] v2: Also lower loads (Caio) v3: Update stage check to trigger lowering (Caio) v4: Assert on != MESH (Caio) v5: Fixup instruction insertion (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	e227bb9fd5	nir/builder: add ishl_imm helper v2: add (y >= x->bit_size) condition (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	3ab7f4471c	isl: disable CPB surface compression Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	dff08cbf8e	isl: add support for coarse pixel control surfaces Those surfaces are used as attachment to rendering passes and describe the rate of coarse pixel shading for the pass. v2: Move CPB_BIT tile filtering to isl_gfx125_filter_tiling() (Nanley) v3: Drop unused macro (Nanley) s/isl_to_gen/isl_encode/ (Nanley) Remove pitch alignment 128B constraint already covered by tiling (Nanley) Move some asserts together (Nanley) v4: Disable miptail for now (Nanley) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	8d90fe587f	intel/dev: details CPS feature support DG2 introduces per primitive coarse pixel settings (in stages preceding the PS shader) and also a control surface specifying the rate at through the resulting surface. v2: update comment (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	8bdbc93a9d	genxml: add new 3DSTATE_PS_EXTRA bit Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	ea71fb0b4b	genxml: gen12.5 changes for CPS v2: Make genxml look more like BSpec (Caio) Fixup X_Focal/Y_Focal entries (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Lionel Landwerlin	7d8884800e	compiler: add VARYING bit for primitive shading rate Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13739>	2022-02-02 17:09:46 +00:00
Filip Gawin	9322b76fc4	r300: replace recursive calls with loops Recursive "loops" tend to be more difficult to follow and understand. Additionally iterative approach should be nicer for compiler. (Less to allocate on stack and easier to optimize) Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13226>	2022-02-02 16:50:03 +00:00
Nanley Chery	f724f95542	intel/isl: Add more PRM text for HiZ/STC requirement Add text describing why HierarchicalDepthBufferEnable must be set along with SeparateStencilBufferEnable. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14825>	2022-02-02 16:25:10 +00:00
Nanley Chery	bc9ce9705c	intel/isl: Fix depth buffer TiledSurface programming The assert for the TiledSurface field caught a programming error, but with a segfault instead of the usual route of assert-failing. We only set this field when we have a depth surface, but we also need to set it when one isn't provided. Fix this issue and drop the assert. Fixes: `b77d694223` ("intel/isl: Allow HiZ with Tile4/64 surfaces") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5950 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14825>	2022-02-02 16:25:10 +00:00
Nanley Chery	146213d0ee	intel/isl: Simplify Z-buffer tiling config during emit For SNB and prior, assert that the surface is Y-tiled and use constants when configuring the tiling parameters. This makes a follow-on commit clearer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14825>	2022-02-02 16:25:10 +00:00
Shmerl	06aaa2cead	docs/features: Add VK_KHR_acceleration_structure, VK_KHR_pipeline_library, VK_KHR_ray_query, VK_KHR_ray_tracing_pipeline. Closes: #5901 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14796>	2022-02-02 08:06:34 +00:00
Chia-I Wu	b41adbf211	venus: update venus-protocol to 1.3.204 There should be no visible functional change. Although an unrelated change in the codegen replaced vn_info_extension_spec_version by vn_info_extension_get. We have to adapt to that. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14782>	2022-02-02 06:57:24 +00:00
Emma Anholt	d4b6d03408	r300/r600: Add drm-shim support. I was tired of swapping gpus around just to check shader-db results of MRs for these. I put it in src/amd since it doesn't make sense in either of r300 or r600. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14809>	2022-02-02 00:59:08 +00:00
Emma Anholt	be78087655	r300: Disable fp16 and int16 in swtcl vertex shaders. We already had them disabled for hwtcl, but in the swtcl case gallivm's param query would return (nir) support even though nir-to-tgsi couldn't handle it because TGSI doesn't do fp16/int16. Fixes: `7d2ea9b0ed` ("r300: Request NIR shaders from mesa/st and use NIR-to-TGSI.") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14809>	2022-02-02 00:59:08 +00:00
Iván Briano	965d58b058	anv: Report the right conformance version Fixes: `df8ac77af8` ("anv: Advertise Vulkan 1.3") Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14827>	2022-02-01 14:36:18 -08:00
Iván Briano	ea0fa5c6bc	anv: Handle resolveImageLayout on dynamic rendering Fixes: `5d9e8bc9be` ("anv: implement the meat of VK_KHR_dynamic_rendering") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5942 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14812>	2022-02-01 19:51:23 +00:00
Mike Blumenkrantz	1285319394	docs: update features/relnotes for zink sparse texture clamp Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14813>	2022-02-01 18:55:50 +00:00
Mike Blumenkrantz	cda3c22a01	zink: ARB_sparse_texture_clamp Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14813>	2022-02-01 18:55:50 +00:00
Samuel Pitoiset	1cadd19197	radv/winsys: fix missing buffer_make_resident() for the null winsys With latest Fossilize everything should now be captured correctly but without this, all Fossilize databases that need VK_EXT_custom_border_color would just crash. Cc: mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14821>	2022-02-01 18:18:28 +00:00
Caio Oliveira	8bab8f6422	compiler, intel: Add gl_shader_stage_is_mesh() And replace the previous Intel-specific function. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14823>	2022-02-01 17:41:25 +00:00
Danylo Piliaiev	f917c73528	ir3: opt_deref in opt loop to remove unnecessary tex casts Otherwise we may be left with such casts: vec1 32 ssa_72 = deref_var &shadow_map (uniform sampler2D) vec1 32 ssa_73 = deref_cast (texture2D )ssa_72 (uniform texture2D) vec1 32 ssa_74 = deref_cast (sampler )ssa_72 (uniform sampler) vec1 32 ssa_76 = (float32)tex ssa_73 (texture_deref), ssa_74 (sampler_deref), ssa_75 (coord), ssa_64 (comparator) And crash in ycbcr lowering since we aren't able to follow deref chain. Fixes crash in GFXBench Aztec Ruins Vulkan tests. See issue: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5945 Cc: mesa-stable Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14819>	2022-02-01 17:02:36 +00:00
Connor Abbott	0248644c89	ir3,tu: Enable subgroup shuffles and relative shuffles We still don't use the fast path for relative shuffles, that's left for future work. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:46 +00:00
Connor Abbott	09731fc79a	ir3/cp: ir3: Prevent propagating shared regs out of loops harder We need to check the source of the copy, not the destination. That means this we need to move this check inside the ifs, where we know that the source is a copy. Fixes: `590efd180b` ("Prevent propagating shared regs out of loops") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:46 +00:00
Connor Abbott	b6e5a03499	ir3: Rewrite (jp) insertion The old code was both too aggressive and not aggressive enough in inserting (jp), and it wasn't based at all on a solid understanding of how the hardware operates. It inserted an extra unnecessary (jp) at the beginning of an if statement right after the conditional branch, which is unnecessary. On the other hand, the only case where it didn't insert a (jp) was a block with one predecessor that has only one successor. We usually don't emit these kinds of blocks, or if we do then one of the blocks is empty, but there is a case where we do and the difference actually matters: for something like while (true) { if (...) { // do stuff ... break; } } The instructions inside the if could be moved below the loop, except that they are supposed to execute before control flow is merged if the loop trip count is nonuniform. The subgroup reduce/scan macro does exactly this, and relies on the control flow being correct. We have to insert a (jp) after the loop, which this code wasn't doing, breaking the scan macro. Since we're now using the physical edges in a non-trivial way, we have to preserve them better when we modify control flow in ir3_legalize. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:46 +00:00
Connor Abbott	53e54898e0	ir3: Fix copy-paste mistakes in ir3_block_remove_physical_predecessor() Fixes: `2768a35e41` ("ir3: Add pass to remove unreachable blocks") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Connor Abbott	503a5bae59	nir: Add support for lowering shuffle to a waterfall loop Qualcomm doesn't natively support shuffle, but it does natively support relative shuffles where the delta is a constant. Therefore we'll expose emulated support for both. Add support for this emulation of subgroupShuffle() to NIR. Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Connor Abbott	913bec10c4	nir/lower_subgroups: Rename lower_shuffle to lower_relative_shuffle This option only applies to relative shuffles (up/down/xor), and in a moment we're going to add an option to lower normal shuffles, so rename it. While we're here, rename lower_shuffle() to lower_to_shuffle() for similar reasons. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14412>	2022-02-01 16:27:45 +00:00
Emma Anholt	bf289e3123	turnip: Store the computed iova in the tu_image. Less of a big deal than for buffers, but let's be consistent in how we handle our bindings. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14816>	2022-02-01 15:30:12 +00:00
Emma Anholt	f460fb3f91	turnip: Store the computed iova in the tu_buffer. We recently had a bug of forgeting to add the buf->bo_offset. Just make the easiest field to get be the bo->iova + buf->bo_offset already. Plus, a little less work at emit time. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14816>	2022-02-01 15:30:12 +00:00
Rhys Perry	ba44634e4d	aco: fix v_mac_legacy_f32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `f68797ead7` ("aco: create v_mac_legacy_f32/v_fmac_legacy_f32") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5952 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14820>	2022-02-01 15:08:04 +00:00
Qiang Yu	d1e46d34f7	radeonsi: enable ARB_sparse_texture_clamp Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	95d3617909	glsl/nir: convert ir_texture->clamp to nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	10a71c6106	glsl: add ARB_sparse_texture_clamp builtin functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	391f2dffaa	glsl: _textureCubeArrayShadow support clamp Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	de37146c71	glsl: _texture support clamp parameter Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	92d6b2735b	glsl: ir_texture add clamp field For ARB_sparse_texture_clamp to hold the lodClamp parameter. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	13ffd46a0f	glsl: add ARB_sparse_texture_clamp extension Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	3729c3fa30	mesa: add ARB_sparse_texture_clamp extension Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Qiang Yu	d68087a1d9	gallium: add PIPE_CAP_CLAMP_SPARSE_TEXTURE_LOD For ARB_sparse_texture_clamp. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14488>	2022-02-01 10:28:05 +00:00
Lionel Landwerlin	d014efa33e	util/utrace: make generated code a tiny bit nicer to look at Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14818>	2022-02-01 09:28:30 +00:00
Lionel Landwerlin	d0363baefd	util/u_trace: make mako conditional code easier to read It was difficult to read the conditional code. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14818>	2022-02-01 09:28:30 +00:00
Lionel Landwerlin	928156bc9d	intel/tracepoint: simplify tracepoint descriptions Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Suggested-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14818>	2022-02-01 09:28:30 +00:00
Simon Ser	713a4363e5	vulkan/wsi/wayland: remove format switch from wl_shm_format_for_vk_format Instead of maintaining two similar switches (one for DRM formats, one for wl_shm formats), only maintain a single switch (for DRM) and convert DRM formats to enum wl_shm_format. This reduces the risk to have inconsistencies between these two functions. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14634>	2022-02-01 08:00:22 +00:00
Simon Ser	5a82232e5c	vulkan/wsi/wayland: use DRM_FORMAT_INVALID Instead of using the magic value 0, use the define. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14634>	2022-02-01 08:00:22 +00:00
Simon Ser	fda2aecb8b	vulkan/wsi/wayland: use enum wl_shm_format libwayland defines an enum for wl_shm formats. Let's use it instead of uint32_t. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14634>	2022-02-01 08:00:22 +00:00
Lionel Landwerlin	bbe97c3871	docs: update INTEL_DEBUG environment variable documentation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5929 Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14800>	2022-02-01 07:53:21 +00:00
Iago Toral Quiroga	ce99b1a746	v3dv: don't submit noop job if there is nothing to wait on or signal Also, do not unconditionally flag signaling for submits without any command buffers. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14802>	2022-02-01 07:28:46 +00:00
Chia-I Wu	4fa24747b9	glthread: call _mesa_glthread_BindBuffer unconditionally _mesa_marshal_GetIntegerv expects those states to be tracked. I am not sure if this covers all states that _mesa_marshal_GetIntegerv needs, but this fixes Civ5 for virgl at least. Fixes: `e48f676835` ("glthread: don't sync for more glGetIntegerv enums for glretrace") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14659>	2022-02-01 06:11:22 +00:00
Mike Blumenkrantz	061bf72a4f	mesa: stop truncating MESA_GLSL=dump this adds a new helper function that can be used to directly dump longer strings to the log file, such as shaders fixes #5614 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14755>	2022-02-01 04:05:58 +00:00
Mike Blumenkrantz	b1b8b712c1	aux/vbuf: add fastpath for skipping identical vbuf updates the overhead of comparing these is MUCH less than the overhead of queuing a driver method and performing the update Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14640>	2022-02-01 01:17:09 +00:00
Mike Blumenkrantz	b733a22636	aux/vbuf: move mask-clearing for vbuf updates after buffer scanning Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14640>	2022-02-01 01:17:09 +00:00
Mike Blumenkrantz	cf6a616122	aux/vbuf: use local var for modifying unaligned_vb_mask during update Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14640>	2022-02-01 01:17:09 +00:00
Paulo Zanoni	28f677e6dc	iris: implement inter-context busy-tracking Previously, no buffers were ever marked as EXEC_OBJECT_ASYNC so the Kernel would ensure dependency tracking for us. After we implemented explicit busy tracking in commit `89a34cb845`, only the external objects kept relying on the Kernel's implicit tracking and Iris did inter-batch busy tracking, meaning we lost inter-screen and inter-context synchronization. This seemed fine to me since, as far as I understdood, it is the duty of the application to synchronize itself against multiple screens and contexts. The problem here is that applications were actually relying on the old behavior where the Kernel guarantees synchronization, so `89a34cb845` can be seen as a regression. This commit addresses the inter-context synchronization case. v2: Rebase after the changes from `a90a1f15a7`. This new version is significantly different. Cc: mesa-stable (see the backport in MR !14783) References: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14783 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5731 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5812 Fixes: `89a34cb845` ("iris: switch to explicit busy tracking") References: `a90a1f15a7` ("iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer") Tested-by: Konstantin Kharlamov <hi-angel@yandex.ru> (v1) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14505>	2022-02-01 00:34:55 +00:00
Paulo Zanoni	06b279ccb2	iris: save some iris_syncobj_reference() calls at update_bo_syncobjs() Save some iris_syncobj_reference() calls at at update_bo_syncobjs() by refactoring the function, moving the unnecessary calls out of the for loop. Also improve the documentation and drive-by fix a white space issue. Commit `a90a1f15a7` incremented IRIS_BATCH_COUNT and adjusted the code involving it. But the way it was done at update_bo_syncobjs() means we'll now call iris_syncobj_reference() more than the amount of times we need. This isn't a problem since in the second call we're moving the reference from something to the same thing, but still we can get away with just not doing it, so why not? There is also the question of whether we should really iterate over IRIS_BATCH_COUNT or something else (to save time on platforms that don't have as many batches as IRIS_BATCH_COUNT), but this is a matter for a separate patch (see MR !14747). In this patch we reorder the operations a little bit so when the time comes there will be just one loop to adjust. I also changed the order so first we look at's in bo_deps and only later we write to bo_deps. This is more future-proof and will make the next patch easier to land and understand. Reference: `a90a1f15a7` ("iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer") Reference: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14747 Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14505>	2022-02-01 00:34:55 +00:00
Nanley Chery	db475c81b7	iris: Return non-zero stride for clear color plane Before this patch, when querying the clear color plane's stride, iris would return the aux surface stride. This was okay because the clear color plane wasn't really used for anything. This doesn't work on XeHP however. On that platform, the aux surface stride is zero (because it doesn't have an ISL surface for the CCS). This is a problem because mesa's implementation of eglCreateImage rejects strides of zero (see dri2_check_dma_buf_attribs) and this value may be queried from GBM and passed into EGL. When the DG2 clear color modifier is enabled, this avoids EGL_BAD_ACCESS errors when running the piglit test, ext_image_dma_buf_import-intel_modifiers. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14759>	2022-01-31 23:22:35 +00:00
Nanley Chery	2407a7e6c1	iris: Pick the right BO in iris_resource_get_param Pick the clear color BO if the clear color plane is being queried. This avoids picking a NULL aux BO on XeHP. When creating shared resources, we place the gallium-visible planes in the same buffer object. However, when importing them, we aren't very strict about each plane sharing the same BO. So, instead of just using res->bo, we use a couple ternaries to figure out the right one. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14759>	2022-01-31 23:22:35 +00:00
Nanley Chery	ea5ffa0128	iris: Refactor a ternary in iris_resource_get_param Suggested-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14759>	2022-01-31 23:22:35 +00:00
Melissa Wen	db9098f2ef	v3dv: move sems_info from event_wait job to wait_thread info Semaphores info was stored as an info of event_wait cpu jobs and this leads to mem leak when the same event_wait job in the same cmd buffer batch was submitted more than once. As a result, `dEQP-VK.api.command_buffers.record_simul_use_primary` fails due to a double-free of sems_info. In this patch, we no longer use v3dv_event_wait_cpu_job_info to store semaphores from a submit info, since semaphores is related to a queue submission and not to the event_wait job type. If we spawn a wait_thread, we copy semaphores to an auxiliary struct (v3dv_wait_thread_info) that will be used in wait_thread to get job and semaphores information. When the spawned thread finishes, it releases the related v3dv_wait_thread_info and the semaphores copy as well. Fixes: `d5bd18fb` ("v3dv: store wait semaphores in event_wait_cpu_job_info") Suggested-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14736>	2022-01-31 23:01:54 +00:00
Jesse Natalie	f7ed838a49	d3d12: ARB_transform_feedback3 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:28 -08:00
Jesse Natalie	30a2071ac7	d3d12: Handle indexed queries Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:27 -08:00
Jesse Natalie	d2bc8b305c	d3d12: Fix xfb varying matching for vars with location_frac Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:27 -08:00
Jesse Natalie	f4654f2c1c	d3d12: Unpack multi-stream varyings DXIL doesn't support a single varying having components belonging to multiple streams. For geometry shader outputs, whenever we find this to be the case, break up that variable into multiple sub- variables, and update stores to that variable to write to the new sub-variables instead. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:27 -08:00
Jesse Natalie	36add3d002	microsoft/compiler: Support multiple GS output streams Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:27 -08:00
Jesse Natalie	895cdbd6f0	microsoft/compiler: Correctly support I/O on variables with location_frac Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:34:27 -08:00
Jesse Natalie	db77360796	d3d12: ARB_transform_feedback2 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:32:10 -08:00
Jesse Natalie	00dd573594	d3d12: Switch primitives-generated query to use XFB, GS, and IA data Per https://www.supergoodcode.com/primitive-pain/ this is how it should work Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:32:10 -08:00
Jesse Natalie	ba82ffaba6	d3d12: Rewrite subquery logic The previous logic was a bit bolted-on, using a linked list of queries to implement cases where a GL query needed maybe multiple D3D queries. Once the sub-query was created, it was always begun/ended along with the parent query, but that's not sufficient or correct. Instead, we need to be able to have mutually-exclusive subqueries underneath a parent query. This will let us handle using different counters for the same GL query in different pipeline configs, and then accumulating the results together. Fixes the arb_transform_feedback2-pause-counting piglit test. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:32:10 -08:00
Jesse Natalie	8875a2fb25	d3d12: Compute transform UBO0 is actually binding 1 Since lower_uniforms_to_ubo will unconditionally increment UBO indices from 0 to 1. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:32:10 -08:00
Jesse Natalie	317c870f0f	d3d12: Implement DrawAuto aka DrawTransformFeedback Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:32:02 -08:00
Jesse Natalie	7009d23865	d3d12: Move "fake" SO buffer handling to compute transforms instead of CPU readback Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:41 -08:00
Jesse Natalie	c1b52d8c3a	d3d12: Move compute transform state save/restore to compute_transforms.cpp Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:22 -08:00
Jesse Natalie	2668eb8089	d3d12: Add a compute transform for draw auto Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:22 -08:00
Jesse Natalie	371d237ba5	d3d12: Add a couple compute transforms for "fake" SO buffers This solves 2 problems with the CPU readback we're doing for this now: 1. It's not on the CPU 2. It handles gaps, leaving the destination buffer intact in those gaps Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:22 -08:00
Jesse Natalie	57f6eeb3fb	d3d12: Add a comment for what the existing compute transform does Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:21 -08:00
Jesse Natalie	396205b0d6	d3d12: SO buffer filled size is only 32-bit Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:31:19 -08:00
Jesse Natalie	5f48e6d7a2	d3d12: Move indirect compute to real indirect dispatches Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:30:49 -08:00
Jesse Natalie	cab0ed52e8	d3d12: Support transform feedback pause/resume Don't unconditionally reallocate/zero the fill buffer count, only when a specific value is to be assigned. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:30:45 -08:00
Jesse Natalie	b2aa21362d	d3d12: Include SO buffer count as a PSO dirty bit ctx->gfx_pipeline_state::num_so_targets is used when compiling PSOs, so make sure that changes to that value result in PSO recompiles. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:13:43 -08:00
Jesse Natalie	1d43e75228	d3d12: Add UAV barriers for UAVs that are being used by compute transforms If an indirect arg buffer is being produced by a compute shader, then when we go to consume it as an SSBO in a compute transform pass, we need to insert a UAV barrier to prevent the two dispatches from overlapping. For app dispatches, this is the app's responsibility via explicit barrier APIs, and if they don't, then they're allowed to overlap. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:13:33 -08:00
Jesse Natalie	31aaf92b7d	d3d12: Fix compute transform for multi-draw indirect with dynamic count + state vars NIR validation will complain on the UBO range not being set. Fixes: `3a8c8d25` ("d3d12: Add a compute transformation to handle indirect draws that need draw params") Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:13:22 -08:00
Jesse Natalie	2d4ee41df0	microsoft/compiler: Fix UAV resource ID counting for static indexed handles Skip resource space 2 after computing the ID it would've used. Fixes: `e5f353f2` ("microsoft/compiler: Emit statically-indexed resource handles and scratch later") Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14787>	2022-01-31 13:13:12 -08:00
Mike Blumenkrantz	6f38ea4ac7	zink: use SpvScopeDevice over SpvScopeWorkgroup for atomic shader ops Workgroup is only allowed in compute shaders, and Device should be more in line with the intended use here the alternative would be to keep using Workgroup for compute and use Device otherwise, but this would effectively make atomic ops non-atomic, which seems like it isn't desirable cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14690>	2022-01-31 20:34:07 +00:00
Mike Blumenkrantz	2361c52b5e	zink: cast image atomic op params/results based on image type according to spec, these must match the texel pointer type cc: mesa-stable fixes (nvidia): dEQP-GLES31.functional.image_load_store.2d.atomic.exchange_r32f_return_value dEQP-GLES31.functional.image_load_store.2d_array.atomic.exchange_r32f_return_value dEQP-GLES31.functional.image_load_store.cube.atomic.exchange_r32f_return_value Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14690>	2022-01-31 20:34:07 +00:00
Mike Blumenkrantz	58e201c66e	zink: add warning printf for drivers missing VK_EXT_shader_atomic_float Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14690>	2022-01-31 20:34:07 +00:00
Mike Blumenkrantz	7e8d609f6e	zink: enable VK_EXT_shader_atomic_float this is needed for atomic ops, but we can let drivers that don't support it have some warning messages instead of gating features Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14690>	2022-01-31 20:34:07 +00:00
Mike Blumenkrantz	8e97f51c67	zink: handle swizzled offset/count values for shader bitfield ops glsl/nir automatically swizzle the value if a vecN is being used, but spirv requires a single scalar, so this has to be detected and unswizzled to avoid violating spec and crashing shader compilers that are less permissive than mesa's fixes (nvidia): dEQP-GLES31.functional.shaders.builtin_functions.integer.bitfieldextract* KHR-GL46.shader_bitfield_operation.bitfieldExtract* Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14690>	2022-01-31 20:34:07 +00:00
Boris Brezillon	11e2c4b502	microsoft/spirv_to_dxil: Define idep_libspirv_to_dxil So we can re-use it when we need to define a dependency on spirv_to_dxil. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	ef47a6800b	microsoft/spirv_to_dxil: Make sure the SampleMask is a uint DXIL doesn't like when SV_Coverage (AKA SampleMask in DXIL) is a signed integer. Fix the type while we're in the NIR domain. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	7e56d8c393	microsoft/spirv_to_dxil: Lower atomics to their dxil variants Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	c2eeba04c3	microsoft/spirv_to_dxil: Discard PSIZ accesses D3D12 doesn't support gl_PointSize, so let's consider PointSize is always 1.0 and discard any PointSize access. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Louis-Francis Ratté-Boulianne	5cd2bf837d	microsoft/spirv_to_dxil: Allow passing a vulkan -> d3d12 binding mapping table Vulkan bindings take only one slot per variable, but d3d12 ones take one slot per entry when the variable is an array. This forces us to pass an explicit vulkan -> d3d12 mapping table when dealing with vulkan shaders. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Louis-Francis Ratté-Boulianne	de1e941c59	microsoft/spirv_to_dxil: Lower push constant loads to UBO loads Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Louis-Francis Ratté-Boulianne	d11a417ded	microsoft/spirv_to_dxil: lower input attachments Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Louis-Francis Ratté-Boulianne	e65303c6e6	microsoft/spirv_to_dxil: check for variables r/w access Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	39592f8ad4	microsoft/spirv_to_dxil: Support [un]conditional YZ-flip The unconditional Y-flip is needed for Vulkan 1.0 since D3D12 and Vulkan coordinate systems differ. Conditional YZ-flip is needed if we want to support negative viewport height/depth. Prepare spirv_to_dxil() to support that, and while at it, prepare things for multi-viewport: the Y/Z flips are per-viewport and encoded in a 32bit bitmask, with the upper 16bits reserved for Z flips, and the lower 16bits reserved for Y flips. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	225867684a	microsoft/spirv_to_dxil: Allow dumping NIR Dumping NIR shaders is a useful debug feature. Let's tweak the spirv_to_nir() helper so we can pass debugging options and add one to allow dumping NIR. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Boris Brezillon	27790c4a7a	microsoft/spirv_to_dxil: Remove dead variables after the struct split pass Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14765>	2022-01-31 20:21:25 +00:00
Jason Ekstrand	d85a9d658f	anv/image: Call into WSI to create swapchain images This guarantees that we get an image that's created with exactly the swapchain image creation parameters instead of trying to emulate it inside the driver. Ideally, we'd use the fancy new bind helper too but our magic ANV_IMAGE_MEMORY_BINDING_PRIVATE gets in the way and we really do want to re-bind ourself. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	a2e986b6d9	anv/image: Add some asserts when binding swapchain images Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	27042d135e	vulkan/wsi: Add image create and bind helpers These are needed to properly implement the Vulkan 1.1 swapchain image create/bind functionality. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	ca791f5c5d	wsi/common: Set VK_IMAGE_CREATE_ALIAS_BIT With Vulkan 1.1, we have a VkImageSwapchainCreateInfoKHR struct which lets you create a new VkImage which aliases a swapchain image. However, there is no corresponding swapchain create flag so we have to set VK_IMAGE_CREATE_ALIAS_BIT all the time. We need to do a bit of work in ANV to prevent it from asserting the moment it sees one of these. Fortunately, they're already safe because WSI images go through a different bind path for VkBindImageMemorySwapchainInfoKHR. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	1abab1a28f	vulkan/wsi/drm: Drop wsi_create_native/prime_image Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	ed391d2a46	vulkan/wsi/win32: Break create_win32_image in pieces This is similar to the previous two commits that we did for DRM native images. It breaks it into configure/create/bind and calls wsi_create_image to walk through the three-step process. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	d7ad73d6b7	vulkan/wsi/win32: Delete unnecessary copy+paste from DRM The Win32 WSI just does a copy into the window on the CPU. There's no need for external memory or modifiers or implicit sync or any of that. While we're at it, rename to create_win32_image. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	d95e3fd98c	vulkan/wsi/display: Split image creation Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	b626a5be43	vulkan/wsi/wayland: Split image creation Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:55 +00:00
Jason Ekstrand	d67250d444	vulkan/wsi/x11: Split image creation Store the wsi_image_create_info in the swapchain and call wsi_configure_*_image once per swapchain and then use wsi_create_image for each image. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:54 +00:00
Jason Ekstrand	579578f10a	vulkan/wsi/drm: Break create_prime_image in pieces This is similar to the previous two commits that we did for DRM native images. It breaks it into configure/create/bind/finish and calls wsi_create_image to walk through the process. The primary difference is that prime images need fifth step in the process to set up the blit command buffer. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:54 +00:00
Jason Ekstrand	830d9967db	vulkan/wsi: Add a helper for the configure/create/bind pattern Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:54 +00:00
Jason Ekstrand	5b13d74583	vulkan/wsi/drm: Break create_native_image in pieces Instead of making create_native_image one monolithic function, break it into a configure stage and a create stage. The configure stage is further broken up, first into a common piece that constructs a simple VkImageCreateInfo and a couple chain-ins. The second adds the extra stuff for create_native_image. This is to prepare for eventually storing those structs in the swapchain itself. Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:54 +00:00
Jason Ekstrand	8299d5f37f	vulkan/wsi: Set MUTABLE_FORMAT_BIT in the prime path Fixes: `4bdf8547f4` "vulkan/wsi: Implement VK_KHR_swapchain_mutable_format" Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12031>	2022-01-31 19:46:54 +00:00
Caleb Callaway	7483c40ba0	vulkan/overlay: revise and reformat README Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14784>	2022-01-31 19:09:56 +00:00
Chia-I Wu	9eb1592e57	turnip: respect buf->bo_offset in transform feedback buf->bo->iova should always be offset by buf->bo_offset. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14786>	2022-01-31 18:31:54 +00:00
Georg Lehmann	cbe4943ae9	vulkan/wsi/wayland: Fix add_drm_format_modifier aplha/opaqueness. This had the opposite problem of the shm path. R8G8B8A8 was always support if either DRM_FORMAT_XBGR8888 or DRM_FORMAT_ABGR8888 was supported, but we need both. Fixes: `d944136f36` ("vulkan/wsi/wayland: don't expose surface formats not fully supported") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14588>	2022-01-31 17:50:01 +00:00
Georg Lehmann	9843fddfff	vulkan/wsi/wayland: Add modifiers for RGB formats. These formats get overwritten after the FALLTHROUGH, so no modifers got added to them at all. Fixes: `151b65b211` ("vulkan/wsi/wayland: generalize modifier handling") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14588>	2022-01-31 17:50:01 +00:00
Georg Lehmann	a881b6ac1f	vulkan/wsi/wayland: Convert missing vulkan formats to shm formats. Fixes: `6b36f35734` ("vulkan/wsi/wl: add wl_shm support for lavapipe.") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14588>	2022-01-31 17:50:01 +00:00
Georg Lehmann	4ae4e04e18	vulkan/wsi/wayland: Fix add_wl_shm_format alpha/opaqueness. We need both the SHM format with alpha and the opaque format to fully support a vulkan format with alpha. Previously no surface format was reported because the vulkan formats with aplha were never added as opaque. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5879 Fixes: `d944136f36` ("vulkan/wsi/wayland: don't expose surface formats not fully supported") Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14588>	2022-01-31 17:50:01 +00:00
Christian Gmeiner	1d75b459a6	etnaviv: add support for INTEL_blackhole_render Passes the following piglits: - spec@intel_blackhole_render@intel_blackhole-draw_gles2 - spec@intel_blackhole_render@intel_blackhole-draw_gles3 Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14792>	2022-01-31 16:52:29 +00:00
Boris Brezillon	a4c8508c37	microsoft/compiler: textureLoad() doesn't take a LOD on MS textures Make sure the LOD is zero in that case. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Boris Brezillon	951fd35012	microsoft/compiler: Skip images in redirect_texture_derefs() The input attachment lowering pass turns input attachment loads into texel fetch operation, and insert an image -> texture deref cast along the way. In this situation, we can end up with a texture deref chain pointing to an image variable, which is not a combined sampler+texture object. Bail out when an image type is found, like we do for bare textures. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Boris Brezillon	678b94c2d8	microsoft/compiler: Fix sampler/texture array emission Those need to be declared as sampler/SRV arrays, as we do for UAVs and CBVs. Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Louis-Francis Ratté-Boulianne	54c32aeba6	microsoft/compiler: Use SRVs for read-only images Acked-by: Enrico Galli <enrico.galli@intel.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Louis-Francis Ratté-Boulianne	8507afbd06	microsoft/compiler: Add subpass input types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Louis-Francis Ratté-Boulianne	ef5283d37d	microsoft/compiler: add support for load_layer_id We simply return 0 for now as we don't support multi-view yet. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13993>	2022-01-31 16:37:47 +00:00
Thomas H.P. Andersen	fd99c36351	svga: silence -Wsometimes-uninitialized Reviewed-by: Neha Bhende <bhenden@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	ac59a266cc	anv: drop a set but unused variable Fixes a warning with clang Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	bdfb1885b8	anv: drop a set but unused variable Fixes a warning with clang Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	e7e3e2072c	panfrost: mark two variables as unused The variables are currently unused as panvk does not support SSBOs yet. Silences a compile warning with clang Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	430b1157a1	broadcom: drop unused functions Fixes a clang warning about unused static inlined functions Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	3fdea42339	v3d: avoid warning about unused function This function is only used if V3D_VERSION < 40 Fixes a clang warning about unused static inlined functions. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	cc79959b09	v3d: avoid warning about unused function This function is only used if V3D_VERSION >= 40 Fixes a clang warning about unused static inlined functions. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	274e4e82d2	vc4: drop unused function Fixes a clang warning about unused static inlined functions Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Thomas H.P. Andersen	8e4b3cf832	anv: avoid warning about unused function This function is only used if GFX_VER == 7 Fixes a clang warning about unused static inlined functions. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14790>	2022-01-31 16:10:31 +00:00
Danylo Piliaiev	f14cae43ac	ci/freedreno: properly test sysmem and gmem paths After autotuner introduction most CTS tests are running in sysmem mode. Now we have to force gmem run and add a small forced sysmem run since it's not guaranteed that autotuner would select gmem in future. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12128>	2022-01-31 15:26:35 +00:00
Danylo Piliaiev	803055ccb4	tu: add debug option to force gmem With autotuner we now want to be able to force gmem rendering, it will respect existing constraints though. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12128>	2022-01-31 15:26:35 +00:00
Danylo Piliaiev	a4f9c54444	freedreno: Update gmem/sysmem debug options to be in line with turnip Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12128>	2022-01-31 15:26:35 +00:00
Danylo Piliaiev	dbae9fa7d8	tu: implement sysmem vs gmem autotuner The implementation is separate from Freedreno due to multithreading support. In Vulkan application may fill command buffer from many threads and expect no locking to occur. We do introduce the possibility of locking on renderpass end, however assuming that application doesn't have a huge amount of slightly different renderpasses, there would be minimal to none contention. Other assumptions are: - Application does submit command buffers soon after their creation. Breaking the above may lead to some decrease in performance or autotuner turning itself off. The heuristic is too simplistic at the moment, to find a proper one - we should run a bunch of traces with sysmem and gmem, and build better heuristic from gathered data. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12128>	2022-01-31 15:26:35 +00:00
Lionel Landwerlin	d6e457c0a4	anv: tidy long lines in descriptor code No functional change. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14799>	2022-01-31 14:55:48 +00:00
Boris Brezillon	7333d244be	d3d12: Fix "use of designated initializers requires at least '/std:c++20'" error Fixes: `3a8c8d25fd` ("d3d12: Add a compute transformation to handle indirect draws that need draw params") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14762>	2022-01-31 14:13:57 +00:00
Rhys Perry	16e0c312fa	aco: preserve pass_flags during format conversions This helps the "vopc() & exec" optimization. fossil-db (Sienna Cichlid): Totals from 1638 (1.21% of 134913) affected shaders: CodeSize: 3331804 -> 3327520 (-0.13%); split: -0.19%, +0.06% Instrs: 611807 -> 610096 (-0.28%) Latency: 5579326 -> 5574874 (-0.08%) InvThroughput: 936782 -> 936731 (-0.01%); split: -0.01%, +0.00% Copies: 43324 -> 43302 (-0.05%); split: -0.06%, +0.01% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14773>	2022-01-31 13:45:01 +00:00
Rhys Perry	1804c21fb5	aco: optimize abs(mul(a, b)) fossil-db (Sienna Cichlid): Totals from 18 (0.01% of 134913) affected shaders: CodeSize: 173924 -> 173852 (-0.04%) Instrs: 33864 -> 33846 (-0.05%) Latency: 122233 -> 122211 (-0.02%) InvThroughput: 22482 -> 22462 (-0.09%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14773>	2022-01-31 13:45:01 +00:00
Rhys Perry	452975f257	aco: fix neg(abs(mul(a, b))) if the mul is not VOP3 Previously, is_abs was just ignored if mul_instr->isVOP3()==false. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `93c8ebfa78` ("aco: Initial commit of independent AMD compiler") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14773>	2022-01-31 13:45:01 +00:00
Ella Stanforth	a53fd9b089	vulkan: Allow RegisterDisplayEventEXT before first page flip Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14685>	2022-01-31 12:49:26 +00:00
Connor Abbott	360f7c5d64	tu: Initial link-time optimizations This is mostly taken from radv, and cleaned up a bit: don't explicitly list every stage at the beginning, and name the shaders "producer" and "consumer" to reduce confusion. I also stripped out a lot of other stuff to get to the bare minimum of calling nir_link_opt_varyings, nir_remove_unused_varyings, and nir_compact_varyings and then cleaning up the fallout. In the future we may want to temporarily scalarize I/O like radv does, and add back a few things like the psize optimization. In the meantime this already provides a lot of benefit. Results from the radv fossil_db with some apps not compilable by turnip removed: Totals: MaxWaves: 1637288 -> 1668200 (+1.89%); split: +1.89%, -0.00% Instrs: 54620287 -> 54114442 (-0.93%); split: -0.98%, +0.05% CodeSize: 92235646 -> 91277584 (-1.04%); split: -1.07%, +0.03% NOPs: 11176775 -> 11185206 (+0.08%); split: -0.63%, +0.71% Full: 1689271 -> 1657175 (-1.90%); split: -1.92%, +0.02% (ss): 1318763 -> 1317757 (-0.08%); split: -1.40%, +1.32% (sy): 618795 -> 617724 (-0.17%); split: -0.70%, +0.53% (ss)-stall: 3496370 -> 3470116 (-0.75%); split: -1.37%, +0.62% (sy)-stall: 23512954 -> 23511164 (-0.01%); split: -1.04%, +1.03% STPs: 27557 -> 27461 (-0.35%) LDPs: 22948 -> 22804 (-0.63%) Cat0: 11823765 -> 11829681 (+0.05%); split: -0.62%, +0.67% Cat1: 3120042 -> 2991831 (-4.11%); split: -4.43%, +0.32% Cat2: 28605309 -> 28324829 (-0.98%); split: -0.98%, +0.00% Cat3: 7334628 -> 7252342 (-1.12%); split: -1.12%, +0.00% Cat4: 1216514 -> 1204894 (-0.96%) Cat5: 863976 -> 861926 (-0.24%) Cat6: 1648571 -> 1641457 (-0.43%) Totals from 23575 (16.16% of 145856) affected shaders: MaxWaves: 258806 -> 289718 (+11.94%); split: +11.94%, -0.00% Instrs: 7571190 -> 7065345 (-6.68%); split: -7.04%, +0.36% CodeSize: 13864308 -> 12906246 (-6.91%); split: -7.09%, +0.18% NOPs: 959185 -> 967616 (+0.88%); split: -7.35%, +8.23% Full: 313335 -> 281239 (-10.24%); split: -10.36%, +0.11% (ss): 154628 -> 153622 (-0.65%); split: -11.90%, +11.25% (sy): 69758 -> 68687 (-1.54%); split: -6.21%, +4.67% (ss)-stall: 322002 -> 295748 (-8.15%); split: -14.92%, +6.76% (sy)-stall: 3270366 -> 3268576 (-0.05%); split: -7.45%, +7.40% STPs: 3624 -> 3528 (-2.65%) LDPs: 1074 -> 930 (-13.41%) Cat0: 1022684 -> `1028600` (+0.58%); split: -7.13%, +7.71% Cat1: 531102 -> 402891 (-24.14%); split: -26.04%, +1.90% Cat2: 4090309 -> 3809829 (-6.86%); split: -6.86%, +0.00% Cat3: 1449686 -> 1367400 (-5.68%); split: -5.69%, +0.01% Cat4: 103543 -> 91923 (-11.22%) Cat5: 57441 -> 55391 (-3.57%) Cat6: 316096 -> 308982 (-2.25%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14767>	2022-01-31 12:19:55 +00:00
Timothy Arceri	bfa096a0b9	glsl/st: move st_nir_opts() into gl compiler common code This will allow us to use this in future NIR linker work. It also makes more sense to move it here as the classic drivers are gone, tgsi is going away and we are merging more of the st into the gl common code. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14785>	2022-01-31 11:49:21 +00:00
Christian Gmeiner	665ee002c3	etnaviv: add two new HI related perfmon counter These counter are available starting with kernel 5.10. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7398>	2022-01-31 10:14:57 +00:00
Christian Gmeiner	cb643e9239	etnaviv: use bytes for read TX data Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7398>	2022-01-31 10:14:57 +00:00
Christian Gmeiner	d0a5129fde	etnaviv: add multiply_with_8 flag There are some HW counters that are exposing things in terms of 8bytes bundles. From a user PoV those counters would be much more useful if we do the scaling to single bytes internally. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7398>	2022-01-31 10:14:57 +00:00
Lionel Landwerlin	17d143231b	docs/anv: add descriptor memory layout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14522>	2022-01-31 07:59:00 +00:00
Lionel Landwerlin	4999e4cd4c	docs/anv: list environment variables Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14522>	2022-01-31 07:59:00 +00:00
Lionel Landwerlin	a98045c31b	docs: start some documentation on Anv Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14522>	2022-01-31 07:59:00 +00:00
Mike Blumenkrantz	3a0888c62f	zink: fix waiting on current batch id - the current batch id is always 0 - there is always a current batch - a batch id can only be set at the time of submit thus when passing 0 to wait on the current batch, the submit must complete so that there is a batch id, and this must occur before the timeline wait path or else the timeline wait does nothing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14693>	2022-01-31 04:03:26 +00:00
Mike Blumenkrantz	b62b916945	zink: print an error when the device is lost how many hours were lost to this Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14693>	2022-01-31 04:03:26 +00:00
Mike Blumenkrantz	95bfb75688	zink: add vertex shader pipeline bit for generated barrier construction if the vertex buffer resource has writes, it needs this bit too cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14691>	2022-01-31 03:46:53 +00:00
Mike Blumenkrantz	27d405dc2f	zink: clamp tbo creation to maxTexelBufferElements for sparse buffers, the total buffer size will be huge, so this needs to only be the limit that the driver can support to avoid crashing or whatever cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14692>	2022-01-31 03:34:09 +00:00
Mike Blumenkrantz	fa74a6c554	zink: ci updates Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14745>	2022-01-30 19:18:45 +00:00
Mike Blumenkrantz	90bf30d7e4	zink: make pipe_buffer_write usage trigger compiler errors don't want to have to hunt this down ever again Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14745>	2022-01-30 19:18:45 +00:00
Mike Blumenkrantz	c9a0919499	zink: replace other pipe_buffer_write usage with pipe_buffer_write_nooverlap this is just to be consistent and avoid any pipe_buffer_write() usage so that grep won't find it Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14745>	2022-01-30 19:18:45 +00:00
Mike Blumenkrantz	3402558b47	zink: replace qbo pipe_buffer_write usage with tc_buffer_write this fixes flakiness with qbo readback due to the buffer being internally invalidated without tc being aware of it fixes: KHR-GL46.direct_state_access.queries_functional Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14745>	2022-01-30 19:18:45 +00:00
Lucas Stach	a40a6e551e	etnaviv: draw: only mark resources as read/written when the state changed If the relevant state for a resource has not been dirtied between the last and the current draw, we don't need to mark the resource as read or written, as they are guaranteed to be marked already by the last draw. This saves quite a bit of hashset operations for the resource tracking in draw heavy workloads. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14791>	2022-01-30 13:28:21 +00:00
Thomas H.P. Andersen	37989670b9	microsoft/compiler: fix -Wbitwise-instead-of-logical warning Replace the bitwise operation with a more explicit do-while loop. This fixes a warning with clang, and ensures that nir_opt_dead_cf and nir_opt_dce are called in the right order. Suggested-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14742>	2022-01-30 05:05:38 +00:00
Christian Gmeiner	708a33d132	etnaviv: fix FRONT_AND_BACK culling HW has no value to cull both faces (setting both CW/CCW bits results in only CCW being culled). The blob just skips triangle draws when FRONT_AND_BACK culling is enabled. Lets do the same in draw_vbo(..). Signed-off-by: Jonathan Marek <jonathan@marek.ca> Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14779>	2022-01-29 16:28:09 +00:00
Marcin Ślusarz	24fef8f33d	intel/compiler: Use Task/Mesh InlineData for the first few push constants Replace load_mesh_global_arg_addr_intel with a more general intrinsic load_mesh_inline_data_intel, since inline data now hold both a pointer descriptor information and the first few push constants. Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Marcin Ślusarz	1d9f47325b	intel/compiler: handle gl_[Clip\|Cull]Distance from mesh in fragment shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Marcin Ślusarz	baa17865de	intel/compiler: handle gl_[Clip\|Cull]Distance in mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Caio Oliveira	856a0cacb1	intel/compiler: Merge Per-Primitive attribute handling in Mesh case Just a refactor, no behavior change. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Caio Oliveira	2b8b884bcd	intel/compiler: Have specific mesh handling in calculate_urb_setup() Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14788>	2022-01-29 06:32:19 +00:00
Paulo Zanoni	83788b864d	iris: sprinkle some assertions for bufmgr->lock Assert the lock is held when we need it to be held. Also hold the lock during bufmgr destruction to keep the asserts happy. Now the only place where we're not holding the lock while manipulating bufmgr data structures is iris_bufmgr_create(), but this shouldn't trigger any of the new assertions. Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13486>	2022-01-29 05:00:51 +00:00
Paulo Zanoni	6f5af78e25	iris: improve error checking in functions that call vma_alloc() On these functions, for vm_bind I want to add another function call that can fail and needs to be handled. As a preparation for that, add real error checks around vma_alloc() and try to fix some of the other error-related issues in these functions. This way, it will be much simpler to just drop the new vm_bind-related function calls and error handling code. My fear is that having vma-related errors (and in the future vm_bind-related errors) go unannounced may create some hard-to-debug bugs. v2: Unlock only after bo_free() (Marcin Ślusarz). v3: Prefer goto over early returns (Marcin Ślusarz). Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13486>	2022-01-29 05:00:51 +00:00
Caio Oliveira	8599ded193	intel: Only reserve space for Compute Engine out of URB in Gfx12LP Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14756>	2022-01-28 14:52:17 -08:00
Chia-I Wu	c3a74e10f9	venus: updates to the doc VIRTGPU_PARAM_CONTEXT_INIT has been upstreamed. Misc updates to reflect the current status. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14777>	2022-01-28 20:13:49 +00:00
Yiwei Zhang	3f9eb4fdf4	venus: make vn_QueueSubmit async for native submissions Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14657>	2022-01-28 19:15:52 +00:00
Yiwei Zhang	15e7750446	Revert "venus: remove vn_ring_wait_all" This reverts commit `7253e61d9d`. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14657>	2022-01-28 19:15:52 +00:00
Yiwei Zhang	5ba9309c29	venus: track whether a fence is external Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14657>	2022-01-28 19:15:52 +00:00
Yiwei Zhang	088ea93a59	venus: update some obsolete assumptions described Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14657>	2022-01-28 19:15:52 +00:00
Christian Gmeiner	be5b0e08fa	etnaviv: make use of nir_lower_tex_shadow Also force the texture filter to nearest when the lowering is used. This enables the GL_ARB_shadow extension for all GPU models. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14308>	2022-01-28 18:40:53 +00:00
Christian Gmeiner	e5f9cdac1f	nir/nir_lower_tex_shadow: support tex_instr without deref src Use texture_index if there is no deref src. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14308>	2022-01-28 18:40:53 +00:00
Christian Gmeiner	e67bca3fe7	nir: make lower_sample_tex_compare a common pass This pass was originally written for d3d12, but is useful for hardware that lacks sample compare support like some etnaviv GPU models. Also rename the lowering pass and some surrounding code to nir_lower_tex_shadow as suggested by Emma. I'd like to use the pass that's already in tree. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14308>	2022-01-28 18:40:53 +00:00
Alyssa Rosenzweig	a78861b0fb	docs/panfrost: Add new Midgard/Bifrost chips These should be fine. It's only the early chips that are too broken to see the light of day. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	0b16bf370e	panfrost: Add Mali-G51 support Just to prove it can be done in one line now :-) Thanks to Robin Murphy for breaking out the ol' FPGA. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	33399e95ae	pan/bi: Assume future Valhall is 16-wide warps This is true for v10 at least, per the public data sheet. It seems like a reasonable default going forward to eliminate a special case. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	8f0b5b4b19	pan/bi: Clean up quirks NO_PRELOAD isn't actually used. The other quirks are used, but they're implementation details only affecting a few older models. So we can default to no quirks and clean up a lot. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	94bb229c46	panfrost: Get performance counters from table Avoids yet another open-coded list of GPU IDs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	5e808f7b0a	panfrost: Make the GPU allowlist implicit Allowlist a GPU if and only if it has a model definition in-tree. This replaces the explicit allowlist, which is somewhat cumbersome to maintain. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	8c01a8a263	panfrost: Replace panfrost_model_name with model->name One less place to update GPU IDs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	6c0d433d19	panfrost: Centralize our model list Replaces panfrost-quirks.h Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	2b638c1eb3	panfrost: Don't pass quirks to pan_lower_framebuffer There is a single quirk it cares about. Pass just that, so the relevant quirk can be made a Midgard compiler quirk and not a global Panfrost quirk. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14726>	2022-01-28 17:47:46 +00:00
Alyssa Rosenzweig	2b699d0650	panfrost: Fix v9 "Stencil from shader" bit Typo. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Fixes: `96acad5cd5` ("panfrost: Add XML for Valhall data structures") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14776>	2022-01-28 17:30:37 +00:00
Alyssa Rosenzweig	23d52b47f1	panfrost: Make primary_shader boolean Fixes: `96acad5cd5` ("panfrost: Add XML for Valhall data structures") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14776>	2022-01-28 17:30:37 +00:00
Christian Gmeiner	52b36cb790	isaspec: Add support for special {:align=} field Make it possible to just do alignment handling without an actual field to print. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14668>	2022-01-28 16:33:18 +00:00
Mike Blumenkrantz	42ae116ac7	zink: fix vertex buffer mask computation for null buffers off by N affects: KHR-GL46.texture_cube_map_array.sampling Fixes: `53aade0ef0` ("zink: fix enabled vertex buffer mask calculation") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14721>	2022-01-28 15:53:43 +00:00
Mike Blumenkrantz	143c156409	aux/tc: add tc_buffer_write to replace pipe_buffer_write usage tc_buffer_write is the tc-safe version of this function which will avoid accidental invalidations that break behavior Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14770>	2022-01-28 14:58:20 +00:00
Mike Blumenkrantz	be5311972f	zink: remove tmp buffer rebinds this is no longer used/necessary Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14689>	2022-01-28 14:40:31 +00:00
Mike Blumenkrantz	5c28afdc7f	zink: simplify buffer case for zink_resource_object_init_storage() this is a no-op, but leave the case to simplify the caller Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14689>	2022-01-28 14:40:31 +00:00
Mike Blumenkrantz	bfde221952	zink: flag all buffer resources with PIPE_BIND_SHADER_IMAGE Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14689>	2022-01-28 14:40:31 +00:00
Mike Blumenkrantz	93edc63250	zink: use the storage buffer for bufferview creation when format allows this avoids issues where a buffer created for storage can't be used for texturing due to having the storage bit fixes (anv): KHR-GL46.texture_buffer.texture_buffer_texture_buffer_range crash -> fail Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14689>	2022-01-28 14:40:31 +00:00
Mike Blumenkrantz	94afa1632f	zink: always create a separate VkBuffer for storage use bufferview objects must be created with or without the storage bit as the driver permits, so create two buffer objects that can be used as needed Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14689>	2022-01-28 14:40:31 +00:00
Mike Blumenkrantz	3e12cb9ad9	zink: use VkImageViewUsageCreateInfo to remove attachment bits this avoids issues where the driver may not support attachment bits for texture views Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14719>	2022-01-28 14:23:51 +00:00
Mike Blumenkrantz	c639e82f18	zink: allow resource creation without VK_FORMAT_FEATURE_COLOR_ATTACHMENT_BIT as long as this isn't multisampled, it should be okay since it could be used with texture_subdata fixes (anv): dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.conversion_gpu dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.enabled dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.multiple_textures dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.skipped dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.texel_fetch dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.toggled dEQP-GLES31.functional.srgb_texture_decode.skip_decode.sr8.using_sampler dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_linear dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_linear_mipmap_linear dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_linear_mipmap_nearest dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_nearest dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_nearest_mipmap_linear dEQP-GLES31.functional.texture.filtering.cube_array.formats.sr8_nearest_mipmap_nearest dEQP-GLES31.functional.texture.format.sized.cube_array.srgb_r8_npot dEQP-GLES31.functional.texture.format.sized.cube_array.srgb_r8_pot Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14719>	2022-01-28 14:23:51 +00:00
Mike Blumenkrantz	d4720c65ac	zink: flag has_work when a GL semaphore is signalled this is an async flush, so it has to have this flag set or else the subsequent (also async) flush won't do anything Fixes: `32597e116d` ("zink: implement GL semaphores") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14740>	2022-01-28 14:02:48 +00:00
Mike Blumenkrantz	794fabc8c2	zink: emit same number of timeline signals as semaphore signals spec requires these counts/arrays to match, and I did match them, but it was only in my kopper branch, which didn't propagate back to the original implementation Fixes: `32597e116d` ("zink: implement GL semaphores") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14740>	2022-01-28 14:02:48 +00:00
Iago Toral Quiroga	5974949c0d	v3dv: expose VK_KHR_depth_stencil_resolve Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Iago Toral Quiroga	668653f830	v3dv: fallback to blit resolve if render area is not aligned to tile boundaries Just as with all other TLB operations, we can only use the TLB if the render area is aligned to tile boundaries. If it is not, then the operation would overwrite pixels outside the render area, which is not allowed. In this case, we can't even emit a previous TLB load to fix this because the TLB has the multisampled attachment, not the resolve attachment, which is just a destination buffer for the tile store. Because the condition for tile alignment has to be determined for each subpass, we handle this by storing this information in the attachment state of the command buffer with the start of each subpass. We store whether the attachment is to be resolved and whether it can use the TLB (considering tile alignment restrictions). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Iago Toral Quiroga	7f87a1256e	v3dv: support resolving depth/stencil attachments This implements the bulk of VK_KHR_depth_stencil_resolve Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14752>	2022-01-28 12:25:43 +00:00
Tomeu Vizoso	743582d170	ci: Rebalance Iris jobs More boards have been added to Collabora's lab, and the reliability should be fine now. Use them to the max! Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14557>	2022-01-28 12:00:21 +01:00
Caio Oliveira	d6c31f05a2	anv: Fix subgroupSupportedStages physical property Use the proper Vulkan values that can be combined into a bitmask. Fixes: `f40a08d25c` ("anv: Don't advertise unsupported shader stages") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14761>	2022-01-28 01:13:07 -08:00
Tatsuyuki Ishi	89f376b506	radv/sqtt: Add and enable basic EXT_debug_utils support. Only CommandBuffer markers for now. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14645>	2022-01-28 07:47:10 +00:00
Vinson Lee	a97ec3eb13	v3dv: Add missing unlocks on errors. Fix defects reported by Coverity Scan. Missing unlock (LOCK) missing_unlock: Returning without unlocking. Fixes: `a7052dcf2c` ("v3dv: enable multiple semaphores for csd job") Fixes: `ad09e50129` ("v3dv: enable multiple semaphores for tfu job") Fixes: `ff8586c345` ("v3dv: enable multiple semaphores on cl submission") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14663>	2022-01-28 04:15:24 +00:00
Nanley Chery	dc70dd8c7d	iris: Support the XeHP media compression format The format on this platform is slightly different from the one used on TGL. Also it's part of the surface state instead of an aux-map. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	7f46e569e5	intel/isl: Support the XeHP media compression format The format on this platform is slightly different from the one used on TGL. Also it's part of the surface state instead of an aux-map. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	fde43bb194	intel: Rename a RenderCompressionFormat field The name of the bit field is CompressionFormat. The format subsections of the field specify the alternate names of RenderCompressionFormat or MediaCompressionFormat depending on the compression type. We're going to start programming this field for media compression, so we'd like to use either the bit field name or a new MediaCompressionFormat field. Either option seems fine, so we go with the first. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	d8c6b2c394	iris: Use iris_format_for_usage in map_aux_addresses Enables dropping the format-mapping switch statement in iris_resource_finish_aux_import. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	37a0185ec2	iris: Drop stale media compression import code With commit `f57c074270`, ("gallium/dri: Allow use of R8G8_R8B8 for YUYV and G8R8_B8R8 for UYVY"), iris stopped lowering media compressed surfaces to multiple planes. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	cc973de5f6	intel/isl: Support YUV pipe-to-isl format mapping This will help to configure surfaces for media compression. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Nanley Chery	514897ef41	iris: Explicitly rely on gallium fallbacks for YUV iris_is_format_supported has been returning false for YUV pipe formats. We're going to update isl_format_for_pipe_format to map some YUV pipe formats, but we don't want iris_is_format_supported to start returning true for them. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14355>	2022-01-28 00:30:55 +00:00
Emma Anholt	b5e41c8c2d	ci/freedreno: Switch 2 default a630 VK jobs to being GLES and VK ASan jobs. The automatic VK coverage we care about is happening on a618, which is the HW we're shipping. Having the old 630 runners make sure we don't leak memory is a great use for them. Still, keep one default A630 VK job to make sure we don't totally trash it. Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14235>	2022-01-27 23:47:46 +00:00
Emma Anholt	8457667be9	ci: Use a dlclose-disabling preload library for leak checking in Vulkan. For GL, we disable the dlclose() call on the driver in asan builds so that leak reports get proper backtraces. For Vulkan, the dlclose() happens from libvulkan so you need a bigger hammer to keep our drivers loaded. Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14235>	2022-01-27 23:47:46 +00:00
Danylo Piliaiev	da7a475138	turnip: Drop references to layout of all sets on pool reset/destruction We dropped the references only for non-host_memory_base pools. Create a list of alive descriptor to account for all of them. Fixes: `1b513f49` ("tu: add reference counting for descriptor set layouts") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14235>	2022-01-27 23:47:46 +00:00
Emma Anholt	bdb8e615d1	vulkan: Fix leak of error messages Fixes: `0cad3beb2a` ("vulkan/log: Add common vk_error and vk_errorf helpers") Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14235>	2022-01-27 23:47:46 +00:00
Nanley Chery	749b82238b	isl: Enable compression with multisampled Tile64 We haven't tested the single-sampled case, so this doesn't enable any more uses of standalone CCS. This does however, allow multisampled surfaces with MCS or HIZ to get upgraded to MCS_CCS and HIZ_CCS, respectively. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Jordan Justen	792e294572	isl: Enable compression with Tile4 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Nanley Chery	793338266c	anv: Don't allocate VMA for CCS on XeHP On XeHP, CCS doesn't require VMA on XeHP. The HW provides anything allocated in LMEM a mapping to a CCS memory range for free. So, we: 1) use the implicit CCS framework to avoid adding an image memory binding for the CCS surface. 2) leave each BO sized as-is instead of adding on space for the CCS. Thankfully the framework only adds on space if an aux-map is present. XeHP has no aux-map, so this patch doesn't explicitly do anything for this. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Nanley Chery	382f6ccda8	anv: Require the local heap for CCS on XeHP This platform doesn't support CCS in system memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Nanley Chery	93f8d88fa3	anv: Disable the SMEM fallback for local memory The fallback is incompatible with allocations that use CCS on XeHP. On that platform, compression can't be used in SMEM. Apps should be okay with this change. They're able to manage local and system memory heaps directly (see VK_EXT_memory_budget). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Nanley Chery	63096b886f	anv: Drop redundant disabling of non-renderable CCS ISL reports no CCS support for these formats, except for R10G10B10A2_UNORM_SRGB. Anv doesn't support that format at all however. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14431>	2022-01-27 22:38:01 +00:00
Chia-I Wu	4f1cf6fd3b	vulkan/wsi/x11: fix x11_image_init return value on errors fail_pixmap is reached when xshmfence alloc/map fails. result is VK_SUCCESS and we need to pick an error code explicitly. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14748>	2022-01-27 22:06:19 +00:00
Pavel Ondračka	e020c1b979	r300: Set consistent PIPE_SHADER_CAP_PREFERRED_IR This efectivelly enables NIR to TGSI for swtcl chipsets. Otherwise on swtcl chipsets we can end with PIPE_SHADER_IR_TGSI for vertex stage and PIPE_SHADER_IR_NIR for fragment stage. This is not expected by core mesa and leads to issues most visible as crashing piglit atan tests. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5916 Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14732>	2022-01-27 21:04:32 +00:00
Pavel Ondračka	8b11d8a127	r300: Disable integers and indirect temporary addressing with swctl This is not needed with TGSI, but will be needed by the next commit after we switch to NIR. Copied from i915 including the comments. Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14732>	2022-01-27 21:04:32 +00:00
Chia-I Wu	bf32d3145c	venus: handle VkBindImageMemorySwapchainInfoKHR The common WSI advertises VK_KHR_swapchain v70. We must handle VkBindImageMemorySwapchainInfoKHR. Fixes dEQP-VK.wsi.*.image_swapchain_create_info. v2: try to match vn_wsi_create_image (Yiwei) and the common WSI v3: match modifier as well (Yiwei) Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14550>	2022-01-27 20:35:07 +00:00
Chia-I Wu	127263dc4a	venus: remember the memory bound to a swapchain image It will be needed for VkBindImageMemorySwapchainInfoKHR. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14550>	2022-01-27 20:35:07 +00:00
Chia-I Wu	350dfb8c3c	venus: format with clang-format clang-format misaligns one of the comments in vn_physical_device_get_passthrough_extensions, but we can live with that. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14550>	2022-01-27 20:35:07 +00:00
Chia-I Wu	4d15b52783	venus: fix VK_KHR_driver_properties VK-GL-CTS 1.2.7.1 does not really recognize VK_DRIVER_ID_MESA_VENUS, but it is harmless to lie. Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14749>	2022-01-27 20:24:57 +00:00
Danylo Piliaiev	24144f6f5c	turnip/trace: Delete unused start/end_resolve tracepoints Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Danylo Piliaiev	1989e1e6d8	turnip/perfetto: handle gpu timestamps being non-monotonic Perfetto requires time in clock snaphots to be monotonic, otherwise the clock would be excluded. GPU timestamps start from zero after every suspend-resume cycle which makes them non-monotonic. As a solution on msm we check whether GPU was just resumed and remember previous highest timestamp to then add it to the next timestamps. If the functionality to get whether gpu is resumed is unavailable or doesn't work - we fallback to a check for a discontinuity in timestamps. For kgsl we always use fallback. Fixes renderstage timeline disappearing in AGI. Or you could avoid the issue altogether by preventing GPU from going to sleep by increasing auto suspend delay e.g.: echo 5000 > /sys/devices/platform/soc\@0/3d00000.gpu/power/autosuspend_delay_ms Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Danylo Piliaiev	ba7faa6f43	turnip/trace: process u_trace chunks on queue submission tu_QueuePresentKHR was not the best place since application isn't required to call it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Danylo Piliaiev	a6482a3a6e	turnip: rename tu_drm_get_timestamp into tu_device_get_gpu_timestamp It is not drm specific and will be implemented in kgsl. Change parameter to tu_device along the way. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Danylo Piliaiev	f2c53c2a9b	turnip/trace: refactor creation and usage of trace flush data Fixes the case when last cmd buffer in submission doesn't have tracepoints leading to flush data not being freed. Added a few comments, renamed things, refactored allocations - now the data flow should be a bit more clean. Extracted submission data creation into tu_u_trace_submission_data_create which would be later used in in tu_kgsl. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Danylo Piliaiev	95896dee93	turnip/perfetto: Optimize timestamp synchronization We shouldn't do ioctl to get timestamp if perfetto isn't connected. Also it's better to sync timestamps after submission since the call could block until GPU is resumed. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14391>	2022-01-27 18:59:43 +00:00
Guilherme Gallo	2d75fd1e0a	virgl/ci: make crosvm-runner pass variables in a secure way crosvm-runner.sh was using `export -p` to create an environment script for the virtualized system, but this command will dump every declared environment variable in the system, which includes Gitlab's CI variables with sensitive data, such as passwords and auth tokens. Replacing `export -p` to `generate-env.sh`, which only exports the necessary variables for Mesa CI jobs. Extra changes: * Stop changing ${PWD} variable programmatically in scripts. ${PWD} is a variable used by most prolific coreutils and bash commands, such as `cd` and `pwd`, besides it is set by subshells [1]; changing this variable may lead to complex situations. As drop-in replacement for ${PWD}, use ${DEQP_BIN_DIR} to flag that there is a special folder where dEQP should be run. * Double quote path and array variables. See: https://github.com/koalaman/shellcheck/wiki/SC2086 * Do not export variables directly from commands output. See: https://github.com/koalaman/shellcheck/wiki/SC2155 [1] ``` $ cd /tmp $ export PWD=test; bash -c 'echo $PWD' /tmp ``` v2: - Revert $DEQP_BIN_DIR quoting in crosvm-runner.sh and crosvm-init.sh - Log all the passed variables to stdout, to help with debugging when new variable are needed to be put in `generate-env.sh` v3: - Revert $DEQP_BIN_DIR quoting leftovers Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14626>	2022-01-27 18:28:48 +00:00
Emma Anholt	7380d8e285	ci/freedreno: Update hashes for closed traces. These two had different pixel results from last time someone updated them, but things still look fine. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14757>	2022-01-27 17:46:52 +00:00
Connor Abbott	065785e689	tu: Report code size in pipeline statistics Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14754>	2022-01-27 17:16:18 +00:00
Lionel Landwerlin	583e4549fe	intel/ci: expected failure for 1.3 with older CTS Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Lionel Landwerlin	f3877182f7	relnotes/features: updates for Vulkan 1.3 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Jason Ekstrand	df8ac77af8	anv: Advertise Vulkan 1.3 Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Lionel Landwerlin	7d9cd208d5	anv: switch a bunch of struct/enum to 1.3 versions Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Jason Ekstrand	2e730167a4	anv: Implement 1.3 features/properties Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Caio Marcelo de Oliveira Filho	372faa4a23	anv: SPIR-V 1.6 shaders imply ALLOW_VARYING_SUBGROUP_SIZE Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14706>	2022-01-27 16:42:01 +00:00
Manas Chaudhary	cad053db61	panvk: Fix pointer corruption in panvk_add_wait_event_syncobjs nr_in_fences was being incremented to point to an illegal address Fixes: `1e23004600` ("panvk: Add vkEvents support") Cc: mesa-stable Signed-off-by: Manas Chaudhary <manas.chaudhary@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14744>	2022-01-27 14:16:17 +00:00
Mike Blumenkrantz	ece10a5467	zink: unify some context casts in zink_create_sampler_view no functional changes Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14746>	2022-01-27 13:32:54 +00:00
Mike Blumenkrantz	fe3984b8cd	anv: silence wsi debug logging this is triggered by mesa's own wsi handling, so stop printing nonsense Reviewed-by: Jason Ekstrand <jason.ekstrand@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14743>	2022-01-27 13:10:27 +00:00
Roman Gilg	6018d5c44a	vulkan/wsi/x11: document implementation To extend the shared understanding of our code base and ease contributing document purpose and flow for many of our internal functions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6536>	2022-01-27 09:14:29 +00:00
Martin Roukala (né Peres)	978ea32acf	radv/ci: mark the dEQP fails related to a missing VKCTS 1.3 as expected Now that RADV is exposing Vulkan 1.3 by default, VKCTS is getting confused by it and fails 2 tests: - dEQP-VK.api.version_check.version: This version of CTS does not support Vulkan device version 1.3.204 (Fail) - dEQP-VK.info.device_properties: deviceProperties apiVersion not valid (Fail) Mark both of these failures as expected, while we wait for VKCTS 1.3 to be released. Signed-off-by: Martin Roukala (né Peres) <martin.roukala@mupuf.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14729>	2022-01-27 07:55:53 +00:00
Iago Toral Quiroga	764c8867b0	v3dv: document why we don't expose VK_EXT_scalar_block_layout And since this is an optional feature in Vulkan 1.2, fill in the corresponding feature query while we are at it. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14731>	2022-01-27 07:34:19 +00:00
Iago Toral Quiroga	06220a28e7	v3dv: rework Vulkan 1.2 feature queries Fill them into a VkPhysicalDeviceVulkan12Features struct like we do for Vulkan 1.1, and then read them from there. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14731>	2022-01-27 07:34:19 +00:00
Iago Toral Quiroga	692e0dfe27	v3dv: implement VK_KHR_imageless_framebuffer Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14704>	2022-01-27 07:11:20 +00:00
Iago Toral Quiroga	2ee9487ad7	v3dv: drop signature of undefined function This is a left over from when we added multi-version support in the driver, where we turned this helper into a versioned scheme. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14704>	2022-01-27 07:11:20 +00:00
Emma Anholt	af957d4a17	ci/traces: Always generate the junit XML. While it's not the primary interface to interpreting trace job failures, it was set in all the traces jobs it looks like and it's low cost anyway. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	60f7bd3c09	ci/traces: Drop PIGLIT_REPLAY_UPLOAD_TO_MINIO. You have to do this as part of the traces workflow, otherwise there are no baseline images for your driver to compare to in the HTML summary. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	d0af517a14	ci/traces: Drop the baseline file creation for trace results. It's always empty for traces. This reduces more noise in the job logs so people are more likely to see the link to the HTML. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	d562b83a84	ci/traces: Clean up the failure report message. You really want to be reviewing the HTML summary with image diffs, not the junit XML (though we do still generate it so you get the results in the gitlab UI if that's how you like to interact with it). Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	7d6cd4a5cd	ci/traces: Drop the PIGLIT_PROFILES setting for traces replay. Now that the script just does traces, no need to conditionalize this stuff. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	ccbf16124e	ci/traces: Rename the piglit/run.sh script to piglit-traces.sh. That's the only use of this script that's left. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	d041630a37	ci/llvmpipe,softpipe: Switch piglit testing to piglit-runner. The new runner reduces the runtime by about 1/3 thanks to using rust instead of python, and includes automatic flake handling so you don't just have to skip flaky tests. The wrapper script also includes IRC flake reporting (so one can track and update the flakes list to improve CI reliability), always uploading results to CI for review (so you can diagnose flakes and look at timings), has a prettier regressions report and a helpful timing report, and is the same as what's used by all the HW runners as well. The downside is that by dropping the massive list of skips, you no longer get flagged if Mesa refactors end up accidentally disabling extensions and thus making tests skip. For that, I've started on https://gitlab.freedesktop.org/anholt/deqp-runner/-/merge_requests/33 so that hardware drivers get extension checking coverage too. Thanks to the perf improvement, we get to drop one of the jobs for llvmpipe. xfail lists were mostly sed-jobs from the prior expectations lists. The exceptions to that you'll find in the form of whitespace around the affected test group (usually changes of capitalization or special-characters), or an explanation for the more interesting changes (which thankfully we can now record in the xfails lists!). Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	87c3651674	ci/llvmpipe: Drop the skip of piglit edgeflag test. I think the fail was fixed by `39ea95330f` ("mesa: ensure parameter list capacity before associating uniform storage"), it doesn't reproduce for me any more. Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Emma Anholt	6ce7a6e725	Revert "ci: freedreno: Update a530 dEQP fail expectation list" This reverts commit `a35c5540e4`. Another patch doing so had already landed, so 530 was now failing all its (manual) tests. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14604>	2022-01-27 04:37:16 +00:00
Rob Clark	ae32c6229e	freedreno: Add missing generated header dependency Fixes: In file included from ../mesa-freedreno-22.0.0_pre/src/gallium/drivers/freedreno/ir3/ir3_cmdline.c:45: In file included from ../mesa-freedreno-22.0.0_pre/src/gallium/drivers/freedreno/ir3/ir3_gallium.h:34: In file included from ../mesa-freedreno-22.0.0_pre/src/gallium/drivers/freedreno/freedreno_util.h:31: ../mesa-freedreno-22.0.0_pre/src/freedreno/drm/freedreno_ringbuffer.h:35:10: fatal error: 'adreno_common.xml.h' file not found #include "adreno_common.xml.h" ^~~~~~~~~~~~~~~~~~~~~ 1 error generated. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14735>	2022-01-27 00:28:48 +00:00
Adam Jackson	6c984b1469	dri_interface: Remove the remaining DRI1 API definitions None of these are used anymore, and as a bonus we can drop the dance around the libdrm headers. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14738>	2022-01-26 23:56:16 +00:00
Nanley Chery	57445adc89	anv: Re-enable CCS_E on TGL+ Commit `e614789588` ("anv: Also disallow CCS_E for multi-LOD images") accidentally disabled CCS_E on TGL+ because it checked for image->vk.mip_levels > 0 instead of image->vk.mip_levels > 1. Instead of reverting it, we remove the code which disables CCS_E for mipmapped or arrayed images now that we've sufficiently handled the clear color issue in other ways. Fixes: `e614789588` ("anv: Also disallow CCS_E for multi-LOD images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14723>	2022-01-26 23:34:33 +00:00
Nanley Chery	c48401404c	anv: Use ANV_FAST_CLEAR_DEFAULT_VALUE for CCS on TGL+ On TGL, if a block of fragment shader outputs match the surface's clear color, the HW may convert them to fast-clears (see HSD 14010672564). This can lead to rendering corruptions if not handled properly. We restrict the clear color to zero to avoid issues that can occur with: - Texture view rendering (including blorp_copy calls) - Images with multiple levels or array layers Fixes: `e614789588` ("anv: Also disallow CCS_E for multi-LOD images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14723>	2022-01-26 23:34:33 +00:00
Nanley Chery	d68b2db89c	anv: Disable CCS_E for some 8/16bpp copies on TGL+ CCS_E is currently disabled on TGL+, but we'll enable it soon. We choose to explicitly disable it for certain copy operations to avoid CTS failures in the following groups: - dEQP-VK.drm_format_modifiers.export_import.* - dEQP-VK.synchronization* Fixes: `e614789588` ("anv: Also disallow CCS_E for multi-LOD images") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14723>	2022-01-26 23:34:33 +00:00
Nanley Chery	09ca089144	anv: Drop assert against modifier with aux on gfx12 Commit `b664349973` ("anv: Enable implicit CCS for external images") introduced support for implicit CCS with I915_FORMAT_MOD_Y_TILED. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14723>	2022-01-26 23:34:33 +00:00
Erik Faye-Lund	f07f4d5ec5	docs: use http-redirect when possible GitLab Pages has added a feature to do proper HTTP redirects, which are genreally better than the HTML redirects we currently use. Unfortunately, it doesn't support redirecting to other domains, all paths must start with a slash. So there's sadly one redirect this doesn't work for. So let's leave that one using a HTML redirect, and use HTTP redirects when we can. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14369>	2022-01-26 23:26:50 +00:00
Emma Anholt	53426d26c3	softpipe: Dispatch 4 CS invocations per tgsi_exec thread. We were executing 1 non-helper invocation and 3 helpers per CS tgsi_exec machine, which was a total waste of the CPU when we could trivially have all 4 invocations do real work (at least in the common case of a gl_WorkGroupSize.x >= 4). This didn't have the effect on dEQP that I was hoping for, as it turns out that its shaders are almost all 1x1x1 workgroups. However, it does reduce the runtime of piglit arb_compute_shader-local-id from 2:10 to 47 seconds on my system. Part of #4097 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14728>	2022-01-26 22:52:19 +00:00
Emma Anholt	62dc4be470	softpipe: Initialize the CS dispatch mask at machine setup time. It's not modified later, so no need to reset it per barrier or per workgroup. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14728>	2022-01-26 22:52:19 +00:00
Emma Anholt	2fe2a2b080	softpipe: Improve some local var naming in compute shaders. These aren't dimensions, they're gl_LocalInvocationID.xyz. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14728>	2022-01-26 22:52:19 +00:00
Emma Anholt	13b57a8cad	tgsi_exec: Fix shared var stores for >1 real invocation, and overflow checks. The shared var store overflow checks left a lot of overflowing opportunities available, while the buffer storage path did proper checking. But, more importantly for this branch, it always used the first invocation's offset for each invocation in the quad (which only worked so far because softpipe only dispatched a single non-helper invocation per quad). Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14728>	2022-01-26 22:52:19 +00:00
Hyunjun Ko	8a5b949a3e	turnip: fix leaks of submit requests. Fixes: `479a1c40` ("turnip: Porting to common vulkan implementation for synchronization.") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14727>	2022-01-26 22:22:33 +00:00
Mike Blumenkrantz	8bca29dcf0	zink: return 256 for PIPE_CAP_MIN_MAP_BUFFER_ALIGNMENT this isn't the minimum allowed by the driver, but zink doesn't return the minimum allowed by the driver anyway and hasn't in a very long time instead, it suballocates using a minimum alignment of 256 bytes, so use that instead fixes (in caselists): KHR-GL46.map_buffer_alignment.functional Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14720>	2022-01-26 22:05:53 +00:00
Michel Zou	a7ae3bcdf8	zink: fix unused variable warning fixes: `4ed30be3` Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14739>	2022-01-26 21:51:13 +00:00
Eric Engestrom	dcbabb9c9d	docs/release-calendar: add another 21.3.x since 22.0 has been delayed a bit Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14737>	2022-01-26 20:43:36 +00:00
Eric Engestrom	956c93e154	docs: update calendar and link releases notes for 21.3.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14737>	2022-01-26 20:43:36 +00:00
Eric Engestrom	b486dfd1a4	docs: add release notes for 21.3.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14737>	2022-01-26 20:43:36 +00:00
Jason Ekstrand	fe71f4d9ec	.mailmap: Switch Jason Ekstrand to @collabora.com Jason is starting at Collabora on the 24th. More details at https://www.jlekstrand.net/jason/blog/2022/01/hello-collabora/ Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14586>	2022-01-26 20:08:31 +00:00
Yiwei Zhang	96acd0933e	tu: VkExternalImageFormatProperties is optional ..even if external image info has valid external handles. Fixes: `26380b3a9f` ("turnip: Add driver skeleton (v2)") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14730>	2022-01-26 16:35:10 +00:00
Ruijing Dong	49e02d076d	radeon/vcn: Updating render_pic_list for correction In order to keep track of reference frame buffer address changing, using past_ref to compare with render_pic_list, once the one in past_ref is valid and if render_pic_list has that entry, it will need to update it to the latest one in ref[i]. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5868 Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14646>	2022-01-26 15:28:55 +00:00
Ruijing Dong	cf16368977	frontend/va: Keep surface buf addr before reallocation The reference buffer address is used as the indication in h264 DPB Tier2, when reference buffer was reallocated, h264 DPB would lose track of that reference picture. Adding a pointer obsolete_buf in vlVaSurface data structure for tracking this released buffer, also in h264_picture_desc adding a private field, which contains past_ref[16] for tracking previously released buffer vs current buffer for reference frames. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5868 Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14646>	2022-01-26 15:28:55 +00:00
Mike Blumenkrantz	8747715aec	zink: reorder fbfetch flag-setting to avoid null deref this avoids dereferencing pg->dd which is allocated a few lines later Fixes: `417477f60e` ("zink: always use lazy (non-push) updating for fbfetch descriptors") fixes (radv): dEQP-GLES31.functional.blend_equation_advanced.basic.multiply Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14718>	2022-01-26 14:59:56 +00:00
Rhys Perry	7a0cf7f6d1	radv: fix optimized MSAA copies with suballocated images Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `deb4685df3` ("radv: implement optimized MSAA copies using FMASK") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5829 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14705>	2022-01-26 10:54:33 +00:00
Mike Blumenkrantz	e4218e5c4d	zink: handle bogus xfb draws drawing unpopulated xfb data is legal(?) and tested in cts, and the correct operation is to just drop the draw, so do that here fixes (nvidia): GTF-GL46.gtf40.GL3Tests.transform_feedback2.transform_feedback2_api Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14694>	2022-01-26 04:00:42 +00:00
Iván Briano	61ece8f6a4	anv: Enable VK_KHR_dynamic_rendering Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	5d9e8bc9be	anv: implement the meat of VK_KHR_dynamic_rendering Includes a fake framebuffer allocation that's necessary for the code we still use from the regular render passes. v3: (Lionel) - Reuse the attachment count from the faux render pass, remove now unused function - Add a cmd_buffer_end_rendering function to match begin_rendering, making use of the split stuff from end_subpass v4: (Lionel) - Don't bother with mark_images_writen or resolves on suspend case - Remove flush at the end of end_rendering, it's not needed Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	4ad9ccd28a	anv: split end_subpass into more discrete components v3: Split cmd_buffer_end_subpass instead of doing parts conditionally (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	2f942f3217	anv: Split attachment clearing code into their own functions v3: Avoid recalculating parameters the caller already had (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	e41436beec	anv: allocate fake render pass for continuation command buffers v4: Assert if there's no VkCommandBufferInheritanceRenderingInfoKHR (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	b32023573d	anv: Split out state attachments allocation Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	5d9aaea31f	anv: allocate fake render pass on pipeline creation v3: (Lionel) - Handle VkPipelineRenderingCreateInfoKHR not being present - Rename dynamic_pass and set it for regular render passes too v4: C99 is good (Lionel) Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	73ed019bec	anv: add functions to set up fake render passes There's two of them because they can be created from three points in the code that provide different details and this is the least ugly way I could think of for now. v2: Avoid allocations (Lionel) v3: Move definition closer to its usage (Lionel) v4: (Lionel) - Simplify anv_dynamic_pass_init_full - Zero out pass/subpass to avoid stall pointers Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Iván Briano	b18bc028ee	anv: Remove unused struct member Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Jason Ekstrand	6612dcc425	anv/pass: Don't set first_subpass_layout for stencil-only attachments Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13980>	2022-01-25 18:13:51 -08:00
Alyssa Rosenzweig	ee044d445d	panfrost: Remove NO_BLEND_PACKS quirk Now unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	5a8d86f69e	panfrost: Simplify format class selection This was made way more complicated than it needs to be for a Midgard-only pass. The only caller doesn't care about the class, only if it's native or not. Simplify it appropriately. It really isn't that hard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	da9d6a643a	panfrost: Don't set NO_BLEND_PACKS on Bifrost It doesn't make sense on Bifrost -- the only consumer of the quirk is pan_lower_framebuffer, a Midgard-only pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	93f6c6586c	panfrost: Remove MIDGARD_{NO_TYPED_BLEND_STORES,MISSING_LOADS} These "quirks" are common for Midgard, yet are only consumed by pan_lower_framebuffer -- a Midgard-only pass. So the quirks should be removed and inlined into their users. Thid removes MIDGARD_QUIRKS altogether. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	db497c27dc	panfrost: Remove NO_TILE_ENABLE_MAP quirk Function of architecture. Add a comment to the sole consumer of the quirk bit about why it's used. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	bbe0517481	panfrost: Remove MIDGARD_BROKEN_FP16 quirk Unused. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	d4575bcc79	panfrost: Remove MIDGARD_SFBD quirk Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Alyssa Rosenzweig	47a7e26954	panfrost: Remove HAS_SWIZZLES quirk It's a function of the major arch. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14724>	2022-01-26 01:45:09 +00:00
Jesse Natalie	ed42b129ef	d3d12: Set caps for tesselation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	278b30723f	d3d12: Handle input clip array size in the shader key Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	df302f5f90	d3d12: Update varying creation logic to handle location_frac When multiple variables are packed into the same location, we need to re-construct variables that read/write the same components of that register so that the DXIL signature is correct. We could try to merge these variables, but getting the types right sounds harder than just preserving the multiple individual variables. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	1a231ec805	d3d12: Add a state variable for patch_vertices_in Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	de438f381f	d3d12: Handle passthrough TCS in the case where eval is bound Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	224c1562c1	d3d12: Handle patch_vertices and patch topology Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	c83476ff13	d3d12: Link tesselation control and eval shaders GLSL puts a bunch of tessellation info in the eval shaders, because passthrough control shaders can exist. D3D12 puts it in the control (hull) shader instead. So, when specializing, copy info from domain to hull. For initial compiles (no domain shader), just make something up. D3D12 also requires the domain and hull shaders to have identical patch constant signatures. Use the existing infrastructure and extend it to also propagate patch constants. Notably, patch constant locations are outside of the 64-bit range value so they require a separate pass to avoid shifts larger than 64. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	0ed7b44f5c	d3d12: Initial plumbing for tesselation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	22156821ea	d3d12: Enable PIPE_CAP_TGSI_TEXCOORD This is required to be able to use the necessary number of varyings, otherwise we hit asserts because mesa/st starts assigning varyings locations above 64 due to the +9 reserving these texcoords. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	973bff335f	microsoft/compiler: Handle clip/cull distance as an input to tess shaders In order to get the semantics right, we need to know how many of the clip/ cull fields are designated for which purpose. In the case of a shader that can receive these fields as both input and output, the shader_info property is reserved to store the output info. We could add a dedicated input field to shader_info, but since it'd probably only be useful for us, just send it through a side channel during shader linking. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	1c4667bc9f	microsoft/compiler: Location_frac needs to be included in sort order Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	4da51aa88f	microsoft/compiler: Primitive ID should only be added as a sysval in geometry shaders Docs say that its presence in signatures as a "shadow" element (meaning it's not accessed via load/store, but with a dedicated opcode) is legacy. It seems it wasn't carried forward when HS/DS were added in D3D11. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	ec415a274e	microsoft/compiler: Emit DS PSV validation and entrypoint metadata Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	9aca56b137	microsoft/compiler: Handle domain location intrinsic Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8524d04783	microsoft/compiler: Handle load_output in the HS stage as reading a previously written patch constant Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	a39881b788	microsoft/compiler: Handle load_per_vertex_output as LoadOutputControlPoint Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	a550c059c7	microsoft/compiler: For load_input from DS, use loadPatchConstant Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	cc6104dd3f	microsoft/compiler: For store_output from HS, use storePatchConstant In HS, store_per_vertex_output maps to storeOutput in DXIL. The data that isn't per-vertex is patch constants. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	97b6ea71a0	microsoft/compiler: Add a pass for hull and domain shaders to shrink tess level vars DXIL validation will complain if the tess factor signature entries have the wrong number of components for the shader's domain. Make sure that both hull and domain shaders have the right number, and drop loads and stores from the removed components. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	bd2a4fb1b8	microsoft/compiler: Add patch constant signature into PSV and as container blob Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	26247d506e	microsoft/compiler: Gather patch const signature and handle tess factor in it Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8e42891f69	microsoft/compiler: When sorting patch varyings, adjust location to be in normal varying range This way, patch varyings come before the patch sysvals (tess levels). Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	4bb4d0454d	microsoft/compiler: Overlap patch and non-patch varyings so both are separately 0-indexed Also add tess factors to the list of sysvals that can cause vars to be sorted last. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b7da3f8647	microsoft/compiler: Fix I/O signatures for tess shaders - Skip patch variables, those go into a separate patch constant signature - Use nir_is_arrayed_io and only strip one level of array when it's true Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b346f28453	microsoft/compiler: Emit HS PSV validation and entrypoint metadata Note that this requires the shader info "tess" data to be correct. For GLSL tess control shaders, only the output primitive count is automatically available. The rest will need to be either guessed or filled in from a matching tess eval (domain) shader. This is handled by the d3d12 driver in a later patch. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	6c58e1f448	microsoft/compiler: Delete misleading TODO comments about semantic table Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> We've been writing a valid semantic table for a while now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	f511354a64	microsoft/compiler: Split hull (tess ctrl) shaders into main and patch constant funcs Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	ad2233616c	microsoft/compiler: Handle store_per_vertex_output for HS outputs Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	41af962099	microsoft/compiler: Emit all NIR functions into the DXIL module Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	f6a333f010	microsoft/compiler: Emit functions with actual function names Once we start writing multiple functions, we can't keep calling all of them "main" Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	72812fe9b5	microsoft/compiler: Support emitting multiple functions into a DXIL module The instruction and block lists are moved into a new "function definition" struct, and the DXIL module tracks one at a time for adding instructions into. The NIR side still only emits the main function here though. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	e5f353f2f2	microsoft/compiler: Emit statically-indexed resource handles and scratch later The resource declarations are module-wide, but the resource handles are function-local. A future change will add multi-function support, but requires these handles to be potentially emitted multiple times. The alloca used for scratch is also function-local. This is the same pattern that the DXBC to DXIL converter uses. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	02f46b67cd	microsoft/compiler: Fix typo in enum entry Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	b5cb81f8c1	microsoft/compiler: Add mapping from MESA_SHADER_* to DXIL_*_SHADER for tessellation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	8ad0393abe	microsoft/compiler: Getting a builtin function with an undeclared signature should be unreachable Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	4ad72b152c	microsoft/compiler: Multi-row output semantics need to write multiple never_writes_masks Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	5e3d64d067	microsoft/compiler: Semantic table should be de-duped for multi-row semantics too Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	218f8302d2	microsoft/compiler: Use driver_location instead of location for inter-stage varying index in GL In the case of two vars being packed into the same register / location, they'll still get unique driver_location, which is what we need. This does require some tweaks to stream output handling, which also needs to produce the varying index. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Jesse Natalie	14ed624ff3	microsoft/compiler: Force integer I/O vars to use flat/constant interpolation Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14399>	2022-01-26 01:31:35 +00:00
Dave Airlie	9b961b9d1d	mesa/st: refactor program translation into one file. This moves the notify callback into the file where it's all called from. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	8dfe3c83b6	mesa/st: move program new/delete into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	afce8654df	mesa/st: move st_vertex_program to gl_vertex_program in mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	5730772e36	mesa/st: move new ati fragment shader to mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Dave Airlie	3faa21bda7	mesa/st: collapse st_program into gl_program object. Remove the subclass for this. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14700>	2022-01-26 00:42:59 +00:00
Jordan Justen	8db5937f94	intel/genxml: Extend length of 3DSTATE_DEPTH_BUFFER for gfx12.5 The two added dwords are MBZ. Ref: bspec 46935 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14702>	2022-01-26 00:22:54 +00:00
Jordan Justen	315d632977	intel/genxml: Extend length of 3DSTATE_WM_HZ_OP for gfx12.5 The added dword is MBZ. Ref: bspec 46981 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14702>	2022-01-26 00:22:54 +00:00
Mike Blumenkrantz	0ca6273713	zink: add anv (icl) fails mesa/mesa#5918 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14722>	2022-01-26 00:07:58 +00:00
Mike Blumenkrantz	5e748770b9	zink: never use SpvOpImageQuerySizeLod for texel buffers this is illegal cc: mesa-stable affects KHR-GL46.texture_buffer.texture_buffer_texture_buffer_range Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14696>	2022-01-25 23:56:36 +00:00
Mike Blumenkrantz	5dc28ccb79	zink: update radv fails list Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14717>	2022-01-25 23:43:34 +00:00
Mike Blumenkrantz	0d2f854795	zink: update nv fails more passes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14716>	2022-01-25 23:01:24 +00:00
Caio Oliveira	448a840b39	intel/fs/xehp: Add unit test for handling of RaR deps across multiple pipelines. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Paulo Zanoni	d107a0bff8	intel/fs: Assert the GPU supports 64bit ops if present at lower_scoreboard time. On platforms where we don't support 64 bit instructions we shouldn't pass such instructions for the code generator to lower into supported instructions, because this makes their execution pipeline unpredictable to the scoreboard lowering pass on XeHP+ platforms. We really should be reducing all these 64 bit instructions before code generation, so here we add an assert to help us catch and fix these cases more easily. Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> [ Francisco Jerez: Also allow has_integer_dword_mul. ] Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	79fb7f9de8	intel/fs: Perform 64-bit CLUSTER_BROADCAST lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	bdf8ac2466	intel/fs: Honor strided source regions specified by the IR for CLUSTER_BROADCAST. This fixes a bug in the CLUSTER_BROADCAST code generation that causes the original IR region to be ignored, this will be a problem when we start lowering 64-bit CLUSTER_BROADCAST instructions at the IR level, since it will lead to instructions with non-trivial regioning. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	6c8782c135	intel/fs: Perform 64-bit SEL_EXEC lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	9449b71bdd	intel/fs: Perform 64-bit SHUFFLE lowering in the lower_regioning pass. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d2d72fccf1	intel/fs: Fix destination suboffset calculations for non-trivial strides in SHUFFLE codegen. One of the two SHUFFLE implementations wasn't taking into account the destination stride at all, and the other (more commonly used) one was taking it into account incorrectly since brw_reg::hstride represents the stride logarithmically, so we need to use a left-shift operator instead of product. Found by inspection. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	d1038197f3	intel/fs: Take into account region strides during SIMD lowering decision of SHUFFLE. This fixes a bug in the handcrafted SIMD lowering done by the SHUFFLE code generation, which wasn't taking into account the source and destination region strides while deciding whether it needs to split an instruction. v2: Use new element_sz() helper instead of left shift. (Lionel) Fixes: `90c9f29518` ("i965/fs: Add support for nir_intrinsic_shuffle") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	44e48751d2	intel/fs: Teach the lower_regioning pass how to split instructions of unsuported exec type. This adds some generic infrastructure that allows splitting any instruction into a number of instructions of a smaller legal execution type. This is meant to replace several instances of handcrafted 64bit type lowering done manually in the code generator, which is rather error-prone, prevents scheduling of the lowered instructions, and makes them invisible to the SWSB pass on Gfx12+ platforms, which will become especially problematic on Gfx12.5+ since the EUs introduce multiple asynchronous execution pipelines which the SWSB pass needs to be able to synchronize to one another, so it's critical for the real execution type of the instruction to be visible to the SWSB pass. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	539c879a6b	intel/fs: Move legal exec type calculation into helper function in lower_regioning pass. Right now the execution type lowering functionality of this pass assumes that an integer type of the original bit size is always acceptable, however we'll want more complex behavior than that in order to leverage this pass to automate the lowering of unsupported 64-bit operations into multiple 32-bit operations. In order to do that calculate the closest legal execution type from a new helper function, and take advantage of that function from the has_invalid_exec_type() helper, along the lines of other lower_regioning() helpers structured as a pair of has_invalid_foo() + required_foo() functions. This shouldn't have any functional changes. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Francisco Jerez	3886e63033	intel/fs/xehp: Merge repeated in-order read dependencies instead of replacement. Previously the software scoreboard structure would drop previous dependencies for a given register and replace them with the most recent one for the same register when a new instruction (or set of instructions) is processed. This worked correctly on the Gfx12LP platforms this code was originally designed for, because a repeated dependency on the same register would either require the second instruction to synchronize against the first (so the first dependency could be disregarded from that point on) or require the dependency to be RaR and in-order, which allows the synchronization to be optimized out (the first dependency could still be disregarded as well, since the pipeline is in-order). However the latter assumption will break on upcoming Gfx12HP platforms, because they have multiple asynchronous FPU pipelines, so whenever we hit a RaR dependency we need to propagate forward both dependencies, since the order in which both reads will complete is not guaranteed by the hardware in cases where they occur from different asynchronous pipelines. Note that this dependency propagation change requires us to change the definition of dependency::done as well, since that constant is defined to discard any previous dependency information when used as argument for shadow(). This has been reported to fix the following conformance failures on DG2: KHR-GL46.shaders.uniform_block.random.all_per_block_buffers.19 dEQP-GLES3.functional.shaders.derivate.fwidth.* Reported-by: Tapani Pälli <tapani.palli@intel.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5670 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14273>	2022-01-25 22:40:44 +00:00
Alejandro Piñeiro	4ab6631949	vc4/nir_lower_blend: update write mask when we update num components As explained at the header of the lowering: "Once this pass is done, the color write will either have one component (for single sample) with packed argb8888, or 4 components with the per-sample argb8888 result." So in several cases the lowering was updating the number of components, so we need to update the writemask too. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14708>	2022-01-25 21:24:03 +00:00
Bas Nieuwenhuizen	67220077ed	radv/amdgpu: Use aligned sizing for IB buffers. Otherwise aligning might run over buffer size ... Fixes: `1f36f6b83f` ("radv/winsys: use same IBs padding as the kernel") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14644>	2022-01-25 21:02:41 +00:00
Bas Nieuwenhuizen	ef40f2ccc2	radv/amdgpu: Fix handling of IB alignment > 4 words. We reserved space for chaining by subtracting 4 words from max_dw, but then the new alignment code in radv_amdgpu_cs_finalize ended up running all over that. That resulted in going over buffer size when chaining. When lucky you'd get a crash, and when unlucky other stuff might happen. This always adds the 4 words at the end, but initializes with NOP by default. That way we still adhere to the alignment rules. Fixes: `1f36f6b83f` ("radv/winsys: use same IBs padding as the kernel") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14644>	2022-01-25 21:02:41 +00:00
Dave Airlie	06504fb9e2	mesa: consolidate setting no error state and checking suid. This makes MESA_NO_ERROR and mesa_no_error via drirc do the same thing. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14701>	2022-01-26 05:30:35 +10:00
Samuel Pitoiset	047992821b	radv/ci: mark dEQP-VK.api.version_check.version as expected failure on Stoney Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	08c6f437cf	radv: advertise Vulkan 1.3 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	923309e201	radv: bump conformance version to 1.3.0.0 for RDNA2 We can't report conformance for an older major API version and this is required to pass dEQP-VK.api.driver_properties.conformance_version. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	2a88e21570	radv: switch a bunch of struct/enum to 1.3 versions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	852197537e	radv: add a no-op version of vkGetPhysicalDeviceToolPropertiesEXT() It seems the vulkan common runtime code exposes VK_EXT_tooling but doesn't (yet) have a fallback if the backend doesn't enable this extension. Implement it as a no-op for a temporary workaround. This fixes crashes with dEQP-VK.api.tooling_info.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	9be4d36d5f	radv: report textureCompressionASTC_HDR as not supported To fix a mismatch with dEQP-VK.api.info.get_physical_device_properties2.features. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	2d12041967	radv: implement 1.3 features/properties Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Jason Ekstrand	cc8eb6f5df	vulkan/runtime: Implement 1.3 features/properties Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Samuel Pitoiset	6a3928615b	vulkan: Update the XML and headers to 1.3.204 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14707>	2022-01-25 15:57:53 +00:00
Michel Dänzer	a429b3dd33	Revert "wsi/x11: Avoid a class of deadlocks in the WSI queue thread" This reverts commit `272fba8e75`. Multiple regressions have been reported against this. Let's revert and maybe try again. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5910 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5913 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14710>	2022-01-25 14:55:12 +00:00
Jordan Justen	4e0eca7dc3	intel/dev: Add device info for RPL Ref: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=52407c220c44c8dcc6aa8aa35ffc8a2db3c849a9 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14664>	2022-01-25 12:47:44 +00:00
Danylo Piliaiev	1b513f4958	tu: add reference counting for descriptor set layouts The spec states that descriptor set layouts can be destroyed almost at any time: "VkDescriptorSetLayout objects may be accessed by commands that operate on descriptor sets allocated using that layout, and those descriptor sets must not be updated with vkUpdateDescriptorSets after the descriptor set layout has been destroyed. Otherwise, a VkDescriptorSetLayout object passed as a parameter to create another object is not further accessed by that object after the duration of the command it is passed into." Copied mostly from ANV. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5893 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14622>	2022-01-25 12:17:41 +00:00
Lionel Landwerlin	0513ff6564	anv: verify that the format supports multisampling We tightened the requirements for multisampling on Gfx7 but didn't format that at the Vulkan level. This will break more conformance tests on Gfx7, but we weren't conformant anyway. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `531b1b7511` ("intel/isl: Strengthen MCS SINT format restriction") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14679>	2022-01-25 11:57:38 +00:00
Jordan Justen	03cc5a8295	intel/dev: Add device ids for ADL-N Ref: https://cgit.freedesktop.org/drm/drm-tip/commit/?id=7e28d0b26759846485978ada860ef4a427e06c8f Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14666>	2022-01-25 11:26:26 +00:00
Jordan Justen	fd646c2d2f	intel/dev: Add DG1 PCI id 0x4909 Ref: https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/?id=5f0d4214938db66969a50d4b1262307e39f4f2b2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14665>	2022-01-25 10:41:39 +00:00
Iago Toral Quiroga	f666f70935	v3dv: support VK_KHR_8bit_storage Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	5cec893384	broadcom/compiler: update comment on load_uniform fast-path The comment for 16-bit applies to 8-bit uniforms as well. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	296fde31aa	broadcom/compiler: allow vectorization to larger scalar type Allow to vectorize operations from a smaller bit-size into scalar operations of a larger bit-size. This allows us to turn 2x8-bit into a equivalent scalar 16-bit load/store. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	a248ff0b5b	broadcom/compiler: support 8-bit loads via ldunifa This generalizes the support we added for 16-bit to also handle 8-bit loads via ldunifa. The story is the same: we align the address to 32-bit downwards and we skip any bytes that are not of interest. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	4630f5f016	broadcom/compiler: handle to/from 8-bit integer conversions Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	1b530d948d	broadcom/compiler: support 8-bit general store access Just like with 16-bit, this mode only supports scalar access, but we are already lowering all non 32-bit accesses to scalar. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	84adf89d33	v3dv: expose storagePushConstant16 feature from VK_KHR_16bit_storage Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	f7ff462421	broadcom/compiler: support 16-bit uniforms Since ldunif is a 32-bit instruction we need to demote these to UBO loads, like we do for indirect indexing, with the exception of scalar 16bit uniforms with an offset that is 32-bit aligned. For the exception where we can use lfdunif we read a 32-bit slot from memory where the uniform data is in the lower 16-bit and we will read garbage in the upper 16-bit which we won't use anyway. It should be noted that by using ldunif, we are consuming 32-bit from the uniform stream, but this is fine because if there is valid uniform data in the upper 16-bit (i.e. we had a ivec2 uniform aligned to a 32-bit address), since we scalarize 16-bit loads, we would see another load uniform with an unaligned offset for the second component, which we will demote to UBO. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	4f26f50ae4	v3dv: support VK_KHR_16_bit_storage Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	49a8fa152c	broadcom/compiler: support f32 to f16 RTZ and RTE rounding modes These are required by VK_KHR_16bit_storage. Our hardware, however, doesn't provide any mechanism to decide on the rounding mode of the conversion and it seems to be using RTE, so we implement RTZ in software. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	1f639d5310	broadcom/compiler: implement 32-bit/16-bit conversion opcodes Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	bdb6201ea1	broadcom/compiler: use ldunifa with unaligned constant offset If we know we have a load with a constant offset, then even if it is not aligned to 32-bit we can still produce an aligned offset and then skip over the bytes we don't need. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	2eb6910d96	broadcom/compiler: support ldunifa with some 16-bit loads Even though ldunifa is strictly 32-bit we may be able to use it to load 16-bit values that sit at 32-bit aligned addresses. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	2a420bdf92	broadcom/compiler: lower packing after vectorization The vectorization pass can inject 32_2x16 (un)packing opcodes upon successful vectorization of 16-bit operations into 32-bit counterparts, so make sure we lower these to something our backend can handle. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	4b24373137	broadcom/compiler: implement TMU general 16-bit load/store This allows us to implement 16-bit access on uniform and storage buffers. Notice that V3D hardware can only do general access on scalar 16-bit elements, which we currently enforce by running a lowering pass during shader compile. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	2443e45e76	broadcom/compiler: better document vectorization implications Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Iago Toral Quiroga	765d9feb46	broadcom/compiler: add lowering pass to scalarize non 32-bit general load/store V3D hardware doesn't support vector access for general TMU load/store operations like the ones we use for UBO and SSBO, so we need to split these to scalar operations. It should be noted that we also have a vectorization pass (which runs later, during optimization), that may reconstruct some of these into 32-bit operations when possible (i.e. when the resulting operation is 32-bit aligned). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14648>	2022-01-25 09:08:26 +00:00
Tapani Pälli	05e7e2245b	mesa: change GetProgramiv name length queries to use program resources Program resource queries provide equivalent code, gl_resource_name introduced by commit `dea558cbd2` takes care of ARB_gl_spirv special case where name information is not available. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14636>	2022-01-25 06:30:44 +00:00
Tapani Pälli	1b898d78d8	mesa: move GetProgramInterfaceiv as a shader_query function This matches how _mesa_get_program_resourceiv was done and this makes it possible to skip some validation and shader program lookup when calling it from glGetProgramiv. Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14636>	2022-01-25 06:30:44 +00:00
Emma Anholt	61400f8a2d	nir/lower_locals_to_regs: Do an ad-hoc copy propagate on our generated MOV. I noticed the inefficiency in NIR-to-TGSI output while trying to debug a failure handling some arrays in r600. While this makes reading CTS shaders easier, the effect in the real world is pretty limited. From softpipe shader-db: total instructions in shared programs: 2929840 -> 2929836 (<.01%) instructions in affected programs: 118 -> 114 (-3.39%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14321>	2022-01-25 06:01:13 +00:00
Chia-I Wu	ef325d4650	freedreno/drm, turnip: set DRM_RDWR for exported dma-bufs This allows the exported fds to be mapped for writing. My use case is for virtio-gpu blob resources where the fds are mapped rw and mappings are added to the guests using KVM_SET_USER_MEMORY_REGION. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14699>	2022-01-25 05:32:38 +00:00
Neha Bhende	bdf1163c2a	svga: enable PIPE_CAP_IMAGE_STORE_FORMATTED on gl43 capable device With upstream mesa PIPE_CAP_IMAGE_STORE_FORMATTED needs to be set to enable ARB_shader_image_load_store extension. This will reenable GL43 support for svga GL43 capable device Fixes: `3b81d2d30d` ('mesa/st: do not expose ARB_shader_image_load_store if not fully implemented') Tested with glretrace Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14688>	2022-01-25 03:37:26 +00:00
Thomas H.P. Andersen	f9ea6e92e9	ci: debian-android: drop -Wno-error=extern-initializer Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14654>	2022-01-25 00:26:45 +00:00
Thomas H.P. Andersen	23135aece1	vulkan/vk_extensions_gen: fix -Wextern-initializer warning vk_android_allowed_device_extensions is already declared as extern in vk_extensions.h Fixes a warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14654>	2022-01-25 00:26:45 +00:00
Kenneth Graunke	09072a0803	iris: Fix and refactor check for clear color being fully zero I missed updating this code to check res->aux.clear_color_unknown when I added it a while back. While we're here, also refactor this code into a helper function - I'll want to use it in another place shortly. Fixes: `e83da2d8e3` ("iris: Don't try to CPU read imported clear color BOs") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	053251f18c	iris: Implement iris_blorp_exec() for the blitter engine This splits iris_blorp_exec() into separate functions for executing on the render command streamer and the blitter command streamer. A future patch could add a separate iris_blorp_exec_compute() path that skips a bunch of render-specific work. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	e00985d5d4	iris: Set BLORP_BATCH_USE_{COMPUTE,BLITTER} flags for the target batch This makes blits, copies, and (non-fast) clears set the appropriate BLORP_BATCH_USE_{COMPUTE,BLITTER} flag if their batch is either IRIS_BATCH_COMPUTE or IRIS_BATCH_BLITTER. We ignore the other operations for now as those don't support compute or blit yet. Of course, there is no code to attempt to launch BLORP operations on either the compute or blitter batches yet, but that will come in time. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	cc03726165	iris: Only have one blorp_batch_init/finish in iris_copy_region() This is a little simpler, and gives us one place to change flags. Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	a90a1f15a7	iris: Create an IRIS_BATCH_BLITTER for using the BLT command streamer We removed all the hardware blitter support from i965 years ago because the blitter was not worth using (limited functionality, bad performance, extra synchronization, and worse). However, on Tigerlake there are new blitter commands that are actually fast and allow us to do proper asynchronous copies while 3D is busy doing other work. So, reintroduce the blitter. We'll want to use it. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	31eeb72e45	blorp: Add support for blorp_copy via XY_BLOCK_COPY_BLT This introduces a new blorp_copy() path using the new XY_BLOCK_COPY_BLT blitter command introduced on Tigerlake. Unlike the blitter commands of old, this one is actually fast and worth using. Although it doesn't use shaders like the rest of BLORP, we still can use some surface-munging code from there, and BLORP also provides a nice place to put this which is shared among the drivers. To use the new path, set BLORP_BATCH_USE_BLITTER (much like Jordan's recent BLORP_BATCH_USE_COMPUTE bit) and target the batch at the copy engine. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	4d4f57b15c	isl: Add isl_dev->mocs.blitter_{src,dst} fields These will be used for XY_BLOCK_COPY_BLT on XeHP. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	abd71630fc	blorp: Add a blorp_address::local_hint flag This will be used as a performance hint for XY_BLOCK_COPY_BLT to indicate whether the source/destination surfaces are (likely) in device-local memory or system memory. We don't need to be precise here - it's okay to set the fields to LOCAL even if a buffer has been evicted out to system memory. We should set this from Vulkan too, but I haven't yet. There isn't a convenient anv_bo field like there is in iris... Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	5262475242	intel/dev: Add a has_flat_ccs flag This should only be set on XeHP. It implies that CCS works via based on the virtual addresses involved and a flat memory carve-out, rather than treating CCS like a surface, or using auxiliary maps. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	3e0bffbad3	intel/genxml: Add XY_BLOCK_COPY_BLT Color Depth enum values Requested by Jason. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Kenneth Graunke	79b199b333	intel: Allow copy engine class in intel_gem_create_context_engines() I want to use I915_ENGINE_CLASS_COPY in iris. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14687>	2022-01-24 23:27:25 +00:00
Mike Blumenkrantz	08ffbc055b	lavapipe: remove unused struct member Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14695>	2022-01-24 22:55:07 +00:00
Thomas H.P. Andersen	373847609c	ci: debian-android: drop -Wno-error=unused-label Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14639>	2022-01-24 22:23:33 +00:00
Thomas H.P. Andersen	73232d000f	anv: drop unused label Added in `053d4c328f` Last usage removed in `0967584549` Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14639>	2022-01-24 22:23:33 +00:00
Mike Blumenkrantz	3dcf275786	vulkan/wsi: add VK_IMAGE_USAGE_INPUT_ATTACHMENT_BIT for swapchain image caps I need this, and drivers can do it, so add it Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14641>	2022-01-24 21:53:21 +00:00
Dave Airlie	f6926efaad	mesa/st: move st_fb_orientation into a mesa function Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:27 +10:00
Dave Airlie	840aabe752	mesa/st: move invalidate_on_gl_viewport to ctx This is cleaner in the gl context now. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:23 +10:00
Dave Airlie	0ba5def21a	mesa/st: move manager colorbuffer interface to gl_context. This just avoids some st_context in main. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:19 +10:00
Dave Airlie	2f14e0d695	mesa/st: move renderbuffer format choosing wrapper into mesa. This moves this and cleans up the results. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:16 +10:00
Dave Airlie	f74648f912	mesa/st: move last of renderbuffer functionality into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:12 +10:00
Dave Airlie	98df4f7a83	mesa/st: migrate blit code into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:09 +10:00
Dave Airlie	5f675630d9	mesa/st: fixup viewport drawable invalidation This moves the code into more appropriate places Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:04 +10:00
Dave Airlie	018251908e	mesa/st: move some fbo helpers around. This inlines one, and moves the other to a more appropriate place Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:04:01 +10:00
Dave Airlie	9c7e79c4b7	mesa/st: move st_new_renderbuffer_fb to manager This is st_manager code really. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:57 +10:00
Dave Airlie	b70b738bd1	mesa/st: move map/unmap renderbuffer code into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:55 +10:00
Dave Airlie	90e4e7cf74	mesa/st: move st renderbuffer code into mesa renderbuffer Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:49 +10:00
Dave Airlie	f88fb21885	mesa/st: move DrawBufferAllocate into mesa. Little bit of refactoring here. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:45 +10:00
Dave Airlie	57dcaac31d	mesa/st: move st_ReadBuffer functionality into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:42 +10:00
Dave Airlie	e9b12fe20e	mesa/st: move validate/discard framebuffer into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:39 +10:00
Dave Airlie	caa7009cff	mesa/st: move render/finish_render texture in to mesa. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:36 +10:00
Dave Airlie	7645e24045	mesa/st: merge framebuffer objects from st to mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:32 +10:00
Dave Airlie	21d4dd8c39	mesa/st: move some renderbuffer code into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:27 +10:00
Dave Airlie	527b6ca412	mesa/st: merge st_renderbuffer into gl_renderbuffer. This touches lots of places Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14675>	2022-01-25 07:03:23 +10:00
Jesse Natalie	0b6caed85c	mesa/st: Lower user clip planes for tess eval too The logic in st_atom_shader.c leads me to believe this was supposed to work, but was incomplete to actually finish it. This fixes compatibility tess tests on d3d12. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14662>	2022-01-24 09:13:57 -08:00
Alyssa Rosenzweig	c85df67f27	pan/decode: Fix missing newlines in error messages Otherwise these error message lines end up truncated, which is a bit annoying. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14575>	2022-01-24 16:20:49 +00:00
Alyssa Rosenzweig	2bdfa4800d	pan/bi: Pull BLEND precolouring out of per-dest loop Indentation fail. This should happen once per instruction, not once per destination. In theory, this is a minor performance win; in practice, it's simply less wrong. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14575>	2022-01-24 16:20:49 +00:00
Alyssa Rosenzweig	1e5bb54f59	panfrost: Only cull polygons The spec says only polygons, not points/lines, should be culled when culling is enabled. The hardware does not make this distinction, so we have to. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14575>	2022-01-24 16:20:49 +00:00
Alyssa Rosenzweig	3f1abda631	panfrost: Use u_reduced_prim for primitive checks Use a Gallium helper that papers over the differences between primitive types, as required by hardware operation. [Cc'd to mesa-stable for use in the next commit.] Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14575>	2022-01-24 16:20:49 +00:00
Rob Clark	2dfebf3487	freedreno/a5xx: Fix clip_mask The clip_mask needs to also take into account rast->clip_plane_enable Fixes: `99838513ae` ("freedreno/a5xx: Add support for clip distances and use them for userclip.") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14643>	2022-01-24 15:51:47 +00:00
Rob Clark	d26cdfac2c	freedreno/a6xx: Fix clip_mask The clip_mask needs to also take into account rast->clip_plane_enable Fixes: `f2ae8d116a` ("freedreno/a6xx: Implement user clip/cull distances") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14643>	2022-01-24 15:51:47 +00:00
Rob Clark	5e2bd30ea4	freedreno: Add FD_DIRTY_RASTERIZER_CLIP_PLANE_ENABLE Add a dirty bit for clip_plane_enable changes. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14643>	2022-01-24 15:51:47 +00:00
Rob Clark	c190356040	freedreno: Pass shader cache key instead of shader key We are going to need to extend the cache key to add state that effects the program stateobj, but not necessarily the shader itself (ie. so ir3_shader_key wouldn't be the correct place to add it). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14643>	2022-01-24 15:51:47 +00:00
Rob Clark	26d591a669	mesa/st: Lowered ucp should still mark rast state dirty Lowered clip planes should respect the enabled/disabled GL_CLIP_PLANEn (aka GL_CLIP_DISTANCEn), which means updating the rast state as well. Signed-off-by: Rob Clark <robdclark@chromium.org> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14643>	2022-01-24 15:51:47 +00:00
Alyssa Rosenzweig	b4108e1d01	agx: Lower UBO loads to use per-element indexing This lets us support indirect access to UBOs easily. The existing constant special case disappears too, since the peephole optimizer can inline the constant later. (note: this is too conservative since we can go up to 16-bit immediates...) Unfortunately, nir_opt_algebraic can't seem to optimize expressions like "((a << 3) + 4) >> 2" to "(a << 1) + 1" which would be necessary for reasonable perf out of this... Fixes: dEQP-GLES2.functional.shaders.indexing.uniform_array.float_dynamic_loop_read_fragment Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14581>	2022-01-24 14:25:18 +00:00
Rohan Garg	1c825d14fb	docs: Update features and new_features for anv Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14681>	2022-01-24 14:18:33 +00:00
Iago Toral Quiroga	a6aa35a091	v3dv: implement VK_KHR_driver_properties Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14680>	2022-01-24 13:56:39 +00:00
Bas Nieuwenhuizen	492dc06fbe	radv: Remove VK_EXT_display_control support in favor of common impl. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14365>	2022-01-24 12:57:35 +00:00
Bas Nieuwenhuizen	3443557364	anv: Remove VK_EXT_display_control support in favor of common impl. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14365>	2022-01-24 12:57:35 +00:00
Bas Nieuwenhuizen	ce95871a50	vulkan/wsi/display: Add common implementation of VK_EXT_display_control. Based on the new vk_sync functions. Copied the version from anv as that seemed more thorough by using the temporary sync payload. However that does mean we have do use the vk_sync functions instead of being able to layer it on top of the dispatch table. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14365>	2022-01-24 12:57:35 +00:00
Rohan Garg	63e91148b7	anv: Enable VK_VALVE_mutable_descriptor_type This change introduces the anv_descriptor_size_for_mutable_type and anv_descriptor_data_for_mutable_type helpers to compute the size and data flags respectively for mutable descriptor types. In order to make handling these types easier we now store a precomputed descriptor stride for all types and use in the in appropriate places. We also need to adjust the compiler to take into account this descriptor stride. To that extent, we now pack the stride into the upper 16 bits alongside the index and the dynamic offset index and use it later to compute the correct offset. Closes: #4250 Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14633>	2022-01-24 12:24:53 +00:00
Jianxun Zhang	e1376b59ef	anv: refactor queue chain Simplify the buffer chaining process with a single loop and a helper function from Lionel Landwerlin's input. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14578>	2022-01-24 11:53:06 +00:00
Iago Toral Quiroga	be11948a95	broadcom/simulator: handle DRM_V3D_PARAM_SUPPORTS_MULTISYNC_EXT Without this the simulator wrapper will abort upon seeing this query, rendering the driver unusable in that context. Also, it seems the simulator environment doesn't quite work with multisync at present, so do not enable it until we figure out what the issue is. Reviewed-by: Melissa Wen <mwen@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14678>	2022-01-24 11:34:00 +00:00
Samuel Pitoiset	a0e8b774fc	radv: stop checking if pipelines are NULL during draws/dispatches This can't happen. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14617>	2022-01-24 11:15:00 +00:00
Kenneth Graunke	77b93cf3b1	iris: Directly access BOs rather than using iris_resource_bo(...) iris_resource_bo() is convenient when we only have a pipe_resource * variable, and don't need to do a lot with it other than get at the BO. When we need to do more with a resource, we usually cast it to iris_resource *, at which point we can just use res->bo instead. This patch updates iris_copy_region to use src_res->bo, dst_res->bo, rather than iris_resource_bo(src) and iris_resource_bo(dst), since we already have those cast versions on hand. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14667>	2022-01-24 10:28:50 +00:00
Iago Toral Quiroga	33c668a0e0	docs/features: flag VK_KHR_create_renderpass2 as implemented for v3dv This was implemented in `3c86292321`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14677>	2022-01-24 10:58:51 +01:00
Samuel Pitoiset	86b8fa9aee	radv: do not restore NULL compute pipelines after meta operations This should be safe as long as the driver restores descriptors and push constants correctly for compute pipelines. This might also reduce the number of compute pipeline changes if eg. consecutve subpass fast clears with compute. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14616>	2022-01-24 08:47:58 +00:00
Samuel Pitoiset	8f70143c50	radv/winsys: set GTT_WC flag for CS IBs on GFX6 It was missing it seems. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14612>	2022-01-24 08:27:03 +00:00
Samuel Pitoiset	447cddd455	radv: fix copying VRS rates if the ds attachment uses mips While the VRS image can't have mips (and no layers because still not supported by RADV), applications might still want to bind a depth/stencil attachment where the base level isn't 0. Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14518>	2022-01-24 08:06:38 +00:00
Samuel Pitoiset	86909babc9	radv: fix copying VRS rates to HTILE if the depth/stencil is cleared If the application binds a fragment shading rate attachment to a subpass and also clears the depth stencil attachment, the VRS rates would have been reinitialized to 1x1 with fast clears. It makes more sense to clear and then copy instead of the opposite. Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14518>	2022-01-24 08:06:38 +00:00
Samuel Pitoiset	7dd456b173	radv: disable attachmentFragmentShadingRate for RADV_DEBUG=nohiz Region based VRS can only work if HTILE is enabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14518>	2022-01-24 08:06:38 +00:00
Sagar Ghuge	c0849a0697	intel/genxml: Add Un-Typed Data-Port Cache Flush field to pipe control Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14676>	2022-01-23 23:46:54 -08:00
Sagar Ghuge	08429da731	intel/genxml: Add L1 Cache Control bit field Add L1 cache control bit field to RENDER_SURFACE_STATE and STATE_BASE_ADDRESS instruction. v1: (Jason) - Add prefix to bit field - Don't miss out STATE_BASE_ADDRESS instruction Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14676>	2022-01-23 23:43:28 -08:00
Thomas H.P. Andersen	2002e87cc3	ci: clean up debian-android no-error list I see no warnings of these in the log of a recent CI build Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14638>	2022-01-24 04:21:49 +00:00
Bas Nieuwenhuizen	51416b1a12	util/fossilize_db: Fix double free in error handling. If the file ptr is not NULL then foz_destroy will also try to destroy it. Fixes: `eca6bb9540` ("util/fossilize_db: add basic fossilize db util to read/write shader caches") Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14671>	2022-01-24 01:09:21 +00:00
Francisco Jerez	4198ca4b3f	iris/xehp: Implement workaround for 3D texturing+anisotropic filtering. Implements a workaround for HSDES#14014414195. Note that this change deviates heavily from the workaround suggested in the HSDES, since all of the suggestions are either costly at runtime or outright non-compliant, so they would require us to apply the workaround selectively for affected applications. Instead of adding hacks to the compiler that manually implement the LOD computation of 3D texturing operations in the shader, initialize an extra sampler state structure in the driver that has anisotropic filtering forced off, and use it instead of the normal sampler state structure whenever a 3D texture is bound to the same sampler unit. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14489>	2022-01-21 23:24:33 +00:00
Jesse Natalie	bbb12b5550	docs: Update d3d12 features Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	75e1948b23	d3d12: Set sample-rate shading and GLSL 400 caps Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	23918d933e	d3d12: When mapping a non-directly-mappable resource for write, readback first For non-discard writes, we need to make sure that the resource has valid contents so they can be updated, not overwritten. We have to skip this when doing asynchronous maps, but that should be okay, because the threading context should only do asynchronous map-write when the resource is known to be idle/empty. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	7f9ac5bba2	d3d12: Support dynamic UBO/SSBO indexing Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	ce93f114b6	d3d12: Run point sprite lowering pass on multi-stream GS when safe In the case of a multi-stream GS that is attempting to output wide points to stream 0, we can support this by lowering stream 0 to triangles and then removing the other streams. This is only valid to do if the other streams are not being written to stream output, either if they're not present in the SO info or no buffer is bound. Fixes the arb_gpu_shader5/arb_gpu_shader5-emitstreamvertex_nodraw piglit test which does this. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	2a81bd9397	d3d12: Apply GS point sprite lowering to fixed-function point size too Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	bcbfbb8efd	d3d12: Report number of GS streams Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	223a05dc53	d3d12: Temp resources for same-resource copies can be MSAA too Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	2da7db759b	d3d12: Relax multisampling direct copy requirements D3D has supported partial copying of MSAA resources as as long as the sample counts match since D3D10.1 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	e5cf19fced	d3d12: Modify shaders when MSAA is disabled I couldn't find this in a spec but the builtin-gl-sample-mask piglit seems to expect writing to the output sample mask to do nothing when max num samples == 0. The ForcedSampleCount property should make everything appear as if MSAA is disabled. However, it's undefined behavior if depth is bound, so in that case, we can at least use a lowering pass to make things look like MSAA is off, unless you use atomics to count invocations. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	aef777c95d	d3d12: Report sample positions Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	173e373803	d3d12: Lower load_sample_pos to load_sample_pos_at_id D3D doesn't have an intrinsic for loading the current sample's position, only for loading a specific sample's intrinsic. Fortunately, we can also just get the current sample. So do that. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	7ad089c66e	d3d12: Sample mask output needs to be uint-typed Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	f915bc56a4	microsoft/compiler: Lower helper invocations DXIL adds this in SM6.6, so when we get around to being able to emit SM6.6, we can conditionally turn this off and support emitting the new intrinsic. Until then, this is easy. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	1022435845	microsoft/compiler: Handle msb/lsb/bfrev Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	ac94bcf046	microsoft/compiler: Use ibfe/ubfe for bitfield extract instead of lowering to shifts Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	80e782d5ed	microsoft/compiler: Handle bitfield_insert This almost matches what GLSL wants, except for the handling of large widths. You can see this in the lowering algorithm: (('bitfield_insert', 'base', 'insert', 'offset', 'bits'), ('bcsel', ('ult', 31, 'bits'), 'insert', ('bfi', ('bfm', 'bits', 'offset'), 'insert', 'base')), 'options->lower_bitfield_insert'), DXIL's 'bfi' instruction is an inseparable pairing of NIR's 'bfi(bfm(...), ...)', so we just apply the additional bcsel in the backend. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	8938c6e032	microsoft/compiler: Emit samplers as array types Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	7502b08900	microsoft/compiler: Handle load_invocation_id for GS and HS Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	fe6f7b5265	microsoft/compiler: Handle tex texture/sampler offset srcs Note that these offsets are 0-based, so to make them binding IDs we need to explicitly add the base ID to them. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	7748e4799d	microsoft/compiler: Handle input coverage Note that GL requires input coverage for sample execution mode to be only the single bit corresponding to the executing sample, so rearrange things to understand during shader emitting if we're executing per-sample to emit the right coverage value. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	777a42e309	microsoft/compiler: Handle textureGatherCmp Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	30a44e4c3d	microsoft/compiler: Handle 'pull model' explicit interpolation intrinsics Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	5b51842b58	microsoft/compiler: Always have at least one GS active stream DXIL validation will fail if there's no stream that has a valid primitive topology, which is what happens in the case of no active streams. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	a8373ca8de	microsoft/compiler: Handle load_sample_pos_at_id Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	852e243cae	microsoft/compiler: Handle variables declared per-sample Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14624>	2022-01-21 23:08:26 +00:00
Jesse Natalie	413f398cff	ci/windows: Use 2 container stages The first container stage ("build") is for dependencies of the build. These are infrequently-changing things like Visual Studio, LLVM, git, and also meson. The second container stage ("test") currently depends on the first, and adds test dependencies like piglit. This lets us rev piglit without having to rebuild LLVM. Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14637>	2022-01-21 22:38:16 +00:00
Ian Romanick	4d2937ff92	mesa: OpenGL ES 1.1 is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	b04a186296	mesa: OpenGL 1.4 feature GL_EXT_point_parameters is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	577b438f98	mesa: OpenGL 1.4 feature GL_EXT_blend_minmax is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	d45cb3b440	mesa: OpenGL 1.4 feature GL_EXT_blend_func_separate is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	fbd17ede10	mesa: OpenGL 1.4 feature GL_EXT_blend_color is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	86db69e7e8	mesa: OpenGL 1.4 feature GL_ARB_texture_env_crossbar is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Ian Romanick	f80d45c515	mesa: OpenGL 1.4 feature GL_ARB_depth_texture is not optional Cheatsheet: _mesa_has_ARB_depth_texture() becomes (true && ctx->Extensions.Version >= _mesa_extension_table[...].version[ctx->API]). The last value is 0 when ctx->API is API_OPENGL_COMPAT and ~0 otherwise. The whole function effectively becomes (ctx->API == API_OPENGL_COMPAT). _mesa_has_OES_depth_texture() becomes (true && ctx->Extensions.Version >= _mesa_extension_table[...].version[ctx->API]). The last value is 0 when ctx->API is API_OPENGLES2 and ~0 otherwise. The whole function effectively becomes (ctx->API == API_OPENGLES2). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14213>	2022-01-21 22:08:33 +00:00
Jordan Crouse	3608bce137	turnip: Update the msm_kgsl.h header with the sanitized 4.19 version The current msm_kgsl.h header in the tree isn't sanitized and the kernel specific macros will confuse a compiler. Copy in the sanitized version of the header from the 4.19 kernel tree which also adds a few new API bits that are currently unused but may be useful some day. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14651>	2022-01-21 21:25:07 +00:00
Connor Abbott	ab5176ec40	tu/blit: Don't set CLAMPENABLE in sampler for 3d path This was copied from the blob before we understood what it did, and it has questionable utility: there's nothing in the GL, Vulkan, or D3D11 specs that require the result be clamped to the underlying range to account for imprecision. And it doesn't make sense at all for cubic filtering, because the result can legitimately be outside the range in some scenarios. Just remove it. This fixes a bunch of tests added in vulkan CTS 1.2.8 to test blitting from compressed textures, which use random inputs and therefore are more likely to hit the out-of-range condition. For example, dEQP-VK.api.copy_and_blit.core.blit_image.all_formats.color.2d.etc2_r8g8b8a8_unorm_block.r8g8b8a8_snorm.general_general_cubic. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14613>	2022-01-21 20:55:46 +00:00
Connor Abbott	bb41d47f2e	freedreno/a6xx: Name texture descriptor bit This appears to do the same thing as CLAMPENABLE on a3xx. That is, it clamps the result to [0, 1] for unorm formats and [-1, 1] for snorm formats after filtering. In particular it's now more easily observable with cubic filtering, because cubic filtering can produce values outside the original range. Presumably this only matters with linear filtering due to rounding errors when computing the weighted average. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14613>	2022-01-21 20:55:46 +00:00
Nanley Chery	f1f65e5bcf	intel/isl: Allow creating MCS in Tile4 memory This enables MCS support on XeHP, now that MCS can be created with a tiling supported by that platform. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14464>	2022-01-21 20:38:05 +00:00
Nanley Chery	f960e398d3	intel/gen125.xml: Increase Auxiliary Surface Pitch See Bspec 43862. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14464>	2022-01-21 20:38:05 +00:00
Nanley Chery	58a843ab14	Revert "intel/isl: Don't reconfigure aux surfaces for MCS" This reverts commit `2f0fbe06e6`. We don't handle the reconfiguration of existing HiZ surfaces, nor do we do so for CCS surfaces. This code path is unused, so we remove it. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14464>	2022-01-21 20:38:05 +00:00
Nanley Chery	531b1b7511	intel/isl: Strengthen MCS SINT format restriction The gfx7 MCS restriction for SINT formats actually applies to multisampling in general. Place the stronger restriction in the format support check and assert this in isl_surf_get_mcs_surf. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14464>	2022-01-21 20:38:05 +00:00
Nanley Chery	bf9466e285	intel/isl: Don't check pitch in isl_surf_get_mcs_surf The pitch check for 16x multisampled surfaces should already be covered by the pitch_in_range check in isl_calc_row_pitch. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14464>	2022-01-21 20:38:05 +00:00
Samuel Pitoiset	c47b8d7bf3	radv: optimize CPU overhead of si_cp_dma_prefetch() slightly Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5008 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14618>	2022-01-21 20:15:52 +00:00
Adam Jackson	272fba8e75	wsi/x11: Avoid a class of deadlocks in the WSI queue thread Here are two facts, each mildly unpleasant, quite nasty taken together: - xcb_wait_for_special_event retries its poll() if the fd woke up but no matching event arrived, without verifying that the special event queue is still registered. - Present gives no in-band notification of window destruction. Now if the window is destroyed before the swapchain we're in trouble. Our WSI thread might be stuck in xcb_wait_for_special_event as we're awaiting a completion that won't come (the pixmap was being presented as the window, and then the window was destroyed, so no more events can happen on that window). The solution is to use xcb_poll_for_special_event, which is non-blocking, and handle the appropriate edge cases. If we've run the event queue but we still don't have an image to acquire, we poke the X server with a request that gently verifies that the window exists, allowing the thread to exit gracefully in the above case. We detect when we're busy-looping, and poll on the X connection for up to 1ms in response to avoid burning the CPU. Acked-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13564>	2022-01-21 19:46:14 +00:00
Bas Nieuwenhuizen	d1530a3f3b	Revert "nir/algebraic: distribute fmul(fadd(a, b), c) when b and c are constants" This reverts commit `a1af902531`. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5423 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14532>	2022-01-21 16:58:11 +00:00
Charles Baker	1b88777e97	Revert "zink: handle vertex buffer offset overflows" This reverts commit `9823b970fb`. From VkPhysicalDeviceLimits [1]: > maxVertexInputAttributeOffset is the maximum vertex input attribute offset that can be added to the vertex input binding stride. The offset member of the VkVertexInputAttributeDescription structure must be less than or equal to this limit. The maxVertexInputAttributeOffset is a limit on the offset of a vertex attribute within a vertex rather than a limit on offsets for vertex buffer bindings. The code to bind temporary buffers can be removed. [1] https://www.khronos.org/registry/vulkan/specs/1.2-extensions/man/html/VkPhysicalDeviceLimits.html Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14572>	2022-01-21 16:44:14 +00:00
Rhys Perry	3712cd2767	radv: use 8x4 workgroups for wave32 RT Beginning of Quake II RTX, 50% resolution scale, RX 6800: 54 -> 56 FPS. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14011>	2022-01-21 16:25:16 +00:00
Rhys Perry	e7002b6f96	radv: use wave32 for raytracing Beginning of Quake II RTX, 50% resolution scale, RX 6800: 48 -> 54 FPS. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14011>	2022-01-21 16:25:16 +00:00
Rhys Perry	2298a96f9f	radv: fix raytracing with wave32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5452 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14011>	2022-01-21 16:25:16 +00:00
Charles Baker	61e3f549c5	zink: Set vertex binding stride without dynamic state extensions EXT_vertex_input_dynamic_state When both EXT_vertex_input_dynamic_state and EXT_extended_dynamic_state are not available stride is never set and nothing is rasterized as all triangles are degenerate. This fix copies stride into the VkVertexInputBindingDescription array when the graphics pipeline is created when those extensions aren't available. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14549>	2022-01-21 15:30:54 +00:00
Charles Baker	0722cd7a30	zink: Avoid redundant cast to uint on PackHalf2x16 result For example the previous code generates the following sequence of SPIR-V instructions ending with a redundant cast to uint: %2018 = OpExtInst %uint %1 PackHalf2x16 %2017 %2019 = OpBitcast %uint %2018 The new code generates: %2018 = OpExtInst %uint %1 PackHalf2x16 %2017 Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14568>	2022-01-21 15:15:49 +00:00
Charles Baker	985dae7f41	zink: Output PackHalf2x16 to uint not float Fixes InconsistentSpirv validation errors reporting that PackHalf2x16 outputs uint rather than float. For example the previous code generates the following SPIR-V with PackHalf2x16 output as a float: %2018 = OpExtInst %float %1 PackHalf2x16 %2017 %2019 = OpBitcast %uint %2018 The new code generates: %2018 = OpExtInst %uint %1 PackHalf2x16 %2017 %2019 = OpBitcast %uint %2018 cc: mesa-stable Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14568>	2022-01-21 15:15:49 +00:00
Rhys Perry	9e171b6d49	ac/nir: use shorter builder names This makes a lot of lines shorter. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	533118413b	ac/nir: avoid providing an align_mul to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	c0a586bad7	ac/nir: avoid providing a write_mask to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	8951608f08	radv: avoid providing an align_offset to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	552e59aee3	radv: avoid providing an align_mul to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	7833759a41	radv: avoid providing a write_mask to intrinsic builders Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	af51efe195	nir/builder: assume scalar alignment if not provided Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Rhys Perry	e9e1a44872	nir/builder: set write mask if not provided Zero isn't really a valid write mask. If it's provided, use a full write mask. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14455>	2022-01-21 13:45:33 +00:00
Alejandro Piñeiro	275a18322d	v3dv: check correct format when load/storing on a depth/stencil buffer When we create a image view with D24S8 format we made a reformatting to RGBA8UI if the aspect selected is just STENCIL. But when configuring the stores we select the aspects based on the attachment format. Quoting from cmd_buffer_render_pass_emit_stores: /* From the Vulkan spec, VkImageSubresourceRange: * * "When an image view of a depth/stencil image is used as a * depth/stencil framebuffer attachment, the aspectMask is ignored * and both depth and stencil image subresources are used." * * So we ignore the aspects from the subresource range of the image * view for the depth/stencil attachment, but we still need to restrict * the to aspects compatible with the render pass and the image. / const VkImageAspectFlags aspects = vk_format_aspects(ds_attachment->desc.format); So we could ending trying to store on a Z+Stencil buffer, using a RGBA8UI format. So far this only affected some tests when using the simulator (assert). Those were working on the real hw, but probably would fail on other situations, so lets use the original image format on that case. v2 (Iago) Improve comment grammar * Do the same on load too (not just store) v3 (Iago) * Re-word comments. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14635>	2022-01-21 13:24:18 +00:00
Alejandro Piñeiro	5d04b76c09	v3dv: remove unused v3dv_descriptor_map_get_texture_format Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14635>	2022-01-21 13:24:18 +00:00
Melissa Wen	9319ffb53d	v3dv: signal fence when all submitted jobs complete execution We track last submitted jobs by queue type. After all cmd buffer batches have been submitted, we emit a noop job that waits all jobs submitted to each GPU queue complete and signals the fence. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	bce77e758a	v3dv: process signal semaphores in the very last job With multiple semaphores support, we can use a GPU job to handle multiple signal semaphores in the end of a cmd buffer batch. It means, the last job in the last cmd buffer will be in change of signalling semaphores as long as it meets some conditions: 1 - A GPU-job signals semaphores whenever we only have submitted jobs for the same queue (there is no syncobj created for any other type). Otherwise, we emit a noop job that waits on the completion of all jobs submitted and then signals semaphores. 2 - A CPU-job is never in charge of signalling semaphores. We process it first and emit a noop job that depends on all jobs previously submitted to signal semaphores. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	0ab98612ef	v3dv: handle wait semaphores in the first job by queue With multiple semaphore support, we can improve the way we handle wait semaphores considering different job types and cmd buffer batch scenarios, that means: - A GPU job depends on wait semaphores whenever it is the first job submitted to a queue in this command buffer batch (the `first` flag for the job's queue type is set). - For the first CPU job, if there are wait semaphores, we should wait for the CPU and GPU being idle to process the job. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	03a6a82740	v3dv: track submitted jobs by GPU queue type The order in which a GPU job is scheduled is guaranteed within the same queue type (CL, TFU, CSD), but the order of completion of jobs from different queues cannot be guaranteed. Since we have multiple semaphores support now, we can track the completion of the last job submitted to each queue and therefore better determine when gpu is idle. We do it using an array of syncobj (last_job_syncs) for each GPU queue (CL, TFU, CSD). With this, job serialization also become more accurate. We also keep tracking the very last job submitted (last_job_sync became an element of the last_job_syncs array as V3DV_QUEUE_ANY) for the case we don't have multisync support. To help in handling wait semaphores, we set a flag per queue to indicate we are starting a new cmd buffer batch and a job submitted to this queue will be the first. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	fd973218a6	v3dv: enable GPU jobs to signal multiple semaphores In addition to keep a copy of wait semaphores, extend v3dv_submit_info_semaphores to hold a copy of signal semaphores too. With a copy of wait and signal semaphores, we can enable GPU jobs to handle more than one wait and signal semaphores. By now, we don't change the way as we signalling semaphores when all jobs complete, i.e., we still use the master thread to signal semaphores. In this context, no GPU job is actually in charge of signalling, but the support for multiple signal semaphores is done here. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	a7052dcf2c	v3dv: enable multiple semaphores for csd job Whenever v3d kernel-driver supports multisync extension, use it to allow add multiple semaphores as csd job dependency. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	ad09e50129	v3dv: enable multiple semaphores for tfu job Whenever v3d kernel-driver supports multisync extension, use it to enable more than one semaphores in a tfu job. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	ff8586c345	v3dv: enable multiple semaphores on cl submission Whenever v3d kernel-driver supports multisync extension, use it to enable more than one semaphores in cl submission. In CL, we have two kind of job (bin and render), therefore, we need also to determine the stage to sync, that means to add job dependencies/wait semaphores. Also, as we currently process all signal semaphores of a cmd buffer batch together in the submit master thread (when the last wait thread completes), there isn't now a situation in which GPU jobs need to handle signal VkSemaphores. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	85c49db10d	v3dv: check multiple semaphores capability Check if kernel-driver supports multisync extension Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	264dedf690	drm-uapi/v3d: extend interface for multiple semaphores support Extends command submission ioctls to support multiple semaphores through generic ioctl extension design. In this approach, a multisync extension subclasses a generic ioctl extension struct (base) and enables more than one wait and signal semaphores. Multisync extension also uses v3d_queue to specify the wait_stage, i.e. which job should sync before start (wait semaphores). Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	d5bd18fbb3	v3dv: store wait semaphores in event_wait_cpu_job_info Instead of a boolean (sem_wait) in v3dv_event_wait_cpu_job_info, that is used to determine wait condition for jobs put in a wait thread, copy the wait semaphores data and store it as struct v3dv_submit_info_semaphores. In the following patches we enable multiple semaphores in GPU jobs, and therefore we need this data to add wait semaphores as job dependencies for pending jobs submitted from a wait thread. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	d148379edf	v3dv: wrap wait semaphores info in v3dv_submit_info_semaphores Instead of pass pSubmit to queue_submit_cmd_buffer, create a struct v3dv_submit_info_semaphores to wrap semaphores data from VkSubmitInfo. In the next commit, this struct will help to handle wait condition for jobs submitted in a wait event context, since we need to hold this data when handle wait events and pass it to queue_submit_job() called from wait threads. The main goal is to allow multiple wait semaphores in a job submission. Later, this struct will be extended to include a copy of signal semaphores too. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Melissa Wen	09991fc47b	v3dv: drop unused variable on handle_set_event_cpu_job is_wait_thread is passed, but not actually used; and cpu_queue_handle_idle is in charge to handle wait threads spawned before this one. Signed-off-by: Melissa Wen <mwen@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13178>	2022-01-21 10:59:17 +00:00
Pierre-Eric Pelloux-Prayer	3b4d4c7d84	mesa: use less temporaries in build_lighting Preallocating temporaries can cause "out of temporaries" error. Switch to on-demand allocation. Growing temp_in_use would be an alternative, but some drivers support less than 64 temps (see PIPE_SHADER_CAP_MAX_TEMPS) so this wouldn't help them. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5777 Fixes: `272acbed0e` ("mesa: merge STATE_LIGHTPROD parameters") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14590>	2022-01-21 09:59:16 +00:00
Samuel Pitoiset	b8c518f0fb	radv: fix computing the number of color samples if no attachments When no color attachments, the rasterization samples should be used. Fixes: `0222dace90` ("radv: Support VK_KHR_dynamic_rendering for pipeline creation.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5830 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14642>	2022-01-21 09:38:37 +00:00
Pierre-Eric Pelloux-Prayer	4e4a2d0f97	driconf: enable vs_position_always_invariant for Dirt Rally Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5648 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14614>	2022-01-21 10:00:35 +01:00
Pavel Ondračka	b5b105df96	r300: properly initialize new_vs in r300_draw_init_vertex_shader Fixes the following valgrind warnings: Conditional jump or move depends on uninitialised value(s) at 0x5D1298C: draw_pt_so_emit_prepare (draw_pt_so_emit.c:95) by 0x5D89FD7: llvm_middle_end_prepare (draw_pt_fetch_shade_pipeline_llvm.c:319) by 0x5D13BFF: vsplit_prepare (draw_pt_vsplit.c:229) by 0x5D0D5D8: draw_pt_arrays.isra.0 (draw_pt.c:124) by 0x5D0DA08: draw_instances (draw_pt.c:484) by 0x5D0DEB9: draw_vbo (draw_pt.c:610) by 0x5E847D6: r300_swtcl_draw_vbo (r300_render.c:901) by 0x5D67125: u_vbuf_draw_vbo (u_vbuf.c:1470) by 0x5CFCAD3: cso_multi_draw (cso_context.c:1639) by 0x58FEC37: st_draw_gallium (st_draw.c:186) by 0x5A486BA: _mesa_draw_arrays.part.0 (draw.c:1323) by 0x48B4E27: stub_glDrawArrays (piglit-dispatch-gen.c:12421) Uninitialised value was created by a stack allocation at 0x5E90D3F: r300_draw_init_vertex_shader (r300_vs_draw.c:313) Conditional jump or move depends on uninitialised value(s) at 0x5D175D9: draw_create_vertex_shader (draw_vs.c:70) by 0x5E90E7A: r300_draw_init_vertex_shader (r300_vs_draw.c:366) by 0x5E8751F: r300_create_vs_state (r300_state.c:1950) by 0x594453B: st_create_common_variant (st_program.c:888) by 0x594720C: st_get_common_variant (st_program.c:973) by 0x59484B6: st_precompile_shader_variant (st_program.c:1994) by 0x59484B6: st_finalize_program (st_program.c:2056) by 0x58F0571: st_program_string_notify (st_cb_program.c:128) by 0x5928C9B: st_link_tgsi (st_glsl_to_tgsi.cpp:7514) by 0x5904B53: st_link_shader (st_glsl_to_ir.cpp:178) by 0x58D3E08: _mesa_glsl_link_shader (link_program.cpp:91) by 0x58865C7: link_program (shaderapi.c:1363) by 0x58865C7: link_program_error.part.0 (shaderapi.c:1474) by 0x48DBC5A: stub_glLinkProgram (piglit-dispatch-gen.c:34426) Uninitialised value was created by a stack allocation at 0x5E90D4F: r300_draw_init_vertex_shader (r300_vs_draw.c:313) Based on Filip Gawin's r300: set new_vs.type to PIPE_SHADER_IR_TGSI CC: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14620>	2022-01-21 03:35:29 +00:00
Dave Airlie	5decf569fc	mesa/st: move perf query test to st_context, drop files. This removes the unused cb perf query files Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:20 +00:00
Dave Airlie	3f8e7b8735	mesa/st: drop lots of perfquery wrappers Just direct call into pipe driver. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	78d13fdbb2	mesa/st: drop some bindless wrappers Just call directly into pipe Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	8467047f6d	mesa/st: move memory query into mesa. Drop the gl_memory_info type as it's equiv to the pipe one, and internal Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	6e99b10632	mesa/st: move shader completion into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	03f180d88b	mesa/st: inline st_max_shader_compiler_threads Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	74d40c68dc	mesa/ctx: store screen pointer in ctx as well This is actually useful to have. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	df9d5795c1	mesa/st: move evaluate depth values into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	a64e5c02bd	mesa/st/vdpau: direct call the vdpau functions. This provides versions when vdpau is turned off, removes dd.h entries Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	0f8a3a7175	mesa/st: drop release all sampler views wrapper Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	31d3e3ebeb	mesa/st: move st_TexParameter into mesa Some places this just passes an always true pname, so just call sampler view invalidate directly Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	96b57faf35	mesa/st: drop useless tex parameter calls. st_TexParameter never does anything for these Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	9244c792f9	mesa/dd: drop GetProgramBinaryDriverSHA1 Just call direct into state tracker Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	4a055c1940	mesa/st: move pin l3 cache to direct check/call. Drop another dd.h entry Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	10ac88b72f	mesa/st: drop emit string marker device table entry. Just check for the gallium callback instead Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	5618fac786	mesa/st: directly call the uuid get funcs. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	ec0d62ceb5	mesa/st: drop last user of st_Enable. Move the debug output piece into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	84fe99b2a0	mesa/st: migrate debug callback code into mesa Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	bc122e0769	mesa/st: remove st_context from debug callback Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Dave Airlie	e344a117af	mesa/st: move intel blackhole noop enable to frontend Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14632>	2022-01-21 01:18:19 +00:00
Mike Blumenkrantz	129e31cd4f	zink: hook up planar image format creation it'll explode if used for anything, but this is how it's done Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13865>	2022-01-21 01:02:18 +00:00
Mike Blumenkrantz	bff042fd43	zink: link with vulkan utils Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13865>	2022-01-21 01:02:18 +00:00
Ian Romanick	926d78a645	ntt: Extend ntt_compile::addr_declared and ntt_compile::addr_reg This was identified by Coverity. `4bb9c0a28a` added uses of a third address register, but the arrays for tracking address registers only have two slots. Add back a version of the assertion from before `4bb9c0a28a` to help prevent future problems. I don't think any drivers that would hit this path use NIR-to-TGSI yet, so it may be moot. Reviewed-by: Matt Turner <mattst88@gmail.com> CID: 1496942 CID: 1496944 Fixes: `4bb9c0a28a` ("nir_to_tgsi: Use the same address reg mappings as GLSL-to-TGSI did.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14487>	2022-01-21 00:25:38 +00:00
Rhys Perry	495debebad	nir/algebraic: optimize expressions using fmulz/ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	14b8227083	nir: add some missing nir_alu_type_get_base_type Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	f2fbba7920	nir/algebraic: optimize open-coded fmulz/ffmaz This pattern will be found in future versions of D3D9 DXVK. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	312a284980	nir/algebraic: add ignore_exact() wrapper Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	f68797ead7	aco: create v_mac_legacy_f32/v_fmac_legacy_f32 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	43e32ad074	aco: consider legacy multiplications in optimizer Optimize omod, -(ab), b2f(a)b, a1, a0 and create MAD/FMA. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	e7f91b194a	radv,aco,ac/llvm: implement fmulz and ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Rhys Perry	7f05ea3793	nir: add nir_op_fmulz and nir_op_ffmaz Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13436>	2022-01-20 22:54:42 +00:00
Ian Romanick	945fb51fb5	intel/fs: Fix gl_FrontFacing optimization on Gfx12+ It's not obvious why the (gl_FrontFacing ? -1.0 : 1.0) case was handled different for Gfx12+ than for previous generations, and it's not correct. It tries to negate the result as an integer, and it does this before the mask operation that clears the other bits in the value. When we eventually support dual-SIMD8 dispatch, the other front-facing bit is in g1.6 at bit 15, so similar code should be possible there. Reviewed-by: Matt Turner <mattst88@gmail.com> Fixes: `c92fb60007` ("intel/fs/gen12: Implement gl_FrontFacing on gen12+.") Closes: #5876 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14625>	2022-01-20 22:37:18 +00:00
Mike Blumenkrantz	4aaedc20c1	zink: fix non-modifer dmabuf usage drivers/hardware lacking VK_EXT_image_drm_format_modifier can still use dmabuf, but that setup has to do the old copy to linear scanout instead of copy to modifier scanout this requires a couple extra checks to be added to handle the case Fixes: 619438bf7ce ("zink: check EXT_image_drm_format_modifier for dmabuf support") fixes #5836 Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14597>	2022-01-20 21:33:54 +00:00
Renato Pereyra	4d95a7f800	anv: add helper methods related to enabling CCS for external images Also, clarify/improve related comments Signed-off-by: Renato Pereyra <renatopereyra@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14416>	2022-01-20 11:37:29 -08:00
Renato Pereyra	b664349973	anv: Enable implicit CCS for external images AUX and clear state is stored in the VkDevice private binding Signed-off-by: Renato Pereyra <renatopereyra@google.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14416>	2022-01-20 11:37:15 -08:00
Mike Blumenkrantz	a4c9276de2	docs: add features/relnotes for zink sparse texture support Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	f86e97ab34	zink: ARB_sparse_texture2 there is no vulkan driver that can currently pass all these tests, and some of the tests themselves are broken, but this seems like it should be correct Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	7c918a807b	zink: enable ARB_sparse_texture pipe caps Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	0959fd7f34	zink: handle sparse texture miptail commits basically just allocate pages for miptail levels (probably just one) and then bind them separately since they're probably never going to be batched Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	7e03554af0	zink: batch sparse texture binds do 10 binds per submit now (4 is enough for cts but yolo) Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	30bd4ff72e	zink: handle min_lod texture operands Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	ac30051a5d	zink: emit sparse residency cap in ntv Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	d76694a18f	zink: only allocate ntv residency info if it will be used odds are it will never be used, so don't bother allocating Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	7e7c94afaa	zink: add nir_intrinsic_image_deref_sparse_load to image scanning in compiler this flags the shader as having image use Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	60d0a0a8ce	zink: always pass shader info to ntv Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	0af2b7740b	zink: rename zink_so_info -> zink_shader_info start passing more useful info to ntv Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	73ef54e342	zink: handle residency return value from sparse texture instructions this one's a bit tricky since vulkan doesn't support vec5, the return from the instructions is a struct, and I don't want to add temp var support to zink now instead the process for these ops is: * rewrite the is_sparse_texels_resident instruction to read the first vec member of the texop * (temporarily) decrement num_components for sparse texop's dest to get real result size * wrap texop's return type in spirv-required struct(uint, result) * unwrap struct, store result normally + store residency info to separate array * for is_sparse_texels_resident, ignore the mov alu for src[0] and instead use the ssa index from the parent instr since this is the original texop that was used to store the residency result Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	dfc74d703e	zink: always set actual_dest_type for ntv tex instruction emission no-op for now Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	3c05646fbe	zink: implement sparse shader instructions in ntv this automatically wraps the results into the required struct(int, result) type, handling will come next note that there is no cts coverage for sparseImageLoadARB, so this is purely hypothetical Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	1bbcd68d5f	zink: fake sparse R9G9B9E5 support as needed these just allocate the whole thing now, which means they aren't actually sparse, but who cares because nobody but cts is actually going to use it and those tests pass just fine Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	d83b52928c	zink: stop allocating such massive staging regions for buffer maps this would allocate a new stream uploader for every map if the offset was large (e.g., all sparse buffer usage), which almost immediately consumes all vram cc: mesa-stable fixes KHR-GL46.CommonBugs.CommonBug_SparseBuffersWithCopyOps Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	7753ca2a45	zink: allow sparse buffers to be suballocated this is now symmetrical since the backing memory was being cached, and there's no reason not to allow this since memory is no longer in use by the time it gets returned to the cache Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	9fd155697a	zink: support sparse texture range commits this is a bit duplicated because the buffer and image commit code is a little shared but not enough to combine without becoming spaghetti this will only get worse once multisampling is supported Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	d320e8328d	zink: set up image create bits for sparse textures Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Mike Blumenkrantz	2e57c2c029	zink: add get_sparse_texture_virtual_page_size hook Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14381>	2022-01-20 15:51:30 +00:00
Danylo Piliaiev	cadcbed258	tu: expose VK_KHR_copy_commands2 Relevant CTS tests: dEQP-VK.api.copy_and_blit.copy_commands2.* Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14623>	2022-01-20 10:43:31 +00:00
Charles Giessen	6ea7a61d7a	v3dv: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Charles Giessen	4e0604279d	freedreno, tu: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Charles Giessen	6eb8ceac87	lavapipe: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Charles Giessen	0988e4ae09	anv: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Charles Giessen	a2ec6bf60f	panvk: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Charles Giessen	80a99ae906	radv: Update LoaderICDInterfaceVersion to v5 With the proper version checking in the common vulkan instance code (commit `88b9b68`) it is now possible to bring the reported interface version up to v5. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14563>	2022-01-20 07:25:07 +00:00
Dave Airlie	8733d19f53	meson: start building intel earlier. as intel perf is a big impact, start building the intel subdir earlier so there is less chance of long stalls at the end waiting for one file to link other things. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14630>	2022-01-20 06:41:17 +00:00
Dave Airlie	acc2d08cf9	intel/perf: use a function to do common allocations This cuts the compile time down for this file on my ryzen from real 1m4.077s to real 0m30.827s Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14630>	2022-01-20 06:41:17 +00:00
Tapani Pälli	521ede8451	mesa: refactor GetProgramiv to use program resource list This way we make sure glGetActiveUniform and glGetProgramiv are in sync about active uniform count. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5885 Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14589>	2022-01-20 05:52:53 +00:00
Emma Anholt	7a8d651d50	ci/softpipe: Drop the GS sampling known-flakes. They haven't appeared in the last half a year since I added the IRC flake reports. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14627>	2022-01-20 05:41:07 +00:00
Emma Anholt	bee77d3a82	softpipe: Request that st fix up DST_ALPHA blending for RGB render targets. Fixes a render target of dEQP-GLES31.functional.draw_buffers_indexed.random.max_implementation_draw_buffers.0 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14627>	2022-01-20 05:41:07 +00:00
Emma Anholt	263faa3dfb	softpipe: respect !independent_blend_enable for color masks. blend_buf is the resolved "are we using independent blending?" index, cbuf is the RT we're drawing to. Cc: mesa-stable. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14627>	2022-01-20 05:41:07 +00:00
Mike Blumenkrantz	0c31ab34d2	lavapipe: fix ptralloc typo these calculations are so tricky I can't even type them again Fixes: `48fde98b79` ("lavapipe: replace hard pointer calcs in dynamic render with ptralloc") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14631>	2022-01-19 21:05:03 -05:00
Guilherme Gallo	9314950726	ci: Add docs for Linux Kernel uprevs Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14514>	2022-01-20 01:29:45 +00:00
Dave Airlie	4b8d84f71a	mesa/st: merge texture obj/image alloc/free into mesa This just drops the st wrappers for alloc/free of texture images and objects. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:54:08 +10:00
Dave Airlie	cd0961dce2	mesa/st: merge texture object/image structs into mesa This just merges the subclasses into main class Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:54:03 +10:00
Dave Airlie	ea3e700e35	mesa/st: cleanup last bits of st perfmon code. Just some small cleanups left to finish perfmon code movement. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:54:01 +10:00
Dave Airlie	bc9b176aef	mesa/st: move perfmon code from st into mesa This merges the perfmon code after the objects have been merged. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:58 +10:00
Dave Airlie	1e2ded21ba	mesa/st: merge perfmon groups init/cleanup from st into mesa This moves the init/cleanup code from st into mesa. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:55 +10:00
Dave Airlie	350dbb000e	mesa/st: merge perfmon counters/groups objects from st into mesa This merges subclassed or side allocated objects into the main ones. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:50 +10:00
Dave Airlie	a3099f885e	mesa/st: merge perfmon object from st into mesa This just merges the perf mon subclass into the base class, and cleans up the result. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:48 +10:00
Dave Airlie	0fb946da94	mesa/st: merge transform feedback code from st into mesa After the objects are merged, this moves the rest of the code from the state tracker into mesa. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:44 +10:00
Dave Airlie	ecb99724a3	mesa/st: merge st transform feedback object into gl one. This just merges the object subclass into the main class, this was left separate to ease review. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:42 +10:00
Dave Airlie	b6710b6549	mesa/st: merge condrender code from st into mesa. This merges conditional render code from state tracker into mesa. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:36 +10:00
Dave Airlie	1c73afa462	mesa/st: merge queryobj code from st into mesa. This merges all the query object code from st into mesa. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:33 +10:00
Dave Airlie	0af7c1e385	mesa/st: merge the syncobj code from st into mesa This merges all the syncobj code into the mesa from the state tracker. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:30 +10:00
Dave Airlie	adf0dc7801	mesa/st: merge semaphore objects from st into mesa Take all the semaphore objects code from state tracker and merge it into mesa. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:27 +10:00
Dave Airlie	addcc24f77	mesa/st: merge memoryobjects code from st into mesa This takes all the memory object code from state tracker and merges it into mesa, cleaning it up on the way. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14327>	2022-01-20 10:53:18 +10:00
Dave Airlie	ed0046c5b4	glsl: drop glheader.h include. This is unused. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	16fa442057	mesa: split struct gl_config into it's own header. avoids context.h/mtypes.h deps. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	4346f39299	mesa: more mtypes.h cleanups Add more from pepp Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	090f900173	docs: update docs for new extension header. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	07edaa1409	vbo: drop unused mtypes.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	7db6f9b8fc	glsl: drop some more context.h/mtypes.h interactions Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	7b7f627600	glsl/fp64: move context.h dependent checks into main. allows dropping context.h include. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	23b361ae12	glsl: move off mtypes.h in lots of places. This moves to the new split out header files, should mean less recompiling for unrelated changes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	a7b9b4086c	mtypes: move gl_shader_variable to shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	79834f4def	mtypes: move bindless image/sampler objects to shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	0506c99ccd	mtypes: move uniform shader types to shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	57a7915fac	mtypes: move transform feedback internal structs to shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	cc38e6e7d3	mtypes: more gl_active_atomic_buffer to shader_types.h allows dropping mtypes.h in glsl compiler Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	0eb50f738d	mtypes: move gl_program to shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	d6dfa370ee	mtypes: move gl_linked_shader and gl_shader_program to new shader_types.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	fa5788b889	mesa: move ast_to_hir.cpp off mtypes.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	74fa9c0620	glsl: move ast_function.cpp off mtypes.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	b4aa10c089	glsl: avoid rebuilding builtin functions on mtypes.h changes. Restrict to when shader types or consts change Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	72123213ce	mesa: move some gl shader types to shader_types.h. This moves some of these to further removes mtypes.h from the GLSL compiler. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	b40958cbff	glsl: remove some deps on mtypes.h. This should reduce having to rebuild parts of the GLSL compiler when mtypes.h changes. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	02bab148f9	mesa/mtypes: move matrix enums to shader_enums.h These are used in the compiler backend also. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	379fc6b269	mtypes: split gl extensions and consts out into a separate header Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	717a720e9c	mesa: drop unused context parameter to shader program data reference. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Dave Airlie	3868b30fc4	glsl/parser: extract consts/exts/api out of context at start. This stores these pointers separately. in theory now gl_context can be made more opaque later, if we split header files ups. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14437>	2022-01-20 00:20:06 +00:00
Simon McVittie	949b5787ee	meson: Try to link all-targets module if Gallium OpenCL is enabled If we don't do this, and we are statically linking LLVM (-Dshared-llvm=disabled) while using a version of LLVM compiled for a general-purpose distribution, then the link fails with undefined references to the functions called by LLVMInitializeAllTargets(). Using a version of LLVM that was built specifically for Mesa will sometimes mask this: if the only backends built into LLVM are backends that we specifically link anyway, then the link will succeed. According to commit `80817b6e` "meson: Adjust Clover's required LLVM modules", all-targets is not available when using CMake to locate LLVM, so make it optional rather than mandatory. This partially reverts commit `80817b6e34`. Resolves: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3962 Resolves: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5609 Signed-off-by: Simon McVittie <smcv@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13720>	2022-01-19 23:00:48 +00:00
Emma Anholt	cac6f633b2	nir/opt_offsets: Use nir_ssa_scalar to chase offset additions. For nir_to_tgsi, I want to be able to fold into the base from a vector load_const, which the ad-hoc scalar chasing couldn't handle. r300: total instructions in shared programs: 1278731 -> 1256502 (-1.74%) instructions in affected programs: 457909 -> 435680 (-4.85%) total flowcontrol in shared programs: 8316 -> 8313 (-0.04%) flowcontrol in affected programs: 5 -> 2 (-60.00%) total temps in shared programs: 213687 -> 213774 (0.04%) temps in affected programs: 13140 -> 13227 (0.66%) total consts in shared programs: 952850 -> 949929 (-0.31%) consts in affected programs: 386352 -> 383431 (-0.76%) Fixes: #5781 Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	1048e6113e	nir_to_tgsi: Use nir_opt_offsets for load_ubo_vec4. This helps non-native-integers hardware where relative addressing of UBOs has a constant offset field, and having addressing math (particularly for D3D9) emitted as ALU ops ends up running us out of constants. For native-integers drivers (such as softpipe), the possible-overflow check typically triggers and we end up not folding. r300: total instructions in shared programs: 1279167 -> 1278731 (-0.03%) instructions in affected programs: 50834 -> 50398 (-0.86%) total temps in shared programs: 213736 -> 213687 (-0.02%) temps in affected programs: 598 -> 549 (-8.19%) total consts in shared programs: 952973 -> 952850 (-0.01%) consts in affected programs: 26776 -> 26653 (-0.46%) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	645ca56425	nir/opt_offsets: Also apply the max offset to top-level constant folding. nir_to_tgsi wants this for disabling folding into shared var accesses at all. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	ec4b9909f0	nir/opt_offsets: Disable unsigned wrap checks on non-native-integers HW. Since we don't have 32-bit ints, these checks for 32-bit unsigned wrapping don't help and just reduce optimization opportunities (particularly for DX9 addressing math). Doesn't affect any current consumers. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	700d2fbd0a	nir: Add a .base field to nir_load_ubo_vec4. This lets nir-to-tgsi fold the constant offset of addressing calculations into the CONST[] reference, which is important for D3D9-era compatibility: HW of that age has limited uniform space, and if we do the addressing math as math in the shader for dynamic indexing, the nir_load_consts end up taking up uniforms we don't have available. r300: total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total instructions in shared programs: 1279699 -> 1279167 (-0.04%) instructions in affected programs: 134796 -> 134264 (-0.39%) total temps in shared programs: 213912 -> 213736 (-0.08%) temps in affected programs: 2166 -> 1990 (-8.13%) total consts in shared programs: 953237 -> 952973 (-0.03%) consts in affected programs: 45980 -> 45716 (-0.57%) Acked-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Emma Anholt	a98103c55d	nir/lower_dynamic_bo_access: Use copy_inst_indices for our cloned instrs. The ad-hoc index duplication was missing setup of things like the ACCESS or RANGE_BASE fields. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14309>	2022-01-19 22:28:34 +00:00
Dave Airlie	f83f72be8e	intel/brw: drop gl header from the brw backend. This shouldn't be used anywhere now once we drop the GLbitfield64 types. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	ccbf700d6c	nir: remove gl.h include from nir headers. This saves a lot of pointless gl.h includes across the board, it moves the one place that needs GLenum into a separate file only used in those passes that require it. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	39bfb25627	includes: add windows lean and mean guard. When we drop gl.h some files pick up windows.h from here without lean/mean and it causes conflicts. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	1352e0ba0c	mesa/*: add a shader primitive type to get away from GL types. This creates an internal shader_prim enum, I've fixed up most users to use it instead of GL types. don't store the enum in shader_info as it changes size, and confuses other things. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Dave Airlie	d54c07b4c4	mesa/*: use an internal enum for tessellation primitive types. To avoid dragging gl.h into places it has no business being, defined tessellation primitive mode to an enum. This has a lot of fallout all over the place. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14605>	2022-01-19 21:54:58 +00:00
Stefan Brüns	537a0ee3b7	llvmpipe: Add get_{driver,device}_uuid implementations Commit `9da15aa3aa` ("llvmpipe: enable EXT_memory_object(_fd)") enabled the extension, but left this unimplemented. Leaving this unimplemented causes segfaults for anyone trying to retrieve the UUIDs, as the calling code in the state tracker does not check if the function is implemented. This affects e.g. current Wine versions. Set the UUID to all zeros. Although this slightly violates the vulkan specification (since 1.2.146), the UUIDs have to match the ones from lavapipe (lvp_get_physical_device_properties_1_1). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5865 Fixes: `9da15aa3aa` ("llvmpipe: enable EXT_memory_object(_fd)") Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de> Reviewed-by: Dave Airlie airlied@redhat.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14558>	2022-01-19 21:36:41 +00:00
Dave Airlie	acd0afdba4	amd: move uvd decode definitions to common place This just makes sharing these easier later. Acked-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14607>	2022-01-20 07:07:32 +10:00
Dave Airlie	14551f2bde	amd: move vcn decoding regs + structs to a common file. This just moves the main regs + fw interface structs to a new shared file. Acked-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14607>	2022-01-20 07:07:30 +10:00
Guilherme Gallo	a35c5540e4	ci: freedreno: Update a530 dEQP fail expectation list The test `KHR-GLES31.core.shader_storage_buffer_object.basic-stdLayout_UBO_SSBO-case2-cs` was failing even before the kernel uprev Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14548>	2022-01-19 20:14:43 +00:00
Guilherme Gallo	6f74c95a84	ci: Uprev Kernel to v5.16 Changes in freedreno devices: - Adding CONFIG_DRM_PANEL_EDP, split from CONFIG_DRM_PANEL_SIMPLE Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14548>	2022-01-19 20:14:42 +00:00
Alex Xu (Hello71)	19f88eb858	meson: tlsdesc: minor reformatting, add comments Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14027>	2022-01-19 19:45:22 +00:00
Samuel Pitoiset	96cc300746	radv: fix missing destroy for the overallocation mutex Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14611>	2022-01-19 16:47:56 +00:00
Samuel Pitoiset	6619f855d9	vulkan/runtime: fix accessing NULL pointers detected by UBSAN Fixes: `7a84314c12` ("vulkan/runtime: Add sparse bind support.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14619>	2022-01-19 16:16:08 +00:00
Marcin Ślusarz	44ddc28fa8	Add new rules to .gitattributes This will make sure that all source files have native line endings when checked out locally. Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11779>	2022-01-19 15:17:17 +00:00
Marcin Ślusarz	a4b8f69bb5	radv/ci: add line endings exception for files generated with wine Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11779>	2022-01-19 15:17:17 +00:00
Marcin Ślusarz	745af88783	ci/windows: normalize line endings Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11779>	2022-01-19 15:17:17 +00:00
Marcin Ślusarz	f547122772	microsoft/compiler: normalize line endings Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11779>	2022-01-19 15:17:17 +00:00
Marcin Ślusarz	ed0edcc338	freedreno/rnn: normalize line endings in rules-ng.xsd Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11779>	2022-01-19 15:17:17 +00:00
Rhys Perry	b59764a9fc	aco: use p_extract for SGPR nir_op_unpack_half_2x16_split_y fossil-db (Sienna Cichlid): Totals from 7264 (5.40% of 134627) affected shaders: VGPRs: 548152 -> 548184 (+0.01%) SpillSGPRs: 7615 -> 7607 (-0.11%) CodeSize: 71025152 -> 70993036 (-0.05%); split: -0.05%, +0.00% Instrs: 13386799 -> 13298580 (-0.66%); split: -0.66%, +0.00% Latency: 177464497 -> 177091693 (-0.21%); split: -0.21%, +0.00% InvThroughput: 32185148 -> 32151873 (-0.10%); split: -0.10%, +0.00% VClause: 233167 -> 233166 (-0.00%); split: -0.00%, +0.00% SClause: 468423 -> 468426 (+0.00%); split: -0.00%, +0.01% Copies: 950727 -> 942753 (-0.84%); split: -0.85%, +0.02% Branches: 455919 -> 455901 (-0.00%); split: -0.01%, +0.00% fossil-db (Vega): Totals from 7264 (5.39% of 134762) affected shaders: SGPRs: 738800 -> 738816 (+0.00%) VGPRs: 550264 -> 550344 (+0.01%) SpillSGPRs: 11149 -> 11157 (+0.07%) CodeSize: 67487104 -> 67466772 (-0.03%); split: -0.03%, +0.00% Instrs: 13142106 -> 13061767 (-0.61%) Latency: 209278575 -> 208438854 (-0.40%); split: -0.40%, +0.00% InvThroughput: 81486405 -> 81265773 (-0.27%); split: -0.27%, +0.00% VClause: 222293 -> 222291 (-0.00%); split: -0.00%, +0.00% SClause: 447783 -> 447800 (+0.00%); split: -0.00%, +0.01% Copies: 1243760 -> 1238842 (-0.40%); split: -0.43%, +0.03% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14592>	2022-01-19 12:41:48 +00:00
Matti Hamalainen	bbc4ca5d7d	aux/trace: cosmetic cleanup Fix up some function argument indentation alignments and adjust few other small cosmetics. Signed-off-by: Matti Hamalainen <ccr@tnsp.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14398>	2022-01-19 11:57:17 +00:00
Matti Hamalainen	85a75e42db	aux/trace: implement missing trace calls Some call traces (resource_from_handle, resource_get_handle and resource_get_param) were TODO, so implement them while we are here. Signed-off-by: Matti Hamalainen <ccr@tnsp.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14398>	2022-01-19 11:57:17 +00:00
Matti Hamalainen	32d40f7c80	aux/trace: print enum names instead of integer values in gallium traces Having only magic constants instead of human-readable strings in traces not only hinders readability, but also may affect trace comparision of old and new traces if new enums have been added or modified (thus possibly changing the values of existing ones.) So we implement printing of enum names as strings instead. In order to have those strings, we need to add some new helper functions, which we will automatically generate from header file src/gallium/include/pipe/p_defines.h via a new Python script enums2names.py. We also bolt this all into the Meson build system. Link: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4609 Signed-off-by: Matti Hamalainen <ccr@tnsp.org> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14398>	2022-01-19 11:57:17 +00:00
Lionel Landwerlin	51f6288a2d	genxml: reduce amount of generated code $ wc -l build/src/intel/genxml/genX_bits.h 250581 build/src/intel/genxml/genX_bits.h (before) 5748 build/src/intel/genxml/genX_bits.h (after) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14469>	2022-01-19 10:57:48 +00:00
Lionel Landwerlin	acebea9cf1	anv: fix missing descriptor copy of bufferview/surfacestate content When doing copies of descriptors from one set to another, that contain either a UNIFORM_BUFFER or STORAGE_BUFFER, both the buffer view & surface state are allocated from the source descriptor. Therefore we need to copy their content otherwise we could run into lifecycle issues when the source descriptor is destroyed. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14585>	2022-01-19 10:40:37 +00:00
Lionel Landwerlin	2c875e6ad7	intel/dev: fix ppipe_mask computation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `e86ce98c6a` ("intel/devinfo: deal with i915 topology query change") Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14608>	2022-01-19 10:22:36 +00:00
Thomas H.P. Andersen	9e839620da	meson: add check kwarg to run_command run_command will change the default for the check arg to true in the future. If it is true then meson will exit if the command fails. It must be false here as we check the return code to provide a meaningful error message. With meson 0.61 we get the following warning: WARNING: You should add the boolean check kwarg to the run_command call. It currently defaults to false, but it will default to true in future releases of meson. See also: https://github.com/mesonbuild/meson/issues/9300 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14602>	2022-01-19 04:55:55 +00:00
Mike Blumenkrantz	71673cd2d4	zink: add deqp ci baseline for nv Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14603>	2022-01-19 04:41:55 +00:00
Mike Blumenkrantz	f9892b0f4f	zink: update nv ci baseline Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14603>	2022-01-19 04:41:55 +00:00
Mike Blumenkrantz	806946b094	lavapipe: replace hard pointer calcs in push descriptors with ptralloc Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13678>	2022-01-19 04:10:58 +00:00
Mike Blumenkrantz	48fde98b79	lavapipe: replace hard pointer calcs in dynamic render with ptralloc Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13678>	2022-01-19 04:10:58 +00:00
Mike Blumenkrantz	171ccdbf1c	util: add ptralloc many times it will be the case that an allocation for a block of data needs to be done in one alloc() call such that the members of a struct as well as some extra trailing data are all in the same allocation like ``` struct Test { unsigned a[4]; unsigned c; }; unsigned b; //ptr to uint[8] ``` should be allocated as a single block of (13 sizeof(unsigned)) memory using C pointer offsets to allocate the memory as ``` \| Test \| b \| ``` with something like ``` struct Test t = malloc(sizeof(struct Test) + (8 sizeof(unsigned))); ``` and then set `b` with ``` t->b = ((uint8_t)t) + sizeof(struct Test); ``` this is annoying, awful to read, and (at least for dum-dums like me) prone to errors, however, so having some utility functions which can deliver the same functionality with better readability helps out this case by transforming it to ``` unsigned b; void *ptrs[] = {(void)&b}; size_t sizes[] = {8 * sizeof(unsigned)); struct Test *t = ptralloc(sizeof(struct Test), 1, sizes, ptrs); ``` where `b` is now set to the appropriate offset in memory Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13678>	2022-01-19 04:10:58 +00:00
Kenneth Graunke	0bc7562466	iris: Do primitive ID overrides in 3DSTATE_SBE not SBE_SWIZ Broadwell introduced new fields in 3DSTATE_SBE which allow us to ask the hardware to override Primitive ID for us, rather than requiring us to turn on attribute swizzling and specify per-attribute overrides in 3DSTATE_SBE_SWIZ. We unconditionally enable attribute swizzling today, but this is a step toward letting us think about disabling it in the future. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14210>	2022-01-19 01:31:47 +00:00
Kenneth Graunke	223edb1ec1	iris: Use prog_data->inputs rather than shader info in SBE code. This should be the same thing, but requires looking up less data. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14210>	2022-01-19 01:31:47 +00:00
Kenneth Graunke	d475e839da	intel/fs: Reuse the same FS input slot for VUE header fields. VARYING_SLOT_{VIEWPORT,LAYER,PSIZ} all live in the same VUE header slot, and the FS is already set up to read the x/y/z/w component of that vec4. However, we were setting up the SBE to pass each of those items as a separate FS input, so hypothetically if a shader read all three, we would burn 3 FS inputs with redundant data. Not only was this passing extra data to the FS, but it would count as extra input slots for the "Do we have 16 or fewer attributes?" check for using SBE swizzling to rearrange them in a convenient manner. Now we make them share a single FS attribute and only count them once. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14210>	2022-01-19 01:31:47 +00:00
Emma Anholt	5f0bf2113e	r300: Add consts (uniforms) count to the shader-db output. This is one of the critical metrics for this driver. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14600>	2022-01-19 00:49:57 +00:00
Emma Anholt	cd7b260eeb	r300: Drop unused r300_get_stats() call. Unused since we switched to shader-db. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14600>	2022-01-19 00:49:57 +00:00
Jordan Justen	695ba644ab	intel/gem: Return length from intel_i915_query_alloc Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	83da7dc487	intel/dev: Recalculate max_cs_threads after applying hwconfig changes Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	ab22fb488b	intel/dev: Apply settings from hwconfig if devinfo::apply_hwconfig is set During debug builds, if apply_hwconfig is not set, then the devinfo value will be compared with the hwconfig value. If they don't match then a warning message will be logged to stderr. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	59f32c62c7	intel/dev: Set intel_device_info::apply_hwconfig for DG2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	a15f18c886	intel/dev: Add intel_device_info::apply_hwconfig This will be used to conditionally use hwconfig values to update intel_device_info at runtime. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	0300351028	intel/dev: Print urb size with intel_dev_info Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	2dd78b9a4f	intel/dev: Add intel_print_hwconfig_table() Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	3a119f0c6d	intel/dev: Add intel_hwconfig_types.h from random post on the internet There is no spec for any of this, but we'll need this to interpret a blob that the kernel is giving us. Ref: https://lists.freedesktop.org/archives/intel-gfx/2021-September/277488.html Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	f0692365a2	anv,blorp,crocus,i965,iris: Use devinfo->max_threads_per_psd for gfx8+ Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Jordan Justen	08a897fa38	intel/dev: Add max_threads_per_psd field to devinfo for gfx8+ Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13866>	2022-01-19 00:29:35 +00:00
Pavel Ondračka	5e2834754e	r300: fix translate_LRP This was broken by `7daba1fe65` Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14596>	2022-01-19 00:19:46 +00:00
Neha Bhende	e6b60c2764	svga: enable GL43 on SVGA GL43 capable device This patch enables GL43 feature for VMware's future GL43 capable SVGA device. Tested with various GL43 apps. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	1942c06f9c	svga: add GL43 resource validation at draw time This patch adds GL43 tracked states stack and supports GL43 resource validation at draw time. This patch is squash of in house patches to support GL43 on VMware driver. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	1e99a30738	svga: shader translation for compute, image views and shader buffers This patch handles shader translation for compute, image views and shader buffers and updates the corresponding shader compile keys. It also includes support of using shader raw buffer for shader buffer used as constant buffer. This patch is squash of numerous in house patches. Reviewed-by: Charmaine Lee <charmainel@vmware.com> v2: As pointed out by Thomas, fix revert of `64292c0f` caused by this patch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	247c61f2d0	svga: Add support for compute shader, shader buffers and image views This commit is squash of commits which handles resource creation and management for compute shader, shader buffers and image views. It creates uavs for shader buffers and image views. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	533a09541d	tgsi: Add hw_atomic_declared in tgsi_info This patch also adds hw_atomic_declared info in tgsi_info. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	dc512f11ea	svga: Add utility to check for GL43 support GL43 support in SVGA driver requires vmwgfx kernel version 2.20 and GL43 capable SVGA device. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Neha Bhende	391a8bbc77	svga: Add GL43 commands support This patch updates SVGA header files and adds support for uav related commands. Reviewed-by: Charmaine Lee <charmainel@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14270>	2022-01-18 23:50:36 +00:00
Mike Blumenkrantz	440beb01d7	zink: enable EXT_external_objects pipe caps Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Mike Blumenkrantz	4ed30be31e	zink: implement external memory object resource handling this is super simple and can almost fully reuse the existing codepaths Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Mike Blumenkrantz	32597e116d	zink: implement GL semaphores this is basically just a wrapper around vulkan semaphores, so it maps fairly well the existing fence function was a big ??? and should never have been triggered like it was Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Mike Blumenkrantz	29285a0e85	zink: add driver/device uuid screen hooks Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Mike Blumenkrantz	65e34fa70e	zink: add VK_KHR_external_memory_capabilities to instance exts the props for this are weird so they have to be fetched "manually" Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Mike Blumenkrantz	f0f1a94fcf	zink: add VK_KHR_external_semaphore_fd to device exts Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14498>	2022-01-18 23:31:01 +00:00
Jordan Justen	a11dfc11cf	iris: Use mi_builder for load/store reg/mem/imm functions Ref: `06cf838cbd` ("intel/mi_builder: Support gen11 command-streamer based register offsets") Ref: `6ffdcc335e` ("iris: Use mi_builder in iris_load_indirect_location()") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14340>	2022-01-18 23:11:38 +00:00
Jordan Justen	e29ed39d63	iris: Use mi_builder to set 3DPRIM registers for draws Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14340>	2022-01-18 23:11:38 +00:00
Dave Airlie	fbcbbae662	crocus: only clamp point size on last stage. This fixes a regression in tests that pass unclamped point size values between stages. This keeps xfb broken since the real way it should work is to have the hw clamp after xfb, but this seems the least evil path. Fixes: `3077d96856` ("crocus: Clamp VS point sizes to the HW limits as required.") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14359>	2022-01-18 22:53:45 +00:00
Dave Airlie	f9f7f326fa	intel/compiler: add clamp_pointside to vs/tcs/tes keys. This will be used by crocus and iris to clamp pointsizes only on the last stage of the shader compile. Fixes: `3077d96856` ("crocus: Clamp VS point sizes to the HW limits as required.") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14359>	2022-01-18 22:53:45 +00:00
Dave Airlie	7b6cd912a5	mesa/st: get rid of ST_CALLOC_STRUCT use CALLOC_STRUCT Just tie in the CALLOC_STRUCT/FREE mechanism. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14325>	2022-01-18 22:20:18 +00:00
Dave Airlie	05c8fb6335	mesa/st/perfmon: rebalance CALLOC_STRUCT/FREE this was using FREE but ST_CALLOC_STRUCT, so fix the other way. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14325>	2022-01-18 22:20:18 +00:00
Dave Airlie	7c79c9bfb1	mesa: rebalance the CALLOC_STRUCT/FREE force. There were some CALLOC_STRUCT that were using free, rebalance this. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14325>	2022-01-18 22:20:18 +00:00
Dave Airlie	3b26dbf8f1	mesa/program: don't use CALLOC_STRUCT for instructions. Figuring out the frees here was too much for me. Reviewed-by: Jose Fonseca <jfonseca@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14325>	2022-01-18 22:20:18 +00:00
Cristian Ciocaltea	279cc37ac0	freedreno/ci: Fix dEQP tests expectations on A530 Add a new entry to the 'fails' list. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	da709e8aa7	panfrost/ci: Fix piglit tests expectations on G52 Remove the successful tests from the 'fails' list. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	214ea8e018	iris/ci: Fix piglit tests expectations on amly Found two tests that started to show a flaky behavior, although they are not detected as such by the test runner. Depending on their presence in the 'fails' list, they are reported as "UnexpectedPass" or "Fail". Let's fix this by moving them to the 'flaky' list. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	05c6dea68b	iris/ci: Fix whl dEQP expectations Remove the successful tests from the 'fails' list. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	9f0be629cd	virgl/ci: Fix identification of dEQP binary paths In some cases the file paths passed to crosvm for execution do not point to dEQP binaries, but can be wrapper scripts, like deqp-runner.sh. Detect such cases and skip changing the working directory. Additionally, use the POSIX compliant command substitution syntax instead of the obsolete variant based on backquotes. Fixes: `81f25d8f27` ("virgl/ci: Run each dEQP instance in its own VM") Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	aec7eaf3e3	virgl/ci: Do not hide crosvm output messages In some corner cases like the kernel oops, we do not get the relevant log messages from crosvm process to help with debugging. Note there is currently a double redirection of its stdout stream, but the content eventually ends up in /dev/null. Let's fix this by redirecting both stdout and stderr streams to a dedicated file, to avoid clobbering the output from the script/program running inside crosvm. This is particularly required for the scenario that involves deqp-runner starting crosvm via a *.toml suite. Additionally, drop the unnecessary usage of 'stdbuf' and set the 'quiet' kernel command-line parameter to get rid of the noise generated during crosvm boot process. Although not directly related, do some cleanup by removing the temporary folder on script exit. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	efd4bcad20	virgl/ci: Prevent static link of virglrenderer inside crosvm Ensure virglrenderer library is built before crosvm in order to allow dynamic linking. This is needed for the scenarios where a different virglrenderer library must be provided before launching crosvm, e.g.: the upcoming Virgl CI solution that shares Mesa CI containers. Additionally, this provides the virgl_test_server binary which is required by piglit-runner.sh and deqp-runner.sh scripts when using the virpipe Gallium driver. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	8d1ed9b753	virgl/ci: Force crosvm error when exit code file is missing crosvm-runner.sh doesn't correctly report the script execution status if the exit code file is missing. Fix this by returning 1 when there is no exit code available from the script that was executed. Fixes: `81f25d8f27` ("virgl/ci: Run each dEQP instance in its own VM") Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	de4e56fcf7	ci: Create results folder before starting virgl_test_server Move the statement responsible for creating the results output path before 'virgl_test_server' is started in 'piglit-runner.sh' script. This ensures the server log file will be available in the pipeline job artifacts archive. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	c74fb1da4f	ci: Do not remove cmake In order to enable container reuse in Virgl CI, keep 'cmake' in the container. Additionally, provide the 'check' utility. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	8729c6e981	ci: Support building and installing deqp-runner from source Add support for building and installing deqp-runner from a git source repository to facilitate further development & testing of new features or simply making use of the latest upstream changes which were not already tagged as a new package version. The git revision to be built must be specified by setting 'DEQP_RUNNER_GIT_REV' env variable. To specify a git tag name instead, 'DEQP_RUNNER_GIT_TAG' variable must be used. It is also possible to indicate a custom git repository via 'DEQP_RUNNER_GIT_URL'. If neither a git revision or tag name has been set, deqp-runner is installed from the rust package registry (the default behavior). v2: Make use of '--git' and '--rev' cargo args to automate the git checkout operation (Rohan). v3: Allow Git URL override by using the hardcoded URL only when DEQP_RUNNER_GIT_URL is not already set. v4: Override 'EXTRA_CARGO_ARGS' to optimize the script and avoid a second call to 'cargo install'. Additionally, add support for indicating git tags (Rohan). Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Cristian Ciocaltea	f40e7f0e7b	ci: Uprev deqp-runner to 0.11.0 The updated version offers, among others, improved logging support to help debugging failing tests. Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Rohan Garg	bce9cdf968	ci/piglit: Start vtest server if driver is set to virpipe This allows for running of piglit tests with vtest. Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Rohan Garg	217c03ce4b	ci: Do not remove wget Keep wget for reuse by Virgl CI downstream Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [cristian: Fix conflicts while rebasing on latest main] Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Rohan Garg	196dbb12fd	ci: Move common variables out into a separate file Moving common variables out allows for other projects like virglrenderer to be able to reuse Mesa CI's containers Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [cristian: fixed conflicts while rebasing on latest main; updated tags] Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Rohan Garg	b7bd6ee09d	ci: Do not remove libgbm-dev In order to enable container reuse in Virgl CI, keep libgbm-dev in the container. Signed-off-by: Rohan Garg <rohan.garg@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [cristian: discarded the update of MESA_IMAGE_TAG in debian/x86_build] Signed-off-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14413>	2022-01-18 18:42:05 +00:00
Charles Baker	dcb2f48ed6	zink: Enable VK_KHR_image_format_list for VK_KHR_imageless_framebuffer Validation layer reports that VK_KHR_imageless_framebuffer depends on VK_KHR_image_format_list and that the latter must be enabled explicitly with the former. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14570>	2022-01-18 18:30:22 +00:00
Mike Blumenkrantz	97da9e4bf4	Revert "zink: update gfx_pipeline_state.vertex_strides when necessary" This reverts commit `a21d2bfd77`. my brain was still on vacation and this doesn't do anything Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14593>	2022-01-18 18:18:56 +00:00
Connor Abbott	9c9e8c3349	nir: Reorder ffma and fsub combining It's relatively common to do something like "a * b - c", which on most GPUs can be implemented in a single instruction. Before opt_algebraic_late this will be something like "fadd(fmul(a, b), fneg(c))", and we want to turn it info "ffma(a, b, fneg(c))". But because the fsub pattern was first we instead turned it into "fsub(fmul(a, b), c)". Fix this by reordering them. Selected shader-db results on freedreno: total instructions in shared programs: 1561330 -> 1551619 (-0.62%) instructions in affected programs: 780272 -> 770561 (-1.24%) helped: 1941 HURT: 491 helped stats (abs) min: 1 max: 147 x̄: 7.98 x̃: 4 helped stats (rel) min: 0.07% max: 30.77% x̄: 4.36% x̃: 3.17% HURT stats (abs) min: 1 max: 307 x̄: 11.76 x̃: 5 HURT stats (rel) min: 0.09% max: 18.71% x̄: 2.26% x̃: 1.38% 95% mean confidence interval for instructions value: -4.57 -3.41 95% mean confidence interval for instructions %-change: -3.21% -2.84% Instructions are helped. total nops in shared programs: 358926 -> 356263 (-0.74%) nops in affected programs: 167116 -> 164453 (-1.59%) helped: 1395 HURT: 859 helped stats (abs) min: 1 max: 108 x̄: 6.80 x̃: 3 helped stats (rel) min: 0.17% max: 100.00% x̄: 19.15% x̃: 10.57% HURT stats (abs) min: 1 max: 307 x̄: 7.95 x̃: 3 HURT stats (rel) min: 0.00% max: 381.82% x̄: 20.04% x̃: 10.00% 95% mean confidence interval for nops value: -1.77 -0.59 95% mean confidence interval for nops %-change: -5.55% -2.87% Nops are helped. total non-nops in shared programs: 1202404 -> 1195356 (-0.59%) non-nops in affected programs: 496682 -> 489634 (-1.42%) helped: 1951 HURT: 265 helped stats (abs) min: 1 max: 39 x̄: 4.02 x̃: 3 helped stats (rel) min: 0.07% max: 15.38% x̄: 2.97% x̃: 1.96% HURT stats (abs) min: 1 max: 22 x̄: 2.97 x̃: 2 HURT stats (rel) min: 0.05% max: 10.00% x̄: 1.14% x̃: 0.75% 95% mean confidence interval for non-nops value: -3.38 -2.99 95% mean confidence interval for non-nops %-change: -2.60% -2.36% Non-nops are helped. total systall in shared programs: 288317 -> 292975 (1.62%) systall in affected programs: 87876 -> 92534 (5.30%) helped: 388 HURT: 431 helped stats (abs) min: 1 max: 214 x̄: 14.39 x̃: 8 helped stats (rel) min: 0.25% max: 100.00% x̄: 22.12% x̃: 11.96% HURT stats (abs) min: 1 max: 232 x̄: 23.77 x̃: 7 HURT stats (rel) min: 0.00% max: 1300.00% x̄: 51.71% x̃: 17.30% 95% mean confidence interval for systall value: 3.07 8.30 95% mean confidence interval for systall %-change: 9.49% 23.97% Systall are HURT. (The systall hurt is probably just due to having having fewer instructions to hide latency with.) Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Acked-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14554>	2022-01-18 17:44:50 +00:00
Mike Blumenkrantz	29cb1c7c13	zink: check EXT_image_drm_format_modifier for dmabuf support probably fixes https://gitlab.freedesktop.org/mesa/mesa/-/issues/5836 cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14595>	2022-01-18 17:12:50 +00:00
Juan A. Suarez Romero	68f16660f1	v3d: keep clear color untouched When invoking TLB clear, the color to clear could require swapping and clamping. Do the changes in a copy and leave the original color untouched, as it is passed as constant. This fixes local outside scope issues with Coverity. Fixes: `54cba7d2` ("v3d: clamp clear color") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14534>	2022-01-18 15:22:07 +00:00
Marek Olšák	afdfcdd542	radeonsi: determine MEM_ORDERED after generating a shader variant because si_get_nir_shader runs NIR passes and some of them can introduce new loads. Fixes: `3fb77ef2e0` - radeonsi: do opt_large_constants & lower_indirect_derefs after uniform inlining Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14528>	2022-01-18 11:11:08 +00:00
Marek Olšák	e5dd32a48c	radeonsi: apply fbfetch/indirect_descriptor to uses_vmem_load_other earlier Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14528>	2022-01-18 11:11:08 +00:00
Marek Olšák	08cc73a218	radeonsi: rename uses_vmem_* flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14528>	2022-01-18 11:11:08 +00:00
Qiang Yu	ee040a6b63	radeonsi: enable ARB_sparse_texture2 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:11:53 +08:00
Qiang Yu	71a77aea79	radeonsi: enable multi sample sparse texture support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	485ceb5c51	gallium: add multi_sample parameter to get_sparse_texture_virtual_page_size Instead of using actual sample count as parameter, we only use a bool to indicate if the target is multi sample. This is because we don't know the sample count when glGetInternalformativ() case. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	2c994e17c1	mesa/main: export _is_multisample_target for external usage Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	91661a3938	mesa/main: allow multi sample sparse texture Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	697e1cdeef	radeonsi: lower nir_intrinsic_is_sparse_texels_resident Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	7d4b0b7789	glsl/nir: convert is_sparse_texels_resident to nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	634bb25123	glsl: add sparseTexelsResidentARB builtin function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	a0b6515ec4	glsl/nir: adjust sparse texture nir_variable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	3dc950118c	glsl/nir: convert sparse image load to nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:36 +08:00
Qiang Yu	f4a972b748	glsl/nir: convert sparse ir_texture to nir Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	f62bbe44c9	glsl: add vec5 glsl types Used by nir_variable holds sparse texture output which is up to vec5 size. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Singed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	59d94d8188	glsl: add sparse texture image load builtin functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	dcec15d79c	glsl: add _texelFetch related sparse texture builtin function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	a26dbfc24b	glsl: add _textureCubeArrayShadow related sparse texture builtin func Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	640f909862	glsl: add _texture related sparse texture builtin functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	dd84769c55	glsl: ir_texture support sprase texture Sparse ir_texture will set is_sparse and use struct type to hold both residency code and sampled texel. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	67b4c88512	glsl: add ARB_sparse_texture2 extension Reviewed-by: Marek Olšák <marek.olsak@amd.com> Singed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	d44f725ea6	mesa/main: relax alignment check when ARB_sparse_texture2 available Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	6da5136c66	mesa: add ARB_sparse_texture2 extension Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	fef018c307	gallium: add PIPE_CAP_QUERY_SPARSE_TEXTURE_RESIDENCY Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	5dd9cb1069	gallium/dd_debug: add get_sparse_texture_virtual_page_size Otherwise GALLIUM_DDEBUG=always crash when sparse texture is used. Fixes: `eed8421bba` ("gallium: add screen get_sparse_texture_virtual_page_size callback") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Sigend-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Qiang Yu	2cee73f0f7	nir: fix nir_tex_instr hash not count is_sparse field This fixes nir_opt_cse miss replace a non-sparse tex instruction with a sparse tex instruction and fail the nir_validate_shader(). Fixes: `3a7972f72a` ("nir,spirv: add sparse texture fetches") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14362>	2022-01-18 16:10:35 +08:00
Marek Olšák	3cafa3e852	ac/surface: allow displayable DCC with any resolution (e.g. 8K) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14529>	2022-01-18 01:44:17 -05:00
Danylo Piliaiev	e4c582ee71	tu: support VK_EXT_primitive_topology_list_restart Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14556>	2022-01-17 15:21:03 +00:00
Rhys Perry	d95a0b52e4	nir/unsigned_upper_bound: don't follow 64-bit f2u32() Fixes Doom Eternal crash. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Fixes: `72ac3f6026` ("nir: add nir_unsigned_upper_bound and nir_addition_might_overflow") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14555>	2022-01-17 10:59:21 +00:00
Lucas Stach	0d65f229c5	egl/dri2: short-circuit dri2_make_current when possible If an application calls eglMakeCurrent with the same context and the same draw and read surfaces as the current ones, there is no need to go through all the work of unbinding/flushing the old context and binding the new one. While the EGL spec is a bit ambiguous here, it seems that the implicit flush of the outgoing context only needs to be done when the context is actually changed. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14379>	2022-01-17 10:32:01 +00:00
Lucas Stach	b33ed5406a	egl/dri2: remove superfluous flush when changing the context The flush of the outgoing GL context, as required by the EGL spec for eglMakeCurrent and extended by KHR_context_flush_control, is already performed in the unbindContext call. There is no need to pierce through the layers and unconditionally call glFlush() here. Getting the out-fence FD for explicit fencing needs to move behind the unbindContext, to make sure we are getting the fence for the most recently flushed commands. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14379>	2022-01-17 10:32:01 +00:00
Samuel Pitoiset	88f1918919	radv/winsys: fix zero submit if no timeline semaphore support Kernels that don't support timeline semaphores also don't support transferring syncobjs. Use export/import instead. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5845 Fixes: `967fc415fc` ("radv: Add new submission path for use by the common sync framework.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14538>	2022-01-17 08:38:29 +01:00
Emma Anholt	f6ffefba3e	nir: Apply nir_opt_offsets to nir_intrinsic_load_uniform as well. Doing this for ir3 required adding a struct for limits of how much base to fold in (which NTT wants as well for its case of shared vars), otherwise the later work to lower to the 1<<9 word limit would emit more instructions. The shader-db results are that sometimes the reduction in NIR instruction count results in the fewer sampler prefetches due to the shader being estimated to be shorter (dota2, nexuiz): total instructions in shared programs: 8996651 -> 8996776 (<.01%) total cat5 in shared programs: 86561 -> 86577 (0.02%) Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Emma Anholt	b024102d7c	freedreno/ir3: Use nir_opt_offset for removing constant adds for shared vars. Saves some work in carchase and manhattan31: instructions in affected programs: 2842 -> 2818 (-0.84%) nops in affected programs: 1131 -> 1105 (-2.30%) non-nops in affected programs: 1236 -> 1238 (0.16%) mov in affected programs: 57 -> 61 (7.02%) dwords in affected programs: 2144 -> 2150 (0.28%) cat0 in affected programs: 1195 -> 1169 (-2.18%) cat1 in affected programs: 151 -> 155 (2.65%) cat2 in affected programs: 142 -> 140 (-1.41%) sstall in affected programs: 190 -> 178 (-6.32%) (ss) in affected programs: 63 -> 63 (0.00%) systall in affected programs: 532 -> 511 (-3.95%) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14023>	2022-01-16 19:11:29 +00:00
Alyssa Rosenzweig	9645fa9107	agx: Handle discard intrinsics Lower to `sample_mask = 0`. Actually that implements a demote... doing discard correctly probably requires rewriting the shader control flow to insert a return where necessary... Also, possibly we should be lowering this in NIR to play nice with gl_SampleMask writes but that's a problem for when we understand the hardware better. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	f248f6623c	agx: Add sample_mask instruction Sets the output sample mask to a given 8-bit immediate or 16-bit register. Also used to implement discards, which is my ES2 interest. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	dcc12656e3	asahi: Route sample mask from shader Compiler-controlled bit in the cmdstream. Some other magic bits are needed for sample mask writes to work properly. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	9b57600502	asahi: Rectify confusing XML comment The field was split up... Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	3341abc5d7	asahi: Break out Fragment Parameters word What the other 31 bits are for is anyone's guess. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	fc5a72be2f	asahi: Add XML for unknown 0x4a packet Enough bits of this packet are known that open-coding hex bytes for it is annoying. Add some XML correpsonding to what we know. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14219>	2022-01-16 18:23:28 +00:00
Alyssa Rosenzweig	054c5be102	asahi: Warn when hacks mode is enabled I don't want your pesky bug report. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14579>	2022-01-16 12:43:55 -05:00
Alyssa Rosenzweig	011106640f	asahi: Fake more CAPs with dEQP hacks mode Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14579>	2022-01-16 12:43:55 -05:00
Krunal Patel	13b79266e4	frontend/va: Setting the size of VADRMPRIMESurfaceDescriptor Issue: objects[i].size is returned as '0' for all Root cause: The value of objects.size is hard coded to '0' in vlVaExportSurfaceHandle() Fix: Assigning the value by multiplying height and width of the surface Signed-off-by: Krunal Patel <krunalkumarmukeshkumar.patel@amd.corp-partner.google.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14313>	2022-01-16 15:44:17 +00:00
Krunal Patel	76b7e39354	frontends/va: use un-padded width/height in ExportSurfaceHandle Issue: VADRMPRIMESurfaceDescriptor width and height are rounded up to macroblock size (16). Rootcause: surf->buffer's width/height are rounded up to macroblock size (16), so they shouldn't be used here. Fix: Using surf->templ's width/height instead fixes incorrect surface dimensions being sent via VADRMPRIMESurfaceDescriptor. Signed-off-by: Krunal Patel <krunalkumarmukeshkumar.patel@amd.corp-partner.google.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14313>	2022-01-16 15:44:17 +00:00
Erik Faye-Lund	b6cc240db1	bin/gen_calendar_entries: fix newlines on windows The documentation[1] for the csv module specifies that we should specify newline='' when opening the output file. Without that, the module garbles the newlines, writing them as \r\r\n on Windows instead of \r\n. So let's do what the documentation says, and specify newline='' [1]: https://docs.python.org/3/library/csv.html#id3 Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12405>	2022-01-16 08:39:44 +00:00
Erik Faye-Lund	493f688331	ensure csv-files are crlf on disk According to RFC 4189 CSV files should be encoded using CRLF newlines, not LF. This helps compatibility with tools, like python's csv module, who always uses CRLF. While we're at it, normalize the one CSV that was CRLF in-repo to LF, and let git do the newline-normalization when needed instead. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12405>	2022-01-16 08:39:44 +00:00
Alyssa Rosenzweig	b8d37eb1bb	pan/bi: Schedule around blend shader register clobbering By software ABI, a blend shader is permitted to clobber registers R0-R15. The scheduler needs to be aware of this, to avoid moving a write to one of these registers past the BLEND itself. Otherwise the schedule is invalid. This bug affects GLES3.0, but is rare enough in practice that we had missed it. It requires a fragment shader to write to multiple render targets with attached blend shaders, and have temporaries register allocated to R0-R15 that are not read by the blend shader, but are sunk past the BLEND instruction by the scheduler. Prevents a regression when switching boolean representations on: dEQP-GLES31.functional.shaders.builtin_functions.integer.uaddcarry.uvec4_lowp_fragment Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14577>	2022-01-15 21:51:57 +00:00
Alyssa Rosenzweig	77a1514a37	pan/decode: Disassemble Bifrost quietly Although Bifrost clause packing and register assignment is tricky, the relevant code is by now extensively tested, and there's no remaining reverse-engineering here. So disassembling verbosely just adds tons of noise to pandecode without increasing the useful information. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Suggested-by: Icecream95 <ixn@disroot.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:48:32 -05:00
Alyssa Rosenzweig	58accc995b	pan/decode: Don't print Preload twice It's already printed in the RSD itself, no need to print it out-of-band a second time. Removes noise in the pandecode. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:46:12 -05:00
Alyssa Rosenzweig	330bb2c58b	panfrost: Remove FBD pointer on Bifrost XML It's a pointer to a thread storage descriptor, not a framebuffer descriptor. Unlike Midgard, these don't have to alias. The FBD pointer was unused anyway, so remove it to reduce noise in pandecode dumps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:43:40 -05:00
Alyssa Rosenzweig	ae9316f812	pan/decode: Decode Valhall surface descriptor Instead of incorrectly falling down the Bifrost path. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:14 -05:00
Alyssa Rosenzweig	c947a52df4	pan/decode: Add pandecode_dump_mappings Add a helper to dump all mapped GPU memory. This is a blunt, seldom useful instrument ... but when it /is/ useful it's your only option. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:13 -05:00
Alyssa Rosenzweig	861fa2baec	pan/decode: Add hexdump helper I think I originally wrote this for Asahi? Should probably be moved to util/... Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:13 -05:00
Alyssa Rosenzweig	6752fcf179	pan/decode: Track mmaps with a red-black tree Rather than emulating page tables, poorly, with a hash table, use a red-black tree and store the intervals directly. This is deterministic instead of probabilistic, attaining O(log n) performance for n mapped intervals which is good enough. Unlike the hash table approach, this allows us to iterate intervals easily. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:13 -05:00
Alyssa Rosenzweig	a07473b79d	pan/decode: Include addresses for jobs Helpful for contextualizing fault pointers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:13 -05:00
Alyssa Rosenzweig	4af20895c5	pan/decode: Remove hierarchy mask check This has never been meaningful. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14543>	2022-01-15 11:40:13 -05:00
Adam Jackson	01f5fffbc6	mesa: Remove unused src/mesa/x86-64 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	a686946553	mesa: Remove unused _mesa_set_sampler_{filters,srgb_decode,wrap} Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	2db87ad344	mesa: Remove unused _mesa_is_front_buffer_{draw,read}ing Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	203df8c264	mesa: Remove unused _mesa_is_alpha_to_coverage_enabled Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	c630408945	mesa/math: Remove unused m_translate.c Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	c828490b26	mesa: Remove unused _mesa_delete_nameless_texture meta was the last user. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	c1fa6bbecf	mesa: Remove unused _mesa_all_varyings_in_vbos Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Adam Jackson	6e3cd05870	mesa: Remove unused _mesa_convert_colors Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14545>	2022-01-15 03:14:00 +00:00
Rob Clark	fcb3b87553	freedreno/decode: Handle chip-id For cmdstream traces from newer devices, we need to identify the gpu based on chip-id. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14564>	2022-01-14 23:17:03 +00:00
Lepton Wu	d15021435e	driconf: Fix unhandled tags in static conf A rule with executable_regexp tag would match every executable without this fix and force_glsl_extensions_warn would be always set to true which breaks some dEQP tests. Fixes: `5740ac3701` ("xmlconfig: Add static driconfig support") Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14562>	2022-01-14 22:09:50 +00:00
Nanley Chery	fe0a6b9606	anv: Don't fill lowered_storage_image_param on SKL+ The switch statement in anv_descriptor_data_for_type() shows that this field isn't used on SKL+. On XeHP, this avoids assert failures by preventing isl_surf_fill_image_param() from being called. That function doesn't expect Tile4 surfaces. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14546>	2022-01-14 21:47:48 +00:00
Felix DeGrood	1027655810	pps: increase intel.cfg buffer size Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	2e3490dd0f	iris: utrace/perfetto support v2: Fixup gpu_id computation, use minor of /dev/dri/* % 128 since we don't know whether we get card0 or renderD128 for instance. (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> (v1) Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	1d2fea6eae	tools/pps: limit intel cfg to 250ms of sampling Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	b462dafbd6	pps: enable anv source in example config file Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	e760c5b37b	anv: add perfetto source v2: Increase custom stall data (Felix) Fixup build (Felix) v3: Add API enum (Rohan) Fixup old comment (Rohan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	b70143f4e3	util/u_process: protect entrypoints for c++ Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	cf9956a8c5	intel/ds: use a per GPU clock ID Multi-GPU setups :) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	61766f9f90	intel/ds: use the right i915_drm.h include location Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	e35a65ae45	intel/ds: don't forget to reset upper dword timestamp read This could lead to confusing if the 32bits roll over (every ~6mn or so). Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `4ef6698a26` ("intel/ds: drop timestamp correlation code") Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	6eb554a9c7	intel/ds: allow user to select metric set at start time Rather than using always the same metric set, let the user choose when starting the producer with : INTEL_PERFETTO_METRIC_SET=RasterizerAndPixelBackend ./build/src/tool/pps/pps-producer Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	69df00b33b	intel/ds: reuse intel_ioctl() Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	cc5843a573	anv: implement u_trace support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	bb541d1159	intel/blorp: add measure_end entry point Will be useful to figure out when blorp operations end. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	d3724de894	intel/dev,perf: Use a single timescale function Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	02a4d622ed	anv: expose a couple of emit helper to build utrace buffer copies We'll want to copy timestamp buffers when commands buffers are resubmitted multiple times. v2: Merge a couple of #if GFX_VER >= 8 (Rohan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	246e2c74d3	isl: add helpers to printout ops Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	e8ea6a899f	blorp: add description & helpers to printout ops Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Acked-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13996>	2022-01-14 20:17:44 +00:00
Lionel Landwerlin	30a8b8d2df	intel/fs: disable VRS when omask is written As indicated by VkPhysicalDeviceFragmentShadingRatePropertiesKHR::fragmentShadingRateWithShaderSampleMask our implementation will clamp to 1x1 when reading samplemask or writing to samplemask. This fixes vkd3d-proton tests test_sample_mask_dxbc & test_sample_mask_dxil Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b6332fc4a8` ("intel/compiler: handle coarse pixel in render target writes descriptors") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14553>	2022-01-14 19:14:06 +00:00
Chia-I Wu	37fa59fa6c	anv,lavapipe,v3dv: use wsi_common_get_image Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> (anv) Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> (v3dv) Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (lavapipe) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14544>	2022-01-14 17:41:42 +00:00
Chia-I Wu	e6d7e1ec63	vulkan/wsi: add wsi_common_get_image This can be useful with VkBindImageMemorySwapchainInfoKHR. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14544>	2022-01-14 17:41:42 +00:00
Jesse Natalie	dbad53ec6b	docs: Update d3d12 feature list Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14504>	2022-01-14 08:36:38 -08:00
Jesse Natalie	14b1319f29	d3d12: Support ARB_framebuffer_no_attachments Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14504>	2022-01-14 08:36:13 -08:00
Jesse Natalie	0cc79c9c1e	d3d12: When no framebuffer attachments are present, the viewport must be clamped to framebuffer size GL has separate no-attachment framebuffer size parameters, but D3D uses the viewport. Clamp the viewport dimensions to lie within GL's default framebuffer sizes. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14504>	2022-01-14 08:34:23 -08:00
Jesse Natalie	3f22038973	d3d12: When no framebuffer attachments are present, use ForcedSampleCount instead of SampleDesc.Count for MSAA Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14504>	2022-01-14 08:34:23 -08:00
Mike Blumenkrantz	a21d2bfd77	zink: update gfx_pipeline_state.vertex_strides when necessary only necessary without any of the dynamic states and still doesn't fix the problem, but it's a step cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14561>	2022-01-14 16:13:57 +00:00
Jesse Natalie	dd1c6bff29	docs: Update d3d12 features Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	c6e7cdcf38	d3d12: Enable draw and multi-draw indirect Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	3a8c8d25fd	d3d12: Add a compute transformation to handle indirect draws that need draw params Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	a4894fcbfc	d3d12: Handle indirect twoface draws Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	3dd292703b	d3d12: Handle draw indirect and multi-draw indirect Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	c9dc4fa7c1	d3d12: Add a command signature cache for indirect draws Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	529b078718	d3d12: Enable base instance and draw params extensions Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	aaceb10b0f	d3d12: Upgrade first vertex state var into all vertex draw params Add in base instance, draw ID, and is-indexed-draw Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	a98508d092	d3d12: Declare support for inverted conditional render Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	dbc5c3929e	d3d12: Predication fix: For boolean queries used for predication, D3D12 uses uint64, so clear at least a uint64 in the result Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	934dc7b6a0	d3d12: Predication fix: re-enable after restarting a batch if needed Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	9609387a4f	d3d12: Fix re-enabling predication after temporary disablement The equal-zero vs not-equal-zero property was previously lost Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jesse Natalie	07bf8b18b5	d3d12: Export d3d12_get_state_var from d3d12_nir_passes.c Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14486>	2022-01-14 15:54:33 +00:00
Jason Ekstrand	8b3d947267	spirv,radv: Fix some GL enum comments GL_LINE_STRIP_ADJACENCY is 0xB. 0xA is GL_LINES_ADJACENCY. While we're here, drop the ARB prefixes. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14157>	2022-01-14 15:08:09 +00:00
Jason Ekstrand	a1de102479	intel/fs: Use compare_func for wm_prog_key::alpha_test_func Because 0 is no longer a recognizable value (it's NEVER, which isn't a good default), we add an emit_alpha_test bool to tell the back-end when to bother alpha testing. This lets us only touch crocus with the change. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14157>	2022-01-14 15:08:09 +00:00
Jason Ekstrand	460a953df5	intel/compiler: Stop using GLuint in brw_compiler.h Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14157>	2022-01-14 15:08:09 +00:00
Mike Blumenkrantz	5f1ca03c45	aux/trace: add pipe_context::fence_server_signal tracing Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14539>	2022-01-14 14:21:36 +00:00
Danylo Piliaiev	3e7f6c9aeb	tu: implement wsi hook to decide if we can present directly on device This will prevent the driver to take the prime blit path for presentation in scenarios where it can avoid it. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Samuel Iglesias Gonsálvez <siglesias@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11091>	2022-01-14 12:19:57 +00:00
Danylo Piliaiev	fa75b2a027	vulkan/wsi: create a common function to compare drm devices Effectively moves most of v3dv_wsi_can_present_on_device to the common code to be used in other drivers. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11091>	2022-01-14 12:19:57 +00:00
Lionel Landwerlin	e86ce98c6a	intel/devinfo: deal with i915 topology query change i915 does not report slices accurately anymore on Gfx12.5+. Since this is information we need to have for performance queries, we need to rebuild it here. v2: Remove invalid change to pixel pipes computations (Jordan) v3: Fix index calculation (Curro) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14297>	2022-01-14 12:01:05 +00:00
Lionel Landwerlin	6d73426d2a	intel/devinfo: split out l3/pixelpipes counting v2: fix up subslice_total store (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> (v1) Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14297>	2022-01-14 12:01:05 +00:00
Jordan Justen	7609e88d5f	intel/devinfo: Adjust L3 banks for DG2 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Co-Authored-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14297>	2022-01-14 12:01:05 +00:00
Lionel Landwerlin	5f3b0327b7	intel/dev: extract slice/subslice total computation Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14297>	2022-01-14 12:01:05 +00:00
Iago Toral Quiroga	b0f7f1afac	v3d: implement double-buffer mode As with Vulkan, we are not enabling this by default since it may improve of hurt performance depending on the case. Instead, we can selectively enable it by using the V3D_DEBUG environment variable. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14551>	2022-01-14 10:57:27 +00:00
Iago Toral Quiroga	b9f9474577	v3dv: implement double-buffer mode Double buffer mode splits the tile buffer size in half so we can start processing the next tile while the current one is being stored to memory. This mode is available only if MSAA is not enabled and can, in theory, improve performance by reducing tile store overhead, however, it comes at the cost of reducing the tile size, which also causes some overhead of its own. Testing shows that this helps some cases (i.e the Vulkan Quake ports) but hurts others (i.e. Unreal Engine 4), so for the time being we don't enable this by default but we allow to enable it selectively by using V3D_DEBUG. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14551>	2022-01-14 10:57:26 +00:00
Alejandro Piñeiro	821c66e50c	vulkan: return default string for undefined enum Instead of a unreachable. This would avoid an assert on debug builds that uses vkfoo_to_str to print structure types. This will become more common as some tests will start to use VK_STRUCTURE_TYPE_MAX_ENUM to mark structures from unsupported extensions more often. v2 (Jason): * Include enum name on the default message * Handle MAX_ENUM as a special case v3 (Jason): * vk_ObjectType_to_ObjectName don't need to use ${enum.name} Reviewed-by: Jason Ekstrand <jason@jlekstrand.net Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14525>	2022-01-14 10:31:15 +00:00
Iago Toral Quiroga	fbe4d7ccf4	v3dv: implement VK_EXT_4444_formats Because these formats are introduced trough an extension, their enum values are exceedingly large and we cannot use them to index directly into the format table we had for core formats. Instead, we put these in a separate table and we always use the VK_ENUM_OFFSET helper to index into these tables. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14533>	2022-01-14 10:10:10 +00:00
Iago Toral Quiroga	25c46c465d	v3dv: handle formats with reverse flag Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14533>	2022-01-14 10:10:10 +00:00
Iago Toral Quiroga	872f08815b	v3dv: add swizzle helpers to identify formats wit R/B swap and reverse flags Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14533>	2022-01-14 10:10:10 +00:00
Charles Giessen	dbd3935b04	freedreno, tu: Export vk_icdGetPhysicalDeviceProcAddr Support Loader ICD Interface Version 4 by exporting the function vk_icdGetPHysicalDeviceProcAddr. Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14299>	2022-01-14 10:26:13 +01:00
Charles Giessen	b00fe27015	panvk: Export vk_icdGetPhysicalDeviceProcAddr Support Loader ICD Interface Version 4 by exporting the function vk_icdGetPhysicalDeviceProcAddr. Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14299>	2022-01-14 10:26:00 +01:00
Charles Giessen	5a37cc1186	v3dv: Update LoaderICDInterfaceVersion to v4 vk_icdNegotiateLoaderICDInterfaceVersion now correctly identifies the driver as supporting v4. Before, the driver did support the functionality but didn't report supporting it through the negotiate function. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14299>	2022-01-14 10:25:50 +01:00
Charles Giessen	9d013f2c24	radv: Update description of vk_icdNegotiateLoaderICDInterfaceVersion Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14299>	2022-01-14 10:25:41 +01:00
Mike Blumenkrantz	307cdb7147	zink: add some nv ci results just something so I have a vague idea Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14547>	2022-01-13 21:46:34 +00:00
Mike Blumenkrantz	f0eb07f98f	zink: remove SpvMemorySemanticsMakeVisibleMask from nir_intrinsic_memory_barrier this is invalid since vk memory model isn't used cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14540>	2022-01-13 14:57:56 -05:00
Samuel Pitoiset	74b0fbe2e9	radv: enable radv_disable_aniso_single_level for Battlefield 1 & V Both games seem to have a bug. This can also be reproduced with AMDVLK since PAL enabled anisotropic filtering for single level images. Fixes: `5ce4017a2b` ("radv,aco: do not disable anisotropy filtering for non-mipmap images") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5753 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14471>	2022-01-13 16:17:48 +00:00
Samuel Pitoiset	e6173ed1d2	radv: allow to disable anisotropic filtering for single level image with drirc Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14471>	2022-01-13 16:17:48 +00:00
Carsten Haitzler	874f4095c5	panfrost: Don't double-free when handling error for unsupported GPU Setting the screen ro member before we checked gpu id means the error case leads to a double-free because screen->ro is set and allocated by parent who hanbdles de-alloc a second time after we destroyed the screen we just created because ro was set. Cc: mesa-stable Signed-off-by: Carsten Haitzler <carsten.haitzler@foss.arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14516>	2022-01-13 16:01:08 +00:00
Carsten Haitzler	7553aee06b	kmsro: Add komeda DPU Arm Komeda (Mali) display units do exist on a few SoC's. This is the first time that I've seen Mesa brought up "in anger" on one of these, thus it has no such knowledge of these DPUs. This adds that to the kmsro set like a lot of other fairly standard split GPU/DPU devices. Signed-off-by: Carsten Haitzler <carsten.haitzler@foss.arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14515>	2022-01-13 15:25:44 +00:00
Carsten Haitzler	b22294f6d5	panfrost: Add GPU G76 to the set of known ids This is another working GPU so add the ID to the known set. Working on real silicon and tested. Signed-off-by: Carsten Haitzler <carsten.haitzler@foss.arm.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14517>	2022-01-13 14:59:48 +00:00
Mike Blumenkrantz	596d2ab0ad	util/vbuf: fix buffer translation sizing the original change here attempted to fix calculating the maximum bound for the mapped readback buffer by adding the maximum attribute size to the final element used by readback the calculation was erroneous, however, because it instead calculated the maximum offset instead of the size, which would cause a different kind of overrun Fixes: `3c5b7dca30` ("util/vbuf: fix buffer overrun in attribute conversions") fixes #5846 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14479>	2022-01-13 14:28:23 +00:00
Connor Abbott	8f18c72f9a	freedreno/fdl: Fix reinterpreting "size-compatible" formats It's allowed to reinterpret compressed formats as one of a few non-compressed formats with the same pixel size as the blocksize of the compressed format, and vice-versa. If we did this we'd wind up with an incorrect width/height. Fix that. Fixes dEQP-VK.image.sample_texture.*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14520>	2022-01-13 13:44:14 +00:00
Alejandro Piñeiro	d7cbe17760	v3dv: simplify v3dv_debug_ignored_stype mesa_logd already handles having or not DEBUG defined, and also has a better empty option. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14512>	2022-01-13 13:18:05 +00:00
Samuel Pitoiset	49c1b40290	radv: only clear VRS_HTILE_ENCODING on GFX10.3+ On older chips like GFX9, bit 19 is RB_ALIGNED. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5859 Fixes: `9c746157ae` ("radv: reset VRS if the current subpass doesn't have a VRS attachment") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14530>	2022-01-13 12:57:39 +00:00
Vadym Shovkoplias	fd2fbc558b	glthread: Check out of bounds for MultiDrawElementsBaseVertex cmd Make sure draw count is not high so the MultiDrawElementsBaseVertex cmd can fit the queue buffer. v2: add the same check for _mesa_marshal_MultiDrawArrays and add additional assert to _mesa_glthread_allocate_command (Pierre-Eric) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5719 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14490>	2022-01-13 12:05:11 +00:00
Jordan Justen	70a4e64685	intel: Add disabled device ids for DG2 We are waiting for i915 to enable DG2 in upstream Linux, so for now we use an "#if 0" around the PCI ids. Reworks: * Merged Lionel's "intel/devinfo: store the different kind of DG2" Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14524>	2022-01-13 09:48:51 +00:00
Jordan Justen	4f9141607f	intel: Add device info for DG2 Reworks: * Lionel: simulator_id * Rafael: has_llc * Merged Lionel's "intel/devinfo: store the different kind of DG2" Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14524>	2022-01-13 09:48:51 +00:00
Juan A. Suarez Romero	cd678c7029	v3d/doc: do not expose ARB_shader_image_load_store V3D does not support formatless writings. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14288>	2022-01-13 09:12:35 +00:00
Juan A. Suarez Romero	bd70b4f27f	mesa: fix MAX_GEOMETRY_IMAGE_UNIFORMS check support MAX_GEOMETRY_IMAGE_UNIFORMS are supported if geometry shaders and either ARB_shader_image_load_store or GLES 3.1 are supported. v2: - MAX_GEOMETRY_IMAGE_UNIFORMS shouldn't be supported for GL 3.2 if ARB_shader_image_load_store is not supported (Ilia). - MAX_TESS_{CONTROL,EVALUATION}_IMAGE_UNIFORMS requires tessellation shader support (Anholt). v3: - Use _mesa_is_gles31() function (Ilia). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14288>	2022-01-13 09:12:35 +00:00
Juan A. Suarez Romero	3b81d2d30d	mesa/st: do not expose ARB_shader_image_load_store if not fully implemented So far we were checking ARB_shader_image_load_store is supported as requirement to expose GLES 3.1. But when this extension functionality was added in GLES 3.1 spec, it was relaxed, and one of its requirements, the support for formatless writing, was not included. So this means that a driver that support all the extension functionality except formatless writing, could expose GLES 3.1, but it couldn't expose the extension itself (nor GL 4.2, which requires fully implementation of the extension). v2: - Add the same exposure treatment to ARB_shader_image_size (Ilia). v3: - Remove dependency for OES_texture_buffer (Ilia). - Check image resources for GLES 3.1 (Ilia). v4: - Check for MaxImageUniforms in compute shader (Ilia). - Check for max combined image uniforms/ssbo (Ilia). v5: - Remove ARB_shader_image_load_store from check (Ilia). - ARB_shader_image_store and ARB_shader_image required for ARB_ES3_1_compatibility (Ilia). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14288>	2022-01-13 09:12:35 +00:00
Juan A. Suarez Romero	d9bc018854	d3d12: enable PIPE_CAP_IMAGE_STORE_FORMATTED Required to expose ARB_shader_load_image_store and thus ARB_compute_shader, which are supported by the driver. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14288>	2022-01-13 09:12:35 +00:00
Juan A. Suarez Romero	5e9b43f505	softpipe: enable PIPE_CAP_IMAGE_STORE_FORMATTED Required to expose ARB_shader_load_image_store and thus ARB_compute_shader, which are supported by the driver. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14288>	2022-01-13 09:12:35 +00:00
Lionel Landwerlin	1027b68418	anv: fix perf queries We're not saving the first pool. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `36ea90a361` ("anv: Convert to the common sync and submit framework") Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14420>	2022-01-13 07:54:22 +00:00
Jesse Natalie	545e73d72c	mesa/st: Assert that NIR drivers that support tess use tess levels as inputs Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14401>	2022-01-13 07:31:54 +00:00
Mike Blumenkrantz	01709464a4	aux/trace: copy over stream_output_target_offset method from context this can't be traced, so don't crash cc: mesa-stable Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14481>	2022-01-13 06:09:22 +00:00
Mike Blumenkrantz	d0bba3dfbf	zink: add flake smh Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14527>	2022-01-13 05:55:29 +00:00
Rob Clark	4dc406c748	freedreno: Update chip-ids Counterpoint to https://patchwork.freedesktop.org/series/98772/ Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14506>	2022-01-13 05:26:11 +00:00
Rob Clark	785a324deb	freedreno: Handle wildcard fuse-id in device matching A future kernel update will add fuse-id in the upper bits of the chip_id. Do avoid breaking device matching, add a way to include a wildcard/fallback fuse-id. (Note that this only effects un- released devices.) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14506>	2022-01-13 05:26:11 +00:00
Rob Clark	6b8e3aeeb7	freedreno: Rearrange dev_id_compare() logic We're going to need to add a couple more cases. Let's split up the existing two cases first, rather than piling on more logic to a single expression. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14506>	2022-01-13 05:26:11 +00:00
Rob Clark	9176e27dd2	freedreno: Small dev_id_compare() cleanup We don't really treat the two arguments identically, so rename them to make it clear which one is the device id coming from kernel, and which one is the reference id from the fd_dev_recs table. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14506>	2022-01-13 05:26:11 +00:00
Hyunjun Ko	0a82a26a18	turnip: Porting to common implementation for timeline semaphore Define struct tu_timeline_sync for emulated timeline support in common implementation that is on top of drm syncobj as a binary sync. Also implement init/finish/reset/wait_many methods for the struct. v1. Does not set MSM_SUBMIT_SYNCOBJ_RESET for waiting syncobjs since it's being managed in the common implementation already. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14105>	2022-01-13 04:01:44 +00:00
Hyunjun Ko	479a1c405e	turnip: Porting to common vulkan implementation for synchronization. This patch ports to common code for VkSemaphore, VkFence and relevant APIs like vkCreate(Destroy)Semaphore/Fence, vkGetSemaphoreFdKHR, etc. Accordingly, starts using common vkQueueSubmit with implementing driver-specific hook. Also remove all timeline semaphore codes so that we could use common code in the following patches. This way we could easily see what's modified in the following patch. Note that kgsl is not ported in this patch. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14105>	2022-01-13 04:01:44 +00:00
Hyunjun Ko	58aa920706	vulkan: fix typo Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14105>	2022-01-13 04:01:44 +00:00
Hyunjun Ko	f976f71fb0	turnip: Use the new common device lost tracking Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14105>	2022-01-13 04:01:44 +00:00
Jianxun Zhang	14a4600b62	intel: add swizzle flag into driver uuid Suggested by Lionel Landwerlin, we add has_bit6_swizzle as another input when computing driver uuid. Also fix miscalculation of the length of driver tag. Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Jianxun Zhang	f43c7185e0	intel: remove chipset_id The chipset_id should be named after i915 ioctl that's called to get the device id. In user space this field holds pci device id in reality. We now have a pci_device_id queried from drm instead using the ioctl, so there is no much reason to keep the chipset_id for the same purpose. Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Jianxun Zhang	ddfa3924b3	intel: dump PCI info in intel_dev_info Dump PCI bus and device info so that we can easily identify output in a multi-gpu system. Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Jianxun Zhang	3414ba9a81	anv: remove private pci fields These fields are in the base device struct 'intel_device_info' now. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5489 Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Jianxun Zhang	d86989bf73	intel: use PCI info to compute device uuid With the new input from PCI bus and device fields, we can compute device uuids in a multi-gpu system. Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Jianxun Zhang	db8405670a	intel: provide pci bus and dev info in base device struct Having PCI bus and dev info in the base struct 'intel_device_info' enables us to utilize the info across multiple drivers for several purposes, such as computing device uuids in a multi-gpu system. Signed-off-by: Jianxun Zhang <jianxun.zhang@linux.intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13936>	2022-01-13 03:09:36 +00:00
Yiwei Zhang	17b753459e	venus: VkExternalImageFormatProperties is optional It's optional even if VkPhysicalDeviceExternalImageFormatInfo is there. Fixes: `108f386a61` ("venus: initial support for VkPhysicalDevice commands") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14503>	2022-01-13 02:59:51 +00:00
Eric Engestrom	cd6377741d	docs: update calendar and link releases notes for 21.3.4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14526>	2022-01-13 02:55:35 +00:00
Eric Engestrom	7240b37948	docs: add release notes for 21.3.4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14526>	2022-01-13 02:55:35 +00:00
Daniel Schürmann	79a987ad2a	nir/opt_if: also merge break statements with ones after the branch This optimizations turns loop { ... if (cond1) { if (cond2) { do_work_1(); break; } else { do_work_2(); } do_work_3(); break; } else { ... } } into: loop { ... if (cond1) { if (cond2) { do_work_1(); } else { do_work_2(); do_work_3(); } break; } else { ... } } As this optimizations moves code into the NIF statement, it re-iterates on the branch legs in case of success. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7587>	2022-01-13 02:30:32 +00:00
Daniel Schürmann	dad609d152	nir/opt_if: merge two break statements from both branch legs This optimization turns loop { ... if (cond) { do_work_1(); break; } else { do_work_2(); break; } } into: loop { ... if (cond) { do_work_1(); } else { do_work_2(); } break; } Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7587>	2022-01-13 02:30:32 +00:00
Caleb Callaway	64a51293c8	vulkan/overlay: support Vulkan 1.2 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5602 Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14465>	2022-01-13 02:02:14 +00:00
Chia-I Wu	20db89b7c7	virgl: disable texture uploads with copy transfers This disables `cdc480585c` ("virgl/drm: New optimization for uploading textures") effectively. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Lepton Wu <lepton@chromium.org> Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14497>	2022-01-13 01:26:00 +00:00
Dylan Baker	d3398a8e03	docs: move the release for 22.0 out Between a troubles with Marge and FD.O, and requests for a bit more time for a few patches to land, we're going to bump the release out by three weeks. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14502>	2022-01-13 01:21:16 +00:00
Emma Anholt	c638d6f3bf	ci: Add paraview traces to several drivers. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14501>	2022-01-13 00:22:54 +00:00
Emma Anholt	ec29a9391e	ci/llvmpipe: Add a trace for the game JVGS, which got regressed recently. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14501>	2022-01-13 00:22:54 +00:00
Emma Anholt	e46fa37cf4	ci/llvmpipe: Sort the list of traces. so I don't have to deliberate about where to put new ones. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14501>	2022-01-13 00:22:54 +00:00
Rhys Perry	9675cef53b	radv: set radv_split_fma=true for Proton SotTR The game seems to expect fma to be unfused. Fixes depth-prepass artifacts. I haven't tested the D3D12 version, but I think it doesn't work and needs sparse depth/stencil images. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14458>	2022-01-12 23:50:35 +00:00
Rhys Perry	cc802cab7c	radv: add RADV_DEBUG=splitfma This splits application-provided FMA in vertex/geometry/tesselation/mesh shaders. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14458>	2022-01-12 23:50:35 +00:00
Christian Gmeiner	b2ae4b2ac4	lima: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY Occlusion queries are supported always but only the number of supported samples differ. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14361>	2022-01-12 23:19:22 +00:00
Christian Gmeiner	2a90f2702c	i915: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY Occlusion queries are supported always but only the number of supported samples differ. This also removes I915_LIE debug option. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14361>	2022-01-12 23:19:22 +00:00
Christian Gmeiner	b497e454b3	vc4: remove not needed lie about PIPE_CAP_OCCLUSION_QUERY Occlusion queries are supported always but only the number of supported samples differ. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14361>	2022-01-12 23:19:22 +00:00
Christian Gmeiner	41179b665b	broadcom/ci: use .test-manual-mr Allow the jobs to be available for MRs. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14361>	2022-01-12 23:19:22 +00:00
Christian Gmeiner	0186e9e1c5	mesa: always support occlusion queries Excerpt from ARB_occlusion_query.txt: An implementation can either set QUERY_COUNTER_BITS_ARB to the value 0, or to some number greater than or equal to n. If an implementation returns 0 for QUERY_COUNTER_BITS_ARB, then the occlusion queries will always return that zero samples passed the occlusion test, and so an application should not use occlusion queries on that implementation. This looks more sane for drivers wanting desktop gl 1.5 without real hw support then just faking it. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14361>	2022-01-12 23:19:22 +00:00
Daniel Stone	56886459c5	Revert "ci: disable vs2019 windows build" This reverts commit `567a9550d7`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14513>	2022-01-12 21:45:11 +00:00
Thomas H.P. Andersen	315d6ee66f	freedreno: drop dead assignment width0 was introduced in `e11a239e8c` Its use was dropped in `979e7e3680` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14500>	2022-01-12 21:20:23 +00:00
Thomas H.P. Andersen	d71c6eebe2	freedreno: silence sometimes-uninitialized warning Clang does not see that this is unreachable and thus thinks that opc will be used uninitialized later. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14500>	2022-01-12 21:20:23 +00:00
Ruijing Dong	50d4e44fa4	radeon/vcn: enable dynamic dpb Tier2 for hevc dec vaapi path keep omx hevc decoding using the current mode, set dpb Tier2 for vaapi hevc decoding mode as default. Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14484>	2022-01-12 20:16:50 +00:00
Ruijing Dong	be28a475c7	radeon/vcn: enable dynamic dpb Tier2 support for h264 dec vaapi path By disabling h264 enxtension flag to let vaapi application manage the dpb buffers. The calculation of the non_exist_flags for h264 reference frames needs to consider both frame number and POC in the reference picture list, set this flag only if both of the frame number and POC are not existed in the valid reference lists; otherwise, that reference frame is considered valid. Also enabled drm buffer in dynamic dpb Tier2. Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14484>	2022-01-12 20:16:50 +00:00
Ruijing Dong	2efddb5db0	frontends/va: preparing to disable h264 extension flag in vaapi dec path In frame reference frame, the top/bottom field reference flag also needs to be set, so does the long term reference flag. Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14484>	2022-01-12 20:16:50 +00:00
Ruijing Dong	fb1d1d3b1f	frontends/omx: preserve omx to keep current mode for avc decoding Preparing to disable h264 extension flag in vaapi path, and this change will not affect omx. Signed-off-by: Ruijing Dong <ruijing.dong@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14484>	2022-01-12 20:16:50 +00:00
Mike Blumenkrantz	2af033fd54	zink: ci updates Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	d3bb5b5dd1	zink: use even more accurate stride values for query result copies this shouldn't be used at all, but some drivers get it wrong and I don't want to have to fix every driver Fixes: `039ed2de94` ("zink: always use type size for query result copy stride") Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	2fa1bf60d6	Revert "zink: when performing an implicit reset, sync qbos" this appeared to fix some sort of bug related to preserving qbo data, but really there shouldn't have been any sort of bug anyway since the qbos all get read back, and thus the data is already preserved instead, it just preserved the query id, which overloaded the pools and crashed This reverts commit `79790e276f`. fixes #5669 Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	b7a4faea9b	zink: skip readback of qbos with no results this is a no-op and also crashes Fixes: `93190be1b9` ("zink: rewrite query internals") Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	f8d2770737	zink: fix availability buffer sizing/copying for xfb queries xfb queries have 2 results, and the availability bit is a 3rd result, so the buffer size has to be at least that big and the copy offset has to reflect the number of xfb results in the src offset cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	bf9ac4dfcd	zink: always set number of timestamp results to 1 for internal qbo timestamp queries don't accumulate results Fixes: `93190be1b9` ("zink: rewrite query internals") Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Mike Blumenkrantz	8b46d83637	zink: add a better threshold for clamping query pool resets on suspend these pools should be dumped even if they aren't used Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14397>	2022-01-12 18:39:56 +00:00
Daniel Schürmann	8a78706643	nir: refactor nir_opt_move This patch is a rewrite of nir_opt_move. Differently from the previous version, each instruction is checked if it can be moved downwards and then inserted before the first user of the definition. The advantage is that less insert operations are performed, the original order is kept if two movable instructions have the same first user, and instructions without user in the same block are moved towards the end. v2: Only return true if an instruction really changed the position. Don't care for discards, this will be handled by another MR. v3: fix self-referring phis and update according to nir_can_move_instr(). v4: use nir_can_move_instr() and nir_instr_ssa_def() v5: deduplicate some code Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3657>	2022-01-12 13:41:54 +00:00
Lionel Landwerlin	8ef9350ff0	intel/devinfo: drop num_eus_per_subslice field This field is an average computation that is not actually useful for any of our driver code. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14510>	2022-01-12 12:53:21 +00:00
Lionel Landwerlin	5d5a1b660b	intel/devinfo: add a helper to check for slice availability Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14510>	2022-01-12 12:53:21 +00:00
Lionel Landwerlin	1c5b206366	intel/devinfo: printout devinfo struct size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14510>	2022-01-12 12:53:21 +00:00
Lionel Landwerlin	574ba30fb4	intel/devinfo: printout pixel pipes in info printout Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14510>	2022-01-12 12:53:21 +00:00
Pierre-Eric Pelloux-Prayer	d8ba48e447	radeonsi/tests: add expected results for vega20 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14454>	2022-01-12 11:39:53 +00:00
Pierre-Eric Pelloux-Prayer	d299d81919	radeonsi/tests: update expected results Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14454>	2022-01-12 11:39:53 +00:00
Pierre-Eric Pelloux-Prayer	86262b6eac	radeonsi,radv: fix usages of surf_pitch For linear textures, pitch[level] should be used instead. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14454>	2022-01-12 11:39:53 +00:00
Pierre-Eric Pelloux-Prayer	2f8982df0e	radeonsi/gfx10: fix si_texture_get_offset for mipmapped tex Pitch can be different per-level so adjust stride and offset. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5792 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14454>	2022-01-12 11:39:53 +00:00
Samuel Pitoiset	9cd8908c03	radv: fix computing the fb size in presence of dynamic VRS attachment This fixes dEQP-VK.fragment_shading_rate.dynamic_rendering.attachment_rate.*. Fixes: `e914a6710f` ("radv: Expose the VK_KHR_dynamic_rendering extension.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14421>	2022-01-12 11:16:31 +00:00
Samuel Pitoiset	9c746157ae	radv: reset VRS if the current subpass doesn't have a VRS attachment With a scenario like: BeginRP(DS + VRS att) Draw() EndRP() BeginRP(same DS) Draw() EndRP() The second draw shouldn't use VRS but it did because the VRS bit is always set during DS surface initialization if a surface can use VRS. So, it would have been using the previous copied VRS rates. Found by inspection. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14443>	2022-01-12 10:22:29 +00:00
Samuel Pitoiset	ec4dcd53db	radv: stop checking if dynamic states changed This is costly for the CPU and might hurt "good" applications that already avoid that like DXVK/vkd3d-proton. Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5009 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12441>	2022-01-12 09:48:29 +00:00
Lionel Landwerlin	567a9550d7	ci: disable vs2019 windows build Failing with : error during connect: In the default daemon configuration on Windows, the docker client must be run with elevated privileges to connect.: Post http://%2F%2F.%2Fpipe%2Fdocker_engine/v1.24/auth: open //./pipe/docker_engine: The system cannot find the file specified. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14509>	2022-01-12 09:04:29 +00:00
Nanley Chery	912acbf963	anv,iris: Flush HDC before color fast clears Needed for XeHP (see Bspec 47704). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Nanley Chery	f3c629733f	anv,iris: PSS Stall Sync around color fast clears Needed for XeHP (see Bspec 47704). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Nanley Chery	8ec8298ce4	intel: Rename the PSD bit in PIPE_CONTROL for XeHP The name of the field now starts with PSS Stall instead of PSD. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Nanley Chery	de5f1cdd31	anv,iris: Depth stall around color fast clears Needed for TGL (see Bspec 47704). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Nanley Chery	34c8371e2a	anv,iris: Flush tile cache after color fast clears Needed for TGL (see Bspec 47704). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14024>	2022-01-12 01:30:34 +00:00
Bas Nieuwenhuizen	38b3661b8f	radv: 256 byte push constants. This helps vkd3d-proton, especially when indirecting more stuff. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14485>	2022-01-12 01:08:39 +00:00
Bas Nieuwenhuizen	43f8e07765	radv: Use 16-bits to store push constant indices. Otherwise things horrible go wrong when we get 256 bytes of push constants. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14485>	2022-01-12 01:08:39 +00:00
Bas Nieuwenhuizen	3a36d0b787	radv: Use MAX_PUSH_CONSTANTS_SIZE for saved push constants. So that it can never again get out of sync. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14485>	2022-01-12 01:08:39 +00:00
Mike Blumenkrantz	b6499dff37	zink: use device-local heap for sparse backing allocations backing allocations are real allocations, so they shouldn't be initialized as sparse containers Fixes: `40fdb3212c` ("zink: add a suballocator") Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14394>	2022-01-12 00:03:00 +00:00
Marcin Ślusarz	f286ecf906	nir: handle per-view clip/cull distances Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	70a9710eee	spirv: mark [Clip\|Cull]DistancePerViewNV variables as compact Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	0d6f83cbf1	nir: remove invalid assert affecting per-view variables per-view variables can have arbitrary (but > 0) number of array levels Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	ce5a8bff77	spirv: handle multiview bits of SPV_NV_mesh_shader Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	4fed440724	nir: add load_mesh_view_count and load_mesh_view_indices intrinsics Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	e4ff7fd76a	spirv: add MeshViewCountNV/MeshViewIndidcesNV builtins from SPV_NV_mesh_shader Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	561de760fd	compiler: add new MESH_VIEW_COUNT/MESH_VIEW_INDICES system values Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Marcin Ślusarz	4cb7dcb097	spirv: handle ViewportMaskNV builtin/cap from SPV_NV_mesh_shader Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Acked-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14263>	2022-01-11 22:45:23 +00:00
Nanley Chery	1619279219	intel/isl: Return false more in isl_surf_get_hiz_surf Follow the CCS and MCS functions by returning false for unsupported cases. This reduces the burden on the caller. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	b77d694223	intel/isl: Allow HiZ with Tile4/64 surfaces Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	50967402cc	intel/isl: Require Y-tiling for depth on gfx4-5 This enables isl_surf_get_hiz_surf to be simplified. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	267689a269	intel/isl: Use a new HiZ format on XeHP+ The new HiZ compresses twice as many rows of the depth surface compared to TGL (Bspec 47009). Also, its tiling needs to be specified in 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	7d0e847856	intel/isl: Update comment for the XeHP HiZ block An 8x4 HiZ block doesn't fit in with the new formulas for sizing HiZ on XeHP. Update a comment which assumed this block size on SKL+. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	e0ddeec209	intel/isl: Rework HiZ image align calculations * Check the format's compression type instead of the format directly to prepare for a new HiZ format on XeHP. * Adjust the gfx12+ calculations so that XeHP will automatically be handled. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Nanley Chery	30615794e4	blorp: Drop multisampled code in blorp_can_hiz_clear_depth Anv allows non-8x4-aligned depth buffer clears, but it has multisampled HiZ disabled for BDW. iris allows multisampled HiZ on BDW, but disallows non-8x4-aligned depth buffer clears. Drop the unused optimization for non-8x4-aligned clears of multisampled surfaces on BDW and use this opportunity to use some PRM text in the code comment. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Acked-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14091>	2022-01-11 20:06:02 +00:00
Felix DeGrood	0a01d2c04f	anv: increase binding table pool size to 64KB Binding table pool runs out of capacity quickly on modern games, requiring new Surface Base Address instructions to be sent. That is costly due to flushes and stalls. Increasing BT pool capacity to 64KB improves performance several workloads. Fallout4 +4% Shadow of the Tomb Raider +4% Borderlands3 +3% Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14483>	2022-01-11 19:47:30 +00:00
Lionel Landwerlin	d6c0d16791	intel/dev: fixup chv workaround We're using the wrong helper to get the subslice total count. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `c24ba6cecb` ("intel/dev: Handle CHV CS thread weirdness in get_device_info_from_fd") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14492>	2022-01-11 19:02:57 +00:00
Jason Ekstrand	c8d364cb9d	turnip: Use vk_common_QueueSignalReleaseImageANDROID for DRM It's identical to the one turnip copy+pasted from RADV. For KGSL, we still need to hand-roll because of all the emulated stuff. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14411>	2022-01-11 17:25:22 +00:00
Jason Ekstrand	5b8b6315e4	turnip: Use vk_common_AcquireImageANDROID It's got some bug fixes that turnip never picked up. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Hyunjun Ko <zzoon@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14411>	2022-01-11 17:25:22 +00:00
Pavel Ondračka	66ea0f84c2	r300: use point sprite coordinates only when drawing points (v5) Fixes piglit arb_point_sprite-interactions Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/364 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/370 Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14389>	2022-01-11 16:56:18 +00:00
Mike Blumenkrantz	3e5f4cebe8	zink: add extra synchronization for buffer descriptor binds "most" times it isn't necessary to insert any pipeline barriers when binding descriptors, as GL requires explicit barrier usage which comes through a different codepath the exception here is when the following scenario occurs: * have buffer A * buffer_subdata is called on A * discard path is taken \|\| A is not host-visible * stream uploader is used for host write * CmdCopyBuffer is used to copy the data back to A buffer A now has a pending TRANSFER write that must complete before the buffer is used in a shader, so synchronization is required any time TRANSFER usage is detected in a bind there's also going to be more exceptions going forward as more internal usage is added, so just remove the whole fake-barrier mechanism since it'll become more problematic going forward Cc: 21.3 mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14496>	2022-01-11 16:18:56 +00:00
Jesse Natalie	5028630bd6	d3d12/ci: Skip flaky tex-miplevel-selection and timestamp tests Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14494>	2022-01-11 15:51:40 +00:00
Mike Blumenkrantz	d15ff96da2	zink: always unset vertex shader variant key data when changing last vertex stage ensure that vertex key data is always zeroed when changing last stage since it will be updated before draw anyway and can only cause problems if left alone here fixes the following caselist: dEQP-GLES31.functional.shaders.builtin_constants.tessellation_shader.max_tess_evaluation_texture_image_units dEQP-GLES31.functional.tessellation_geometry_interaction.feedback.tessellation_output_quads_geometry_output_points dEQP-GLES31.functional.ubo.random.all_per_block_buffers.25 cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14482>	2022-01-11 15:21:08 +00:00
Mike Blumenkrantz	a9545153a0	zink: add some wsi instance extensions not used for now Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14426>	2022-01-11 14:54:24 +00:00
Mike Blumenkrantz	394c7f3f62	zink: add missing assert for 8bit vertex decompose verify that this bit was set above Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14380>	2022-01-11 14:16:34 +00:00
Pierre-Eric Pelloux-Prayer	7650e6fc4c	radv: implement wsi's private transfer queue using SDMA Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Pierre-Eric Pelloux-Prayer	7bd5aa111c	vulkan/wsi: add a private transfer pool to exec the DRI_PRIME blit The idea is to offer the driver a way to execute on a different queue than the one the app is using for Present. For instance, this could be used to make the DRI_PRIME blit asynchronous, by using a transfer queue. So instead of creating a command buffer to be executed on present using the supplied queue, this commit uses an internal transfer queue to perform the blit. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Pierre-Eric Pelloux-Prayer	0ad7ec56c9	vulkan/wsi: add use_prime_blit param to wsi_swapchain_init Instead of initializing it to false and overriding it later if needed. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Pierre-Eric Pelloux-Prayer	d6ea60d5a2	radv: allocate the prime buffer as uncached This is a write only buffer so caches aren't needed. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Pierre-Eric Pelloux-Prayer	7b5ab48c40	radv: partial sdma support SDMA code adapted from https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12763 The only supported use case is image (linear or tiled) -> buffer and only GFX9+ is supported (for now). Since RADV_QUEUE_TRANSFER aren't exposed to applications, this cannot be used, except by the driver. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Pierre-Eric Pelloux-Prayer	148b2d0040	amd: add SDMA_NOP_PAD And use it in amdgpu_cs.c. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13959>	2022-01-11 12:18:35 +00:00
Daniel Schürmann	4e2b624c10	aco: validate VOP3P opsel correctly Before RA, subdword operands must use .xx After RA, opsel can either be .xx or .yy Cc: mesa-stable Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14472>	2022-01-11 11:41:12 +00:00
Tapani Pälli	a5bc8c4be9	mesa: free vbo_save_vertex_list store prims Fixes a leak: ==47470== 60 bytes in 1 blocks are definitely lost in loss record 1,790 of 1,904 ==47470== at 0x484186F: malloc (vg_replace_malloc.c:381) ==47470== by 0x58EBA6A: compile_vertex_list (vbo_save_api.c:535) ==47470== by 0x58EDABF: wrap_buffers (vbo_save_api.c:1021) ==47470== by 0x58EDF97: upgrade_vertex (vbo_save_api.c:1134) ==47470== by 0x58EE52F: fixup_vertex (vbo_save_api.c:1251) ==47470== by 0x58EFE9E: _save_Normal3f (vbo_attrib_tmp.h:315) Fixes: `69615d92a0` ("vbo/dlist: realloc prims array instead of free/malloc") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14474>	2022-01-11 06:12:15 +00:00
Tapani Pälli	6e9cd801a6	mesa: free idalloc storage for display lists Fixes a leak: ==46154== 48 bytes in 1 blocks are definitely lost in loss record 1,571 of 1,905 ==46154== at 0x48466AF: realloc (vg_replace_malloc.c:1437) ==46154== by 0x5FC98EC: util_idalloc_resize (u_idalloc.c:43) ==46154== by 0x5FC9C16: util_idalloc_alloc_range (u_idalloc.c:125) ==46154== by 0x56FDB9F: _mesa_EndList (dlist.c:13681) Fixes: `b703d7c15f` ("dlist: store all dlist in a continuous memory block") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14474>	2022-01-11 06:12:15 +00:00
Francisco Jerez	074bde9989	intel/xehp: Switch to coarser cross-slice pixel hashing with table permutation. The coarser 32x32 cross-slice hashing mode seems to lead to better L1 and L2 utilization due to the improved execution locality, however it can also lead to a bottleneck in a single slice, especially in workloads that concentrate heavy rendering in small areas of the screen (e.g. SynMark2 OglGeomPoint, OglTerrain*) -- This effect is mitigated here by performing a permutation of the pixel pipe hashing tables that ensures that adjacent rows map to pixel pipes as far away as possible in the caching hierarchy. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	ef675e6857	anv: Program pixel hashing tables on XeHP. Note that this has an effect even for unfused native die platforms, since the pixel pipe hashing tables we intend to program aren't equivalent to the hardware's defaults on such configs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	d149c5e6e0	iris: Program pixel hashing tables on XeHP. Unlike the Gen11 code, this requires us to allocate a pipe_resource for the pixel pipe hashing tables and hold a reference to it from the context, since we need to add it to the validation list of every batch, the tables may be accessed by the hardware at any time after they're specified via 3DSTATE_SLICE_TABLE_STATE_POINTERS. Note that this has an effect even for unfused native die platforms, since the pixel pipe hashing tables we intend to program aren't equivalent to the hardware's defaults on such configs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	283d5bff4e	intel: Rename intel_compute_pixel_hash_table() to intel_compute_pixel_hash_table_3way(). For consistency with intel_compute_pixel_hash_table_nway(). Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	170468b4fe	intel: Minimal calculation of pixel hash table for arbitrary number of pixel pipes. This starts off with the simplest possible pixel hashing table calculation that just assigns consecutive indices (modulo N) to adjacent entries of the table, along the lines of the existing intel_compute_pixel_hash_table(). The same function will be improved in a future commit with a more optimal calculation. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	68cb551b1d	intel: Move pixel hashing table computation into common header file. In order to avoid some duplication between the GL and Vulkan driver, which will get worse as we introduce additional code in order to handle more recent generations. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:35 -08:00
Francisco Jerez	3d3c571db3	iris: Merge gfx11_ and gfx12_upload_pixel_hashing_tables() into the same function. Will save some boilerplate as we introduce another variant of this function. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:28:12 -08:00
Francisco Jerez	ae5fa3f518	intel/genxml: Fix SLICE_HASH_TABLE struct on XeHP. It's now an array with 7 tables, each table is intended to specify the pixel pipe hashing behavior for every possible slice count between 2 and 8, however that doesn't actually work, among other reasons due to hardware bugs that will cause the GPU to erroneously access the table at the wrong index in some cases, so in practice all 7 tables need to be initialized to the same value. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:27:41 -08:00
Francisco Jerez	a748b264e8	intel/blorp/gfx12+: Drop unnecessary state cache invalidation from binding table setup. The state cache invalidation shouldn't be necessary on recent platforms. On ICL it seems to be required to get the hardware to pick up an updated indirect clear color, so this change is only applied to TGL platforms and later for the moment. On some DG2 configs this seems to improve SynMark2/OglDrvRes by 16.0% ±0.1%, n=8. Acked-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:27:41 -08:00
Francisco Jerez	c6455cfec9	intel/fs: Don't assume packed dispatch for fragment shaders on XeHP. The current packed dispatch assumptions for fragment shaders seem to be the reason that the fs-readFirstInvocation-uint-loop Piglit test-case for the ARB_shader_ballot extension fails on DG2 in combination with the patches in this series that enable pixel pipe hashing (thanks Jordan for reporting the regression). I've confirmed that the brw_fs_test_dispatch_packing() test fails on DG2 hardware for fragment shaders, while it succeeds for other shader stages, indicating that the PSD hardware no longer guarantees packed dispatch. Disable it. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:27:41 -08:00
Francisco Jerez	ffa2ca8a77	intel/xehp: Update 3DSTATE_PS maximum number of threads per PSD. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13569>	2022-01-10 18:27:41 -08:00
Jesse Natalie	c503187388	docs: Update d3d12 extension list and new_features.txt Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	cc8219d1b4	d3d12: Enable compute Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	f399378c52	d3d12: Run DXIL shared atomic lowering pass Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	9f67f432d7	d3d12: Handle indirect dispatch Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	9cc6b17c8a	d3d12: Implement num workgroups as a state var Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	65a16a568c	d3d12: Implement launch_grid Some more refactoring in d3d12_draw.cpp to re-use a bunch of state and descriptor management, and some refactoring of the dirty states. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	570a042a94	d3d12: Hook up compute shader variations Currently only variable workgroup size is implemented Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	5f23b1d7cd	d3d12: Support compute root signatures Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	6d38a35afb	d3d12: Compile, bind, and cache compute PSOs Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	e350d1ab09	d3d12: Stop trying to set D3D12_DIRTY_SHADER during bindings We don't key off of it to try to figure out if we need to produce a new shader variant, so there's no need to set it when changing properties that feed into variants. If we do have a new shader or variant at draw time, we'll produce a new PSO without this. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	944c72ae4d	d3d12: Remove draw_info from selection_context It's not needed, and having it there can be misleading since sometimes it's null Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	fbc1d90f19	d3d12: Keep state vars last in the per-stage root parameters Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	166cd05071	d3d12: Limit sampler view count to 32 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	2837e67b9b	microsoft/compiler: Handle more GL memory barriers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Jesse Natalie	fd50ef046b	microsoft/compiler: Move workgroup_size lowering from clc It doesn't depend on the clc data being provided externally, so no need to tie it there, we can re-use it for GL and Vulkan compute. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14367>	2022-01-11 01:36:56 +00:00
Rob Clark	5e18aafd26	freedreno: Report system memory as video memory This seems to be the approach that other UMA drivers have settled on, when there aren't some other constraints. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5675 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14478>	2022-01-11 01:15:31 +00:00
Emma Anholt	3563ae4b2d	nir_to_tgsi: Fix a bug in TXP detection after backend lowering. TGSI reserves 2 components for the coord in the first operand vector, even for 1D. Fixes r600 failure with shadow1d. Fixes: `390a3fcdc4` ("nir_to_tgsi: Add support for TXP.") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14322>	2022-01-11 00:53:39 +00:00
Francisco Jerez	8e21cad39b	intel/xehp: Implement XeHP workaround Wa_14014148106. Actually, no, there's no need to do anything, just update some comments for the record. An earlier revision of this change that implemented the workaround text to the letter required no less than 8 new PIPE_CONTROLs throughout the tree. However Felix Degrood noticed that the cost of some of the PIPE_CONTROLs was showing up in workloads like Shadow of the Tomb Raider. The Windows driver wasn't emitting many of those pipe controls, contrary to the W/A instructions, so we engaged in a back and forth with the hardware team, who concluded that the original suggested workaround was unnecessarily strict, and the Windows driver's behavior acceptable. It turns out that Wa_1408224581 we had already implemented for TGL is roughly equivalent to the Windows behavior, so no need to do anything new after all. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14278>	2022-01-11 00:17:32 +00:00
Francisco Jerez	eeb3f4594d	intel/xehp: Implement XeHP workaround Wa_14013910100. XeHP platforms require the invalidation of the instruction cache after a STATE_BASE_ADDRESS change due to a hardware bug potentially leading to instruction cache pollution. Note that the workaround text says it's applicable "DG2 128/256/512-A/B", however it's also marked as permanent and not confirmed to be fixed in any specific steping, so we apply it to all Gfx12HP platforms. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14278>	2022-01-11 00:17:32 +00:00
Alyssa Rosenzweig	b550b3c89c	vc4: Use u_box_pixels_to_blocks helper Eliminates a ETC1 special case. In fact this unit conversion applies to all formats; the original code path works since ETC1 is the only format with blocks bigger than 1x1 supported by vc4 (I assume). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14370>	2022-01-10 23:16:56 +00:00
Alyssa Rosenzweig	6f07159a1d	v3d: Use u_box_pixels_to_blocks helper Rather than open-coding. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14370>	2022-01-10 23:16:56 +00:00
Alyssa Rosenzweig	b920ace4bc	lima,panfrost: Correct pixel vs block mismatches Different parts of our codebase disagree on whether spatial coordinates/dimensions are given in pixels or blocks, which differ by a constant factor for block-compressed formats. This disagreement manifests as incorrect results accessing block-compressed formats. To resolve this, define the public tiling routines to take their coordinates in pixels, and align the relevant code in Panfrost accordingly. Fixes rendering glitches in Factorio, as well as a pile of piglits on Panfrost. It should also fix glTexSubImage() with ETC1 on Lima, but there are no tests for this in dEQP/Piglit. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Tested-by: Vasily Khoruzhick <anarsoul@gmail.com> [dEQP/Lima] Tested-by: Erico Nunes <nunes.erico@gmail.com> [Piglit/Lima] Reported-by: Icecream95 <ixn@disroot.org> Closes: #5560 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14370>	2022-01-10 23:16:56 +00:00
Alyssa Rosenzweig	26c533f167	gallium/util: Add pixel->blocks box helper There is a lot of unit confusion in Gallium due to pixels versus blocks matching only with uncompressed textures. Add a helper to do a common pixels->blocks unit conversion required in multiple drivers. v2: Rename dst->blocks, src->pixels to avoid confusion about the units to casual readers (Mike). Note to mesa-stable maintainers: this is marked as Cc: mesa-stable so the next patch (a set of bug fixes for Lima and Panfrost) can be backported. It's not a bug fix in its own right, of course. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [v1] Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14370>	2022-01-10 23:16:56 +00:00
Thomas H.P. Andersen	7daba1fe65	replace 0 with NULL for NULL pointers This updates many places where 0 is used as NULL pointer. There are a few warnings left when I build the default configuration but they either relate to code outside of mesa or where "None" is used instead. Found with static analysis (smatch) Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12174>	2022-01-10 22:53:32 +00:00
Rhys Perry	60c711833f	aco: remove pack_half_2x16(a, 0) optimization This makes the compiler less predictable and should only have a very small effect on performance. fossil-db (Vega): Totals from 2410 (1.79% of 134756) affected shaders: CodeSize: 6911568 -> 6942840 (+0.45%) Fixes Horizon Zero Dawn artifacts. If a shader has: a = pack_half_2x16(a, 0) //rtne store(pack_half_2x16(0, b) \| a) //rtne a = unpack_2x16(a).x It will become: store(pack_half_2x16(a, b)) //rtz a = unpack_2x16(pack_half_2x16(a, 0)).x //rtne So a later shader with "unpack_2x16(load()).x" will use "a" rounded to zero, while the previous shader will use "a" rounded to the nearest even. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `2f125908b3` ("radv,aco: lower_pack_half_2x16") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14475>	2022-01-10 22:19:29 +00:00
Christian Gmeiner	6e08d8fc3d	ci: Uprev piglit to af1785f31 Brings in these changes: af1785f31 occlusion_query_conform: skip GetQueryCounterBits test if needed dad078717 occlusion_query_conform: convert to pilgit subtests b52c1c761 glsl-1.30: test nested preprocessor concat 6c4da153b texture-storage: Fix subtest result handling of skips. 4343f19db fbo-integer: Remove the invalid DrawPixels test. e3842f2fe arb_dsa: exclude stencil8 textures from test sets. ce8649be7 spec/ext_external_objects: Fix build on Debian systems 4e553838f glsl: add basic tests for desktop GLSL invariant qualifier linking 7e61e5199 Tests for variable in and out of loop scope f855ad1c8 fbo-mrt-alphatest: Only require GLSL 1.20 9be2fe999 glx: add glx-multi-display-single-pbuffer test bfe290725 glx: add glx-swap-pbuffer test efa64335e framework: Fix build on Windows when using waffle Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14468>	2022-01-10 21:52:42 +00:00
Jordan Justen	0fc93928f1	isl: Don't enable HDC:L1 caches on DG2 The MOCS entry used for this on Tigerlake doesn't exist on DG2. Ref: `aca31baafc` ("isl: Enable Tigerlake HDC:L1 caches via MOCS in various cases.") Suggested-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14467>	2022-01-10 21:20:03 +00:00
Rhys Perry	67fc7a1763	nir/uniform_atomics: fix is_atomic_already_optimized without workgroups dims_needed would have been zero, so this would always returned true for non-compute stages. Also fix this for variable workgroup sizes. Improves Shadow of the Tomb Raider RX 6800 performance by 10.6%, 11.5% and 4.5% (day_of_dead, jungle and paititi scenes). radv_perf before and after: {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'day_of_dead', 'avg_fps': '62.913333333333334', 'min_fps': '62.81', 'max_fps': '62.98', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'jungle', 'avg_fps': '64.02666666666666', 'min_fps': '63.93', 'max_fps': '64.11', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'paititi', 'avg_fps': '74.81666666666666', 'min_fps': '74.72', 'max_fps': '74.88', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'day_of_dead', 'avg_fps': '69.57', 'min_fps': '69.52', 'max_fps': '69.63', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'jungle', 'avg_fps': '71.41000000000001', 'min_fps': '71.31', 'max_fps': '71.5', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'paititi', 'avg_fps': '78.16666666666667', 'min_fps': '78.07', 'max_fps': '78.23', 'interations': '3'} Performance now seems slightly better than AMDVLK 2021.Q4.3: {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'day_of_dead', 'avg_fps': '68.02666666666666', 'min_fps': '67.95', 'max_fps': '68.16', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'jungle', 'avg_fps': '70.24666666666667', 'min_fps': '69.83', 'max_fps': '70.51', 'interations': '3'} {'app': 'SotTR', 'resolution': '3840x2160', 'preset': 'VeryHigh', 'antialiasing': 'off', 'scene': 'paititi', 'avg_fps': '77.19', 'min_fps': '77.18', 'max_fps': '77.2', 'interations': '3'} fossil-db (Sienna Cichlid): Totals from 40 (0.03% of 134621) affected shaders: CodeSize: 62676 -> 65996 (+5.30%) Instrs: 11372 -> 12111 (+6.50%) Latency: 144122 -> 142848 (-0.88%); split: -1.09%, +0.21% InvThroughput: 19686 -> 19847 (+0.82%); split: -0.06%, +0.87% VClause: 304 -> 306 (+0.66%) SClause: 603 -> 604 (+0.17%); split: -0.83%, +1.00% Copies: 780 -> 858 (+10.00%) Branches: 235 -> 329 (+40.00%) PreSGPRs: 1072 -> 1083 (+1.03%); split: -0.37%, +1.40% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14407>	2022-01-10 19:57:38 +00:00
Konstantin Seurer	aaa90c37e0	panvk: Fixed maxFragmentCombinedOutputResources Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14320>	2022-01-10 19:28:17 +00:00
Konstantin Seurer	651bec0971	turnip: Fixed maxFragmentCombinedOutputResources Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14320>	2022-01-10 19:28:17 +00:00
Konstantin Seurer	e0d590cafb	anv: Fixed maxFragmentCombinedOutputResources Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14320>	2022-01-10 19:28:17 +00:00
Konstantin Seurer	2b5cf84efd	lavapipe: Fixed maxFragmentCombinedOutputResources Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14320>	2022-01-10 19:28:17 +00:00
Rhys Perry	0f5d90c2a7	ac/nir: fix store_buffer_amd write_masks Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14447>	2022-01-10 19:01:04 +00:00
Rhys Perry	b00138090e	nir/lower_shader_calls: fix store_scratch write_mask Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14447>	2022-01-10 19:01:04 +00:00
Lucas Stach	d799a4be27	etnaviv: drm: defer destruction of softpin BOs When destroying a BO with a userspace managed address and thus freeing the VMA space, we need to make sure that the BO isn't in use by any active submit anymore, as the kernel will rightfully reject the next submit that re-uses the still active VMA. Keep the BO alive as long as it isn't fully idle to prevent the VMA being reused prematurely. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14159>	2022-01-10 16:49:00 +00:00
Lucas Stach	98a2049c08	etnaviv: drm: rename _etna_bo_del Rename it to a somwhat more descriptive name, which makes it easier to distinguish between the etna_bo_del function in the public interface and the internal function. Also remove the duplicated forward declaration and move it to the common interal header. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14159>	2022-01-10 16:49:00 +00:00
Lucas Stach	77ebbcbf9a	etnaviv: drm: export BO idle check function The ability to check if a BO is idle is not only useful in the buffer cache, but also in other parts of the winsys and even the pipe driver. Make this functionality available in the interface. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14159>	2022-01-10 16:49:00 +00:00
Lucas Stach	1b1f8592c0	etnaviv: drm: properly handle reviving BOs via a lookup If a BO is removed from a cache bucket list via a lookup, we must handle it in the same way as if a allocation from the cache happened: tell valgrind that the buffer is active again and take a reference to the etna_device, which the BO had given up while being in the cache. Cc: mesa-stable Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Tested-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14159>	2022-01-10 16:49:00 +00:00
Lucas Stach	ccfd5054a4	etnaviv: drm: fix size limit in etna_cmd_stream_realloc The intended limit for command stream size is 64KB, as this is what old kernels can reliably do and what allows for maximum number of queued streams on newer kernels. However, due to unit confusion with the size member, which is in dwords, the submitted streams could grow up to ~128KB. Fix this by using the proper limit in dwords. Flushing due to some limits being exceeded is not an issue, but is expected with certain workloads, so lower the severity of the message being emitted in this case to debug level. Cc: mesa-stable Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14425>	2022-01-10 15:38:27 +00:00
Lucas Stach	22d796feb8	egl/wayland: break double/tripple buffering feedback loops Currently we dispose any unneeded color buffers immediately if we detect that there are more unlocked buffers than we need. This can lead to feedback loops between the compositor and the application causing rapid toggling between double and tripple buffering. Scenario: 2 buffers already queued to the compositor, egl/wayland allocates a new back buffer to avoid throttling, slowing down the frame. This allows the compositor to catch up and unlock both buffers. EGL detects that there are more buffers than currently needed, freeing the buffer, restarting the loop shortly after. To avoid wasting CPU time on rapidly freeing and reallocating color buffers break those feedback loops by letting the unneeded buffers sit around for a short while before disposing them. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14451>	2022-01-10 15:11:44 +00:00
Danylo Piliaiev	d77bfc117c	tu,ir3: Implement VK_KHR_shader_integer_dot_product - gen4 - has dp4acc and dp2acc, dp4acc is used to implement 4x8 dot product. - gen3 - has dp2acc, in OpenCL blob uses dp2acc for dot product on both get3 and gen4. - gen2 - unknown, lower everything. - gen1 - no dp2acc, lower everything. OpenCL blob doesn't advertise cl_qcom_dot_product8 but still generates code for it. The assembly is more verbose and uses yet to be documented mad32.u16 instruction. Passes: dEQP-VK.spirv_assembly.instruction.compute.opsdotkhr.* dEQP-VK.spirv_assembly.instruction.compute.opudotkhr.* dEQP-VK.spirv_assembly.instruction.compute.opsudotkhr.* dEQP-VK.spirv_assembly.instruction.compute.opsdotaccsatkhr.* dEQP-VK.spirv_assembly.instruction.compute.opudotaccsatkhr.* dEQP-VK.spirv_assembly.instruction.compute.opsudotaccsatkhr.* Only packed 4x8 unsigned and mixed versions are accelerated. However in theory we should be able to do better for signed version than current NIR lowering. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:21:24 +02:00
Danylo Piliaiev	e1f89a1da2	ir3: Make nir compiler options a part of ir3_compiler This would allow for sub-gens to have different options. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Danylo Piliaiev	b8d486f298	nir/algebraic: Separate has_dot_4x8 into has_sdot_4x8 and has_udot_4x8 Adreno GPUs has native instruction for unsigned and mixed dot_4x8 but not signed dot product. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Danylo Piliaiev	c1d5c318bc	ir3: New cat3 instructions * shrm - (src2 >> src1) & src3 * shlm - (src2 << src1) & src3 * shrg - (src2 >> src1) \| src3 * shlg - (src2 << src1) \| src3 * andg - (src2 & src1) \| src3 * dp2acc - dot product of two {i,u}8vec2 packed into SRC1 and SRC2, added to 32b SRC3 * dp4acc - dot product of two {i,u}8vec4 packed into SRC1 and SRC2, added to 32b SRC3 * wmm - vec4(x_1, x_2, x_3, x_4) * (y_1 + y_2 + y_3 + y_4), which is duplicated (1 << (SRC3 / 32)) times starting from DST register * wmm.accu - same as wmm but result is added to DST registers, however the first reg in each vec4 result is overwritten instead of accumulating. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13986>	2022-01-10 13:20:39 +02:00
Connor Abbott	c45c6e36eb	tu: Implement VK_EXT_subgroup_size_control Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	1a1e25dcce	tu, ir3: Support runtime gl_SubgroupSize in FS We already supported it in the CS for computing the subgroup ID, but soon we'll need it in the FS too. Vertex stages will always have it lowered. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	e6e34883a9	ir3: Add wavesize control This allows the wavesize to be controlled per-shader. This will be used by VK_EXT_subgroup_size_control, and freedreno will also need it if legacy ARB_shader_ballot is to be supported (since it forces a wavesize of 64 or less). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	30237b3d9c	ir3: Pass shader to ir3_nir_post_finalize() We'll need to add shader-specific lowering for gl_SubgroupSize. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Connor Abbott	9ebc48005c	ir3, freedreno: Add options struct for ir3_shader_from_nir() We'll expand this in a moment. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13960>	2022-01-10 10:58:28 +00:00
Danylo Piliaiev	fe9c9ec83f	tu: fix workaround for depth bounds test without depth test Fixes: `bb4db22ff4` ("turnip: apply workaround for depth bounds test without depth test") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14390>	2022-01-10 09:36:59 +00:00
Lionel Landwerlin	07bc6b7ed9	anv: limit compiler valid color outputs using NIR variables This fixes a test from the vkd3d-proton test_dual_source_blending_dxbc test which asserts in the backend with : brw_fs_visitor.cpp:716: void fs_visitor::emit_fb_writes(): Assertion `!prog_data->dual_src_blend \|\| key->nr_color_regions == 1' failed. This is because there is 2 color attachments provided by the renderpass so we initially set nr_color_regions = 2. But once we've parsed the shader, we can see it's only using one output (with dual source color blending). This change looks at the output variables to update the valid output variables. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14417>	2022-01-10 09:38:32 +02:00
Tapani Pälli	b8f0459d6f	iris: unref syncobjs and free r/w dependencies array for slab entries Fixes memory leak with dependencies array: ==5224== 104 (96 direct, 8 indirect) bytes in 3 blocks are definitely lost in loss record 1,954 of 2,035 ==5224== at 0x484178A: malloc (vg_replace_malloc.c:380) ==5224== by 0x484670B: realloc (vg_replace_malloc.c:1437) ==5224== by 0x14DBAB9B: update_bo_syncobjs (iris_batch.c:819) ==5224== by 0x14DBADB8: update_batch_syncobjs (iris_batch.c:898) ==5224== by 0x14DBB3D5: _iris_batch_flush (iris_batch.c:1031) ==5224== by 0x14DB77D0: iris_transfer_map (iris_resource.c:2348) ==5224== by 0x157786FD: u_transfer_helper_transfer_map (u_transfer_helper.c:243) ==5224== by 0x14C479E7: tc_buffer_map (u_threaded_context.c:2252) ==5224== by 0x1434F3F8: pipe_buffer_map_range (u_inlines.h:393) ==5224== by 0x1435094A: _mesa_bufferobj_map_range (bufferobj.c:491) ==5224== by 0x143586D9: map_buffer_range (bufferobj.c:3737) ==5224== by 0x14358DA3: _mesa_MapBuffer (bufferobj.c:3947) ==5224== 240 (192 direct, 48 indirect) bytes in 6 blocks are definitely lost in loss record 1,984 of 2,035 ==5224== at 0x484178A: malloc (vg_replace_malloc.c:380) ==5224== by 0x484670B: realloc (vg_replace_malloc.c:1437) ==5224== by 0x14DBAB9B: update_bo_syncobjs (iris_batch.c:819) ==5224== by 0x14DBADB8: update_batch_syncobjs (iris_batch.c:898) ==5224== by 0x14DBB3D5: _iris_batch_flush (iris_batch.c:1031) ==5224== by 0x14FF72CC: iris_get_query_result (iris_query.c:631) ==5224== by 0x14C4396A: tc_get_query_result (u_threaded_context.c:880) ==5224== by 0x1458F4F7: get_query_result (st_cb_queryobj.c:273) ==5224== by 0x1458F7EB: st_WaitQuery (st_cb_queryobj.c:352) ==5224== by 0x144EFF66: get_query_object (queryobj.c:742) ==5224== by 0x144F01AE: _mesa_GetQueryObjectuiv (queryobj.c:811) And leak with syncobjs: ==13644== 8 bytes in 1 blocks are definitely lost in loss record 1 of 1,846 ==13644== at 0x484186F: malloc (vg_replace_malloc.c:381) ==13644== by 0x639789B: iris_create_syncobj (iris_fence.c:69) ==13644== by 0x63B213A: iris_batch_reset (iris_batch.c:512) ==13644== by 0x63B3637: _iris_batch_flush (iris_batch.c:1056) ==13644== by 0x65EF2BC: iris_get_query_result (iris_query.c:631) ==13644== by 0x623B970: tc_get_query_result (u_threaded_context.c:880) ==13644== by 0x5B874F7: get_query_result (st_cb_queryobj.c:273) ==13644== by 0x5B877EB: st_WaitQuery (st_cb_queryobj.c:352) ==13644== by 0x5AE7F66: get_query_object (queryobj.c:742) ==13644== by 0x5AE8150: _mesa_GetQueryObjectiv (queryobj.c:801) Fixes: `ce2e2296ab` ("iris: Suballocate BO using the Gallium pb_slab mechanism") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14387>	2022-01-09 13:43:45 +00:00
Christian Gmeiner	9cb91010ab	iris/ci: update piglit fails Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14442>	2022-01-07 23:12:37 +00:00
Christian Gmeiner	4d624f189e	i915g/ci: update piglit fails Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14439>	2022-01-07 23:00:16 +00:00
Emma Anholt	a2dbdc645f	ci: Shrink container/rootfs sizes. Cutting the extra VK mustpass files is 315MB out of 1.5GB of the amd64 rootfs. pip was 10MB. The rustup toolchains were massive (over a GB IIRC) on the x86 container images. Hopefully helps with #5837 Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14460>	2022-01-07 22:32:31 +00:00
Yiwei Zhang	48712b8cc5	venus: subtract appended header size in vn_CreatePipelineCache Use header->header_size to offset cache data as well in case the header struct extends on a newer driver but the cache data was appended with an old header. Fixes: `723f0bf74a` ("venus: initial support for module and pipelines") Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14463>	2022-01-07 22:10:53 +00:00
Danylo Piliaiev	3792fbfcf6	ir3: Assert that we cannot have enough concurrent waves for CS with barrier If we have a compute shader that has a big workgroup, a barrier, and a branchstack which limits max_waves - this may result in a situation when we cannot run concurrently all waves of the workgroup, which would lead to a hang. Blob just explodes in such case. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14110>	2022-01-07 18:40:15 +00:00
Danylo Piliaiev	9ed4d49c97	ir3: Be able to reduce register limit for RA when CS has barriers If barriers are used, it must be possible for all waves in the workgroup to execute concurrently. Thus we may have to reduce the registers limit. Fixes a hang in "Digital Combat Simulator". Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14110>	2022-01-07 18:40:15 +00:00
Hoe Hao Cheng	9323d2ea6d	zink/codegen: remove bogus print statement Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14434>	2022-01-07 17:51:45 +00:00
Hoe Hao Cheng	37f01832eb	zink/codegen: remove core_since in constructor the variable is now automatically filled in according to registry values Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14434>	2022-01-07 17:51:45 +00:00
Hoe Hao Cheng	029e871239	zink/codegen: support platform tags Some extensions are locked behind certain platforms, don't include them if the extension is unsupported. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14434>	2022-01-07 17:51:45 +00:00
Lionel Landwerlin	1d40d53e03	anv: don't leave anv_batch fields undefined Because the extend_cb vfunc is not initialized, there is a risk that the emission code calls into a random pointer. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Rohan Garg <rohan.garg@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14418>	2022-01-07 17:28:11 +00:00
Gert Wollny	8685a505e7	ntt: Set the output invariant flag according to the semantics This is used by virglrenderer to create the correct shaders on the host. Fixes: dEQP-GLES31.functional.primitive_bounding_box.triangles.tessellation_set_per_primitive.vertex_tessellation_fragment.fbo when using ntt with virgl. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14423>	2022-01-07 16:35:43 +00:00
Gert Wollny	6f348d9c99	nir_lower_io: propagate the "invariant" flag to outputs Ultimately this is consumed by nir-to-tgsi and needed by virglrenderer to correctly declare output variables. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14423>	2022-01-07 16:35:43 +00:00
Gert Wollny	5bfe292708	util/primconvert: map only index buffer part that is needed By putting vertex store and indices all in one buffer the larger part of the shared buffer might actually only be vertex data we are not interested in. Hence only map the part of the buffer that contains the index data for the currently active draw command. This helps drivers where a mapping operation is expensive, like e.g. virgl. v2: - add comment about ranged buffer mapping (Pierre-Eric) - keep passing direct_draws[i].start to direct_draw_func, it looks like the "start" parameter is properly set in util_prim_restart_convert_to_direct v3: Fix ws error (Mike) Related: #5825 Fixes: `f9d12bf50e` vbo/dlist: use a single buffer object Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14423>	2022-01-07 16:35:43 +00:00
Christian Gmeiner	86b19db459	etnaviv/ci: update piglit fails Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14441>	2022-01-07 16:24:12 +00:00
Rhys Perry	1756930a79	radv: increase maxTaskOutputCount to 65535 This is the minimum required by the spec. Fixes dEQP-VK.api.info.vulkan1p2_limits_validation.nv_mesh_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14446>	2022-01-07 15:04:11 +00:00
Connor Abbott	cb45120556	ir3: Use (ss) for instructions writing shared regs The blob uses both nops and (ss). It turns out that in some rare cases the hardware does take more than 6 cycles, at least for movmsk, but adding nops is unnecessary. I believe the extra nops are only there due to the immaturity of the blob's implementation of subgroup ops, so we don't have to copy them - just handle shared reg producers the same as SFU instructions. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	d45678cac4	ir3/postsched: Rename tex/sfu to sy/ss Analogous to the previous commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	e6b35d606d	ir3/sched: Rename tex/sfu to sy/ss This now covers e.g. cat6 instructions as well, and ss will cover instructions writing shared regs as well. This is split out from the previous change to avoid too much churn and shouldn't cause any functional changes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	0cc4aca345	ir3: Use new (sy)/(ss) stall helpers in the compiler This fixes a few bad assumptions in the pre-RA and post-RA scheduler, for example that (sy) is only for texture instructions and (ss) is only for SFU instructions and (sy) and (ss) producers will always take the same number of cycles. This means we now start doing latency hiding for cat6 instructions like ldib and ldc. It also should make us hide latency more aggressively, since the number used for (sy) stall cycles was way lower than the real numbers for everything except ldc. Finally it unifies the various places (ss) soft nops were calculated. selected shader-db results: total nops in shared programs: 345278 -> 358959 (3.96%) nops in affected programs: 215622 -> 229303 (6.34%) helped: 690 HURT: 2430 helped stats (abs) min: 1 max: 125 x̄: 11.40 x̃: 5 helped stats (rel) min: 0.53% max: 100.00% x̄: 24.19% x̃: 18.52% HURT stats (abs) min: 1 max: 501 x̄: 8.87 x̃: 5 HURT stats (rel) min: 0.00% max: 9900.00% x̄: 52.36% x̃: 14.29% 95% mean confidence interval for nops value: 3.78 4.99 95% mean confidence interval for nops %-change: 28.21% 42.66% Nops are HURT. total mov in shared programs: 75049 -> 74110 (-1.25%) mov in affected programs: 15754 -> 14815 (-5.96%) helped: 566 HURT: 455 helped stats (abs) min: 1 max: 36 x̄: 4.52 x̃: 3 helped stats (rel) min: 0.83% max: 100.00% x̄: 35.85% x̃: 30.00% HURT stats (abs) min: 1 max: 35 x̄: 3.55 x̃: 3 HURT stats (rel) min: 0.00% max: 1100.00% x̄: 63.60% x̃: 25.00% 95% mean confidence interval for mov value: -1.25 -0.58 95% mean confidence interval for mov %-change: 2.92% 14.02% Inconclusive result (value mean confidence interval and %-change mean confidence interval disagree). total last-baryf in shared programs: 80468 -> 67670 (-15.90%) last-baryf in affected programs: 63676 -> 50878 (-20.10%) helped: 309 HURT: 147 helped stats (abs) min: 1 max: 260 x̄: 49.20 x̃: 24 helped stats (rel) min: 0.60% max: 98.81% x̄: 37.92% x̃: 40.91% HURT stats (abs) min: 1 max: 115 x̄: 16.35 x̃: 12 HURT stats (rel) min: 0.96% max: 1933.33% x̄: 45.55% x̃: 7.89% 95% mean confidence interval for last-baryf value: -33.03 -23.10 95% mean confidence interval for last-baryf %-change: -21.52% -0.50% Last-baryf are helped. total sstall in shared programs: 133997 -> 126398 (-5.67%) sstall in affected programs: 86866 -> 79267 (-8.75%) helped: 1893 HURT: 598 helped stats (abs) min: 1 max: 77 x̄: 6.06 x̃: 4 helped stats (rel) min: 0.71% max: 100.00% x̄: 32.82% x̃: 16.67% HURT stats (abs) min: 1 max: 65 x̄: 6.47 x̃: 6 HURT stats (rel) min: 0.00% max: 900.00% x̄: 65.51% x̃: 25.00% 95% mean confidence interval for sstall value: -3.39 -2.71 95% mean confidence interval for sstall %-change: -12.19% -6.24% Sstall are helped. total systall in shared programs: 350304 -> 288234 (-17.72%) systall in affected programs: 234855 -> 172785 (-26.43%) helped: 1456 HURT: 260 helped stats (abs) min: 1 max: 574 x̄: 46.42 x̃: 27 helped stats (rel) min: 0.19% max: 100.00% x̄: 39.43% x̃: 36.06% HURT stats (abs) min: 1 max: 757 x̄: 21.20 x̃: 8 HURT stats (rel) min: 0.00% max: 180.95% x̄: 24.82% x̃: 12.50% 95% mean confidence interval for systall value: -39.31 -33.03 95% mean confidence interval for systall %-change: -31.49% -27.90% Systall are helped. total waves in shared programs: 236732 -> 235142 (-0.67%) waves in affected programs: 6142 -> 4552 (-25.89%) helped: 535 HURT: 17 helped stats (abs) min: 2 max: 8 x̄: 3.08 x̃: 2 helped stats (rel) min: 12.50% max: 75.00% x̄: 28.78% x̃: 25.00% HURT stats (abs) min: 2 max: 6 x̄: 3.53 x̃: 4 HURT stats (rel) min: 16.67% max: 75.00% x̄: 37.35% x̃: 33.33% 95% mean confidence interval for waves value: -3.04 -2.72 95% mean confidence interval for waves %-change: -28.10% -25.39% Waves are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	7e60978d30	ir3: Introduce systall metric and new helper functions Add new centralized functions which will replace the various places we hardcode 10 for the number of (ss) nops, add numbers for soft (sy) nops based on similar computerator experiments with ldc, sam, and ldib (the most common (sy) producers), and add a "systall" metric which is analogous to sstall. This also fixes some cases where we'd erroniously count ldl* as (sy) producers instead of (ss) producers when calculating sstall. This only switches over the metric reporting to the new functions, so there is no behavior change. The following commit will switch over the rest of the compiler. While we're at it, remove max_sun as it's never set. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	603791bdeb	ir3: Bump type mismatch penalty to 3 After some experimentation with computerator, it seems on a618 that writing a full register and then reading half of it as a half register requires a delay of 6, the same as the delay for cat5/cat6 sources. The other direction only has a delay of 5, but just bump it unconditionally out of an abundance of caution. Fixes: `890de1a436` ("ir3/delay: Fix full->half and half->full delay") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Connor Abbott	d371d807eb	ir3/ra: Fix logic bug in compress_regs_left If we're allocating a source then we force is_killed to false, not to true. Fixes a regression in dEQP-GLES31.functional.synchronization.in_invocation.image_atomic_write_read later. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14246>	2022-01-07 14:26:08 +00:00
Tomeu Vizoso	c9adcb6051	anv/tests: Free BO cache and device mutex Was getting ASAN errors in CI when trying to add ANV to the debian-testing job: ==10993==ERROR: LeakSanitizer: detected memory leaks Direct leak of 4194304 byte(s) in 64 object(s) allocated from: #0 0x7f763c1bda3c in __interceptor_posix_memalign ../../../../src/libsanitizer/asan/asan_malloc_linux.cpp:226 #1 0x55f43d28627f in os_malloc_aligned ../src/util/os_memory_aligned.h:58 #2 0x55f43d28627f in _util_sparse_array_node_alloc ../src/util/sparse_array.c:107 #3 0x55f43d28627f in util_sparse_array_get ../src/util/sparse_array.c:143 #4 0x55f43d1fdaba in anv_device_lookup_bo ../src/intel/vulkan/anv_private.h:1335 #5 0x55f43d1fdaba in anv_device_import_bo_from_host_ptr ../src/intel/vulkan/anv_allocator.c:1843 #6 0x55f43d1ff571 in anv_block_pool_expand_range ../src/intel/vulkan/anv_allocator.c:534 #7 0x55f43d1ffcb5 in anv_block_pool_init ../src/intel/vulkan/anv_allocator.c:417 #8 0x55f43d18f082 in run_test ../src/intel/vulkan/tests/block_pool_no_free.c:123 #9 0x55f43d1862b6 in main ../src/intel/vulkan/tests/block_pool_no_free.c:152 #10 0x7f763b942d09 in __libc_start_main ../csu/libc-start.c:308 Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14121>	2022-01-07 13:33:32 +00:00
Tomeu Vizoso	8a7659a7a2	anv/ci: Test with deqp-vk on Tiger Lake Run half of the CTS in 10 Volteer Chromebook devices. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14121>	2022-01-07 13:33:32 +00:00
Jesse Natalie	ef27036b4c	shader_info: tess.spacing needs to be unsigned Otherwise MSVC will treat the bit-packed enum values as signed. Reviewed-by: Marek Olák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14402>	2022-01-07 12:16:41 +00:00
Philipp Zabel	1b808f1dea	etnaviv: fix emit_if in case the else block ends in a jump Fixes piglit test shaders@ssa@fs-if-def-else-break. Signed-off-by: Philipp Zabel <p.zabel@pengutronix.de> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12892>	2022-01-07 12:02:39 +00:00
Rohan Garg	af13119993	intel/fs: OpImageQueryLod does not support arrayed images as an operand When we lower SPIR-V to NIR for textures in vtn_handle_texture, we only bump the number of coordinate components when the op is not a lod query. Update the assert to take this into account. This fixes: - dEQP-VK.robustness.robustness2.bind.template.r32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.r32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.r32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.r32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.r32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.r32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rg32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.bind.template.rgba32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.r32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rg32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32f.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32f.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32i.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32i.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32ui.dontunroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag - dEQP-VK.robustness.robustness2.push.notemplate.rgba32ui.unroll.nonvolatile.sampled_image.no_fmt_qual.null_descriptor.samples_1.cube_array.frag Fixes: `231337a1` ("intel/fs/xehp: Assert that the compiler is sending all 3 coords for cubemaps.") Signed-off-by: Rohan Garg <rohan.garg@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13925>	2022-01-07 10:53:35 +00:00
Emma Anholt	558a600629	nir_to_tgsi: Enable fdot_replicates flag. That's how the TGSI math opcodes work. This lets lower_vec_to_regs coalesce the DP output into the .yzw channels, giving an impressive shader-db win on softpipe: total instructions in shared programs: 2929840 -> 2794036 (-4.64%) instructions in affected programs: 1651438 -> 1515634 (-8.22%) total temps in shared programs: 372730 -> 332744 (-10.73%) temps in affected programs: 118151 -> 78165 (-33.84%) and a minor one on r300: total instructions in shared programs: 51238 -> 51149 (-0.17%) instructions in affected programs: 2621 -> 2532 (-3.40%) total vinst in shared programs: 15655 -> 15618 (-0.24%) vinst in affected programs: 468 -> 431 (-7.91%) total temps in shared programs: 9838 -> 9828 (-0.10%) temps in affected programs: 59 -> 49 (-16.95%) and a bigger one on i915g: total instructions in shared programs: 398064 -> 395901 (-0.54%) instructions in affected programs: 29271 -> 27108 (-7.39%) total tex_indirect in shared programs: 12261 -> 12233 (-0.23%) tex_indirect in affected programs: 98 -> 70 (-28.57%) LOST: 0 GAINED: 5 The r300 change is less impressive because it does some backend copy-prop, but also because intermediate storage of DPs now takes a vec4 instead of a scalar. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14200>	2022-01-07 09:58:24 +00:00
Christian Gmeiner	85d7d520b9	panfrost/ci: update piglit fails Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14428>	2022-01-07 09:06:20 +00:00
Francisco Jerez	054eb9f346	intel/dev: Implement DG2 restrictions requiring additional DSSes to be disabled. Note that this causes a geometry slice to be disabled if any DSS is fused off within that slice, which may seem stricter than the BSpec quotation implies, but testing shows that pixel pipes with any faulted DSS don't work at all, and that using a slice with any faulted pixel pipe leads to serious graphics corruption. It would be better to query this geometry topology information from the hardware instead of trying to reconstruct it here, but the kernel interface for that is not available yet. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14436>	2022-01-07 07:58:27 +00:00
Francisco Jerez	e48c29acca	intel/dev: Add support for pixel pipe subslice accounting on multi-slice GPUs. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14436>	2022-01-07 07:58:27 +00:00
Francisco Jerez	f3274e94fd	intel/dev: Fix size of device info num_subslices array. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14436>	2022-01-07 07:58:27 +00:00
Dave Airlie	f7bb68e499	glsl/nir: don't pass gl_context to the convertor routine. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	ef3303385d	glsl/linker: remove a bunch more gl_context references. This leaves only one reference used for the ctx->Driver.NewProgram hook, this should likely be revisited. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	636b07943e	glsl/linker: drop unused gl_context. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	32702047d8	glsl/linker/uniform_blocks: don't pass gl_context around. just pass the constants Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	e943af6a55	glsl/nir/linker: avoid passing gl_context inside gl_nir linker Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	7c8017bbd1	glsl/linker: remove gl_context usage from more places. Just pass in consts, exts, api Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	34090712c6	glsl/linker: remove gl_context from check image resources Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	4d6e866b7b	glsl/linker: get rid of gl_context from atomic counters paths Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	e9ec1429ba	glsl/linker: get rid of gl_context from uniform assign paths Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	adbbee980d	glsl/linker: get rid of gl_context from link varyings Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	98f665e613	glsl/linker: remove direct gl_context usage in favour of consts/exts/api Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	d5e7c7351a	glsl/linker: move more ctx->Consts to consts. Don't pass gl contexts around as much Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	eb811bdaf4	glsl/linker: don't pass gl_context just for constants in xfb code Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	e83f0fc620	glsl: don't pass gl_context to lower shared references. this uses the consts only Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Dave Airlie	ff0771e253	glsl/linker: cleanup passing gl_context unnecessarily Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14433>	2022-01-07 06:19:49 +00:00
Jesse Natalie	c09c0b351f	nir_opt_dead_cf: Remove dead ifs An if that looks like: if (x) { } else { } That has no phis following it is dead. Currently these are only removed by peephole select, but that means that 'x' is considered used until that pass is run, which can make it difficult to apply sane lowering in the case where loading 'x' requires complex or expensive transformations, but 'x' is really unused. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14400>	2022-01-07 05:15:48 +00:00
Jesse Natalie	2bb2219aa7	d3d12: Set appropriate caps for shader images Note that currently there's no emulation if the D3D12 driver doesn't support the "UAV typed load" feature for all of the GL required formats. This is not a required D3D12 feature, so this support won't light up on all D3D12 hardware. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	0f8213cb10	d3d12: Handle bitcasting of shader images This is handled in 2 ways: * For casts in the same "class," we can just do the cast. There's also limited support for 4x8 => 1x32 and 2x16 => 1x32. * For casts that are just by size, use a lowering pass. This reads the data in the appropriate UINT format (except R11G11B10), retrieves the original bits, re-packs it into the dest, and returns it to the app for loads. For stores, the process is done in reverse - bits for the final format are computed and re-expanded to the format that should be used for the store. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	156ef05ec7	d3d12: Handle memory barriers This is a bit fragile. The algorithm is essentially: - Let the driver track state for non-dual-bound resources. - For resources that are dual-bound as SSBO/image and a second bind point, assume they're being used as UAVs. - When a MemoryBarrier is issued, dirty all destination bind points so they re-assert their state, and if the destination is not UAV, temporarily suppress UAV state re-assertion. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	63fe4888ea	d3d12: Lower cube images to 2D arrays via existing int cubemap lowering pass Note that the coordinates are not lowered, just types. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	d0a3794d6d	d3d12: Fill out shader image descriptor tables Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	ef44b93914	d3d12: Create textures as UAV-capable when appropriate Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	23f0c924cf	d3d12: Handle set_shader_images Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	5192c8dd1d	d3d12: Handle images in the root signature Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	def1f0743c	d3d12: Retrieve shader image dimensions during shader compiles Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	5e82a2ff2b	d3d12: Init null UAVs Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	9b62f88cf8	d3d12: Handle format support queries for shader images Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	951f6f2dba	d3d12: Figure out if we can support GL shader images Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	f7e50c8cf2	d3d12: Add missed SSBO binding enum value Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	6620c342ac	d3d12: Rename UAV -> SSBO to disambiguate with image UAVs Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	aa73203850	d3d12: Fix format table typeless-ness for A8 and RGBA1010102 Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	d1a5250c10	d3d12: Shrink 2D array size so that max-layer cube arrays can be created Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	02fc28625f	microsoft/compiler: Fix handling of fp16-in-32bit-val ops to handle high bits Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	20e374f4a3	microsoft/compiler: Hook up memory/control barriers Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	5d33a9fa47	microsoft/compiler: Handle forced early depth Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	0b88600a64	microsoft/compiler: Implement atomic image ops Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	1b62c86eb1	microsoft/compiler: Handle images as derefs for GL Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	a7311ceafe	microsoft/compiler: Fix array-of-array handling for derefs of textures/images Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	c9719b6d2e	microsoft/compiler: Emit SRVs/UAVs as arrays This is done regardless of size so that "dynamic" indexing with a uniform can work even on arrays of size 1. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	e3c14cf718	microsoft/compiler: Unify handle retrieval between images and UBO/SSBO Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	61872b5240	microsoft/compiler: Emit GL images in descriptor space 1 with driver_location instead of binding Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	83ed626b76	microsoft/compiler: Put SSBO and image handles in separate arrays In a future change, the bindings for images and SSBOs will start to overlap in GL (using a separate space to disambiguate them) Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Jesse Natalie	8a4b443c5b	microsoft/compiler: Change vulkan_environment bool to an enum There are currently 3 different "environments" supported by this backend, and they have slightly different semantics for how resources are accessed, which is only going to get a little weirder when GL images start getting supported. To try to make things less confusing, use an explicit enum with heavy documentation on what's different between them. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14342>	2022-01-07 03:31:16 +00:00
Caio Oliveira	87e2d2249d	anv/blorp: Apply pending pipe flushes after PIPELINE_SELECT Allows the PIPELINE_SELECT change to consume any outstanding flushes. In case it doesn't, we still apply the pipe flushses afterwards. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14301>	2022-01-07 03:14:55 +00:00
Caio Oliveira	313aeee84b	anv: Use pending pipe control mechanism in flush_pipeline_select() This removes the repeated implementation of a workaround and a per-platform case. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14301>	2022-01-07 03:14:55 +00:00
Caio Oliveira	9ba7bc17d3	anv: Add another case to INTEL_DEBUG=pc output Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14301>	2022-01-07 03:14:55 +00:00
Bas Nieuwenhuizen	85ca7fab29	radv: Add common entrypoint dependency. To ensure we have the header. The revert likely reintroduced compilation flakiness due to missing dependencies. Fixes: `a255f6f823` ("radv: do not use the common entrypoint for the Metro Exodus layer") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14430>	2022-01-07 02:21:36 +00:00
Bas Nieuwenhuizen	63101914f8	radv: Set optimal copy alignment to 1. I think we set it to 128 for no reason at all. The app is still required to align to the texel size. Note that we prefer 4 bytes for non-formatted buffer->buffer copy, but that isn't in scope for these properties according to the Vulkan spec. It also happens to help hide what looks like an application bug at this point with Baldurs Gate 3. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5509 Cc: mesa-stable Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14415>	2022-01-07 01:59:13 +00:00
Mike Blumenkrantz	05a5e5a2bc	radv: fix xfb query copy param ordering Fixes: `afff9dd0f0` ("radv: Use correct buffer size for query pool result copies.") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14422>	2022-01-06 17:44:09 +00:00
Samuel Pitoiset	f43b0d5cdc	radv/winsys: remove unused syncobj functions The vulkan common runtime code uses the drm functions directly. Fixes: `91fe0b5629` ("radv: Delete lots of sync code.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14375>	2022-01-06 09:13:50 +00:00
Samuel Pitoiset	da0c708d79	radv: remove remaining dead code related to the old sync code Fixes: `91fe0b5629` ("radv: Delete lots of sync code.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14375>	2022-01-06 09:13:50 +00:00
Pierre-Eric Pelloux-Prayer	dcbf2423d2	vbo/dlist: add vertices to incomplete primitives If a primitive is added with missing vertices, this will shift all the vertices and produce incorrect rendering. In issue #5786 the app uses GL_LINE_STRIPS with a single vertex. Adding extra vertices can make the initial estimation for the index buffer size incorrect, so this buffer can now be growed if needed. Fixes: `ac3d4c7635` ("vbo/dlist: convert LINE_STRIPS to LINES") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5786 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14243>	2022-01-06 08:45:05 +00:00
Pierre-Eric Pelloux-Prayer	7a1d3d3abc	vbo/dlist: fix loopback crash The original code incorrectly adjusted only when Loopback was false, while primitives' start value is actually modified unconditionnally. Fixes: `3253594268` ("vbo/dlist: rework buffer sizes") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5754 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14243>	2022-01-06 08:45:05 +00:00
Pierre-Eric Pelloux-Prayer	d84e0096a5	radeonsi/gfx8: use the proper dcc clear size dcc_fast_clear_size is assigned using addrlib's dccFastClearSize, which is computed using the whole surface size (including layers) so we don't need to multiply dcc_fast_clear_size by num_layers. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4530 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14409>	2022-01-06 09:25:57 +01:00
Vinson Lee	21047a4a06	isaspec: Remove duplicate return statement. Fix defect reported by Coverity Scan. Structurally dead code (UNREACHABLE) unreachable: This code cannot be reached: return val;. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14350>	2022-01-05 22:13:48 -08:00
Jordan Justen	d57b10ab98	intel/compiler: Adjust TCS instance-id for dg2+ Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14385>	2022-01-05 16:13:28 -08:00
Guilherme Gallo	9f58275f98	ci: skqp: Add documentation on how to maintain skqp jobs Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14146>	2022-01-05 20:15:04 +00:00
Guilherme Gallo	a6d05e6863	ci: Add a630_skqp jobs Start Xorg during skqp job, since it is needed to make rendered tests work. There are 1 new job, namely `a630_skqp` which runs GL and GLES backends and then the skqp GPU unittests. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5580 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14146>	2022-01-05 20:15:04 +00:00
Guilherme Gallo	8992cf5ab8	ci: Build skqp on ARM64 images This commit makes `kernel+rootfs_arm64` job build and install skqp on ARM64 devices rootfs. Skia repository has a tool to prepare skqp models located at `tools/skqp/cut-release`, which get files from [Skia Gold](https://skia.org/docs/dev/testing/skiagold/), generate files.checksum, rendertests.txt and unittests.txt. One gives a range of commits to let `cut-release` find the right resources to prepare skqp for the user. However, it is failing, since it fails when trying to get image packages from a range of commits via HTTPS from the host https://public-gold.skia.org but it responds with error 404 every time. I tried a range a thousand of commits, yet it still does not give results. The workaround employed was to recover the most recent `files.checksum` and `rendertests.txt` files from the git history and generate `unittests.txt` from `list_gpu_unit_tests` binary. `skqp` runs two lists of tests, `rendertests.txt` and `unittests.txt`. Both must be located inside the `skqp` assets folder. The first list uses GL and GLES to test rendering scenarios. The second runs some unit tests that do not render an image per se. In order to make the first `a630_skqp` to be green, the crashing tests were removed from the test lists and the expectations of the failing ones were updated. It is worth noting that `rendertests.txt` can bring some detail about each test expectation, so each test can have a max pixel error count, to tell `skqp` that it is OK to have at most that number of errors for that test. See also: https://github.com/google/skia/blob/main/tools/skqp/README_ALGORITHM.md As each render backend has a different error count, two different `rendertests.txt` files were created, `src/freedreno/ci/freedreno-a630-skqp-gl_rendertests.txt`, `src/freedreno/ci/freedreno-a630-skqp-gles_rendertests.txt` and , which one refers to GL and GLES tests respectfully. The unit tests file for a630 is located at `src/freedreno/ci/freedreno-a630-skqp_unittests.txt` ``` aaclip domain formats highcontrastfilter rectangle_texture yuv_make_color_space ``` ``` ProcessorOptimizationValidationTest VkProtectedContext_CreateNonprotectedContext VkYCbcrSampler_DrawImageWithYcbcrSampler VkYCbcrSampler_NoYcbcrSurface ``` Each test was updated with the max_error count equal to the first run result. ``` analytic_antialias_inverse async_rescale_and_read_dog_down async_rescale_and_read_dog_up async_rescale_and_read_rose async_rescale_and_read_text_down async_rescale_and_read_text_up async_rescale_and_read_text_up_large async_rescale_and_read_yuv420_rose complexclip2_path_bw encode-platform imageblur_large lcdtextsize onebadarc onefailarc scale-pixels surfaceprops textfilter_color textfilter_image ``` Considering all the following tests results as wrong. ``` async_rescale_and_read_no_bleed backdrop_imagefilter_croprect_persp complexclip2 imageblurrepeatmode mixerCF overdrawcolorfilter patch_alpha patch_primitive rrect_clip_bw scaledemoji_rendering yuv_splitter ``` v2: a) add link to HTML report on job log b) remove extraneous spaces diff c) remove unnecessary conditions from build-skqp.sh d) use fixed skqp source commit SHA v3: a) Use only main skia repository to fetch models and build skqp b) Use list_gpu_unit_tests binary to create a base unittests.txt file c) Remove crashing tests d) Set failing tests expectations for the first skqp run v4: a) Remove clang dependency b) Separate each skqp backend result into its folder c) Regroup a630_skqp in one job v5: a) Separate tests files per driver Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5580 Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14146>	2022-01-05 20:15:04 +00:00
Samuel Pitoiset	a255f6f823	radv: do not use the common entrypoint for the Metro Exodus layer This is incorrect, it will calls the function recursively. It seems there is random build failures with common runtime code and MSVC but that's a different issue. Closes: #5815 Fixes: `46c59e8fd6` ("radv: Remove dependencies on vk_common entrypoints.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14374>	2022-01-05 19:22:50 +00:00
Lucas Stach	c1f8bc67e2	etnaviv: initialize vertex attributes on context reset It seems that at least some GC400 come out of reset with random vertex attributes enabled and also don't disable them on the write to the first config register as normal. Enabling all attributes seems to provide the GPU with the required edge to actually disable the unused attributes on the next draw. Cc: mesa-stable Reported-by: Steven Walter Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14285>	2022-01-05 18:39:20 +00:00
Emma Anholt	105b48c85c	r300: Fix omod failing to increase the number of channels stored. In dEQP-GLES2.functional.shaders.operator.geometric.reflect.highp_vec2_fragment and friends this pass would turn: 0: DP3 temp[1].x, input[1].yx0_, input[0].wy0_; 1: MUL temp[2].xy, temp[1].xx__, const[0].xx__; into 0: DP3 temp[2].x * 2, input[1].yx0_, input[0].wy0_; 1: MUL temp[3].xy, temp[2].xy__, input[1].yx__; Note the attempt to use .y of temp[2]. Just bail when we more dst channels than src channels, since the rewrite can't generate more channels for us. Fixes this subset of tests (which I hadn't included in the xfails until now since results hadn't quite been stable). Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Filip Gawin <filip.gawin@zoho.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14405>	2022-01-05 17:07:38 +00:00
Emma Anholt	946fe209d9	ci/r300: Update xfails from a full dEQP run. I can now run the whole thing instead of doing subsets, so update for that. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14405>	2022-01-05 17:07:38 +00:00
Emma Anholt	7b7530a85a	r300: Use uif() instead of pointer aliasing in program printing. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14405>	2022-01-05 17:07:38 +00:00
Jason Ekstrand	ca1d0333db	v3dv: Use the common QueueSignalReleaseImageANDROID from RADV This is an actual functional change as we now plumb through the sync FD instead of doing a vkQueueSubmit and trusting in implicit sync. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Tested-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14372>	2022-01-05 16:36:10 +00:00
Jason Ekstrand	a9321b1309	anv: Use the common QueueSignalReleaseImageANDROID from RADV This is an actual functional change as we now plumb through the sync FD instead of doing a vkQueueSubmit and trusting in implicit sync. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14372>	2022-01-05 16:36:10 +00:00
Jason Ekstrand	b2073f5e5d	radv: Move QueueSignalReleaseImageANDROID to common code This is mostly a copy+paste job but with a few syntax changes to make it follow more closely with other common Vulkan code. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14372>	2022-01-05 16:36:10 +00:00
Jason Ekstrand	dfb1e1777c	anv,radv,v3dv: Move AcquireImageANDROID to common code All three implementations are identical. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14372>	2022-01-05 16:36:10 +00:00
Thong Thai	9caf110d4f	frontends/va/enc: default motion estimation parameters for performance Sets the default motion estimation parameters for the H.264 encoder to values that increase encoding performance - realistically, this only benefits the VCE encoder, as the parameters are used no where else. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3540 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2702 Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14339>	2022-01-05 16:03:45 +00:00
Michel Zou	4ff57e5aba	zink: fix -Warray-bounds warning It would seems msvc and mingw dont pack across disparate types so zink_bind_rasterizer_state is too big for int32 string.h:202:10: warning: ‘__builtin___memcpy_chk’ forming offset [4, 7] is out of the bounds [0, 4] of object ‘rast_bits’ with type ‘uint32_t’ {aka ‘unsigned int’} [-Warray-bounds] 202 \| return __builtin___memcpy_chk(__dst, __src, __n, __mingw_bos(__dst, 0)); \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ ../src/gallium/drivers/zink/zink_state.c: In function ‘zink_bind_rasterizer_state’: ../src/gallium/drivers/zink/zink_state.c:586:16: note: ‘rast_bits’ declared here Fixes: `9c5a2ab6` Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12609>	2022-01-05 14:46:03 +00:00
Marek Olšák	43d57189dd	radeonsi: print the number of param exports for shader-db Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	d96a346120	radeonsi: print all streamout info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	3c471c0eb5	ac/nir: move ac_are_tessfactors_def_in_all_invocs into radeonsi radv isn't going to use it because it's for the TCS epilog Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	116a05c721	ac: move ac_exp_param.h to ac_nir.h Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	d4a1766a5a	radeonsi: move the GS copy shader into shader variants This will allow further optimizations for shader variants that change GS outputs (affecting the copy shader), and this is mainly about sharing optimizations with NGG instead of having a totally separate codepath for legacy GS. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	1caa94f2a5	radeonsi: add into the disk cache key whether cached shaders contain LLVM IR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	8ed9d38e73	radeonsi: move si_nir_scan_shader into si_shader_info.c Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	70919f30c1	radeonsi: change si_shader_output_values::vertex_stream to a bitmask to match si_shader_info. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	2c4926dfc8	radeonsi: use nir->scratch_size instead of ac_count_scratch_private_memory It's the same. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	3fb77ef2e0	radeonsi: do opt_large_constants & lower_indirect_derefs after uniform inlining because loop unrolling caused by uniform inlining can eliminate large constants and indirect derefs. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	198ad7e4dc	radeonsi: move smoothing to the main shader part to remove 1 live VGPR The samplemask VGPR that we had to pass to the epilog increased VGPR usage by 1 for all shaders. Do it in the main function by using the mono key structure, which causes on-demand compilation and stall, but we'll save the VGPR. 57794 shaders in 35145 tests Totals: SGPRS: 2715856 -> 2716272 (0.02 %) VGPRS: 1776168 -> 1718432 (-3.25 %) Spilled SGPRs: 3704 -> 3630 (-2.00 %) Spilled VGPRs: 1727 -> 1733 (0.35 %) Private memory VGPRs: 256 -> 256 (0.00 %) Scratch size: 2008 -> 2016 (0.40 %) dwords per thread Code Size: 61429584 -> 61393288 (-0.06 %) bytes Max Waves: 838645 -> 840484 (0.22 %) Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:31 +00:00
Marek Olšák	12b942bd16	radeonsi: pass sample_coverage VGPR index to the PS prolog instead of guessing The code was correct, but little confusing. This is cleaner. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	3283df1425	radeonsi: remove unused si_shader::prolog2 This became unused when the GS prolog was removed. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	054d50904b	radeonsi: don't bind the ESGS ring twice, handle the difference in the shader The other shader is changed to modify the descriptor to get the required values. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	18fadc8e6a	radeonsi: reorder slots for internal buffers, reuse a slot for GS_QUERY_BUF GFX10_GS_QUERY_BUF is also renamed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	0246580106	radeonsi: simplify compacted_mrt_index in si_export_mrt_color Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	ca5f11ccd3	radeonsi: export mrtz before color exports Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	e762bc751e	radeonsi: remove unnecessary code that was used to find the last export We can do it trivially like this. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	fd14e0afb1	radeonsi: set done=1 for PS exports at the end of si_llvm_build_ps_epilog so that we can reorder the exports Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	fd02299055	radeonsi: clean up si_export_mrt_color Remove the intermediate args variable. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	7f6643ffd0	radeonsi: make get_thread_id_in_tg non-static for future work Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	6386b95f0f	radeonsi: modifiers can't disable DCC Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	d382ceea2b	ac/llvm: remove the num_channels parameter from ac_build_buffer_store_dword It was used when LLVM didn't support vec3 and we had to pass vec4 with num_channels=3. We no longer need to do that. This also removes the vec3 splitting or conversion to vec4 in callers. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	e6aac44051	ac/llvm: add vindex into ac_build_buffer_store_dword for future work Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Marek Olšák	bb017d5d16	amd/registers: work around an assertion in parse_kernel_headers.py Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14266>	2022-01-05 12:46:30 +00:00
Iago Toral Quiroga	44fa8304d4	v3dv: add a refcount mechanism to BOs Until now we have lived without a refcount mechanism in the driver because in Vulkan the user is responsible for handling the life span of memory allocations for all Vulkan objects, however, imported BOs are tricky because the kernel doesn't refcount so user-space needs to make sure that: 1. When importing a BO into the same device used to create it (self-importing) it does not double free the same BO. 2. Frees imported BOs that were not allocated through the same device. Our initial implementation always freed BOs when requested, so we handled 2) correctly but not 1) and we would double-free self-imported BOs. We tried to fix that in commit `d809d9f3` but that broke 2) and we started to leak BOs for some imports. This fixes the problem for good by adding refcounts to BOs so that self-imported BOs have a refcnt > 1 and are only freed when all references are freed. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5769 Tested-by: Roman Stratiienko <r.stratiienko@gmail.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14392>	2022-01-05 12:22:45 +00:00
Marek Olšák	946bd90a09	radeonsi: decrease the size of si_pm4_state::pm4 except for cs_preamble_state Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	a031cd3c2f	radeonsi: replace SI_PM4_MAX_DW with a max_dw field it will vary Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	005fc20a34	radeonsi: pack si_pm4_state Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	3ea5beca1f	radv: apply spi_cu_en to CU_EN Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	955c6de1c1	radv: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	384014bebe	radeonsi: apply spi_cu_en to CU_EN Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	b06b481dfe	radeonsi: program COMPUTE_STATIC_THREAD_MGMT_SE4..7 on Arcturus Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	5406ad93a9	radeonsi: set COMPUTE_DESTINATION_EN_SEn to spi_cu_en Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	470b61f3a9	ac/gpu_info: add AMD_CU_MASK environment variable to set CU_EN requested internally Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Marek Olšák	a68cb9db8d	ac/gpu_info: set cu_mask correctly for Arcturus Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14122>	2022-01-05 01:36:10 -05:00
Emma Anholt	b9e8936bfb	i915g: Turn off FP16 in the vertex shaders. This ended up being turned on in gallivm, but since we use nir_to_tgsi on the VS and TGSI doesn't have FP16, we can't let that happen. Fixes: `f814a2449e` ("llvmpipe: enable FP16 and update CL + traces piglit results.") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14403>	2022-01-05 03:15:15 +00:00
satmandu	d27805753f	Fix compilation on armv7l with gcc 11.2.0 Cc: mesa-stable Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12810>	2022-01-05 02:48:29 +00:00
Timothy Arceri	d2711f9b61	glsl/glcpp: make sure to expand new token after concatenation Previously the code was using a hack to change the token type from INDETIFIER -> OTHER in order to avoid getting in an infinite loop expanding the tokens. This worked ok until we got to a paste where the replacement parameters had already had their type changed to OTHER because the newly created paste token would then inherit the OTHER type and never get expanded inself. For example with the follow code: #define STEP_ONE() \ out_Color = vec4(0.0,1.0,0.0,1.0) #define GLUE(x,y) x ## _ ## y #define EVALUATE(x,y) GLUE(x,y) #define STEP(stepname) EVALUATE(STEP, stepname)() #define PERFORM_RAYCASTING_STEP STEP(ONE) This would get all the way to expanding PERFORM_RAYCASTING_STEP to STEP_ONE() but because it was created via the paste `x ## _ ## y` it would never get any further. To fix this we remove the OTHER hack and instead just track if the token has already been handled via a bool value `explanding`. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5724 Fixes: `28842c2331` ("glcpp: Implement token pasting for non-function-like macros") Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14101>	2022-01-04 23:36:42 +00:00
Emma Anholt	4dc6cd5933	tgsi/exec: Simplify indirects now that they always use the ADDR file. This was a lot of extra code in the hot path of getting though fetch_src_file_channel(). No significant perf difference in softpipe, though. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14360>	2022-01-04 23:05:41 +00:00
Emma Anholt	c00db99e0e	gallium: Delete PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS Softpipe was the only driver still using this feature. I had enabled it in `ba22f014f9` ("softpipe: Enable PIPE_CAP_TGSI_ANY_REG_AS_ADDRESS;") for an instr count win, but it's really not important to that driver and it's not worth keeping the knob around just for that. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14360>	2022-01-04 23:05:41 +00:00
Emma Anholt	4bb9c0a28a	nir_to_tgsi: Use the same address reg mappings as GLSL-to-TGSI did. It turns out r600 has a bunch of expectations about the Dimension being in ADDR[1].x, and sampler or atomic indirects being in ADDR[2].x. It's simpler to just use this static assignment than our dynamic one, anyway. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14360>	2022-01-04 23:05:41 +00:00
Biju Das	daf59694ac	kmsro: Add 'rcar-du' driver support rcar-du is used by Renesas R-Car and RZ/G SOCs. Driver is available in the mainline kernel [1] [1] https://elixir.bootlin.com/linux/latest/source/drivers/gpu/drm/rcar-du Suggested-by: Daniel Stone <daniels@collabora.com> Signed-off-by: Show Liu <show.liu.yj@renesas.com> Signed-off-by: Biju Das <biju.das.jz@bp.renesas.com> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14393>	2022-01-04 21:59:16 +00:00
Adam Jackson	69678d50d1	mesa: Remove unused _mesa_get_linear_format_srgb Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14220>	2022-01-04 19:45:55 +00:00
Adam Jackson	267f28e384	mesa: Remove unused _mesa_format_fallback_rgbx_to_rgba Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14220>	2022-01-04 19:45:55 +00:00
Adam Jackson	07d23c207a	mesa: Remove unused _mesa_bind_texture Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14220>	2022-01-04 19:45:55 +00:00
Adam Jackson	e44ce092be	mesa: Remove unused _mesa_AllocTextureStorage_sw Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14220>	2022-01-04 19:45:54 +00:00
Adam Jackson	c77da4de4b	mesa: Remove unused _mesa_allow_light_in_model ctx::_ForceEyeCoords is now never not false, so remove and simplify to match. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14220>	2022-01-04 19:45:54 +00:00
Timur Kristóf	d34d0f38e1	radv: Support VRS for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14193>	2022-01-04 17:46:02 +00:00
Timur Kristóf	bc94c2718a	aco: Emit VRS rate when it's per-primitive. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14193>	2022-01-04 17:46:02 +00:00
Timur Kristóf	811c001049	radv: Lower primitive shading rate for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14193>	2022-01-04 17:46:02 +00:00
Timur Kristóf	c6becb9574	radv: Note when a mesh shader writes the primitive shading rate. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14193>	2022-01-04 17:46:02 +00:00
Rob Clark	94f465856c	clover: Move min image support check If the gallium driver supports images, but not the minimum image requirements for CL, then simply don't claim to support images. This is better than not claiming to support CL at all. Signed-off-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14310>	2022-01-04 16:56:25 +00:00
Samuel Pitoiset	12ac44378d	radv: add UMR markers for the vertex prolog Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	14622b8bcc	radv: dump the VS prolog disassembly to the hang report Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	d659c2ef9c	radv: save the vertex prolog to the trace BO for debugging Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	2bf25e6f6e	radv,aco: keep track of the prolog disassembly if necessary Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	e836174077	aco: do not print prologs disassembly if no disassembler Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13376>	2022-01-04 07:50:07 +00:00
Samuel Pitoiset	3ef736c94e	aco: fix a dynamic-stack-buffer-overflow when printing instructions Detected by ASAN. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14329>	2022-01-04 08:04:50 +01:00
Dave Airlie	ad2902cbbe	mapi: generate correct dispatch for EXT_draw_instanced These APIs can be exposed in GLES2.0 via EXT_draw_instanced, they were incorrectly being stuck on GLES 3.0 only. Fixes piglit ext_draw_instanced-drawarrays Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14338>	2022-01-03 21:37:34 +00:00
Pavel Ondračka	96ad4f6437	r300: Remove broken optimization in rc_transform_KILL The logic was reversed so this was not only not working but it was also removing random instructions around. The special IF-KILP-ENDIF case this optimization was targeting is already transformed to KILL_IF in the TGSI, so just remove this altogether. This fixes piglit glsl-fs-discard-04 v2: Update the comment as well Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/343 Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Reviewed-by: Filip Gawin <filip.gawin@zoho.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14378>	2022-01-03 20:45:32 +00:00
Thomas H.P. Andersen	496947d0d4	ci: debian-clang: drop -Wno-error=absolute-value Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14302>	2022-01-03 20:20:37 +00:00
Thomas H.P. Andersen	c32c9014f5	broadcom/compiler: fix compile warning -Wabsolute-value fixes a compile warning with clang Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14302>	2022-01-03 20:20:37 +00:00
Thomas H.P. Andersen	574c4466f8	xa: fix compile warning for -Wabsolute-value fixes a compile warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14302>	2022-01-03 20:20:37 +00:00
Lionel Landwerlin	8092d58e2e	util/u_trace: protect against reentrant calls With the Iris driver, the first tracepoint triggers the tracepoint for "begin of command buffer" which leads to 2 tracepoints using the same idx. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14254>	2022-01-03 18:15:08 +00:00
Marek Olšák	13bdd8da5e	driconf: enable glthread for Minecraft-FTB, Stellaris, Battletech Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14368>	2022-01-03 17:45:59 +00:00
M Henning	d28677cfd9	nouveau/nir: Lower 64-bit phis Fixes arb_gpu_shader_fp64-fs-non-uniform-control-flow-ubo on kepler with NV50_PROG_USE_NIR=1 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14343>	2022-01-03 17:35:05 +00:00
Marek Olšák	ce72395008	radeonsi: add a debug option that disables DCC for all exported buffers requested internally Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14283>	2022-01-03 17:19:56 +00:00
Thomas H.P. Andersen	4cc0548046	zink: malloc/sizeof mismatch buffer_infos is VkBufferView* but malloc is using the size of VkImageView. The size is the same but this fixes a warning Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14335>	2022-01-03 17:00:55 +00:00
Thomas H.P. Andersen	495b2c537b	ci: debian-clang: -Wno-error for sometimes-uninitialized The last of this kind of warning is gone for CPP Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14296>	2022-01-03 14:34:22 +00:00
Thomas H.P. Andersen	9e2f1f05c6	r600/sb: silence a sometimes-uninitialized warning Fixes a warning from clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14296>	2022-01-03 14:34:22 +00:00
Mike Blumenkrantz	2a4c56e4e3	mesa/vbo: be more comprehensive for degenerate primitive conversion in dlists these shouldn't result in any sort of draw, and passing the unsupported primtype to the driver is invalid Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13845>	2022-01-03 13:59:48 +00:00
Alyssa Rosenzweig	29d319c767	pan/bi: Fix load_const of 1-bit booleans For historical reasons, we ingest 1-bit booleans in NIR but expand them to 16/32-bit booleans in the backend IR. We need to handle this case when loading boolean constants, extending from 1-bit to 16/32-bit as required. This issue is masked by effective constant folding for booleans, but is visible in a shader from Firefox WebRender. Fixes: `646e03c451` ("pan/bi: Temporarily switch back to 0/~0 bools") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 Closes: #5797 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14371>	2022-01-02 20:26:15 +00:00
Uday Kiran Pichika	78ef08a061	anv: enable adaptive sync for ANV Signed-off-by: Uday Kiran Pichika <pichika.uday.kiran@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6420>	2022-01-02 18:53:29 +00:00
Uday Kiran Pichika	89b54626fe	iris: enable adaptive sync for IRIS Signed-off-by: Uday Kiran Pichika <pichika.uday.kiran@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6420>	2022-01-02 18:53:29 +00:00
Alyssa Rosenzweig	795638767d	pan/bi: Use fused dual source blending Instead of emitting a pile of moves to fixed registers at codegen time and hoping everything works out, add a second staging source to the BLEND instruction in the intermediate representation containing the dual source colour, and modify register allocation appropriately. This better models the operation of blending render target #0 with two sources. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	af3863c658	pan/bi: Allow an extra staging source Useful for dual source blending. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	251b1c1753	pan/bi: Use is_staging_src helper This will allow us to have a separate set of staging sources, which will be useful for dual-source blending. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	c4da1b84d3	panfrost: Remove pan_nir_reorder_writeout Now that dual source stores are fused, there's no ordering hazard so we can remove this tricky code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	d3fb341b4a	panfrost: Combine dual source blends Reuse the store_combined_output_pan infrastructure for dual source blending as well. This simplifies ordering. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	24ea7cbb06	nir: Extend store_combined_output_pan Extend store_combined_output_pan to take a dual source blend input in addition to colour, depth, and stencil inputs. Use the last source for this, and represent the type with the DEST_TYPE index. This is a hack but there is no SRC2_TYPE and NIR doesn't seem to mind as long as we know what we mean. This allows the backend to emit a combined "blend render target #0" instruction taking two sources. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	e42133aa2f	panfrost: Simplify blend lowering pass Combine depth/stencil writeout redundancy. This simplifies the code and will allow us to add dual source blend handling without duplicating even more code. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	5c168f09eb	nir: Eliminate store_combined_output_pan BASE It's meaningless for this intrinsic and is just adding noise to the lowering pass. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	b3d7272753	pan/mdg: Don't read base for combined stores `base` is meaningless for combined stores, so don't read it. This allows us to remove the base index from the intrinsic and simplify the lowering code generating the combined store intrinsic. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Alyssa Rosenzweig	996645e479	pan/bi: Don't read base for combined stores `base` is meaningless for combined stores, so don't read it. This allows us to remove the base index from the intrinsic and simplify the lowering code generating the combined store intrinsic. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13714>	2022-01-02 01:12:05 +00:00
Tatsuyuki Ishi	31d839aacc	aco: lower masked swizzle to DPP8 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13971>	2021-12-31 20:56:39 +00:00
Tatsuyuki Ishi	da0412e55b	aco: support DPP8 Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13971>	2021-12-31 20:56:39 +00:00
Jesse Natalie	5c3dfb4ef5	gallium/aux: Move index offsetting from prim restart to primconvert Fixes: `b34fed64` ("u_prim_restart: Fix index scanning with start offset") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5799 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14346>	2021-12-31 17:29:46 +00:00
Bas Nieuwenhuizen	46c59e8fd6	radv: Remove dependencies on vk_common entrypoints. Fixes: `91fe0b5629` ("radv: Delete lots of sync code.") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14366>	2021-12-31 17:49:10 +01:00
Bas Nieuwenhuizen	8c4c63a6ed	radv: Rename submit2->submit. Cleanup after the big switcheroo. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	84ec547f69	radv: Remove syncobj reset mechanism. Now done with the vulkan runtime. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	91fe0b5629	radv: Delete lots of sync code. This make things all fall back to the common synchronization functions which will switch things to the new submission path and to use vk_fence and vk_semaphore. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> radv: Use common AcquireNextImage2KHR. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> radv: Use common functions for Metro Exodus layer. Needs to be squashed with the big "switch the world" deletion patch but kept it separate for review. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	967fc415fc	radv: Add new submission path for use by the common sync framework. Basically just a copy of radv's internal deferred submission handler. This uses the new winsys submit2 to later be able to move the sem_info to the winsys. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	93cde57f7a	radv: Add new cs_submit2 winsys call. A bit more runtime dependence so we can remove all the syncobj stuff in radv itself. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	9d75f23350	radv: Use vk_command_buffer for preambles. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	31da5c41b6	radv: Set horizontal sync types. We basically use 2 types for amdgpu: 1) syncobj. This supports both binary & timeline and autodetects all the features. 2) emulated timelines if (1) doesn't detect timeline syncobj support. Note that one has to put these in order of preference so that the common CreateFence/CreateSemaphore functions decide on the right type. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:19 +00:00
Bas Nieuwenhuizen	1965872b41	radv: Add function to allow WSI signalling fences/semaphores. Added but not enabled yet. Will enable in the big switchover. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	3055d2f9df	radv: Initialize vk device drm fd. For common sync. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	101a366e57	meson: Bump libdrm_amdgpu version req to 2.4.109. For getting the fd from the device. This uses the new function introduced with libdrm 2.4.109. This has already been updated in CI and is already required for mesa common bits so requiring it for libdrm_amdgpu is no big deal. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	1a84dd6358	radv: Use vulkan runtime for device lost. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	085f99b729	radv: Use dispatch table for wsi_display.c Pretty sure this could be moved into WSI. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	40558dbe91	radv: Use dispatch table for QueueWaitIdle in the SQTT layer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	7a84314c12	vulkan/runtime: Add sparse bind support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Bas Nieuwenhuizen	73cdc302ab	vulkan/runtime: Refactor queue submit to take an argument struct. For merging with the sparse binding structs. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13974>	2021-12-31 15:14:18 +00:00
Daniel Schürmann	16a527deef	aco: don't split VOP3P definitions Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	7e02787a54	aco: use p_create_vector(v2b,v2b) in get_alu_src_vop3p() Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	e56d8b0b2e	aco: use explicit zero-padding for 64bit image loads in expand_vector() Previously, this only worked because of regClass mismatches in the allocated vector. Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	91f17e1c73	aco/optimizer: apply extract from subdword p_split_vector Totals from 1345 (1.00% of 134572) affected shaders: (GFX10.3) VGPRs: 76752 -> 76744 (-0.01%); split: -0.02%, +0.01% SpillSGPRs: 1459 -> 1460 (+0.07%) SpillVGPRs: 1776 -> 1784 (+0.45%); split: -0.39%, +0.84% CodeSize: 13310964 -> 13309420 (-0.01%); split: -0.06%, +0.05% Scratch: 178176 -> 179200 (+0.57%) Instrs: 2516874 -> 2516860 (-0.00%); split: -0.05%, +0.05% Latency: 23228506 -> 23230338 (+0.01%); split: -0.14%, +0.15% InvThroughput: 6002384 -> 6000158 (-0.04%); split: -0.24%, +0.21% VClause: 41115 -> 41117 (+0.00%); split: -0.28%, +0.29% SClause: 104639 -> 104664 (+0.02%); split: -0.07%, +0.09% Copies: 185121 -> 184862 (-0.14%); split: -0.69%, +0.55% Branches: 100740 -> 100735 (-0.00%); split: -0.01%, +0.00% PreVGPRs: 70119 -> 69968 (-0.22%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	fb622775b5	aco/optimizer: optimize extract(extract()) Totals from 53 (0.04% of 134572) affected shaders: (GFX10.3) SpillVGPRs: 1780 -> 1776 (-0.22%); split: -0.34%, +0.11% CodeSize: 968352 -> 963196 (-0.53%); split: -0.55%, +0.02% Scratch: 180224 -> 178176 (-1.14%) Instrs: 169800 -> 169158 (-0.38%); split: -0.39%, +0.01% Latency: 6186064 -> 6141408 (-0.72%); split: -1.16%, +0.44% InvThroughput: 2605044 -> 2582967 (-0.85%); split: -1.37%, +0.52% VClause: 4851 -> 4866 (+0.31%); split: -0.16%, +0.47% SClause: 1744 -> 1746 (+0.11%) Copies: 42874 -> 42325 (-1.28%); split: -1.40%, +0.12% Branches: 5762 -> 5765 (+0.05%); split: -0.02%, +0.07% Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	5ad9c20d4a	aco/optimizer: apply extract from p_extract_vector Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	11712729eb	aco/optimizer: keep instr_mod_labels after applying extract This allows to use clamp on SDWA and VOP3 opsel instructions. Totals from 32 (0.02% of 134572) affected shaders: (GFX10.3) SpillVGPRs: 1783 -> 1780 (-0.17%); split: -0.50%, +0.34% CodeSize: 881480 -> 881496 (+0.00%); split: -0.15%, +0.15% Instrs: 154400 -> 154388 (-0.01%); split: -0.13%, +0.12% Latency: 5021791 -> 5033485 (+0.23%); split: -0.67%, +0.90% InvThroughput: 2486454 -> 2492312 (+0.24%); split: -0.67%, +0.91% VClause: 4763 -> 4755 (-0.17%); split: -0.52%, +0.36% Copies: 42866 -> 42965 (+0.23%); split: -0.25%, +0.48% Branches: 5640 -> 5639 (-0.02%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Daniel Schürmann	1502c22e2c	aco: don't allow SDWA on VOP3P instructions Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13576>	2021-12-31 14:52:14 +00:00
Samuel Pitoiset	90994e4db7	radv: add drirc radv_disable_htile_layers and enable it for F1 2021 To workaround some flickering issues in the main menu. See https://github.com/HansKristian-Work/vkd3d-proton/issues/950 Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14354>	2021-12-31 14:27:32 +00:00
Samuel Pitoiset	ce5d7bf508	radv: fix copying mutable descriptors to sampler descriptors This fixes a heap-buffer-overflow detected by ASAN. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14330>	2021-12-31 13:39:56 +00:00
Timur Kristóf	6fee84bc2e	radv: Enable NV_mesh_shader with a perftest flag. We don't plan to support NV_mesh_shader officially on RADV, because it performs poorly on AMD hardware. However, we are implementing this extension to get some experience with mesh shader technology. Users should not rely on this support because we are going to remove it if/when a potential cross-vendor extension appears. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	f2dd1fbc63	radv: Implement NV_mesh_shader draw calls. The NV indirect command buffer format is not supported by the hardware, so we have to emit several copy packets to make it work. Otherwise it works by emulating the number of launched MS workgroups by setting the number of "input vertices". Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	8d238f5581	aco: Export per-primitive mesh shader output attributes. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	fc1424f1d8	aco: Use the correct outinfo for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	92556d6067	aco: Add 1D workgroup_id support for mesh shaders. I'll add support for 3D workgroup_id later, but NV_mesh_shader only supports 1D workgroups. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	7759323b75	aco: Update README about NGG and mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	6766e6a985	aco: Add Mesh and Task shader stages. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	7ae425c5d4	radv: Add support for mesh shading pipelines in the command buffer. - Prefetch mesh shaders - Flush mesh shader descriptors - Assert that input assembly, etc. are ignored. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	8dc4f626ac	radv: Create mesh shading pipelines. - Fill gfx10_ngg_info - Allow NULL input assembly state - Assert that the correct shader stages are used - Program VGT_GS_MAX_VERT_OUT, GS_EN, GS_FAST_LAUNCH Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	e2df56f502	radv: Set output driver locations for mesh shaders. Ignore primitive count and indices which work differently. Then, assign driver locations to per-vertex and per-primitive outputs independently because we allocate separate LDS areas for them. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	11501fa686	radv: Compile mesh shaders and apply the necessary NIR lowerings. Mesh shaders use NGG, but the API allows many compute shader features such as workgroups and shared memory. Use the appropriate NIR lowerings for these, then call ac_nir_lower_ngg_ms. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	ae82d9d8f4	radv: Setup shader arguments for mesh shaders. Use the same code path as other NGG shaders, but with a few special cases. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	e3a80d9d29	radv: Add support for per-primitive mesh shader outputs. Generic per-primitive outputs: They work similarly to other NGG outputs. In the ISA they are param export instructions that are executed on the primitive threads. These per-primitive params must be sorted last among both mesh shader outputs and pixel shader inputs. PS can read these inputs using the same old VINTRP instructions. They use the same amount of LDS space as per-vertex PS inputs. Special per-primitive outputs: The VRS rate x, y, viewport and layer are special per-primitive outputs which must go to the second channel of the primitive export instruction, which is enabled by EN_PRIM_PAYLOAD. If the PS wants to read these, they must also be exported as a generic param. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	c7027156ea	radv: Cleanup VS output param assignment. Makes the code a little cleaner, and makes it easier to add per-primitive PS inputs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	256dcc209f	radv: Cleanup PS input generation. This cleans up radv_pipeline_generate_ps_inputs. Makes the code a little cleaner, and makes it easier to add per-primitive PS inputs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	459b4961bd	radv: Add mesh shader specific info. Use the same old outinfo structure as other NGG shaders. Additionally, store the output primitive type. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	32bc466dcb	radv: Add radv_pipeline_has_mesh helper. This will be used to determine whether a pipeline uses mesh shading or not. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	7aa42e023a	ac/nir/ngg: Lower NV mesh shaders to NGG semantics. Lower mesh shader outputs to shared memory. At the end of the shader, read the outputs from shared memory and export their values as NGG expects. We allocate separate shared memory (LDS) areas for per-vertex, per-primitive outputs, primitive indices, primitive count. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13580>	2021-12-31 13:05:09 +00:00
Timur Kristóf	1a1f5ad11d	gitlab-ci: Disable radv-fossils again. It gets a HTTP 504 error code when trying to access a git repo. Disable it again until this issue is resolved. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14364>	2021-12-31 13:38:32 +01:00
Markus_included	273edf76c2	Fixed you're to your Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14164>	2021-12-30 21:51:19 +00:00
Henry Goffin	fe617bcca0	intel/compiler/test: Fix build with GCC 7 Without this change, test_fs_scoreboard.cpp does not compile on GCC 7 due to the use of C99 initializers in a C++ source file. Fixes: `c847bfaaf5` ("intel/fs/gen12: Add tests for scoreboard pass") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14349>	2021-12-30 19:59:52 +00:00
Jesse Natalie	8371591dc6	microsoft/compiler: Fix LOD instruction to return 2 values Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 09:50:17 -08:00
Jesse Natalie	8926cfc9f7	d3d12: Enable texture gather The CI changes are because WARP fails really hard at gather, with crashes when doing it on a 1x1 or Nx1 texture, and incorrectly applying swizzling to the result vector instead of the actual texture accesses. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 09:50:17 -08:00
Jesse Natalie	bd7fc35d6a	d3d12: Handle cubemap gather on int cubemaps There were 2 options: This, or double-bind the cubemap so we can reference it as both a cube and as a 2D array, and only run gathers on the cubemap. As ugly as this is, I think I like it better. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 09:49:33 -08:00
Jesse Natalie	9ba4569184	microsoft/compiler: Position should always be no-perspective Debug WARP asserts on this, and sure enough, the spec says it should be no-perspective. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 08:14:20 -08:00
Jesse Natalie	fe963f336c	d3d12: Enable cubemap arrays Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 08:14:11 -08:00
Jesse Natalie	5b32915055	d3d12: Replace pipe cap literals with D3D12 defines when available Also remove references to feature levels < 11.0, D3D12 doesn't support those Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14161>	2021-12-30 08:13:30 -08:00
Qiang Yu	71e2df73d9	radeonsi: enable ARB_sparse_texture Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	5a844f87a1	radeonsi: support texture resource commit Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	5f3c904ac8	radeonsi: implement get_sparse_texture_virtual_page_size Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	c11059c2b5	radeonsi: use staging buffer for sparse texture when transfer map Can't map sparse texture directly. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	c2bb232dae	radeonsi: support alloc a sparse texture Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	1876285c27	ac/surface: add prt_tile_depth For supporting 3D sparse texture. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	92d810fa74	ac/surface: fix prt_first_mip_tail calculation for gfx9+ Use firstMipIdInTail directly from addrlib which calculated this in a different way: Original way: either dimension size of mipmap should be less than the tile size. Addrlib way: all dimesion size of the mipmap should be less than the tile size and at lest one dimension size should be less than half of the tile size, so that all following mip levels can fit in one tile and any commit for level in the mip tail also commit for all levels in mip tail. Theoretically either way is OK but addrlib way needs less care about the mip tail commit and better align with the true memory layout given by itself. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	40928e452d	winsys/radeon: change surface_init flags to 64bit RADEON_SURF_PRT is (1ull << 32). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	30daded7e9	mesa/st: update NumSparseLevels from pipe_resource Add nr_sparse_levels in pipe_resource for updating the gl_texture_object->NumSparseLevels. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	90415c1a3a	mesa: implement glTexPageCommitmentARB/glTexturePageCommitmentEXT Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	82f2ab0bfc	mesa/st: add st_TexturePageCommitment interface Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	34e5c14f70	mesa: glTexStorage* support sparse texture allocation Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	482fe76300	mesa/st: add st_GetSparseTextureVirtualPageSize interface For ARB_sparse_texture implementation. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	c7c5f9e168	mesa: add ARB_sparse_texture texture param set/get Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	b1c32a6c8c	mesa: add ARB_sparse_texture query in glGetInternalformativ Add new parameter of glGetInternalformativ for ARB_sparse_texture. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	1a616c4b29	gallium: add get_sparse_texture_virtual_page_size for noop/rbug/trace Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	eed8421bba	gallium: add screen get_sparse_texture_virtual_page_size callback For ARB_sparse_texture extension to query driver supported page size for each texture format. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	f3ea79f65e	mesa: add ARB_sparse_texture constants Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	e1cecd8964	mesa: add ARB_sparse_texture extension Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	7f48aba641	gallium: add caps for sparse texture support Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Qiang Yu	bcaf9704ad	glapi: should not add alias function to static_data.py Alias function should not be assigned an offset, otherwise new added function will get error: Exception: entries are not ordered by slots Fixes: `757bc6d37a` ("mesa: Add support for EXT_clear_texture") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14223>	2021-12-30 16:11:19 +08:00
Vinson Lee	94351c94db	r600/sfn: Remove unused AluInstruction members. Fix defects reported by Coverity Scan. uninit_member: Non-static class member m_omod is not initialized in this constructor nor in any functions that it calls. uninit_member: Non-static class member m_pred_sel is not initialized in this constructor nor in any functions that it calls. Suggested-by: Emma Anholt <emma@anholt.net> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12294>	2021-12-29 23:31:33 -08:00
Dave Airlie	9130e4564b	crocus: set max clip planes to 6 for gen4. Fixes piglit gl-1.0-user-clip-all-planes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14344>	2021-12-30 16:07:56 +10:00
Dave Airlie	af3cbc5379	gallium/mesa: enhance PIPE_CAP_CLIP_PLANES to support override number There exists hardware intel gen4 specifically that has only 6 clip planes but supports GLSL 1.30. This enhances the CAP so that the current values of 0,1 remain the same, but giving it a larger number will override the max. This allows the gen4 intel to set this to 6 and leave everyone else alone. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14344>	2021-12-30 16:07:42 +10:00
Dave Airlie	f1c1fcfdce	crocus: don't create staging resources > half aperture The theory here is that a staging resource will need to be transferred into or out of a non-staging resource. If the staging resource is too big for 1/2 aperture threshold, then it the corresponding non-stage resource will be similiarly sized. This means there's no hope of fitting both of them in. This will cause the st blit transfer path to fall back and allow max-texture-size to pass. Fixes piglit max-texture-size Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14347>	2021-12-30 14:00:45 +10:00
Dave Airlie	d8a38edc48	crocus: fail resource allocation properly. Older gens have a limit of 2GB on surfaces, this results in isl_surf_init_s failing if the surface exceeds that, in this case this should fail all the way back up the stack. This fixes some cases of max-texture-size on crocus Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14347>	2021-12-30 14:00:41 +10:00
Dave Airlie	a2293e33fd	intel/genxml/gen4-5: fix more Raster Operation in BLT to be a uint This has been partly fixed twice before, but looks like some got missed. Fixes arb_copy_image on gen4 with crocus Fixes: `de625dddee` ("intel/genxml: fix raster operation field in blt genxml") Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14345>	2021-12-30 11:40:33 +10:00
Emma Anholt	c2217acaa5	ci/i915g: Add a couple more recent regressions. The new frontfacing one is just different UB for the same lack of frontfacing support. And for RGB10_A2, we have another fail in that area already as well. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14300>	2021-12-29 16:59:56 -08:00
Emma Anholt	177bf24569	ci: Enable reporting to the flakes IRC channel for i915g and crocus. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14300>	2021-12-29 16:59:53 -08:00
Emma Anholt	d1a59c4077	ci/crocus: Add manual CI for the new HSW box I have at home. Also extend the set of traces being tested and add more notes about them, as well as add some g41 flakes since I did more runs. Acked-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14300>	2021-12-29 16:59:42 -08:00
Eric Engestrom	e573f54473	docs: update calendar and link releases notes for 21.3.3 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14341>	2021-12-29 21:51:22 +00:00
Eric Engestrom	e4135c265d	docs: add release notes for 21.3.3 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14341>	2021-12-29 21:51:05 +00:00
Filip Gawin	2ddfb9c256	r300: fix handling swizzle in transform_source_conflicts these tests are now passing: dEQP-GLES2.functional.shaders.operator.exponential.pow.highp_vec2_vertex,Fail dEQP-GLES2.functional.shaders.operator.exponential.pow.highp_vec3_vertex,Fail dEQP-GLES2.functional.shaders.operator.exponential.pow.highp_vec4_vertex,Fail dEQP-GLES2.functional.shaders.operator.exponential.pow.mediump_vec2_vertex,Fail dEQP-GLES2.functional.shaders.operator.exponential.pow.mediump_vec3_vertex,Fail dEQP-GLES2.functional.shaders.operator.exponential.pow.mediump_vec4_vertex,Fail Fixes: `1c2c4ddbd1` ("r300g: copy the compiler from r300c") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14282>	2021-12-29 18:27:39 +00:00
Pavel Ondračka	6f7421f50d	r300: Replace RADEON_NO_TCL with RADEON_DEBUG=notcl The old option was broken with shader cache. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14333>	2021-12-29 18:11:59 +00:00
Pavel Ondračka	44f134bc21	r300: Document the RADEON_DEBUG options Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Pavel Ondračka <pavel.ondracka@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14333>	2021-12-29 18:11:59 +00:00
Samuel Pitoiset	b249e700ac	radv: print number of levels with RADV_DEBUG=img Might help debugging image related issues like DCC or HTILE. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14336>	2021-12-29 15:24:42 +00:00
Samuel Pitoiset	4942e10890	radv: stop checking buffer size in vkCreateBuffer() This was added to fix dEQP-VK.api.buffer.basic.size_max_uint64 but with VK_KHR_maintenance4 this is now considered invalid usage. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14331>	2021-12-29 15:03:28 +00:00
Daniel Stone	05818bf7a4	Revert "gitlab-ci: disable radv-fossils" The fossils-db repository is fixed now. This reverts commit `7b8ea33848`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14337>	2021-12-29 13:48:03 +00:00
Eleni Maria Stea	0357a2e012	dri_drawable: missing header dri_util.h must be included in dri_drawable.h for __DRI* types and driDrawPriv to be defined. Signed-off-by: Eleni Maria Stea <elene.mst@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14334>	2021-12-29 13:55:40 +02:00
Hamish Arblaster	cafca76fa2	zink: Fix building on macOS This fixes building on macOS: Disable ZINK_USE_DMABUF on macOS, it is unsupported. Import vk_mvk_moltenvk.h for MVK_VERSION in zink_resource.c. Add additional build arguments (see meson.build) to build properly. To build on macOS, you will probably need to run: brew install bison llvm cmake libxext xquartz ninja xorgproto meson And you also need to setup meson with something like: -Dmoltenvk-dir=/Users/<Username>/VulkanSDK/<VersionNumber>/MoltenVK -Dc_std=c11 Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14255>	2021-12-29 03:21:35 +00:00
Lionel Landwerlin	eca7b24e74	intel/devinfo: adjust subslice array size Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14324>	2021-12-28 14:22:53 -08:00
Nanley Chery	cd182a31c3	iris: Use util packing fns in convert_clear_color A couple things happen as a result of this: 1. This function is more consistent. It aims to clamp clear color ranges to format-dependent values, but it didn't clamp the upper ranges of 10-bit and 11-bit floats (64512 and 65024 respectively). Those are handled now. 2. The clear colors are quantized. The quantization method seems to differ from what the HW uses for slow (and obviously fast) clears. Due to this change in quantization method, iris starts failing dEQP-EGL.functional.image.modify.renderbuffer_rgba4_renderbuffer_clear_color. That's okay however. That test was removed from the mustpass list because of threshold issues (see egl-test-issues.txt). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/7740>	2021-12-28 17:49:11 +00:00
Samuel Pitoiset	34151f9be9	radv: fix clears with value of "1" and different DCC signedness For example, if the driver clears a view of SINT with 127 and the image format is UNORM, the hw fills "1" instead of the clear value (127.0f / 255.0f). This CB feature is GFX9+ only. This should fix test_typed_srv_cast_clear from vkd3d-proton once it correctly sets the MUTABLE flag (which is still buggy as of c0a3fa8a). Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5676 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14228>	2021-12-28 16:35:51 +00:00
Georg Lehmann	554e1c2000	radv: Increase maxFragmentCombinedOutputResources. This also includes storage images and storage buffers, not just color attachments. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14311>	2021-12-28 16:15:26 +00:00
Bas Nieuwenhuizen	50658ed8da	radv/amdgpu: Use VkResult for wait_timeline_syncobj. So we can actually return errors. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14165>	2021-12-28 15:55:18 +00:00
Bas Nieuwenhuizen	20b51cdabe	radv: Skip wait timeline ioctl with 0 handles. Fixes: `55d8022878` "radv: Add winsys functions for timeline syncobj." Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14165>	2021-12-28 15:55:18 +00:00
Bas Nieuwenhuizen	afff9dd0f0	radv: Use correct buffer size for query pool result copies. 1. the dst stride may be too small if count=1. 2. the src stride may be too small due to the availability bit. So lets just compute the size needed explicitly and use it. Fixes: `90a0556c` ("radv: use pool stride when copying single query results") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14242>	2021-12-28 15:35:51 +00:00
Samuel Pitoiset	b775aaff1e	radv: re-apply "Do not access set layout during vkCmdBindDescriptorSets." Uplay needs this to avoid a crash because it does an use-after-free of a descriptor set layout. This was initially introduced by Bas to workaround a similar issue with Baldur's Gate 3, it seems needed again. Cc: 21.3 mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5789 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14318>	2021-12-28 15:56:55 +01:00
Samuel Pitoiset	01f1bd4dfd	radv: re-enable fast clears for images that support comp-to-single This optimized path was disabled, oops. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14176>	2021-12-28 14:24:40 +00:00
Alyssa Rosenzweig	311077c483	panfrost: Make pan_merge macro more robust Consider the following innocuous-looking code: pan_merge(packed, vtx->attributes[i], ATTRIBUTE); Under the current implementation, this code is completely broken. Why? The current implemention is a macro which hardcodes the loop index i, which shadows the i used to index attributes. Pull out a helper method so we do the right thing without resulting to further preprocessor abuse (__COUNTER__). While making things more robust, assert the crucial pan_merge invariant that the total size is a multiple of 4; if this fails, the routine risks memory corruption. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14119>	2021-12-28 13:55:21 +00:00
Jason2013	36f8f9b882	Fix typo Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14169>	2021-12-28 13:34:04 +00:00
Samuel Pitoiset	7b7debe8f9	radv: fix restoring subpass during hw/fs color resolves This fixes an stack-use-after-scope detect by ASAN because the subpass is used after the loop by radv_mark_noncoherent_rb(). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14317>	2021-12-28 12:53:12 +00:00
Samuel Pitoiset	030daf80b5	radv/winsys: remove radv_amdgpu_winsys_bo::is_shared This has never been used. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14287>	2021-12-28 12:51:29 +01:00
Samuel Pitoiset	847048fa24	radv/winsys: stop zeroing few structs in buffer_from_fd() Errors are correctly handled here. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14287>	2021-12-28 12:51:27 +01:00
Samuel Pitoiset	3368e522b4	radv: remove unnecessary NULL checks in vkMapMemory()/vkUnmapMemory() It's required to have a valid device memory handle and it would make no sense to call these functions with a NULL pointer. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14287>	2021-12-28 12:51:25 +01:00
Thomas H.P. Andersen	ff7aee2ac9	tu/clear_blit: use \|\| when working with bools Fixes a warning with clang Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14315>	2021-12-28 03:13:38 +00:00
Jesse Natalie	5c69c44a99	d3d12: Avoid a debug warning trying to unmap a not-mapped resource Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:25 +00:00
Jesse Natalie	0c5fde39e4	d3d12: Set SSBO support caps Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:25 +00:00
Jesse Natalie	416a807854	d3d12: Use DXIL load/store lowering pass Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	32375789e5	d3d12: Support setting SSBOs on the context and turning them into descriptors Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	49cf325d82	d3d12: Always create buffers as UAV-capable Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	de4c38c3a7	d3d12: Support SSBOs in root signatures Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	c375b05bfe	microsoft/compiler: Handle write masks in SSBO lowering pass Previously, the lowering was for 8/16/64-bit values, or 8/16-component vectors. Now, it also handles write masks on 32-bit 1/2/3/4-component vectors. DXIL looks like it supports putting an interesting write mask in the buffer store intrinsic, but DXC never generates stores with write masks, and multiple drivers completely ignore the write mask. Also, set the write mask properly on the output intrinsic. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	efc47571d4	microsoft/compiler: Hook up uavs-at-every-stage flag Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Jesse Natalie	72b0d0cda0	microsoft/compiler: Emit SSBOs from 0 -> count for GL (non-kernel, non-Vulkan) shaders Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14294>	2021-12-27 23:40:24 +00:00
Lionel Landwerlin	7b8ea33848	gitlab-ci: disable radv-fossils For some reason CI is unable to pull a git repo needed to run this. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14312>	2021-12-27 15:13:40 -08:00
Jesse Natalie	01821a2601	CI: Trigger Windows build on softpipe changes Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14312>	2021-12-27 12:26:02 -08:00
Jesse Natalie	d87a089579	softpipe: Add a dummy field to sp_fragment_shader_variant_key MSVC doesn't support 0-size structs in C. Fixes: `0b7a0d1a` ("softpipe: Use the draw module's poly stipple handling, like llvmpipe.") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14312>	2021-12-27 12:25:57 -08:00
Emma Anholt	79af19ab9e	softpipe: Drop duplicate decl of softpipe_find_fs_variant Reviewed-by: Zoltán Böszőrményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13731>	2021-12-27 09:57:57 -08:00
Emma Anholt	0b7a0d1a49	softpipe: Use the draw module's poly stipple handling, like llvmpipe. softpipe was using the draw helper module as a testbed for the draw helper module long ago, but we can just use the finished product. Reviewed-by: Zoltán Böszőrményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13731>	2021-12-27 09:57:53 -08:00
Emma Anholt	764d367a62	softpipe: Drop the quad pstipple stage. It's unused, and it doesn't have the information it needs ("what is the prim type after poly fill mode but without considering wide point/line-to-triangle conversion) to stipple correctly. Reviewed-by: Zoltán Böszőrményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13731>	2021-12-27 09:57:46 -08:00
Vinson Lee	222487fabe	radv: Fix memory leak on error path. Fix defects reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_storage: Variable signal_semaphore_infos going out of scope leaks the storage it points to leaked_storage: Variable wait_semaphore_infos going out of scope leaks the storage it points to. Fixes: `3da7d10d9b` ("radv: implement vkQueueSubmit2KHR()") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14260>	2021-12-24 00:15:53 -08:00
Dave Airlie	d2148af2ca	mesa/st: remove conditionals for driver state bits that are always set. Just removes some conditional checks that never work out now. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:57 +00:00
Dave Airlie	86a7a36164	mesa/st: drop multisample mask/locations state drivers bits Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:57 +00:00
Dave Airlie	ddade693d0	mesa/st: drop new framebuffer srgb driver state bit Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:57 +00:00
Dave Airlie	cbaf072971	mesa/st: drop clip plane driver state bits Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:57 +00:00
Dave Airlie	33991f0743	mesa/st: drop scissor/window rect driver state bits Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	42c7570eed	mesa/st: drop ssbo, image and sampler driver state flags bits Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	75255a1d06	mesa: drop unused transform feedback state driver flags Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	6e15cc69ec	mesa/st: drop new uniform driver state bit Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	f35d22b2ee	mesa/st: drop new tess state driver bit Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	562f01fbc7	mesa/st: drop poly stipple driver state bit Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	784ced98f0	mesa/st: drop new depth/stencil state bits Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	14e1f9cb98	mesa/st: drop NewBlend driver state flags Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	4f0316613f	mesa/st: remove the viewport driver state flags Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	c5af853cb9	mesa/st: drop the rasterizer driver flags Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	b2175609ba	mesa/st: drop the new array driver state bit Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	df4f0672d3	mesa/st: merge NewDepthClamp state flag Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	535a9d4203	mesa: drop optional tex/tnl maintains mode. These are always going to be on with gallium v2: drop call, tidy up switch (kwg) Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	1adff0d0db	mesa/st: move default enabled extensions into mesa. This just moves a bunch of true assignments into the core Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	87cc3ee964	mesa/draw: drop the multi draw with indices fallback. Gallium drivers don't need this. v2: drop some more code Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Dave Airlie	d17f45df1a	mesa: remove StripTextureBorder option. Always make this true. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14261>	2021-12-23 19:06:56 +00:00
Thomas H.P. Andersen	424941f0e4	ci: debian-clang: build more drivers Add gallium drivers: i915 + asahi Add vulkan drivers: swrast + panfrost These can now compile with the current no-error list Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14289>	2021-12-23 17:08:29 +01:00
Thomas H.P. Andersen	08f7d37fb9	panvk: cast negative value to unint8_t The index is a uint8_t but can be assigned a negative 1 value in panvk_pipeline_builder_parse_color_blend() The comparison to ~0 thus makes sense but clang will complain: "result of comparison of constant -1 with expression of type 'const uint8_t' (aka 'const unsigned char') is always true [-Wtautological-constant-out-of-range-compare]" Fix this by casting to a uint8_t before comparison. Fixes a warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14289>	2021-12-23 17:08:24 +01:00
Thomas H.P. Andersen	cea1df7d34	panvk: use FALLTHROUGH to stop a warning Fixes a warning with clang Reviewed-by: Alyssa Rosenzweig <alyssa.rosenzweig@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14289>	2021-12-23 17:08:01 +01:00
Thomas H.P. Andersen	bc19893f5d	i915g: avoid left shifting a negative number Fixes a warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14289>	2021-12-23 16:22:53 +01:00
Thomas H.P. Andersen	107c63aee8	lavapipe: fix string-plus-int warning Fixes a warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14289>	2021-12-23 16:22:53 +01:00
Alyssa Rosenzweig	944c8907ba	pan/bi: Don't call useless NIR passes Cargo culted from the Midgard compiler. nir_move_vec_src_uses_to_dest is intended for vec4 backends, which does not apply to Bifrost. nir_lower_locals_to_regs runs much earlier in the compiler and is a no-op here. total instructions in shared programs: 107252 -> 107242 (<.01%) instructions in affected programs: 2403 -> 2393 (-0.42%) helped: 10 HURT: 0 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.33% max: 0.57% x̄: 0.43% x̃: 0.42% 95% mean confidence interval for instructions value: -1.00 -1.00 95% mean confidence interval for instructions %-change: -0.49% -0.37% Instructions are helped. total tuples in shared programs: 89664 -> 89664 (0.00%) tuples in affected programs: 333 -> 333 (0.00%) helped: 1 HURT: 1 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.52% max: 0.52% x̄: 0.52% x̃: 0.52% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.70% max: 0.70% x̄: 0.70% x̃: 0.70% total cycles in shared programs: 8103.88 -> 8103.79 (<.01%) cycles in affected programs: 29.42 -> 29.33 (-0.28%) helped: 3 HURT: 1 helped stats (abs) min: 0.041665999999999315 max: 0.04166700000000034 x̄: 0.04 x̃: 0 helped stats (rel) min: 0.49% max: 0.55% x̄: 0.53% x̃: 0.54% HURT stats (abs) min: 0.04166700000000034 max: 0.04166700000000034 x̄: 0.04 x̃: 0 HURT stats (rel) min: 0.74% max: 0.74% x̄: 0.74% x̃: 0.74% 95% mean confidence interval for cycles value: -0.09 0.05 95% mean confidence interval for cycles %-change: -1.22% 0.80% Inconclusive result (value mean confidence interval includes 0). total arith in shared programs: 3376.42 -> 3376.33 (<.01%) arith in affected programs: 29.42 -> 29.33 (-0.28%) helped: 3 HURT: 1 helped stats (abs) min: 0.041665999999999315 max: 0.04166700000000034 x̄: 0.04 x̃: 0 helped stats (rel) min: 0.49% max: 0.55% x̄: 0.53% x̃: 0.54% HURT stats (abs) min: 0.04166700000000034 max: 0.04166700000000034 x̄: 0.04 x̃: 0 HURT stats (rel) min: 0.74% max: 0.74% x̄: 0.74% x̃: 0.74% 95% mean confidence interval for arith value: -0.09 0.05 95% mean confidence interval for arith %-change: -1.22% 0.80% Inconclusive result (value mean confidence interval includes 0). total quadwords in shared programs: 79681 -> 79681 (0.00%) quadwords in affected programs: 283 -> 283 (0.00%) helped: 1 HURT: 1 helped stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 helped stats (rel) min: 0.62% max: 0.62% x̄: 0.62% x̃: 0.62% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.82% max: 0.82% x̄: 0.82% x̃: 0.82% total threads in shared programs: 2226 -> 2227 (0.04%) threads in affected programs: 1 -> 2 (100.00%) helped: 1 HURT: 0 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14293>	2021-12-23 14:01:48 +00:00
Emma Anholt	f0f6aec545	glcpp: Disable the valgrind tests. We have the glcpp unit tests covered with ASan and MSan in CI, no need to make everyone doing a "meson test" suffer through valgrind slowly churning through these. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14236>	2021-12-23 09:10:11 +00:00
Emma Anholt	059b71b58d	ci: Enable a build with MSan. This will catch uninitialized data usage (such as `37855fd59d` ("glcpp: Fully initialize struct gl_context")) much faster than valgrind does. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14236>	2021-12-23 09:10:11 +00:00
Vinson Lee	6e6e16b317	isaspec: Sort field names to generate deterministic output. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14259>	2021-12-22 23:08:12 -08:00
Thomas H.P. Andersen	e0ec818cfd	microsoft/compiler: dxil_nir_opt_alu_deref_srcs: return progress dxil_nir_opt_alu_deref_srcs will always return false because the progress variable is declared both for the function and also inside the loop. Spotted by a unused-but-set-variable warning from clang Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14290>	2021-12-23 01:52:13 +00:00
Alyssa Rosenzweig	f5161f6cec	pan/va: Generalize LD_VAR_IMM_* to support flat varyings Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14292>	2021-12-22 22:01:20 +00:00
Alyssa Rosenzweig	8df1b42eb0	pan/va: Add .signed bit to right shift instructions This makes the RSHIFT_* family of instructions act like ARSHIFT.* on Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14292>	2021-12-22 22:01:20 +00:00
Alyssa Rosenzweig	73d7a046ff	pan/va: Rename LEA_ATTR to LEA_VARY This more accurately reflects the function of the instruction. Unlike earlier Malis, there don't seem to be attribute records for varyings. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14292>	2021-12-22 22:01:20 +00:00
Alyssa Rosenzweig	1e5a20a33b	pan/va: Remove extra LD_VAR_IMM_F32 source Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14292>	2021-12-22 22:01:20 +00:00
Dave Airlie	4392c24844	intel/compiler: drop unused decleration Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14202>	2021-12-22 21:37:55 +00:00
Dave Airlie	2692a5f8db	intel/compiler: don't lower swizzles in backend. These are lowered by crocus in the frontend, the key entries are still used. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14202>	2021-12-22 21:37:55 +00:00
Dave Airlie	e12b0d0d60	intel/compiler: remove gfx6 gather wa from backend. Crocus lowers this in the frontend, they key member is still used but reset prior to backend. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14202>	2021-12-22 21:37:55 +00:00
Dave Airlie	ebaba7a2fd	mesa/dd: drop unused InvalidateBufferSubData entry. I already removed the users of this, but forgot the entry. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14286>	2021-12-22 20:59:23 +00:00
Dave Airlie	57a730f4ad	mesa: drop unused _mesa_new_program. This isn't used since classic removal. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14286>	2021-12-22 20:59:23 +00:00
Dave Airlie	3c9bbe2fa1	mesa: drop unused new renderbuffer code. This isn't used anywhere currently Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14286>	2021-12-22 20:59:23 +00:00
Dave Airlie	be277ace89	mesa/st: use has_stencil_export instead of querying screen cap. We already query at context setup, just use that value. Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14286>	2021-12-22 20:59:23 +00:00
Dave Airlie	37190ad932	mesa: drop texformat code this isn't used. v2: dropped comment (Tapani) Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14286>	2021-12-22 20:59:23 +00:00
Timur Kristóf	739bf4d0be	spirv: Allow VRS with mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14291>	2021-12-22 21:29:22 +01:00
Thomas H.P. Andersen	c8c00fc6c3	draw: drop unused function Introduced in `381e9fe6`. Never used. Fixes a compile warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	a7c5645dd3	gallium/tgsi_exec: drop unused function Introduced in `9ca6cf0f` and last usage dropped in `31369987` Fixes a compile warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	887b838632	gallium/u_threaded: drop unused function tc_drop_sampler_view_reference is unused. It was introduced in `340703e0` and its last usage was dropped in `bb89cf4bf3` Fixes a compile warning with clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	2a5b867594	glx: remove a set but not used variable total_sent was never used. It was introduced in `fdb07636f2` Fixes a warning in clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	46a0f6384e	r600: remove a set but not used variable grid_size was never used. Not even when introduced in `6a829a1b72` Fixes a warning on clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	ea33eceb32	r300: remove a set but not used variable The use of phase_refmask was removed 12 years ago in `b7cf887ca7` Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	202ebf9969	i915g: fix implicit-fallthrough warning Fixes a warning on clang. Uses FALLTHROUGH like the surrounding code. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Thomas H.P. Andersen	57f6ef69b9	lavapipe: fix implicit-fallthrough warning Fixes a warning on clang Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14252>	2021-12-22 17:02:00 +00:00
Marcin Ślusarz	a48f1d51e2	intel/compiler: disable workaround not applicable to gfx >= 11 There's nothing in bspec that would suggest this is still needed. It only affected gfx 9 and 10. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14280>	2021-12-22 10:13:25 +00:00
Guido Günther	7440fbd596	etnaviv: Use mesa_log* Makes it consistent with the DRM bits Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:16:07 +01:00
Guido Günther	01bb981d57	entaviv/drm: Use same log format as gallium bits We prefix with __func__:__LINE__ there. Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:16:04 +01:00
Guido Günther	b5eacfc469	etnaviv/drm: Use mesa_log* for debugging This makes sure errors, warnings and info messages don't get compiled out in non debug builds. Signed-off-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:16:00 +01:00
Guido Günther	afbccdf8f9	etnaviv/drm: Print gpu model at debug verbosity Otherwise we print it at every application start. Signed-off-by: Guido Günther <guido.gunther@puri.sm> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:15:57 +01:00
Guido Günther	a93c0e1860	etnaviv/drm: Add some bo debug output This makes it simpler to trace BO usage. Signed-off-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:15:53 +01:00
Guido Günther	87879b0633	etnaviv/drm: Use etna_mesa_debug for debugging messages We use the variable from gallium but fall back to a weak symbol in case there's usage out of galllium in the future. Signed-off-by: Guido Günther <agx@sigxcpu.org> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Acked-by: Lucas Stach <l.stach@pengutronix.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10546>	2021-12-22 09:15:02 +01:00
Tapani Pälli	ebd1f202ae	glsl: fix invariant qualifer usage and matching rule for GLSL 4.20 I noticed that GLSL version referenced here was wrong, version 4.20 is first spec that does not allow invariant keyword for inputs. v2: fix all comments (Timothy Arceri) Fixes: `f9f462936a` ("glsl: Fix invariant matching in GLSL 4.30 and GLSL ES 1.00.") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14241>	2021-12-22 06:01:37 +00:00
Thomas H.P. Andersen	0bc5e8cddc	ci: debian-clang: drop -Wno-error for self-assign Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14272>	2021-12-22 03:34:23 +00:00
Thomas H.P. Andersen	f1dfc6a780	gallivm: avoid a self-assign warning Fixes a clang warning Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14272>	2021-12-22 03:34:23 +00:00
Vinson Lee	9f8a204645	panfrost: Avoid double unlock. Fix defect reported by Coverity Scan. Double unlock (LOCK) double_unlock: pthread_mutex_unlock unlocks dev->indirect_draw_shaders.lock while it is unlocked. Fixes: `2e6d94c198` ("panfrost: Add helpers to support indirect draws") Suggested-by: Alyssa Rosenzweig <alyssa@collabora.com> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14262>	2021-12-22 02:01:57 +00:00
Vinson Lee	1d6f6f9102	ir3: Make shift operand 64-bit. Fix defect reported by Coverity Scan. Unintentional integer overflow (OVERFLOW_BEFORE_WIDEN) overflow_before_widen: Potentially overflowing expression 2 << W with type int (32 bits, signed) is evaluated using 32-bit arithmetic, and then used in a context that expects an expression of type uint64_t (64 bits, unsigned). Signed-off-by: Vinson Lee <vlee@freedesktop.org> Acked-by: Rob Clark <robclark@freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14258>	2021-12-22 01:19:46 +00:00
Timur Kristóf	b293299776	aco/optimizer_postRA: Fix applying VCC to branches. Fixes: `a93092d0ed` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14281>	2021-12-21 22:53:23 +00:00
Timur Kristóf	ce4daa259c	aco/optimizer_postRA: Fix combining DPP into VALU. Fixes: `4ac47ad1cd` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14281>	2021-12-21 22:53:23 +00:00
Thomas H.P. Andersen	14a204b19d	ci: clean up debian-clang no-error list I see no warnings for these on a local build Reviewed-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14269>	2021-12-21 22:24:16 +00:00
Caio Oliveira	ac90519e35	anv: Simplify assertions related to graphics stages In all three cases, COMPUTE was on the table but with an invalid value (zero). Drop it from the tables and the extra assertion, so if a COMPUTE is passed it will just fail the ARRAY_SIZE assertion. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14274>	2021-12-21 18:25:05 +00:00
Daniel Schürmann	d36a43598c	aco/ra: fix get_reg_for_operand() in case of stride mismatches We have to clear the register file from the previous operand as otherwise, there might be no space left. Totals from 5 (0.00% of 134572) affected shaders: (GFX10.3) CodeSize: 21144 -> 21000 (-0.68%); split: -0.72%, +0.04% Instrs: 3738 -> 3720 (-0.48%); split: -0.51%, +0.03% Latency: 517229 -> 516319 (-0.18%); split: -0.18%, +0.00% InvThroughput: 49068 -> 48902 (-0.34%); split: -0.38%, +0.04% Copies: 501 -> 483 (-3.59%); split: -3.79%, +0.20% Cc: mesa-stable Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14279>	2021-12-21 17:15:45 +00:00
Jesse Natalie	664810ebb0	d3d12: Fix NV12 resource importing Fixes: `a6db8054` ("d3d12: Handle opening planar resources") Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14276>	2021-12-21 16:56:45 +00:00
Daniel Schürmann	17ecd0b31a	nir/opt_algebraic: lower fneg_hi/lo to fmul This pattern, found in the FSR upscaling shader, helps the vectorization efforts by keeping the chain of vectorized instructions intact. Radeon can optimize it to per-component fneg modifiers. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13688>	2021-12-21 13:23:37 +01:00
Daniel Schürmann	30a7199e37	aco/optimizer: propagate and fold inline constants on VOP3P instructions This patch aims to propagate and fold constants on VOP3P instructions by using omod selection and the fneg modifier. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13688>	2021-12-21 13:23:36 +01:00
Daniel Schürmann	62bcfcd0a8	aco: change fneg for VOP3P to use fmul with +1.0 This will be useful to be able to also apply fneg_lo and fneg_hi. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13688>	2021-12-21 13:23:36 +01:00
Daniel Schürmann	193bd740ab	aco/optimizer: fix fneg modifier propagation on VOP3P Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13688>	2021-12-21 13:23:36 +01:00
Caio Oliveira	de916d827f	anv: Refactor dirty masking in cmd_buffer_flush_state Instead of masking the dirty variable itself, use an appropriate mask in the users of dirty. This will avoid extra tracking when dealing with Task/Mesh later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14275>	2021-12-21 11:07:31 +00:00
Caio Oliveira	37fca614b8	anv/blorp: Split blorp_exec into a render and compute And set the relevant push_constants_dirty for each case. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14275>	2021-12-21 11:07:31 +00:00
Roman Stratiienko	2686c5419d	v3dv: add Android support Acknowledgements to android-rpi team and lineage-rpi maintainer (KonstaT) for creating/testing initial vulkan support. Their experience was used as a baseline for this work. Most of the code is a copy of turnip and anv. Improved by cleaning dEQP failures: - Improved gralloc support (use allocation time stride, size, modifier). - Fixed some dEQP crashes due to memory allocation issues. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14016>	2021-12-21 09:24:43 +00:00
Emma Anholt	658b2ca467	r300/vs: Fix flow control processing just after an endloop. We tried to step over the instruction we just generated, except we didn't always just generate one. In the sequence_vertex tests, that meant we skipped processing the next BGNLOOP and then underflowed our stack. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14271>	2021-12-21 01:05:07 +00:00
Emma Anholt	b48852436e	r300/vs: Reuse rc_match_bgnloop(). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14271>	2021-12-21 01:05:07 +00:00
Emma Anholt	e41a53cd19	r300/vs: Allocate temps we see a use as a source, too. This is a quick hack for a bunch of the fail in #5766. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14271>	2021-12-21 01:05:07 +00:00
Emma Anholt	e3d0ceccc9	ci/r300: Add another xfail on the main branch. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14271>	2021-12-21 01:05:07 +00:00
Francisco Jerez	e7470a40c5	intel/fs: Add physical fall-through CFG edge for unconditional BREAK instruction. This adds a missing CFG edge that represents a possible physical control flow path the EU might take under some conditions which isn't part of the logical CFG of the program. This possibility shouldn't have led to problems on platforms prior to Gfx12, since the missing control flow edge cannot possibly influence liveness intervals. However on Gfx12+ it becomes the compiler's responsibility to resolve data dependencies across instructions, and the missing physical control flow paths may lead to a WaR data hazard currently not visible to the software scoreboard pass, which could lead to data corruption. Worse, the possibility for this path to be taken by the EU increases on Gfx12+ due to a hardware bug affecting EU fusion -- However the same physical path can be potentially taken on earlier platforms as well, so this patch extends the CFG on all platforms for consistency, even though the lack of this edge shouldn't lead to any functional issues on platforms earlier than Gfx12. There are no shader-db changes on earlier platforms, so there seems to be no disadvantage from using the same CFG representation as on later platforms. This issue has ben reported on TGL with the following conformance test, thanks to Ian for bringing the FULSIM dependency check warning to my attention: dEQP-VK.graphicsfuzz.spv-stable-pillars-volatile-nontemporal-store Fixes: `4d1959e693` ("intel/cfg: Represent divergent control flow paths caused by non-uniform loop execution.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4940 Reported-by: Tapani Pälli <tapani.palli@intel.com> Reported-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14248>	2021-12-21 00:43:29 +00:00
Emma Anholt	f568d80986	glsl: Retire unused modes for lower_64bit_integer_instructions. Unused since `424ac809bf` ("i965: Do int64 lowering in NIR") Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com>. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14249>	2021-12-20 14:56:35 -08:00
Emma Anholt	97242b39f9	glsl: Remove comment about non-existing DFREXP_TO_ARITH Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com>. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14249>	2021-12-20 14:56:35 -08:00
Emma Anholt	b82b3a327e	glsl: Remove dead prototype for old do_discard_simplification(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com>. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14249>	2021-12-20 14:47:57 -08:00
Emma Anholt	6db1f93699	glsl: Delete the optimize_redundant_jumps pass. Nothing here that NIR doesn't do. No effect on shader-db of hsw or softpipe. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com>. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14249>	2021-12-20 14:47:57 -08:00
Emma Anholt	c2ead6c9b5	glsl: Delete the vectorization opt pass. Nothing uses it, and i965 was the last thing to. Even if I enable it for softpipe or crocus, it quickly causes NIR validation failures in shader-db from swizzles outside the bounds of vectors. Retire it in favor of nir_opt_vectorize(). Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com>. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14249>	2021-12-20 14:47:57 -08:00
Rob Clark	8a21b2fda0	freedreno/ir3: Dump const state with shader disasm Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	9766a5721d	freedreno/computerator: Mark shader bo for dumping Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	d1edc6d9a1	freedreno/computerator: Fix @buf header Order is important in the grammar, the more specific match needs to go first. Fixes: `ba1c989348` ("freedreno/computerator: pass iova of buffer to const register") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Rob Clark	78c53f4888	freedreno/ir3: Handle instr->address when cloning Without this, a cloned instruction that takes full regs will trigger an ir3_validate assert. This can happen, for ex, if an instruction that writes p0.x and has a relative src gets cloned in ir3_sched. Fixes an assert in Genshin Impact with a debug build. Fixes: `9af795d9b9` ("ir3: Make ir3_instruction::address a normal register") Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14231>	2021-12-20 19:47:35 +00:00
Alyssa Rosenzweig	1d21de788d	pan/bi: Specialize shaders for IDVS We need to compile multiple variants and report them together in a common shader info. To do so, we split off per-variant shader infos and combine at the end. glmark2 is very happy: https://people.collabora.com/~alyssa/idvs-g52.txt Highlights include -bshading up 41% fps and -bbump:bump-render=high-poly up 62% faster Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	c59977c3bf	pan/bi: Add helper to decide if IDVS should be used Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	a211d2b4e4	pan/bi: Use position shader ST_CVT path We need to use a preload instead of the LEA_ATTR. Not sure why. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	fe8ec31114	pan/bi: Split out varying store paths This means we don't need to special case IDVS quite so hard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	e0771d5832	pan/bi: Remove the "wrong" stores in IDVS variants Position shaders should only write gl_Position (and gl_PointSize on Valhall), varying shaders should only write varyings. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	fba5936fdb	pan/bi: Add IDVS mode to bi_context Various parts of the compiler switch behaviour based on IDVS variant. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	c7fae2c896	pan/bi: Allow UBO pushing to run multiple times For IDVS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	e2f7871bcf	pan/bi: Extract bi_finalize_nir Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	62d46c7ee6	panfrost: Add panfrost_compile_inputs->no_idvs option panvk will want IDVS support eventually, but not right now. Allow the driver to opt out of IDVS in the mean time. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	2ff3c4a636	panfrost: Align instance size for IDVS Hardware requirement. Failing to do this raises a DATA_INVALID_FAULT. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	79356b2e5f	panfrost: Skip rasterizer discard draws without side effects Minor optimization, but more importantly fixes an interaction of IDVS with rasterizer discard. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	f5412409db	panfrost: Extract panfrost_batch_skip_rasterization Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	3a49f4798c	panfrost: Emit IDVS jobs When trying to draw with an IDVS capable shader. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	e5b0c514d8	panfrost: Extract panfrost_draw_emit_vertex_section To be shared with IDVS. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	4d8d987f1a	panfrost: Set secondary_* fields for IDVS Easy now that we've split everything out nicely. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	29f63c6283	panfrost: Remove regalloc from v6.xml These fields were not introduced until v7, fix that. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	83356c58f8	panfrost: Split out regalloc/preload helpers The logic gets duplicated if IDVS is in use. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	9e65ebb67a	panfrost: Add IDVS fields to shader_info This lets the compiler decide if IDVS should be used. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	dc4fe86a01	panfrost: Treat IDVS jobs as tiler for scoreboarding These need to be chained and need to provoke a fragment job when we're done. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:41 +00:00
Alyssa Rosenzweig	8dc1936faa	panfrost: Fix Secondary Shader field Off-by-one on the start. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reported-by: Icecream95 <ixn@disroot.org> Fixes: `73e80994d5` ("panfrost: Add secondary shader XML fields") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	b27bbbe0c9	panfrost: Remove unused shader info bits These were only used to infer preloading and can be deleted. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	7358edad42	panfrost: Set preload descriptor more accurately Preload exactly what the shader needs, based on the compiler's mask of uninitialized registers, rather than trying to sync pan_shader.h with the behaviour of code gen. Would've saved me some debugging over the years... As a bonus this avoids preloading unnecessary registers, particularly in compute shaders. In theory this should reduce power consumption. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	52fe998aa6	panfrost: Track preloaded registers We already collect this information. We may as well make use of it. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	e8566f7529	pan/indirect_draw: Support IDVS jobs Handle as tiler jobs with an extra vertex DCD at the end. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	37ab248c77	pan/indirect_draw: Split out update_dcd This is common between vertex/tiler jobs and needs to be duplicated for IDVS jobs. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Alyssa Rosenzweig	d696183d4d	pan/indirect_draw: Don't upload garbage UBO There should never be a CPU pointer in GPU memory, let's say that... Fixes: `2e6d94c198` ("panfrost: Add helpers to support indirect draws") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14154>	2021-12-20 18:21:40 +00:00
Rafael Antognolli	e9b509755b	intel: Emit 3DSTATE_BINDING_TABLE_POOL_ALLOC for XeHP On XeHP+, Binding Table Pointers are an offset relative to the Surface State Base Address anymore. Instead, they are relative to the State Binding Table Pool Address, which is set by the command above. We emit that command (pointing to the same address as the Surface State Base Addresss), and everything should stay working as before. Reworks: * Jordan: Add iris * Jordan: Drop i965 * Ken: Set MOCS to avoid a major perf impact. (Found by Felix DeGrood.) * Jordan: Shrink size from 2MiB to actual iris, anv usage * Lionel: Add BINDING_TABLE_POOL_BLOCK_SIZE Ref: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4995 Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> [jordan.l.justen@intel.com: Add Iris, adjust sizes] Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13992>	2021-12-20 17:58:13 +00:00
Jordan Justen	e6fc231184	anv: Add BINDING_TABLE_POOL_BLOCK_SIZE Suggested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13992>	2021-12-20 17:58:13 +00:00
Jordan Justen	1ed7a65e6d	intel/genxml/12.5: Remove bt-pool enable from 3DSTATE_BINDING_TABLE_POOL_ALLOC This was dropped in gfx12.5. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13992>	2021-12-20 17:58:13 +00:00
Alyssa Rosenzweig	9da1787488	docs/macos: Update for recent Mesa changes - Default c_std is now c11, no need to workaround `89b4f337d5` ("c_std=c11 in meson default_options") - gallium-xlib has been renamed to xlib: `76791db088` ("mesa/x11: Remove the swrast-classic-based fake libGL") Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14216>	2021-12-20 17:53:27 +00:00
Jason Ekstrand	88b9b68f30	vulkan/runtime: Validate instance version on 1.0 implementations This isn't something that ANV or RADV have cared about in a long time but, as people bring up new Vulkan drivers, shipping Vulkan 1.0 is still a thing that happens in Mesa. The common code should also implement the 1.0 rules. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14150>	2021-12-20 16:45:55 +00:00
Jesse Natalie	64991d44a8	microsoft/compiler: Load synthesized sysvals via lowered io Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14175>	2021-12-20 08:20:59 -08:00
Jesse Natalie	8d5b7450a4	microsoft/compiler: Delete non-sysval deref load/store code Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14175>	2021-12-20 08:20:55 -08:00
Jesse Natalie	f30768f1d6	microsoft/compiler: Lower io Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14175>	2021-12-20 08:20:52 -08:00
Jesse Natalie	f4d247c2e3	microsoft/compiler: Support lowered io (nir_intrinsic_load_input/store_output) Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14175>	2021-12-20 08:20:11 -08:00
Simon Ser	0a9886cc45	renderonly: write down usage rules The renderonly helpers are extremely easy to mis-use. Write down the expectations. I've seen many mistakes in the past, including: - Forgetting to create the scanout resource on import [1] [2], causing bugs such as [3]. - Assuming the scanout resource always exists [4]. - Returning a GEM handle valid for the driver's internal DRM FD, but invalid for the caller's DRM FD [5]. - Not implementing resource_get_param, breaking stride/offset/modifier queries when no scanout resource is available [6] [7]. Signed-off-by: Simon Ser <contact@emersion.fr> Reviewed-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Daniel Stone <daniels@collabora.com> [1]: `4aac98f8a6` [2]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12018 [3]: https://github.com/swaywm/wlroots/issues/2795 [4]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12081 [5]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12074 [6]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12362 [7]: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12370 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12418>	2021-12-20 12:42:03 +01:00
Dave Airlie	d4af7d2519	mesa/st: move st strings handling into mesa Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14257>	2021-12-20 04:35:41 +00:00
Dave Airlie	8956b7f38f	mesa/st: migrate barrier code into mesa Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14257>	2021-12-20 04:35:41 +00:00
Dave Airlie	294dc8fa04	mesa/st: move msaa functionality into multisample.c This moves some state track code into main Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14257>	2021-12-20 04:35:41 +00:00
Dave Airlie	f5eda36760	mesa/st: move get sample position code to static in mesa Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14257>	2021-12-20 04:35:41 +00:00
Dave Airlie	b6fd811d2c	mesa/compute: refactor compute launch to look more like draw Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14256>	2021-12-20 13:41:02 +10:00
Dave Airlie	56f5e69497	mesa/st: migrate compute dispatch to mesa Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14256>	2021-12-20 13:40:57 +10:00
Dave Airlie	20de14c57e	mesa/st: refactor compute dispatch to fill grid info earlier. This fills the grid info earlier and uses info in validation Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14256>	2021-12-20 13:40:42 +10:00
Kostiantyn Lazukin	e9cc1633a2	util/ra: Fix numeric overflow during bitset allocation Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Kostiantyn Lazukin <kostiantyn.lazukin@globallogic.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5752 Fixes: `d4a4cd20d5` ("util/ra: use adjacency matrix for undirected graph") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14224>	2021-12-19 13:10:26 -08:00
Thomas H.P. Andersen	84b21fea46	meson: drop a temp formatting variable This was only needed in meson < 0.50 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14240>	2021-12-18 10:34:44 +00:00
Thomas H.P. Andersen	44dba714f5	docs: update the required meson version Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14240>	2021-12-18 10:34:44 +00:00
Thomas H.P. Andersen	1867a0cebf	meson: drop a comment relating to old meson version This comment was related to an if/else on meson version that has already been removed in `c1a290bdd5` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14240>	2021-12-18 10:34:44 +00:00
Thomas H.P. Andersen	88d0aeab6d	meson: drop compatability with < 0.48 Before meson 0.48 the cpu_family() would return 'ppc64le' on little endian power8. In newer versions it returns 'ppc64' and endianness should be checked with endian() We now require meson >= 0.53 so we can drop the compatability with older versions. The old behavior was added in `e430a034b9` Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14240>	2021-12-18 10:34:44 +00:00
Jason Ekstrand	eebb2dedb2	intel/fs: Add a NONE scheduling mode While our LIFO scheduling mode attempts to optimize for register pressure, it's often hard for a scheduling algorithm to do better than the instruction order provided by the shader author. Shader authors often do perfectly reasonable things like using texture results immediately after fetching them or constructing texture coordinates immediately before the texture op. When we throw all the instruction ordering information away, we loose any help the author may have given us. By attempting NONE before we fall back to the worst case LIFO mode. And, yes, I tried this with NONE both before and after LIFO and doing NONE before LIFO is substantially better, according to shader-db. total instructions in shared programs: 19673152 -> 19665202 (-0.04%) instructions in affected programs: 33669 -> 25719 (-23.61%) helped: 20 HURT: 0 helped stats (abs) min: 15 max: 4609 x̄: 397.50 x̃: 107 helped stats (rel) min: 2.33% max: 67.50% x̄: 14.60% x̃: 9.12% 95% mean confidence interval for instructions value: -867.61 72.61 95% mean confidence interval for instructions %-change: -21.74% -7.46% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 935562500 -> 935020920 (-0.06%) cycles in affected programs: 18620349 -> 18078769 (-2.91%) helped: 104 HURT: 48 helped stats (abs) min: 88 max: 60986 x̄: 8031.48 x̃: 3680 helped stats (rel) min: 0.61% max: 51.44% x̄: 14.95% x̃: 8.87% HURT stats (abs) min: 10 max: 54724 x̄: 6118.62 x̃: 1530 HURT stats (rel) min: 0.13% max: 46.45% x̄: 10.28% x̃: 6.46% 95% mean confidence interval for cycles value: -5724.34 -1401.71 95% mean confidence interval for cycles %-change: -9.86% -4.10% Cycles are helped. total spills in shared programs: 12158 -> 10327 (-15.06%) spills in affected programs: 1831 -> 0 helped: 20 HURT: 0 total fills in shared programs: 14749 -> 12635 (-14.33%) fills in affected programs: 2114 -> 0 helped: 20 HURT: 0 LOST: 8 GAINED: 649 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Jason Ekstrand	e6ddee764e	intel/fs: Reset instruction order before re-scheduling The way the current scheduler loop is implemented, each scheduling pass starts with what the previous pass had. This means that, if PRE screwed everything up majorly, PRE_NON_LIFO would have to try to fix it. It also meant that tiny changes to one pass would affect every later pass. Instead, reset the order of the instructions before each scheduling pass. This makes the passes entirely independent of each other. Shader-db results on Ice Lake: total instructions in shared programs: 19670486 -> 19670648 (<.01%) instructions in affected programs: 25317 -> 25479 (0.64%) helped: 2 HURT: 7 helped stats (abs) min: 4 max: 4 x̄: 4.00 x̃: 4 helped stats (rel) min: 0.07% max: 0.07% x̄: 0.07% x̃: 0.07% HURT stats (abs) min: 8 max: 70 x̄: 24.29 x̃: 12 HURT stats (rel) min: 0.41% max: 4.95% x̄: 1.47% x̃: 0.87% 95% mean confidence interval for instructions value: -1.28 37.28 95% mean confidence interval for instructions %-change: -0.04% 2.30% Inconclusive result (value mean confidence interval includes 0). total cycles in shared programs: 935535948 -> 935490243 (<.01%) cycles in affected programs: 421994824 -> 421949119 (-0.01%) helped: 1269 HURT: 879 helped stats (abs) min: 1 max: 12008 x̄: 259.38 x̃: 52 helped stats (rel) min: <.01% max: 28.02% x̄: 1.12% x̃: 0.14% HURT stats (abs) min: 1 max: 29931 x̄: 322.46 x̃: 20 HURT stats (rel) min: <.01% max: 32.17% x̄: 1.74% x̃: 0.22% 95% mean confidence interval for cycles value: -71.37 28.81 95% mean confidence interval for cycles %-change: -0.11% 0.21% Inconclusive result (value mean confidence interval includes 0). total spills in shared programs: 12403 -> 12430 (0.22%) spills in affected programs: 1355 -> 1382 (1.99%) helped: 2 HURT: 7 total fills in shared programs: 15128 -> 15182 (0.36%) fills in affected programs: 3294 -> 3348 (1.64%) helped: 2 HURT: 7 LOST: 21 GAINED: 28 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Jason Ekstrand	d49d092259	Revert "intel/fs: Do cmod prop again after scheduling" This reverts commit `ba2fa1ceaf`. Doing optimizations after scheduling but before RA means doing them in the middle of the scheduling loop which introduces additional dependencies between one scheduling iteration and the next. That won't work if we want to make the scheduling modes independent, at least not unless we have some way of fully cloning the IR. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Jason Ekstrand	e6f0def97d	intel/eu: Don't double-loop as often in brw_set_uip_jip brw_find_next_block_end() scans through the instructions to find the end of the block. We were calling it for every instruction in the program which is, if you have a single basic block, makes the whole mess a nice clean O(n^2) when it really doesn't need to be. Instead, only call brw_find_next_block_end() as-needed. This brings it back to O(n) like it should have been. This cuts the runtime of the following Vulkan CTS on my SKL box by 5% from 1:51 to 1:45: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Jason Ekstrand	cf98a3cc19	intel/fs: Use OPT() for split_virtual_grfs Now that we're being conservative in the pass, it's easy to tell when it makes progress and we can put it in the OPT() macro. This way, we get nice INTEL_DEBUG=optimizer dumps for it. While we're here, fix the header comment which is massively out-of-date. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Jason Ekstrand	38fa18a7a3	intel/fs: Be more conservative in split_virtual_grfs Instead of modifying every single instruction, keep track of which VGRFs are actually split in a bit-set, and only modify the instructions that actually touch split regs. This cuts the runtime of the following Vulkan CTS on my SKL box by 45% from 3:21 to 1:51: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13734>	2021-12-18 01:46:19 +00:00
Caio Oliveira	c9c50f89b2	spirv: Use the incorporated names Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14209>	2021-12-17 16:37:14 -08:00
Caio Oliveira	53f38d3683	spirv: Identify non-temporal image operand added in SPIR-V 1.6 Map it to the existing ACCESS_STREAM_CACHE_POLICY access mode. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14209>	2021-12-17 16:37:14 -08:00
Caio Oliveira	729df14e45	nir: Handle volatile semantics for loading HelperInvocation builtin SPV_EXT_demote_to_helper_invocation added OpDemoteToHelperInvocation operation to turn an invocation into a helper invocation, but the value of HelperInvocation (a builtin from Input storage class) couldn't be modified dynamically without breaking compatibility. For the extension the operation OpIsHelperInvocation was added to get the dynamic value. For SPIR-V 1.6, the demote operation was promoted, but now to get the dynamic value the shader must issue a load to HelperInvocation with Volatile memory access semantics. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14209>	2021-12-17 16:37:14 -08:00
Caio Oliveira	49e0dd6d42	spirv: Update headers and metadata to SPIR-V 1.6, revision 1 Acked-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14209>	2021-12-17 16:37:14 -08:00
Eric Engestrom	96d28f4cde	docs: update calendar and link releases notes for 21.3.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14250>	2021-12-17 23:41:49 +00:00
Eric Engestrom	7dbd3d73d4	docs: add release notes for 21.3.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14250>	2021-12-17 23:41:49 +00:00
Caio Oliveira	3415b51b1c	ci/windows: Remove line numbers of SPIR-V errors in spirv2dxil tests Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14245>	2021-12-17 23:04:55 +00:00
Rhys Perry	fa4e08112e	aco: remove SMEM constant/addition combining out of the loop There's no reason for this optimization to be in this loop. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13755>	2021-12-17 22:14:36 +00:00
Rhys Perry	dd18925f86	aco: skip &-4 before SMEM The hardware ignores the low 2 bits. I'm not sure if they are ignored before or after the address is calculated, but this optimization should be cautious enough. fossil-db (Sienna Cichlid): Totals from 259 (0.19% of 134572) affected shaders: SpillSGPRs: 1381 -> 1382 (+0.07%) SpillVGPRs: 1783 -> 1782 (-0.06%); split: -0.67%, +0.62% CodeSize: 1598612 -> 1596084 (-0.16%); split: -0.30%, +0.14% Scratch: 180224 -> 179200 (-0.57%); split: -1.14%, +0.57% Instrs: 284885 -> 284268 (-0.22%); split: -0.34%, +0.12% Latency: 6585634 -> 6603388 (+0.27%); split: -0.48%, +0.75% InvThroughput: 2638983 -> 2648474 (+0.36%); split: -0.58%, +0.94% VClause: 6797 -> 6820 (+0.34%); split: -0.15%, +0.49% SClause: 6569 -> 6574 (+0.08%); split: -1.11%, +1.19% Copies: 50561 -> 50586 (+0.05%); split: -0.61%, +0.66% Branches: 10058 -> 10062 (+0.04%); split: -0.01%, +0.05% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13755>	2021-12-17 22:14:36 +00:00
Rhys Perry	cf5fc4b973	aco: disallow SMEM offsets that are not multiples of 4 These can't be encoded on GFX6/7, and combining these additions causes CTS failures on GFX10.3. I think the low 2 MSBs are ignored before the addition, not after, so load(a + 3, 0) becomes load(a, 3), which is the same as load(a, 0). No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13755>	2021-12-17 22:14:36 +00:00
Bas Nieuwenhuizen	860532c5a1	radv: Add safety check for RGP traces on VanGogh. To avoid accidental hangs. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5260 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13730>	2021-12-17 21:25:01 +00:00
Emma Anholt	3077d96856	crocus: Clamp VS point sizes to the HW limits as required. Fixes piglit vs-point-size-zero. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14238>	2021-12-17 19:41:54 +00:00
Emma Anholt	39ea803f9f	ci/crocus: Add support for manual CI runs on my G41. Uses a shared runner at my house so we have an easy way to minimally test crocus. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14238>	2021-12-17 19:41:54 +00:00
Rhys Perry	a65285f54b	nir/opt_access: infer CAN_REORDER for global access fossil-db (Sienna Cichlid): Totals from 352 (0.26% of 134621) affected shaders: VGPRs: 17240 -> 17272 (+0.19%) CodeSize: 1753640 -> 1755744 (+0.12%); split: -0.04%, +0.16% Instrs: 323190 -> 323801 (+0.19%); split: -0.03%, +0.22% Latency: 3241205 -> 3241293 (+0.00%); split: -0.10%, +0.10% InvThroughput: 568927 -> 568067 (-0.15%); split: -0.16%, +0.00% SClause: 12109 -> 10444 (-13.75%); split: -13.76%, +0.01% Copies: 27802 -> 27717 (-0.31%); split: -0.56%, +0.26% PreSGPRs: 14699 -> 14690 (-0.06%) PreVGPRs: 15793 -> 15799 (+0.04%) fossil-db (Polaris10): Totals from 348 (0.26% of 135668) affected shaders: SGPRs: 21446 -> 21574 (+0.60%); split: -0.15%, +0.75% VGPRs: 17004 -> 16996 (-0.05%); split: -0.09%, +0.05% CodeSize: 1782796 -> 1783060 (+0.01%); split: -0.03%, +0.05% Instrs: 337828 -> 337921 (+0.03%); split: -0.03%, +0.06% Latency: 3726328 -> 3726721 (+0.01%); split: -0.09%, +0.10% InvThroughput: 1307917 -> 1299841 (-0.62%); split: -0.62%, +0.00% VClause: 4327 -> 4337 (+0.23%); split: -0.09%, +0.32% SClause: 12178 -> 10529 (-13.54%); split: -13.55%, +0.01% Copies: 40227 -> 40244 (+0.04%); split: -0.19%, +0.24% PreSGPRs: 14946 -> 14937 (-0.06%) PreVGPRs: 15637 -> 15643 (+0.04%) fossil-db (Pitcairn): Totals from 351 (0.26% of 135668) affected shaders: SGPRs: 20382 -> 20619 (+1.16%); split: -0.79%, +1.95% CodeSize: 1789732 -> 1789836 (+0.01%); split: -0.04%, +0.04% MaxWaves: 1947 -> 1949 (+0.10%) Instrs: 352274 -> 352318 (+0.01%); split: -0.04%, +0.06% Latency: 4057829 -> 4058226 (+0.01%); split: -0.08%, +0.09% InvThroughput: 1332245 -> 1317578 (-1.10%); split: -1.11%, +0.01% VClause: 8581 -> 8583 (+0.02%); split: -0.13%, +0.15% SClause: 12187 -> 10552 (-13.42%); split: -13.43%, +0.02% Copies: 44906 -> 44915 (+0.02%); split: -0.24%, +0.26% PreSGPRs: 16571 -> 16562 (-0.05%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14227>	2021-12-17 18:51:24 +00:00
Rhys Perry	403ae3b48e	nir/algebraic: optimize more 64-bit imul with constant source Two 64-bit shifts and an addition are usually faster than the several multiplications nir_lower_int64 creates. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14227>	2021-12-17 18:51:24 +00:00
Rhys Perry	c56cf157c5	nir/opt_load_store_vectorize: improve ssbo/global alias analysis If either the global access or the ssbo access is restrict, they shouldn't alias. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14227>	2021-12-17 18:51:24 +00:00
Samuel Pitoiset	ac8afe3f44	radv: fix dynamic rendering global scissor Make sure to clamp the global scissor to the render area. This fixes dEQP-VK.draw.dynamic_rendering.*oversized. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14170>	2021-12-17 16:51:00 +00:00
Jason Ekstrand	288a670f17	anv/pipeline: Get rid of sample_shading_enable Putting it in the pipeline is a bit of a lie. We no longer need it for nir_lower_wpos_center. The only other user is pipeline_has_coarse_pixel and that is used to build the shader key which we construct before we've processed any NIR so we don't have accurate information at that time anyway. Instead, look at ms_info->sampleShadingEnable directly in pipeline_has_coarse_pixel and trust the back-end to deal with disabling coarse when we need per-sample dispatch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	deec7a590b	anv,nir: Use sample_pos_or_center in lower_wpos_center Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	3c89dbdbfe	intel/fs: Implement the sample_pos_or_center system value Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	a580fd55e1	intel/fs: Rework emit_samplepos_setup() This rolls compute_sample_position into emit_samplepos_setup, its only caller, by using a loop instead of calling it twice. We also early-return for the !persample_dispatch case instead of doing it as part of the sample calculation. This means that we don't call fetch_payload_reg() to get sample_pos_reg unless we're actually going to use it so the function is safe to call even if we haven't set up sample_pos_reg. This will be important for the next commit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	ac7255ed1e	intel/fs: Return fs_reg directly from builtin setup helpers There's no good reason why we're allocating them on the heap and returning a pointer. Return the fs_reg directly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	e8acc5a7ea	nir: Add a new sample_pos_or_center system value Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Jason Ekstrand	732b234ddb	radeonsi/nir: Check for VARYING_SLOT_PRIMITIVE_ID not SYSTEM_VALUE This function is called on load/store_input/output. It makes no sense for it to get a SYSTEM_VALUE enum. This only doesn't explode because SYSTEM_VALUE_PRIMITIVE_ID happens to be below VARYING_SLOT_VAR0 so it doesn't interact with any actual varyings. The next commit is going to add another system value which will push SYSTEM_VALUE_PRIMITIVE_ID up by one so it will equal VARYING_SLOT_VAR0 and then the first FS input will always get smashed to flat which isn't what we want. Fixes: `b59bb9c07a` ("radeonsi: force flat for PrimID early in si_nir_scan_shader") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14198>	2021-12-17 16:02:16 +00:00
Pierre-Eric Pelloux-Prayer	41cc6a4c7f	glthread: only log glthread destroy reason when it's not NULL Fixes: `670759a208` ("glthread: inline _mesa_glthread_restore_dispatch and merge disable & destroy") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14226>	2021-12-17 11:56:24 +00:00
Pierre-Eric Pelloux-Prayer	c1860a6848	radeonsi: don't use perp. end caps when line smoothing is on The line smoothing algorithm causes the diagonal line to be visible. See: https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700#note_1187405 Fixes: `4571778008` ("radeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14226>	2021-12-17 11:56:24 +00:00
Rhys Perry	94603786c5	aco: fix check_vop3_operands() for f16vec2 ffma fneg combine For v_pk_fma_f16, we should consider all three operands, not the first two. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `15a375b4c8` ("radv,aco: don't lower some ffma instructions") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14229>	2021-12-17 11:16:12 +00:00
Marcin Ślusarz	504e5cb4e8	nir/print: print const value near each use of const ssa variable Without/with NIR_DEBUG=print,print_const: -vec4 32 ssa_60 = fadd ssa_59, ssa_58 +vec4 32 ssa_60 = fadd ssa_59 /(0xbf800000, 0x3e800000, 0x00000000, 0x3f800000) = (-1.000000, 0.250000, 0.000000, 1.000000)/, ssa_58 Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13880>	2021-12-17 10:04:50 +00:00
Marcin Ślusarz	23f8f836e0	nir/print: group hex and float vectors together Vectors are much easier to follow in this format, because developer cares either about hex or float values, never both. Before/after: -vec4 32 ssa_222 = load_const (0x00000000 /* 0.000000 /, 0x00000000 / 0.000000 /, 0x3f800000 / 1.000000 /, 0x3f800000 / 1.000000 /) +vec4 32 ssa_222 = load_const (0x00000000, 0x00000000, 0x3f800000, 0x3f800000) = (0.000000, 0.000000, 1.000000, 1.000000) -vec1 32 ssa_174 = load_const (0xbf800000 / -1.000000 */) +vec1 32 ssa_174 = load_const (0xbf800000 = -1.000000) Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13880>	2021-12-17 10:04:50 +00:00
Marcin Ślusarz	d2b4051ea9	nir/print: move print_load_const_instr up ... to avoid forward declarations in future commit Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13880>	2021-12-17 10:04:50 +00:00
Juan A. Suarez Romero	bc11dc7187	broadcom/ci: restructure expected results Sort/rename the files so expected tests are classified by device. No need to split the tests by driver (e.g., V3D vs V3DV). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13983>	2021-12-17 09:15:34 +00:00
Bas Nieuwenhuizen	ed7c48f94a	radv/amdgpu: Only wait on queue_syncobj when needed. If signalled on the same queue it is totally useless, so only wait if we have a syncobj that is explicitly being waited on, which can be from potentially another queue/ctx. (Ideally we'd check but there is no way to do so currently. Might revisit when we integrate the common sync framework) Fixes: `7675d066ca` ("radv/amdgpu: Add support for submitting 0 commandbuffers.") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14214>	2021-12-17 08:54:08 +00:00
Jason Ekstrand	3878094eb1	anv: Drop anv_sync_create_for_bo The older helper is unused so we can roll it all into anv_create_sync_for_memory. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14237>	2021-12-17 00:55:31 +00:00
Lionel Landwerlin	b00086d393	anv,wsi: simplify WSI synchronization Rather than using 2 vfuncs, use one since we've unified the synchronization framework in the runtime with a single vk_sync object. v2 (Jason Ekstrand): - create_sync_for_memory is now in vk_device Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14237>	2021-12-17 00:55:31 +00:00
Jason Ekstrand	9ae1e621e5	anv: Implement vk_device::create_sync_for_memory Fixes: `36ea90a361` ("anv: Convert to the common sync and submit framework") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14237>	2021-12-17 00:55:31 +00:00
Jason Ekstrand	2188829d29	vulkan/queue: Handle WSI memory signal information We handle it by asking the driver to create a vk_sync that wraps a VkDeviceMemory object and gets passed as one of the signal ops. Fixes: `9bffd81f1c` ("vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14237>	2021-12-17 00:55:31 +00:00
Lionel Landwerlin	cdf101455d	vulkan: fix missing handling of WSI memory signal Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `b996fa8efa` ("anv: implement VK_KHR_synchronization2") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5744 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14237>	2021-12-17 00:55:31 +00:00
Ian Romanick	ff44547ea4	intel/stub: Implement shell versions of DRM_I915_GEM_GET_TILING and DRM_I915_SEM_GET_TILING This is necessary to use intel_stub_gpu with Crocus. v2: Remove unused i915_bo::swizzle_mode. Noticed by Emma. Fixes: `953a4ca6fe` ("intel: Add has_bit6_swizzle to devinfo") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14218>	2021-12-16 23:06:38 +00:00
Ian Romanick	2dc7c24b80	intel/stub: Silence "initialized field overwritten" warning src/intel/tools/intel_noop_drm_shim.c:459:36: warning: initialized field overwritten [-Woverride-init] 459 \| [DRM_I915_GEM_EXECBUFFER2_WR] = i915_ioctl_noop, \| ^~~~~~~~~~~~~~~ Fixes: `0f4f1d70bf` ("intel: add stub_gpu tool") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14218>	2021-12-16 23:06:38 +00:00
Emma Anholt	9c722a06ed	ci/freedreno: Add known flakes from the last month. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14239>	2021-12-16 22:37:53 +00:00
Adam Jackson	c77e5af7a3	glx: Fix GLX_NV_float_buffer fbconfig handling Since we didn't record this attribute from the server, we wouldn't account for it in glXChooseFBConfig, and glXGetFBConfigAttrib wouldn't know about it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14221>	2021-12-16 22:05:20 +00:00
Chia-I Wu	108881cbcc	venus: add some trace points Add trace points for - vn_AcquireNextImage2KHR and vn_QueuePresentKHR - vn_AcquireImageANDROID and vn_QueueSignalReleaseImageANDROID - vn_BeginCommandBuffer and vn_EndCommandBuffer - vn_Wait - vn_Queue* - vn_instance_wait_roundtrip - shmem allocations and cache miss/skip v2: fix cache miss/skip trace points (Ryan) Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> (v1) Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14215>	2021-12-16 19:27:56 +00:00
Michel Zou	631b3fe3e9	meson: correctly detect linker arguments Fixes: `22673a98` ("meson: Check arguments before adding") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13961>	2021-12-16 17:19:28 +00:00
Emma Anholt	7a22967de3	r300: Remove unused RC_OPCODE_DPH Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	9312bfb5fb	r300: Remove unused RC_OPCODE_SFL Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	495d119aa9	r300: Remove unused RC_OPCODE_CLAMP. Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	9ed55c0c15	r300: Remove unused RC_OPCODE_SWZ. Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	a982d0baf3	r300: Remove unused RC_OPCODE_XPD. Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	2e2b755ecb	r300: Remove unused RC_OPCODE_ABS. Nothing generates it in the backend. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	7a0c3b1024	r300: Remove support for SCS. Nothing generates this meta-op in the backend, so we don't need it. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Emma Anholt	acef6b6bb3	r300: Remove some dead compiler code. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14211>	2021-12-16 16:57:02 +00:00
Marcin Ślusarz	f7e63ec5d8	nir/print: compact printing of intrinsic indices Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14222>	2021-12-16 09:43:13 +00:00
Marcin Ślusarz	d8fa625bb3	nir/print: expand printing of io semantics.gs_streams gs_streams can be set for at least 2 other intrinsics. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14222>	2021-12-16 09:43:13 +00:00
Marcin Ślusarz	be25db9f0f	nir/print: simplify printing of IO semantics Some of the tested flags are set for other intrinsics and they are printed only when set, so there's no point in checking exact intrinsic name or shader stage. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14222>	2021-12-16 09:43:13 +00:00
Kenneth Graunke	7325179bcb	intel/compiler: Use uppercase enum values in brw_ir_performance.cpp This is by far the more common style in Mesa. It also gives a cue that e.g. num_dependency_ids is a fixed definition rather than some kind of local variable maintaining a count. While hre, we also rename the enums to have full prefixes to prepare for a future where we use them in multiple files for future backend work. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14182>	2021-12-16 09:00:57 +00:00
Kenneth Graunke	d3f4f23ca3	intel/vec4: Inline emit_texture and move helpers to brw_vec4_nir.cpp emit_texture() only has one caller, nir_emit_texture(). We may as well inline that. Move the associated helper functions for emitting sampler messages there as well, to keep associated code nearby. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5183 Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14191>	2021-12-16 00:09:45 -08:00
Kenneth Graunke	92d194427d	intel/vec4: Use nir_texop in emit_texture instead of translating We eliminated the GLSL IR -> vec4 backend ages ago, so the only caller uses a nir_texop enum. Drop a layer of translating. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14191>	2021-12-16 00:09:44 -08:00
Kenneth Graunke	2729a741fc	intel/vec4: Use ir_texture_opcode less in emit_texture() This replaces a bunch of uses of the GLSL IR ir_texture_opcode enum with the backend opcode, in preparation for removing it altogether. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14191>	2021-12-16 00:09:36 -08:00
Samuel Pitoiset	5ce4017a2b	radv,aco: do not disable anisotropy filtering for non-mipmap images This fixes dEQP-VK.texture.filtering_anisotropy.single_level.anisotropy_*.mag_linear_min_linear. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14171>	2021-12-16 07:20:50 +00:00
Samuel Pitoiset	8a327722d5	ac/nir: add an option to disable anisotropic filtering for single level images Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14171>	2021-12-16 07:20:50 +00:00
Pierre-Eric Pelloux-Prayer	1cb5c1775b	glx: fix querying GLX_FBCONFIG_ID for Window This commit fixes apps using the following sequence: 1. XCreateWindow(dpy) -> win 2. glXCreateContextAttribsARB(dpy, ...) -> ctx 3. glXMakeCurrent(dpy, win, ctx) 4. glXQueryDrawable(dpy, win, GLX_FBCONFIG_ID, ...) glXQueryDrawable returned 0 (while correctly returning a valid GLXFCONFIG_ID for other types of drawables). This commit adds the same dance as driInferDrawableConfig to get the GLX visual from the Window, and then the GLXFBCONFIG_ID of this visual. This fixes: * piglit: glx-query-drawable --attr=GLX_FBCONFIG_ID --type=WINDOW * Maya which uses the config ID from step 4 as an input to glXChooseFBConfig. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14174>	2021-12-16 01:21:36 +00:00
Adam Jackson	6c5b3c6bb5	dri: Remove unused driGetRendererString Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14199>	2021-12-15 19:43:42 -05:00
Adam Jackson	7cc42a8dd4	dri: Remove unused driUpdateFramebufferSize Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14199>	2021-12-15 19:43:41 -05:00
Adam Jackson	7e1e3722fb	dri: Remove unused driContextSetFlags Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14199>	2021-12-15 19:43:39 -05:00
Adam Jackson	69aad97788	mesa: Remove unused _mesa_initialize_visual Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14199>	2021-12-15 19:43:28 -05:00
Sagar Ghuge	cd38b6e2e8	anv, iris: Implement Wa_14014890652 for DG2 Workaround is to set: 3DSTATE_VFG::GranularityThresholdDisable = 1 3DSTATE_VFG::DistributionGranularity = BATCH 3DSTATE_VF::GeometryDistributionEnable = 1 Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:23 +00:00
Anuj Phogat	40b66a4499	anv, iris: Add Wa_22011440098 for DG2 Rework: * Jordan: Set MOCS after `7b78b2fcac` ("intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+") Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:22 +00:00
Anuj Phogat	17a1df79ba	anv, iris: Add Wa_16011773973 for DG2 Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14212>	2021-12-16 00:00:22 +00:00
Caio Oliveira	b1156f23a2	Revert "nir: disable a NIR test due to undebuggable & locally unreproducible CI failures" This reverts commit `6eb3fe2d4f`. The root cause was a bug in Meson when using the new gtest protocol and a test failed before producing the XML file expected by it. This was fixed in later versions of Meson, so we've bumped the required meson version to use that feature. The failure should now be properly identified, so re-enabling the NIR test. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14204>	2021-12-15 23:28:09 +00:00
Caio Oliveira	49c356a335	meson: Bump version required for gtest protocol The feature was added in 0.55 but there was a bug when tests crashed (and no XML file was generated) that was only fixed in 0.59.2. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14204>	2021-12-15 23:28:09 +00:00
Caio Oliveira	dcc7b19cae	nir: Initialize nir_register::divergent Fixes: `c7fc44f9eb` ("nir/from_ssa: Respect and populate divergence information") Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14205>	2021-12-15 22:39:06 +00:00
Emma Anholt	3ffd6f3fa6	nir_to_tgsi: Set the TGSI Precise flag for exact ALU instructions. This flag is used by the nv50, r600, and svga backends for instruction exactness. It was easier to plumb it in as an override in tgsi_ureg than to make all of ALU instruction emit do it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14201>	2021-12-15 21:58:04 +00:00
Ian Romanick	af4d277ccc	mesa: OpenGL 1.3 and OpenGL ES 1.0 are not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14203>	2021-12-15 20:25:19 +00:00
Ian Romanick	5f14e98780	mesa: OpenGL 1.3 feature GL_ARB_texture_env_dot3 is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14203>	2021-12-15 20:25:19 +00:00
Ian Romanick	61a3e68767	mesa: OpenGL 1.3 feature GL_ARB_texture_env_combine is not optional v2: GL_SRC_COLOR, GL_ONE_MINUS_SRC_COLOR, and GL_ONE_MINUS_SRC_ALPHA should always be supported now. Noticed by Marek. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14203>	2021-12-15 20:25:19 +00:00
Ian Romanick	7649ab1f03	mesa: OpenGL 1.3 feature GL_ARB_texture_cube_map is not optional Cheatsheet: _mesa_has_ARB_texture_cube_map() becomes (true && ctx->Extensions.Version >= _mesa_extension_table[...].version[ctx->API]). The last value is 0 when ctx->API is API_OPENGL_COMPAT and ~0 otherwise. The whole function effectively becomes (ctx->API == API_OPENGL_COMPAT). _mesa_has_OES_texture_cube_map() becomes (true && ctx->Extensions.Version >= _mesa_extension_table[...].version[ctx->API]). The last value is 0 when ctx->API is API_OPENGLES and ~0 otherwise. The whole function effectively becomes (ctx->API == API_OPENGLES). Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14203>	2021-12-15 20:25:19 +00:00
Ian Romanick	c11641ab24	mesa: OpenGL 1.3 feature GL_ARB_texture_border_clamp is not optional Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14203>	2021-12-15 20:25:19 +00:00
Ian Romanick	2ca13abcce	intel/fs: Use HF as destination type for F32TOF16 in fquantize2f16 Having an integer destination type instead of a float destination type confuses the SWSB code. This causes problems on some Intel GPUs. Fix this by using the correct type in the destination of the F32TOF16 opcode. Gfx7 doesn't have the HF type, so continue to emit W on that platform. The assertions in brw_F32TO16 (brw_eu_emit.c) are updated to reflect this. In scalar mode, UD is never emitted as a destination type for this opcode, so remove it from the allowed types in the assertion. I also condidered doing something like `de55fd358f` ("intel/fs/xehp: Teach SWSB pass about the exec pipeline of FS_OPCODE_PACK_HALF_2x16_SPLIT."), but Curro recommended that just using the correct types is a better fix. I agree. v2: Add missing changes to fs_generator::generate_pack_half_2x16_split. I'm not sure how I (and the Intel CI) missed that the first time. :( v3: Fix copy-and-paste issue in the v2 fix. Noticed by Tapani. Reviewed-by: Francisco Jerez <currojerez@riseup.net> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14181>	2021-12-15 20:03:51 +00:00
Chia-I Wu	9c81de7df2	venus: cache shmems Shmems are allocated internally and are only for CPU access. They can be easily cached. Venus have 4 sources of shmem allocations - the ring buffer - the reply stream - the indirection submission upload cs - one cs for each vn_command_buffer The first one is allocated only once. The other three reallocate occasionally. The frequencies depend on the workloads. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Chia-I Wu	7bec2a0b23	venus: add VN_CS_ENCODER_STORAGE_SHMEM_POOL for VkCommandBuffer It suballocates from a shmem pool owned by vn_instance. The goals are to speed up shmem allocations for VkCommandBuffer and to reduce the number of BOs. Both are crucial when shmems are HOST3D BOs, because they require roundtrips to the renderer to allocate and they take up KVM memslots. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Chia-I Wu	487926aa86	venus: add vn_cs_encoder_storage_type It generalizes cs->indirect. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Chia-I Wu	1fe8f0fea0	venus: use vn_renderer_shmem_pool for reply shmems Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Chia-I Wu	35c430e75a	venus: add vn_renderer_shmem_pool It provides shmem suballocations. It is designed to be used with short-lived shmems. A long-lived shmem can hold on to some large allocation while only using a likely small region of the large allocation. v2: cleanups suggested by Yiwei Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> (v1) Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Chia-I Wu	511fb6b8e9	venus: add vn_renderer_util.[ch] Move helpers built on top of vn_renderer.h to the new files. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14179>	2021-12-15 19:02:29 +00:00
Dave Airlie	ee283c49b7	mesa: inline mesa_initialize_buffer_object. This has no other users now. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	131efc204d	mesa/st: remove st_cb_bufferobjects* this has all been merged into mesa now Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	8c1472bc84	mesa/bufferobj: move invalidate buffer to optional feature Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	a79f5d9016	mesa/st: rename access flag to transfer flag function Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	fdd31298d3	bufferobj: cleanup subdata copies This moves the common dst min/max invalidation and renames to be a bit more consistent Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	955ddc02e4	bufferobj: inline page commitment Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	55bb6cc8e1	bufferobj: inline buffer clearing Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	22e516b846	bufferobj: make sw clear buffer static, move it and rename it Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	ff8c2a1748	mesa/bufferobj: rename bufferobj functions to be more consistent. After all the refactoring, start consolidating a bit and get the API names more consistent Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	f6840bb940	mesa/st: make static the buffer object funcs that can be Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	7288fcdc72	mesa/st: migrate most of state tracker buffer objects into mesa This moves all of non-optional st functions into the main bufferobj.c file. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	8232cf1b4d	mesa: add pointer to cso_context to gl_context Makes migrating code easier Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	22607020f5	mesa: add a pointer to st_config_options to gl_context Allows porting out of st code easier Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	f6c608dd24	mesa: add a pipe_context pointer to gl context This will be used to move more code over Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	90cb1493b7	mesa/st: start moving bufferobject alloc/free/reference to main. This moves these out of the state tracker code Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Dave Airlie	970daedb1d	mesa/st: merge st buffer object into GL Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14133>	2021-12-15 13:29:33 +00:00
Alejandro Piñeiro	1c4f76672d	broadcom/compiler: avoid unneeded sint/unorm clamping when lowering stores They are being used on integer to integer stores. From Vulkan sec, final paragraph of 16.4.4 "Texel Output Format Conversion": "Each component is converted based on its type and size (as defined in the Format Definition section for each VkFormat). ... Integer outputs are converted such that their value is preserved. The converted value of any integer that cannot be represented in the target format is undefined." I didn't find a equivalent quote for OpenGL as all conversion entries are forcused on float to integer, fixed-point to integer, etc, and not on integer to integer. Didn't find any test failure with this change. We didn't get any shader-db stats change with shaderdb (even overriding to OpenGL 4.4 to get more shaders built), so as a reference Vulkan shader-db stats with the pattern dEQP-VK.image..with_format..* total instructions in shared programs: 37534 -> 36522 (-2.70%) instructions in affected programs: 12080 -> 11068 (-8.38%) helped: 241 HURT: 0 Instructions are helped. total uniforms in shared programs: 9100 -> 8550 (-6.04%) uniforms in affected programs: 3004 -> 2454 (-18.31%) helped: 229 HURT: 0 total max-temps in shared programs: 6110 -> 6014 (-1.57%) max-temps in affected programs: 402 -> 306 (-23.88%) helped: 43 HURT: 0 Max-temps are helped. total nops in shared programs: 1523 -> 1526 (0.20%) nops in affected programs: 21 -> 24 (14.29%) helped: 3 HURT: 6 Inconclusive result (value mean confidence interval includes 0). Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14194>	2021-12-15 11:53:20 +00:00
Samuel Pitoiset	cca8803e6b	radv/winsys: update sparse mappings with OP_REPLACE instead of OP_MAP/OP_UNMAP When the BO is NULL, AMDGPU will reset the PTE VA range to the initial state. Otherwise, it will first unmap all existing VA that overlap the requested range and then map. This seems better than using MAP/UNMAP. This reduces stuttering in Forza Horizon 5. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14116>	2021-12-15 08:17:29 +01:00
Samuel Pitoiset	a6ca0a8c52	radv/winsys: stop using reference counting for virtual BOs This shouldn't be necessary because applications have to manage resources and memory themselves. This also prevented memory to be freed if an application doesn't unbind a sparse memory object and free it, which is legal as long as the resource isn't used afterwards. This was introduced to unmap the sparse mappings when destroying a virtual BO, but now that the driver uses OP_CLEAR it's no longer needed. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14116>	2021-12-15 08:17:26 +01:00
Samuel Pitoiset	a931d5a4a4	radv/winsys: clear the PRT VA range when destroying a virtual BO Instead of unmapping every range. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14116>	2021-12-15 08:17:24 +01:00
Samuel Pitoiset	782474782b	radv/winsys: remove useless has_sparse_vm_mappings checks Sparse is only exposed on GFX8+, so this is always TRUE. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14116>	2021-12-15 08:17:22 +01:00
Jason Ekstrand	b05d228695	Revert "anv: Stop doing too much per-sample shading" This reverts commit `1f559930b6`. Turns out, this approach won't work. Fixes: `1f559930b6` ("anv: Stop doing too much per-sample shading") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14196>	2021-12-14 18:09:03 +00:00
Marek Olšák	ae4065f0b2	mesa: use nop dispatch for ColorTable/Convolution/Histogram The nop dispatch generates GL_INVALID_OPERATION too. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:29:00 -05:00
Marek Olšák	7994b6c893	mesa: remove all GL func forward declarations because they are autogenerated Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:28:58 -05:00
Marek Olšák	0ca96f5cf6	mesa,vbo: make ES wrapper functions static Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:28:57 -05:00
Marek Olšák	4c91c6162b	glapi: add missing no_error settings for implemented functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:28:49 -05:00
Marek Olšák	9a9d14fa4d	mesa: remove COPY_DISPATCH code that doesn't do anything When we get into create_beginend_table, ctx->Exec only contains nops set by _mesa_alloc_dispatch_table. This function calls _mesa_alloc_dispatch_table too, so table and ctx->Exec are identical, and then it copies identical entries from one table to the other. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:09 -05:00
Marek Olšák	933a88f76c	mesa: rename _ae_ArrayElement -> _mesa_ArrayElement to match glapi Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:08 -05:00
Marek Olšák	e49d9c0fed	mesa: use ctx->GLThread.enabled now that it's correct Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:07 -05:00
Marek Olšák	d052612317	glthread: disable glthread if the context is lost Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:06 -05:00
Marek Olšák	9d8301d602	glthread: fix restoring the dispatch in destroy when the context is not current also remove an invalid comment in mtypes.h Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:05 -05:00
Marek Olšák	670759a208	glthread: inline _mesa_glthread_restore_dispatch and merge disable & destroy No change in behavior. This fixes ctx->GLThread.enabled, which was only set to false by destroy. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:01:02 -05:00
Marek Olšák	7b123ad16a	glthread: set marshal functions in dispatch only if they exist in the API We now have proper nop dispatch for the unset functions. The autogenerated code looks like this: if ((ctx->API == API_OPENGLES2 && ctx->Version >= 31)) { if (_gloffset_DepthRangeArrayfvOES >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_DepthRangeArrayfvOES] = (_glapi_proc)_mesa_marshal_DepthRangeArrayfvOES; if (_gloffset_DepthRangeIndexedfOES >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_DepthRangeIndexedfOES] = (_glapi_proc)_mesa_marshal_DepthRangeIndexedfOES; } if (_mesa_is_desktop_gl(ctx)) { if (_gloffset_AlphaToCoverageDitherControlNV >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_AlphaToCoverageDitherControlNV] = (_glapi_proc)_mesa_marshal_AlphaToCoverageDitherControlNV; if (_gloffset_AttachObjectARB >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_AttachObjectARB] = (_glapi_proc)_mesa_marshal_AttachObjectARB; if (_gloffset_BeginQueryIndexed >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_BeginQueryIndexed] = (_glapi_proc)_mesa_marshal_BeginQueryIndexed; if (_gloffset_BindBufferOffsetEXT >= 0) ((_glapi_proc )(ctx->MarshalExec))[_gloffset_BindBufferOffsetEXT] = (_glapi_proc)_mesa_marshal_BindBufferOffsetEXT; ... Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:42 -05:00
Marek Olšák	e93a9b422c	glthread: add nop dispatch so that glthread behaves the same as the main dispatch. Also fix the SetError function for GLES 1.0. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:40 -05:00
Marek Olšák	dd3709dcfd	vbo: expose all exec entrypoints for glthread and match api_exec_decl.h names Autogenerated glthread code will call these directly. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:38 -05:00
Marek Olšák	bade2407fa	mesa: remove GLvertexformat Function pointers were first set in GLvertexformat, and then GLvertexformat was copied to the dispatch. This just sets the function pointers in the dispatch directly, skipping the intermediate GLvertexformat structure. The code with SET_* calls is autogenerated by api_vtxfmt_init_h.py. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:35 -05:00
Marek Olšák	a87e5d437e	glapi: autogenerate install_vtxfmt with python This is a prerequisite for the GLvertexformat removal. The autogenerated file looks like this: if (_mesa_is_desktop_gl(ctx) \|\| (ctx->API == API_OPENGLES2 && ctx->Version >= 30)) { SET_VertexAttribI4iEXT(tab, NAME(VertexAttribI4iEXT)); SET_VertexAttribI4ivEXT(tab, NAME(VertexAttribI4ivEXT)); SET_VertexAttribI4uiEXT(tab, NAME(VertexAttribI4uiEXT)); SET_VertexAttribI4uivEXT(tab, NAME(VertexAttribI4uivEXT)); } if (ctx->API == API_OPENGLES2) { SET_VertexAttrib1fARB(tab, NAME_ES(VertexAttrib1fARB)); SET_VertexAttrib1fvARB(tab, NAME_ES(VertexAttrib1fvARB)); SET_VertexAttrib2fARB(tab, NAME_ES(VertexAttrib2fARB)); SET_VertexAttrib2fvARB(tab, NAME_ES(VertexAttrib2fvARB)); SET_VertexAttrib3fARB(tab, NAME_ES(VertexAttrib3fARB)); SET_VertexAttrib3fvARB(tab, NAME_ES(VertexAttrib3fvARB)); SET_VertexAttrib4fARB(tab, NAME_ES(VertexAttrib4fARB)); SET_VertexAttrib4fvARB(tab, NAME_ES(VertexAttrib4fvARB)); } if (ctx->API == API_OPENGL_COMPAT) { SET_ArrayElement(tab, NAME_AE(ArrayElement)); SET_Begin(tab, NAME(Begin)); SET_CallList(tab, NAME_CALLLIST(CallList)); SET_CallLists(tab, NAME_CALLLIST(CallLists)); SET_Color3b(tab, NAME(Color3b)); SET_Color3bv(tab, NAME(Color3bv)); ... Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:29 -05:00
Marek Olšák	1f33948733	glapi: autogenerate all _mesa_* forward declarations in api_exec_decl.h We could remove them from other header files now. This purposefully omits "_exec" in _mesa_exec such as _mesa_exec_Begin to make it pretty. Later commits will remove _exec from names, e.g. it will become _mesa_Begin. The only other variants are really just save_Begin (dlist) and _save_Begin (vbo). The autogenerated file looks like this: void GLAPIENTRY _mesa_NewList(GLuint list, GLenum mode); void GLAPIENTRY _mesa_EndList(void); void GLAPIENTRY _mesa_CallList(GLuint list); void GLAPIENTRY _mesa_CallLists(GLsizei n, GLenum type, const GLvoid * lists); void GLAPIENTRY _mesa_DeleteLists(GLuint list, GLsizei range); GLuint GLAPIENTRY _mesa_GenLists(GLsizei range); void GLAPIENTRY _mesa_ListBase(GLuint base); void GLAPIENTRY _mesa_Begin(GLenum mode); void GLAPIENTRY _mesa_Bitmap(GLsizei width, GLsizei height, GLfloat xorig, GLfloat yorig, GLfloat xmove, GLfloat ymove, const GLubyte * bitmap); void GLAPIENTRY _mesa_Color3b(GLbyte red, GLbyte green, GLbyte blue); void GLAPIENTRY _mesa_Color3bv(const GLbyte * v); ... Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:25 -05:00
Marek Olšák	5603d3e42c	mesa: remove api_exec.h and move its contents into context.h Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:24 -05:00
Marek Olšák	898649c145	glapi: autogenerate api_save.h with save_* function declarations This is planned to be used by glthread for its own dispatch mechanism. The autogenerated file looks like this: void GLAPIENTRY save_NewList(GLuint list, GLenum mode); void GLAPIENTRY save_ListBase(GLuint base); void GLAPIENTRY save_Bitmap(GLsizei width, GLsizei height, GLfloat xorig, GLfloat yorig, GLfloat xmove, GLfloat ymove, const GLubyte * bitmap); void GLAPIENTRY save_RasterPos2d(GLdouble x, GLdouble y); void GLAPIENTRY save_RasterPos2dv(const GLdouble * v); void GLAPIENTRY save_RasterPos2f(GLfloat x, GLfloat y); void GLAPIENTRY save_RasterPos2fv(const GLfloat * v); void GLAPIENTRY save_RasterPos2i(GLint x, GLint y); void GLAPIENTRY save_RasterPos2iv(const GLint * v); ... Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:20 -05:00
Marek Olšák	df3447c331	glapi: autogenerate _mesa_initialize_save_table with python The generated file looks like this: SET_NewList(table, save_NewList); SET_ListBase(table, save_ListBase); SET_Bitmap(table, save_Bitmap); SET_RasterPos2d(table, save_RasterPos2d); SET_RasterPos2dv(table, save_RasterPos2dv); SET_RasterPos2f(table, save_RasterPos2f); SET_RasterPos2fv(table, save_RasterPos2fv); SET_RasterPos2i(table, save_RasterPos2i); SET_RasterPos2iv(table, save_RasterPos2iv); SET_RasterPos2s(table, save_RasterPos2s); ... Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:19 -05:00
Marek Olšák	d7c5161242	glapi: move reusable glapi printing code to apiexec.py This will be used by all new scripts. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:18 -05:00
Marek Olšák	ac622b8536	vbo: rename ES vertex functions to match GL dispatch names vbo_init_tmp.h will be autogenerated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:16 -05:00
Marek Olšák	62eba623b5	vbo: rename vertex functions to match GL dispatch names vbo_init_tmp.h will be autogenerated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:13 -05:00
Marek Olšák	dd6b1ae110	mesa: add EXT suffix to VertexAttribIEXT to match glapi name I don't wanna do it the other way and potentially break the libGL - _dri ABI. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:11 -05:00
Marek Olšák	77c2a7c2e4	glapi: replace dispatch.h inline functions with macros for faster compilation A change in dispatch.h now takes 11.7% less user+sys time to compile. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:11 -05:00
Marek Olšák	1aa0b587cd	glapi: move apiexec API condition determination to common code it will be used elsewhere Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:10 -05:00
Marek Olšák	6e4238f99a	glapi: rename gl_genexec.py to api_exec_init.py, api_exec.c to api_exec_init.c this seems cleaner Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:09 -05:00
Marek Olšák	9cef21e33f	mesa: rename dlist functions to match dispatch function names _mesa_initialize_save_table will be autogenerated. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:08 -05:00
Marek Olšák	12b1feb03e	mesa: don't set CallList* redundantly in _mesa_initialize_save_table It's set by _mesa_install_save_vtxfmt. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 12:00:07 -05:00
Marek Olšák	b8ad4fd59d	glapi: rename exec="dynamic" to exec "vtxfmt" to make it self-explanatory Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:58 -05:00
Marek Olšák	8fa0332965	mesa: move the ES2 check from vbo_init_tmp.h to install_vtxfmt It's where other API checks are done. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:57 -05:00
Marek Olšák	0f4891d77d	mesa: inline _vbo_install_exec_vtxfmt also remove unused vbo_initialize_exec_dispatch Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:57 -05:00
Marek Olšák	43a9f6a938	mesa: move _mesa_initialize_vbo_vtxfmt calls to a common place and inline Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:56 -05:00
Marek Olšák	e7c543c60f	mesa: inline _mesa_install_dlist_vtxfmt Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:56 -05:00
Marek Olšák	31baf7bc4d	mesa: inline _mesa_install_eval_vtxfmt Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:55 -05:00
Marek Olšák	086c03e839	mesa: inline _mesa_install_arrayelt_vtxfmt Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:54 -05:00
Marek Olšák	d07b0d7dd7	mesa: inline vbo_initialize_save_dispatch and rename the functions _mesa_initialize_save_table will be autogenerated. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:52 -05:00
Marek Olšák	6fcec5900e	mesa: include less stuff in dlist.c Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14000>	2021-12-14 11:59:51 -05:00
Gert Wollny	1b80e1716e	virgl: Enable higher compatibility profiles if host supports it v2: Update CI expectations Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Italo Nicola <italonicola@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12542>	2021-12-14 12:32:26 +00:00
Gert Wollny	b8b70fc1b9	ci: pin virglrenderer version Currently we always just pull in whatever version of virglrenderer happens to be TOT in googlesource. Instead, pin a specific version, and this should also trigger an update of the container when this versions is changed. v2: Fix spelling error (tomeu) Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12542>	2021-12-14 12:32:26 +00:00
Rhys Perry	451e6c1b32	radv: have the null winsys set more fields I copied stuff from ac_gpu_info.c until there were no Sienna Cichild or Polaris10 fossil-db changes between real hardware and RADV_FORCE_FAMILY. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14126>	2021-12-14 12:11:50 +00:00
Bas Nieuwenhuizen	9e8fa8168b	radv: Expose the ETC2 emulation. As needed on Android (as it is required) and by driconf flag otherwise. The non-Android case would be on the host side for an Android VM. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Bas Nieuwenhuizen	a8078bab91	radv: Deal with border colors with emulated ETC2. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Bas Nieuwenhuizen	1153db23f5	radv: Add ETC2 decode shader. To make sure that apps actually get something when the HW doesn't support ETC2. To do that we decompress after every copy operation. Includes a quite complicated decode shader. It is not bit-to-bit equivalent to AMD APUs that support ETC2, but close enough to pass CTS. Likely missing bits are related to the R11 and R11G11 formats where we decode to 16 bits but likely do the extension differently. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Bas Nieuwenhuizen	42a71be793	radv: Add extra plane for decoding ETC images with emulation. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Bas Nieuwenhuizen	c55ebdb76d	radv: Use the correct base format for reintepretation. Going to hit it when emulating ETC2 through another plane. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Bas Nieuwenhuizen	7c5fe66f8a	radv: Set up ETC2 emulation wiring. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14071>	2021-12-14 11:30:48 +00:00
Roman Stratiienko	ef3b31c967	v3d: Don't force SCANOUT for PIPE_BIND_SHARED requests This was workaround for the users of gbm_bo_create_with_modifiers(), which were unable to specify the buffer usage (GPU / GPU+DISPLAY). But after the commit [1] this become possible. And forcing usage to GBM_BO_USE_SCANOUT migrated directly into gbm_bo_create_with_modifiers [2], allowing us to remove such workarounds from the drivers. This makes possible to allocate the buffers in VRAM using {gbm_bo_create_with_modifiers2 \| gbm_bo_create} and providing correct use flag thus saving CMA memory. This should also enable tiling for such buffers. [1]: `268e12c605` ("gbm: add gbm_{bo,surface}_create_with_modifiers2") [2]: `ad50b47a14` ("gbm: assume USE_SCANOUT in create_with_modifiers") Signed-off-by: Roman Stratiienko <roman.o.stratiienko@globallogic.com> Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14151>	2021-12-14 10:55:37 +00:00
Roman Stratiienko	2cbbfd23ce	v3dv: Hotfix: Rename remaining V3DV_HAS_SURFACE->V3DV_USE_WSI_PLATFORM This was somehow missed by me and during review. Fixes `fcfc4ddfcc`: ("v3dv: Fix V3DV_HAS_SURFACE preprocessor condition") Signed-off-by: Roman Stratiienko <roman.o.stratiienko@globallogic.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14190>	2021-12-14 10:33:28 +00:00
Iago Toral Quiroga	2630c8f546	broadcom/compiler: improve thrsw merge Instead of stopping the merge process when we find an instruction with an incompatible signal (such as an small immediate), keep going and see if we can merge the thrsw in a previous instruction that is compatible. total instructions in shared programs: 13409835 -> 13356648 (-0.40%) instructions in affected programs: 3556860 -> 3503673 (-1.50%) helped: 17457 HURT: 18 Instructions are helped. total max-temps in shared programs: 2353971 -> 2352956 (-0.04%) max-temps in affected programs: 13960 -> 12945 (-7.27%) helped: 703 HURT: 0 Max-temps are helped. total spills in shared programs: 12301 -> 12301 (0.00%) total sfu-stalls in shared programs: 32596 -> 32499 (-0.30%) sfu-stalls in affected programs: 225 -> 128 (-43.11%) helped: 79 HURT: 3 Sfu-stalls are helped. total nops in shared programs: 347204 -> 325234 (-6.33%) nops in affected programs: 99834 -> 77864 (-22.01%) helped: 11515 HURT: 158 Nops are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14172>	2021-12-14 09:50:17 +00:00
Kostiantyn Lazukin	d4a4cd20d5	util/ra: use adjacency matrix for undirected graph Since this graph is actually not oriented, its adjacency matrix can be represented using less than half bits required by full adjacency matrix. It reduces memory consumption and number of cache misses. It also simplifies logic of growing this matrix - no need to touch adjacency bits for previously allocated number of nodes. Move adjacency bits from nodes to graph to reduce the number of allocations. No changes to shader-db. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Signed-off-by: Kostiantyn Lazukin <kostiantyn.lazukin@globallogic.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14189>	2021-12-14 09:19:01 +00:00
Tomeu Vizoso	f71713d43c	lvp: Free the driver_data pointer for all commands We were only freeing it for commands that had a struct as their parameter, but all commands can have driver_data. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5715 Reported-by: Jose Fonseca <jfonsec@vmware.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14081>	2021-12-14 09:46:39 +01:00
Juan A. Suarez Romero	b8f6685bb5	nir: use call_once() to init debug variable For data-race safety, let's use this function to ensure NIR debug is initialized only once. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14057>	2021-12-14 08:01:17 +00:00
Juan A. Suarez Romero	18c039b2e1	tgsi-to-nir: initialize NIR_DEBUG envvar This envvar is initialized when creating a NIR shader, but it needs to be used before. So initialize it here. v2 (Juan): - Use static variable for first initialization. Fixes: `f77ccdfb4a` ("nir: add NIR_DEBUG envvar") Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14057>	2021-12-14 08:01:17 +00:00
Nanley Chery	42a865730e	iris: Disable the SMEM fallback for CCS on XeHP On XeHP, CCS is only supported in local memory. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	9a188b10a5	iris: Rework the DEVICE_LOCAL heap Split it into a local-only heap (which keeps the original enum) and a local-preferred heap (which has a new enum). Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	305677e242	iris: Add and use bucket_info_for_heap Add a helper that maps a heap to the related cache bucket information. This avoids complicating existing ternaries when new cache buckets are added. Rework: * Jordan: Add default and set pointers in default branch of bucket_info_for_heap to prevent "may be used uninitialized" warning in release builds. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	cd787b4e68	iris: Add and use BUCKET_ARRAY_SIZE This improves an assert in add_bucket. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	5885a2cf13	iris: Replace "local" with "heap" in bufmgr fn params We'll want to describe more than two placement options for BOs. Switch to using the more flexible heap enum. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	7a8bf62ac8	iris: Use a num_buckets pointer in add_bucket Store a pointer to the appropriate cache bucket counter, then increment the integer it points to. This keeps us from having to add code for incrementing when a new cache bucket is added. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	b77935a83d	iris: Add and use flags_to_heap Reduces duplicated calculations. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	14ebd81ee3	iris: Replace bo->real.local with bo->real.heap We'll want to describe more than two placement options for BOs. Switch to using the more flexible heap enum. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Nanley Chery	f93892c5d3	iris: Free the local cache bucket in bufmgr_destroy Fixes: `55be94dcab` ("iris/bufmgr: Add new set of buckets for local memory.") Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14012>	2021-12-14 07:37:42 +00:00
Chia-I Wu	65576eec2e	venus: fix vn_buffer_get_max_buffer_size The binary search can lead to infinite loop. Fixes dEQP-VK.api.object_management.alloc_callback_fail.device where vn_CreateBuffer can always fail. Fixes: `a74f2495ca` ("venus: implement vn_buffer_get_max_buffer_size") Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14184>	2021-12-14 05:49:08 +00:00
Alyssa Rosenzweig	d92e353a11	pan/mdg: Fix definition of UBO unpack Needed to link the disassembler separate from the rest of the compiler, as in out-of-tree pandecode builds. Which I haven't done for Midgard in well over a year, enough time for this to bit rot. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14185>	2021-12-14 03:42:28 +00:00
Rafael Antognolli	a026d2d11c	intel/compiler: Assert that unsupported tg4 offsets were lowered for XeHP Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14142>	2021-12-13 16:59:44 -08:00
Jordan Justen	52a55f097f	intel/compiler: Use nir_lower_tex_options::lower_offset_filter for tg4 on XeHP Based on Rafael's: * "nir/lower_tex: Add option to lower offset for tg4 too." * "intel/compiler: Lower offsets for tg4 on gen9+." * "WIP: Do not lower basic offsets." * "WIP: intel/compiler: Enable lowering offsets restriction." But, with these changes: * Fixed range checking to be signed 4 bits * Converted to filter * Apply only to gfx12.5+ * Use nir_src_is_const / nir_src_comp_as_int (s-b Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14142>	2021-12-13 16:59:37 -08:00
Jordan Justen	211e0606c7	nir/lower_tex: Add filter for tex offset lowering Rework: * Add callback_data (s-b Jason) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14142>	2021-12-13 16:56:23 -08:00
Jordan Justen	abace2b8a4	iris: Align buffer VMA to 2MiB for XeHP Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14155>	2021-12-13 22:29:18 +00:00
Jordan Justen	c17e2216dd	anv: Align buffer VMA to 2MiB for XeHP Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14155>	2021-12-13 22:29:18 +00:00
Jordan Justen	f94ff2cc03	iris: Not all gfx12+ have aux_map_ctx This code matches other similar cases in iris. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14152>	2021-12-13 13:30:48 -08:00
Jesse Natalie	36425c43c9	glapi: Never use dllimport/dllexport for TLS vars on Windows Fixes: `c691149f` ("win32: Fixes thread local on win32 with clang/mingw") Reviewed-by: Yonggang Luo <luoyonggang@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14162>	2021-12-13 16:56:06 +00:00
Rhys Perry	15a375b4c8	radv,aco: don't lower some ffma instructions GFX10.3 has no v_mad_f32 and we can't recombine exact ffma into a v_fma_f32 if they're split. GFX9+ only has v_fma_f16 and no generation has a 64-bit MAD. fossil-db (GFX10.3): Totals from 84040 (57.46% of 146267) affected shaders: VGPRs: 3717256 -> 3688064 (-0.79%); split: -0.87%, +0.08% SpillSGPRs: 10419 -> 10403 (-0.15%) CodeSize: 263064884 -> 262442820 (-0.24%); split: -0.31%, +0.07% MaxWaves: 2036908 -> 2038374 (+0.07%); split: +0.10%, -0.03% Instrs: 49849448 -> 49572182 (-0.56%); split: -0.60%, +0.04% Latency: 908130602 -> 907764246 (-0.04%); split: -0.18%, +0.14% InvThroughput: 207051300 -> 206762704 (-0.14%); split: -0.24%, +0.10% fossil-db (GFX10): Totals from 2 (0.00% of 146267) affected shaders: Latency: 8123 -> 8107 (-0.20%) fossil-db (GFX9): Totals from 2 (0.00% of 146401) affected shaders: (no statistics affected) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Rhys Perry	165ca5088b	radv,aco: implement nir_op_ffma Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Rhys Perry	c5f02a1cd3	aco: swap multiplication operands if needed to create v_fmac_f32/etc For v_pk_fma_f32 and v_fma_f32 from nir_op_ffma, we don't try to put scalars in the first operand. No fossil-db changes. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Rhys Perry	f4f5d577fc	aco: swap operands if necessary to create v_madak/v_fmaak Also rewrite the check_literal logic to be more straightforward. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Rhys Perry	2665320c78	aco: create v_fmamk_f32/v_fmaak_f32 from nir_op_ffma Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Rhys Perry	a487747ebd	aco: use more predictable tiebreaker when forming MADs fossil-db (GFX10.3): Totals from 84981 (58.10% of 146267) affected shaders: VGPRs: 3829896 -> 3820480 (-0.25%); split: -0.33%, +0.08% CodeSize: 270860472 -> 270850132 (-0.00%); split: -0.08%, +0.08% MaxWaves: 2035822 -> 2042516 (+0.33%); split: +0.39%, -0.06% Instrs: 51285526 -> 51308869 (+0.05%); split: -0.03%, +0.08% Latency: 931503706 -> 932556231 (+0.11%); split: -0.19%, +0.30% InvThroughput: 217084232 -> 217070849 (-0.01%); split: -0.12%, +0.11% fossil-db (GFX10): Totals from 85520 (58.47% of 146267) affected shaders: VGPRs: 3729132 -> 3725344 (-0.10%); split: -0.21%, +0.10% CodeSize: 272796500 -> 272783084 (-0.00%); split: -0.09%, +0.08% MaxWaves: 2246410 -> 2249012 (+0.12%); split: +0.17%, -0.05% Instrs: 51643962 -> 51664865 (+0.04%); split: -0.04%, +0.08% Latency: 932331949 -> 933274979 (+0.10%); split: -0.19%, +0.29% InvThroughput: 214187040 -> 214130994 (-0.03%); split: -0.13%, +0.11% fossil-db (GFX9): Totals from 84619 (57.80% of 146401) affected shaders: SGPRs: 5366240 -> 5366944 (+0.01%); split: -0.09%, +0.10% VGPRs: 3765608 -> 3764972 (-0.02%); split: -0.23%, +0.22% CodeSize: 263634732 -> 263616320 (-0.01%); split: -0.08%, +0.08% MaxWaves: 546617 -> 547091 (+0.09%); split: +0.18%, -0.09% Instrs: 51426195 -> 51458334 (+0.06%); split: -0.03%, +0.10% Latency: 1164445660 -> 1161923480 (-0.22%); split: -0.46%, +0.24% InvThroughput: 542964697 -> 542329595 (-0.12%); split: -0.26%, +0.14% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9805>	2021-12-13 11:22:33 +00:00
Samuel Pitoiset	9a388beda7	radv: ignore dynamic inheritance if the render pass isn't NULL From the Vulkan spec: "If the pNext chain of VkCommandBufferInheritanceInfo includes a VkCommandBufferInheritanceRenderingInfoKHR structure, then that structure controls parameters of dynamic render pass instances that the VkCommandBuffer can be executed within. If VkCommandBufferInheritanceInfo::renderPass is not VK_NULL_HANDLE, or VK_COMMAND_BUFFER_USAGE_RENDER_PASS_CONTINUE_BIT is not specified in VkCommandBufferBeginInfo::flags, parameters of this structure are ignored." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14109>	2021-12-13 10:48:44 +00:00
Samuel Pitoiset	841949e50b	radv: fix dynamic rendering inheritance if the subpass index isn't 0 The driver will always create only one subpass in the render pass for inheritance but the subpass index isn't always zero. This fixes dEQP-VK.multiview.dynamic_rendering.secondary_cmd_buffer*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14109>	2021-12-13 10:48:44 +00:00
Samuel Pitoiset	43022ecc3a	radv: enable lower_lod_zero_width This fixes dEQP-VK.glsl.texture_functions.query.texturequerylod.*. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14147>	2021-12-13 10:00:07 +00:00
Samuel Pitoiset	be53b3d1bf	nir/lower_tex: add lower_lod_zero_width On AMD, the hardware will return 0 for the raw LOD if the sum of the absolute values of derivatives is 0 but Vulkan expects the value to be in the [-inf, -22.0f] range. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14147>	2021-12-13 10:00:07 +00:00
Pierre-Eric Pelloux-Prayer	51e772586c	radeonsi: use max_zplanes after the last write Fixes: `c0f723ce2b` ("radeonsi: allow and finish TC-compatible MSAA HTILE") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14089>	2021-12-13 09:13:46 +00:00
Pierre-Eric Pelloux-Prayer	84fea554e3	radeonsi: silence a warning Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14089>	2021-12-13 09:13:46 +00:00
Pierre-Eric Pelloux-Prayer	573d645133	radeonsi: fix fast clear / depth decompression corruption Insert a flush after a depth decompression pass if the texture was fast cleared. This fixes a corruption which seems to only affect gfx10.3 chips. Ideally we should also clear tex->need_flush_after_depth_decompression after a flush but there's no easy way for this so this commit will introduce extra flushes. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14089>	2021-12-13 09:13:46 +00:00
Marcin Ślusarz	87f03b1662	nir: limit lower_clip_cull_distance_arrays input to traditional stages Compute, task, mesh & raytracing stages don't support ClipDistance/CullDistance as input. This change is not needed for correctness. Just something I stumbled on. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14149>	2021-12-13 08:32:23 +00:00
Roman Stratiienko	fcfc4ddfcc	v3dv: Fix V3DV_HAS_SURFACE preprocessor condition Currently V3DV_HAS_SURFACE is always defined. There is no WSI for Android in mesa3d, therefore WSI related extensions should not be exposed. 1. Define V3DV_HAS_SURFACE only for platforms which has WSI implemented. 2. Rename V3DV_HAS_SURFACE -> V3DV_USE_WSI_PLATFORM to align naming with other platforms. Fixes dEQP-VK.wsi.android.surface#query_protected_capabilities Fixes: `79e4451430` ("v3dv: move extensions table to v3dv_device") Signed-off-by: Roman Stratiienko <roman.o.stratiienko@globallogic.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14144>	2021-12-13 07:11:20 +00:00
Caio Oliveira	2ad11b39bd	intel/compiler: Use a struct for brw_compile_bs parameters Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14139>	2021-12-13 01:08:16 +00:00
Caio Oliveira	58c4a95320	intel/compiler: Use a struct for brw_compile_gs parameters Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14139>	2021-12-13 01:08:16 +00:00
Caio Oliveira	acf2d3c78b	intel/compiler: Use a struct for brw_compile_tes parameters Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14139>	2021-12-13 01:08:16 +00:00
Caio Oliveira	7372a48a4a	intel/compiler: Use a struct for brw_compile_tcs parameters Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14139>	2021-12-13 01:08:16 +00:00
Dave Airlie	76da456954	crocus: cleanup bo exports for external objects This might have led to a leak in firefox/webrender/webgl scenarios Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14167>	2021-12-13 10:31:12 +10:00
Marek Olšák	9ff086052a	radeonsi: unroll loops of up to 128 iterations It's not exactly 128 because longer loop bodies scale the number down. This improves perf for VP13/Creo and Piano. Most other tests either didn't show any difference or are CPU-bound. v2: - The lowering passes had to be moved to the optimization loop because unrolling creates lowerable variables. - Piano has some pattern that looks like corruption and the pattern changed with loop unrolling. The pattern is present on other drivers as well. v3: - I removed the Piano test from CI traces because the image is random. The output was wrong even before this MR, and now it's randomly wrong. \| PERCENTAGE DELTAS \| Shaders \| SGPRs \| VGPRs \|SpillSGPR \|SpillVGPR \| PrivVGPR \| Scratch \| CodeSize \| MaxWaves \| \|------------------------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\| \| alien_isolation \| 2936\| . \| 0.02 %\| . \| . \| . \| . \| 0.83 %\| . \| \| deadcore \| 76\| 18.47 %\| . \| . \| . \| . \| . \| 167.69 %\| . \| \| deus_ex_mankind_div.. \| 1410\| 0.10 %\| 0.15 %\| . \| . \| . \| . \| 1.70 %\| . \| \| f1-2015 \| 775\| 0.37 %\| 0.16 %\| . \| . \| . \| . \| 3.25 %\| -0.07 %\| \| hitman \| 1413\| 0.10 %\| -0.03 %\| 6.45 %\| . \| . \| . \| 0.61 %\| 0.03 %\| \| metro_2033_redux \| 2670\| . \| . \| . \| . \| . \| . \| 0.13 %\| 0.01 %\| \| pixmark-piano-0.7.0 \| 2\| . \| 14.29 %\| -100.00 %\| . \| . \| . \| 78.07 %\| -4.76 %\| \| reflections_subway \| 98\| -0.53 %\| . \| . \| . \| . \| . \| 7.64 %\| . \| \| thea \| 172\| 0.12 %\| -0.81 %\| . \| . \| . \| . \| 0.65 %\| 0.15 %\| \| ubershaders \| 54\| . \| . \| . \| . \| . \| . \| 61.13 %\| . \| \| ue4_effects_cave \| 290\| 0.05 %\| . \| . \| . \| . \| . \| 2.62 %\| . \| \| vp13-creo \| 26\| -3.38 %\| -4.20 %\| . \| . \| . \| . \| 88.56 %\| 2.62 %\| \| vp13-sw \| 100\| -0.36 %\| -9.14 %\| . \| -100.00 %\| . \| -100.00 %\| -17.97 %\| 0.39 %\| \| vp20-creo \| 22\| -0.82 %\| -3.33 %\| . \| . \| . \| . \| 81.59 %\| 1.51 %\| \| vp20-sw \| 296\| -4.51 %\| -0.63 %\| . \| . \| . \| . \| 58.93 %\| 0.20 %\| \|------------------------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\| \| All affected \| 189\| 3.05 %\| -2.87 %\| 500.00 %\| -100.00 %\| . \| -100.00 %\| 135.61 %\| 1.32 %\| \|------------------------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\|----------\| \| Total \| 57794\| 0.01 %\| -0.02 %\| 0.27 %\| -3.13 %\| . \| -2.89 %\| 1.73 %\| . \| Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	af9ec3c45d	radeonsi: add shader profiles that disable binning Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	4fd8171f64	radeonsi: print more stats for shader-db Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	b3b2f97f2e	radeonsi: add Wave32 heuristics and shader profiles This generally works well. There are new cases that select Wave32, and there are shader profiles which adjust that. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	e2a1883337	glsl: fix setting compiled_source_sha1 without a shader cache We need to set it even if Cache == NULL. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	2785141c16	nir: add nir_has_divergent_loop function Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	26b522eae5	nir: serialize divergent fields Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	6eb3fe2d4f	nir: disable a NIR test due to undebuggable & locally unreproducible CI failures debian-vulkan but not any other CI pipeline consistently fails with: FileNotFoundError: [Errno 2] No such file or directory: 'nir_tests.xml' I have to assume that either debian-vulkan is broken, or the NIR test infrastructure is broken. That's not all. I got the same failure when I wanted to add a new test, which means the CI is preventing us from adding new NIR tests, which is a very serious problem with the CI or NIR tests. The python error doesn't imply that it's a test failure, so something else is broken. If you don't want such commits to happen again, print better error messages. See also the discussion in the MR. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Marek Olšák	2ab310b78b	nir: handle more intrinsics in divergence analysis Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13966>	2021-12-11 20:07:35 +00:00
Italo Nicola	f0eb163ae0	drisw: do an MSAA resolve when copying the backbuffer When calling glXCopySubBuffer, we must resolve the backbuffer before copying it the frontbuffer. Fixes piglit's glx/glx-copy-sub-buffer on virgl. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11714>	2021-12-11 17:49:00 +00:00
Italo Nicola	6740f34568	virgl: flush cmd buffer when flushing frontbuffer When a resource is multisampled, we usually submit a multisampling resolving blit before we present it or use it in some other way, but currently we don't always flush the cmd buffer before flushing the frontbuffer, this commit fixes that. Fixes piglit's glx/glx-copy-sub-buffer MSAA cases on vtest, in conjunction with other commits of this series. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11714>	2021-12-11 17:49:00 +00:00
Italo Nicola	0577a142de	virgl/vtest: implement resource_create_front This is required for glXCopySubBufferMESA to work. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11714>	2021-12-11 17:49:00 +00:00
Italo Nicola	b6d0447027	virgl/vtest: use correct resource stride in flush_frontbuffer The displaytarget's resource stride is alignment is currently 64-bytes, where the shared resource stride is unaligned. Signed-off-by: Italo Nicola <italonicola@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11714>	2021-12-11 17:49:00 +00:00
Caio Oliveira	a235e02787	util: Use ralloc for strings in cache test Also avoid warnings about asprintf result not being checked. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14054>	2021-12-11 08:06:10 +00:00
Caio Oliveira	51351760c2	util: Convert cache test to use gtest Replace a bunch of helper functions for checking results with ones from GTest. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14054>	2021-12-11 08:06:10 +00:00
Jason Ekstrand	88e97d75d0	intel/dev: Add gtt_size to devinfo Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13647>	2021-12-11 05:05:19 +00:00
Jason Ekstrand	1f559930b6	anv: Stop doing too much per-sample shading We were setting anv_pipeline::sample_shading_enable based on sampleShadingEnable without looking at minSampleShading. We would then pass this value into nir_lower_wpos_center which would add sample_pos to frag_coord. Then the back-end compiler picks up on the existence of sample_pos and forces persample dispatch. This leads to doing per-sample dispatch whenever sampleShadingEnable = VK_TRUE regardless of the value of minSampleShading. This is almost certainly costing us perf somewhere. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14022>	2021-12-11 04:40:20 +00:00
Nanley Chery	5197809302	iris: Update the initial CCS state on XeHP We can't map the CCS on this platform to initialize it into the PASS_THROUGH state. This can cause issues with optimizations in the driver that rely on this state. For example, after rendering to a surface with AUX_NONE, we can then render to it with AUX_CCS_E without an ambiguate in between (if the CCS in the PASS_THROUGH state). If that state was incorrect and the aux was actually compressed, there can be rendering corruption because the contents may be misinterpreted on the second render. Use a more accurate initial aux state to avoid these issues. One notable change in behavior here is that aux surfaces can be created with fast-cleared blocks even though the caller may specify a modifier that doesn't support fast clears. This should be fine, so long as all HW units that can access these surfaces can handle that bit-pattern. We haven't seen an applicable restriction yet. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	eef4399afd	iris: Modify the comment about zeroing CCS Among other changes, we highlight the fact that we'll map the CCS - something we can't do on XeHP. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	cba6d6cad3	iris: Don't assert a NULL aux BO during aux config The assert was introduced in a function that allocated an auxiliary surface BO, iris_resource_alloc_aux. After refactors, the function it's in now, iris_resource_configure_aux, no longer does this allocation. Drop the assert because its purpose is unclear and it's no longer relevant for CCS on XeHP. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	7d3200a37d	iris: Don't allocate and initialize CCS on XeHP The memory for CCS on XeHP can't be mapped by the CPU. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	656d34a811	iris: Drop row pitch param from iris_get_ccs_surf This parameter won't be used for XeHP, because we can't directly control the row pitch of the CCS independently from the main surface. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	7d57c9959e	iris: Don't allocate a clear color BO for some Z/S The only depth/stencil aux usage that can actually use the BO is ISL_AUX_USAGE_HIZ_CCS_WT. Even with that aux usage, iris may disable sampling depending on the surface configuration. Allocate the clear color BO when it'd be usable, not just when the auxiliary surface size is non-zero on ICL+. This prepares for CCS on XeHP, which won't have an auxiliary surface. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	fecd6ae38e	iris: Simplify iris_get_aux_clear_color_state_size isl_dev.ss.clear_color_state_size is already zero on BDW and SKL. Drop the redundant platform check and return the field directly. We're going to have this function return zero more often and it will do so uniformly using if-statements. We choose to remove the redundant expression instead of adding a redundant if-statement. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	4027337004	iris: Move some BO setup to iris_resource_init_aux_buf To ease verification, place the assignment and reference of the aux BO right before the same operations are done for the clear color BO. Also, move the call to map_aux_addresses that's in the same if-block. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	9acf0316ec	iris: Use the aux BO and surf less during init res->aux.bo and res->aux.surf will be NULL and zeroed, respectively, for CCS on XeHP. Move and modify iris_resource_init_aux_buf to support this. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	02bbdb0e92	iris: Change a param of iris_resource_init_aux_buf Have iris_resource_init_aux_buf compute the clear color state size (with an iris_screen struct) instead of passing it in directly. We're going to move the function call soon. This keeps us from having to move a passed in variable along with it. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	18231fc548	intel/blorp: Modify get_fast_clear_rect for XeHP The alignment and scale down values have changed on this platform. To support drivers that won't use a CCS surface on this platform, this patch computes the CCS fast clear rectangle using the main surface. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	c35b8a2889	intel/blorp: Modify the SKL+ CCS resolve rectangle According to Bspec 2424, "Render Target Resolve": The Resolve Rectangle size is same as Clear Rectangle size from SKL+. Use get_fast_clear_rect in blorp_ccs_resolve for SKL+. Note that the Bspec differs from Vol7 of the Sky Lake PRM, which only specifies aligning by the scaledown factors. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Nanley Chery	91128b1a0f	intel/isl: Require aux map for some 64K alignment The comment states that 64K alignment of surfaces is required when an aux map is present on the platform. However, the code checks for GFX12 instead of dev->info->has_aux_map. Update the code to match the comment. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13555>	2021-12-11 04:14:20 +00:00
Jesse Natalie	46ef1e8389	ci/windows: Remove line numbers from assertions in spirv2dxil tests Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14156>	2021-12-11 03:43:11 +00:00
Lucas Stach	fc17f79f2c	etnaviv: fix alpha blend with dither on older GPUs While setting up DITHER_MODE allows alpha blending to work properly together with dithering on new GPUs (those with PE_DITHER_FIX), older cores still change the render target. As dithering is optional and implementation defined we can simply disable it on the affected GPUs, when alpha blending is enabled to work around this bug. Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13396>	2021-12-11 03:06:15 +00:00
Emma Anholt	d199d65c3a	nir/nir_opt_move,sink: Include load_ubo_vec4 as a load_ubo instr. We weren't doing much motion in nir-to-tgsi because we considered all our lowered load-ubos as unmovable. softpipe shader-db: total temps in shared programs: 563942 -> 563136 (-0.14%) temps in affected programs: 9833 -> 9027 (-8.20%) r300 shader-db: instructions in affected programs: 22858 -> 23575 (3.14%) temps in affected programs: 2039 -> 1813 (-11.08%) (NIR had given r300 -19% instrs for +40% temps, so this feels like a worthwhile trade back). Reivewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14138>	2021-12-11 02:12:27 +00:00
Erico Nunes	013900c4d2	mesa: fix GL_MAX_SAMPLES with GLES2 EXT_multisampled_render_to_texture on GLES2 allows the GL_MAX_SAMPLES_EXT enum to be used. Move the condition from the GLES3 section to the GLES2 one so that it stops returning GL_INVALID_ENUM in that case. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13967>	2021-12-11 01:31:59 +00:00
Silvestrs Timofejevs	c02b75d22e	egl: add config debug printout Feature to print out EGL returned configs for debug purposes. 'eglChooseConfig' and 'eglGetConfigs' debug information printout is enabled when the log level equals '_EGL_DEBUG'. The configs are printed, and if any of them are "chosen" they are marked with their index in the chosen configs array. v2: a) re-factor the code in line with review comments b) rename function _snprintfStrcat, split it out and put into the src/util/u_string.h, make it a separate patch. v3: remove unnecessary 'const' qualifiers from the function prototype v4: a) re-factor the code in line with review comments b) move util_strnappend from utils back to eglconfigdebug.c. In my opinion this function is the best way of achieving the desired result, so it still used but made private to eglconfigdebug.c. v5: a) drop unused parameter from function signature b) more const annotations c) directly access config parameters instead of going through _eglGetConfigKey Signed-off-by: Silvestrs Timofejevs <silvestrs.timofejevs@imgtec.com> Signed-off-by: Lucas Stach <l.stach@pengutronix.de> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13705>	2021-12-11 00:25:04 +00:00
Silvestrs Timofejevs	f927a5c3d2	egl: introduce a log level getter function Being able to retrieve the log level can be useful to enable/disable debug code. The alternative, which is calling 'getenv' function every time to retrieve the log level, is more "expensive". Signed-off-by: Silvestrs Timofejevs <silvestrs.timofejevs@imgtec.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13705>	2021-12-11 00:25:04 +00:00
Jordan Justen	fd2a558bf8	intel/l3: Make DG1 urb-size exception more generic Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14136>	2021-12-11 00:09:50 +00:00
Rhys Perry	c7fa15b381	aco: improve clrx disassembly - remove uninteresting lines of output - remove binary offset before instructions, for easier diffing - replace generated labels with block numbers - add encoded instructions at the end of lines, like LLVM dissaembly - print constant data instead of trying to disassemble it Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14042>	2021-12-10 23:46:30 +00:00
Jesse Natalie	11eb03c44e	microsoft/compiler: Remove algebaric pass for inot Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14140>	2021-12-10 23:23:29 +00:00
Jesse Natalie	45354be410	microsoft/compiler: Implement inot Fixes: `cb283616` ("nir/algebraic: Small optimizations for SpvOpFOrdNotEqual and SpvOpFUnordEqual") Reviewed-by: Enrico Galli <enrico.galli@intel.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14140>	2021-12-10 23:23:29 +00:00
Khem Raj	249556dad8	v3dv: account for 64bit time_t on 32bit arches This makes is a bit more portable, especially on 32bit architectures with 64bit time_t defaults. Especially on musl its a must. Fixes ../mesa-21.3.0/src/broadcom/vulkan/v3dv_bo.c:71:15: error: format specifies type 'long' but the argument has type 'time_t' (aka 'long long') [-Werror,-Wformat] time.tv_sec); ^~~~~~~~~~~ Also reported here [1] [1] https://github.com/agherzan/meta-raspberrypi/issues/969 Signed-off-by: Khem Raj <raj.khem@gmail.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14118>	2021-12-10 22:26:56 +00:00
Samuel Pitoiset	e2ad92eb22	radv: do not perform depth/stencil resolves for suspended render pass From the Vulkan spec: "Store and resolve operations are only performed at the end of a render pass instance that does not specify the VK_RENDERING_SUSPENDING_BIT_KHR flag." Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14112>	2021-12-10 22:03:00 +00:00
Samuel Pitoiset	c27348a3f7	Revert "radv: Add bufferDeviceAddressMultiDevice support." This was a workaround for fixing a crash with Baldur's Gate 3 at start but the game fixed it since. This reverts commit `1fe375e7cf`. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14134>	2021-12-10 21:38:00 +00:00
Jason Ekstrand	b8d04863e2	intel/fs: Drop high_quality_derivatives We've never bothered to hook it up in crocus or iris. If we do in the future, it should probably be a NIR pasa anyway. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	6dc9958bf3	intel/compiler: Get rid of wm_prog_key::frag_coord_adds_sample_pos This hasn't actually done anything for a while. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	278d12f991	intel/fs,vec4: Drop prog_data binding tables Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	e49f65dfe0	intel/blorp: Stop depending on prog_data binding tables Instead, set BLORP_TEXTURE_BT_INDEX on the texture instructions directly. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	4fa58d27a5	intel/fs,vec4: Drop support for shader time Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	8f3c100d61	intel/fs,vec4: Drop uniform compaction and pull constant support The only driver using these was i965 and it's gone now. This is all dead code. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14056>	2021-12-10 21:20:47 +00:00
Jason Ekstrand	4175ed5099	crocus: wm_prog_key::key_alpha_test uses GL enums Fixes: `f3630548f1` ("crocus: initial gallium driver for Intel gfx 4-7") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14141>	2021-12-10 21:09:00 +00:00
Danylo Piliaiev	c82d7e3617	turnip: Fix operator precedence in address calculation macros for queries Fixes crash in Oblivion, Skyrim, Crysis running through DXVK on 32b systems. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5723 Fixes: `937dd76426` "turnip: Implement VK_KHR_performance_query" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14148>	2021-12-10 17:45:02 +00:00
Emma Anholt	5887768f48	nir_to_tgsi: Enable nir_opt_move. This moves some ops down to when they're needed, generally reducing the number of temps in use. It's not always a win -- sometimes you can end up moving a generator of a component used by a nir_op_vec down, which means that op's sources stay live while the vec (whose register likely gets coalesced with the ops creating it) is also live. But it's generally good. softpipe results: temps in affected programs: 18115 -> 18026 (-0.49%) imm in affected programs: 19 -> 22 (15.79%) r300 results: instructions in affected programs: 174 -> 178 (2.30%) vinst in affected programs: 156 -> 160 (2.56%) sinst in affected programs: 54 -> 50 (-7.41%) temps in affected programs: 2634 -> 2169 (-17.65%) Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Emma Anholt	7d2ea9b0ed	r300: Request NIR shaders from mesa/st and use NIR-to-TGSI. This brings us into parity on state tracker paths with most other supported drivers, and a lot of additional optimization on our shaders. Results on a subset of shader-db that doesn't crash: instructions in affected programs: 59502 -> 47991 (-19.35%) vinst in affected programs: 17633 -> 15197 (-13.82%) sinst in affected programs: 9296 -> 7319 (-21.27%) flowcontrol in affected programs: 627 -> 310 (-50.56%) presub in affected programs: 4220 -> 1554 (-63.18%) temps in affected programs: 5775 -> 8570 (48.40%) lits in affected programs: 215 -> 37 (-82.79%) The temps (register usage) increase is unfortunate, but it seems that instruction counts tend to be our limit before reg counts are. Fixes: #3354 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Emma Anholt	e68a9b0339	r300: Disable loop unrolling on r500. It's buggy, and we should just trust GLSL or NIR to do unrolling for us. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Emma Anholt	495a4cfbc3	nir_to_tgsi: Make !native_integers front face input match glsl_to_tgsi. Avoids regression on r300, which has 0.0 vs 1.0 frontface despite what tgsi.rst says. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Emma Anholt	f1647525ab	nir/nir_to_tgsi: Add support for "if" statements with !native_integers Previously we've only used this on HW that had all ifs lowered. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Emma Anholt	0b651db795	r300/ci: Add some piglit expectations. Not a full run, but a bit of sanity-check for the NTT change. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14096>	2021-12-09 22:15:53 +00:00
Adam Jackson	231ccb6100	docs: Remove no-longer-accurate text about the xlib driver Mesa hasn't supported color-index rendering in a long time, and gallium's xlib target doesn't respect MESA_GAMMA. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14125>	2021-12-09 15:08:17 -05:00
Ian Romanick	c5d731ac5c	intel/stub: Implement I915_PARAM_HAS_USERPTR_PROBE Just say no for now. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:57 -08:00
Ian Romanick	832db9d900	intel/stub: Implement DRM_I915_QUERY_MEMORY_REGIONS Borrowed from sim-drm. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:55 -08:00
Ian Romanick	4c429b6be6	intel/stub: Implement DRM_I915_QUERY_ENGINE_INFO Borrowed from sim-drm. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:53 -08:00
Ian Romanick	12d35892e0	intel/stub: Suppress warnings about DRM_I915_QUERY_PERF_CONFIG There's not a useful way to implement this, so just silence the warning to cleanup shader-db runs. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14132>	2021-12-09 10:57:48 -08:00
Rhys Perry	786d434397	aco: don't create unnecessary addition in indirect get_sampler_desc() I don't think this has any effect on GFX9+ because the addition is combined into the load. fossil-db (polaris10): Totals from 12595 (9.29% of 135627) affected shaders: SGPRs: 1054348 -> 1054860 (+0.05%); split: -0.02%, +0.07% VGPRs: 667240 -> 667320 (+0.01%); split: -0.01%, +0.02% CodeSize: 82761508 -> 82512816 (-0.30%); split: -0.30%, +0.00% MaxWaves: 62182 -> 62181 (-0.00%) Instrs: 16072934 -> 16010764 (-0.39%); split: -0.39%, +0.00% Latency: 582819635 -> 582287964 (-0.09%); split: -0.13%, +0.04% InvThroughput: 276460536 -> 276417613 (-0.02%); split: -0.06%, +0.05% VClause: 261656 -> 261654 (-0.00%); split: -0.01%, +0.01% SClause: 680952 -> 680854 (-0.01%); split: -0.05%, +0.04% Copies: 1727202 -> 1727742 (+0.03%); split: -0.12%, +0.15% Branches: 547050 -> 547033 (-0.00%); split: -0.01%, +0.00% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14043>	2021-12-09 17:58:54 +00:00
Timur Kristóf	77db4e27b1	aco: Clean up and fix quad group instructions with WQM. According to the Vulkan spec chapter 9.25 Helper Invocations, quad group operations have to be executed by helper invocations. This commit cleans up the code for quad group instructions by unifying the code path of quad broadcast with the others, and then calling emit_wqm just once at the end. Fixes: `93c8ebfa78` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5570 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13929>	2021-12-09 17:36:51 +00:00
Emma Anholt	af163d7220	loader: Restore i915g support. The cleanup of i915c cleaned up our PCI ID list. Fixes: `0cad451f00` ("classic/i915: Remove driver") Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14131>	2021-12-09 09:05:23 -08:00
Qiang Yu	c50bdacbda	glx: fix regression for drawable type detection Newer version of XServer supporting GLX_DRAWABLE_TYPE query also support query with raw X11 window ID besides GLXWindow ID. So we should not limit the suppported type to GLXPbuffer when query success. Otherwise can't start GLX application on newer XServer with: libGL error: GLX drawable type is not supported libGL error: GLX drawable type is not supported X Error of failed request: GLXBadContext Major opcode of failed request: 149 (GLX) Minor opcode of failed request: 5 (X_GLXMakeCurrent) Serial number of failed request: 35 Current serial number in output stream: 35 Fixes: `6625c960c5` ("glx: check drawable type before create drawble") Tested-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14120>	2021-12-09 00:52:17 +00:00
Ian Romanick	ff74d5dd1b	intel/compiler: Don't store "scalar stage" bits on Gfx8 or Gfx9 Since `1d71b1a311`, only Gfx7 and earlier have any vec4 stages ever. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14128>	2021-12-08 14:59:32 -08:00
Ian Romanick	4563261ad1	intel/compiler: Don't predicate a WHILE if there is a CONT Previously a predicated BREAK that appeared immediately before the WHILE would get merged into the WHILE. This doesn't work if other flow control (e.g., a CONT) can transfer directly to the WHILE. On Intel platforms, this fixes the CTS test dEQP-VK.graphicsfuzz.stable-binarysearch-tree-nested-if-and-conditional. No shader-db changes on any Intel platform. When this commit was first created (over a month before it is going to land), there were some regressions that were prevented by other commits in MR !13095. That does not appear to be the case now, so I don't know what changed. Basically, the treatment of discard as a combination of demote and terminate causes additional continues in some loops, and those continues trigger this bug. The other commits from that MR prevent those continues from being generated in the first place. All Intel platforms had simlar fossil-db results. (Ice Lake shown) Instructions in all programs: 144419989 -> 144419995 (+0.0%) SENDs in all programs: 6947332 -> 6947332 (+0.0%) Loops in all programs: 38277 -> 38277 (+0.0%) Spills in all programs: 204075 -> 204075 (+0.0%) Fills in all programs: 319480 -> 319480 (+0.0%) A few shaders in Doom 2016 were hurt by one instruction each. It seems likely that these shaders would have experienced at least some mis-rendering. Closes: #4213 Fixes: `d13bcdb3a9` ("i965/fs: Extend predicated break pass to predicate WHILE.") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14128>	2021-12-08 14:56:32 -08:00
Dave Airlie	d051854cca	treewide: drop mtypes/macros includes from main These aren't required in lots of places, so remove them. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14127>	2021-12-08 22:14:45 +00:00
Roman Stratiienko	72db15913f	v3dv: Fix dEQP-VK.info#instance_extensions test When mesa3d is built without VK_USE_PLATFORM_DISPLAY_KHR definition, dEQP test fails: dEQP : Test case 'dEQP-VK.info.instance_extensions'.. dEQP : Fail (Extension VK_KHR_get_display_properties2 is missing dependency: VK_KHR_display) dEQP : DONE! Enable KHR_get_display_properties2 only if VK_USE_PLATFORM_DISPLAY_KHR is enabled. Fixes: `f884c2e3be` ("v3dv: implement VK_KHR_get_display_properties2") Signed-off-by: Roman Stratiienko <roman.o.stratiienko@globallogic.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14047>	2021-12-08 21:34:47 +00:00
Chia-I Wu	b14dd3a866	venus: prefer VIRTGPU_BLOB_MEM_HOST3D for shmems They are logically contiguously in the host. More importantly, they enable host process isolation. Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13800>	2021-12-08 21:23:05 +00:00
Jesse Natalie	fe3a800ad3	d3d12: Use overall resource format + plane format to get format info Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14123>	2021-12-08 20:46:22 +00:00
Jesse Natalie	0312142d96	d3d12: Allow creating planar resources Also handle opening planar resources with a single handle, instead of per-plane handles. Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14123>	2021-12-08 20:46:22 +00:00
Jesse Natalie	a6db805469	d3d12: Handle opening planar resources Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14123>	2021-12-08 20:46:22 +00:00
Jesse Natalie	fb6479544b	d3d12: Force emulation of all YUV formats using per-plane formats Reviewed-by: Sil Vilerino <sivileri@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14123>	2021-12-08 20:46:22 +00:00
Bas Nieuwenhuizen	7f34b70474	radv: Use the winsys 0 cmdbuffer submission support. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14097>	2021-12-08 20:21:05 +00:00
Bas Nieuwenhuizen	7675d066ca	radv/amdgpu: Add support for submitting 0 commandbuffers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14097>	2021-12-08 20:21:05 +00:00
Bas Nieuwenhuizen	6eb0821315	radv/winsys: Add queue family param to submit. Extracting it from the first CS will not go over well once we try submitting 0 of them. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14097>	2021-12-08 20:21:05 +00:00
Bas Nieuwenhuizen	c03d258046	radv/amdgpu: Add a syncobj per queue. For merging our own dependencies in without submitting. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14097>	2021-12-08 20:21:05 +00:00
Dave Airlie	6d1a15f7fa	mesa/st: drop Draw from dd function table. Draw is only called in the feedback paths now, and the only thing it is set to is the st feedback draw path. So just always have the fallback call the draw feedback path and get rid of the draw entry. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	5312511ea5	mesa/st: move draw indirect and xfb to direct calls. These don't get used any other way. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	132ffc46dc	mesa/st: move compute to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	28bd406f50	mesa/st: move msaa functions to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	d16f514f19	mesa/st: convert DrawTex to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	601c30c8c1	mesa/st: convert the non-optional egl image to direct calls Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	7b066ebfb3	mesa/st: move blit function to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	c9fec99c4a	mesa/st: replace most of buffer funcs with direct calls. This leaves invalidate buffer for later Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	f60add48ee	mesa/st: move program calls to direct call Can't move NewProgram yet as the GLSL standalone compiler relies on it, can probably fix that up later. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	878cf8cae9	mesa/st: move copy image sub data to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	81eae71936	mesa/st: move viewport to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	ce9a50e6f8	mesa/st: move some context functions to direct calls Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	cb400f368a	mesa/st: move clear/flush/finish to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	37fb8058fb	mesa/st: move pixel/bitmap functions to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	4e13c7d46a	mesa/st: move Clear to new direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	889ac0f1b9	mesa/st: move texture APIs to direct st calls This is a bit more spreadout, but is still pretty straightforward Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	2d912b5caa	mesa/st: move fbo code to direct calling Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	6d79ce5a58	mesa/dd: drop purgeable interface This whole extension can likely be dropped, just remove dd.h bits for now. Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	b79d12bbb2	mesa/st: move perfomance monitor to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	097646cf7c	mesa/st: move perf query to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	df28e0a082	mesa/st: move query memory info to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	d0e48ef34f	mesa/st: move Enable to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
Dave Airlie	cc4964847b	mesa/st: move rendermode to direct call Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14100>	2021-12-08 19:06:48 +00:00
mwezdeck	cdc480585c	virgl/drm: New optimization for uploading textures 1. We can create resource with size of "1" on drm, because size is not passed to the renderer. 2. We can't create resource with size of "1" on vtest, because shmem is created based on that. 3. If renderer supports copy_transfer_from_host, then use staging buffer for transfer in both ways to and from host. This will allow to reduce memory consumption in the guest. v2: - add inline function for checking if we can use this optimization - add check in readback path. If renderer doesn't support copy transfer from host, then we need to go with previous path in readback (through transfer_get ioctl) v3: - fix logic for readback v4: - refactor the implementation to integrate it more to existing code base v5: - reuse COPY_TRANSFER3D in both directions v6: - encode direction in COPY_TRANSFER3D if host supports it v7: - renamed cap bit - introduced COPY_TRANSFER3D_SIZE_LEGACY define Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13689>	2021-12-08 14:01:48 +00:00
Rhys Perry	420170fabc	radv: initialize workgroup_size in radv_meta_init_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Rhys Perry	85161fb8ac	radv: clone shader in radv_shader_compile_to_nir This way, radv_shader_compile_to_nir doesn't alter the NIR. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Rhys Perry	2020a1799b	radv: include RT shaders in RADV_DEBUG=shaders,shaderstats Instead of using module->nir or nir->info->name to determine if it's a meta shader, use nir->info->internal. This also has an effect of disabling printing of meta shaders with NIR_DEBUG=print. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
Rhys Perry	d74498e617	radv: add radv_meta_init_shader Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14087>	2021-12-08 11:07:40 +00:00
James Jones	c2550d1b7c	gbm: Don't pass default usage flags on ABIs < 1 Older drivers will not expect any flags from the GBM front-end when modifiers are in use, and will likely fail the allocation or handle them incorrectly as a result. Only specify usage flags when allocating from a backend with an ABI >= 1, as that's the ABI version that added support for specifying usage flags along with modifiers. Fixes: `ad50b47a14` ("gbm: assume USE_SCANOUT in create_with_modifiers") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5709 Signed-off-by: James Jones <jajones@nvidia.com> Reviewed-by: Simon Ser <contact@emersion.fr> Tested-by: Olivier Fourdan <ofourdan@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14049>	2021-12-08 09:49:47 +00:00
Samuel Pitoiset	31efef7b3a	radv: mark GFX10.3 (aka RDNA2) as conformant products with CTS 1.2.7.1 https://www.khronos.org/conformance/adopters/conformant-products#submission_599 Also bump the CTS version for GFX8, GFX9 and GFX10 because they also pass 1.2.7.1. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14076>	2021-12-08 09:27:26 +00:00
Samuel Pitoiset	d2612bb424	radv: fix resume/suspend render pass with depth/stencil attachment Shouldn't clear on resume. This fixes dEQP-VK.dynamic_rendering.*resuming. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14113>	2021-12-08 09:06:27 +00:00
Samuel Pitoiset	e18e857292	radv: add initial SPM support on GFX10+ RGP doesn't support previous generations. This can be enabled with RADV_THREAD_TRACE_CACHE_COUNTERS=true. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13737>	2021-12-08 08:40:51 +00:00
Samuel Pitoiset	f1fe91b847	radv: add few helpers for configuring performance counters Only used for SPM but VK_KHR_performance_query will be added there. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13737>	2021-12-08 08:40:51 +00:00
Samuel Pitoiset	8a0052f099	radv/sqtt: always dump pipelines and shaders ISA Even if instruction timing is disabled, both features are unrelated. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13648>	2021-12-08 08:18:26 +00:00
Alex Xu (Hello71)	3aab34171d	Fix TSD stubs for non-initial-exec case (fixes #5667 ). ppc64le TSD disabled for now since I am insufficiently familiar with ppc asm. x86 pthread stubs deleted because adding USE_ELF_TLS to the #if is isn't worth the effort since it only saves a single function call on initial entry (TSD stubs are not used for read-only text now). also potentially fix non-pthread TSD builds (_glapi_Dispatch was undefined), but untested (could still be broken). Tested-by: Jesse Natalie <jenatali@microsoft.com> Tested-by: Jan Beich <jbeich@freebsd.org> Signed-off-by: Alex Xu (Hello71) <alex_y_xu@yahoo.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13935>	2021-12-08 00:10:29 -05:00
Adam Jackson	b713fca495	glapi: Remove remnants of EXT_paletted_texture and the imaging subset The GLX code had to special case these for uninteresting reasons, but we don't support them anymore in Mesa so all this would do is keep them sorta-working over GLX protocol. Given that Mesa hasn't supported them on the renderer side since ~2011 let's stop pretending they're real. If we get around to modernizing the indirect GLX code (hah) we can revisit these then. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14085>	2021-12-08 03:42:44 +00:00
Thong Thai	7ba0c68e31	radeon/vcn: implement encoder dpb management Previously, the number of previously encoded frames the encoder handled was 1 - the encoder now supports many more encoded pictures, so the encoder now has to keep track of multiple reconstructed pictures. v2: Add a check to make sure an array index is not negative (Boyuan) Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Thong Thai	96b276b327	radeon: hardcode uvd/vce encoder not_referenced value to false Sets the not_referenced parameter to be the same as the previously hardcoded frontends/va value (false) to ensure UVD/VCE encoding functionality remains unaffected by the change in frontends/va code. This commit will eventually be reverted once more testing is completed. Fixes: a90802ef644 ("frontends/va/enc: allow for frames to be marked as (not) referenced") Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Thong Thai	e44fef8dd6	frontends/va/enc: allow for frames to be marked as (not) referenced Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Thong Thai	ad3ed91b1f	radeon/vcn: increase encoder dpb size Base the number of reconstructed pictures the encoder allocates based on the number of reference pictures to be used for encoding. Also move the calculation and allocation of reconstructed pictures to VCN 1, from VCN 2. v2: Add back the accidentally deleted 'two_pass_search_center_map_offset' (Boyuan) Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Thong Thai	31b033eec2	frontends/va/enc: hardcode h265 encoder ref pic list size Set the size of the ref pic list0 for the h265 encoder to 1. Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Thong Thai	c5b7fb998f	frontends/va: disable packed header support for h264 encoder Packed headers for the H.264 encoder has not been implemented yet, so don't report packed header support for the H.264 encoder Signed-off-by: Thong Thai <thong.thai@amd.com> Reviewed-by: Boyuan Zhang <boyuan.zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13915>	2021-12-08 03:27:42 +00:00
Cherser-s	934bc24fe9	radv: handle VK_DESCRIPTOR_TYPE_SAMPLER in VK_VALVE_mutable_descriptor_type extension Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13842>	2021-12-08 03:03:43 +00:00
Francisco Jerez	de55fd358f	intel/fs/xehp: Teach SWSB pass about the exec pipeline of FS_OPCODE_PACK_HALF_2x16_SPLIT. This virtual instruction is translated into multiple half float physical instructions, even though its destination is typically of integer type, which prevents the software scoreboard pass from inferring the correct execution pipeline for the virtual instruction on XeHP+ platforms. Teach the SWSB lowering pass about this inconsistency between the IR and physical instruction types. Fixes among other tests: dEQP-GLES31.functional.shaders.builtin_functions.pack_unpack.packhalf2x16_compute Fixes: `d4537770bb` ("intel/fs: Add helper functions inferring sync and exec pipeline of an instruction.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5685 Reported-by: Tapani Pälli <tapani.palli@intel.com> Tested-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14002>	2021-12-08 02:47:11 +00:00
Emma Anholt	7e9158761a	r300/ci: Update loop expectations from running "-t loops" which I hadn't totally covered before. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	8ddefb8ea5	r300: Route shader stats output to ARB_debug_output. This lets us use shader-db to compare stats on shaders, rather than having to manually review the RADEON_DEBUG=pstat output. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	141302e61f	r300: Precompile the FS at shader creation time. This should reduce jank at first draw, and is also good prep for doing shader-db. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	e9dd776ef9	r300: Remove the non_normalized_coords from the shader key. TexSrcTarget has to be RECT when this is set, anyway. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	26b3e2f7cd	r300: Also consider ALU condition modifiers for loop DCE. Since we typically use an ALU op to set the condition modifier for the IF-BRK-ENDIF, we were particularly likely to remove the increment of the loop counter! Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	d6fed4ab7d	r300: Ensure that immediates have matching negate flags too. We only have one bit of negate, so we have to make sure that immediate usage has matching negates on all used channels (or rewrite to do so). Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	915af8de8b	r300: Cache the var list in the peephole_mul_omod() loop. If we're not making progress (which the function was already giving us!), then there's no need to recompute the list. Reduces pixmark-piano/7.shader_test compile time from 50 seconds to 10. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Emma Anholt	42e8f48be7	r300: Move the instruction filter for r500_transform_IF() to the top. rc_get_variables() is slow, don't call it if we're going to just exit immediately anyway. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14117>	2021-12-08 02:35:52 +00:00
Mike Lothian	1050686afe	meson: Fix dri.pc dridriverdir Change dridriversdir to dridriverdir Fixes: `3ae3569d82` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5726 Signed-off-by: Mike Lothian <mike@fireburn.co.uk> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Tested-by: Marcus Seyfarth <m.seyfarth@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14111>	2021-12-08 01:30:00 +00:00
Ilia Mirkin	0db2e78788	freedreno/ci/a306: increase concurrency No harm from using more threads, but not enough benefit to reduce parallelism unfortunately. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14067>	2021-12-08 00:50:25 +00:00
Ilia Mirkin	3db30ea877	freedreno/ci/a306: add more skips These come up with increased concurrency. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14067>	2021-12-08 00:50:25 +00:00
Dave Airlie	34804e1266	intel/crocus: push main/macros.h out to the users Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	9105cf1955	intel/compiler: drop shader_info.h from compiler header include it explicitly in the correct places Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	9265d1d62d	brw/compiler: drop mtypes.h from compiler This adds a bunch of other headers in, and adds mtypes.h to iris for perf query object. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	3f35b5fdc9	anv: include futex.h explicitly in allocator. This file needs futexes so make an explicit include, so it doesn't come via the compiler Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Dave Airlie	244fa81c13	mesa: move _mesa_varying_slot_in_fs to shader_enums This doesn't need anything from mtypes.h, just changes types to non GL equivalents Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14104>	2021-12-07 23:59:58 +00:00
Nanley Chery	0733266706	intel/isl: Drop extra devinfo checks for CCS support These checks are done in isl_format_supports_ccs_*. Since isl_surf_supports_ccs calls these functions, it doesn't need to check them itself. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14082>	2021-12-07 23:31:23 +00:00
Nanley Chery	99b320fc68	iris: Drop the YCRCB cases in finish_aux_import We recently added native support for these formats in gallium and ISL. See commits: * (gallium/dri) `f57c074270` * (intel/isl) `3fa16b3025` Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14082>	2021-12-07 23:31:23 +00:00
Nanley Chery	0e075227d8	intel/isl: Restore CCS_E support for YUYV and UYVY These formats are used when creating surfaces with the I915_FORMAT_MOD_Y_TILED_GEN12_MC_CCS modifier. Makes iris pass the out-of-tree piglit test, ext_image_dma_buf_import-intel-modifiers. Fixes: `1433fe7860` ("intel/isl: Unify fmt checks in isl_surf_supports_ccs") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14082>	2021-12-07 23:31:23 +00:00
Erik Faye-Lund	5439e3c412	docs: remove stale notice about deleted dir We're not going to move this directory like this comment suggests, as the suggested target no longer exists. Let's just drop the mention. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14079>	2021-12-07 22:54:27 +00:00
Erik Faye-Lund	d16263cdee	docs: remove mentions of deleted code Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14079>	2021-12-07 22:54:27 +00:00
Erik Faye-Lund	dc81cd1931	ci: remove testing of deleted code Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14079>	2021-12-07 22:54:27 +00:00
Erik Faye-Lund	e03334e977	CODEOWNERS: remove ownership of deleted code Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14079>	2021-12-07 22:54:27 +00:00
Michel Zou	558bc2227e	meson: check -mtls if has_exe_wrapper Fixes: `60d95c5d` (Auto-enable TLSDESC support) Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14086>	2021-12-07 22:24:52 +00:00
Manas Chaudhary	def254b05f	panvk: Add check for null fence Signed-off-by: Manas Chaudhary <manas.chaudhary@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14108>	2021-12-07 21:18:44 +00:00
Danylo Piliaiev	c749da6135	ir3,turnip: Add support for GL_KHR_shader_subgroup_quad Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	3dfd4230bb	ir3,turnip: Enable subgroup ops support in all stages on gen4 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	ded51fd39e	ir3: Use getfiberid for SubgroupInvocationID on gen4 Since it requires (ss) categorize it as is_sfu() and not is_mem(). Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Danylo Piliaiev	d1c49901df	ir3: Add gen4 new subgroup instructions * getlast.w8 #4 - Perform jump for the first (CLUSTER_SIZE-1) fibers in a subgroup * brcst.active.w8 - necessary to implement arithmetic subgroup operations with prefix sum. * quad_shuffle.brcst - subgroupQuadBroadcast * quad_shuffle.horiz - subgroupQuadSwapHorizontal * quad_shuffle.vert - subgroupQuadSwapVertical * quad_shuffle.diag - subgroupQuadSwapDiagonal * getfiberid - gl_SubgroupID Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13817>	2021-12-07 20:45:53 +00:00
Samuel Pitoiset	943ef0edbd	radv: avoid prefixing few VkXXX structures by struct Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14115>	2021-12-07 20:23:48 +00:00
Lionel Landwerlin	a3886fa471	util/u_vector: prevent C++ warning on cast from void* to something else v2: fix windows build v3: duplicate foreach macro for C/C++ v4: Extract casting macro Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> (v3) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13997>	2021-12-07 19:41:06 +00:00
Dave Airlie	55b396e743	mesa/crocus/iris/blorp: drop minify macro in favour of u_minify This macro is duplicated, clean it up. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14103>	2021-12-07 19:04:01 +00:00
Adam Jackson	d9f0991744	mesa: Make _mesa_generate_mipmap_level static Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	3731d21c68	mesa: Remove unused execmem code Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	68b7fabbe2	mesa/program: Dead code cleanup Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	087f196a08	mesa/vbo: Always use buffer objects for storage Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	b4ae8ee43e	mesa: Remove unused _vbo_current_binding Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	d88398413a	mesa: Remove unused _es_{,Get}TexGenfv Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	3f35fd71c3	mesa: Remove unused _es_RenderbufferStorageEXT Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	5b8eeaac3a	mesa: Remove unused _es_color4ub Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	5d09812c2f	mesa: Remove unused _mesa_compressed_image_address Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	74c29e1bf7	mesa: Remove unused _mesa_apply_ci_transfer_ops Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	1455d1cd30	mesa: Remove unused _check_TexGenOES Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	98ef86b3af	mesa: Remove unused _mesa_DrawTexx{,v} Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	8f92e81718	mesa: Remove unused _mesa_get_render_format Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Adam Jackson	8aa776ab1f	mesa: Remove unused _mesa_all_buffers_are_unmapped Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14098>	2021-12-07 18:21:52 +00:00
Samuel Pitoiset	31ad50d989	radv: fix dynamic rendering with VRS The structure type was wrong. This fixes a bunch failures in dEQP-VK.fragment_shading_rate.dynamic_rendering.*. Fixes: `7f3aba37d2` ("radv: Support Begin/EndRendering.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14106>	2021-12-07 17:30:47 +00:00
Samuel Pitoiset	92d84f189c	radv: constify radv_vs_input_state() in more places Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13857>	2021-12-07 14:27:29 +00:00
Samuel Pitoiset	b83caef6d2	radv: constify radv_vertex_binding in CmdSetVertexInputEXT() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13857>	2021-12-07 14:27:29 +00:00
Samuel Pitoiset	45f181c482	radv: move a comment at the right place in CmdBindVertexBuffers2EXT() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13857>	2021-12-07 14:27:29 +00:00
Danylo Piliaiev	e63ffc2f04	freedreno,tu: Limit the amount of instructions preloaded into icache Inferring from blob's cmdstream the size of shader instruction cache for: - a630 is 64 - a650 is 128 - a660 is 128 On a650 and a660 gpu could hang if we exceed the limit. Though it is not reproducible with computerator or a single amber test. Also while blob limits the size to 128 - Turnip still hangs with it but does not hang with the limit of 127. On a630 there seem to be no hang when limit is exceeded. Fixes the hang of compute shader in Alien Isolation on a650/a660. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14044>	2021-12-07 13:48:35 +00:00
Dave Airlie	da24bb17a8	mesa/st: move external objects to direct calls This moves the memory and semaphore objects to direct calls Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	830e2038fc	mesa/st: move transformfeedback to direct calls Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	12f8475ff9	mesa/st: move barriers to direct call Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	8fe4ff2fda	mesa/st: direct call sync object functions Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	bd9adf3919	mesa/dd/st: direct wire queries/timestamp/condrender. These were all interrelated, avoid the indirect calls here, and call directly between main and state tracker Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	0d8610b099	mesa/dd/st: move get strings pointer out of dd.h Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	b03db92720	meson: make mesa/tests/glx depend on gallium If I start direct linking the state tracker to mesa, the tests fail to build because they don't have gallium linked. Reviewed-by: Dylan Baker <dylan.c.baker@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14073>	2021-12-07 13:03:53 +00:00
Dave Airlie	9bb375b0be	intel/compiler: drop glsl options from brw_compiler Only the nir options are used now, since i965 was dropped, the glsl options come from the state tracker Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14102>	2021-12-07 08:52:36 +00:00
Emma Anholt	de33205f88	nir/algebraic: Move all the individual transforms to a common table. Cuts 28% of the remaining relocations in libvulkan_intel.so, shrinks binary size by 290kb. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	a29b54f014	nir/algebraic: Mark the automaton's filter tables as const. Moves it to .rodata instead of .data. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	45a8d11b6e	nir/algebraic: Pack various bitfields in the nir_search_value_union. This gets our union's size down to 22 bytes (now smaller than any of the union's types were before we made the union!). Cuts another 48kb off of the drivers. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	53f49b7066	nir/algebraic: Move relocations for variable conds to a table. This helps concentrate the dirty pages from the relocations, reduces how many relocations there are, and reduces the size of each variable assuming variables mostly don't have conditions or the conditions are mostly reused). Reduces libvulkan_intel.so size by 49kb. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	8485a78977	nir/algebraic: Move relocations for expression conds to a table. This helps concentrate the dirty pages from the relocations, reduces how many relocations there are, and reduces the size of each expression (assuming expressions mostly don't have conditions or the conditions are mostly reused). Reduces libvulkan_intel.so size by 8.7kb. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	7635379dc7	nir/algebraic: Remove array-of-cond code You can't have an array of them after removing many-comm-expr, there's no space in the struct. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	5d82c61a30	nir/algebraic: Replace relocations for nir_search values with a table. Even with packing all 3 types into a 40-byte union (nir_search_constant being 24 bytes and nir_search_expression having formerly been 32), and having a single array of them, this cuts 1.7MB from each of libvulkan_intel.so and libgallium_dri.so. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:09:00 +00:00
Emma Anholt	e7d8717375	nir/algebraic: Drop the check for cache == None. The cache is always set. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:08:59 +00:00
Emma Anholt	a263474d3b	nir/algebraic: Move some generated-code algebraic opt args into a struct. I'm going to be adding some more tables to reduce relocations in the generated code, so move the current tables to a struct for arg-passing sanity. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13987>	2021-12-07 07:08:59 +00:00
Emma Anholt	4b5692fa71	nouveau/nir: Use the address reg for indirect scratch access. Fixes the dEQP regressions in dEQP-GLES2.functional.shaders.indexing.*. TGSI used the address reg for these offsets too. Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14083>	2021-12-07 06:57:27 +00:00
Timothy Arceri	ca16c271fa	mesa: make struct in gl_program a union and remove FIXME Now that the classic drivers that were mixing the use of these asm and glsl shader fields are gone we can finally use a union here. This basically reverts commit `9d99dc4bc1` but also moves a read of IsPositionInvariant inside an arb asm only code block for safety. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14059>	2021-12-07 05:24:47 +00:00
Qiang Yu	df6ff88fd0	loader/dri3: support glx pbuffer swap Double buffered pbuffer need to update the front buffer, otherwise we always get wrong value when glReadPixels(). Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	e0311746cd	loader/dri3: stop doing anything in swap buffer for some drawable We are sure to have a back buffer in swap buffer now. Reviewed-by: Adam Jackson <ajax@redhat.com> Singed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	d67d1dddbe	loader/dri3: rename dri3_fake_front_buffer Sometimes this is the real front buffer depends on the place called. Since it's the same LOADER_DRI3_FRONT_ID slot, just name it dri3_front_buffer. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	c7d5e91b6b	loader/dri3: replace is_pixmap with drawable type Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	d19013a199	loader/dri3: setup present event with drawable type info If we already know the drawable type, setup event in a simpler way. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	6508e5b4a1	loader/dri3: pack window present event setup into a function For simplicity and latter commits. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	9faa2892b9	loader/dri3: remove unused present capability query The query result is not used anywhere. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	13bf30583c	loader/dri3: add drawable type set by GLX and EGL Drawable type include more information which can be used to distinguish pixmap and pbuffer which both treated as pixmap before. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	887f5a6320	glx: add drawable type argument when create drawable For distinguish different behavior of pixmap and pbuffer in latter commits. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	6625c960c5	glx: check drawable type before create drawble If glxDrawable is not a X window ID, we can only support GLXPbuffer now. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Qiang Yu	bc8a51a79a	glx: no need to create extra pixmap for pbuffer XServer already created a pixmap with same id as pbuffer, so that other client can use the pbuffer id to do glXMakeCurrent(). But with a hidden pixmap, we can't do this. Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13750>	2021-12-07 03:26:13 +00:00
Timothy Arceri	51f71e0483	util: add workaround for SNK HEROINES Tag Team Frenzy The game makes use of builtin functions that were moved to compatibility shaders in GLSL 4.20 in its GLSL 4.20 shaders without declaring the compatibility token. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5706 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14040>	2021-12-07 02:53:04 +00:00
Timothy Arceri	f225e0679a	util: add dri config option force_compat_shaders This allows us to force all shaders to offer shader features only provided to compatibility shaders. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14040>	2021-12-07 02:53:04 +00:00
Kenneth Graunke	2d7c25fb9d	isl: Move some genxml surface state helpers into an include file On XeHP, the XY_BLOCK_COPY_BLT command has a number of fields that describe the layout of the surface, much like SURFACE_STATE does. Several of them are encoded in such a similar manner that we really would like to reuse the isl helpers for emitting those. This commit moves them into a new isl_genX_helpers.h file which I can include from the BLORP code. (The alternative would be to add XY_BLOCK_COPY_BLT filling commands to isl, but that...seems more like a BLORP feature.) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14094>	2021-12-06 17:23:56 -08:00
Kenneth Graunke	b3b63c795f	iris: Rename is_render_target to is_dest in a few blit functions When targeting the blitter or compute engines, the destination is not really a render target. But it's still useful to know whether we're talking about the source or destination. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14094>	2021-12-06 17:23:56 -08:00
Emma Anholt	65e343dda3	r300: Fix mis-optimization turning -1 - x into 1 - x. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14092>	2021-12-07 01:08:01 +00:00
Emma Anholt	0e0a49039b	r300: Turn a comment about presub into an assert. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14092>	2021-12-07 01:08:01 +00:00
Emma Anholt	ce0e228ff4	r300: Add deqp expectations for RV515. This may not be a complete set, as I haven't been able to run dEQP-GLES2 to completion (GPU hangs at some point, no particular test seems to be guilty). But this will help me assess NIR-to-TGSI for the driver. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14092>	2021-12-07 01:08:01 +00:00
Timothy Arceri	3580ffaa44	doc: update source tree doc to reflect recent classic/swrast deletions Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14078>	2021-12-07 01:01:37 +00:00
Dylan Baker	2636e8681a	fixup! gallium/swr: Remove driver source Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Dylan Baker	fb4fc1fd50	new_features: Add OpenSWR removal Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Dylan Baker	7dd12694f1	CODEOWNERS: remove OpenSWR Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Jan Zielinski	ce4c96ea1b	gallium/swr: clean up the documentation after SWR removal from main Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Jan Zielinski	e2de00876a	gallium/swr: Remove common code and build options This commit removes all OpenSWR references from common Mesa code and build system. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Jan Zielinski	855793c6c6	gallium/swr: Remove driver source The OpenSWR will be maintained on a classic/LTS branch. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11264>	2021-12-06 23:37:50 +00:00
Pierre Moreau	d22d328859	nv50/nir: Switch to the common NIR options Reviewed-by: Karol Herbst <kherbst@redhat.com> Signed-off-by: Pierre Moreau <dev@pmoreau.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14069>	2021-12-06 23:25:28 +00:00
Alyssa Rosenzweig	46b758cbcc	pan/va: Add table parameter to LD_ATTR_IMM ..and test the instruction. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Alyssa Rosenzweig	a5084127eb	pan/va: Add sample/update modes to LD_VAR ..and test the new instructions. As usual, the semantics are the same as bifrost, but the encoding is simpler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Alyssa Rosenzweig	3485e9dd3d	pan/va: Make LD_VAR index more fine-grained Index in bytes instead of vec4s, since varyings on Valhall are no longer vec4 based like on previous Malis. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Alyssa Rosenzweig	7d157ae50e	pan/va: Add more assembler tests For new patterns Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Alyssa Rosenzweig	f3d4d074da	pan/va: Disambiguate sign of CSEL instructions The naming scheme is a bit simpler than Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Alyssa Rosenzweig	9da627fd6d	pan/va: Improve assembler unit test output Instead of using Python hex() to print the result, print the result in the same format as the disassembler for easy visual comparison. This means we don't need to reprint the expectation. This gives output like: 7c 7d 11 33 04 80 66 00 LD_ATTR_IMM.v4.f16.slot0 @r0:r1, `r60, `r61, index:0x1 7c 7d 10 33 04 80 66 00 Incorrect assembly Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14065>	2021-12-06 23:06:59 +00:00
Dylan Baker	3ae3569d82	meson: restore dri.pc file Which was accidentally deleted. Fixes: `ea8fa10edd` Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5717 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14090>	2021-12-06 22:42:34 +00:00
Dave Airlie	2feee3b0b0	mesa/externalobject: delete unused functions Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:40 +00:00
Dave Airlie	2db58c3f89	mesa/barrier: remove unused barrier functions Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:40 +00:00
Dave Airlie	26bd234d06	mesa/transformfeedback: remove unused transform feedback code Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:39 +00:00
Dave Airlie	b7eb7bd47b	mesa: remove unused buffer object code. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:39 +00:00
Dave Airlie	a7c7f55a3b	mesa/syncobj: drop unused syncobj code. This is all done in the state tracker now Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:39 +00:00
Dave Airlie	ae9c96ea17	mesa/query: remove all the mesa queryobj code. This is all unused in favour of the state tracker code. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14075>	2021-12-06 22:08:39 +00:00
Dave Airlie	711176bc0c	iris/ci: comment out iris-cml-traces-performance due to hw unavailable This job seems to be timing out, daniels said hw was having some availability issues, so turn off for now. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14088>	2021-12-06 21:19:51 +00:00
Alyssa Rosenzweig	9b068f186a	panfrost: Add Valhall support to pandecode Valhall v9 introduces a number of new data structures since Bifrost v7, and removes a number of traditional data structures. Add decode routines for the new Valhall data structures, and condition the old routines on (PAN_ARCH <= 7) to remain buildable and warning-free. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	745d7db748	panfrost: Don't shadow Mesa's fui() Will fix a compiler error when we #include the Valhall disassembler header from pandecode. Fixes: `688827f3c5` ("pan/va: Add disassembler generator") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	244f3704d4	panfrost: Zero initialize disassembler stats Keep it simple for introducing new support. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	96acad5cd5	panfrost: Add XML for Valhall data structures Fork the latest canonical XML (Bifrost v7) and adapt to the data structures found in the earliest Valhall GPU I could get my hands on (Valhall v9). This should minimize the churn needed for the port by keeping the Valhall model close to the Bifrost we already supported. It is not known what happened to v8. It appears to have been yeeted from existence. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	6eb0770be8	panfrost: Add "hex" type to GenXML Although known fields wouldn't be given the type "hex", it is useful as the default type for unknown fields while reverse-engineering, and as such is used in the Valhall XML. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	72b3f21cd4	pan/va: Only hex dump when verbosely disassembling Closer behaviour to Bifrost, making the entrypoints symmetric. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Alyssa Rosenzweig	7fa5382ad6	pan/bi: Link with Valhall disassembler For pandecode's use. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14063>	2021-12-06 20:46:09 +00:00
Marek Olšák	de5c863a52	mesa: use simple_mtx_t for TexMutex (v2) change mtx_recursive -> mtx_plain, there's no recursive locking Let's try this again! This was originally landed as `f6abb3445b` ("mesa: use simple_mtx_t for TexMutex") and then reverted with `781c0eafcf` ("Revert "mesa: use simple_mtx_t for TexMutex"") because it broke i965. Now that i965 is no longer in the tree, we can restore it. Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> (v1) Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Kristian H. Kristensen <hoegsberg@google.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14053>	2021-12-06 20:17:02 +00:00
Ian Romanick	b88202b0e4	nir/constant_folding: Optimize txb with bias of constant zero to tex v2: Fail gracefully when bias_idx < 0. See comment in the code for the rationale. See also issue #5722. All Haswell and newer Intel GPUs had similar results. (Ice Lake shown) total instructions in shared programs: 19757733 -> 19753431 (-0.02%) instructions in affected programs: 277248 -> 272946 (-1.55%) helped: 1644 HURT: 1 helped stats (abs) min: 1 max: 16 x̄: 2.62 x̃: 2 helped stats (rel) min: 0.05% max: 11.11% x̄: 2.11% x̃: 1.61% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 0.35% max: 0.35% x̄: 0.35% x̃: 0.35% 95% mean confidence interval for instructions value: -2.72 -2.51 95% mean confidence interval for instructions %-change: -2.19% -2.03% Instructions are helped. total cycles in shared programs: 938517439 -> 938384079 (-0.01%) cycles in affected programs: 19548849 -> 19415489 (-0.68%) helped: 1358 HURT: 269 helped stats (abs) min: 1 max: 2328 x̄: 133.01 x̃: 16 helped stats (rel) min: <.01% max: 41.12% x̄: 1.40% x̃: 0.48% HURT stats (abs) min: 1 max: 1302 x̄: 175.70 x̃: 30 HURT stats (rel) min: <.01% max: 69.03% x̄: 6.24% x̃: 1.04% 95% mean confidence interval for cycles value: -99.14 -64.79 95% mean confidence interval for cycles %-change: -0.47% 0.19% Inconclusive result (%-change mean confidence interval includes 0). LOST: 21 GAINED: 32 All Ivy Bridge and older Intel GPUs had similar results. (Ivy Bridge shown) total instructions in shared programs: 15302017 -> 15301485 (<.01%) instructions in affected programs: 22565 -> 22033 (-2.36%) helped: 168 HURT: 0 helped stats (abs) min: 1 max: 7 x̄: 3.17 x̃: 3 helped stats (rel) min: 0.04% max: 4.39% x̄: 3.05% x̃: 3.27% 95% mean confidence interval for instructions value: -3.45 -2.89 95% mean confidence interval for instructions %-change: -3.19% -2.91% Instructions are helped. total cycles in shared programs: 550119761 -> 549989147 (-0.02%) cycles in affected programs: 12834251 -> 12703637 (-1.02%) helped: 164 HURT: 0 helped stats (abs) min: 20 max: 4547 x̄: 796.43 x̃: 294 helped stats (rel) min: 0.23% max: 53.84% x̄: 2.05% x̃: 0.37% 95% mean confidence interval for cycles value: -942.62 -650.24 95% mean confidence interval for cycles %-change: -3.17% -0.94% Cycles are helped. fossil-db results: Tiger Lake, Ice Lake, and Skylake had similar results. (Ice Lake shown) Instructions in all programs: 142073649 -> 141307526 (-0.5%) SENDs in all programs: 6876848 -> 6876778 (-0.0%) Loops in all programs: 38283 -> 38283 (+0.0%) Cycles in all programs: 8410049681 -> 8402902960 (-0.1%) Spills in all programs: 190623 -> 190599 (-0.0%) Fills in all programs: 297780 -> 297756 (-0.0%) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> [v1] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14025>	2021-12-06 19:50:42 +00:00
Michel Zou	fadb4b92c5	llvmpipe: Fix Wpointer-to-int-cast Fixes: `2771fd4a` (gallium, windows: Use HANDLE instead of FD for external objects) Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14021>	2021-12-06 19:00:29 +00:00
Emma Anholt	9b2600da87	mesa/st: Remove GL_ARB_depth_clamp emulation support. This was useful for emulating GL 3.2 in virgl on a GLES3 host renderer, before GL_EXT_depth_clamp introduced the ability for hardware drivers to expose the feature on GLES. Now that we have that, the desktop-GL-capable HW that virgl cares about can expose desktop GL even on its GLES renderer on the host without this emulation. I don't think anyone particularly cares about hitting higher GL versions on actually-core-GLES hosts with virgl. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13729>	2021-12-06 18:17:25 +00:00
Yonggang Luo	c691149f3e	win32: Fixes thread local on win32 with clang/mingw (!14062 ) The mingw compiling error: ``` ../../src/mapi/glapi/glapi.h:86:66: error: '_glapi_tls_Dispatch' cannot be thread local when declared 'dllimport' _GLAPI_EXPORT extern __THREAD_INITIAL_EXEC struct _glapi_table * _glapi_tls_Dispatch; ^ ../../src/mapi/glapi/glapi.h:88:51: error: '_glapi_tls_Context' cannot be thread local when declared 'dllimport' _GLAPI_EXPORT extern __THREAD_INITIAL_EXEC void * _glapi_tls_Context; ``` Fixes: `c47fd3dc` ("windows: Use TLS context/dispatch with shared-glapi") Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14062>	2021-12-06 14:55:54 +00:00
Jesse Natalie	9626595026	nir: Add an 'external' texture type for parity with samplers Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14046>	2021-12-06 14:21:38 +00:00
Alyssa Rosenzweig	ae4d46d457	panfrost: Only build GPU indirect kernels for v7 These kernels aren't tested (and are probably broken) elsewhere. Don't waste cycles trying to compile for other architectures. This reduces the amount of code that needs to be ported to a new architecture. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14064>	2021-12-06 13:54:25 +00:00
Lionel Landwerlin	79421616e8	docs/envvars: update after INTEL_DEBUG cleanup Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14080>	2021-12-06 13:19:10 +00:00
Corentin Noël	7fa60cd7ce	virgl: Disable cache for VIRGL_BIND_SAMPLER_VIEW this currently makes the dEQP-GLES31.functional.image_load_store.buffer.image_size.readonly_12 test fail when used simultaneously with other tests that lead to hitting the cache. For instance the combination of: dEQP-GLES31.functional.image_load_store.buffer.atomic.or_r32i_result dEQP-GLES31.functional.image_load_store.buffer.atomic.or_r32i_return_value dEQP-GLES31.functional.image_load_store.buffer.image_size.readonly_12 results in a failure of the readonly_12 test. Deflag dEQP-GLES31.functional.image_load_store.buffer.image_size.{read,write}only_12 as flakes. Signed-off-by: Corentin Noël <corentin.noel@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14045>	2021-12-06 13:02:30 +00:00
Jakob Bornecrantz	555f93cdcd	vulkan-device-select: Don't leak drmDevicePtr ASAN found a leak: ``` Direct leak of 1440 byte(s) in 10 object(s) allocated from: #0 0x4a9a92 in calloc (build-Monado-CMake/src/xrt/targets/service/monado-service+0x4a9a92) #1 0x7fdf82afed06 in drmDeviceAlloc build-drm/../drm/xf86drm.c:3933:14 #2 0x7fdf82b00203 in drmProcessPciDevice build-drm/../drm/xf86drm.c:3965:11 #3 0x7fdf82b00203 in process_device build-drm/../drm/xf86drm.c:4359:16 #4 0x7fdf82b0485e in drmGetDevice2 build-drm/../drm/xf86drm.c:4528:15 #5 0x7fdf70751113 in device_select_find_xcb_pci_default ../src/vulkan/device-select-layer/device_select_x11.c:95:13 #6 0x7fdf70751113 in get_default_device ../src/vulkan/device-select-layer/device_select_layer.c:395:21 #7 0x7fdf70751113 in device_select_EnumeratePhysicalDevices ../src/vulkan/device-select-layer/device_select_layer.c:456:33 ``` Cc: mesa-stable Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14068>	2021-12-06 11:21:03 +00:00
Erik Faye-Lund	8bd0446d00	docs: update trademark disclaimer It's a long time since SGI was the copyright holder for the OpenGL trademark. And we implement more APIs by now, so let's update the disclaimer to instead redirect to the Khronos licensing page for details. While we're at it, soften the language on legal status as a formal implementation, as we currently have conformant drivers for most of the APIs by now. But refer to the Khronos website for details, as conformance status for drivers are subject to change. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13833>	2021-12-06 11:15:47 +00:00
Timothy Arceri	74a1f103b6	mesa: update or remove out of date references to ir_to_mesa Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	bf1f809d7f	mesa: rename ir_to_mesa.{cpp,h} -> link_program.{cpp,h} The only code now left in this file is the linking function. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	5cad5db97b	mesa: tidy up ir_to_mesa.{cpp,h} includes, comments, etc Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	79abf6a17e	mesa: move _mesa_ensure_and_associate_uniform_storage() to uniform_query.cpp This is where all the other functions that handle uniform storage live. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	7abe527ab4	mesa/st: move _mesa_generate_parameters_list_for_uniforms() code to st The classic drivers that shared the code are now gone and the only user is the tgsi linker so here we move the code to where it is used. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	4b895dc895	mesa: remove GLSL IR to Mesa IR code The last user of this was dropped with the classic drivers. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	d3e0cfaa08	mesa: make _mesa_associate_uniform_storage() static The function is no longer called directly outside of this file. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Timothy Arceri	33cbab854e	mesa: remove _mesa_ir_link_shader() The final use of this was removed when the classic drivers were dropped. Reviewed-by: Dave Airlie <airlied@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14066>	2021-12-06 10:15:08 +00:00
Lionel Landwerlin	d44478483c	genxml: protect _length defines in genX_bits.h Those defines exist in the packing headers too and some parts of the code (like mi_builder.h) include both. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13954>	2021-12-06 08:02:59 +00:00
Lionel Landwerlin	e9b58116ea	genxml: fix compilation with P/I defines Those names are a bit too common and sometimes clash variables. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13954>	2021-12-06 08:02:59 +00:00
Lionel Landwerlin	365903ebbb	intel/debug: reclaim 7 unused bits from classic driver Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14060>	2021-12-06 09:44:04 +02:00
Dave Airlie	d72c420f70	mesa/light: make _mesa_light static do_light. This is unused outside this now. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14074>	2021-12-06 16:13:50 +10:00
Dave Airlie	86bbd14b8e	mesa/dd: remove NewSamplerObject This was always calling directly into the mesa version now Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14074>	2021-12-06 16:13:46 +10:00
Dave Airlie	67f971e6ad	mesa/dd: remove some fbo driver hooks. These are assign to core mesa functions by st, so just direct call Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14074>	2021-12-06 16:13:40 +10:00
Dave Airlie	279471bda6	mesa/dd: burn a bunch of legacy driver interfaces down None of these are used anymore in the gallium world, there are some more to get rid off but this is a good start. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14074>	2021-12-06 16:13:33 +10:00
Dave Airlie	e2c05539fe	mesa: drop unused sw extensions init This isn't used since swrast went away. Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14072>	2021-12-06 15:10:07 +10:00
Dave Airlie	bf35f0cb7a	mtypes: drop some context pointers that are unused now Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14072>	2021-12-06 15:09:43 +10:00
Timothy Arceri	80719f08a7	mesa: remove old tnl device driver header files The last users of these were removed when the classic drivers were dropped. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14070>	2021-12-06 13:05:40 +11:00
Bas Nieuwenhuizen	e914a6710f	radv: Expose the VK_KHR_dynamic_rendering extension. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Bas Nieuwenhuizen	483a08d552	radv: Support dynamic rendering inheritance info. Straightforward, just converting to a renderpass as well. Note that we now own the renderpass so I also added a bool to check if we own it so we can destroy it after recording. Doing the destruction at destroy & reset time, as reset can be called during recording, and destroy all the time. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Bas Nieuwenhuizen	7f3aba37d2	radv: Support Begin/EndRendering. This is just the naive implementation that create a new renderpass and then destroys it at the end. I do it this way because in meta operations we are still creating temporary subpasses for a renderpass for e.g. the resolve. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Bas Nieuwenhuizen	0222dace90	radv: Support VK_KHR_dynamic_rendering for pipeline creation. The approach here is to include a wrapper converting the legacy renderpass info to the new structures. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Bas Nieuwenhuizen	403a5c1a79	radv: Do not use VK_FORMAT_UNDEFINED in meta passes. Is used in VK_KHR_dynamic_rendering to indicate non-presence of color attachments. Wasn't really valid Vulkan so we otherwise don't need a workaround in the renderpass->dynamic rendering conversion. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Bas Nieuwenhuizen	6968c87e97	radv: Add named constants for max framebuffer width/height. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13721>	2021-12-05 17:21:32 +00:00
Lionel Landwerlin	4c703686db	spirv: handle ray query intrinsics v2: Fixup comment (Caio) Use generated builders (Caio) v3: Update spirv2dxil CI expectations Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	0cbcc15afe	nir: add a ray query optimization pass Just remove queries that are never used or proceeded with. The latter case leading to undefined values. v2: Don't use nir_shader_instructions_pass() to find variables (Caio) Simplify replacement (Caio) v3: Don't track all the queries intrinsic effects (Caio) Rename things to represent only read queries (Caio) Use set instead of hash_table (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	5a9cdab170	nir: track variables representing ray queries v2: Fix missing ray_query variable check (Caio) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	0d6f050b46	nir: add intrinsics for ray queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	0800ec2c77	nir: add a new access flag to allow access in helper invocations v2: Add nir_print support Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	54489b3c09	nir/print: printout ACCESS_STREAM_CACHE_POLICY Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	f98984ad13	nir/lower_io: include the variable access in the lowered intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Lionel Landwerlin	7661237a31	intel/nir: preserve access value when duping intrinsic Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `6339aba775` ("intel/compiler: Lower SSBO and shared loads/stores in NIR") Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13718>	2021-12-04 20:46:35 +00:00
Yonggang Luo	6e7ffa760f	vulkan: Open registry XML files as UTF-8 "C:\CI-Tools\msys64\mingw64\bin/python3.EXE" "../../src/vulkan/util/gen_enum_to_str.py" "--xml" "../../src/vulkan/registry/vk.xml" "--outdir" "C:/work/xemu/xemu-opengl/mesa/build/windows-mingw64/src/vulkan/util" Traceback (most recent call last): File "C:\work\xemu\xemu-opengl\mesa\src\vulkan\util\gen_enum_to_str.py", line 473, in <module> main() File "C:\work\xemu\xemu-opengl\mesa\src\vulkan\util\gen_enum_to_str.py", line 462, in main f.write(template.render( UnicodeEncodeError: 'gbk' codec can't encode character '\xa9' in position 107: illegal multibyte sequence Signed-off-by: Yonggang Luo <luoyonggang@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14015>	2021-12-05 03:28:38 +08:00
Yiwei Zhang	8665910a63	venus: move bo allocation for mappable memory to vn_MapMemory This change defers the bo allocation for non-external mappable memory direct allocation. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13874>	2021-12-04 01:48:16 +00:00
Yiwei Zhang	19d6b497fb	venus: track memory type property flags in vn_device_memory Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13874>	2021-12-04 01:48:16 +00:00
Yiwei Zhang	86b3a4e6aa	venus: defer roundtrip waiting to vkFreeMemory time bo allocations for the below cases are after device memory allocation: 1. direct non-external mappable memory allocation 2. pool grow 3. exportable memory allocation For (1) and (2), the bo is for mapping, which is a pure kernel operation to happen later. So roundtrip waiting can be deferred until free memory. For (3), the bo is for either fd export or mapping, which are both pure kernel operations. So roundtrip waiting can also be deferred until free memory. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13874>	2021-12-04 01:48:16 +00:00
Yiwei Zhang	9fa3e3df9e	venus: simplify device memory pool alloc and refcount The behavior we stick to is that the base_bo is always created for pool memory so that to keep it alive during suballocates and frees. This CL does the below: 1. rename pool_alloc to pool_suballocate and align the api interface 2. rename simple_alloc to pool_grow_alloc to make it pool specific 3. refactor pool_free and simple_free into a pair of pool_ref and pool_unref to simplify that vkFreeMemory is only called after the pool bo gets destroyed. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13874>	2021-12-04 01:48:16 +00:00
Yiwei Zhang	e019780626	venus: refactor vn_device_memory_simple_alloc Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13874>	2021-12-04 01:48:16 +00:00
Ilia Mirkin	eb28ac0f88	nv50: don't claim support for format-less stores This is not supported, nor is there any need to support it -- ES 3.1 doesn't need it, and we're in no danger of supporting ARB_shader_image_load_store (among other things, it requires frag images). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14050>	2021-12-04 01:34:17 +00:00
Ilia Mirkin	03acfa4aac	nv50,nvc0: add new caps to list Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14050>	2021-12-04 01:34:17 +00:00
Marcin Ślusarz	bd2c11dfa8	intel/compiler: Load draw_id from XP0 in Task/Mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Marcin Ślusarz	b717872e08	intel/compiler: Get mesh_global_addr from the Inline Parameter for Task/Mesh Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Marcin Ślusarz	28e0c63a4c	intel/compiler: extract brw_nir_load_global_const out of rt code Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	1f438eb033	intel/compiler: Implement Mesh Output Use the same URB access helpers that were added for Task Output. The Arrayed I/O (per-primitive and per-vertex) is handled by applying the pitch from the MUE layout into the NIR intrinsics and including the non-arrayed offset on top of it. After that, the index src can be used directly for lowering. Because we keep around the non-arrayed offset AND the pitch is aligned, we can identify cases where the access is indirect but guaranteed to be aligned, and dispatch a single message. Added a TODO to explore that later. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	70ace2bbcd	intel/compiler: Implement Task Output and Mesh Input Implement the output written by the task workgroup and available to all the mesh workgroups dispatched from that task. We currently ignore any layout annotations (since they are not really testable) and produce a (packed) layout ourselves. The URB messages are only SIMD8, so for larger SIMDs, the functions will produce multiple messages. Making this lowering here instead of the generic lower_simd_width() since it is not just a matter of zip/unzip, e.g. the offset must be adjusted. Indirect writes/reads are implemented by handling one component at a time and using the PER_SLOT variant of the messages. Note that VK_NV_mesh_shader allows reading outputs, so add support for that as well. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	171bdd2ec6	intel/compiler: Lower Task/Mesh local_invocation_{id,index} The Invocation index is provided by the payload, so we can skip the usual math done to get to it. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	db23c41537	intel/compiler: Add backend compiler basics for Task/Mesh Task/Mesh stages are CS-like stages, and include many builtins (e.g. workgroup ID/index) and intrinsics (e.g. workgroup memory primitives) originally present only in CS. This commit add two new stages (task and mesh) that 'inherit' from CS by embedding a brw_cs_prog_data in their own prog_data structure, so that CS functionality can be easily reused. They also currently use the same helpers to select the SIMD variant to use -- that was recently added for CS. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	827cf65a26	intel/compiler: Export brw_nir_lower_simd Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	09dd05a219	intel/compiler: Make MUE available when setting up FS URB access Allows to assert its existence for per-primitive variables and will later be useful to implement the "more than 16 attributes" case for Mesh. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	79e5e353e4	intel/compiler: Add structs to hold TUE/MUE Used to specify the layout of 'Task URB Entry' and 'Mesh URB Entry'. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	fcc1ccf541	intel/compiler: Don't lower Mesh/Task I/O to temporaries Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	18e1c9c542	intel/compiler: Don't stage Task/Mesh outputs in registers Since the outputs are shared among the whole workgroup, these can't be staged in registers as they will not be always visible for all the invocations (to read/flush). If they ever need to be staged, we should use SLM for that. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	be89ea3231	intel/compiler: Handle per-primitive inputs in FS In Fragment Shader, regular inputs are laid out in the thread payload in a one dword per each half-GRF, that gives room for having the two delta dwords needed for interpolation. Per-primitive inputs are laid out before the regular inputs, and since there's no need to have delta information, they are packed. So half-GRF will be fully filled with 4 dwords of input. When num_per_primitive_inputs is zero (the default case), behavior should be the same as before. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	7938c38778	intel/compiler: Properly lower WorkgroupId for Task/Mesh Task/Mesh currently only support a single dimension (both in NV API and HW), so make Y and Z be zero. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Caio Oliveira	76f55d7556	intel: Add INTEL_DEBUG=task,mesh Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13661>	2021-12-04 00:41:46 +00:00
Dylan Baker	ea8fa10edd	mesa: move common/dri into gallium There are no other consumers, so we can just move this into gallium and out of mesa. Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	01b44d66b1	mesa: Merge libmesa_gallium and libmesa_common Since we don't have libmesa_classic anymore, we don't nee to split these, and can save a target/ar invocation by not having two targets. Plus it's just conceptually simpler Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	e030d5ba8a	mesa: Delete libmesa_classic We no longer have any classic drivers, so we no longer need libmesa_classic Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	bc2d3e7b5f	mesa/main/tests: remove dispatch sanity Thsi test uses a bunch of the classic infastructure, which is about to be deleted. Since gallium will be the sole user, it will likely be refactored out anyway. Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	a63760f01a	include/pci_ids: Move PCI ids supported by both i965 and iris to iris As crocus won't support any of these (BDW+) they should go into iris. This also allows us to remove the "prefer_iris" option, as iris is now the only option Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	bf97868062	mesa/dri: remove mega driver stub As it is now unused. Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	cdde031ac2	classic/i965: Remove driver Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	0cad451f00	classic/i915: Remove driver This is only going to be supported in the Amber branch Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	f464871932	classic/nouveau: Remove driver This will now only be available in the Amber branch Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	184a690fca	classic/r200: Delete driver This will now only be available on the Amber branch Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	4d45b280bf	classic/r100: Delete driver This is now only going to be available in the Amber branch Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Adam Jackson	76791db088	mesa/x11: Remove the swrast-classic-based fake libGL If you want this you will almost certainly be happier with the gallium version, giving you llvmpipe instead of swrast-classic. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Acked-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Dylan Baker	901e0d6a11	mesa/tests: ensure that util_cpu_detect has been called I think that this test was passing in some cases because of loader side effects, that stop happening after classic is removed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10153>	2021-12-03 23:53:06 +00:00
Ilia Mirkin	268fc8e5c1	gitlab-ci: detect a3xx gpu hang recovery failure But don't bail immediately, instead print out some more lines after the hang, hopefully catching info about the cause of the hang. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14033>	2021-12-03 23:26:27 +00:00
Ilia Mirkin	eb0b08ea1a	gitlab-ci: serial close can leave an active read So instead cancel the read first, and then close. Make sure the serial-reading properly detects this cancelled condition under all circumstances and exits. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14033>	2021-12-03 23:26:27 +00:00
Jesse Natalie	1abd6375c9	d3d12: Handle depth readback on drivers that require full-resource copies for depth Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14051>	2021-12-03 23:08:37 +00:00
Timur Kristóf	f28adc711f	nir: Print task and mesh shader I/O variable names. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14007>	2021-12-03 21:34:45 +00:00
Ilia Mirkin	a7180bd4a6	freedreno/a5xx: enable OES_gpu_shader5 This extension is controlled by the ESSL feature level. Bump it up since all parts of OES_gpu_shader5 should be supported. This also avoids lowering all of the "advanced" functions (which should probably not be lowered in the first place since they're part of ES 3.1...) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14035>	2021-12-03 20:04:17 +00:00
Timur Kristóf	078f9d9eeb	radv: Use util_widen_mask. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14005>	2021-12-03 18:29:13 +00:00
Timur Kristóf	c3eebc860a	aco: Use util_widen_mask. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14005>	2021-12-03 18:29:13 +00:00
Timur Kristóf	6cde424945	util: Add util_widen_mask function. Widens the given bit mask by a multiplier, meaning that it will replicate each bit by that amount. For example: 0b101 widened by 2 will become: 0b110011 This is typically used in shader I/O to transform a 64-bit writemask to a 32-bit writemask. Moving this function here because it is used in multiple places. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14005>	2021-12-03 18:29:13 +00:00
Timur Kristóf	7e66da89f8	nir: Fix sorting per-primitive outputs. Fixes: `59860d4873` Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14006>	2021-12-03 17:06:47 +00:00
Daniel Stone	76ffc72742	CI: Don't stream wget directly into bash If our environment has pipefail set, bash could exit early before wget has completed, and we will die with a broken pipe. Work around this by first downloading from wget, and then executing from bash. Signed-off-by: Daniel Stone <daniels@collabora.com> Acked-by: Michel Dänzer <mdaenzer@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14036>	2021-12-03 16:11:42 +00:00
Juan A. Suarez Romero	11287475c8	v3d: enable ARB_texture_view v2 (Iago): - Add comments to failing tests Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Alejandro Piñeiro	7f1525f086	v3d: enable ARB_texture_buffer_object and ARB_texture_buffer_range Through their specific PIPE_CAP. v2 (Iago) - Add comment in test failure Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	5cd161164b	st/pbo: set layer coord for array textures Even for drivers not supporting layers, we need to include the layer coordinate (zero in this case) when using array textures in the download/upload FS. Otherwise we are missing a component to get the texture, which ends up in a malformed NIR shader. v5 (Ilia): - Do not emit needless layer code. v6 (Ilia): - Add comment clarifying the code logic behind. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	fd47c939f4	st/pbo: add the image format in the download FS In the V3D driver there is a NIR lowering step for `image_store` intrinsic, where the image store format is required for doing the proper lowering. Thus, let's define it for the download FS instead of keeping it as NONE. v2 (Illia) - Use format only for drivers not supporting format-less writing. v4 (Illia): - Use PIPE_CAP_IMAGE_STORE_FORMATTED to reduce combinations. v5 (Ilia): - Use indirect array for download FS in not formatless-store support drivers. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	38c953e287	gallium: add new PIPE_CAP_IMAGE_STORE_FORMATTED This capability is enabled for drivers supporting formatless image writing in shader. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	54cba7d297	v3d: clamp clear color On clearing a color buffer, clamp the passed color values to the allowed ones. Hardware do clamping for TLB values, but not for clear values. v2 (Iago) - Add comment about hardware-based clamping on clear values. v3 (Iago): - Use format utils to simplify clamping - Move clamp color function as utility Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	fa1cd83fef	gallium/util: add helper to clamp colors to valid range v3 (Iago): - Fix comment. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	da0cdc84d5	st/pbo: do not use GS for NIR preferred shaders If PBO require to use a GS, this is created with TGSI. For drivers preferring NIR shaders, they need to translate it from TGSI to NIR. But unfortunately TGSI to NIR for GS is not implemented, which makes it crash. So let's use a GS only for drivers preferring TGSI. v3: - Add comment (Iago and Alejandro) Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Juan A. Suarez Romero	2000ea7e27	mesa: allow TEXTURE_BUFFER target for ARB_texture_buffer_range While TEXTURE_BUFFER do not support texture parameters in ARB_texture_buffer_object specification, it does in ARB_texture_buffer_range, specifically TEXTURE_BUFFER_OFFSET and TEXTURE_BUFFER_SIZE. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Alejandro Piñeiro	982c630cd5	v3d: restrict formats supported for PIPE_BIND_SHADER_IMAGE So far we were relying on the supported formats filtering when creating images from the user API. But for some other (internal) uses, some of the formats that pass the filter need to be restricted when binding them as shader images, as they are not supported for this case. v3 (Iago): - Change commit message. Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Alejandro Piñeiro	bb8285c258	v3d: add support for no buffer object bound From the GL_OES_texture_buffer spec: "If no buffer object is bound to the buffer texture, the results of the texel access are undefined." this can be interpreted as allowing any result to come back, but not terminate the program. The current solution is not entirely complete, as it could still try to get a wrong address for the shader state address. This can be checked with piglit test arb_texture_buffer_object-render-no-bo; the test is skip because it requires OpenGL 3.1, but if overriding the version then it will crash. Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Alejandro Piñeiro	60a1968fa1	v3d: support for texture buffer objects This commit handles the support for texture buffer objects. In general it is mostly about using the buffer info from the pipe_image_view instead of the texture info. v2: - Rework some assertions (Iago) - Remove needless comment (Alejandro) - Fix comment typos (Iago) v3: - Fix typos (Iago) Signed-off-by: Alejandro Piñeiro <apinheiro@igalia.com> Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13409>	2021-12-03 15:32:36 +00:00
Rhys Perry	a2d8c5b26d	nir/algebraic: optimize a*#b & -4 fossil-db (Sienna Cichlid): Totals from 611 (0.47% of 128647) affected shaders: CodeSize: 3096680 -> 3090976 (-0.18%) Instrs: 570494 -> 569249 (-0.22%) Latency: 5765865 -> 5759619 (-0.11%) InvThroughput: 969840 -> 967608 (-0.23%) VClause: 9690 -> 9688 (-0.02%) Copies: 42884 -> 42894 (+0.02%); split: -0.01%, +0.03% PreVGPRs: 28290 -> 28288 (-0.01%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13752>	2021-12-03 13:41:07 +00:00
Rhys Perry	2368c36427	nir/opt_offsets: remove need to loop try_extract_const_addition fossil-db (Sienna Cichlid): Totals from 1 (0.00% of 134572) affected shaders: no stat changes Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14009>	2021-12-03 11:51:49 +00:00
Rhys Perry	5c0fe11072	nir/opt_offsets: fix try_extract_const_addition recursion This initially looks like a miscompilation bug, but I don't think it's actually possible for it to create incorrect code. fossil-db (Sienna Cichlid): Totals from 32 (0.02% of 134572) affected shaders: VGPRs: 1336 -> 1320 (-1.20%) CodeSize: 90552 -> 89468 (-1.20%) Instrs: 17007 -> 16852 (-0.91%); split: -0.92%, +0.01% Latency: 429040 -> 428136 (-0.21%); split: -0.21%, +0.00% InvThroughput: 84966 -> 84572 (-0.46%); split: -0.47%, +0.00% Copies: 1458 -> 1468 (+0.69%); split: -0.07%, +0.75% Branches: 382 -> 384 (+0.52%) PreSGPRs: 970 -> 968 (-0.21%) PreVGPRs: 1029 -> 1011 (-1.75%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14009>	2021-12-03 11:51:49 +00:00
Juan A. Suarez Romero	f77ccdfb4a	nir: add NIR_DEBUG envvar Move all the NIR related debug environmental variables in a single NIR_DEBUG one. Use NIR_DEBUG=help to print all the available options. v2: - Use a macro to simplify (Marcin, Jason) - Remove wrong changes (Marcin) v3 (Marcin): - Remove rendundant NIR mentioning in option descriptions. - Unwrap option descriptions. - Ensure the constant is unsigned. - Use extern array to remove switch. v4: - Add missing kernel shader (Jason). - Add unlikely() (Marcin). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13840>	2021-12-03 11:15:29 +00:00
Iago Toral Quiroga	cc7db1fc53	broadcom/compiler: improve documentation for Z writes Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14037>	2021-12-03 10:39:08 +00:00
Iago Toral Quiroga	d7b79f3531	v3d,v3dv: don't disable EZ for passthrough Z writes The early-Z test uses Z values produced from FEP, so when we write Z from a shader we need to disable EZ. However, there are some instances where want to write the FEP-Z from the shader, in which case we would not need to disable EZ. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14037>	2021-12-03 10:39:08 +00:00
Iago Toral Quiroga	a65c605365	broadcom/compiler: track passthrough Z writes In some cases we need to make the shaders write the Z value produced from rasterization (FEP). Track these instances because they are relevant to early EZ setup. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14037>	2021-12-03 10:39:08 +00:00
Iago Toral Quiroga	6d4a645c90	broadcom/compiler: emit passthrough Z write if shader reads Z Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14037>	2021-12-03 10:39:08 +00:00
Tapani Pälli	d44d2e823f	anv: allow VK_IMAGE_LAYOUT_UNDEFINED as final layout From VK_KHR_synchronization2: "Image memory barriers that do not perform an image layout transition can be specified by setting oldLayout equal to newLayout. E.g. the old and new layout can both be set to VK_IMAGE_LAYOUT_UNDEFINED, without discarding data in the image." v2: make assert more readable (Lionel Landwerlin) Cc: mesa-stable Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14008>	2021-12-03 09:10:32 +00:00
Jordan Justen	5c4c8bdc4c	iris/batch: Add support for engines contexts As described in "intel: Add intel_gem_create_context_engines", this should make it easier to support an I915_ENGINE_CLASS_FOO engine in the future. For example, maybe something like: `98c3bbd5b5` Reworks: * Tweak engine counting logic (s-b Ken) * Tweak init of engine_classes in iris_init_engines_context (s-b Ken) * Add STATIC_ASSERT on engine_classes (Jordan) * Paulo: Call iris_hw_context_set_unrecoverable() for engines context * Rename to has_engines_context (s-b Paulo) * Jordan: Handle creating a new engines context when the context needs to be replaced. * Ken: Tweak context destroy code paths. * Call iris_lost_context_state on every batch. (s-b Ken) Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:35:32 -08:00
Jordan Justen	9f0070e9e8	iris: Make iris_kernel_context_get_priority() public Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:34:46 -08:00
Jordan Justen	f0bec1dd1e	iris: Destroy all batches with a new iris_destroy_batches() function Suggested-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:34:42 -08:00
Jordan Justen	5b4914aaf7	iris: Move away from "hw" for some context terminology Kernel contexts can take two forms now. In the older case a kernel context will have a single hardware context. With an "engines" based context, the context can now have 1 or more hardware contexts. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:31:03 -08:00
Jordan Justen	3643450dc0	iris/batch: Add exec_flags field Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:57 -08:00
Paulo Zanoni	dd89c6ca65	iris: extract iris_hw_context_set_unrecoverable() We're going to add a second caller. Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:57 -08:00
Jordan Justen	e88dcb38a1	iris/batch: Move kernel context init to iris_init_non_engine_contexts Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:49 -08:00
Jordan Justen	5b87f5c88a	iris: Add iris_init_batches Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:44 -08:00
Jordan Justen	0634cb741b	intel: Add intel_gem_create_context_engines Engines based contexts operate somewhat different for executing batches. Previously, we would specify a bitmask value such as I915_EXEC_RENDER to specify to run the batch on the render ring. With engines contexts, instead this becomes an array of "engines", and when the context is created we specify the class and instance of the engine. Each index in the array has a separate hardware-context. Previously we had to create separate kernel level contexts to create multiple hardware contexts, but now a single kernel context can own multiple hardware contexts. Another forward looking advantage to using the engines based contexts is that the kernel does not plan to add new supported I915_EXEC_FOO masks, whereas they instead plan to add new I915_ENGINE_CLASS_FOO engine classes. Therefore some rings may only be usable with an engine based class. Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:38 -08:00
Jordan Justen	9a9042a904	intel: Add intel_gem_count_engines Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12692>	2021-12-02 16:30:31 -08:00
Dylan Baker	76d13e1330	docs: Add calendar entries for 22.0 release candidates. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14029>	2021-12-02 15:54:18 -08:00
Chia-I Wu	bfa245e0c1	venus: fix vn_instance_wait_roundtrip when seqno wraps Signed-off-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14026>	2021-12-02 15:02:15 -08:00
Daniel Stone	47ed98f540	zink/ci: Add GL4.6 tessellation flake Seen in https://gitlab.freedesktop.org/mesa/mesa/-/jobs/16318152#L636 which is extremely unrelated. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14010>	2021-12-02 19:00:44 +00:00
Guilherme Gallo	dabc068e6c	ci: Use ci-fairy minio login via token file For every CI job, put JWT content into a file and unset CI_JOB_JWT environment var ======= * virgl jobs: - Share JWT token file to crosvm instance - Keep using `export -p` due to high complexity in the scripts of these jobs. At least, the CI_JOB_JWT will not be leaked, since it is being unset at the `before_script` phase of each Mesa CI job. * iris jobs: Update lava_job_submitter to take token file as argument - generate-env with CI_JOB_JWT_TOKEN_FILE - create token file during baremetal init stage * baremetal jobs: Copy token file to bare-metal NFS Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14004>	2021-12-02 18:01:29 +00:00
Guilherme Gallo	cdf8a14bff	ci: Uprev piglit Bring up the piglit replay jwt-file argument feature. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14004>	2021-12-02 18:01:29 +00:00
Guilherme Gallo	19cb49c280	ci: Update ci-fairy to version with --token-file support Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Cristian Ciocaltea <cristian.ciocaltea@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14004>	2021-12-02 18:01:29 +00:00
Alex Xu (Hello71)	3161bc5c1a	meson: check for lld split TLSDESC bug (fixes #5665 ) Reviewed-by: Emma Anholt <emma@anholt.net> Tested-by: Michel Dänzer <mdaenzer@redhat.com> Signed-off-by: Alex Xu (Hello71) <alex_y_xu@yahoo.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13934>	2021-12-02 09:18:50 -08:00
Leandro Ribeiro	7c57346dfd	egl/wayland: fix surface dma-buf feedback error exits This fixes a leak that was introduced in 89d15b9a "egl/wayland: add initial dma-buf feedback support". Do not leak dri2_surf->wl_dmabuf_feedback when we have to bail out because of allocation issues. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13985>	2021-12-02 14:18:55 +00:00
Leandro Ribeiro	81361490ef	egl/wayland: do not try to bind to wl_drm if not advertised This fixes a bug that was introduced in 89d15b9a "egl/wayland: add initial dma-buf feedback support". Sometimes we have to fallback to wl_drm. But do not try to bind to it when it is not advertised by the compositor. This issue was found by n3rdopolis and reported here: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5697 Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13985>	2021-12-02 14:18:55 +00:00
Samuel Pitoiset	3fa2220838	radv: upload shader binaries of a pipeline contiguously in memory RGP expects shaders to be contiguous in memory, otherwise it explodes because we have to generate huge captures with lot of holes. This reduces capture sizes of Cyberpunk 2077 from ~3.5GiB to ~180MiB. This should also help for future pipeline libraries. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	a7f0463612	radv: pass a pointer to a pipeline for the create/insert cache functions Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	13143b3c11	radv: upload shader binaries after they are all compiled Instead of mixing compilation and upload. This will allow us to upload all shader binaries contiguously in memory and also for future pipeline libraries work. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	ff61b36ba2	radv: add a helper function to upload a shader binary Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Samuel Pitoiset	dd66de6017	radv: remove never reached free() when compiling shaders binary_out is never NULL and binaries are freed from the pipeline after they are added to the cache. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13690>	2021-12-02 07:17:04 +00:00
Ilia Mirkin	fc2cc39a0f	freedreno/ci/a306: split off snorm blending failures The hardware doesn't support this. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Ilia Mirkin	bbe5b745dc	freedreno/ci/a306: split off the f32 blend / texturing failures The hardware doesn't support this. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Ilia Mirkin	1f79c36dae	freedreno/ci/a306: separate msaa fails The driver does not implement MSAA. When that happens these can be split up further. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13990>	2021-12-02 03:39:28 +00:00
Jesse Natalie	c47fd3dc00	windows: Use TLS context/dispatch with shared-glapi However they have to be called via _glapi_get_dispatch/context. This would be safe to do on any platform, but the extra indirection is only necessary on Windows since TLS vars can't be exported from a DLL. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13634>	2021-12-02 03:03:14 +00:00
Ilia Mirkin	58aad3f403	freedreno/a3xx: add some legacy formats These can be used in "legacy" buffer textures. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13989>	2021-12-02 02:29:50 +00:00
Ilia Mirkin	41aa583edf	freedreno/ci/a306: add additional skip which hangchecks I was having trouble getting a run to complete without this. Was working earlier, not sure what changed. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13989>	2021-12-02 02:29:50 +00:00
Emma Anholt	59ba7a2ad8	freedreno/a6xx: Set the tess BO ptrs in the program stateobj. Saves some draw-time work for tess. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13851>	2021-12-02 01:47:38 +00:00
Emma Anholt	5495359085	freedreno/a6xx: Skip emitting tess BO pointers past the shader's constlen. Some shaders don't want these pointers, and going past the constlen would potentially overwrite consts from other draws. This is a port of a fix from turnip. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13851>	2021-12-02 01:47:38 +00:00
Emma Anholt	d7226e9a9e	freedreno/a6xx: Allocate a fixed-size tess factor BO. Saves per-batch allocations, avoids reallocation for various vertex counts, and avoids needing the indirect tess addrs constobj so that we could emit the relocs to the tess BO after we'd emitted all the draws. Also apparently it fixes one of our CTS fails. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13851>	2021-12-02 01:47:38 +00:00
Bas Nieuwenhuizen	577a0a7352	radv: Don't emit framebuffer state if there is no renderpass active. The framebuffer state could still be dirty from when the previous renderpass was bound. Fixes: `5632359959` ("radv: Remove the skipping of framebuffer emission if we don't have a framebuffer.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5702 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13969>	2021-12-02 00:35:28 +00:00
Jesse Natalie	c3e014670f	d3d12: Support compat level 330 Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/14001>	2021-12-01 23:48:57 +00:00
Ryan Neph	996e855e66	venus: ignore framebuffer for VkCommandBuffer executed outside of render pass The vulkan spec states[1]: > If the VkCommandBuffer will not be executed within a render pass instance, > or if the render pass instance was begun with vkCmdBeginRenderingKHR, > renderPass, subpass, and framebuffer are ignored. but venus will still try to encode them, resulting in a guest-side assert or host-side command stream error. [1]: https://www.khronos.org/registry/vulkan/specs/1.2-extensions/man/html/VkCommandBufferInheritanceInfo.html#_description Signed-off-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Yiwei Zhang <zzyiwei@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13988>	2021-12-01 23:35:42 +00:00
Emma Anholt	06fe04b4d7	nir: Make nir_build_alu() variants per 1-4 arg count. This saves a bunch of generated code to pack up the extra NULLs to get to 4 args, and saves executing the conditions in nir_build_alu() to then skip those NULLs. Saves another 27kb on disk. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13916>	2021-12-01 22:12:19 +00:00
Emma Anholt	e770ec1182	nir: Uninline a bunch of nir.h functions. I aimed for "things that look like big switch statements, or cases where the compiler is unlikely to be able to constant-propagate an argument into something useful." Saves another 80kb on disk. No perf difference on iris shader-db, n=23. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13916>	2021-12-01 22:12:19 +00:00
Nanley Chery	c394f2f0ea	iris: Drop redundant iris_resource_disable_aux call Drop the call to iris_resource_disable_aux in iris_resource_configure_aux. With the previous patches, we no longer create CCS surfaces and pick the AUX_NONE usage. As a result, if the aux usage is NONE, all iris_resource fields already indicate that aux is disabled. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12398>	2021-12-01 20:36:38 +00:00
Nanley Chery	137a054c94	iris: Enable CCS_E on 32-bpc float formats on TGL+ Allow CCS_E on these formats on TGL+ for a couple reasons: 1) TGL doesn't have the option to fall back to CCS_D/fast-clears like prior platforms do. 2) The CCS compression scheme on TGL improves to encode more than 3 levels of compression. This should help floating point formats. In my measurements, enabling this on TGL results in a minor performance improvement on Paraview (+0.06%) rather than a major regression like on prior platforms. The improvement was measured by taking the average of 3 runs of: waveletvolume.py -d 256 -f 600. Also, the Intel performance CI reports a 3.81% ±0.12% FPS improvement in Bioshock Infinite. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12398>	2021-12-01 20:36:38 +00:00
Nanley Chery	1433fe7860	intel/isl: Unify fmt checks in isl_surf_supports_ccs On TGL+, require that the surface format supports CCS_E in order to support CCS. This aligns with the ISL code that pads the primary surface for CCS on this platform. Pre-TGL, require support for either CCS_D or CCS_E. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12398>	2021-12-01 20:36:38 +00:00
Eric Engestrom	d9eaabf05d	docs: update calendar and link releases notes for 21.3.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13998>	2021-12-01 19:32:36 +00:00
Eric Engestrom	897dde881c	docs: add release notes for 21.3.1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13998>	2021-12-01 19:32:36 +00:00
Jesse Natalie	70dd119abd	CI/d3d12: Add a quick_shader run Refactor the YML for some DRY, and rename the existing pass from "-windows" to "-quick_gl" to disambiguate it. Reviewed-by: Enrico Galli <enrico.galli@intel.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13902>	2021-12-01 18:26:15 +00:00
Jesse Natalie	7afb4aba3f	CI/windows: Move reference files to relevant ci subdirectories Reviewed-by: Enrico Galli <enrico.galli@intel.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13902>	2021-12-01 18:26:15 +00:00
Jesse Natalie	c70e31c4d5	CI/windows: Move SPIRV-to-DXIL test YML to microsoft folder Reviewed-by: Enrico Galli <enrico.galli@intel.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13902>	2021-12-01 18:26:15 +00:00
Jesse Natalie	214168621d	CI/windows: Move D3D12 test YML to D3D12 driver folder Reviewed-by: Enrico Galli <enrico.galli@intel.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13902>	2021-12-01 18:26:15 +00:00
Rob Clark	145b0711fc	freedreno/crashdec: Basing GMU log decoding Looks like each entry is four dwords, with the second dword being a timestamp. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	8c654d02a3	freedreno/crashdec: Fallback to chip_id for GPU id Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	f33d5256dd	freedreno/crashdec: HFI queue decoding Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Rob Clark	2133d34b11	freedreno/crashdec: Split out mempool decoding Before we start adding GMU HFI decoding, lets split the other big section specific decoding (mempool) out into it's own file. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13937>	2021-12-01 17:53:21 +00:00
Emma Anholt	b234c538e8	turnip: Move CP_SET_SUBDRAW_SIZE to vkCmdBindPipeline() time. Now that the subdraw size is constant for a pipeline, this lets tess draws avoid the slow path in vkCmdDraw*(). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
Jonathan Marek	fd11d99254	turnip: use SUBDRAW_SIZE and constant sized tess bos This fixes the problem of large indirect draws, and at the same time avoids allocating too large buffers for tessellation. Reworked by @anholt to use a separate tess factor BO so we can skip the WFIs to set the TESSFACTOR_ADDR. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
Emma Anholt	3748b8afce	freedreno/ir3: Make a shared helper for the tess factor stride. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6089>	2021-12-01 16:57:30 +00:00
M Henning	17de0841ae	nouveau/nir: Use natural alignment for scalars We used to request vec4 alignment for everything on the nir codepath, but this triggers an assertion failure since `a0b82c24b6`, which prohibits vec4 alignment on scalars. Since requiring vec4 alignment on scalars is a little silly anyway, this patch relaxes the alignment to naturally aligned for scalars. Fixes about 27 crashing tests in piglit and deqp on kepler, including eg piglit/tests/spec/glsl-1.30/execution/fs-large-local-array.shader_test Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13883>	2021-12-01 16:10:57 +00:00
Lionel Landwerlin	698343edc5	util/u_trace/perfetto: add new env variable to enable perfetto When using the Vulkan API, command buffers can be recorded way before perfetto is enabled. This can be problematic if you want already recorded command buffers to produce traces. This new environment variable makes perfetto enabled internally so that command buffers are recorded with timestamps, even though no perfetto recording happens. v2: rename to GPU_TRACE_INSTRUMENT (Rob) v3: Move instrumentation check to generated headers (Danylo) Decouple instrumentation enabling from tracing (Danylo) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13911>	2021-12-01 15:14:05 +00:00
Lionel Landwerlin	65697d6141	util/u_trace: add end_of_pipe property to tracepoints In order to capture the timestamp when things actually end on Intel GPU HW, we need to know whether the timestamp should be capture at the top or end of pipeline. v2: use one line python if/else (Danylo) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13911>	2021-12-01 15:14:05 +00:00
Viktoriia Palianytsia	6f54ebe44f	glsl: fix for unused variable in glsl_types.cpp Unused variable vector_elements is now used in return from function decode_type_from_blob instead of encoded.basic.vector_elements. In the code we can see how those variables were equated and then the operations were made exactly to vector_elements. But variable didn't pass into any other variables or functions. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5536 Signed-off-by: Viktoriia Palianytsia <v.palianytsia@globallogic.com> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13725>	2021-12-01 09:33:41 +00:00
Marcin Ślusarz	4f58cc82e2	spirv: handle SpvOpMemberName Now we can see field names in structs instead of generic "fieldN" with NIR_PRINT=1. Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13941>	2021-12-01 08:56:29 +00:00
Lionel Landwerlin	8e568d3f00	nir/opt_deref: don't try to cast empty structures Found while running valgrind : ==3583454== Invalid read of size 4 ==3583454== at 0xF48336: glsl_get_struct_field_offset (nir_types.cpp:84) ==3583454== by 0xC7CD0D: opt_replace_struct_wrapper_cast (nir_deref.c:1068) ==3583454== by 0xC7CDD9: opt_deref_cast (nir_deref.c:1087) ==3583454== by 0xC7DD8E: nir_opt_deref_impl (nir_deref.c:1369) ==3583454== by 0xC7DF4E: nir_opt_deref (nir_deref.c:1428) ==3583454== by 0xA63F3C: brw_kernel_from_spirv (brw_kernel.c:325) ==3583454== by 0xA3BC2C: main (intel_clc.c:481) ==3583454== Address 0xe4f7e88 is 24 bytes after a block of size 48 in arena "client" Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13952>	2021-12-01 08:24:39 +00:00
Boris Brezillon	69ec384bba	gallium/d3d12: Don't use designated initializers Use of designated initializers requires at least '/std:c++20', and mesa is using c++14 by default. Fixes: `8d3a3e7a00` ("microsoft/compiler: Use textures for SRVs") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13912>	2021-12-01 08:51:17 +01:00
Boris Brezillon	83280b8e23	microsoft/compiler: Fix dxil_nir_create_bare_samplers() _mesa_hash_table_u64_search() returns the data directly, not an hash_entry object. We also need to take the descriptor set into account for this pass to work properly on Vulkan shaders. Fixes: `46bc7cf678` ("microsoft/compiler: Rewrite sampler splitting pass to be smarter and handle derefs") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13912>	2021-12-01 08:51:05 +01:00
Ilia Mirkin	c868bff36a	freedreno/ci: add piglit runs for a306 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13920>	2021-11-30 20:06:07 -05:00
Mauro Rossi	1ba231fb75	android: define cpp_rtti=false because libLLVM is built w/o RTTI (v2) libLLVM for Android is built without RTTI, but after commit `ad86267` mesa inherits meson default RTTI enabled state. cpp_rtti=false is added to meson options in android/mesa3d_cross.mk (v2) Add Fixes tag and use spaces instead of tabs for aligning the trailing \ Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Fixes: `ad862674` ("meson: Don't override built-in cpp_rtti option, error if it's invalid") Cc: "21.3" "21.2" mesa-stable Reviewed-by: Marijn Suijten <marijn.suijten@somainline.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13901>	2021-11-30 22:58:30 +01:00
Mauro Rossi	be7a0b23c8	Revert "android: define cpp_rtti=false because libLLVM is built w/o RTTI" This reverts commit `f659d00000`. The revert is done because essential Fixes tag was missing and to apply a better version that could be picked for mesa-stable. Acked-by: Marijn Suijten <marijn.suijten@somainline.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13901>	2021-11-30 22:58:23 +01:00
Rhys Perry	6afba80534	aco: don't create DPP instructions with SGPR operands Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `2e6834d4f6` ("aco: combine DPP into VALU before RA") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13976>	2021-11-30 20:11:48 +00:00
Alyssa Rosenzweig	8d2be391e3	panfrost: Add empty tile flags to GenXML These flags control special CRC handling for empty tiles using the CRC clear colour field added on Bifrost. Their use depends on CRC being used. We missed these flags earlier; let's add them since they are used by the Valhall DDK but are not new to Valhall. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13982>	2021-11-30 15:43:59 +00:00
Samuel Pitoiset	8f00f19da5	radv: fix resetting the entire vertex input dynamic state If there is holes, eg. the application firsts set vertex attributes 0 and 1, then vertex attributes 0 and 7, the format of vertex attribute 1 is still the previous one, while it should be FORMAT_INVALID to avoid a GPU hang. This fixes a GPU hang with Yuzu. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5627 Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13856>	2021-11-30 15:10:39 +00:00
Nanley Chery	18cd0a5409	anv: Drop code from get_blorp_surf_for_anv_buffer The code to handle ASTC surfaces hasn't been needed since commit `dd92179a72` ("anv: Canonicalize buffer formats for image/buffer copies"). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	355f318843	anv: Allow transfer-only linear ASTC images Some apps depend on this to run. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/2397 Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	bdf8b36c4c	anv: Require transfer features for transfer usages In order for an image to support the transfer usage, require that its format can be used for blits or copies. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	8171535c45	iris: Allow GPU-based uploads of ASTC textures ISL recently started allowing linear ASTC surfaces to be created. With that in place, iris can perform GPU-based uploads to ASTC textures in the same way it does so with other compressed surfaces. We're not aware of any reason to continue special-casing ASTC texture uploads, so we get rid of the code which does so. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Nanley Chery	caa998ca8f	intel/isl: Allow creating non-Y-tiled ASTC surfaces The sampler can only decode ASTC surfaces that are Y-tiled. ISL has been asserting this restriction at surface creation time. However, some drivers want to create a surface that is only used for copying compressed data. And during the copy, the surface won't have a compressed format. To enable this behavior, we choose to move the tiling assertion to the moment a surface state is created for the sampler. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13881>	2021-11-30 13:36:35 +00:00
Kenneth Graunke	574c5d1540	blorp: Disallow multisampling for BLORP compute blits and copies. We don't support typed image writes for multisampling, so we can't handle multisampled destinations. We also usually handle MSAA by running the fragment shader per-sample, which we aren't accounting for in our compute shaders, so we can't handle MSAA sources either. We could do both of these things if we really wanted to, but we don't. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	f0744ebef2	blorp: Assert that BLORP_BATCH_PREDICATE_ENABLE isn't set for compute We don't support this, so make sure it isn't happening. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	5dc36e5e93	blorp: Don't try to use the 3D stencil write hardware for compute When we're doing a stencil blit via a fragment shader, we can avoid W-tiling shenanigans by using the stencil write hardware on Skylake and later. Of course, the compute engine doesn't have stencil fragment writes, so it can't do that. Just fall back to the detiling shenanigans. Caught by Piglit's arb_copy_image-formats when forcing iris to use BLOCS for resource_copy_region on Icelake. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Kenneth Graunke	d832209a78	blorp: Fix compute-blits for rectangles not aligned to the workgroup When dispatching compute shaders to do a blit, our destination rectangle may not line up perfectly with the workgroup size. For example, we may round the left x0 coordinate down to a multiple of the workgroup width, and the right x1 coordinate up to the next multiple of the workgroup width. Similarly for y0/y1 and workgroup height. This means that we may dispatch additional invocations which should not actually do any blitting. We need to set key->uses_kill to bounds check and drop those. Caught by Piglit's arb_copy_image-simple when forcing iris to perform resource_copy_region via BLOCS and running with INTEL_DEBUG=norbc on Icelake. Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13524>	2021-11-30 12:30:50 +00:00
Filip Gawin	80c2b27438	iris: fix mapping compressed textures This code was originally made for crocus by Dave Airlie. Iris is also affected, so this commit ports the fix. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12993>	2021-11-30 10:07:35 +00:00
Qiang Yu	fcc062235c	ci: remove egl-copy-buffers from fail list egl-copy-buffers test has been fixed for dri3. So remove it from broadcom and freedreno ci fail list to prevent the gitlab ci test fail: spec@egl 1.4@egl-copy-buffers,UnexpectedPass Also remove it from radeonsi ci fail list since I verified on radeonsi. Acked-by: Daniel Stone <daniels@collabora.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13868>	2021-11-30 01:58:42 +00:00
Qiang Yu	24aa03e9e8	loader/dri3: fix piglit egl-copy-buffer test In the test no front buffer has been allocated on the client side, so we get a segfault when access it directly. eglCopyBuffers() just need to do server side copy, so we don't really need to create a client side front buffer to perform the copy. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13868>	2021-11-30 01:58:42 +00:00
Mykhailo Skorokhodov	391569e911	nir: Fix read depth for predecessors In some non-trivial cases (the amber script file in the merge request description) phi instruction has more than 32 elements in predecessors tree and that isn't recursion, just large tree. In that case, phis not fully converted into a register or mov, but successfully removed. The fix removes the counter and adds container of visited blocks. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3690 Cc: mesa-stable Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13710>	2021-11-30 00:12:48 +00:00
Ilia Mirkin	e31d08d307	ci: move windowoverlap exclusion to all-skips The test is just plain not built by our containers. Skip it everywhere. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13919>	2021-11-29 18:08:49 -05:00
Rhys Perry	32a8b391e3	nir/tests: add DCE test for loops following a jump Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10284>	2021-11-29 22:22:24 +00:00
Rhys Perry	cc5dd15417	nir/cf: fix insertion of loops/ifs after jumps Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10284>	2021-11-29 22:22:24 +00:00
Rhys Perry	2fe13aa2ad	nir/dce: fix DCE of loops with a halt or return instruction in the pre-header If there is a halt or return instruction right before a loop with a single continue, we would have taken the fast path intended for loops without continues. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Tested-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `71a985d80b` ("nir/dce: perform DCE for unlooped instructions in a single pass") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10284>	2021-11-29 22:22:24 +00:00
Vasily Khoruzhick	34a75ce15c	lima: fix blending with min/max ops It turns out that BLEND_MIN and BLEND_MAX in Utgard take blend factors into account. My guess is that actual equation looks like: OP(As * S + Ad * D, Ad) for alpha, and OP(Cs * S + Cd * D, Cd) for color. So we have to set S factor to 1 and D factor to 0 to be compliant with GL spec. Fixes following piglit tests: spec@!opengl 1.4@blendminmax spec@arb_blend_func_extended@arb_blend_func_extended-fbo-extended-blend (with patch my for ES2_compatibility and EXT_blend_func_extended) Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Vasily Khoruzhick	5f9434b611	lima: use 1 as blend factor for dst_alpha for SRC_ALPHA_SATURATE As per [1] alpha blend factors for Sa and Da should be 1 for SRC_ALPHA_SATURATE [1] https://www.khronos.org/registry/OpenGL/extensions/ARB/ARB_blend_func_extended.txt Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Vasily Khoruzhick	d1d3ebb48c	lima: implement dual source blend It was a bit trickier to RE, since blob doesn't expose this functionality at all, however we had a clue from the very beginning: lima_blend_factor is 3 bits, i.e. 8 values, but only 5 of them were used, it just waited till someone tried what 3 unused values do. Interestingly enough, it turns out "5" works just as "0" (which is PIPE_BLENDFACTOR_SRC_), but only if output register for gl_FragColor is $0, So it looks suspiciously similar with PIPE_BLENDFACTOR_SRC1_ behavior, and looks like secondary output is taken from $0. Since output regs for all other outputs are configured via RSW, there must be a field in RSW for output register for secondary color, it's likely 4 bits and it's currently set to 0 for reg $0. Then it was just a matter of brute-forcing various consecutive 4 bits in RSW - and indeed, setting top 4 bits of rsw->aux0 to the index of gl_FragColor output register fixes blending tests when we use "5" blend factor instead of "0". So it must be a register number for gl_SecondaryFragColor. Unlike gl_FragColor, the field is only repeated once in RSW. Wire it up in compiler, and piglit arb_blend_func_extended now passes. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13873>	2021-11-29 19:31:59 +00:00
Rhys Perry	65a78b2252	aco: properly update use counts if a extract is still used Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13909>	2021-11-29 18:52:12 +00:00
Vasily Khoruzhick	b8f4d36ee4	lima: disasm: call util_cpu_detect() to init CPU caps It's needed by _mesa_half_to_float(), without this change it hits assertion failure in util_get_cpu_caps(). Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13968>	2021-11-29 18:34:58 +00:00
Vasily Khoruzhick	711a4ccddb	lima: disasm: use last argument as a filename Otherwise it fails to open a file. Fixes: `9660427ab7` ("lima: Print usage if --help is any of the arguments.") Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13968>	2021-11-29 18:34:58 +00:00
Vasily Khoruzhick	437b97de1c	lima: fix crash with sparse samplers Fixes following piglit tests: spec@arb_fragment_program@fp-fragment-position spec@arb_fragment_program@sparse-samplers Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13939>	2021-11-29 18:19:19 +00:00
Marek Olšák	e48f676835	glthread: don't sync for more glGetIntegerv enums for glretrace This makes glretrace faster with glthread. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13970>	2021-11-29 14:28:12 +00:00
Iago Toral Quiroga	996f147fef	broadcom/compiler: relax restriction on VPM inst in last thread end slot According to the documentation, only vpmwt is disallowed in the last delay slot of the thread end. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13975>	2021-11-29 14:06:43 +00:00
Filip Gawin	57969c6dad	radv: dont call calloc when BVH is empty Usage of pointer returned by calloc(0) is UB. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13503>	2021-11-29 13:38:37 +00:00
Samuel Pitoiset	8eae431720	radv/llvm: constify radv_shader_info Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:07 +00:00
Samuel Pitoiset	6c1cd2fd3a	radv/llvm: stop trying to eliminate VS outputs This has no effects, except for XFB but that shouldn't really matter. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Samuel Pitoiset	5a94271069	radv: constify radv_shader_info in radv_declare_shader_args() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Samuel Pitoiset	096c02bcf5	radv: copy the user SGPRs locations outside of radv_declare_shader_args() The shader locations are now directly stored in radv_shader_args which makes sense because they are tied to the arguments. The locations are then copied to radv_shader_info but they will be moved into a new radv_shader_binary_info with upcoming changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Samuel Pitoiset	3bbc226d7a	radv: configure the number of SGPRs/VGPRs directly from the arguments Instead of copying the values to radv_shader_info. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13542>	2021-11-29 10:10:06 +00:00
Samuel Pitoiset	990a8ee5eb	radv: add a workaround to fix a segfault with Metro Exodus (Linux native) The game calls vkGetSemaphoreCounterValue() with an invalid semaphore handle and it crashes. This is an invalid Vulkan usage and it should be fixed in the game. I reported the issue to the developers. Workaround this temporarily (hopefully) by ignoring vkGetSemaphoreCounterValue() if the semaphore is NULL from an internal RADV layer. Cc: 21.3 mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5119 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13956>	2021-11-29 09:01:30 +00:00
Samuel Pitoiset	a58f68fc68	radv: fix accessing NULL pointers when destroy the VRS image Detected by UBSAN. ../src/amd/vulkan/radv_private.h:2939:1: runtime error: member access within null pointer of type 'struct radv_device_memory' ../src/amd/vulkan/radv_private.h:2926:1: runtime error: member access within null pointer of type 'struct radv_buffer' ../src/amd/vulkan/radv_private.h:2945:1: runtime error: member access within null pointer of type 'struct radv_image' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13965>	2021-11-29 08:55:02 +01:00
Iago Toral Quiroga	6923dd687c	broadcom/compiler: allow color TLB writes in last instruction Only Z writes are disallowed. total instructions in shared programs: 11578449 -> 11577369 (<.01%) instructions in affected programs: 38132 -> 37052 (-2.83%) helped: 1080 HURT: 0 Instructions are helped. total max-temps in shared programs: 2334416 -> 2334395 (<.01%) max-temps in affected programs: 218 -> 197 (-9.63%) helped: 21 HURT: 0 Max-temps are helped. total inst-and-stalls in shared programs: 11607890 -> 11606810 (<.01%) inst-and-stalls in affected programs: 38265 -> 37185 (-2.82%) helped: 1080 HURT: 0 Inst-and-stalls are helped. total nops in shared programs: 338316 -> 337236 (-0.32%) nops in affected programs: 2625 -> 1545 (-41.14%) helped: 1080 HURT: 0 Nops are helped. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13964>	2021-11-29 06:44:07 +00:00
Ilia Mirkin	f533d7a446	freedreno/ir3: get the post-lowering clip/cull mask The variant may include a lowered gl_Clip/CullDistance array. So we have to use the variant's info (which is not available). However we save off the clip/cull masks already, so just reuse those. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13891>	2021-11-28 02:55:58 -05:00
Ilia Mirkin	13fb587b8a	freedreno/ir3: indicate that clipdist arrays are in use We expose the compact array cap, which means that we get compact clipdist arrays. Indicate this to the lowering pass so that it works for gl_ClipDistance from fs, among others. Fixes, among others, on a420, tests/spec/glsl-1.30/execution/clipping/fs-clip-distance-interpolated.shader_test Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13891>	2021-11-28 02:55:58 -05:00
Ilia Mirkin	b7f423006a	nir/lower_clip: support clipdist array + no vars This runs after the "to io" lowering on freedreno. Support this case. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13917>	2021-11-28 04:44:56 +00:00
Ilia Mirkin	7efb1c4b29	nir/lower_clip: increment num_inputs/outputs by appropriate amount The inputs/outputs are meant to be in vec4 units. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13917>	2021-11-28 04:44:56 +00:00
Ilia Mirkin	3bf47700e2	nir/lower_clip: location offset goes into offset, not base Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13917>	2021-11-28 04:44:56 +00:00
Ilia Mirkin	a8930e6302	nir/lower_clip: replace bogus comment about gl_ClipDistance reading in GL gl_ClipDistance most definitely can be read in fragment shaders since GLSL 1.30. This is also accessible in ES with EXT_clip_cull_distance. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13917>	2021-11-28 04:44:56 +00:00
Leandro Ribeiro	b5848b2dac	egl/wayland: use surface dma-buf feedback to allocate surface buffers As explained in "egl/wayland: add initial dma-buf feedback support", we still don't use the per-surface dma-buf feedback. In this patch we start to use it. If per-surface dma-buf feedback is advertised, use it to allocate surface buffers. Also, the dma-buf protocol states that the feedback is resent only when the client is using a suboptimal format/modifier pair. So listen for new per-surface feedback events and reallocate the surface buffers based on them. We can't change the format of a buffer, but we can pick a new modifier. This patch is based on previous work of Scott Anderson (@ascent). Signed-off-by: Scott Anderson <scott.anderson@collabora.com> Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	5b56fc748a	egl/wayland: move loader_dri_create_image() calls to separate functions In get_back_bo() we have two calls to loader_dri_create_image() and some overhead. As in the next commit we add another call to this same function and more overhead to get_back_bo(), it starts to lose legibility. So move loader_dri_create_image() calls to separate functions, allowing us to have an easier to read get_back_bo(). It also adds some minor style changes. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	83916ae0b9	egl/wayland: add initial dma-buf feedback support This bumps the supported dma-buf version up to 4 and adds the initial dma-buf feedback implementation. It follows the changes in the dma-buf protocol extension [1] to include the dma-buf feedback interface, which should be incorporated by most Wayland compositors in the future. From version 4 onwards, the dma-buf modifier events are not sent by the compositor anymore, so we use the default feedback to pick the set of formats/modifiers supported by the compositor. Also, we try to avoid the wl_drm device event and instead use the dma-buf feedback main device. We only fallback to wl_drm when the compositor advertises a device that does not have a render node associated. In this initial dma-buf feedback implementation we still don't do anything with the per-surface dma-buf feedback, but in the next commits we add proper support. It's important to mention that this also bumps the minimal supported version of wayland-protocols to 1.24, in order to include [1]. [1] https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests/8 This patch is based on previous work of Scott Anderson (@ascent). Signed-off-by: Scott Anderson <scott.anderson@collabora.com> Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	a25d4dd276	loader: add function to get render node from dev_t Add function loader_get_render_node() to help us to get a render node from dev_t. If the device does not expose a render node, this new function returns NULL. As this function uses drmGetDeviceFromDevId(), we bump libdrm minimal version to 2.4.109. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	cd39180cfa	egl/wayland: remove unused constant EGL_DRI2_NUM_FORMATS Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	aa6c900625	egl/wayland: move formats and modifiers to a separate struct This will allow us to remove EGL_DRI2_NUM_FORMATS (as explained in "egl/wayland: remove unused constant EGL_DRI2_NUM_FORMATS") and it will also help to add the dma-buf feedback support. Both changes happen in the next commits. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	6c37784400	egl/wayland: do not try to access memory if allocation failed Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	af1ee8e010	egl/wayland: deprecate drm_handle_format() and drm_handle_capabilities() Most Wayland compositors have already implemented the dma-buf protocol extension. So we can stop relying on the wl_drm events and start to depend only on the dma-buf interface to receive format/modifier pairs and create wl_buffer's. So we can deprecate drm_handle_format() and drm_handle_capabilities(). Note that we still use the wl_drm interface to find out the DRM device that the compositor is using, so we can't deprecate it fully for now. In the future (when the dma-buf feedback interface is added to the dma-buf protocol extension [1] and most compositors incorporate it) we may be able to fully deprecate wl_drm. [1] https://gitlab.freedesktop.org/wayland/wayland-protocols/-/merge_requests/8 Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	3022ad7e15	egl/wayland: replace EGL_DRI2_MAX_FORMATS by EGL_DRI2_NUM_FORMATS Currently we have a weird design. We have a hardcoded array (named dri2_wl_visuals) with all the formats that we support, which are 9. And we also have EGL_DRI2_MAX_FORMATS, which is a constant set to 10. In patches in which people added new formats to dri2_wl_visuals, this constant had its value increased. This is confusing, as its name gives the idea that we can't support more formats. This constant is only used to define the bitset size of dri2_egl_display::formats. And it should work just fine if we created this bitset with the number of formats supported. To make things clearer, replace EGL_DRI2_MAX_FORMATS by EGL_DRI2_NUM_FORMATS, which must be equal to ARRAY_SIZE(dri2_wl_visuals) (i.e. the number of supported formats). In the next commits we get rid of this constant completely, as it is prone to errors. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Scott Anderson	42a0e5caa9	egl/wayland: Remove unused wayland enum Signed-off-by: Scott Anderson <scott@anderso.nz> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Leandro Ribeiro	c53733b004	egl: remove unnecessary spaces after types If we want to add a variable with a type name that is too long, we have to realign all the variables within these structs. The other option is to ignore and don't realign, and then we end up with a very ugly code. So get rid of these unnecessary spaces, as they don't bring anything useful. Instead, they are annoying. Signed-off-by: Leandro Ribeiro <leandro.ribeiro@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Daniel Stone	be1acac84c	ci: Upgrade to libdrm 2.4.109 Required for being able to use drmGetDeviceFromDevId in the loader. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Daniel Stone	a2fd507973	ci: Consistently build Wayland and protocols Rather than relying on distro packages, build libwayland and wayland-protocols from known versions everywhere we need it. The only place we do not do so but rely on distro packages is the LAVA rootfs, for which it does not matter right now since the version is sufficiently new, but this could/should be cleaned up later. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Daniel Stone	9bab991be0	ci: Use common build script for libwayland Rather than open-coding libwayland install for each container, create a common build script like the rest, using both git and meson like the rest. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11248>	2021-11-26 16:06:09 +00:00
Samuel Pitoiset	db3d76c42d	radv: advertise VK_KHR_synchronization2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:23 +00:00
Samuel Pitoiset	52b4185012	radv: switch the remaining stages/access to VK_PIPELINE_STAGE_2/VK_ACCESS_2 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:23 +00:00
Samuel Pitoiset	ad3cd581f4	radv: add support for VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL_KHR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:23 +00:00
Samuel Pitoiset	936ac58b77	radv: add support for new pipeline stages and access masks Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	3f29ae2d31	radv: add support for creating device-only events Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	e99c66ad19	radv: add support for VkMemoryBarrier2KHR Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	3da7d10d9b	radv: implement vkQueueSubmit2KHR() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	8df17163c7	radv: implement vkCmdWaitEvents2KHR()/vkCmdPipelineBarrier2KHR() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	57575974fd	radv: implement vkCmdWriteBufferMarker2AMD() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	cff81c863b	radv: implement vkCmd{Reset,Set}Event2KHR() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Samuel Pitoiset	a0ac03676f	radv: implement vkCmdWriteTimestamp2KHR() Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13549>	2021-11-26 13:41:22 +00:00
Marek Olšák	1df7c0ce7e	radeonsi: print the shader stage for shader-db dumps Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	59926f25fa	radeonsi: print source_sha1 as part of shader dumps It's not part of the shader key, but I don't know where else to put it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	e54264c84f	nir: add shader_info::source_sha1, its initialization and printing Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	ba8075031a	util: add SHA1 printing and comparison functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	0b196b40a3	mesa: don't compute the same SHA1 twice in glShaderSource We can just use original_sha1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	67f01e02d9	mesa: add gl_linked_shader::linked_source_sha1 Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	4364449677	mesa: add shader source SHA1s that are propagated up to glCompileShader glCompileShader can use two different sources, so we need 2 different SHA1s there. Successful compilation sets compiled_source_sha1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	d473b31fe7	mesa: rename gl_shader::sha1 to disk_cache_sha1 there will be more sha1s Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	0bb580754c	mesa: remove SourceChecksum from shader structures it will be replaced by sha1. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13869>	2021-11-26 11:58:27 +00:00
Marek Olšák	cd86f1dc2b	radeonsi: rename si_get_shader_wave_size and make it non-inline Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	676d4ddcf8	radeonsi: centralize wave size computation in si_get_shader_wave_size The big comment was not really true. The other debug options are unused right now, but will be used again in the future. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	b5665bd46c	radeonsi: don't use compute_wave_size directly It will be removed. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	1ef027851d	radeonsi: propagate si_shader::wave_size to VGT_SHADER_STAGES instead of hardcoding them Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	913e1b9138	radeonsi: clean up compute_wave_size use in si_compute_blit.c Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	8290cae2b7	radeonsi: don't use si_get_wave_size in si_get_ir_cache_key Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	d08b09cb7e	radeonsi: use si_shader::wave_size Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	bc57488936	radeonsi: add si_shader::wave_size because it will vary Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	41523773f5	radeonsi: add wave32 flag into prolog/epilog keys It will vary between shaders. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	3b2a6e1b21	radeonsi: don't print uninitialized inlined_uniform_values We don't set them and we don't read them if they are disabled, so don't print them either. This silences valgrind warnings. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13878>	2021-11-26 11:35:05 +00:00
Marek Olšák	94dd401287	driconf: enable glthread for Basemark GPU Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13947>	2021-11-26 10:54:49 +00:00
Marek Olšák	5a970dbac3	driconf: enable glthread for Minecraft +30% performance Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13947>	2021-11-26 10:54:49 +00:00
Marek Olšák	9c4d92508d	driconf: enable glthread for all Unigine benchmarks It helps. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13947>	2021-11-26 10:54:49 +00:00
Samuel Pitoiset	add883bf9b	aco: fix right shift of exponent 32 detected by UBSAN src/amd/compiler/aco_optimizer.cpp:1316:17: runtime error: shift exponent 32 is too large for 32-bit type 'unsigned int' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13951>	2021-11-25 16:15:30 +00:00
Samuel Pitoiset	d18720897f	radv: fix OOB access for inline push constants detected by UBSAN src/amd/vulkan/radv_cmd_buffer.c:3232:75: runtime error: index 252 out of bounds for type 'uint8_t [128]' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13951>	2021-11-25 16:15:30 +00:00
Samuel Pitoiset	8e3fbe7cc8	ac/nir: fix left shift of 1 by 31 places detected by UBSAN src/amd/common/ac_nir_lower_ngg.c:1135:62: runtime error: left shift of 1 by 31 places cannot be represented in type 'int src/amd/common/ac_nir_lower_ngg.c:622:20: runtime error: left shift of 1 by 31 places cannot be represented in type 'int' Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13951>	2021-11-25 16:15:30 +00:00
Marius Hillenbrand	a46d155329	util/cpu_detect, gallium: use cpu_family CPU_S390X instead of separate flag to also get rid of the additional function that I introduced before. Fixes: `82b261417e` ("util/cpu_detect: Add flag for IBM Z (s390x)") Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13958>	2021-11-25 12:57:20 +00:00
Danylo Piliaiev	a78c36ecc6	ir3/cp: Prevent setting an address on subgroup macros These macros expand to a mov in an if statement which breaks address assumption that instruction which produces address and consumes it are in the same block. Fixes test: dEQP-VK.subgroups.ballot_broadcast.framebuffer.subgroupbroadcast_intvertex Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13931>	2021-11-25 12:18:48 +00:00
Marek Olšák	d14c09f7f6	mesa: add a more straightforward callback for replacing shaders Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13946>	2021-11-25 10:39:30 +00:00
Connor Abbott	969369e962	ir3/lower_subgroups: Fix potential infinite loop I was trying to be clever here, skipping ahead to the newly-created block and processing the remaining instructions after the split in the same loop. But if the last instruction in a block was lowered, the saved next instruction would be the head of the block before the split, not the new block, and we would compare it to the new block so we wouldn't stop like we were supposed to. Stop being so clever, and just restart processing with the new block after lowering an instruction. Because we're wrapping the actual transform in yet another loop, and the restarting logic is a bit tricky, refactor the actual lowering into a separate lower_instr function. Otherwise we'd be mixing the two and indenting the actual logic even more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13928>	2021-11-25 10:16:48 +00:00
Dylan Baker	fc8e789b21	docs/release-calendar: remove additional 21.2 releases 21.3 is here, and will be releasing the .1 release before any of the additional 21.2 releases would be made anyway. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13950>	2021-11-24 21:25:02 -08:00
Dylan Baker	ed8bac5eb3	docs: update calendar and link releases notes for 21.2.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13950>	2021-11-24 21:24:37 -08:00
Dylan Baker	6d37d741a4	docs: add sha256 sums for 21.2.6 relnotes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13950>	2021-11-24 21:24:15 -08:00
Dylan Baker	0481c17c6c	docs: add release notes for 21.2.6 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13950>	2021-11-24 21:24:13 -08:00
Alejandro Piñeiro	f8009d3db2	meson: bump meson requirement to 0.53.0 Needed to avoid the following error: meson.build:936:0: ERROR: Tried to access unknown option "cpp_rtti" Fixes: `ad86267412` ("meson: Don't override built-in cpp_rtti option, error if it's invalid") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13905>	2021-11-24 21:45:37 +00:00
Bas Nieuwenhuizen	cf6a14de0c	radv: Set RB+ registers correctly without framebuffer. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13699>	2021-11-24 18:22:10 +00:00
Bas Nieuwenhuizen	5632359959	radv: Remove the skipping of framebuffer emission if we don't have a framebuffer. This was plain broken. The solution is to not require any framebuffer changes. Silently skipping results in broken behavior. e.g: (secondary cmdbuffer with no framebuffer) ClearAttachment 2 translated into bind attachment 2 as attachment 0 (skipped) clear attachment 0 restore original bindings (skipped) which results in clearing attachment 0, not what we wanted. It is a small wonder CTS doesn't find it until VK_KHR_dynamic_rendering. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13699>	2021-11-24 18:22:10 +00:00
Bas Nieuwenhuizen	728590403e	radv: Stop using a subpass for color clears. They might not be available in secondary cmdbuffers with inheritance. To avoid binding anything we need to create pipelines per attachment index. I've excluded these from the "compile on device creation" set because I think almost nobody will need them. Alternative solution would be to reuse the same shader but muck with a bunch of registers to shift them for the attachment index. That is however a lot of complexity and has to execute on every pipeline change, which is probably more expensive in overhead and definitely in complexity. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13699>	2021-11-24 18:22:10 +00:00
Bas Nieuwenhuizen	fcd2e69f7d	radv: Avoid using a new subpass for ds clears. If we have an inherited subpass in a cmdbuffer we can't really emit any framebuffer data if it isn't provided. Note we still do it for resolve clears, but we can't have that in a secondary cmdbuffer with inherited renderpass. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13699>	2021-11-24 18:22:10 +00:00
Lionel Landwerlin	14e45cb21e	util/u_trace: refcount payloads When cloning a chunk of tracepoints, we cannot just copy the elements of the traces[] array. We also need the payloads associated with those. This change introduces a new u_trace_payloaf_buf object that is refcounted so that we can easily import traces[] elements and their payloads from one utrace to another. v2: use u_vector (Danylo) v3: Delete outdate comment (Danylo) Fix assert (Danylo) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `0565c993f9` ("u_trace: helpers for tracing tiling GPUs and re-usable VK cmdbuffers") Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13899>	2021-11-24 17:53:08 +00:00
Lionel Landwerlin	87888c0b3f	anv: fix execbuf syncobjs/syncobj_values array leak Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `36ea90a361` ("anv: Convert to the common sync and submit framework") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13945>	2021-11-24 17:29:46 +00:00
Rhys Perry	34510ce3cc	nir/lower_subgroups: fix left shift of -1 Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5365 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12901>	2021-11-24 16:45:05 +00:00
Rhys Perry	811a7a2d31	nir/lower_tex: don't calculate texture_mask for texture_index>=32 With Vulkan, texture_index can be 32 or larger, which creates a shift exponent larger than 31 (undefined behaviour). Since we don't use texture_mask with Vulkan, just initialize it to 0. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5365 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12901>	2021-11-24 16:45:04 +00:00
Rhys Perry	26d2e22eea	radv: stop running copy-propagation before nir_opt_deref spirv_to_nir() now does this. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13924>	2021-11-24 15:43:51 +00:00
Rhys Perry	b425100781	spirv: run nir_copy_prop before nir_rematerialize_derefs_in_use_blocks_impl spirv_to_nir sometimes wraps derefs in vec2 or mov instructions as part of its texture handling. These get in the way of nir_rematerialize_derefs_in_use_blocks_impl. Running copy propagation should get rid of the extra move instructions and get us back to intact deref chains for everything except variable pointer use-cases. fossil-db (Sienna Cichlid): Totals from 6 (0.00% of 134572) affected shaders: CodeSize: 92656 -> 93088 (+0.47%) Instrs: 17060 -> 17138 (+0.46%) Latency: 224408 -> 227539 (+1.40%) InvThroughput: 37402 -> 37924 (+1.40%) VClause: 408 -> 402 (-1.47%) Copies: 1065 -> 1107 (+3.94%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5668 Fixes: `14a12b771d` ("spirv: Rework our handling of images and samplers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13924>	2021-11-24 15:43:51 +00:00
Sergii Melikhov	7fb6eafbc4	vulkan: Unlock before return. Fix defect reported by Coverity Scan CID-1494382. Missing unlock (LOCK): Returning without unlocking queue->submit.mutex. Fixes: `9bffd81f1c` ("vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle") Signed-off-by: Sergii Melikhov <sergii.v.melikhov@globallogic.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13933>	2021-11-24 15:15:46 +00:00
Rhys Perry	d5b41cbb4a	radv: fix max_render_backends for Sienna Cichlid null winsys This affects NGG culling. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13926>	2021-11-24 14:27:39 +00:00
Rhys Perry	f67b09e37c	radv: make RADV_FORCE_FAMILY case-insensitive So I don't have to update my scripts each time I switch between before/after `cfc5c2abfd`. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13926>	2021-11-24 14:27:39 +00:00
Marek Olšák	694731ac13	ac/surface: allow gfx6-8 to enter the gfx9 DCC codepath for SI_FORCE_FAMILY Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13871>	2021-11-24 13:55:23 +00:00
Marek Olšák	d830d213b6	ac/gpu_info: don't fail on amdgpu_query_video_caps_info failures When VCN is unsupported, we don't want to break GL or Vulkan. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13871>	2021-11-24 13:55:23 +00:00
Alejandro Piñeiro	a9b4aef0f2	broadcom/compiler: make shaderdb debug output compatible with shaderdb's report tool Even although the option is called shaderdb, it is not really used by shaderdb (for V3D shaderdb uses the debug option "precompile"). And in fact, right now the output format is not compatible with shaderdb. This commit tries to fix that, and as we are here, also try to make the option more useful for the Vulkan case, as that debug option also works with v3dv. We can't really fully imitate shaderdb use with OpenGL (run with a set of glsl shader tests), but we can at least assign a unique name (the pipeline sha1 in text format) so we can compare executions of the same vulkan application. For that remember to disable the on-disk cache. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13938>	2021-11-24 13:02:08 +00:00
Marek Olšák	6c78ec4eac	mesa: add allow_glsl_compat_shaders for shader-db Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13870>	2021-11-24 10:28:15 +00:00
Marek Olšák	09342dcfc0	mesa: don't add attenuation constants if ffvp doesn't use them This slightly decreases the size of constant buffers. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13872>	2021-11-24 09:57:39 +00:00
Samuel Pitoiset	deb4685df3	radv: implement optimized MSAA copies using FMASK Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Samuel Pitoiset	51612b0e95	radv: make radv_copy_buffer() a non-static function Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Samuel Pitoiset	e8c0eb9598	radv: make radv_break_on_count() a non-static function Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12555>	2021-11-24 08:03:47 +00:00
Georg Lehmann	d106e5c732	amd/addrlib: Use get_supported_arguments to get compiler args. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Georg Lehmann	a6f783948d	meson: Remove some unnecessary loops. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Georg Lehmann	6c89f09b7b	meson: Use get_supported_arguments more often. Signed-off-by: Georg Lehmann <dadschoorse@gmail.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11609>	2021-11-24 07:03:54 +00:00
Vasily Khoruzhick	3b15fb3575	lima/ppir: implement gl_FragDepth support Mali4x0 supports writing depth and stencil from fragment shader and we've been using it quite a while for depth/stencil buffer reload. The missing part was specifying output register for depth/stencil. To figure it out, I changed reload shader to use register $4 as output and poked RSW bits (or rather consecutive 4 bit groups) until tests that rely on reload started to pass again. It turns out that register number for gl_FragDepth/gl_FragStencil is in rsw->depth_test and register number for gl_FragColor is in rsw->multi_sample and it's repeated 4 times for some reason (likely for MSAA?) With this knowledge we now can modify ppir compiler to support multiple store_output intrinsics. To do that just add destination SSA for store_output to the registers list for regalloc and mark them explicitly as output. Since it's never read in shader we have to take care about it in liveness analysis - basically just mark it alive from the time when it's written to the end of the block. If it's live only in the last instruction, mark it as live_internal, so regalloc doesn't clobber it. Then just let regalloc do its job, and then copy register number to the shader state and program it in RSW. The tricky part is gl_FragStencil, since it resides in the same register as gl_FragDepth and with the current design of the compiler it's hard to merge them. However gl_FragStencil doesn't seem to be part of GL2 or GLES2, so we can just leave it not implemented. Also we need to take care of stop bit for instructions - now we can't just set it in every instruction that stores output, since there may be several outputs. So if there's any store_output instructions in the block just mark that block has a stop, and set stop bit in the last instruction in the block. The only exception is discard - we always need to set stop bit in discard instruction. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Vasily Khoruzhick	98a7c4c6f8	lima/ppir: check if mul node is a source of add node before inserting We can't insert mul node into add node instruction if it's a virtual dep (sequence or write_or_read dep), so use ppir_node_has_single_src_succ in addition to ppir_node_has_single_succ. We can't use ppir_node_has_single_src_succ alone, since node may have a virtual dependency in addition to source dependency, and we can't insert it either in this case. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13830>	2021-11-24 02:26:08 +00:00
Thomas H.P. Andersen	64292c0f05	svga: fix bitwise/logical and mixup The function need_temp_reg_initialization looks suspecious. It will only ever return true if we get past this if: if (!(emit->info.indirect_files && (1u << TGSI_FILE_TEMPORARY)) ... Using the logical && means the intended initialization done based on the result of this check is not performed. This code was both introduced and altered in MR 5317. `ccb4ea5a` introduces the function. `ba37d408` is a collection of performance improvements and misc fixes. This altered the if from using bitwise to logical and. This commit changes it back to bitwise. Spotted from a compile warning. Fixes: `ba37d408da` ("svga: Performance fixes") Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12157>	2021-11-24 01:59:36 +00:00
Thomas H.P. Andersen	6b4294daf0	nine: remove dead code This line gets the cap but does not store it. The line has existed unchanged since the original import in `fdd96578`. Fixes a compile warning Acked-by: Axel Davy davyaxel0@gmail.com Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12155>	2021-11-23 22:38:19 +00:00
Roman Stratiienko	32ec0fffa6	android.mk: Add missing variables to the make target Android build system may use different internal variables to specify cflags/cppflags. Small change in product confguration may force Android to use diffrent set of variables, therefore we should keep all of them attached to the make rule's target. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5549 Fixes: `8621bd8d5e` ("android: Add scripts to build using meson") Signed-off-by: Roman Stratiienko <r.stratiienko@gmail.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Marijn Suijten <marijn.suijten@somainline.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13914>	2021-11-23 21:06:09 +00:00
Michel Zou	0daed2dc6b	lavapipe: fix unused variable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13892>	2021-11-23 19:11:05 +00:00
Michel Zou	d7957df318	vulkan: fix uninitialized variables Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13892>	2021-11-23 19:11:05 +00:00
Danylo Piliaiev	d5757c965a	turnip: implement VK_KHR_buffer_device_address We don't advertise bufferDeviceAddressCaptureReplay capability and neither does blob, because at the moment there is no way to allocate bo with predefined iova. There is no support of any arithmetic with addresses since shaderInt64 is not enabled. However, we could enable int64 support whenever we want. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Danylo Piliaiev	99388f0c27	freedreno/ir3: handle global atomics Only for a6xx since we don't know the instructions for global atomics on previous gens. Per Qualcomm's docs in OpenCL atomics are only supported since a5xx together with Generic memory space. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Danylo Piliaiev	5d5b1fc472	freedreno/ir3: add a6xx global atomics and separate atomic opcodes Separating atomic opcodes makes possible to express a6xx global atomics which take iova in SRC1. They would be needed by VK_KHR_buffer_device_address. The change also makes easier to distiguish atomics in conditions. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8717>	2021-11-23 18:26:37 +00:00
Marius Hillenbrand	c5d6e57e42	llvmpipe: Use lp_build_round_arch on IBM Z (s390x) LLVM has all the required intrinsics available on IBM Z, so use them for rounding operations (they will be implemented as a single instruction). This change makes the test case lp_test_arit pass, because it avoids using the buggy generic code. v2: update .gitlab-ci/cross-xfail-s390x to reflect passing lp_test_arit Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13927>	2021-11-23 17:49:02 +00:00
Marius Hillenbrand	82b261417e	util/cpu_detect: Add flag for IBM Z (s390x) As preparation for changing the behavior of LLVMpipe on IBM Z, add a flag to detect that platform. As it is always known at compile-time, we do not add it to the struct for cpu flags to avoid inflating that struct's size. Signed-off-by: Marius Hillenbrand <mhillen@linux.ibm.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13927>	2021-11-23 17:49:02 +00:00
Ilia Mirkin	be048ec112	freedreno/ir3: remove unused actual_in counting Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13918>	2021-11-23 17:20:32 +00:00
Antonio Caggiano	902c5bf468	virgl: Link shader program Add a new command associated to glLinkProgram. With this we should be able to compile and link shaders when requested by the user, thus avoiding that to happen in the middle of a frame. Together with the command we pass an array of shader handles attached to the program, where each position of the array corresponds to a pipe shader type. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13674>	2021-11-23 16:14:16 +00:00
Antonio Caggiano	0de0440b7c	gallium: add a link shader hook Allow drivers to register a callback for when a shader program is linked. Signed-off-by: Antonio Caggiano <antonio.caggiano@collabora.com> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13674>	2021-11-23 16:14:16 +00:00
Iago Toral Quiroga	79dee14cc2	broadcom/compiler: don't move ldvary earlier if current instruction has ldunif If we did, we would have the instruction coming right after ldvary write to the same implicit destination as ldvary at the same time. We prevent this when merging instructions, but we should make sure we prevent this when we move ldvary around for pipelining too. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13921>	2021-11-23 10:52:24 +00:00
Samuel Pitoiset	aee25471b9	radv: fix emitting VBO when vertex input dynamic state is used In the following scenario: CmdBindPipeline() CmdBindVertexBuffers() CmdSetVertexInput() CmdDraw() CmdBindVertexBuffers() CmdSetVertexInput() CmdDraw() The VBO won't be updated for the second draw because the state is cleared when the dynamic state is emitted and the pipeline isn't dirty. Found by inspection. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13855>	2021-11-23 08:39:13 +00:00
Samuel Pitoiset	d36119716d	radv/winsys: report the real family name instead of OVERRIDDEN When RADV_FORCE_FAMILY is used, this helps pre-compiling shaders to make sure cache entries will match real hardware. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13812>	2021-11-23 08:07:41 +00:00
Samuel Pitoiset	cfc5c2abfd	ac: change family names to uppercase in ac_get_family_name() To print the same device name as real hw. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13812>	2021-11-23 08:07:41 +00:00
Samuel Pitoiset	8e5bb2d6ac	radv: convert remaining enums/structs to 1.2 versions Some were missing. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13882>	2021-11-23 08:27:19 +01:00
Sagar Ghuge	0d0eae07be	intel/compiler: Prepare disasm for 16-bit sampler params v2: - Update descriptor helper (Jason) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	2fa68cb7da	intel/fs: Define and set correct sampler simd mode Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	31e3e32625	intel/compiler: Deprecate ld2dms and use ld2dms_w instead Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	261dd6c8f8	intel/compiler: Add new variant for TXF_CMS_W This allows, for example, fs_inst::components_read() without passing devinfo as extra argument. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	24831bbd40	intel/compiler: Prepare ld2dms_w for 4 mcs components Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	dfe0ba9080	intel/compiler: Demote sampler params to 16-bit for CMS/UMS/MCS Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	0374b56faa	intel/compiler/fs: Add support for 16-bit sampler msg payload For SIMD8 half float payload, each component takes a full register, so we can use existing LOAD_PAYLOAD infrastruture for required padding by alternating plain 8-wide half float vector and null vector. Also this patch removes an unwanted assertion from opt_copy_propagation_local for LOAD_PAYLOAD. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	936412af27	intel/compiler: Add helper to support half float payload with padding To support SIMD8 half float payloads, each component takes one full 32bit wide register in both SIMD8H and SIMD16H mode. So we can make use of existing LOAD_PAYLOAD infrastructure alternating a half float vector and a null vector, in order to handle required padding. v2: (Francisco) - Skip header sources - Fix comparision units - Don't allocate VGRF for padded source Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggested-by: Francisco Jerez <currojerez@riseup.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	75c73fcdc4	intel/compiler: Fix instruction size written calculation We are always aligning to REG_SIZE but when we have payload sources less than REG_SIZE, size written is miscalculated. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	be2bfe5fe8	intel/compiler: Don't hardcode padding source type to 32bit We can use LOAD_PAYLOAD infrastructure in order to handle 16bit float payload. Let's rely on source type for padding sources, if not set previously then default one would be 32-bit. This patch will be used later in the series to handle 16-bit float payloads. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Topi Pohjolainen	0e61d1fbbb	intel/compiler: Handle new sampler descriptor fields for 16bit sampler Update return format field and add SIMD Mode [2] field in sampler descriptor. Now we can tell sampler to return data in either 32/16 bit format precision. v1: - Drop unnecessary descriptor fields (Jason) - Handle return format (Jason) Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Sagar Ghuge	f78e33aa1a	intel/compiler: Set correct return format for brw_SAMPLE on GFX8 onwards, we have only single bit to determine correct return format. v2: - Define macro and use it instead of hardcoded value. (Lionel) Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11766>	2021-11-22 21:27:30 -08:00
Emma Anholt	7603187aec	nir: Un-inline more of nir_builder.h. Cuts another 470KB of libnir.a in my release build. Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13889>	2021-11-22 20:40:47 +00:00
Emma Anholt	d9bfcf5f5b	nir: Un-inline nir_builder_alu_instr_finish_and_insert() This function is big and I don't think it will won't get meaningfully constant-propagated during inlining without LTO. Move it to a .c file so we just have one copy, saving 2.8MB from libnir.a on an amd64 release build. text data bss total filename before: 18953406 7768312 687260 27408978 build-release/driver-symlinks/iris_dri.so 9734366 5542453 481692 15758511 build-release/lib/libvulkan_intel.so 28687772 13310765 1168952 43167489 (TOTALS) after: 15478350 7767864 687260 23933474 build-release/driver-symlinks/iris_dri.so 6810366 5541685 481692 12833743 build-release/lib/libvulkan_intel.so 22288716 13309549 1168952 36767217 (TOTALS) No statistically significant performance difference on iris shader-db, n=8. Reviewed-by: Matt Turner <mattst88@gmail.com> Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13889>	2021-11-22 20:40:47 +00:00
Ilia Mirkin	3b5b4b5d45	nir: apply interpolated input intrinsics setting when lowering clipdist For drivers that use this in fragment shaders, load_input is going to produce incorrect results (flat-shaded values). Fixes clipping tests on a4xx. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13900>	2021-11-22 20:11:19 +00:00
Ilia Mirkin	df934873e1	nir: always keep the clip distance array size updated Drivers expect to know the number of clip distances irrespective of whether compact arrays are used or not. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13900>	2021-11-22 20:11:19 +00:00
Rhys Perry	cc2894345f	aco/spill: use spills_entry instead of spills_exit to kill linear VGPRs If a predecessor has only spilled constants (no temporaries), spills_exit will be empty. fossil-db (Sienna Cichlid): Totals from 2 (0.00% of 128647) affected shaders: Latency: 139106 -> 139104 (-0.00%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5633 Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13821>	2021-11-22 19:46:22 +00:00
Ilia Mirkin	bb6fb6065f	freedreno/a[345]xx: fix unorm/snorm blend factors when they're "over" The float value may be out of range, so must be clamped to the allowed range. Unclear if a3xx also has a SNORM factor that we're just missing there, but that will be a separate investigation. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13903>	2021-11-22 18:09:44 +00:00
Ilia Mirkin	43f94ee9f1	freedreno/a5xx: add missing L8A8_UNORM format to support TBOs Fixes arb_texture_buffer_object-formats test. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13906>	2021-11-22 17:44:59 +00:00
Ilia Mirkin	c87967bf17	freedreno/a4xx: add some missing legacy formats to help TBOs Unlike with regular textures, we really have to support all the formats directly for TBOs to work properly. Add the missing formats to fix arb_texture_buffer_object-formats piglit. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13906>	2021-11-22 17:44:59 +00:00
Ilia Mirkin	5a69f34aeb	freedreno/a4xx: add missing SNORM formats to help tests pass Otherwise some of these fall back to RGBA_SNORM, which can screw up blend factors. Fixes spec@ext_texture_snorm@fbo-blending-formats. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13904>	2021-11-22 17:18:56 +00:00
Alyssa Rosenzweig	c6ca2d1929	panfrost: Handle AFBC_FEATURES in drm-shim Fixes the warning with drm-shim: Unknown DRM_IOCTL_PANFROST_GET_PARAM 40 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13894>	2021-11-22 13:12:20 +00:00
Alyssa Rosenzweig	a777e38cf9	panfrost: Collapse 0 parameters in drm-shim Makes the code a bit more readable, since this is a sensible default for many parameters. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13894>	2021-11-22 13:12:20 +00:00
Iago Toral Quiroga	7fec4f4135	broadcom/compiler: fix scoreboard locking checks According to the spec the hardware locks the scoreboard on the first or last thread switch (selected via shader state) and any TLB accesses executed before this are not synchronized by hardware. This change updates the logic to ensure we respect this requirement and that we don't assume that the lock is acquired automatically on the first TLB access, which is not valid at least since V3D 4.1+. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13910>	2021-11-22 12:53:43 +00:00
Iago Toral Quiroga	bd7584c16b	broadcom/compiler: don't allow RF writes from signals after thrend Writes to physical registers are not allowed after thread end. We were checking this for ALU writes, but we need to check it for signal writes too. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13910>	2021-11-22 12:53:43 +00:00
Danylo Piliaiev	ed16eedb2d	ir3: print half-dst/src for ldib.b/stib.b So it would print: ldib.b.untyped.1d.u16.1.imm.base0 hr0.z, r0.x, 0 instead of: ldib.b.untyped.1d.u16.1.imm.base0 r0.z, r0.x, 0 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13876>	2021-11-22 12:32:15 +00:00
Lionel Landwerlin	5a2cff9bc8	intel: move timestamp scaling helper to intel/perf Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	6126742648	intel/ds: remove verbose messages At high frequency sampling, this generates a lot of messages. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	bd104d5b1a	intel/pps: tweak intel config some more Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	3d71e35857	intel/ds: isolate intel/perf from the pps-producer Otherwise we need to include intel headers in generic code. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	ed9116e545	intel/ds: drop unused constructors Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	215dbfd131	intel/perf: track end timestamp of queries Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	4ef6698a26	intel/ds: drop timestamp correlation code Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	21a1c6995c	pps: fixup sporadic missing counters The issue seems to be that without proper timestamps & clock_id, the recording might discard some packets if they go backward in time. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	120f24cb36	intel/perf: add a helper to read timestamp from reports On newer HW it will require more work than just reading a dword. It could also vary depending on the report format. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Lionel Landwerlin	8657fa6b86	pps: allow drivers to report timestamps in their own time domain For this each driver must : - report its clock_id (if no particular clock just default to cpu boottime one) - be able to sample its clock (gpu_timestamp()) The PPSDataSource will then emit timestamp correlation events in the trace ensuring perfetto is able to display GPU & CPU events appropriately on its timeline. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13831>	2021-11-22 11:52:46 +00:00
Juan A. Suarez Romero	457dbb81f5	broadcom/compiler: apply constant folding on early GS lowering This solves a case where a NIR geometry shader was storing the output in a non-constant: vec4 32 ssa_1 = load_const (0xc0800000 /* -4.000000 /, 0xc1100000 / -9.000000 /, 0x40400000 / 3.000000 /, 0x40e00000 / 7.000000 /) vec1 32 ssa_7 = load_const (0x00000000 / 0.000000 /) vec1 32 ssa_8 = load_const (0x00000001 / 0.000000 /) vec1 32 ssa_9 = iadd ssa_7, ssa_8 vec1 32 ssa_19 = mov ssa_1.x intrinsic store_output (ssa_19, ssa_9) (1, 1, 0, 160, 288) / base=1 / / wrmask=x / / component=0 / / src_type=float32 / / location=32 slots=2 gs_streams(x=0 y=0 z=0 w=0) / When lowering the VPM output we check if the destination (ssa_9 in this case) is a constant to add to the VPM offset. We run a constant folding optimization in an earlier VS lowering, and we should do the same for GS. This fixes multiple dEQP-VK.pipeline.interface_matching. failures. Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13884>	2021-11-22 09:32:50 +00:00
Juan A. Suarez Romero	7b21635057	broadcom/compiler: handle array of structs in GS/FS inputs While fragment and geometry shader were handling structs as inputs, they weren't doing for it arrays of structures. This fixes multiple dEQP-VK.pipeline.interface_matching.* failures and assertions. v2: - Fix style (Iago). Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13884>	2021-11-22 09:32:50 +00:00
Lionel Landwerlin	c5a42e4010	intel/fs: fix shader call lowering pass Now that we removed the intel intrinsic and just use the generic one, we can skip it in the intel call lowering pass and just deal with it in the intel rt intrinsic lowering. v2: rewrite with nir_shader_instructions_pass() (Jason) v3: handle everything in switch (Jason) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `423c47de99` ("nir: drop the btd_resume_intel intrinsic") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12113>	2021-11-22 08:17:26 +00:00
Jesse Natalie	724a38eb94	CI/windows: Upload result.txt as an artifact Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13893>	2021-11-20 20:30:59 +00:00
Jesse Natalie	1e3db7923f	CI/windows: Uprev piglit Acked-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13893>	2021-11-20 20:30:59 +00:00
Alex Xu (Hello71)	60d95c5d0f	Auto-enable TLSDESC support TLSDESC speeds up access to dynamic TLS. This is especially important for non-glibc targets, but is also helpful for non-initial-exec TLS variables. The entry asm does not support TLSDESC, but it only accesses initial-exec symbols, so it is not necessary to handle that separately. Acked-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12722>	2021-11-20 11:57:40 -05:00
Alex Xu (Hello71)	8570a2a280	Use initial-exec TLS for glibc only, enable TLS elsewhere It is not portable to use initial-exec TLS in dlopened libraries. glibc and FreeBSD allocate extra memory for extra initial-exec variables specifically for libGL, but other libcs including musl do not. Keep initial-exec disabled on FreeBSD since it is apparently broken for some reason: https://gitlab.freedesktop.org/mesa/mesa/-/issues/966#note_394512 `81dbdb15d5` Enable TLS on OpenBSD and Haiku based on the u_thread.h comment that emutls is better than pthread_getspecific, which seems plausible given that emutls has strictly more information to work with. Fixes #966. Acked-by: Tapani Pälli <tapani.palli@intel.com> Acked-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12722>	2021-11-20 11:56:34 -05:00
Ilia Mirkin	df005c2a65	mesa: move around current texture object fetching We have to validate the target before fetching the current texture object. Move this so that it happens later. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13767>	2021-11-19 20:45:35 -05:00
Ilia Mirkin	d814539c2b	mesa: check target/format for Tex(ture)StorageMem* Noticed while doing an audit around _mesa_get_current_tex usage. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Acked-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13767>	2021-11-19 20:45:13 -05:00
Mauro Rossi	f659d00000	android: define cpp_rtti=false because libLLVM is built w/o RTTI libLLVM for Android is built without RTTI, but after commit `ad86267` mesa inherits meson default RTTI enabled state cpp_rtti=false is added to meson options in android/mesa3d_cross.mk Signed-off-by: Mauro Rossi <issor.oruam@gmail.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13888>	2021-11-20 02:25:12 +01:00
Marek Olšák	cdeecadcb6	radeonsi: deduplicate min_esverts code in gfx10_ngg_calculate_subgroup_info Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	9d7ac70ffb	radeonsi: implement shader culling in GS It already does compaction, so we just need to load vertex positions and cull. This was easier than expected. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	492a61fe72	radeonsi: don't use ctx.stage outside of si_llvm_translate_nir si_llvm_translate_nir() changes ctx.stage, so the outside code shouldn't use it. This hasn't caused any issues yet. Since ctx.stage starts as 0, the first use in this commit was a tautology. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	1c5899900d	radeonsi: simplify si_get_vs_key_outputs for GS ngg_culling is always 0 when GS is enabled. This will change in the future. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	a368385b23	radeonsi: add is_gs parameter into si_vs_needs_prolog and disable the VS prolog code for GS. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	f96d1757bb	radeonsi: restructure code that declares merged VS-GS and TES-GS SGPRs no change in the SGPR layout Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Marek Olšák	2418da2d4a	radeonsi: separate culling code from VS/TES (to be reused by GS) Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13829>	2021-11-20 00:03:45 +00:00
Nicholas Bishop	37c3e16d35	mesa/get: allow NV_pixel_buffer_object constants in GLES2 The NV_pixel_buffer_object extension can be available in a GLES2 context, so the PIXEL_PACK_BUFFER_BINDING/PIXEL_UNPACK_BUFFER_BINDING constants should also be available. Tested on 8086:2e12, "Mesa DRI Intel(R) Q45/Q43 (ELK)". Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5655 Signed-off-by: Nicholas Bishop <nicholasbishop@google.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13862>	2021-11-19 23:21:52 +00:00
Jesse Natalie	b8f41c5c4e	d3d12: Validate opened D3D12 resource matches pipe template Unlike Linux dma-bufs, D3D12 resources are strongly typed, and can't necessarily just reinterpret the memory arbitrarily. Allow importing resources with no description coming from the frontend, and populate the resource desc from the driver instead. If there was a template, make sure that it matches the incoming resource. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	9740141b2e	d3d12: Generate a pipe format -> typeless mapping table too Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	ca7d4fcb3f	d3d12: Generate format table using a macro list Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	25bcc56027	d3d12: Make format list all use macros Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	96012b686e	d3d12: Handle import/export of fd shared handles Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	31c7a04b47	winsys/d3d12: Populate winsys handle format All other winsys handle users do so, and a future commit will start caring about it. Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	2771fd4a3f	gallium, windows: Use HANDLE instead of FD for external objects Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	5bfbf4bec9	microsoft/compiler: Handle GLES external textures Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	2188607014	d3d12: Support RGBX formats mapped to RGBA Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	ab9948997a	d3d12: Support PIPE_CAP_MIXED_COLOR_DEPTH_BITS Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	e0576ec148	d3d12: Support BGRA 555 and 565 formats Reviewed-by: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13054>	2021-11-19 22:54:46 +00:00
Jesse Natalie	d0bc4974fa	android: Allow forcing softpipe When dealing with swrast, there's two possibilities: If you have LLVM, you get llvmpipe, which is pretty fast. If you don't, you get softpipe, which is slow, but does have a couple nice qualities, like being smaller and not needing executable memory for JIT. If you're building a driver that requires LLVM like radeonsi then you need the LLVM stub for the build to find LLVM. But for swrast, since it can mean either softpipe/llvmpipe, you don't strictly need LLVM. So this just makes the Android build files flexible like the Meson build files (where you can specify -Dllvm=disabled even if LLVM is findable). Reviewed-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13532>	2021-11-19 21:21:35 +00:00
Jesse Natalie	33e5a4378e	android,d3d12: Support using DirectX-Headers dependency from AOSP Note that the Android build system apparently lowercases stuff, so add a lowercase "directx-headers" dependency which is searched first, before falling back to the proper-cased "DirectX-Headers" dependency. Reviewed-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13532>	2021-11-19 21:21:35 +00:00
Jesse Natalie	6138b047e2	mesa/main, android: Log errors to logcat Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13532>	2021-11-19 21:21:35 +00:00
Jesse Natalie	9e82a56745	android: Add a BOARD CFlags option so build can be customized Reviewed-by: Roman Stratiienko <r.stratiienko@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13532>	2021-11-19 21:21:35 +00:00
Mike Blumenkrantz	81cc94b8f0	zink: be consistent about waiting on context queue on context destroy Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13885>	2021-11-19 18:56:10 +00:00
Mike Blumenkrantz	e92b8956c7	zink: set batch state queue on creation make this easier to find Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13885>	2021-11-19 18:56:10 +00:00
Emma Anholt	b8ffd7a888	freedreno/a5xx: Emit MSAA state for sysmem rendering, too. This looked obviously wrong, we want to set the sample counts for sysmem too just like we do on 6xx. Turns out it fixes some piglits. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13867>	2021-11-19 17:24:11 +00:00
Emma Anholt	5071d39cb2	freedreno/a5xx: Document the sRGB bit on RB_2D_SRC/DST info. Noticed while looking through my set of traces for where the average bit might be. Same spot as on a6xx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13867>	2021-11-19 17:24:11 +00:00
Emma Anholt	1ef6465665	freedreno/a5xx: Define a5xx_2d_surf_info like a6xx has. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13867>	2021-11-19 17:24:11 +00:00
Emma Anholt	cad0b6e2e5	freedreno/a6xx: Disable sample averaging on non-ubwc z24s8 MSAA blits. The fallback path we averages unorm textures, but if we don't have ubwc on either then we can just cast them to uint which then just takes sample 0. The proper UBWC format I think ends up averaging, though. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13867>	2021-11-19 17:24:11 +00:00
Emma Anholt	93eb697a8d	freedreno/a6xx: Disable sample averaging on z/s or integer blits. We can't generally force fd_blitter_blit() to not average in our fallback blits, but this should at help some cases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13867>	2021-11-19 17:24:11 +00:00
Connor Abbott	c98adc56f4	ir3/lower_pcopy: Fix bug with "illegal" copies and swaps If the source and destination were within the same full register, like hr90.x and hr90.y (which both map to r45.x), then we'd perform the swap/copy with the wrong register. This broke dEQP-VK.ssbo.phys.layout.random.16bit.scalar.35 once BDA is enabled. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13818>	2021-11-19 16:59:54 +00:00
Connor Abbott	65da866ad9	ir3/lower_pcopy: Fix shr.b illegal copy lowering The immediate shouldn't be half-reg because the other source isn't. Fixes an assertion failure with dEQP-VK.ssbo.phys.layout.random.16bit.scalar.35. Fixes: `0ffcb19b9d` ("ir3: Rewrite register allocation") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13818>	2021-11-19 16:59:54 +00:00
Connor Abbott	9912c61362	ir3/spill: Support larger spill slot offset This is required by dEQP-VK.ssbo.phys.layout.random.all_shared_buffer.47, where we need to spill a lot of pointers due to NIR CSE being a little too aggressive and creating a large register pressure across basic blocks, too large to fit within the boundaries of ldp/stp offsets. Note that this will be a lot more difficult with support for "real functions" because the base register will become unknown at compile time. However this hack gets things working for the time being. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13818>	2021-11-19 16:59:54 +00:00
Connor Abbott	29d3889bbb	ir3/ra: Add missing asserts to ra_push_interval() This would've caught the previous issue earlier. We checked that the physreg made sense when inserting via ra_file_insert() but not ra_push_interval() which is used for live-range splitting. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13818>	2021-11-19 16:59:54 +00:00
Connor Abbott	9d88b98b08	ir3/ra: Consider reg file size when swapping killed sources Don't swap a 2-component vector of half-regs with a full reg if that would result in the half regs going outside of the allowable half-reg space. Fixes: `d4b5d2a020` ("ir3/ra: Use killed sources in register eviction") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13818>	2021-11-19 16:59:54 +00:00
Jesse Natalie	f9a46ad22a	meson: Allow mismatching RTTI for MSVC This might be safe to relax to all Windows compilers, but I didn't test Clang or MinGW, so scoping to MSVC for now. For MSVC, this is safe to mismatch, because the vftables are emitted into all objects with "pick largest," and the definition with RTTI is larger than the one without. This is different than the Itanium ABI, which only emits one copy of the typeinfo in the object which defines the key method. Acked-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13064>	2021-11-19 15:36:59 +00:00
Jesse Natalie	ad86267412	meson: Don't override built-in cpp_rtti option, error if it's invalid Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13064>	2021-11-19 15:36:59 +00:00
Lionel Landwerlin	21ec880bf9	anv: initialize anv_bo_sync base fields v2: zalloc Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `cbb13fae33` ("anv: Add a BO sync type") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13875>	2021-11-19 15:09:43 +00:00
Lionel Landwerlin	04bd5bb69b	anv: don't try to close fd = -1 CID: 1464334 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13879>	2021-11-19 14:52:29 +00:00
Samuel Pitoiset	ddbc84d5a0	radv: ignore the descriptor set layout when creating descriptor template From the Vulkan spec: "This parameter is ignored if templateType is not VK_DESCRIPTOR_UPDATE_TEMPLATE_TYPE_DESCRIPTOR_SET." This fixes an assertion about the base object type when running Yuzu with Vulkan validation layers enabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13846>	2021-11-19 13:52:36 +00:00
Samuel Pitoiset	2436cafffe	radv: allow TC-compat CMASK with storage images on GFX10+ Hardware seems to support it. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12173>	2021-11-19 13:30:40 +00:00
Mike Blumenkrantz	22d9d0f8b5	zink: add a compiler pass to scan for shader image use other frontends and internal shaders won't set this Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13864>	2021-11-19 13:14:46 +00:00
Mike Blumenkrantz	e386a57769	zink: explicitly init glsl need this to be able to use other frontends Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13864>	2021-11-19 13:14:46 +00:00
Alejandro Piñeiro	ff89dc3523	vulkan: move common format helpers to vk_format v3dv, radv, and turnip are using several C&P format helpers (most of them wrappers over util_format_description based helpers). methods. This commit moves the common helpers to the already existing common vk_format.h. For the case of v3dv we were able to remove the vk_format header. For turnip and radv, a local vk_format.h header remains, with methods that are only used for those drivers. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13858>	2021-11-19 12:23:19 +01:00
Samuel Pitoiset	04c90f292e	util/queue: fix a data race detected by TSAN when finishing the queue Thread sanitizer complains if it detects that the pthread_barrier is destroyed when a thread might still blocked on the barrier. Fix this by destroying the barrier only if pthread_barrier_wait returns PTHREAD_BARRIER_SERIAL_THREAD which is the value for success. In practice this shouldn't fix anything serious given that this code is only called when the disk cache is destroyed. Original patch from Timothy Arceri. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4342 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13861>	2021-11-19 09:02:23 +01:00
Qiang Yu	cee1dd92bd	glx/dri3: fix glXQueryContext does not return GLX_RENDER_TYPE value Cc: mesa-stable Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13772>	2021-11-19 01:37:27 +00:00
Emma Anholt	e277b13182	freedreno: Stop exposing MSAA image load/store on desktop GL. GLES doesn't support it, and blob VK doesn't support it. We could theoretically lower it, but don't bother since it's not required. Fixes various piglit image load/store tests. Suggested-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13852>	2021-11-18 23:47:58 +00:00
Alyssa Rosenzweig	81d22da6de	asahi: Fix BIND_PIPELINE sizing and alignment Fix a bug in BIND_PIPELINE XML reported by Dougall, which cleans up a bit of both decoder and driver. Instead of... * 17 bytes BIND_PIPELINE (17) * An unused 8 byte record (25) * A set of N 8 byte records (25 + 8 * N) * Oops, 1 byte too many! One just disappeared (24 + 8 * N) It seems to instead be * 24 bytes BIND_PIPELINE (24) * A set of N 8 byte records (24 + 8 * N) without the sentinel record. These means the 8 byte records themselves are shuffled, with the high byte of the pointers split from the low word, but that's less gross than an off-by-one. It's still not clear what the last 8 bytes of the BIND_VERTEX_PIPELINE structure mean, or the last 4 byte of the BIND_FRAGMENT_PIPELINE structure which seems to be a bit shorter. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	3b108393a2	asahi: Remove obnoxious workaround Now that we're not hardcoded any magic BO IDs, there is no minimum number of allocations needed. Remove the unneeded -- and obnoxious -- workaround of allocating unused BOs on startup. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	a28775046c	asahi: Remove silly magic numbers These are unnecessary now that the structure of agx_map_* is better understood. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	d55a1a77bd	asahi: Fix agx_map_* structures Dougall Johnson observed these structures make more sense with indices[] first in the entries and indices[] absent from the header. Then the sentinel entry disappears, nr_entries makes more sense, and a few magic numbers pop out. Many thanks to Dougall's astute eyes. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	6637fbb211	asahi: Allocate special scratch buffers Seem to be used for preemption. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	30433ae716	asahi: Deflake addresses Reported by Dougall. Signed-off-by: Alyssa Rosenzweig <alyssa@rosenzweig.io> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	df1824046a	asahi: Rename PANDECODE->AGXDECODE Fix remnant of the Panfrost decoder fork. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13784>	2021-11-18 23:35:25 +00:00
Alyssa Rosenzweig	e346ca5b41	pan/bi: Add XML for LD_BUFFER Encoded like LOAD. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	69ddbc4341	pan/bi: Suppress uniform validation for LD_BUFFER Seems to be ok and used by the DDK... Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	36486f54e9	pan/bi: Confirm IDP unit on Valhall Based on Anandtech which gives 8-bit dot product throughput on Valhall under FMA and not consistent with SFU. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	b8ba909ca6	pan/bi: Forbid unaligned staging registers on Valhall Would've saved me some debugging with the computerator. I keep forgetting about this nuance. Enforce it in the assembler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	df807cb839	pan/bi: Add XML for assembling Valhall image stores Not complete yet but let's get some tests in early. Document the new instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	58b65a340c	pan/bi: Add Valhall's special FMA_RSCALE instructions Like Bifrost, but exposed as separate physical instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	aee819d54c	pan/bi: Add sqrt form of Valhall FREXPM Like Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	137053c4f4	pan/bi: Add full form of Valhall MUX instruction Like Bifrost. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Alyssa Rosenzweig	855ab23d9a	pan/bi: Annotate Valhall instructions with units Based on analyzing the cycle counts reported by the Mali offline compiler. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13802>	2021-11-18 23:16:20 +00:00
Mike Blumenkrantz	04cc1b93b1	zink: enable PIPE_TEXTURE_TRANSFER_COMPUTE on non-cpu drivers Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13859>	2021-11-18 22:12:58 +00:00
Mike Blumenkrantz	ea761a40d5	zink: use pb_slab_alloc_reclaimed(reclaim_all) for BAR heap sometimes this forces a full slab reclaim any time the device is known to have a too-small BAR in order to keep memory usage at a minimum when it might otherwise balloon out and crash us Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13850>	2021-11-18 21:22:30 +00:00
Mike Blumenkrantz	da9acf7088	aux/pb: add a new slab alloc function for reclaiming all bo objects sometimes a driver might want to always reclaim all bo objects in the course of allocating a new bo. this is useful when it's known that a given memory heap is very small and will likely need to keep its usage minimized Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13850>	2021-11-18 21:22:30 +00:00
Roland Scheidegger	b7e2214b3c	llvmpipe: adjust rounding for viewport scissoring Some apps may try to use a viewport adjusted by 0.5 pixels (among other things) to emulate d3d9 pixel center, and in this case we would end up with incorrect "fake scissor" box (shifted by 1 pixel), hence pixels being incorrectly scissored away when permit_linear_rasterizer is set (this happens even if the linear rasterizer is not used in the end). So adjust the offset so that the half-way points get rounded down instead of up. (This is all a bit iffy I think since we don't use fractional boxes (with 8 subpixel bits) anywhere yet, but at least without msaa it should work out.) Reviewed-by: Brian Paul <brianp@vmware.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13794>	2021-11-18 19:23:13 +00:00
Eric Engestrom	f6dd931640	docs: add 22.0 branchpoint date for perspective Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13848>	2021-11-18 19:00:02 +00:00
Eric Engestrom	f75e657e69	docs: add 21.3.x release schedule Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13848>	2021-11-18 19:00:02 +00:00
Eric Engestrom	9aea588900	docs: update calendar and link releases notes for 21.3.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13848>	2021-11-18 19:00:02 +00:00
Eric Engestrom	1061f3ad0d	docs: add release notes for 21.3.0 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13848>	2021-11-18 19:00:02 +00:00
Samuel Pitoiset	341278f069	radv: disable HTILE for D32S8 format and mipmaps on GFX10 Stencil texturing with HTILE doesn't work with mipmapping on Navi10-14, it's a hw bug. RadeonSI and PAL have a workaround too. This fixes 35 piglit failures with Zink on Navi10. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13814>	2021-11-18 14:45:54 +00:00
Tomeu Vizoso	d63cd245e1	ci: Uprev Crosvm And use my fork while we upstream some improvements to Crosvm that make it more appropriate for using in CI. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Tomeu Vizoso	81f25d8f27	virgl/ci: Run each dEQP instance in its own VM Currently we run deqp-runner inside a single VM, which makes very poor use of the available CPUs because Virgl has a bottleneck in the VMM that serializes everything. With this change, we can run several Crosvm instances in a runner and make full use of the CPUs. Getting the same coverage with 3 runners instead of 6. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Tomeu Vizoso	4bfcbe3f69	ci: Remove syslogd Crosvm doesn't need it any more. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Tomeu Vizoso	d542e978e9	virgl/ci: Set GALLIVM_PERF=nopt,no_quad_lod nopt will disable some shader optimizations that slow down test runs for no gain. no_quad_lod will disable some speed hacks that can cause inaccurate results. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Tomeu Vizoso	8a3cccc3f4	ci: Don't set GALLIVM_PERF in the scripts Instead, let gitlab-ci.yml files define it. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Tomeu Vizoso	8acdf1f71c	ci: Create symlink to /install early So we can use well-known absolute paths in configuration files. Otherwise, the install dir is within $CI_PROJECT_DIR, which changes between jobs. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Corentin Noël <corentin.noel@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12828>	2021-11-18 13:36:24 +00:00
Mike Blumenkrantz	e7b9561959	gallium: implement compute pbo download this reworks PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER into an enum as PIPE_CAP_TEXTURE_TRANSFER_MODES, enabling drivers to choose a (sometimes) faster, compute-based download mechanism based on a new pipe_screen hook compute pbo download is implemented using shaders with a prolog to convert the input format to generic rgb float values, then an epilog to convert to the output value. the prolog and epilog are determined based on a vec4 of packed ubo data which is dynamically updated based on the API usage currently, the only known limitations are: * GL_ARB_texture_cube_map_array is broken somehow (and disabled) * AMD hardware somehow can't do depth readback? otherwise it should work for every possible case Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 08:00:07 -05:00
Mike Blumenkrantz	ed65b5e839	mesa/st: make some pbo functions public Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 08:00:05 -05:00
Mike Blumenkrantz	f7a51e9469	mesa/st: make sampler_type_for_target public Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 07:58:29 -05:00
Mike Blumenkrantz	c9a47c85da	gallium: rename PIPE_CAP_PREFER_BLIT_BASED_TEXTURE_TRANSFER this is now a bitfield enum for more functionality Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 07:58:29 -05:00
Mike Blumenkrantz	0b30a7a243	gallium: add pipe_screen::is_compute_copy_faster hook this can be used to query whether a driver expects a given texture copy to be faster as a compute shader or using cpu/gfx transfers Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11984>	2021-11-18 07:58:29 -05:00
Dylan Baker	a854cbc7b5	turnip: don't use mesa/macros.h to get utils/rounding.h For hopefully obvious reasons. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13853>	2021-11-18 10:46:51 +00:00
Pierre-Eric Pelloux-Prayer	df8aeb4598	radeonsi/sqtt: increase the default buffer size to 32MB Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13838>	2021-11-18 10:53:37 +01:00
Pierre-Eric Pelloux-Prayer	56382ec071	radeonsi: unreference framebuffer state after use util_copy_framebuffer_state increases refcounts, so we have to decrement them afterwards. Fixes: `b1b491cdbb` ("radeonsi: add a faster clear path for glClearTexImage") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5631 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13838>	2021-11-18 10:53:34 +01:00
Iago Toral Quiroga	5e536c97a9	broadcom/compiler: fix early fragment tests setup When early fragment tests are mandated by the shader, we must use the Z value produced by the FEP even if there are elements that would typically require late fragment tests (such as discards, sample to coverage, etc). This change means we also need to be a bit more careful when we promote shaders to use early fragment tests so we don't promote anything with discards for example. Fixes: dEQP-VK.fragment_operations.early_fragment.discard_early_fragment_tests_depth dEQP-VK.fragment_operations.early_fragment.discard_early_fragment_tests_stencil Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13837>	2021-11-18 07:39:32 +00:00
Joshua Ashton	68ec867181	radv: Implement VK_EXT_image_view_min_lod Signed-off-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13820>	2021-11-18 01:05:06 +00:00
Joshua Ashton	d0f217ad80	vulkan: Update the XML and headers to 1.2.199 Signed-off-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13820>	2021-11-18 01:05:06 +00:00
Joshua Ashton	4e76d0cb56	radv: Expose min_lod in *_make_texture_descriptor We'll need this going forward for VK_EXT_image_view_min_lod. Signed-off-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13820>	2021-11-18 01:05:06 +00:00
Joshua Ashton	c6471ef918	radv: Refactor S_FIXED to radv_float_to_{s,u}fixed We'll need to use this in radv_image for VK_EXT_image_view_min_lod. Additionally, creates signed/unsigned variants to avoid sign-extending where we don't need to. Signed-off-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13820>	2021-11-18 01:05:06 +00:00
Mike Blumenkrantz	35ffadb9e7	zink: clamp to 500 max batch states on nvidia I've been advised that leaving this unclamped will use up all the fds allotted to a process Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13844>	2021-11-18 00:00:16 +00:00
Mike Blumenkrantz	a3be30665f	zink: fail context creation more gracefully handle some cases where context creation fails earlier than expected cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13844>	2021-11-18 00:00:16 +00:00
Mike Blumenkrantz	72a88c77de	zink: fix memory availability reporting this shouldn't report the budgeted available memory, it should return the total memory, as that's what this api expects Fixes: `ff4ba3d4a7` ("zink: support PIPE_CAP_QUERY_MEMORY_INFO") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Mike Blumenkrantz	5f140a723d	zink: use IMMUTABLE for dummy xfb buffer this is never getting read back or anything so don't waste BAR allocation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Mike Blumenkrantz	1eb2f0d41e	zink: demote BAR allocations to device-local on oom ideally this shouldn't happen, but it's better than crashing even if it may crash later from attempting to map Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Mike Blumenkrantz	8f97af050e	zink: set zink_resource_object::host_visible based on actual bo placement the properties determined before allocation may not be the same as what gets allocated Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Mike Blumenkrantz	74d2e89201	zink: always use slab allocation placement for domains this allows the actual bo to have its memory type changed if necessary Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Mike Blumenkrantz	4fc216b4ba	zink: add error for bo allocation failure Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13849>	2021-11-17 22:59:43 +00:00
Marek Olšák	83278b5661	glx: add a workaround to glXDestroyWindow for Viewperf2020/Sw This fixes: X Error of failed request: GLXBadWindow Major opcode of failed request: 152 (GLX) Minor opcode of failed request: 32 (X_GLXDestroyWindow) Serial number of failed request: 9667 Current serial number in output stream: 9674 Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13611>	2021-11-17 21:26:54 +00:00
Iván Briano	0388783a03	intel/nir: also allow unknown format for getting the size of a storage image Fixes: `fa251cf111` ("intel/nir: allow unknown format in lowering of storage images") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13847>	2021-11-17 21:06:23 +00:00
Ian Romanick	04f5c543de	glsl/nir: Don't build soft float64 when it cannot be used Fixes: `82d9a37a59` ("glsl/nir: Add a shared helper for building float64 shaders") Closes: #5556 Reviewed-by: Matt Turner <mattst88@gmail.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13828>	2021-11-17 12:23:29 -08:00
Mike Blumenkrantz	b1a32d1432	zink: implement multiplanar modifier handling it turns out this is trivial as long as dri gives usable resource templates Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Mike Blumenkrantz	b1675608c3	dri2: set dimensions on dmabuf import planes this is unusable for some drivers without the plane size attached Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Mike Blumenkrantz	943f6a038d	zink: always set matching resource export type for dmabuf creation both of these need to be set if one is cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Mike Blumenkrantz	11c79a8bd7	zink: stop using VK_IMAGE_LAYOUT_PREINITIALIZED for dmabuf this is illegal cc: mesa-stable Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13799>	2021-11-17 19:22:02 +00:00
Jason Ekstrand	63baeffc2d	vulkan/sync: Rework asserts a bit ANV currently smashes off the TIMELINE bit depending on whether or not the i915 interface supports them, triggering assert(!type->get_value). Instead of requiring ANV to smash off function pointers, let the extra function pointers through and then assert on the feature bits before the function pointers get used. This should give us roughly the same amount of assert protection while side-stepping the feature disabling problem. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13839>	2021-11-17 16:52:29 +00:00
Filip Gawin	0c74f80645	glsl: fix trivial strict aliasing warning Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13827>	2021-11-17 16:22:10 +00:00
Omar Akkila	58a0d8d0de	llvmpipe: page-align memory allocations Allows memory allocated by llvmpipe_allocate_memory_fd to be mappable to guests in virtualized environments like KVM which requires page-aligned memory. llvmpipe_allocate_memory is updated similarly for consistency. Signed-off-by: Omar Akkila <omar.akkila@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13793>	2021-11-17 09:25:37 -05:00
Connor Abbott	23a5f1a5ac	ir3: Stop inserting nops during scheduling Not necessary since nothing uses it anymore. This might have a slight effect on spilling with multiple blocks, but no shader-db difference because nothing spills. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	e0eeba6cbb	ir3/postsched: Only prefer tex/sfu if they are soft-ready Otherwise we schedule an SFU depending on a tex as soon as the tex is scheduled, which is very much not what we want. Note that sstall is helped more than nops are hurt, and the shaders with the largest nop regressions also have sstall helped. However (sy) is also very much helped. total nops in shared programs: 345482 -> 345986 (0.15%) nops in affected programs: 5731 -> 6235 (8.79%) helped: 15 HURT: 81 helped stats (abs) min: 1 max: 9 x̄: 3.27 x̃: 3 helped stats (rel) min: 0.50% max: 16.00% x̄: 8.55% x̃: 10.26% HURT stats (abs) min: 1 max: 72 x̄: 6.83 x̃: 4 HURT stats (rel) min: 0.57% max: 400.00% x̄: 32.50% x̃: 13.16% 95% mean confidence interval for nops value: 3.34 7.16 95% mean confidence interval for nops %-change: 13.07% 39.10% Nops are HURT. total sstall in shared programs: 133804 -> 132381 (-1.06%) sstall in affected programs: 4743 -> 3320 (-30.00%) helped: 68 HURT: 24 helped stats (abs) min: 1 max: 153 x̄: 21.88 x̃: 8 helped stats (rel) min: 1.79% max: 100.00% x̄: 33.20% x̃: 28.00% HURT stats (abs) min: 1 max: 11 x̄: 2.71 x̃: 2 HURT stats (rel) min: 1.02% max: 200.00% x̄: 17.73% x̃: 5.59% 95% mean confidence interval for sstall value: -22.05 -8.89 95% mean confidence interval for sstall %-change: -27.60% -12.22% Sstall are helped. total (ss) in shared programs: 35471 -> 35481 (0.03%) (ss) in affected programs: 462 -> 472 (2.16%) helped: 9 HURT: 15 helped stats (abs) min: 1 max: 2 x̄: 1.11 x̃: 1 helped stats (rel) min: 4.17% max: 33.33% x̄: 14.00% x̃: 7.69% HURT stats (abs) min: 1 max: 3 x̄: 1.33 x̃: 1 HURT stats (rel) min: 1.19% max: 50.00% x̄: 12.27% x̃: 8.33% 95% mean confidence interval for (ss) value: -0.14 0.97 95% mean confidence interval for (ss) %-change: -5.11% 9.94% Inconclusive result (value mean confidence interval includes 0). total (sy) in shared programs: 13522 -> 13288 (-1.73%) (sy) in affected programs: 422 -> 188 (-55.45%) helped: 22 HURT: 1 helped stats (abs) min: 1 max: 21 x̄: 10.68 x̃: 10 helped stats (rel) min: 8.00% max: 94.44% x̄: 56.58% x̃: 56.94% HURT stats (abs) min: 1 max: 1 x̄: 1.00 x̃: 1 HURT stats (rel) min: 25.00% max: 25.00% x̄: 25.00% x̃: 25.00% 95% mean confidence interval for (sy) value: -13.18 -7.17 95% mean confidence interval for (sy) %-change: -65.48% -40.59% (sy) are helped. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	6f5c0d209c	ir3/postsched: Rewrite delay handling Analogous to the pre-RA scheduler. Unfortunately this time it's a bit more involved because we have to correctly handle (rptN), which is already relevant for swz. This means we need the index of the destination register that conflicts with the source register, to handle swz, and we need to expose that part of ir3_delay. But once that's done, we can delete ir3_delay_calc_postra. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	140e117f2b	ir3/delay: Ignore earlier definitions to the same register We have a situation in some skia shaders like: add.f r0.x, ... (rpt2)nop mul.f ..., r0.x sam (xyzw) r0.x, ... rcp ..., r0.x Notice that rcp uses the result of the sam instruction, not the add.f, but we didn't keep track of which instructions kill the sources in ir3_delay, so we'd add an extra nop, resulting in a disagreement betwen ir3_delay and the scheduling graph. Since postsched is correct, fix ir3_delay. This only results in some very slight shader-db changes but keeps the next commit from changing things. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	a54e7baa65	ir3/postsched: Handle sync dependencies better We want to model soft dependencies, but because of how there's only a single bit to wait on all of them, there may be unnecessary delays inserted when a (sy)-consumer follows an unrelated (sy)-producer. Previously there was some code to try to work around this, but we can just model it directly using the sfu_delay and tex_delay cycle counts that we have to maintain anyway and delete it. This also gets rid of the calls to ir3_delay_postra with soft=true which would be more complicated to handle in the next commit. There is a functional change here: the idea of preferring less nop's over critical path length (max_delay) up to 3 nops is kept (and we delete the TODO which is already sort-of resolved by it), but delays due to (ss)/(sy) and nops are now treated equally, rather than always preferring nops over syncs. So if our estimate indicates that scheduling an (ss) consumer will result in a wait of one cycle and there's another instruction that will require one nop, we will treat them otherwise equal and choose based on max_delay instead. This results in more sstall, but the decrease in nops is much greater. total nops in shared programs: 376613 -> 345482 (-8.27%) nops in affected programs: 275483 -> 244352 (-11.30%) helped: 3226 HURT: 110 helped stats (abs) min: 1 max: 78 x̄: 9.73 x̃: 7 helped stats (rel) min: 0.19% max: 100.00% x̄: 19.48% x̃: 13.68% HURT stats (abs) min: 1 max: 16 x̄: 2.43 x̃: 2 HURT stats (rel) min: 0.00% max: 150.00% x̄: 13.34% x̃: 4.36% 95% mean confidence interval for nops value: -9.61 -9.06 95% mean confidence interval for nops %-change: -19.01% -17.78% Nops are helped. total sstall in shared programs: 126195 -> 133806 (6.03%) sstall in affected programs: 79440 -> 87051 (9.58%) helped: 300 HURT: 1922 helped stats (abs) min: 1 max: 15 x̄: 4.72 x̃: 4 helped stats (rel) min: 1.05% max: 100.00% x̄: 17.15% x̃: 14.55% HURT stats (abs) min: 1 max: 29 x̄: 4.70 x̃: 4 HURT stats (rel) min: 0.00% max: 900.00% x̄: 25.38% x̃: 10.53% 95% mean confidence interval for sstall value: 3.22 3.63 95% mean confidence interval for sstall %-change: 17.50% 21.78% Sstall are HURT. total (ss) in shared programs: 35190 -> 35472 (0.80%) (ss) in affected programs: 6433 -> 6715 (4.38%) helped: 163 HURT: 401 helped stats (abs) min: 1 max: 2 x̄: 1.06 x̃: 1 helped stats (rel) min: 1.92% max: 33.33% x̄: 11.53% x̃: 10.00% HURT stats (abs) min: 1 max: 3 x̄: 1.13 x̃: 1 HURT stats (rel) min: 1.56% max: 100.00% x̄: 15.33% x̃: 12.50% 95% mean confidence interval for (ss) value: 0.41 0.59 95% mean confidence interval for (ss) %-change: 6.22% 8.93% (ss) are HURT. total (sy) in shared programs: 13476 -> 13521 (0.33%) (sy) in affected programs: 669 -> 714 (6.73%) helped: 30 HURT: 78 helped stats (abs) min: 1 max: 2 x̄: 1.13 x̃: 1 helped stats (rel) min: 4.00% max: 50.00% x̄: 21.22% x̃: 21.11% HURT stats (abs) min: 1 max: 2 x̄: 1.01 x̃: 1 HURT stats (rel) min: 3.45% max: 100.00% x̄: 31.93% x̃: 25.00% 95% mean confidence interval for (sy) value: 0.23 0.60 95% mean confidence interval for (sy) %-change: 11.19% 23.15% (sy) are HURT. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	b9f61d7287	ir3/postsched: Fix copy-paste mistake Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	d9a91318b1	ir3/sched: Rewrite delay handling The old code walked the instructions between each ready instruction and each of its parents for every instruction, which can quickly become accidentally quadratic. Instead we keep track of the current "instruction pointer" of the to-be-scheduled instruction, and for each ready instruction calculate an "earliest possible IP" which is the IP that needs to be reached before we can schedule it. Because this stays constant as soon as an instruction becomes ready, we never have to recompute it and each call to ir3_delay_calc_prera() becomes a simple comparison and subtract. We only need to iterate over the children and update their earliest_ip when scheduling an instruction, and we already do that in util_day_prune_head() so it should be cheap. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	b8fc7a08f9	util/dag: Add dag_add_edge_max_data This will be useful for when the edge data represents a delay of some sort, like it will with ir3. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Connor Abbott	508f917d8c	util/dag: Make edge data a uintptr_t Nobody was actually using it as a pointer, and I'm going to introduce a shared function which relies on it not being a pointer so let's fix this once and for all. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13722>	2021-11-17 13:41:47 +00:00
Erico Nunes	ee2e14b352	ci: temporarily disable lima CI The lima board farm will be unavailable for a few days, so disable it to avoid CI failures. Signed-off-by: Erico Nunes <nunes.erico@gmail.com> Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13595>	2021-11-17 11:40:19 +00:00
Kenneth Graunke	3b78f17532	iris: Tidy code in iris_use_pinned_bo a bit Now that we aren't short-circuiting most of the code, we should probably reorganize it a little bit. Tagged with fixes just so we pull all the refactors together as one group. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Kenneth Graunke	6e90984934	iris: Check for cross-batch flushing whenever a buffer is newly written. We need to perform cross-batch flushing if any batch writes to a BO while others refer to it. We checked this case when recording a new BO in the list which we'd never seen before. However, we neglected to handle the case when we already read from a BO, but then began writing to it. That new write may provoke a conflict between existing reads in other batches, so we need to re-check the cross-batch flushing. Caught by Piglit's copyteximage when forcing blits and copies to use a new IRIS_BATCH_BLITTER that isn't upstream yet. But this bug could be provoked by render/compute work today...we just hadn't noticed it. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Kenneth Graunke	76030964a6	iris: Make a helper function for cross-batch dependency flushing This should have no functional change, but it's tagged with Fixes anyway because it's needed for the bug fix in the next patch. Fixes: `b21e916a62` ("iris: Combine iris_use_pinned_bo and add_exec_bo") Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13808>	2021-11-17 02:43:30 -08:00
Alejandro Piñeiro	cbf0d83eac	v3d,v3dv: move TFU register definition to a common header We are using the same definitions for both OpenGL and Vulkan, so let's move it to common. As we are here we are also adding versioning on the TFU register definition. Those are basically register bit places, so really likely to change between versions. Adding 33 as it is the first version they got defined. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13832>	2021-11-17 11:04:31 +01:00
Samuel Pitoiset	ffbad81305	radv: simplify re-using cache entries in radv_pipeline_cache_insert_shaders() If entry->shaders[i] is NULL, shaders[i] should be also NULL, so the else condition is a no-op. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13823>	2021-11-17 08:15:53 +01:00
Iago Toral Quiroga	836a4b5836	v3dv: fix internal bpp of D/S formats Depth/stencil formats can, at worse (d32/d24s8), be exactly 32bpp, which is the minimum we can program for the internal format. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13816>	2021-11-17 06:57:48 +00:00
Pavel Asyutchenko	8ee7309e57	llvmpipe: enable PIPE_CAP_FBFETCH_COHERENT llvmpipe's fragment shaders are always run sequentially and in API order for a single tile, so it's impossible to have out of order render target writes requiring fetch barriers. Issues fixed in previous commits were actually breaking most piglit/deqp tests for coherent extension variant. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	e403c1c23e	llvmpipe: remove dead args from load_unswizzled_block They were only used in fs_fb_fetch. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	ea6eeb70e6	llvmpipe: fix FB fetch with non 32-bit render target formats Use lp_build_fetch_rgba_soa instead of lp_build_unpack_rgba_soa. This one was failing most of deqp framebuffer_fetch tests. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	2b3a020928	llvmpipe: protect from doing FB fetch of missing buffers Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	3ebd6498c4	llvmpipe: fix gl_FragColor and gl_LastFragData[0] combination Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Pavel Asyutchenko	b1de61dd38	llvmpipe: fix wrong assumption on FB fetch shader opacity In certain cases variant->opaque could be set to true, which reset command list for tiles fully covered by a triangle with this shader. This is obviously wrong in presence of framebuffer fetch. Signed-off-by: Pavel Asyutchenko <sventeam@yandex.ru> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13252>	2021-11-17 04:08:54 +00:00
Mike Blumenkrantz	86eb1549ef	zink: implement pipe_context::draw_vertex_state rough implementation, but it should be a decent start Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13692>	2021-11-17 03:16:13 +00:00
Vasily Khoruzhick	02e5f4fb10	lima: add more wrap modes Using 1 bit per wrap mode looked very suspicious and after some experiments it turns out it's 3-bit enum. Border color is also here, it sits right after depth field. For some reason it uses 16 bit per channel just like for clear color in RSW GL_CLAMP mode is broken for nearest filter just as on Midgard, so add the same workaround - use GL_CLAMP_TO_EDGE for nearest filter. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	cbed4d784e	lima: handle 1D samplers It's just a matter of changing number of dimensions in texture descriptor. Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Vasily Khoruzhick	fa86a2a94d	lima: add support for 3D textures It looks like MBS format used by blob doesn't distinguish sampler2D from sampler3D, so load texture instruction is the same for 2D and 3D textures. So all we need to RE is texture descriptor for 3D textures, but blob doesn't implement it, so we need to do some guesswork: - unknown_3_1 looks like a depth since it sits after height/width and always set to 1 - unknown_2_2 is exactly 3 bits and it follows wrap_t, so it must be wrap_r - missing part is texture type for 3D textures. By trial and error it seems to be 4. First bit is only set for cubemap, so it's likely a separate flag, and rest 2 bits look like number of tex dimensions akin to midgard and later (thanks, panfrost!) with 0 for 1D, 1 for 2D and 2 for 3D. Put it all together and we have working 3D textures on lima! Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13213>	2021-11-16 22:58:12 +00:00
Mike Blumenkrantz	97b92c9c32	zink: set suballocator bo size to aligned allocation size this is the actual memory size cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13824>	2021-11-16 22:29:20 +00:00
Mike Blumenkrantz	eb6f1d5348	zink: block suballocator caching for swapchain/dmabuf images these have pNext pointers which makes their memory uncacheable cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13824>	2021-11-16 22:29:20 +00:00
Marek Olšák	ba6d389fa7	radeonsi: don't use GS SGPR6 for the small prim cull info use a user SGPR instead. This will be needed in the future. Also don't upload small_prim_precision because it's passed via VS_STATE_BITS. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	0690a44e69	radeonsi: inline declare_vs_specific_input_sgprs I think it was getting a little hard to follow. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	513bd6acca	radeonsi: cull against clip planes, clipvertex, clip/cull distances in shader The downside is that this duplicates shader code for clip/cull distances in both the position and parameter portions of the shader. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Marek Olšák	881c459191	radeonsi: unify how ngg_cull_flags are set Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13811>	2021-11-16 19:41:07 +00:00
Jesse Natalie	a818f7b686	d3d12: Fix incorrect hash table usage I'd assumed that since insert didn't take a deleter, it was find-or-insert, not insert-or-replace. This caused a bo reference leak if the same bo was used more than once in a batch. Fixes: `fde36d7992` ("d3d12: Don't wait for GPU reads to do CPU reads") Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13819>	2021-11-16 19:27:16 +00:00
Vasily Khoruzhick	764760314d	lima: add native txp support Currently lima uses generic TXP lowering that results in downgrading coords precision to FP16 since we have to do some calculations with coords instead of loading them directly from varying. Mali4x0 has native TXP support, however coords and projector have to come from a single source. Add NIR lowering pass that combines coords and projector into a single backend-specific source and use it instead of generic lowering. Unfortunately this change regresses one test, but it also fails in blob and disassembly is now identical. shader-db diff: total instructions in shared programs: 15623 -> 15603 (-0.13%) instructions in affected programs: 877 -> 857 (-2.28%) helped: 7 HURT: 0 helped stats (abs) min: 2 max: 8 x̄: 2.86 x̃: 2 helped stats (rel) min: 0.87% max: 10.53% x̄: 4.93% x̃: 1.85% 95% mean confidence interval for instructions value: -4.95 -0.76 95% mean confidence interval for instructions %-change: -9.31% -0.55% Instructions are helped. total loops in shared programs: 3 -> 3 (0.00%) loops in affected programs: 0 -> 0 helped: 0 HURT: 0 total spills in shared programs: 136 -> 137 (0.74%) spills in affected programs: 0 -> 1 helped: 0 HURT: 1 total fills in shared programs: 598 -> 602 (0.67%) fills in affected programs: 0 -> 4 helped: 0 HURT: 1 Tested-by: Denis Pauk <pauk.denis@gmail.com> Reviewed-by: Andreas Baierl <ichgeh@imkreisrum.de> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13111>	2021-11-16 19:13:42 +00:00
Rob Clark	fac9d22773	isaspec: Add prototypes for expr evaluators Add function prototypes for generated expr evaluators, to avoid a use- before-declaration issue if an expression references a derived field. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13787>	2021-11-16 18:44:22 +00:00
Ilia Mirkin	aa93896156	freedreno/ir3: adjust condition for when to use ldib We have to use it any time that the image is writable. Otherwise writes from the same invocation won't have posted into the texture cache. See: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5629 Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13807>	2021-11-16 18:22:29 +00:00
Samuel Pitoiset	011ea32585	nir: fix constant expression of ibitfield_extract This fixes dEQP-VK.graphicsfuzz.cov-condition-bitfield-extract-integer. For example, nir_ibitfield_extract(3, 1, 2) should return 1. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13791>	2021-11-16 17:32:21 +00:00
Jason Ekstrand	8a11d2a31b	vulkan: Add a dummy sync type This is useful in WSI scenarios where you want to trivially signal a fence or semaphore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	2a910071bc	vulkan,anv: Auto-detect syncobj features Instead of having a bunch of const vk_sync_type for each permutation of vk_drm_syncobj capabilities, have a vk_drm_syncobj_get_type helper which auto-detects features. If a driver can't support a feature for some reason (i915 got timeline support very late, for instance), they can always mask off feature bits they don't want. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	36b4d12f02	anv: Simplify submit_simple_batch() BO waits aren't going away any time soon so using a syncobj here doesn't really gain us anything. It just makes it more complicated. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	623e9ecd6d	anv: Remove unnecessary syncobj wrappers These are entirely unused except for the syncobj in submit_simple_batch. We can use the libdrm wrappers for that as they're basically equivalent and the core Vulkan sync code already depends on them. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	c5ac1d1669	vulkan: Add an emulated binary vk_sync type This wraps a timeline vk_sync type and turns it into a binary one. This is useful for, for instance, driver layered on D3D12. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	36ea90a361	anv: Convert to the common sync and submit framework This is, unfortunately, a large flag-day mega-commit. However, any other approach would likely be fragile and involve a lot more churn as we try to plumb the new vk_fence and vk_semaphore primitives into ANV's submit code before we delete it all. Instead, we do it all in one go and accept the consequences. While this should be mostly functionally equivalent to the previous code, there is one potential perf-affecting change. The command buffer chaining optimization no longer works across VkSubmitInfo structs. Within a single VkSubmitInfo, we will attempt to chain all the command buffers together but we no longer try to chain across a VkSubmitInfo boundary. Hopefully, this isn't a significant perf problem. If it ever is, we'll have to teach the core runtime code how to combine two or more VkSubmitInfos into a single vk_queue_submit. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	87cf858946	anv: Use helpers in util/os_time.h in the query code These are about to be the only use of anv_gettime_ns() so lets switch them over so the next patch can delete the helper outright. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	213edd1841	anv: Remove the last remnants of in/out fences This should have been dropped as part of `d44ea09e61` ("anv: Drop unused sync_file and BO semaphore code") but we missed a few bits. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	cbb13fae33	anv: Add a BO sync type This is copy+paste from the BO implementation of anv_fence Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	dc62695c3a	anv: Delete ANV_SEMAPHORE_TYPE_DUMMY It's never set as a semaphore type. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	e3fda7ae78	vulkan/wsi/display: Wrap wsi_display_fence in a vk_sync This gets rid of the bespoke vfunc interface in the WSI code. Eventually, I'd love to get rid of wsi_display_fence entirely but this at least makes it a lot more palatable. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	b1addc425a	wsi/display: Rework wsi_fence a bit Get rid of most of the guts of the base class and just leave it as a vtable. We can also drop some of wsi_display_fence. One functional change here is that we're now using VK_SYSTEM_ALLOCATION_SCOPE_INSTANCE which is more correct anyway because, thanks to the funky reference counting we do with destroyed and event_received, its lifetime is tied to the physical device, at best. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	2bd3434fa2	vulkan/wsi: Drop wsi_common_get_current_time() Use os_time_get_nano() instead. At this point, it's just a wrapper around os_time_get_nano() anyway. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	7b19c9d19b	vulkan/device: Log the timeline mode when lost Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	9bffd81f1c	vulkan: Add common implementations of vkQueueSubmit and vkQueueWaitIdle This adds a new vk_queue_submit object which contains a list of command buffers as well as wait and signal operations along with a driver hook which takes a vk_queue and a vk_queue_submit and does the actual submit. The common code then handles spawning a submit thread if needed, waiting for timeline points to materialize, dealing with timeline semaphore emulation via vk_timeline, etc. All the driver sees are vk_queue.submit calls with fully materialized vk_sync objects which it can wait on unconditionally. This implementation takes a page from RADV's book and only ever spawns the submit thread if it sees a timeline wait on a time point that has not yet materialized. If this never happens, it calls vk_queue.submit directly from vkQueueSubmit() and the thread is never spawned. One other nicety of the new framework is that there is no longer a distinction, from the driver's PoV, between fences and semaphores. The fence, if any, is included as just one more signal operation on the final vk_queue_submit in the batch. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	673b5e97ec	vulkan: Add a common implementation of VkSemaphore This is built on the new vk_sync primitives. In the vk_physical_device, the driver provides a null-terminated array of vk_sync_type pointers in priority order. The semaphore implementation then selects the first type that meets the necessary criterion. In particular, semaphores may or may not be timelines depending on the VkSemaphoreType. It also auto-selects the semaphore type based on the external handle types provided and can down-grade as needed to support a particular external handle. The implementation itself is mostly copy+pasted from ANV. The primary difference is the fact that anv_semaphore_impl has been replaced with vk_sync. The permanent vk_sync is still embedded (like ANV) but the temporary one is a pointer. This makes stealing the temporary state as part of VkQueueSubmit a bit easier. All of the interesting stuff around waits, signals, etc. is implemented by the vk_sync interface. All this code does is wrap it all in the annoyingly detailed VkFence rules so we can provide the correct Vulkan entrypoint behavior. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:54:27 -06:00
Jason Ekstrand	60e7359db0	vulkan: Add a common implementation of VkFence This is built on the new vk_sync primitives. In the vk_physical_device, the driver provides a null-terminated array of vk_sync_type pointers in priority order. The fence implementation then selects the first type that meets the necessary criterion. In particular, fences can't be timelines and need to support reset and CPU wait. It also auto-selects the fence type based on the external handle types provided and can down-grade as needed to support a particular external handle. The implementation itself is mostly copy+pasted from ANV. The primary difference is the fact that anv_fence_impl has been replaced with vk_sync. The permanent vk_sync is still embedded (like ANV) but the temporary one is a pointer. This makes stealing the temporary state as part of VkQueueSubmit a bit easier. All of the interesting stuff around waits, resets, etc. is implemented by the vk_sync interface. All this code does is wrap it all in the annoyingly detailed VkFence rules so we can provide the correct Vulkan entrypoint behavior. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	301e20f7a3	vulkan: Add an emulated timeline sync type This is mostly copy+paste (and a bit of re-typing) from ANV. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	f3843b11c9	c11/threads: Re-align return values for timed waits They're supposed to return thrd_timedout (which we mistakenly named thrd_timeout), not thrd_busy. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	13ab3757bb	vulkan: Add a common vk_drm_syncobj struct Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	fdb636e3a6	vulkan/vk_device: Add a drm_fd field Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	95dee5150a	vulkan/util: Include stdlib.h It's needed for malloc() which is used by STACK_ARRAY Fixes: `f695171e38` ("vulkan: add common entrypoints for sparse image requirements/properties") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	8134feb9a2	vulkan/meson: Re-arrange libvulkan_util deps a bit Rename files_vulkan_runtime to vulkan_runtime_files and add a new vulkan_runtime_deps array for dependencies. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	3cf5fced4c	vulkan: Add a vk_sync base class This doesn't map directly to any particular Vulkan object but is, instead, a base class for the internal implementations of both VkFence and VkSemaphore. Its utility will become evident in later patches. The design of vk_sync will look familiar to anyone with significant experience in DRM. The base object itself is just a pointer to a vfunc table with function pointers providing the implementation of the various operations. Depending on how the vk_sync will be used some of of those vfuncs are optional. If it's only going to be used for VkSemaphore, for instance, there's no need for reset(). Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	236ca76376	anv: Wire up the new status check Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	e9662e0154	vulkan/device: Add a check_status hook Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	955f329fbe	anv: Use the new common device lost tracking Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	dd89ef96d7	vulkan: Pull the device lost framework from ANV It's a bit on the over-complicated side but the objective is to make the debug log messages show up in the same thread as the first VK_ERROR_DEVICE_LOST so we don't massively confuse the app. It's unknown if this is actually ever a problem but, with submit happening off on its own thread, logging errors from threads the client doesn't know about doesn't seem like a massively great plan. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13427>	2021-11-16 10:02:08 -06:00
Jason Ekstrand	e6b4678fdb	anv: Move device memory maps back to anv_device_memory This effectively partially reverts `13fe43714c` ("anv: Add helpers in anv_allocator for mapping BOs") where we both added helpers and reworked memory mapping to stash the maps on the BO. The problem comes with external memory. Due to GEM rules, if a memory object is exported and then imported or imported twice, we have to deduplicate the anv_bo struct but, according to Vulkan rules, they are separate VkDeviceMemory objects. This means we either need to always map whole objects and reference-count the map or we need to handle maps separately for separate VkDeviceMemory objects. For now, take the later path. Fixes: `13fe43714c` ("anv: Add helpers in anv_allocator for mapping BOs") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5612 Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13795>	2021-11-16 15:18:03 +00:00
Mike Blumenkrantz	32c0c5fcd9	mesa: convert unsupported primtypes during display list compilation this adds primitive type translation in before the draw reaches gallium, which massively increases performance by avoiding any sort of buffer readback fixes #5249 Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13741>	2021-11-16 14:12:03 +00:00
Mike Blumenkrantz	97ba2f2fd4	move util/indices to core util these are useful tools to have outside of gallium Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Adam Jackson <ajax@redhat.com> Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13741>	2021-11-16 14:12:03 +00:00
Kenneth Graunke	88e4d3809c	intel/genxml: Decode VALIGN/HALIGN values in XY_BLOCK_COPY_BLT For easier readability in INTEL_DEBUG=bat. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>	2021-11-16 11:38:30 +00:00
Kenneth Graunke	406ff7473a	intel/genxml: Fix XY_BLOCK_COPY_BLT destination tiling field type Fixes: `2f58a63b2f` ("intel/genxml: Add XY_BLOCK_COPY_BLT on Tigerlake and later.") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>	2021-11-16 11:38:30 +00:00
Kenneth Graunke	29025f66fd	intel/genxml: Fix MI_FLUSH_DW to actually specify the length properly Fixes: `569afd37f1` ("intel/genxml: Copy gen12.xml to gen125.xml") Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>	2021-11-16 11:38:30 +00:00
Kenneth Graunke	ebc0099d89	intel/genxml: Collapse leading underscores on prefixed value defines We prefix names with an underscore to make them "safe" C identifiers when necessary. For example, a value of "32x32" would become "_32x32". However, when specifying something like <field ... prefix="BLOCK_SIZE"> <value name="32x32" value="0"/> </field> we already have a prefix that makes the field name safe. We'd rather generate a name with a single underscore, i.e. #define BLOCK_SIZE_32x32 0 rather than #define BLOCK_SIZE__32x32 0 This also fixes up affected defines in crocus. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>	2021-11-16 11:38:30 +00:00
Kenneth Graunke	cd7d3c7ae3	intel/genxml: Simplify prefix handling for field value lists When a <field> tag has multiple <value> children, listing symbolic names for possible field values, we generate #defines for each value, with an optional prefix. I don't know why, but this code was checking whether self.default is None. We want to generate the same list of #defines, with a prefix, regardless of whether the field has a default value specified or not. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13809>	2021-11-16 11:38:30 +00:00
Kenneth Graunke	f4004fde26	iris: Fix parameters to iris_copy_region in reallocate_resource_inplace We had accidentally passed <x, y, z, l> instead of <l, x, y, z>. Fixes: `b8ef3271c8` ("iris: Move suballocated resources to a dedicated allocation on export") Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13815>	2021-11-16 11:22:04 +00:00
Bas Nieuwenhuizen	68b7b4fb38	radv: Don't crash if VkExternalImageFormatProperties isn't provided. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5623 Stable: 21.2 21.3 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13786>	2021-11-16 10:23:37 +00:00
Alejandro Piñeiro	fef9ef48dd	gallium/u_blitter: clean up texcoords ZW when filling up just XY To avoid a scenario like this: * One blit needed the four components => XYZW filled up with 4 values * Following blit needing two components => ZW uses the previous values We detected this using the v3d driver with the arb_framebuffer_srgb-blit test, specifically: ./bin/arb_framebuffer_srgb-blit texture linear_to_srgb msaa enabled render -auto -fbo The main linear to srgb with msaa (not doing the resolve yet) blit requires the four components. At the end (after a resolve copy), the test uses glReadPixels, and internally it uses the blitter with two components, but the shader still uses lod on the texel fetch, so it gets the one used for the main blit, when it should be zero. Right now v3d works fine even with that wrong value, and I assume that any other driver too. But we can't ensure that would keep happening on the future, so let's use correct values. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13753>	2021-11-16 10:17:18 +01:00
Timur Kristóf	59860d4873	nir: Group per-primitive outputs at the end for driver location assign. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Acked-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	f23f7ef316	nir: Don't compact per-vertex and per-primitive outputs together. Prevent nir_compact_varyings from putting per-vertex and per-primitive output components in the same slot. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	e1e461d11c	nir: Lower cull and clip distance arrays for mesh shaders. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	6a502a0a2c	nir: Add new option to lower invocation ID from invocation index. Add this as an option to nir_lower_compute_system_values_options instead of just relying on the shader's options. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	7562e34463	nir, spirv: Don't mark NV_mesh_shader primitive indices as per-primitive. They are not per-primitive in NV_mesh_shader, but a flat array. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Oliveira <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	d79d9a7a06	nir: Fix nir_lower_io with per primitive outputs. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	9cf4124be0	nir: Print Mesh Shader specific info. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Timur Kristóf	5aa39253cb	nir: Rename nir_get_io_vertex_index_src and include per-primitive I/O. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13466>	2021-11-16 07:46:55 +00:00
Vinson Lee	008f5a127c	ac/rgp: Initialize clock_calibration with memset. Fix defect reported by Coverity Scan. Uninitialized scalar variable (UNINIT) uninit_use_in_call: Using uninitialized value clock_calibration. Field clock_calibration.reserved is uninitialized when calling fwrite. Fixes: `1ee85e8bab` ("ac/rgp: add support for clock calibration") Suggested-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13783>	2021-11-15 22:59:05 -08:00
Ilia Mirkin	bf14a63e1d	freedreno/a4xx: hook up sample mask/id, used to determine helper invocs This fixes the various gl_HelperInvocation-based tests. There's a lowering pass which converts it to (1 << sampleid) & samplemask. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>	2021-11-16 05:08:26 +00:00
Ilia Mirkin	a95a9f0cc6	freedreno/a4xx: include guesses from a3xx for some of the constid's The ones that are untested are left as comments. The ones that rename values were tested manually. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>	2021-11-16 05:08:26 +00:00
Ilia Mirkin	45606b51cc	freedreno/a4xx: indicate whether outputs are uint/sint Unclear whether this fixes anything, but the blob does seem to set these. (Discovered while trying to determine if value clamping was missing for non-32-bit integer formats, which fail in some tests.) Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>	2021-11-16 05:08:26 +00:00
Ilia Mirkin	14087cb9ea	freedreno/a4xx: fix stencil-textured border colors These are implemented with unusual sampler formats, so the usual approach of looking at the format descriptors fails. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13806>	2021-11-16 05:08:26 +00:00
Ilia Mirkin	20e8e11d64	freedreno/a6xx: re-express buffer textures more logically Same as a5xx, move one bit into the tex type, one as a separate named BUFFER bit. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>	2021-11-16 04:44:23 +00:00
Ilia Mirkin	8c041f4bf3	freedreno/a5xx: re-express buffer textures more logically Instead of treating it as 2 bits to enable, make BUFFER a type (and extend the bitfield width), and then add a separate BUFFER bit (ostensibly to perform the width/height concatenation but who knows). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>	2021-11-16 04:44:23 +00:00
Ilia Mirkin	6566eae933	freedreno/a4xx: add proper buffer texture support Rather than faking it as a 1d texture, add the buffer texture type, and allow a full range of sizes. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13805>	2021-11-16 04:44:23 +00:00
Marek Olšák	42dbfd7206	radeonsi: make si_llvm_emit_clipvertex non-static it will be used in culling code Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	d3d5777536	radeonsi: remove an incorrect comment at lds_byte0_accept_flag Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	20e83abf06	radeonsi: improve memory instruction tracking Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	901697654a	radeonsi: add dcc_msaa option to enable DCC for MSAA Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	5a5263d65d	radeonsi: unify GFX9_VSGS_NUM_USER_SGPR and GFX9_TESGS_NUM_USER_SGPR Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	9151ac3531	ac,radeonsi: cull small lines in the shader using the diamond exit rule It also splits clip_half_line_width into X and Y components for tighter view culling. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	701a0b5165	radeonsi: add si_state_rasterizer::ngg_cull_flags_lines and rename the others Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	3166d4428d	radeonsi: set EXTRA_DX_DY_PRECISION for lines where it's supported Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:46 +00:00
Marek Olšák	4571778008	radeonsi: set PERPENDICULAR_ENDCAP_ENA for wide AA lines This is more correct. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:45 +00:00
Marek Olšák	3338956268	radeonsi: make si_get_small_prim_cull_info static Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:45 +00:00
Marek Olšák	963b7475a9	radeonsi: use ac_build_load_to_sgpr in gfx10_emit_ngg_culling_epilogue This is more correct because we are loading constants into an SGPR even though there is no effect on behavior in this case. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:45 +00:00
Marek Olšák	f8a0aa6852	radeonsi: fix view culling for wide lines We need to cull wide lines as quads, but only for view culling. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:45 +00:00
Marek Olšák	8f687bb5dc	radeonsi: fix shader culling with integer pixel centers Only Nine was using them. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13700>	2021-11-16 02:11:45 +00:00
Ilia Mirkin	185826a400	nir: remove double-validation of src component counts The nir_tex_instr_src_size helper already sorts this out correctly, no need to do it twice, and validate_src takes care of it. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13781>	2021-11-16 01:23:41 +00:00
Bas Nieuwenhuizen	6e3266709a	radv: Add more checking of cache sizes. Hopefully prevents things. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13789>	2021-11-16 00:58:08 +00:00
Bas Nieuwenhuizen	9494c566c2	radv: Fix memory corruption loading RT pipeline cache entries. Oops. Forgot to account for the size here. Fixes: `ca2d96db51` ("radv: Add caching for RT pipelines.") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13789>	2021-11-16 00:58:08 +00:00
Ilia Mirkin	8c9a86cb57	freedreno/ir3: fix image-to-tex flags, remove 3d -> array hack The function would return both the 3d and array flags set for 2d array, and would return just 3d for cubes. Fix the flags so that they are appropriate for images. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13804>	2021-11-16 00:33:31 +00:00
Paulo Zanoni	a9c1cc63c6	iris: call brw_process_intel_debug_variable() earlier We're currently only calling it after creating the screen and the bufmgr. There are a few cases where Iris checks for the DEBUG_BUFMGR bit before we call brw_process_intel_debug_variable(), which means intel_debug is 0 and so we don't run the debug code. Today, these are all related to the creation of the workaround bo and its mmap. I found this in a custom branch after I converted to INTEL_DEBUG an environment variable that I had. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13780>	2021-11-15 23:33:18 +00:00
Eric Engestrom	9ae34651f7	docs: update branchpoint instructions Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13745>	2021-11-15 23:24:03 +00:00
Vasily Khoruzhick	15013958d0	lima: enable PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS Mali4x0 PP doesn't have a swizzle for load_input, so use POT-aligned varyings to avoid unnecessary movs for vec3 and precision downgrade in case if this vec3 is coordinates for a sampler shader-db: total instructions in shared programs: 15707 -> 15623 (-0.53%) instructions in affected programs: 3906 -> 3822 (-2.15%) helped: 47 HURT: 18 helped stats (abs) min: 1 max: 9 x̄: 3.09 x̃: 2 helped stats (rel) min: 1.49% max: 23.53% x̄: 8.20% x̃: 6.45% HURT stats (abs) min: 1 max: 7 x̄: 3.39 x̃: 3 HURT stats (rel) min: 0.78% max: 20.59% x̄: 10.45% x̃: 10.97% 95% mean confidence interval for instructions value: -2.18 -0.41 95% mean confidence interval for instructions %-change: -5.70% -0.38% Instructions are helped. total spills in shared programs: 146 -> 136 (-6.85%) spills in affected programs: 39 -> 29 (-25.64%) helped: 6 HURT: 0 total fills in shared programs: 617 -> 598 (-3.08%) fills in affected programs: 125 -> 106 (-15.20%) helped: 6 HURT: 0 HURT shaders are vertex shaders where we may need more instructions for non-packed vec3s. It's acceptable trade-off since we don't get precision downgrade if this varying is coordinates for a sampler. Reviewed-by: Erico Nunes <nunes.erico@gmail.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13151>	2021-11-15 22:52:55 +00:00
Vasily Khoruzhick	3bb192a15b	gallium: add PIPE_CAP_PREFER_POT_ALIGNED_VARYINGS Driver should enable this cap if it prefers varyings to be aligned to power of two in a slot, i.e. vec4 in .xyzw, vec3 in .xyz, vec2 in .xy or .zw Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Signed-off-by: Vasily Khoruzhick <anarsoul@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13151>	2021-11-15 22:52:55 +00:00
Eric Engestrom	df93e7aeee	docs/submittingpatches: mention use of the `-x` flag of `git cherry-pick` when backporting a commit Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>	2021-11-15 22:47:39 +00:00
Eric Engestrom	d47bfd56de	docs/submittingpatches: add formatting around the release branches names Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>	2021-11-15 22:47:39 +00:00
Eric Engestrom	162622bf63	docs/submittingpatches: add link to section describing how to make a backport MR Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13746>	2021-11-15 22:47:39 +00:00
Emma Anholt	42753be1e7	freedreno/a6xx: Fix a bunch of 3D texture layout to match blob behavior. This doesn't get all of the texelfetch sampler3d testcases working, but it's sure a lot more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	a3717c1496	freedreno/cffdump: Handle the TILE_ALL flag in unit test generation. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	e42450a255	freedreno/cffdump: Fix up formatting of texturator unit test script output. Now I don't need to re-clang-format as I generate testcases. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	7a6fc25daa	freedreno/fdl: Add support for unit testing 3D texture array strides. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	0d7c6eedc7	freedreno/cffdump: Fix 64-bit reg decode in script mode. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	f63fd3425d	freedreno: Fix the texturator unit test script. We no longer have reg defs for the HI fields, so all we can access from lua is the low 32 bits. LUA has only double-precision floats for numbers, so we can't fix that. However, the high bits are almost always the same, so it's not that big of a deal to be ignoring them for this script. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Emma Anholt	3ddefb4ae3	freedreno/fdl: Dump the generated layout when a layout test fails. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13733>	2021-11-15 22:25:08 +00:00
Dave Airlie	4b27ebee7f	util/vl: move gallium vl_vlc.h and vl_rbsp.h to shared code. For vulkan video I need these to parse slice headers, so move them somewhere easier to get at them. drops pointer_to_uintptr in favour of a cast. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13768>	2021-11-15 21:57:28 +00:00
Jordan Justen	29c2f32a57	intel/dev: Add platform enum with DG2 G10 & G11 Based on Lionel's "intel/devinfo: store the different kind of DG2". Ref: `361b3fee3c` ("intel: move away from booleans to identify platforms") Ref: https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=9e22cfc5e9b92556a56d8a564cdab31045f29010 Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13797>	2021-11-15 21:39:27 +00:00
Mike Blumenkrantz	43c457a6ec	zink: always add VK_IMAGE_CREATE_2D_ARRAY_COMPATIBLE_BIT for 3D images there's no way to know what an image will be used for, so this bit needs to always be added fixes KHR-GL46.packed_pixels.varied_rectangle.compressed_rgb cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13798>	2021-11-15 21:24:05 +00:00
Mike Blumenkrantz	93a55537f2	zink: stop running discard_if in generated tcs just embarrassing smh Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13798>	2021-11-15 21:24:05 +00:00
Samuel Pitoiset	df526aae1b	zink: skip one GLES31 subset to avoid GPU hangs on Navi10 Weird bug... I will figure out later. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13796>	2021-11-15 20:33:22 +00:00
Dave Airlie	bf7b6dd73f	intel/genxml: generate video headers This just generates the video engine pieces. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Dave Airlie	8f9006804a	intel/genxml: fix gen6 LD->VLD typo. Pointed out by Ilia Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Matt Turner	2bb8aa2942	intel/genxml: capitalize decoder mode select properly Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Dave Airlie	2268fc1bb6	intel/genxml: fix Picure->Picture typo Ilia pointed this out. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Dave Airlie	dc32a164c8	intel/genxml: align QM field names across gens. This just picks a consistent name. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Dave Airlie	0f3f8b4591	intel/genxml: fix some missing address from the 75 xml Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Dave Airlie	5d956d65b6	intel/genxml: cleanup video xml collisions. When you enable video genxml, lots of warnings about redefined things appear, just clean those up before things get started. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13788>	2021-11-15 20:13:46 +00:00
Rhys Perry	d89461208b	aco: consider pseudo-instructions reading exec in needs_exec_mask() No matter the format, this should return true if the instruction has an exec operand. Otherwise, eliminate_useless_exec_writes_in_block() could remove an exec write in a block if it's successor begins with: s2: %3737:s[8-9] = p_parallelcopy %0:exec s2: %0:exec, s1: %3738:scc = s_wqm_b64 %3737:s[8-9] Totals from 3 (0.00% of 150170) affected shaders (GFX10.3): CodeSize: 23184 -> 23204 (+0.09%) Instrs: 4143 -> 4148 (+0.12%) Latency: 98379 -> 98382 (+0.00%) Copies: 172 -> 175 (+1.74%) Branches: 95 -> 97 (+2.11%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Fixes: `bc13049747` ("aco: Eliminate useless exec writes in jump threading.") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5620 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13776>	2021-11-15 18:58:37 +00:00
Daniel Schürmann	7876886bdc	radv: use nir_fold_16bit_sampler_conversions() for now only for texture dest and if there is no rounding mode required. Totals from 2 (0.00% of 150170) affected shaders: (GFX10.3) CodeSize: 7980 -> 7948 (-0.40%) Instrs: 1441 -> 1422 (-1.32%) Latency: 7703 -> 7626 (-1.00%) InvThroughput: 2336 -> 2302 (-1.46%) VClause: 34 -> 36 (+5.88%) Copies: 57 -> 58 (+1.75%) Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Daniel Schürmann	ab21183b5d	aco: implement D16 texture loads Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Daniel Schürmann	626aa7b648	aco: workaround GFX9 hardware bug for D16 image instructions Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Daniel Schürmann	8f1483cd5c	aco: add more D16 load/store instructions to RA and validator This enables correct handling for buffer_load/store_format_d16_x and D16 Image instructions. Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Daniel Schürmann	1e4c6e059e	nir/fold_16bit_sampler_conversions: skip sparse residency tex instructions The residency return value mismatches between NIR and Radeon. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13592>	2021-11-15 18:28:20 +00:00
Rob Clark	f53e1823c2	freedreno: caps for clover Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12500>	2021-11-15 18:06:39 +00:00
Rob Clark	9e7f5b75ec	freedreno: Add PIPE_SHADER_IR_NIR_SERIALIZED support Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12500>	2021-11-15 18:06:39 +00:00
Ilia Mirkin	31d6cd224a	a5xx: remove astc srgb workaround logic This was copied from a4xx, which only needs it on one chip model (A420). Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13782>	2021-11-15 17:31:53 +00:00
Samuel Pitoiset	cb56b83572	zink: update the CI lists for RADV Lot of GPU hangs fixed lately. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13792>	2021-11-15 16:19:29 +00:00
Jesse Natalie	f0e5bc228c	microsoft/clc: Add a test for arg metadata Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13759>	2021-11-15 07:47:06 -08:00
Jesse Natalie	53d4dc7feb	clc: Use kernel_arg_type_qual string to add const type qualifier to arg metadata Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13759>	2021-11-15 07:47:00 -08:00
Iago Toral Quiroga	f384c763fc	v3d,v3dv: move tile size calculation to a common helper We had this code replicated in 3 places across both drivers. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13790>	2021-11-15 11:40:39 +00:00
Samuel Pitoiset	a9c4e0c371	ac/spm: fix determining the counter slot Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `e928f475cc` ("ac: add initial SPM support") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13758>	2021-11-15 11:24:36 +01:00
Samuel Pitoiset	11c6a32759	ac/spm: fix determing the SPM wire One SPM wire holds two 16-bit counters. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Fixes: `e928f475cc` ("ac: add initial SPM support") Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13758>	2021-11-15 11:24:33 +01:00
Samuel Pitoiset	e94a899c0e	radv: fix a sync issue on GFX9+ by clearing the upload BO fence If the same cmdbuf is submitted more than once, they were waiting on the same fence value. Fix this by clearing the value when beginning a new command buffer. This might fix spurious GPU hangs, especially on GFX9. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5401 Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13777>	2021-11-15 09:59:53 +01:00
Timothy Arceri	9d9de15a02	mesa: fix buffer overrun in SavedObj texture obj array Fixes: `3be42f9ca1` ("mesa: rewrite glPushAttrib/glPopAttrib to get rid of malloc") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5621 Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13773>	2021-11-15 00:09:57 +00:00
Dave Airlie	27903abbb6	llvmpipe: fix compressed image sizes. VK CTS just added some new tests to write to a compressed image from a compute shader, which was overrunning memory. The image width/height need to be sized according to the block sizes to avoid overwriting memory. dEQP-VK.image.sample_texture.bit_compressed Cc: mesa-stable Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13618>	2021-11-15 07:15:36 +10:00
Dave Airlie	53a8faafc1	llvmpipe: disable 64-bit integer textures. This fixes some crashes in VK-GL-CTS where it doesn't deal with these. Cc: mesa-stable Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13617>	2021-11-14 20:47:15 +00:00
Timur Kristóf	d80c7f3406	aco: Fix how p_is_helper interacts with optimizations. p_is_helper doesn't have any operands, so ACO's value numbering and/or the pre-RA optimizer could incorrectly recognize two such instructions as the same. This patch adds exec as an operand to p_is_helper in order to achieve correct behavior. Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5570 Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13577>	2021-11-13 16:32:02 +01:00
Emma Anholt	01d36149cd	ci/freedreno: Add a link to the issue for color_depth_attachments. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Emma Anholt	1847700d3c	ci/freedreno: Add notes explaining the KHR-GL* failures. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Emma Anholt	32b51d5e60	freedreno/a6xx: Do sparse setup of the TFB program. We don't need to init the whole program RAM, just the locations we are actually writing from. Syncs this code up with tu a bit more. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Emma Anholt	943449fb8e	ci/freedreno: Enable the tes-input/tcs-input tests. They seem to be mostly passing these days. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Emma Anholt	2ce44a0298	freedreno/ir3: Fix an off-by-one in so->outputs_count safety assert. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Emma Anholt	02079cbb77	freedreno/a6xx: Add some notes about piglit failures. Hopefully this helps others save time looking at piglit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13747>	2021-11-12 20:26:22 +00:00
Rhys Perry	11b533cb19	aco: optimize load_local_invocation_index with single-wave workgroups fossil-db (Sienna Cichlid): Totals from 668 (0.52% of 128647) affected shaders: CodeSize: 2201912 -> 2193336 (-0.39%) Instrs: 403124 -> 402325 (-0.20%) Latency: 4510940 -> 4510214 (-0.02%); split: -0.02%, +0.00% InvThroughput: 681057 -> 679453 (-0.24%); split: -0.24%, +0.00% VClause: 6470 -> 6467 (-0.05%) SClause: 12759 -> 12755 (-0.03%) Copies: 26348 -> 26218 (-0.49%); split: -0.50%, +0.00% PreSGPRs: 26140 -> 26101 (-0.15%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel-schuermann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13757>	2021-11-12 18:59:51 +00:00
Rhys Perry	2d07bcad66	radv: lower load_local_invocation_index with 1D workgroups For 1D workgroups, we can just load from an input VGPR. fossil-db (Sienna Cichlid): Totals from 226 (0.18% of 128647) affected shaders: CodeSize: 1200476 -> 1195696 (-0.40%); split: -0.49%, +0.09% Instrs: 223817 -> 223328 (-0.22%); split: -0.29%, +0.07% Latency: 2552394 -> 2549606 (-0.11%); split: -0.15%, +0.04% InvThroughput: 533989 -> 532670 (-0.25%); split: -0.27%, +0.02% VClause: 5191 -> 5188 (-0.06%) SClause: 7637 -> 7636 (-0.01%) Copies: 18165 -> 18182 (+0.09%); split: -0.22%, +0.31% Branches: 10446 -> 10442 (-0.04%) PreSGPRs: 8049 -> 8041 (-0.10%); split: -0.17%, +0.07% PreVGPRs: 7785 -> 7767 (-0.23%); split: -0.32%, +0.09% Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel-schuermann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13757>	2021-11-12 18:59:51 +00:00
Rhys Perry	719b48f85d	nir/lower_system_values: replace local_invocation_id components with zero fossil-db (Sienna Cichlid): Totals from 360 (0.28% of 128647) affected shaders: VGPRs: 7912 -> 7272 (-8.09%); split: -8.59%, +0.51% CodeSize: 542456 -> 544688 (+0.41%); split: -0.32%, +0.73% MaxWaves: 10866 -> 10952 (+0.79%) Instrs: 95973 -> 96010 (+0.04%); split: -0.34%, +0.38% Latency: 4366023 -> 4344664 (-0.49%); split: -0.90%, +0.41% InvThroughput: 19656659 -> 18297185 (-6.92%); split: -6.92%, +0.00% VClause: 3242 -> 3116 (-3.89%); split: -4.04%, +0.15% SClause: 3422 -> 3504 (+2.40%); split: -0.20%, +2.60% Copies: 8854 -> 9376 (+5.90%); split: -0.89%, +6.79% Branches: 2329 -> 2326 (-0.13%); split: -0.39%, +0.26% PreSGPRs: 7620 -> 7841 (+2.90%); split: -0.43%, +3.33% PreVGPRs: 5765 -> 5504 (-4.53%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel-schuermann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13757>	2021-11-12 18:59:51 +00:00
Connor Abbott	a9b4a507fe	tu: Expose Vulkan 1.2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13756>	2021-11-12 18:14:34 +00:00
Connor Abbott	c6216c941c	tu: Add VK_KHR_buffer_device_address stubs dEQP-VK.api.version_check.entry_points requires us to return a function pointer, even though the feature is optional in Vulkan 1.2. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13756>	2021-11-12 18:14:34 +00:00
Connor Abbott	952ab4f64f	tu: Enable subgroupBroadcastDynamicId It's a Vulkan 1.2 only feature, but it's trivially supported. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13756>	2021-11-12 18:14:34 +00:00
Ilia Mirkin	170e1aa647	freedreno/a[345]xx: add R8/RG8 SRGB formats These enable the GL_EXT_texture_sRGB_R8 / GL_EXT_texture_sRGB_RG8 extensions. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13765>	2021-11-12 17:22:02 +00:00
Ilia Mirkin	8db29109be	freedreno: prefer float immediates when float values are involved Using double immediates can cause a natively-float value to have to get upgraded to a double unnecessarily. Use float immediates where possible. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13764>	2021-11-12 16:48:49 +00:00
Alyssa Rosenzweig	4e83584092	pan/mdg: Remove duplicate compiler option Noted by clang. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	a4d3a29647	pan/bi: Enable dual texture fusing pass Everything is in place for it now -- ship it! Our Bifrost cycle model is coarse, so take the shader-db results with a massive spoonful of salt: total instructions in shared programs: 107504 -> 107252 (-0.23%) instructions in affected programs: 39692 -> 39440 (-0.63%) helped: 191 HURT: 1 helped stats (abs) min: 1.0 max: 20.0 x̄: 1.32 x̃: 1 helped stats (rel) min: 0.11% max: 9.52% x̄: 1.21% x̃: 0.98% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 2.04% max: 2.04% x̄: 2.04% x̃: 2.04% 95% mean confidence interval for instructions value: -1.60 -1.02 95% mean confidence interval for instructions %-change: -1.37% -1.01% Instructions are helped. total tuples in shared programs: 89864 -> 89664 (-0.22%) tuples in affected programs: 27787 -> 27587 (-0.72%) helped: 146 HURT: 6 helped stats (abs) min: 1.0 max: 17.0 x̄: 1.54 x̃: 1 helped stats (rel) min: 0.14% max: 15.38% x̄: 1.83% x̃: 1.25% HURT stats (abs) min: 1.0 max: 11.0 x̄: 4.17 x̃: 2 HURT stats (rel) min: 0.54% max: 3.55% x̄: 1.29% x̃: 0.87% 95% mean confidence interval for tuples value: -1.64 -0.99 95% mean confidence interval for tuples %-change: -2.06% -1.36% Tuples are helped. total clauses in shared programs: 18253 -> 18044 (-1.15%) clauses in affected programs: 5127 -> 4918 (-4.08%) helped: 164 HURT: 1 helped stats (abs) min: 1.0 max: 19.0 x̄: 1.28 x̃: 1 helped stats (rel) min: 0.78% max: 28.57% x̄: 6.73% x̃: 5.88% HURT stats (abs) min: 1.0 max: 1.0 x̄: 1.00 x̃: 1 HURT stats (rel) min: 2.94% max: 2.94% x̄: 2.94% x̃: 2.94% 95% mean confidence interval for clauses value: -1.50 -1.04 95% mean confidence interval for clauses %-change: -7.42% -5.91% Clauses are helped. total cycles in shared programs: 8118.54 -> 8103.88 (-0.18%) cycles in affected programs: 414.96 -> 400.29 (-3.53%) helped: 43 HURT: 27 helped stats (abs) min: 0.041665999999999315 max: 4.375 x̄: 0.41 x̃: 0 helped stats (rel) min: 0.24% max: 50.00% x̄: 11.49% x̃: 5.26% HURT stats (abs) min: 0.041665999999999315 max: 1.1666639999999973 x̄: 0.10 x̃: 0 HURT stats (rel) min: 0.43% max: 4.71% x̄: 1.42% x̃: 1.28% 95% mean confidence interval for cycles value: -0.35 -0.07 95% mean confidence interval for cycles %-change: -9.50% -3.53% Cycles are helped. total arith in shared programs: 3375.67 -> 3376.42 (0.02%) arith in affected programs: 345.29 -> 346.04 (0.22%) helped: 24 HURT: 32 helped stats (abs) min: 0.041665999999999315 max: 0.5833329999999997 x̄: 0.09 x̃: 0 helped stats (rel) min: 0.24% max: 14.29% x̄: 2.82% x̃: 1.50% HURT stats (abs) min: 0.041665999999999315 max: 1.1666639999999973 x̄: 0.09 x̃: 0 HURT stats (rel) min: 0.43% max: 4.71% x̄: 1.42% x̃: 1.28% 95% mean confidence interval for arith value: -0.04 0.07 95% mean confidence interval for arith %-change: -1.19% 0.39% Inconclusive result (value mean confidence interval includes 0). total texture in shared programs: 1275 -> 1157 (-9.25%) texture in affected programs: 725.50 -> 607.50 (-16.26%) helped: 192 HURT: 0 helped stats (abs) min: 0.5 max: 10.0 x̄: 0.61 x̃: 0 helped stats (rel) min: 2.86% max: 50.00% x̄: 25.20% x̃: 25.00% 95% mean confidence interval for texture value: -0.72 -0.51 95% mean confidence interval for texture %-change: -27.12% -23.27% Texture are helped. total vary in shared programs: 537.88 -> 536.12 (-0.33%) vary in affected programs: 2.75 -> 1 (-63.64%) helped: 1 HURT: 0 total quadwords in shared programs: 79762 -> 79681 (-0.10%) quadwords in affected programs: 10261 -> 10180 (-0.79%) helped: 59 HURT: 18 helped stats (abs) min: 1.0 max: 14.0 x̄: 1.88 x̃: 1 helped stats (rel) min: 0.38% max: 8.20% x̄: 1.95% x̃: 1.43% HURT stats (abs) min: 1.0 max: 4.0 x̄: 1.67 x̃: 1 HURT stats (rel) min: 0.46% max: 8.89% x̄: 2.22% x̃: 1.21% 95% mean confidence interval for quadwords value: -1.57 -0.53 95% mean confidence interval for quadwords %-change: -1.59% -0.37% Quadwords are helped. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	0c215813f7	pan/bi: Test dual texture fusing These patterns are quite tricky, so let's make sure we're testing adequately. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	9146bafbb4	pan/bi: Add dual texture fusing pass Bifrost supports a special "dual texture" instruction, sampling from two textures at once at the same coordinate. Each subinstruction is highly restricted (a subset of TEXS_2D); together, they are represented by TEXC with a special dual texture operation descriptor. Add an optimization pass to fuse these instructions. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	e6c6a1afb4	pan/bi: Fix up dual texturing registers This must be done after RA. How delightful. Use the GenXML strategy to just OR the birds. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	ce8d2b96c1	pan/bi: Add bi_dual_tex_as_u32 helper Type safe cast, making dual texture descriptors easier to manipulate. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	9b2a383af8	pan/bi: Support dual texture scheduling Teach the scheduler about dual texturing to avoid an artifical "must not last" constraint causing suboptimal scheduling like clause_1: ds(0) nbb tex ncph dwb(0) { NOP t0 +TEXC.skip t1, r0, r1, 0xf1e00144, @r4 NOP t0 +NOP t1 } Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	98c1b3e7e1	pan/bi: Use BIFROST_TEXTURE_OPERATION_SINGLE enum Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	9245b39ccf	pan/bi: Add bifrost_dual_texture_operation struct This is the other state of the texture operation descriptor. We must pack it in the compiler when fusing dual texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	3612880ea3	pan/bi: Add bifrost_texture_operation_mode enum Differentiates single/dual texturing. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	7dc90b68d9	pan/bi: Add second destination to TEXC Used to model dual texturing, which writes to separate sets of staging registers. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	8e02731624	pan/bi: Add secondary staging count Useful for instructions with two independent sets of staging registers (like dual source blending or dual texturing). Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	238f6d80a7	pan/bi: Make bi_index padding explicit Avoids reliance on UB. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Alyssa Rosenzweig	b0022f1c6b	pan/bi: Fix typo in helper invocation analysis Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13723>	2021-11-12 16:30:02 +00:00
Ilia Mirkin	a315d04308	mesa: add just a tiny bit of debug info to some _mesa_problem calls I hit these on a4xx with dEQP-GLES2/3 runs. Having more info on them would make everyone's life easier. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13766>	2021-11-12 15:48:37 +00:00
Ilia Mirkin	269b4dec9e	nv50,nvc0: expose R8/RG8_SRGB formats for texturing This enables the GL_EXT_texture_sRGB_R8/RG8 extensions. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13769>	2021-11-12 15:34:45 +00:00
Hyunjun Ko	ddb3d30d47	turnip: Enable VK_KHR_separate_depth_stencil_layouts We now start handling depth/stencil layouts separately when adding implicit subpass dependancies. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13057>	2021-11-12 13:16:23 +00:00
Alyssa Rosenzweig	e257344a82	nir/lower_pntc_ytransform: Support PointCoordIsSysval Pattern match the point coord sysval and support lowering it as well. This is required to handle flipped framebuffers on Bifrost. However, what this pass normalizes to is the opposite of the hardware mode we used on Bifrost before, so we need to swap modes at the same time to prevent regressions. Fixes Piglit glsl-fs-pointcoord and glsl-fs-pointcoord_gles2 Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13073>	2021-11-12 12:34:14 +00:00
Ilia Mirkin	43322ceccd	mesa: add missing state to state string computation This is an internal state, so does not need to be made available in the string itself (same as the wpos_y_transform). But it needs to be listed to avoid the mesa_problem. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13073>	2021-11-12 12:34:14 +00:00
Iago Toral Quiroga	7490bcad37	v3dv: don't use a global constant for default pipeline dynamic state Some of these may change across V3D versions, so it is not practical. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13775>	2021-11-12 11:04:07 +00:00
Iago Toral Quiroga	4b3931ee6c	v3dv: account for multisampling when computing subpass granularity The granularity is defined by the tile size, which is also determined by multisampling. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13775>	2021-11-12 11:04:07 +00:00
Iago Toral Quiroga	0cb58f80d2	v3d: use V3D_MAX_DRAW_BUFFERS instead of hardcoded constant Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13775>	2021-11-12 11:04:07 +00:00
James Park	fc106d86a5	meson: Update libelf wrap for Windows Newer libelf update supports 32-bit Windows. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13749>	2021-11-12 09:46:10 +00:00
James Park	0aaaee09a4	radv: Match function definitions to declarations Fixes compiler errors for 32-bit Windows. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13749>	2021-11-12 09:46:10 +00:00
James Park	195a379a7e	ac: Align ADDR_FASTCALL with addrlib Fixes linker errors for 32-bit Windows. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13749>	2021-11-12 09:46:10 +00:00
Qiang Yu	aa2f5cd1a3	driconf: support META application Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13686>	2021-11-12 09:01:58 +00:00
Qiang Yu	3900551894	radeonsi: add radeonsi_force_use_fma32 driconf option fma32 only round once so has 0.5UP accuracy. mad32 round twice so has 1UP accuracy. This accuracy difference sometimes make the result different at the last bit. Applications like META need more accuracy for display right result. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13686>	2021-11-12 09:01:58 +00:00
Christian Gmeiner	a0634a3c85	ci/bare-metal: switch to common .baremetal-test-arm64 Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13751>	2021-11-12 08:22:29 +00:00
Christian Gmeiner	d39904ea30	ci/bare-metal: add .baremetal-test-arm64 Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13751>	2021-11-12 08:22:29 +00:00
Christian Gmeiner	92fcdfd38b	ci/etnaviv: no need to force nir anymore It is the new default. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13751>	2021-11-12 08:22:29 +00:00
Christian Gmeiner	3d1c1137e5	ci/etnaviv: armhf: switch to .baremetal-test-armhf Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13751>	2021-11-12 08:22:29 +00:00
Christian Gmeiner	8bc284fe5b	ci/bare-metal: armhf: move BM_ROOTFS to generic place Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13751>	2021-11-12 08:22:29 +00:00
Mike Blumenkrantz	34c5ba8850	aux/primconvert: support pipe_context::draw_vertex_state Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13742>	2021-11-12 02:43:14 +00:00
Mike Blumenkrantz	e1948c9a71	aux/primconvert: break out primconvert internals into util function this should (ideally) be no functional changes Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13742>	2021-11-12 02:43:14 +00:00
Ilia Mirkin	d903eb156a	freedreno/a4xx: fix min/max/bias lod sampler settings This makes a4xx look more like a3xx for these settings. Most importantly it adds the workaround for allowing the hw to decide between min and mag filtering. This fixes a number of dEQP texture filtering tests. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13763>	2021-11-12 01:12:35 +00:00
Ilia Mirkin	4ffcef821c	freedreno/ir3: fix setting the max tf vertex when there are no outputs Fixes dEQP-GLES3.functional.transform_feedback.* on a4xx. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13760>	2021-11-11 23:49:19 +00:00
Ilia Mirkin	c0de7ea0ab	freedreno: check batch size after the fallback blitter clear When force-flushing after every draw, this would otherwise hit a NULL batch in fd_blitter_clear. Tested on a4xx. Suggested-by: Rob Clark <robdclark@chromium.org> Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13761>	2021-11-11 23:26:00 +00:00
Lionel Landwerlin	d2ff2b9e4a	anv: fix multiple wait/signal on same binary semaphore We need to guarantee that when vkQueueSubmit() returns the application can actually wait on a signaled semaphore/syncobj. When using a thread to do the submission to i915, this gets a bit tricky in the following case : A syncobj is used both as a wait & signal semaphore and has been signaled once already. It contains a fence before entering vkQueueSubmit(). This means we need to reset the syncobj to ensure when we return from vkQueueSubmit(), the syncobj contains no stale fence. Currently in the Anv, the submission thread is in charge of putting the new fence in the syncobj and also picks up the wait fence directly from the syncobj. This means we can't reset the syncobj from vkQueueSubmit(). The solution to this has been pointed by Bas & Jason : In vkQueueSubmit(), clone the wait syncobj fence into a new temporary syncobj that will be destroy after submission and use this temporary syncobj as a wait fence for i915. This allows us to reset the original syncobj in vkQueueSubmit(). For this to work with wait_before_signal behavior, we also need to do a wait-on-materialize on binary semaphores from vkQueueSubmit(). Otherwise the application thread calling vkQueueSubmit() could race the submission thread and pick up the wrong fence when cloing. v2: Use copy semantic for clone_syncobj_dma_fence() (Jason) Do the cloning prior to adding the syncobj to anv_queue_submit so that if the cloning fails don't have an invalid syncobj in anv_queue_submit (Jason) v3: Fix another syncobj leak (Jason) v4: Fix invalid argument order (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4945 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11474>	2021-11-11 20:59:32 +00:00
Caio Oliveira	a4eecc543e	gtest: Fix output of array ASSERT/EXPECT macros Fixes: `015383d1d7` ("gtest: Add mesa-gtest-extras.h with array ASSERT/EXPECT macros") Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13748>	2021-11-11 09:53:09 -08:00
Danylo Piliaiev	deb23612f7	vulkan/util: Handle depth-only formats in vk_att_ref_stencil_layout From VUID-VkAttachmentReference2-attachment-04755: "If attachment is not VK_ATTACHMENT_UNUSED, and the format of the referenced attachment is a depth/stencil format which includes both depth and stencil aspects, and layout is VK_IMAGE_LAYOUT_DEPTH_ATTACHMENT_OPTIMAL or VK_IMAGE_LAYOUT_DEPTH_READ_ONLY_OPTIMAL, the pNext chain must include a VkAttachmentReferenceStencilLayout structure." We did not check that there even could be a stencil layout before fetching it. Fixes a few tests from: dEQP-VK.image.depth_stencil_descriptor.depth_read_only_optimal.* Fixes: `979ea394e5` "vulkan/util: Move helper functions for depth/stencil images to vk_iamge" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13740>	2021-11-11 17:17:16 +00:00
Emma Anholt	a68a0c9e1c	mesa/st: Disable NV_copy_depth_to_color on non-doubles-capable HW. The previous doubles check (https://gitlab.freedesktop.org/mesa/mesa/-/issues/3459) checked that you didn't have full doubles emulation turned on, but we also need to check that you have doubles at all (emulated or not) or non-GL4 drivers will fail. Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13743>	2021-11-11 16:38:58 +00:00
Alejandro Piñeiro	3f3820a3a5	v3d: remove static v3d_start_binning v3dx(start_binning) is just a call to that method, so let's just use it directly. Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13754>	2021-11-11 14:04:22 +01:00
Alejandro Piñeiro	2a65db2458	v3d: remove unused include Reviewed-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13754>	2021-11-11 14:04:16 +01:00
Andreas Baierl	ee41e1bbd2	lima: Fix drawing wide lines GLES2.0 spec allows parts of wide lines and points to be drawn even if their center is outside the viewport. Therefore 0x2000 in PLBU_CMD_PRIMITIVE_SETUP has to be set for points. This is already our default setting as it seems to have no negative effect when this bit is always set. Points work as expected but lines don't. It's hard to RE it, because the affected deqp tests also fail with the blob. To respect this behaviour for lines and solve another 2 tests, we need to do a workaround and temporarily extend the viewport by half of the line width. The scissor rectangle is still equal with the initial viewport. Reviewed-by: Vasily Khoruzhick <anarsoul@gmail.com> Signed-off-by: Andreas Baierl <ichgeh@imkreisrum.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12971>	2021-11-11 11:25:58 +00:00
Samuel Pitoiset	3e7bac80ce	ac/rgp: add support for dumping SPM data Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13704>	2021-11-11 10:05:49 +00:00
Samuel Pitoiset	e928f475cc	ac: add initial SPM support SPM is hardware feature that allows us to dump performance counters at a sampling interval to a buffer. It is used by RGP to report cache counters. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13704>	2021-11-11 10:05:49 +00:00
Neil Roberts	bdaf185889	v3d: Update prim_counts when prims generated query in flight without TF In order to implement GL_PRIMITIVES_GENERATED, v3d allocates a small resource and adds a command to the job to store the prim counts to it. However it was only doing this when TF was enabled which meant that if the query was used with a geometry shader but no TF then the query would always be zero. This patch makes the driver keep track of how many PRIMITIVES_GENERATED queries are in flight and then enable writing the prim count if its more than zero. Fix dEQP-GLES31.functional.geometry_shading.query.primitives_generated_* v2: Update CI expectations and references to fixed tests in commit log. v3: - Add comment that GL_PRIMITIVES_GENERATED query is included because OES_geometry_shader, but it is not part of OpenGL ES 3.1. (Iago) - Update Fixes to commit introducing geometry shaders. (Iago) Fixes: `a1b7c084` ("v3d: fix primitive queries for geometry shaders") Signed-off-by: Neil Roberts <nroberts@igalia.com> Signed-off-by: Jose Maria Casanova Crespo <jmcasanova@igalia.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13712>	2021-11-11 08:02:04 +00:00
Vinson Lee	4a38ed822a	virgl: Allocate qdws after virgl_init_context to avoid leak. Fix defect reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_storage: Variable qdws going out of scope leaks the storage it points to. Fixes: `9a7d6a110e` ("virgl/drm: explicit context initialization") Suggested-by: Gert Wollny <gert.wollny@collabora.com> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Gert Wollny <gert.wollny@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13609>	2021-11-11 01:51:35 +00:00
Vinson Lee	1ca90f8752	microsoft/spirv_to_dxil: Fix non-Windows build. ../src/microsoft/spirv_to_dxil/dxil_validation.cpp: In function ‘bool validate_dxil(dxil_spirv_object*)’: ../src/microsoft/spirv_to_dxil/dxil_validation.cpp:129:12: error: ‘stderr’ was not declared in this scope 129 \| fprintf(stderr, "DXIL validation only available in Windows.\n"); \| ^~~~~~ Fixes: `37c366e283` ("microsoft/spirv_to_dxil: Add DXIL validation to spirv2dxil") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13736>	2021-11-11 01:37:16 +00:00
Emma Anholt	07aaef5721	freedreno/a6xx: Inline remaining fd6_tex_const_0() call. Less indirection and fixups for figuring out what's going on. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	7230058e8a	freedreno/a6xx: Drop an unused tile_mode arg. I added this in `ebaeddcbb3` ("freedreno/a6xx: Rewrite the format table format/swap helpers.") but it had already become unused through some bugfixing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	a9057d45a4	freedreno/a6xx: Clean up sysmem fb read patching using fd6_view. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	c90220e449	freedreno/a6xx: Use fd6_view for non-buffer image descriptors, too. This deletes a whole lot of code, but there's a modest drawoverhead perf loss: drawoverhead 1-image change -6.48856% +/- 4.28269% (n=50) drawoverhead 8-image change -5.29195% +/- 2.62549% (n=90) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	533e486923	freedreno/a6xx: Switch to relying on fd6_view for our texture descriptors. Having checked the deltas between fdl6_view and what we did before, switch over to fdl6_view now. No statistically significant difference on no-hw drawoverhead 8-texture change (n=50) with the texture cache disabled from this and the previous commit. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	84377785a4	freedreno/a6xx: Create a fd6_view at sampler view update time. The goal is to share the same code as turnip for descriptor setup. This just calls it and cross-checks. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	5b3a6ff9f7	freedreno: Set layer_first on (2D) resource imports. Prevents getting a weird layer stride if you ask for it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	94e4cd4d83	freedreno/fdl6: Skip redundant setting of TILE_ALL for NV12. We already respect the tile_all flag above, and it should be set in tu. Fixes a mismatch between fdl6_view_init() and gallium. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	2e6810a06a	util/format: Add G8_B8R8_420_UNORM to match Vulkan. turnip was playing fast and loose with the name, using the R8_G8B8 format name but actually setting the descriptors up to read G8_B8R8 like Vulkan (sensibly) wants. This caused trouble when trying to make freedreno and turnip share code. By having both orderings as format names, we can share the descriptor code and also confuse readers less. Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Emma Anholt	271b6cb981	util: Rename PIPE_FORMAT_G8_B8_R8_420_UNORM. The only user, turnip, was actually treating it as this layout, matching vulkan's specification of how the planes map to RGB values. (Y=G means that Cb=B and Cr=R). Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13443>	2021-11-11 00:10:57 +00:00
Caio Oliveira	6eb86efe91	util/ra: Fix deserialization of register sets Set ra_class::regset and ra_class::index when deserializing. Fixes: `95d41a3525` ("ra: Use struct ra_class in the public API.") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13728>	2021-11-10 22:57:57 +00:00
Caio Oliveira	ac416ce07b	util/ra: Add simple test for register set serialization Marked as DISABLED since it currently fails. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13728>	2021-11-10 22:57:57 +00:00
Eric Engestrom	37b51d5140	docs: update calendar for 21.3.0-rc5 Add another release candidate as we're not ready for 21.3.0 final just yet. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13744>	2021-11-10 22:13:56 +00:00
Emma Anholt	8f5a0bd9b4	ci/bare-metal: Close serial and join serial threads before exit. This should fix the intermittent (~1/week) cheza failure where python complains that a thread tried to do stdio while the main thread has exited. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13462>	2021-11-10 20:36:57 +00:00
Emma Anholt	4cdbe3f2b9	ci/etnaviv: Add more texturing flakes. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13462>	2021-11-10 20:36:57 +00:00
Emma Anholt	13e23b54ba	ci/etnaviv: Mark the rest of uniform_api.random as flaky. Another 3 new flakes happened on my last run. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13462>	2021-11-10 20:36:57 +00:00
Jesse Natalie	5451bb03c7	mesa/main: Fix use of alloca() without #include "c99_alloca.h" Fixes: `c216f193` ("mesa: use alloca in search_resource_hash") Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13735>	2021-11-10 19:07:38 +00:00
Emma Anholt	549924d53e	freedreno: Fix constant-index assumptions in IBO loads. The encoder already sets up our IBO accesses as potentially nonuniform, so we just need to be careful to not try to force the IBO index into an immediate. Fixes assertion failures in piglit arb_shader_image_load_store-invalid (intermittent due to https://gitlab.freedesktop.org/mesa/piglit/-/merge_requests/597), which had some interesting actual failures hidden behind it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13601>	2021-11-10 17:48:59 +00:00
Emma Anholt	9e04f97d8e	freedreno: Fix the uniform/nonuniform handling for cat5 bindful modes. We can see from the dynamically_uniform (compiler doesn't know if you're uniform or not) vs uniform (compiler can see it's uniform) case in the blob which is which. Now that we have the right names, also use the nonunif flag for encoding the actual non-uniform mode (previously, we were always setting it always in a way that meant uniform). I verified this behavior back to a418 with samplers. The a3xx blob I have only does GLES3, so we don't have the opaque_type_indexing tests to see. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13601>	2021-11-10 17:48:59 +00:00
Lionel Landwerlin	46c37c8600	anv: don't forget to add scratch buffer to BO list We reference the scratch BO using a bindless index in the command streamer instructions, but we forgot to add them to the BO list. v2: Make use of pipeline reloc list (Jason) v3: Don't add NULL BOs to the reloc list (Lionel) v4: Don't add BOs twice to reloc list when dealing with addresses (Lionel) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `eeeea5cb87` ("anv: Add support for scratch on XeHP") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13544>	2021-11-10 17:22:15 +00:00
James Park	e0de7aa4d7	aco: Work around MSVC restrict in c99_compat.h Future LLVM header leads to __declspec(__restrict), which is invalid. Just undefine the restrict macro to keep __declspec(restrict). Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13505>	2021-11-10 16:37:51 +00:00
Niklas Haas	a697dde553	wsi/x11: support depth 30 visuals There's only really one format that makes sense here, since there's no sRGB equivalent for 10-bit packed formats. It's possible that this results in flipped colors, if 10-bit visuals exist that have the channels in the opposite order. (They don't seem to, on my end) Perhaps a more principled solution would also compare the exact r/g/b bitmasks. But for now, this works. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/3537 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9450>	2021-11-10 15:53:11 +00:00
Vinson Lee	794a9cf3d6	vulkan/wsi: Unlock before return on error path. Fix defect reported by Coverity Scan. Missing unlock (LOCK) missing_unlock: Returning without unlocking wsi->wait_mutex. Fixes: `4885e63a6d` ("vulkan/wsi: implement missing wsi_register_device_event") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13698>	2021-11-10 13:22:24 +00:00
Iago Toral Quiroga	3a95e25e84	v3dv,v3d: don't store swizzle pointer in shader/pipeline keys We had been storing pointers to a driver owned swizzle table rather than storing the actual swizzle value in various shader and pipeline keys on both GL and Vulkan drivers. This doesn't look very robust, particularly since we also compute sha1 hashes from these values and we may store these hashes to disk (for the disk cache). Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13738>	2021-11-10 11:24:26 +00:00
James Park	183e705a15	vulkan, radv: Support backslash in ICD paths Vulkan loader wants backslash for paths on Windows. Need to jump through hoops because Meson does not support backslashes in commands. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13461>	2021-11-10 09:48:41 +00:00
Samuel Pitoiset	379fab74d2	radv/sqtt: fix GPU hangs when capturing from the compute queue S_008D20_FINISH_DONE is a mask of queues and 1 means "wait on the gfx queue until the value is not 0" which can never happen when the driver captures from compute. Instead, use the full mask of possible queues. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13694>	2021-11-10 08:24:54 +01:00
Mike Blumenkrantz	4dfb5818ed	zink: update gfx pipeline shader module pointer even if the program is unchanged this is used for pipeline comparisons, so it has to always be accurate cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13727>	2021-11-10 01:15:39 +00:00
Mike Blumenkrantz	bfa81c1e8c	zink: be more consistent about applying module hash for gfx pipeline this was a little spaghetti-ish: the module hash was sometimes being applied during module update, sometimes in draw during program create, and then also it was removed when a shader unbind would cause the program to no longer be reachable now things are more consistent: * keep removing module hash when program becomes unreachable * only apply module hash in draw during updates there cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13727>	2021-11-10 01:15:39 +00:00
Mike Blumenkrantz	937a841b57	zink: ci updates these don't spend forever in llvmpipe optimization passes anymore Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13727>	2021-11-10 01:15:39 +00:00
Mike Blumenkrantz	2ac23b4d58	zink: always inline uniforms when running on a cpu driver the overhead from creating new inlined shader variants is likely to be less than the time required to fully optimize and run those variants, so just inline 100% of the time to cut down shader runs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13727>	2021-11-10 01:15:39 +00:00
Mike Blumenkrantz	a8d90c8ed5	zink: implement cs uniform inlining this implements shader variants for compute Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13727>	2021-11-10 01:15:39 +00:00
Mike Blumenkrantz	06f2054cb5	zink: radv ci updates for 1dshadow stuff Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13583>	2021-11-09 23:59:04 +00:00
Mike Blumenkrantz	64e0ca15d6	zink: add 1DShadow sampler handling for drivers (radv) that don't support it some drivers won't create zs textures in any shape but 2D. this can be handled instead by using 2D textures and then performing shader rewrites to convert shadow samplers for 1D and 1DArray types to 2D/array Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13583>	2021-11-09 23:59:04 +00:00
Ryan Houdek	0ae1231879	util/xmlconfig: Allow DT_UNKNOWN files Some filesystems don't fill in d_type in dirent. Resulting in DT_UNKNOWN. Pass this entry through to the next step and use stat on the full filepath to determine if it is a file. sshfs is known to not fill d_type. This resolves an issue where driconf living on an sshfs path wasn't working. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13697>	2021-11-09 23:25:32 +00:00
Jason Ekstrand	e614789588	anv: Also disallow CCS_E for multi-LOD images Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/4616 Fixes: `e3101c96bb` ("anv/image: Disable multi-layer CCS_E on TGL+") Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13680>	2021-11-09 23:05:53 +00:00
Mike Blumenkrantz	62983f276b	zink: add another compiler pass to convert 64bit vertex attribs gallium always provides uint types, so rewrite the shader to load a 64bit attrib and then cast back to whatever it was before Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13566>	2021-11-09 21:51:06 +00:00
Mike Blumenkrantz	39bdb00d77	zink: simplify 64bit vertex attrib lowering this was a cool myfirstcompilerpass.exe but there's easier ways to do things like this Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13566>	2021-11-09 21:51:06 +00:00
Mike Blumenkrantz	854fd242fa	zink: declare int/float size caps inline with type usage this is much more accurate than trying to use shader info Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13566>	2021-11-09 21:51:05 +00:00
Bas Nieuwenhuizen	17aa2be4c9	ci: Add RADV to Android CI. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	aad80e47d8	util: Add support for clang::fallthrough. Looks like the __attribute__ version doesn't work for C++ in the Android build. Only found now because we don't enable -Wimplicit-fallthrough by default project wide for C++. Only ACO enables it. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	966c171d88	amd/addrlib: Ignore self-assign warnings. There was a preference to not fix addrlib to make syncing with upstream easier. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	22673a980f	meson: Check arguments before adding. -static-libstdc++ doesn't exist on the Android NDK, casuing all later has_argument calls to return false even though the compiler supports that argument. Fixes: `3aee462781` "meson: add windows compiler checks and libraries" Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	5db098c98b	aco: Remove useless sub-expr. ../src/amd/compiler/aco_instruction_selection.cpp:11915:83: error: expression result unused [-Werror,-Wunused-value] bld.vop2(aco_opcode::v_lshrrev_b32, fetch_index_def, div_info, instance_id).instr; Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	bbd091d1fa	radv: Always use linker script when possible. Also without LLVM. Useful for Android with statically linked libelf. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	66713d33fe	radv: Remove android build warning. Shadowed variable. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	7b06b6288c	amd/addrlib: Use alternative ADDR_C_ASSERT definition. Copied from mesa util/macros.h Avoids unused-local-typedef warnings. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	d14cc308f9	ci: Add libelf to the Android image. Needed for RADV. The mirror situation is kinda messy since the library is not maintained and the original website is offline. Put a mirror in that seemed to be used by some non-fdo CIs already, but if reliability is still a concern we can discuss more mirrors. There is an alternative implementation that is maintained in elfutils, but that doesn't build on Android: 1) Doesn't build with clang (resolved in git, so next release probably) 2) Needs argp_parse with is a glibc specific feature. There is a version of elfutils in AOSP but instead of fixing upstream they just made an Android.bp that avoids building most stuff, which isn't really usable here. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Bas Nieuwenhuizen	1b945a695a	ci: Bump libdrm for the android image. Seems I bumped the tag previously but not the script. Let us do better this time. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13164>	2021-11-09 20:51:14 +00:00
Jesse Natalie	fde36d7992	d3d12: Don't wait for GPU reads to do CPU reads Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13669>	2021-11-09 18:31:19 +00:00
Jesse Natalie	8ea1e58f0e	d3d12: Don't wait for all batches when synchronizing a resource Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13669>	2021-11-09 18:31:19 +00:00
Samuel Pitoiset	5bb72ff750	zink: update the CI lists for RADV Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13726>	2021-11-09 16:41:13 +00:00
Hyunjun Ko	979ea394e5	vulkan/util: Move helper functions for depth/stencil images to vk_iamge Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12785>	2021-11-09 16:12:04 +00:00
Hyunjun Ko	2a0253b9b5	radv: Fix to honor the spec to get stencil layout. Fixes: `3ef89b245e` ("radv: fix separate depth/stencil layout in render pass") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12785>	2021-11-09 16:12:04 +00:00
Hyunjun Ko	00bea38242	anv: Fix to honor the spec to get stencil layout. Fixes: `28207669d0` ("anv: Fix stencil layout in render passes") Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12785>	2021-11-09 16:12:04 +00:00
Samuel Pitoiset	1f36f6b83f	radv/winsys: use same IBs padding as the kernel Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13703>	2021-11-09 11:05:32 +00:00
Samuel Pitoiset	1ee85e8bab	ac/rgp: add support for clock calibration Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13709>	2021-11-09 11:20:12 +01:00
Samuel Pitoiset	aebf04ab3f	ac/rgp: add support for queue event timings Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13709>	2021-11-09 11:20:10 +01:00
Samuel Pitoiset	e04101c34e	radv: only emit PGM_LO for the vertex prolog Shaders are allocated in the 32-bit address space. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13551>	2021-11-09 10:13:38 +00:00
Samuel Pitoiset	824ce4ef40	ac/rgp: fix alignment of code object records to follow the RGP spec Should be aligned to 4 bytes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13711>	2021-11-09 09:42:20 +00:00
Samuel Pitoiset	ca7c748f45	radv: do not expose buffer features for depth/stencil formats The Vulkan spec got clarified recently and it's invalid (hw can support it though). Fixes new CTS dEQP-VK.api.buffer.invalid_buffer_features.*. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13701>	2021-11-09 09:13:38 +00:00
Samuel Pitoiset	891e6f009b	radv/sqtt: stop calling radv_cs_add_buffer() for the thread trace BO It's resident, so global. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13717>	2021-11-09 08:22:11 +00:00
Samuel Pitoiset	ed70230df6	radv/sqtt: reserve a VMID for better profiling To avoid capturing other processes work. PAL always requests a VMID when capturing with SQTT too. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5051 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13695>	2021-11-09 08:35:36 +01:00
Dave Airlie	995f38838f	meson: allow building with vulkan beta extensions enabled. This is just a precursor to anyone enabling beta stuff later. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13685>	2021-11-09 04:33:06 +00:00
Dave Airlie	40157bc2b0	vulkan: add new image types undef beta define to switch statements. This fixes some warnings when building with beta extensions enabled. Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13685>	2021-11-09 04:33:06 +00:00
Dave Airlie	3e9e186ca1	vulkan/include: import the video codec headers. I'd like to allow mesa builds with beta headers enabled, this requires importing these. v2: add video headers to khronos update Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13685>	2021-11-09 04:33:06 +00:00
Jesse Natalie	df92a13a27	util/libsync: Fix timeout handling if poll() wakes up early Check how long poll waited, and use a smaller timeout on the next iteration through the loop. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12268>	2021-11-09 04:05:55 +00:00
Jesse Natalie	1ab906d17f	d3d12: Handle non-infinite wait timeouts > 49.7 days as infinite Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12268>	2021-11-09 04:05:55 +00:00
Jesse Natalie	accd8326c5	d3d12: Fix Linux fence wait return value zero is for success, nonzero is failure. Fixes: `0b60d6a2` ("d3d12: Support Linux eventfds for fences") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12268>	2021-11-09 04:05:55 +00:00
Hyunjun Ko	5d0712b185	turnip: expose VK_KHR_driver_properties Now that we have a conformance version to advertise, we can expose the extension. Signed-off-by: Hyunjun Ko <zzoon@igalia.com> Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6251>	2021-11-09 03:43:54 +00:00
Emma Anholt	1e850f23b1	turnip: Claim 1.2.7.1 CTS conformance. I submitted a conformance package for A618 today, so let's stop doing all this warning about non-conformance. Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6251>	2021-11-09 03:43:54 +00:00
Jason Ekstrand	8817377ce2	anv: Add an anv_bo_is_pinned helper If we ever want to stop depending on the EXEC_OBJECT_PINNED to detect when something is pinned (like for VM_BIND), having a helper will reduce the code churn. This also gives us the opportunity to make it compile away to true/false when we can figure it out just based on compile-time GFX_VERx10. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	2a75855049	anv: Stop checking for HAS_EXEC_FENCE Starting with `3b363d5b55` ("anv: Assume syncobj support"), we assume syncobj support and no longer use the execbuf sync_file API directly so there's no point in checking for it. For the one physical device check this deletes, we can assume has_exec_fence is always true because every kernel with syncobj support also has sync_file. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	2eb9057de0	anv: Add a use_relocations physical device bit This is more the bit of information we want to be carrying around long-term. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	cbcb10320f	anv: Add a anv_use_relocations helper and use it Soft-pin is but one possible mechanism for pinning buffers. We're working on another called VM_BIND. Most of the time, the real question we're asking isn't "are we using soft-pin?" but rather "are we using relocations?" because it's relocations, and not soft-pin, that cause us all the extra pain we have to write code to handle. This commit flips the majority of those checks around. The new helper is currently just the exact inverse of the old use_softpin helper. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	c15a8a43d9	anv: Int64 atomics don't need to depend on softpin Ever since `04ccfeae98` ("anv: Require softpin on Gen8+"), softpin has been a hard requirement on BDW+ so there's no reason for SKL+ code to have a relocation path. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	1936ceec58	anv: Always set bindless surface base on SKL+ Ever since `04ccfeae98` ("anv: Require softpin on Gen8+"), softpin has been a hard requirement on BDW+ so there's no reason for SKL+ code to have a relocation path. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	13fe43714c	anv: Add helpers in anv_allocator for mapping BOs Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	90ac06e502	anv: Fix FlushMappedMemoryRanges for odd mmap offsets When the client calls vkMapMemory(), we have to align the requested offset down to the nearest page or else the map will fail. On platforms where we have DRM_IOCTL_I915_GEM_MMAP_OFFSET, we always map the whole buffer. In either case, the original map may start before the requested offset and we need to take that into account when we clflush. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	f9b69e43a5	anv: Add a couple more checks in MapMemory Reviwed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	0967584549	anv: Add get/set_tiling helpers These are only required WSI cases and Android but still better to have them in a central place. Reviwed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	135cac5c9c	anv: Rename anv_bo::index to exec_obj_index This is more descriptive of its very specific purpose. Reviwed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	1394e415b7	anv/allocator: Use anv_device_release_bo in anv_block_pool_finish This is left-over from the days where we didn't use BO pointers and had them embedded and we didn't have a nice allocation API. Now that block pools get their memory via anv_device_alloc_bo(), they really should be using anv_device_release_bo() to tear it down. This is equivalent in this case because anv_device_release_bo() does the following: 1. Decrements refcount. The pool holds the only reference so it will proceed onto the clean-up steps 2. Unmaps it, if needed. This is the same as the tear-down code today except anv_device_release_bo() will avoid doing an unmap if it's been created via userptr. The current code probably unmaps the userptr which is wrong but pretty harmless since it happens on a tear-down path. 3. Unmaps the CCS range from the AUX-TT, if any. There is none, so this is a no-op. 4. Frees the VA range if it's not pinned and doesn't have a fixed VA assignment. These BOs always either have a fixed VA range (softpin) or aren't pinned (relocations). 5. Closes the GEM handle. Same as the current code. In short, anything created using the BO API should probably be destroyed that way. We were getting away with hand-rolling it because this is a simple case. Why did we do it this way? It dates back to before the more formlized BO cache and BO API in ANV. Reviewed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	0b2b9b49af	anv: Pull aperture size from devinfo Reviwed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Jason Ekstrand	2254146d0a	anv/allocator: Add a couple of helpers The anv_bo_finish helper is explicitly supposed to be capable of tearing down partially completed BOs as long as the flag and metadata parameters are set correctly. Reviwed-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13610>	2021-11-09 02:48:24 +00:00
Enrico Galli	13ee05f2b4	ci/windows: Add validation tests for spriv_to_dxil Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13655>	2021-11-09 01:32:47 +00:00
Enrico Galli	37c366e283	microsoft/spirv_to_dxil: Add DXIL validation to spirv2dxil Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13655>	2021-11-09 01:32:47 +00:00
Jesse Natalie	e7502c5404	d3d12: Fully init primconvert config Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	c151e9d087	d3d12: Hook up threaded context Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	2c90fa19a8	d3d12: Pass explicit context to pre/post draw surface blits Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	cd41ed53b2	d3d12: Use thread safe slab allocators in transfer_map handling Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	17a46e2cf9	d3d12: Inherit from threaded_transfer Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	e9a1e1c21e	d3d12: Resources inherit from threaded_resource Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	a463aa0099	d3d12: Inherit from threaded_query Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Jesse Natalie	00016b4251	u_threaded_context: Support including from C++ Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olák <marek.olsak@amd.com> Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13670>	2021-11-09 00:44:52 +00:00
Connor Abbott	38f0b36f1a	ir3/spill: Initial implementation of rematerialization This only handles moves from immedates/constants. The next step would be to rematerialize ALU instructions whose sources are available. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13650>	2021-11-08 23:51:37 +00:00
Connor Abbott	db566904ba	ir3/spill: Mark root as non-spillable after inserting We have to mark the root as non-spillable in case the interval is the child of some other interval, but we can't know whether it's the child of some other interval until it's been inserted. Move the setting of cant_spill below the insertion. This prevents us from using a bogus parent value. Fixes: `613eaac7b5` ("ir3: Initial support for spilling non-shared registers") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13650>	2021-11-08 23:51:37 +00:00
Jordan Justen	7eb13fc2f2	anv,blorp,iris: Set MOCS for COMPUTE_WALKER post-sync operation We don't current enable post sync operations, but it is probably better to set them to "internal" MOCS than to remove the non-zero checking for this genxml field. Reworks: * Fix COMPUTE_WALKER in cmd_buffer_trace_rays (s-b Jason) Fixes: `7b78b2fcac` ("intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13624>	2021-11-08 23:29:51 +00:00
Jordan Justen	a298ad26c1	intel/genxml/125: Update COMPUTE_WALKER POSTSYNC_DATA struct Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13624>	2021-11-08 23:29:51 +00:00
Jason Ekstrand	419b02c90c	anv,iris: Advertise a max 3D workgroup size of 1024^3 On GFX version 12.5+ with COMPUTE_WALKER, this is the limit based on the size of the HW packet. On older HW, we can technically go a bit bigger but there's not much point. Technically, some hardware can support a scalar workgroup size up to 2048 but most apps don't go any bigger than 1024. As discussed on the merge request page, the current limit assumes SIMD32, but it is unclear if we want to encourage applications to use SIMD32 if it may lead to additional register spilling in shader programs. Many applications have likely tuned for a limit of 1024 based on the OpenGL minimum limit, so it might not gain much by advertising more than 1024. Reworks: * Jordan: Use MIN2 and limit total invocations as well. * Jordan: Add second paragraph to commit message based on merge request discussion. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13538>	2021-11-08 23:07:42 +00:00
Mike Blumenkrantz	8626949f07	zink: flatten out draw templates a bit having this be super granular was a neat idea, but really I don't care even a little bit about a driver that's weirdly implementing only dynamic vertex input or only dynamic state2 this massively cuts down the combinatorics and provides a more accurate gauge of driver feature levels, since this is the general level of support that they're likely to have Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13715>	2021-11-08 21:49:40 +00:00
Marek Olšák	3d80d6b696	radeonsi: enable nir_group_loads for better performance The best case I have is one viewperf subtest getting +9% performance. 56979 shaders in 34726 tests Totals: SGPRS: 2667522 -> 2669178 (0.06 %) VGPRS: 1543608 -> 1553472 (0.64 %) Spilled SGPRs: 4090 -> 4100 (0.24 %) Spilled VGPRs: 1600 -> 1791 (11.94 %) Private memory VGPRs: 256 -> 256 (0.00 %) Scratch size: 1872 -> 2076 (10.90 %) dwords per thread Code Size: 59443980 -> 59479804 (0.06 %) bytes Max Waves: 867280 -> 865634 (-0.19 %) Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> v2: No change in pixels but the hash changed. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13604>	2021-11-08 21:20:11 +00:00
Marek Olšák	33b4eb149e	nir: add new SSA instruction scheduler grouping loads into indirection groups Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13604>	2021-11-08 21:20:11 +00:00
Mike Blumenkrantz	acddf83c95	zink: update radv ci passes Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13716>	2021-11-08 20:02:26 +00:00
Gert Wollny	63c4c559cb	virgl: obtain supported number of shader sampler views from host Modern games may use more than 16 sampler views, so get what the host actually supports, and default to 16 on old hosts that don't pass the value. Since the possible maximal value of PIPE_MAX_SHADER_SAMPLER_VIEWS doesn't fit into an uint32_t remove the binding flags, they were only used for releasing the sampler views, and this can be achieved differently. v2: Fix compilation error Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: John Bates <jbates@chromium.org> (v1) Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13646>	2021-11-08 19:34:30 +00:00
Caio Oliveira	55e1dc8b64	pan/bi: Drop unused test helpers With the tests moved to use gtest, some helpers are not needed anymore. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:02 +00:00
Caio Oliveira	baa405f02a	pan/bi: Use gtest for test-constant-fold Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:02 +00:00
Caio Oliveira	4554cd03f0	pan/bi: Use gtest for test-optimizer Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:01 +00:00
Caio Oliveira	06b3b53768	pan/bi: Use gtest for test-pack-formats Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:01 +00:00
Caio Oliveira	71cbb9c4f0	pan/bi: Use gtest for test-packing Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:01 +00:00
Caio Oliveira	fecb299b81	pan/bi: Use gtest for test-scheduler-predicates Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:01 +00:00
Caio Oliveira	3a9210c93d	pan/bi: Make some headers compilable with C++ This will allow have tests that use gtest (which is C++) for. Note the reordering of designated initializers in structs is because C++ requires them to follow the order that they were defined in the struct. The table `bifrost_reg_ctrl_lut` is not used by the tests at the moment, so bailed out trying to replace the C syntax designated initializer for arrays (that doesn't have an equivalent in C++), and simply #ifdef it out when including from C++ code. Also, use not_mod instead of not. not is a keyword in C++ and can't be used. Luckily, the naming is not determined by the XML, so we can freely rename. [Alyssa: squash in not_mod] Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13684>	2021-11-08 19:02:01 +00:00
Eric Engestrom	bee2c9c081	meson: automatically define `HAVE_{some}_PLATFORM` Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13278>	2021-11-08 18:35:28 +00:00
Eric Engestrom	5cc9c30aef	meson: always define `HAVE_{X11,XCB}_PLATFORM` when it's enabled Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13278>	2021-11-08 18:35:28 +00:00
Eric Engestrom	448dd106da	meson: drop impossible `if no platform` branch We've already ensured a few lines above that there is at least `surfaceless` in the list. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13278>	2021-11-08 18:35:28 +00:00
Eric Engestrom	2351c0aded	meson: move `egl_native_platform` definition inside the `with_egl` block Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13278>	2021-11-08 18:35:28 +00:00
Eric Engestrom	9ad375bdcd	meson: drop duplicate addition of surfaceless & drm to the list of platforms This is already done on lines 475-480, resulting in them appearing twice in the summary. Fixes: `47946855f1` ("meson: allow egl_native_platform to be specified") Signed-off-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13278>	2021-11-08 18:35:28 +00:00
Eric Engestrom	09bb4dbe60	release-calendar: fix date for next 21.3 rc Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13713>	2021-11-08 18:30:50 +00:00
Samuel Pitoiset	984091531e	radv: remove unused parameter in radv_emit_subpass_barrier() It got introduced by "radv: optimize subpass barrier flushes for imageless framebuffers" but never used in the final version. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13702>	2021-11-08 18:10:49 +00:00
Caio Oliveira	75161e6f3d	util: Change blob_test to use macro from mesa-gtest-extras.h Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13672>	2021-11-08 17:42:18 +00:00
Caio Oliveira	015383d1d7	gtest: Add mesa-gtest-extras.h with array ASSERT/EXPECT macros There are no similar macros in gtest. They do recommend pulling another library (gmock) and use that, but having our own let us control the output more precisely. Extracted the code from a similar existing macro in blob_test.cpp, making changes to the output. A check like ASSERT_U8_ARRAY_EQUAL(expected_u8, result_u8, 32); that fails will output ``` Expected 32 values to be equal but found 1 that differ: expected_u8 values are: [000] 29 44 00 00 44 60 cb 80 93 65 07 c0 08 00 40 29 [016] 03 81 00 00 5c a0 21 87 b0 31 00 00 00 00 00 60 result_u8 values are: [000] 29 44 00 00 44 60 cb 80 93 65 07 c0 08 00 40 29 [016] 03 81 00 00 5c a0 21 87 af 31 00 00 00 00 00 60 ``` and a check like ASSERT_U64_ARRAY_EQUAL(expected_u64, result_u64, 4); ``` Expected 4 values to be equal but found 1 that differ: expected_u64 values are: [000] 80cb604400004429 29400008c0076593 [002] 8721a05c00008103 60000000000031b0 result_u64 values are: [000] 80cb604400004429 29400008c0076593 [002] 8721a05c00008103 60000000000031af ``` Note the asterisk indicating wrong values. The new header for extra macros is in src/gtest/include/ so it doesn't get in the way when updating the real gtest headers that are in src/gtest/include/gtest/. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13672>	2021-11-08 17:42:18 +00:00
Pierre-Eric Pelloux-Prayer	5358d8a110	radeonsi/sqtt: reserve a vmid when sqtt is enabled Based on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13695 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13696>	2021-11-08 17:16:11 +00:00
Pierre-Eric Pelloux-Prayer	e26dd92957	radeonsi/sqtt: fix FINISH_DONE / BUSY usage They're using more than a single bit so use the proper mask. Based on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13694 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13696>	2021-11-08 17:16:11 +00:00
Pierre-Eric Pelloux-Prayer	3de072aaec	radeonsi/sqtt: fix shader stage values shader_stages_mask and others expect MESA_SHADER_* based values, not PIPE_SHADER_*... Without this the fragment shader wouldn't appear in the "Pipelines" pane of RGP. Fixes: `c276bde34a` ("radeonsi/sqtt: export shader code to RGP") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13696>	2021-11-08 17:16:11 +00:00
Lionel Landwerlin	c572adceaa	intel/dev: also test crocus & i915 pci-ids Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12981>	2021-11-08 16:48:06 +00:00
Lionel Landwerlin	361b3fee3c	intel: move away from booleans to identify platforms v2: Drop changes around GFX_VERx10 == 75 (Luis) v3: Replace (GFX_VERx10 < 75 && devinfo->platform != INTEL_PLATFORM_BYT) by (devinfo->platform == INTEL_PLATFORM_IVB) Replace (devinfo->ver >= 5 \|\| devinfo->platform == INTEL_PLATFORM_G4X) by (devinfo->verx10 >= 45) Replace (devinfo->platform != INTEL_PLATFORM_G4X) by (devinfo->verx10 != 45) v4: Fix crocus typo v5: Rebase v6: Add GFX3, ILK & I965 platforms (Jordan) Move ifdef to code expressions (Jordan) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12981>	2021-11-08 16:48:06 +00:00
Lionel Landwerlin	3b1a5b8f2b	intel: remove 2 preproduction pci-id for ADLS Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `d399c3e861` ("intel/dev: Add device info for ADL-S") Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13476>	2021-11-08 16:09:55 +00:00
Filip Gawin	f32dcb6fe1	nir: assert that variables in optimize_atomic are initialized If you gonna view context of function parse_atomic_op, then you gonna know that index for array (data_src) can be unitialized. Imho this approach is cleaner than doing stuff inside parse_atomic_op. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12995>	2021-11-08 15:10:07 +00:00
Mike Blumenkrantz	fbd61d2b02	zink: set new point/line caps Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	8bbebc7e9f	st/mesa: use new point and line CAPs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	78337728d1	radeonsi: set correct point and line limits Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	cf9afc7b0c	gallium: add missing point and line CAPs The returned values are the same as the GL frontend. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	b80dca86c3	gallium: rename PIPE_CAPF_MAX_POINT_WIDTH -> MAX_POINT_SIZE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13676>	2021-11-08 14:37:49 +00:00
Marek Olšák	10ee261c38	driconf: disallow 10-bit pbuffers for viewperf2020/maya due to X errors Cc: 21.2 21.3 <mesa-stable@lists.freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13472>	2021-11-08 14:00:45 +00:00
Daniel Stone	f8ea4c9e1d	Revert "CI: Disable Windows jobs" They're fixed now, just in time. This reverts commit `7b44e7d7bb`. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13708>	2021-11-08 13:08:48 +00:00
Daniel Stone	7b44e7d7bb	CI: Disable Windows jobs The runner is broken, perhaps after an upgrade. Signed-off-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13707>	2021-11-08 12:29:14 +00:00
Mike Blumenkrantz	c4d904101c	aux/trace: add pipe_context::render_condition_mem Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13582>	2021-11-05 14:37:25 -04:00
Mike Blumenkrantz	f579401099	aux/trace: fix vertex state tracing Fixes: `e8cad57aa7` ("gallium/trace: add pipe_vertex_state support") Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13582>	2021-11-05 14:37:25 -04:00
Mike Blumenkrantz	810305fbed	aux/trace: trace pipe_screen::is_format_supported better storage_sample_count is important Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13582>	2021-11-05 14:37:25 -04:00
Mike Blumenkrantz	d2f3aba5f0	aux/trace: support pipe_context::get_query_result_resource Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13582>	2021-11-05 14:37:24 -04:00
Mike Blumenkrantz	58ba18474b	aux/trace: fix PIPE_QUERY_PIPELINE_STATISTICS_SINGLE tracing don't just crash, dump! Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13582>	2021-11-05 14:36:54 -04:00
Kostiantyn Lazukin	78b613db23	util/u_trace: Replace Flag with IntEnum to support python3.5 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5586 Fixes: `cefaa73909` Signed-off-by: Kostiantyn Lazukin <kostiantyn.lazukin@globallogic.com> Reviewed-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13643>	2021-11-05 11:54:30 +00:00
Lionel Landwerlin	349bfb7275	intel/devinfo: fix wrong offset computation A bit difficult to find what commit introduced the issue because of all the renaming, but it was my bug :) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	718c97d525	intel/devinfo: use compatible type for ARRAY_SIZE Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	67619d8153	intel/perf: fix perf equation subslice mask generation for gfx12+ v2: Fix comment change (Marcin) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	a543a94404	intel/dev: fix subslice/eu total computations with some fused configurations When a device has its first slice/subslice fused off, we can't use the number of slices/subslices to iterate the mask array. v2: Fix spelling (Marcin) Use size_t for iterator (Marcin) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reported-by: Matt Roper <matthew.d.roper@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5601 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	e10c641f00	intel/dev: reuse internal functions to set mask Rather than having 2 paths to set the slice/subslice/eu masks, reuse the other internal functions. This simplifies finding bugs within this code : * If we have i915 query topology support, update_from_topology() is called. * If we don't have query topology support but we have getparam for slice/subslice/EU, we generate a topology data and call update_from_topology() * If we have no kernel support to query any kind of topology, we generate the values return by the kernel for slice/subslice/EU and call update_from_masks() which in turns calls update_from_topology() v2: Fixup typo (Adam) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	d7c6a90c26	intel/dev: don't forget to set max_eu_per_subslice in generated topology Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Lionel Landwerlin	d1db5d562a	intel/dev: fix HSW GT3 number of subslices in slice1 Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: <mesa-stable@lists.freedesktop.org> Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/10015>	2021-11-05 10:22:18 +00:00
Rhys Perry	12294026d5	nir/algebraic: optimize Cyberpunk 2077's open-coded bitfieldReverse() fossil-db (Sienna Cichlid): Totals from 9 (0.01% of 128647) affected shaders: CodeSize: 29900 -> 28640 (-4.21%) Instrs: 5677 -> 5443 (-4.12%) Latency: 96561 -> 95025 (-1.59%) Copies: 571 -> 544 (-4.73%) Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13673>	2021-11-05 09:31:04 +00:00
Pierre-Eric Pelloux-Prayer	9b8bc712b2	mesa: remove NEW_COPY_TEX_STATE Since _NEW_PIXEL isn't handled in _mesa_update_state anymore, we can replace NEW_COPY_TEX_STATE by _NEW_BUFFERS. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13596>	2021-11-05 10:03:04 +01:00
Pierre-Eric Pelloux-Prayer	1ee3fbd703	mesa: always call _mesa_update_pixel `10c75ae4` moved handling of this state to the functions that depend on ctx->_ImageTransferState. So we can't depend on _NEW_PIXEL being set to call this function, since it'll be always clear earlier by _mesa_update_state_locked. Example sequence that would trigger the issue: glPixelTransferi(...) glClear(...) glTexSubImage2D(...) <-- won't use the new value set by glPixelTransferi because glClear caused _NEW_PIXEL to be cleared. _NEW_PIXEL itself is kept because st_update_pixel_transfer depends on it. Fixes: `10c75ae4` ("mesa: move _mesa_update_pixel out of _mesa_update_state") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5273 Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13596>	2021-11-05 10:02:59 +01:00
Samuel Pitoiset	6b002d2549	Revert "radv: only enable VK_EXT_display_control for vrcompositor (SteamVR)" It's fixed now. This reverts commit `db7ad0c170`. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13682>	2021-11-05 08:27:54 +01:00
orbea	0a6f079afe	build: add sha1_h for lp_texture.c ../mesa-9999/src/gallium/drivers/llvmpipe/lp_texture.c:55:10: fatal error: git_sha1.h: No such file or directory Fixes: `1608a815e3` ("llvmpipe: add support for EXT_memory_object(_fd)") Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Signed-off-by: orbea <orbea@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13665>	2021-11-05 05:54:20 +00:00
Jordan Justen	6ffdcc335e	iris: Use mi_builder in iris_load_indirect_location() For example, this allows us to take advantage of command-streamer based register offsets in mi_builder. Ref: `06cf838cbd` ("intel/mi_builder: Support gen11 command-streamer based register offsets") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13652>	2021-11-04 21:23:21 -07:00
Mike Blumenkrantz	833c0394e0	Revert "gallium/u_blitter: work around broken sample shading in llvmpipe and zink" This reverts commit `8b287c3f92`. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13679>	2021-11-05 02:36:32 +00:00
Mike Blumenkrantz	60a8d68285	gallivm: handle TGSI SampleId sysval Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13679>	2021-11-05 02:36:32 +00:00
Mike Blumenkrantz	64cee33984	lavapipe: add some asserts for descriptor dynamic offsets ensure that this explodes if there aren't enough offsets Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13677>	2021-11-05 02:21:01 +00:00
Mike Blumenkrantz	8c37cd8860	zink: rework cached fbfetch descriptor fallback this ended up being a little trickier than I thought; lazy descriptors don't use dynamic ubo types for the push set, which means drivers that (correctly) assert dynamic offset existence explode because the descriptor template will never work with the push set the better, though slightly more annoying, option here is to use the lazy manager's faster descriptor allocation and lesser complexity to quickly grab a push set, then tweak the existing cached codepath slightly in order to update a raw vkdescriptorset Fixes: `417477f60e` ("zink: always use lazy (non-push) updating for fbfetch descriptors") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13677>	2021-11-05 02:21:01 +00:00
Jesse Natalie	2d1f5e3dcb	d3d12: Don't accumulate timestamp queries If an app re-issues a timestamp query a lot, but doesn't ever ask for the results, we could end up running off the end of our query heap. But we don't actually need to advance/accumulate, so just use a single entry in the heap. Reviewed By: Bill Kristiansen <billkris@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12920>	2021-11-05 00:44:15 +00:00
Emma Anholt	34739cb6e2	freedreno/ir3: Fix off-by-one in prefetch safety assert. This looks like just a typo, we allow up to == 0xf in the lowering pass. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Emma Anholt	b0f2b0e980	freedreno/a5xx: Clean up a little bit of blitter array pitch setup. We have a nice helper function for determining an array pitch. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Emma Anholt	b26e0cdf44	freedreno/a5xx: Try to fix drawing to z/s miplevel/layer offsets. Terrifyingly, no testcases are fixed by this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Emma Anholt	99f5b7ba1e	freedreno/a5xx: Remove bogus assertion about BO size. The slice->size0 temp is being used as both the array stride (incorrectly) and as the size of the slice (for this assert). This assert doesn't seem to be in the right place to me, if you want to check that offset+slice size is < bo size, you could just do that at the end of layout setup. This caused troubles when fixing the temp to be the actual array stride for filling out the HW state, since then rendering to nonzero levels would think that the rendering overflowed the BO when it doesn't. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Emma Anholt	03d8677bca	freedreno/a6xx: Try to fix drawing to z/s miplevel/layer offsets. Terrifyingly, no testcases are fixed by this. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Emma Anholt	35f56ad856	freedreno/a5xx: Diff reduction in fd5_layout to fd6_layout. This should be exactly equivalent code, except for the is_3d "level <= 1" which doesn't bring over `6c19d37331` ("freedreno/a6xx: fix 3d tex layout") due to it failing our unit tests where we compare to the blob's behavior. The layer_stride setup is pulling in what freedreno_resource.c was doing after the layout setup, so we match fd6 and so that it could potentially be checked in unit testing. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13668>	2021-11-04 22:49:29 +00:00
Caio Oliveira	8fc6a11f0e	intel/blorp: Add option to emit packets that disable Mesh If a driver doesn't support Mesh, don't emit anything. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13660>	2021-11-04 14:41:06 -07:00
Caio Oliveira	ecba8178bd	intel/dev: Add an intel_device_info::has_mesh_shading bit Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13660>	2021-11-04 14:41:06 -07:00
Marcin Ślusarz	bba26939b1	intel/decoder: Dump Task/Mesh shaders Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13657>	2021-11-04 21:01:13 +00:00
Caio Oliveira	3567d47f3e	intel/genxml: Inline the BODY structs into the instructions Follows the convention used in other instructions. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13657>	2021-11-04 21:01:13 +00:00
Caio Oliveira	3fe2e862b5	intel/genxml: Add Mesh Shading structures Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Acked-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13657>	2021-11-04 21:01:13 +00:00
Jesse Natalie	b34fed64fa	u_prim_restart: Fix index scanning with start offset Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13681>	2021-11-04 20:39:25 +00:00
Mike Blumenkrantz	bc345281ab	aux/primconvert: handle singular incomplete restarts if no restart indices are found, this draw must be discarded to avoid crashing later on Fixes: `583070748c` ("util/primconvert: handle rewriting of prim-restart draws with unsupported primtype") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13630>	2021-11-04 20:12:32 +00:00
Emma Anholt	1e869e3fb4	freedreno/a5xx+: Fix missing LA formats. GL_ARB_texture_buffer_object uses these formats, and we expose it. Since we didn't have the formats in the table, we we were using bad HW texture/color formats for them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13666>	2021-11-04 19:07:54 +00:00
Emma Anholt	0e4fcda7e0	freedreno/a6xx: Don't try to generate mipmaps for SNORM with our blitter. Since we're casting to unorm, the linear filtering will give bad results. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13666>	2021-11-04 19:07:54 +00:00
Jason Ekstrand	953a4ca6fe	intel: Add has_bit6_swizzle to devinfo There's no good reason to have this rather complex check in three drivers. Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13636>	2021-11-04 18:51:04 +00:00
Marek Olšák	a0dc303b45	vbo: utilize structure padding to optimize indirection cold->prims[0].begin Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13589>	2021-11-04 18:24:15 +00:00
Marek Olšák	3f997bccc6	radeonsi: increase tc_max_cpu_storage_size Viewperf benefits. The number is only slightly above the size we need. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Marek Olšák	74adf22a0a	radeonsi: fix a typo preventing a fast depth-stencil clear Fixes: `9defe8aca9` - radeonsi: implement fast Z/S clears using clear_buffer on HTILE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Marek Olšák	c0f723ce2b	radeonsi: allow and finish TC-compatible MSAA HTILE This improves perf for Catia by 4%. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Marek Olšák	3baeaac64b	radeonsi: rename stencil_cleared_level_mask -> stencil_cleared_level_mask_once Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Marek Olšák	b1b491cdbb	radeonsi: add a faster clear path for glClearTexImage Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Marek Olšák	5d3aea49b8	radeonsi: fix 2 issues with depth_cleared_level_mask - Unset depth_cleared_level_mask for non-clear blits. Set the flag after the clear, so that we don't have to check blitter_running. - Set depth_cleared_level_mask only when we set depth_clear_value. Fixes: `ff8a930cf7` - radeonsi: add _once suffix to depth_cleared_level_mask Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13603>	2021-11-04 17:36:26 +00:00
Tapani Pälli	4885e63a6d	vulkan/wsi: implement missing wsi_register_device_event These changes implement vkRegisterDeviceEventEXT and detection of monitor hotplug. Wsi launches a thread that listens to udev events and signals the appropriate device fences when hotplug hapens. v2: use wsi fences instead of syncobj api (Jason Ekstrand) v3: refactor + cleanups, create thread on demand (Samuel Pitoiset) v4: bring back syncobj support from initial version for radv v5: make libudev dependency optional, check for poll errors (Simon Ser) v6: change matching mechanism to use udev device node instead of path v7: remove the matching mechanism v8: fix a race with thread creation + use single mutex + other cleanups (Jason Ekstrand) Fixes: dEQP-VK.wsi.display_control.register_device_event Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12305>	2021-11-04 16:57:29 +00:00
Tapani Pälli	9f6764953b	anv: setup syncobj fd via wsi_device_setup_syncobj_fd Patch moves initialization of variable so that we have fd when calling wsi initialization. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12305>	2021-11-04 16:57:29 +00:00
Tapani Pälli	01fb24d50e	radv: setup syncobj fd via wsi_device_setup_syncobj_fd Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12305>	2021-11-04 16:57:29 +00:00
Tapani Pälli	73f21ea2e1	vulkan/wsi: provide api for drivers to setup syncobj fd Drivers that import sync_fd to the wsi fences can use this to set file descriptor for syncobj related calls. This fixes permission errors when registering display/device events and importing sync_fd from driver side. Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12305>	2021-11-04 16:57:29 +00:00
Mike Blumenkrantz	92215d8da8	zink: add khr46 to ci this blocks out all the very long tests and marks failures as needed to improve the coverage of ci Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13656>	2021-11-04 16:42:04 +00:00
Mike Blumenkrantz	f5f2426ffd	zink: remove lazy ci job the push descriptor coverage for lavapipe should be okay in ci now, and that was the point of adding this job Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13656>	2021-11-04 16:42:04 +00:00
Joshua Ashton	7d64f0dd16	nvc0: Fix uninitialized width/height/depth warning. This can happen if view->resource is false. Fixes a warning in GCC 9+ that's been bugging me for a very long time when building Mesa. Signed-off-by: Joshua Ashton <joshua@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12955>	2021-11-04 15:31:09 +00:00
Marek Olšák	8b287c3f92	gallium/u_blitter: work around broken sample shading in llvmpipe and zink Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13602>	2021-11-04 15:06:09 +00:00
Marek Olšák	eb34716c1f	gallium/u_blitter: do MSAA copies in 1 pass using sample shading Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13602>	2021-11-04 15:06:09 +00:00
Marek Olšák	6d483fed85	gallium/u_blitter: disable sample shading for all blits Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13602>	2021-11-04 15:06:09 +00:00
Marek Olšák	7ce3f8e639	gallium/util: fix util_can_blit_via_copy_region with unbound render condition It returned false when a render condition was not bound, but it should have returned true. The bool stuff is random and incomplete, but that's life. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13602>	2021-11-04 15:06:09 +00:00
Mike Blumenkrantz	5d1b81d8ac	zink: clamp PIPE_SHADER_CAP_MAX_INPUTS for xfb vertex shader stages that can produce xfb must have their input size clamped to the compiler define MAX_VARYING to successfully be able to export an xfb output for each input fixes KHR-GL46.geometry_shader.limits.max_input_components cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13632>	2021-11-04 14:51:10 +00:00
Mike Blumenkrantz	46e167028d	zink: do a better job conserving locations for packed xfb outputs if an entire vec4 is exported to xfb, mark it as an explicit xfb buffer whenever possible to avoid blowing out the location limit Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13632>	2021-11-04 14:51:10 +00:00
Pierre-Eric Pelloux-Prayer	ff981a7e79	drirc: add options for BETA CAE Ansa application. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13364>	2021-11-04 14:16:55 +00:00
Pierre-Eric Pelloux-Prayer	f5dc334b6d	drirc: add mesa_extension_override option This allows specific per-application override. The existing MESA_EXTENSION_OVERRIDE env variable is kept. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13364>	2021-11-04 14:16:55 +00:00
Pierre-Eric Pelloux-Prayer	50c983402e	mesa/init: replace call_once with manual implementation This will be useful to add parameters to one_time_init(). _MTX_INITIALIZER_NP and Windows don't play nice together, so I had to keep a call_once() to initialize the mutex. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13364>	2021-11-04 14:16:55 +00:00
Pierre-Eric Pelloux-Prayer	596597f4a4	mesa: don't use dummy_true for some MESA extensions Otherwise we can't use MESA_EXTENSION_OVERRIDE, or rather: disabling one extension using dummy_true will disable all others. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13364>	2021-11-04 14:16:55 +00:00
Pierre-Eric Pelloux-Prayer	8a03e25977	mesa: print a warning when an extension can't be disabled Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13364>	2021-11-04 14:16:55 +00:00
Iago Toral Quiroga	aa5a0e1dad	broadcom/compiler: copy packing when converting add to mul Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13675>	2021-11-04 13:57:39 +00:00
Timur Kristóf	b59614619b	radv: Use MESA_VULKAN_SHADER_STAGES to make room for mesh/task. Signed-off-by: Timur Kristóf <timur.kristof@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13440>	2021-11-04 13:32:07 +00:00
Pierre-Eric Pelloux-Prayer	cd6e9ad36a	llvmpipe: add missing NIR alu-op handling nir_op_bcsel implemented based on ac_nir_to_llvm.c emit_bcsel function. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13246>	2021-11-04 13:59:00 +01:00
Pierre-Eric Pelloux-Prayer	a21c0f531e	mesa: enable force_direct_glx_context for DiscoveryStudio2020 This app uses Qt4.8 which uses an indirect context to save a GL scene to a pixmap. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13246>	2021-11-04 13:59:00 +01:00
Pierre-Eric Pelloux-Prayer	fc3ef76eec	glx/drirc: add a force_direct_glx_context option Some applications may request an indirect context but this feature is disabled by default on Xorg and thus context creation will fail. This commit adds a drirc setting to force the creation of direct glx context, regardless of what the app is requesting. Reviewed-by: Adam Jackson <ajax@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13246>	2021-11-04 13:59:00 +01:00
Pierre-Eric Pelloux-Prayer	9b09655a58	vbo/dlist: free copied.buffer if no vertices were copied Other parts of the code asserts that copied.buffer is NULL if there are no vertices to copy. Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13246>	2021-11-04 13:59:00 +01:00
Marek Olšák	118f2952b6	driconf: set vblank_mode=0 for viewperf2020 It doesn't disable vsync. Reported internally. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13612>	2021-11-04 11:04:35 +00:00
Pierre-Eric Pelloux-Prayer	dbf602a6b3	ac/surface: don't validate DCC settings if DCC isn't possible Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13550>	2021-11-04 09:38:46 +01:00
Pierre-Eric Pelloux-Prayer	bc6d22b920	radeonsi: fix ps_uses_fbfetch value si_update_ps_colorbuf0_slot used blitter_running as a way to detect recursive calls. Unfortunately this catch too many cases; for instance a backtrace like: #0 si_update_ps_colorbuf0_slot #1 si_set_framebuffer_state #2 do_blits [...] #5 si_blit #6 si_copy_region_with_blit Would end-up not updating ps_uses_fbfetch; so if the new fb_state is something like: cbufs = {0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0, 0x0}, zsbuf = 0x55b8987545e0} We can have ps_uses_fbfetch=true but cbufs[0] = NULL, which causes a crash later in si_ps_key_update_framebuffer. This commit fixes intermittent crashes in KHR-GL46.stencil_texturing.functional. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13550>	2021-11-04 09:38:42 +01:00
Pierre-Eric Pelloux-Prayer	d86d602ed0	radeonsi/sdma: fix bogus assert src can use dcc even for non sdma v5 variants because si_decompress_dcc is called in si_sdma_copy_image. Fixes: `46c95047bd` ("radeonsi: implement si_sdma_copy_image for gfx7+") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13550>	2021-11-04 09:38:41 +01:00
Pierre-Eric Pelloux-Prayer	84d4bda8e5	ac/surface: use a less strict condition in is_dcc_supported_by_L2 While Mesa chooses to always use independent_128B_blocks, other drivers can make different choices. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13550>	2021-11-04 09:38:27 +01:00
Pierre-Eric Pelloux-Prayer	dc56301f78	radeonsi: treat nir_intrinsic_load_constant as a VMEM operation This is used by variable indexing of constant arrays, to build code like this: s_add_u32 s6, s6, const_data@rel32@lo+4 s_addc_u32 s7, s7, const_data@rel32@hi+12 [...] global_load_dword v4, v4, s[6:7 Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5118 Fixes: `8288882965` ("radeonsi: set MEM_ORDERED optimally") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13550>	2021-11-04 09:38:20 +01:00
Joshua Ashton	af8fa2644e	radv: Split off cmd_buffer variant of descriptor set template updates Assumes cmd_buffer != NULL for this path to eliminate the checks in the generic code. Benchmarks: I made a microbenchmark based on Bas' vulkan microbench suite. https://gitlab.freedesktop.org/bnieuwenhuizen/vulkan_microbench/-/merge_requests/1 In this benchmark, this improves vkDescriptorTemplateUpdate performance consistently by 36%. ------------------------------------------------------------------- Benchmark Time CPU Iterations ------------------------------------------------------------------- DescriptorTemplateUpdate 81.2 ns 81.2 ns 8573169 -> ------------------------------------------------------------------- Benchmark Time CPU Iterations ------------------------------------------------------------------- DescriptorTemplateUpdate 52.9 ns 52.9 ns 13306065 Signed-off-by: Joshua Ashton <joshua@froggi.es> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13342>	2021-11-04 05:50:54 +00:00
Joshua Ashton	4865cb6514	radv: Split off cmd_buffer variant of descriptor set updates Assumes cmd_buffer != NULL for this path to eliminate the checks in the generic code. Signed-off-by: Joshua Ashton <joshua@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13342>	2021-11-04 05:50:48 +00:00
Joshua Ashton	7b0826db56	radv: Always inline descriptor writes Improves performance, see next commit for benchmarks. Signed-off-by: Joshua Ashton <joshua@froggi.es> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13342>	2021-11-04 05:50:47 +00:00
Emma Anholt	0913ac33a9	freedreno/a618: Mark a flaky test that triggers hangcheck. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13659>	2021-11-04 03:47:54 +00:00
Emma Anholt	d1801d43f8	freedreno/a5xx: Use the defined names for 2D_BLIT_CNTL regs. We have definitions for them above, no need to be UNKNOWN about it. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13659>	2021-11-04 03:47:54 +00:00
Emma Anholt	f0f5b8d47c	freedreno/a6xx: Fix partial z/s clears with sysmem. We have to set 8c01 to say "leave these channels alone" when clearing/storing just Z or S of z24s8. Fixes the bypass path for KHR-GLES3.packed_depth_stencil.verify_read_pixels.depth24_stencil8. Cc: mesa-stable Fixes: #5592 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13659>	2021-11-04 03:47:54 +00:00
Mike Blumenkrantz	aa5ca7fc3c	features: add dynamic render for lavapipe Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13627>	2021-11-04 03:22:09 +00:00
Mike Blumenkrantz	8a6160a354	lavapipe: VK_KHR_dynamic_rendering this is a conformant implementation Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13627>	2021-11-04 03:22:09 +00:00
Mike Blumenkrantz	dd71cc8947	lavapipe: fix cmd queuing for dynamic render this sucks, but it should be sorted quickly in #5440 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13627>	2021-11-04 03:22:09 +00:00
Mike Blumenkrantz	fbd5ded5e0	vk: update headers for 1.2.197 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13627>	2021-11-04 03:22:08 +00:00
Matt Turner	cc29b94041	freedreno/ir3: Use immediate for flat.b's src1 According to Jonathan Marek: Only one immediate can be decoded in a cat2 instruction (if both srcs are immediates, they will use the value of the either the first or second one, I don't remember which) - using 2 immediates in a cat2 instruction is only "correct" if they are both equal. The (i,j) in the second src of flat.b is not unused, but behaves as 0 for any (small) integer because it is a float src. The hack I suggested is to set the second src equal to (immediate) first src, which seems to work. This allows us to remove a couple of mov instructions or a bit of extra constfile usage. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13558>	2021-11-04 02:59:28 +00:00
Matt Turner	2ab0cf2b54	freedreno/ir3: Use flat.b to load flat varyings on a6xx The flat.b/bary.f cat2 instruction should be faster than an ldlv cat6 instruction, even with a couple of additional moves (which will be removed in the next patch). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13558>	2021-11-04 02:59:28 +00:00
Matt Turner	2ee1b5a526	freedreno/ir3: Add infrastructure for flat.b Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13558>	2021-11-04 02:59:28 +00:00
Matt Turner	a150e31910	ir3: Add support for (dis)assembling flat.b flat.b is a variant of the bary.f instruction that does not perform interpolation of the varying input. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13558>	2021-11-04 02:59:28 +00:00
Mike Blumenkrantz	417477f60e	zink: always use lazy (non-push) updating for fbfetch descriptors fbfetch descriptors are uncacheable due to having mixed descriptor types in the same set, so this needs to always use lazy updating to avoid exploding the cache and crashing cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13654>	2021-11-04 02:41:09 +00:00
Mike Blumenkrantz	2c54ad8f3d	zink: set fbfetch state on lazy batch data when enabling it this avoids creating new descriptor pools on every update cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13654>	2021-11-04 02:41:09 +00:00
Mike Blumenkrantz	841bea2c9f	anv: disable debug logging spam nobody wants to see this by default, so disable it until #5404 is resolved to make it more manageable Reviewed-by: Ivan Briano <ivan.briano@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13628>	2021-11-04 02:24:43 +00:00
Mike Blumenkrantz	4af7842ede	mesa/st: lower psiz for shader precompile if the driver is requesting that psiz always be set, precompile the first variant with psiz since that's most likely to be what's eventually used Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13588>	2021-11-04 01:54:07 +00:00
Mike Blumenkrantz	8be35803a5	mesa/st: rework psiz lowering the context flag is immutable, and it's set by drivers that always require the shader to export a psiz value for rasterization "always" is all cases except GL_VERTEX_PROGRAM_POINT_SIZE being enabled, in which case it should be assumed that the shader already writes it, and no changes should be made thus the shader key value can be changed to reflect that, when set, it requires the shader to export a psiz value if it doesn't already Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13588>	2021-11-04 01:54:07 +00:00
Eric Engestrom	97c3b658b3	docs: update calendar for 21.3.0-rc4 Add another release candidate as we're not ready for 21.3.0 final just yet. Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13667>	2021-11-04 00:38:53 +00:00
Marek Olšák	cdc288993e	st/mesa: don't update vertex elements when GL doesn't change them We rely on mesa/main to tell us whether to update vertex elements. This decreases overhead for obvious reasons. The select/feedback code path doesn't use this, which is why you see unconditional "ALL" in a few codepaths. This sequence of GL calls doesn't update vertex elements if only the pointer and stride vary: glVertexPointer() glDrawElements() glVertexPointer() glDrawElements() glVertexPointer() glDrawElements() Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	69ee132b86	cso: add missing parameters into cso_set_vertex_buffers they will be used later Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	29877c1dce	mesa: add NewVertexBuffers/NewVertexElements flags to indicate state changes This splits the per-VAO NewArrays flag into NewVertexBuffers and NewVertexElements, and adds a global NewVertexElements flag to be used by drivers. This allows gallium to skip updating vertex elements when only vertex buffers need to be updated. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	ae625da7ef	mesa: change gl_vertex_array_object::NewArrays to bool the individual bits are never used Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	d24539b152	st/mesa: use POPCNT in st_update_array if the CPU supports it The st_update_array overhead decreases from 8.28% to 7.67% for a viewperf subtest. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	5b8b34825f	st/mesa: change st_atom_array.c to cpp Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Marek Olšák	81d35c8d48	util: add a util_bitcount variant that selects POPCNT through C++ template arg Moved from radeonsi. st/mesa will use it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13512>	2021-11-03 23:22:31 +00:00
Filip Gawin	e1c640c3a4	r300: stub derivatives on r300 and r400 hardware There are three approaches for problem: - use dummy shader (current solution) - use software rendering - stub First option always gonna give bad results, second one gonna be slideshow (r300/r400 at best are running with old 2 cores cpu), third one can sometimes give graphical gliches. IMHO third one is least annoying option. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13642>	2021-11-03 21:56:19 +00:00
Emma Anholt	14fca01b32	freedreno: Fix layered rendering to just Z/S and not color. We would try to take the gmem path which can't do layered rendering. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13653>	2021-11-03 21:13:45 +00:00
Mike Blumenkrantz	7c8fee6049	build: add sha1_h to llvmpipe build cc: mesa-stable fixes #5588 Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13658>	2021-11-03 20:50:24 +00:00
Caio Oliveira	1fb41193fe	.mailmap: Simplify my name Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13651>	2021-11-03 20:48:43 +00:00
Mike Blumenkrantz	3137ff4709	zink: add queue locking sparse binds have to be processed synchronously with cmdbuf recording to avoid resource object desync in the vk driver, which means they have to be done in the driver thread instead of the flush thread. this necessitates adding locking for the queue since there is now a case when submissions occur in a different thread fixes illegal multithread usage in KHR-GL46.CommonBugs.CommonBug_SparseBuffersWithCopyOps cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13597>	2021-11-03 20:34:25 +00:00
Mike Blumenkrantz	786167b88c	zink: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY fixes #5557 Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13556>	2021-11-03 20:03:55 +00:00
Mike Blumenkrantz	8297d243fb	gallium: add PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY vulkan requires that vertex attribute access be aligned to the size of a component for the attribute, but GL has no such requirements the existing alignment caps are unnecessarily restrictive for applying this limitation, so this cap now pre-calculates the masks for elements and vertex buffers in vbuf to enable rewriting misaligned buffers Acked-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13556>	2021-11-03 20:03:55 +00:00
Emma Anholt	c356f3cfce	freedreno/a6xx: Use the fdl buffer view setup for img/ssbo descriptors. The single-plane descriptor emit helper doesn't strictly need the UBWC reloc, since imageBuffer can't be UBWC, but it means the function is ready to be used for non-buffer image descriptors later. no-hw drawoverhead 1-imageBuffer change throughput 1.95457% +/- 1.44325% (n=127). Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	3050e20283	freedreno/fdl6: Add support for texture swizzles of A/L/I/LA/RGBx. To convert freedreno over, we need to support these formats where we remap R or RG formats to GL compat ones, or RGBA to RGBx. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	669caded51	turnip: Remove buffer-view cross-check code. Now that I've tested storage.*buffer, I'm confident I've moved the buffer views correctly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	ef1fb25787	turnip: Use the new shared buffer-view descriptor creation function. This cross-checks that our descriptors match as I move the code. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	aa3074e5be	freedreno/fdl6: Add an interface for setting up buffer descriptors. Buffers don't need all the layout stuff that image views do, so it's easier to have a separate interface for generating them. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	7b578c1249	freedreno/a6xx: Emit a null descriptor for unoccupied IBO slots. Fixes a crash in some desktop GL testcases in piglit. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13635>	2021-11-03 19:38:48 +00:00
Emma Anholt	29093bc42d	freedreno: Fix gmem invalidating the depth or stencil of packed d/s. The gmem store stores both depth and stencil for z24s8. So, if we're doing a write (clear or draw) to one or the other of the channels, we need the other one restored as well. Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13649>	2021-11-03 18:56:23 +00:00
Caio Oliveira	858424bd2e	intel/compiler: Use gl_shader_stage_uses_workgroup() helpers Instead of checking for MESA_SHADER_COMPUTE (and KERNEL). Where appropriate, also use gl_shader_stage_is_compute(). This allows most of the workgroup-related lowering to be applied to Task and Mesh shaders. These will be added later and "inherit" from cs_prog_data structure. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13629>	2021-11-03 11:09:48 -07:00
Caio Oliveira	c4355d3f24	intel/compiler: Make brw_nir_populate_wm_prog_data() static Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13629>	2021-11-03 11:09:48 -07:00
Vadym Shovkoplias	732cfa525f	anv: Include viewport size in scissor rectangle Prevent drawing outside the viewport when viewport size is smaller than framebuffer size. v2: (Jason Ekstrand) - re-emit scissor on viewport change - do the same calculations for other platforms Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5515 Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13594>	2021-11-03 17:48:10 +00:00
Caio Marcelo de Oliveira Filho	71022a53e4	anv: Process FS last when compiling graphics pipeline Enum values for MESA_SHADER_TASK and MESA_SHADER_MESH are larger than MESA_SHADER_FRAGMENT, so can't rely on the them for ordering anymore. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13637>	2021-11-03 17:28:25 +00:00
Caio Marcelo de Oliveira Filho	a0109a3754	anv: Make shaders array in anv_graphics_pipeline fit Task/Mesh We could decouple the locations in the array from the gl_shader_stage enum values, but for now this is convenient. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13637>	2021-11-03 17:28:25 +00:00
Caio Marcelo de Oliveira Filho	eb430aa9e7	anv: Get rid of "may be used initialized" warning in anv_QueueSubmit2KHR The code is correct, but compiler can't see it. Initialize the value to NULL and assert on it if the function succeeds. It both helps the compiler and make the code slightly more robust. ``` ../src/intel/vulkan/anv_queue.c: In function ‘anv_QueueSubmit2KHR’: ../src/intel/vulkan/anv_queue.c:932:16: warning: ‘impl’ may be used uninitialized in this function [-Wmaybe-uninitialized] 932 \| result = anv_queue_submit_add_timeline_wait(queue, submit, \| ^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 933 \| &impl->timeline, \| ~~~~~~~~~~~~~~~~ 934 \| value); \| ~~~~~~ ../src/intel/vulkan/anv_queue.c:899:31: note: ‘impl’ was declared here 899 \| struct anv_semaphore_impl *impl; \| ^~~~ ``` Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13638>	2021-11-03 17:14:36 +00:00
Danylo Piliaiev	3afdc3ab2c	freedreno/computerator: Support A660 gpu Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13640>	2021-11-03 16:32:19 +00:00
Danylo Piliaiev	79fcd63bd6	tu: fix rast state allocation size on a6xx gen4 A few regs were added without changing the size of draw state. Fixes: `4e05338d99` ("turnip: Rast updates for a6xx gen4") Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13644>	2021-11-03 16:09:23 +00:00
Mike Blumenkrantz	675519f1d0	zink: reject all storage multisampling if the feature is unsupported this also enables removing a stupid conditional cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13631>	2021-11-03 14:14:44 +00:00
Mike Blumenkrantz	aacdc6eb44	zink: add SpvCapabilityStorageImageMultisample for multisampled storage images cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13631>	2021-11-03 14:14:44 +00:00
Iago Toral Quiroga	a794bdf953	broadcom/compiler: check that sig packing is valid when pipelining ldvary Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13641>	2021-11-03 10:49:06 +00:00
Pierre-Eric Pelloux-Prayer	c6e2c802c4	glsl/nir: mark samplers inside a block as bindless Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13416>	2021-11-03 10:22:41 +01:00
Pierre-Eric Pelloux-Prayer	e3a51a408f	mesa: don't reset SamplersValidated if nothing changed This could prevent error detection, if a uniform change sets SamplersValidated to true without calling _mesa_update_shader_textures_used. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13416>	2021-11-03 10:22:41 +01:00
Samuel Pitoiset	e15e3a8e86	radv: optimize subpass barrier flushes for imageless framebuffers The driver should always know the attachments at this point. This should reduce the number of L2 cache flushes for imageless framebuffers. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13291>	2021-11-03 08:26:43 +01:00
Vinson Lee	f13d486ee7	intel/compiler: Initialize SIMDSelectionTest member error. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) uninit_member: Non-static class member error is not initialized in this constructor nor in any functions that it calls. Fixes: `7558340ebb` ("intel/compiler: Add helpers to select SIMD for compute shaders") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13608>	2021-11-03 04:22:35 +00:00
Vinson Lee	d863e62b32	intel/compiler: Change selected_simd return type to int. brw_simd_select return type is int. Fix defect reported by Coverity Scan. Unsigned compared against 0 (NO_EFFECT) unsigned_compare: This less-than-zero comparison of an unsigned value is never true. selected_simd < 0U. Fixes: `7dda0cf2b8` ("intel/compiler: Use SIMD selection helpers for CS") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13606>	2021-11-03 01:48:46 +00:00
Mike Blumenkrantz	ac2af149f1	zink: stop double printing validation messages VVL already prints its messages using configurable settings. there's no reason for zink to unconditionally repeat them immediately after cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13633>	2021-11-03 01:22:56 +00:00
Emma Anholt	4e28962800	ci: Uprev VK-GL-CTS to 1.2.7.2, and pull in piglit while I'm here. The VK-GL-CTS fixes some issues for freedreno, and almost all of LVP's xfails. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13622>	2021-11-02 20:29:31 +00:00
Jesse Natalie	8d3a3e7a00	microsoft/compiler: Use textures for SRVs After running the (renamed) dxil_nir_split_typed_samplers pass, the shader will have either: * Textures, which map to D3D SRVs * Bare samplers, which map to D3D bare samplers * Images, which map to D3D UAVs There shouldn't be any remaining samplers with type information Reviewed-by: Enrico Galli <enrico.galli@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13390>	2021-11-02 11:02:22 -07:00
Jesse Natalie	ffd4157b1c	util/hash_table: Clear special 0/1 entries for u64 hash table too Fixes: `e532a47f` ("util/hash_table: do not leak u64 struct key") Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13390>	2021-11-02 11:02:18 -07:00
Rhys Perry	c047fc9de3	docs: update radv extensions in features.txt Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13626>	2021-11-02 12:36:54 +00:00
Iago Toral Quiroga	6b9bd3f038	broadcom/compiler: make opt passes set current block Typically, optimization passes go through all the blocks in a shader and make adjustments on the fly, so we always want them to update the current block or the current block pointer will become outdated. Also, we don't need to keep track of the previous current block pointer to restore it, since optimization passes run after we have completed conversion to VIR, and therefore, anything that comes after that should always set the current block before emitting code. Fixes debug assert crashes when running shader-db: vir.c:1888: try_opt_ldunif: Assertion `found \|\| &c->cur_block->instructions == c->cursor.link' failed Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13625>	2021-11-02 11:17:01 +00:00
Tomeu Vizoso	27cb4166b5	freedreno/ci: Test Turnip on Adreno 618 Collabora has added 7 new lazor Chromebooks which have Adreno 618 GPUs. Run half of the test suite (plus variants) in pre-merge. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13441>	2021-11-02 10:51:54 +00:00
Tomeu Vizoso	83a0bb007f	ci: Let manual LAVA jobs have a longer timeout than others So far only LAVA jobs make use of it, but I guess baremetal could be extended to have these timeouts as well. Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13441>	2021-11-02 10:51:54 +00:00
Tomeu Vizoso	dedc149307	ci: Add support for lazor Chromebooks Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13441>	2021-11-02 10:51:54 +00:00
Gert Wollny	634e2353a0	virgl: Add driconf tweak to force-enable reading back R8_SRGB textures In the menu of CS:GO R8_SRGB textures are uploaded and read back, and since R8_SRGB can't be read back on GLES, because it is not a rendertarget format and glGetTexImage and siblings don't exists, we can't default to enabling reading back this format. This leads to an emulation of the glGetTexImage calls issued by CS:GO, and this slows down the menus a lot (below 1 fps on Intel XE hosts). So add this driconf tweak and enable it for CS:GO to work around the issue. It can be done safely, because in this case we actually can use the data that is stored on the host in the backing IOV. This tweak lets the CS:GO menu run at around 60 FPS when run with virgl on a Intel XE host when it would run with less than 1 FPS without the tweak. Signed-off-by: Gert Wollny <gert.wollny@collabora.com> Reviewed-by: John Bates <jbates@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13572>	2021-11-02 10:22:52 +00:00
Paulo Zanoni	1311eddd52	iris: fix off-by-one error when clearing stale syncobjs This shouldn't fix any real world bugs, except it will now clear more stale syncobjs than it was previously doing, and actually do what the comment says it does. I could not find a real workload where this change would be relevant, although I didn't try too much. I wrote my own little egl program to test this. I spotted this while reading the code when investigating a Piglit failure [0]. It turns out this part the code was not relvant for the failure. [0]: ext_external_objects-vk-image-display-overwrite Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13536>	2021-11-02 08:31:54 +00:00
Samuel Pitoiset	db7ad0c170	radv: only enable VK_EXT_display_control for vrcompositor (SteamVR) One CTS test is still failing and the fix isn't upstream yet. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13623>	2021-11-02 08:08:33 +00:00
Ella Stanforth	3c86292321	v3dv: Implement VK_KHR_create_renderpass2 Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13575>	2021-11-02 07:43:31 +00:00
Guilherme Gallo	1813c70943	panfrost/ci: update piglit tests expectations on G52 Because we aren't running this test on pre-merge, the expectations have gone out of sync with the code. Sync them again. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13089>	2021-11-02 07:08:21 +01:00
Guilherme Gallo	8d96cf4eaa	ci/freedreno: Add maxcpus=2 to the kernel cmdline on a530 It seems that the cpufreq support in a530 is not rock solid yet, as the device intermittently reboots during the boot process. Passing `maxcpus=2` to the kernel avoids this issue. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13089>	2021-11-02 07:06:46 +01:00
Guilherme Gallo	38c62646d0	iris/ci: Fix traces for amly and deqp list for whl These are manual jobs, so the expected results got out of date. Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13089>	2021-11-02 07:06:07 +01:00
Guilherme Gallo	7fea3c6f14	ci: Update linux kernel to v5.15 * Update Kconfig for x86_64 and ARM64. Follow the dependency tree of the kernel modules to make sure that the intended configurations are being set. Check scripts/merge_config.sh output as well to see if there is a requested Kconfig not being considered. For a630 devices: * Use kernel version with a6xx workaround for frequency scaling * Enable CONFIG_QCOM_LMH targeting a630 slowness on new kernel ---- Out of tree patches used ---- For a360 device: * Revert a commit which remove slpi_region from msm8996: 8b0031f8bda2 ("Revert "arm64: msm8996: fix memory region overlap"") Signed-off-by: Guilherme Gallo <guilherme.gallo@collabora.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13089>	2021-11-02 06:59:27 +01:00
Qiang Yu	4098607870	xmlconfig_test: add unit test for executable_regexp Reviewed-by: Emma Anholt <emma@anholt.net> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13304>	2021-11-02 02:21:00 +00:00
Qiang Yu	c7f481c615	drirc: add Mari application workaround To allow using non-constant sampler array index. This prevent Mari render fail for any scene. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13304>	2021-11-02 02:21:00 +00:00
Qiang Yu	9a24de9aa7	driconf: add executable_regexp application attribute For matching executable with variable names like Mari which append version string to its executable like: Mari4.5v2, Mari4.7v4. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Signed-off-by: Qiang Yu <yuq825@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13304>	2021-11-02 02:21:00 +00:00
Vinson Lee	308bd1f00c	zink: Remove duplicate variable unsized. Fix defect reported by Coverity Scan. Evaluation order violation (EVALUATION_ORDER) write_write_typo: In unsized = unsized = glsl_array_type(glsl_uintN_t_type(bit_size), 0U, bit_size / 8U), unsized is written twice with the same value. Fixes: `f79a25653b` ("zink: move all shader bo/sharedmem access to compiler passes") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13607>	2021-11-01 18:59:45 -07:00
Dave Airlie	a8725ec3dc	vulkan/wsi: set correct bits for host allocations/exports for images. Lavapipe was hitting asserts in this area due to incorrect bits being specified. Set the handle type depending on the sw flag, and set a correct handle type for the memory host ptrs. v2: add image export struct to image creation (Jason) Fixes: `895d3399f7` ("lavapipe: add support for KHR_external_memory_fd") Reviewed-by: Roland Scheidegger <sroland@vmware.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13615>	2021-11-02 10:27:04 +10:00
Bas Nieuwenhuizen	d66514aacc	radv: Disable coherent L2 optimization on cards with noncoherent L2. With high likelihood we are forgetting to set the noncoherent bits somewhere but I don't have the HW to debug. To avoid user pain disable this optimization on these GPUs. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5505 Fixes: `fd8210f27e` ("radv: Try to do a better job of dealing with L2 coherent images.") Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13620>	2021-11-01 23:48:43 +00:00
Lorenz Brun	65afcddbf1	frontends/va: Return error in vaRenderPicture if decoder is NULL This fixes a crash if a data slice is submitted before the decoder is initialized. A well-behaved application shouldn't encounter this but returning an error is still better than crashing the entire process and the rest of the code is similarly defensive. Signed-off-by: Lorenz Brun <lorenz@brun.one> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11792>	2021-11-01 23:23:17 +00:00
Vadym Shovkoplias	2dbb66997e	intel/fs: Fix a cmod prop bug when cmod is set to inst that doesn't support it Fixes dEQP-VK.reconvergence.nesting tests. There are cases when cmod is set to an instruction that cannot have conditional modifier. E.g. following: find_live_channel(32) vgrf166:UD, NoMask cmp.z.f0.0(32) null:D, vgrf166+0.0<0>:D, 0d is optimized to: find_live_channel.z.f0.0(32) vgrf166:UD, NoMask v2: - Add unit test to check cmod is not set to 'find_live_channel' (Matt Turner) - Update flag_subreg when conditonal_mod is updated (Ian Romanick) Signed-off-by: Vadym Shovkoplias <vadym.shovkoplias@globallogic.com> Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5431 Fixes: `32b7ba66b0` ("intel/compiler: fix cmod propagation optimisations") Reviewed-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13268>	2021-11-01 21:08:12 +00:00
Emma Anholt	085e838959	i915g: Improve the explanation for the 1D Y swizzle. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13133>	2021-11-01 20:56:22 +00:00
Emma Anholt	fa4fd67f78	i915g: Make sure we consider negates/swizzles on bias/shadow coords. Caught by imirkin while debugging #4986. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13133>	2021-11-01 20:56:22 +00:00
Emma Anholt	ebe5626de6	i915g: Check for negate/swizzle on TGSI_OPCODE_KILL_IF's src.yzw. Caught by imirkin while debugging #4986. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13133>	2021-11-01 20:56:22 +00:00
Emma Anholt	ba48b27a11	etnaviv: Switch to the NIR compiler by default. This was the conclusion for the next action in https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12889, and I wanted to get moving on it as part of !8044. I made the change as mechanical as possible to ease review. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13535>	2021-11-01 20:47:58 +00:00
Samuel Pitoiset	9b80f4d5f2	radv: rename radv_shader_variant to radv_shader Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13548>	2021-11-01 20:04:45 +00:00
Samuel Pitoiset	eeb034f2cc	docs: document RADV_THREAD_TRACE_* envvars Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13477>	2021-11-01 17:32:50 +00:00
Samuel Pitoiset	3839f3a486	radv: stop reporting SQTT/RGP support as experimental It can be considered stable these days. This also now reports the buffer size and if instruction timing is enabled/disabled. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13477>	2021-11-01 17:32:50 +00:00
Samuel Pitoiset	74e625f057	radv: enable SQTT instruction timing by default It seems stable enough to turn it on by default. This replaces RADV_THREAD_TRACE_PIPELINE by RADV_THREAD_TRACE_INSTRUCTION_TIMING which is enabled by default. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13477>	2021-11-01 17:32:50 +00:00
Samuel Pitoiset	7fd414ec84	radv: remove useless checks about GFX7 for SQTT It's only enabled on GFX8+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13477>	2021-11-01 17:32:50 +00:00
Samuel Pitoiset	b35fd1bff6	radv: move freeing the trigger SQTT file at a better place Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13477>	2021-11-01 17:32:50 +00:00
Christian Gmeiner	c5a1df095c	ci/etnaviv: add manual piglit testing Initial work by Christian, polishing by @anholt. Takes ~21 minutes, and flakes may not be fully classified yet. Signed-off-by: Christian Gmeiner <christian.gmeiner@gmail.com> Reviewed-by: Emma Anholt <emma@anholt.net> Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13600>	2021-11-01 09:18:23 -07:00
Emma Anholt	d2440b27e5	ci/etnaviv: Add some more deqp flakes I've seen in recent runs. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13600>	2021-11-01 09:18:23 -07:00
Emma Anholt	7355694e11	ci/etnaviv: Fix the dependency for the build artifacts. This is an armhf board, not arm64. Fixes 404s from trying to pull the artifacts early. Reviewed-by: Christian Gmeiner <christian.gmeiner@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13600>	2021-11-01 09:18:23 -07:00
Jason Ekstrand	79f57f6893	lavapipe: Don't wrap errors returned from vk_device_init in vk_error vk_device_init already calls vk_error so this is redundant. Also, it makes vk_error grumpy to see a VK_ERROR_FEATURE_NOT_PRESENT on an instance rather than a physical device. Fixes: `47adb11143` ("lavapipe: Switch to the new vk_error helpers") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13619>	2021-11-01 14:30:32 +10:00
Mike Blumenkrantz	73af67883d	zink: force float dest types on some alu results these aren't exact matches in spirv, so set the expected result type to float where necessary cc: mesa-stable fixes #5567 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13562>	2021-11-01 01:15:20 +00:00
Mike Blumenkrantz	c73f5a0082	zink: add more int/float types to cast switching in ntv these come from opcode results, which are not always 32bit cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13562>	2021-11-01 01:15:20 +00:00
Mike Blumenkrantz	69501ff458	zink: explicitly enable VK_EXT_shader_subgroup_ballot this is needed when not creating 1.2 contexts cc: mesa-stable ref #5567 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13562>	2021-11-01 01:15:20 +00:00
Mike Blumenkrantz	ccfe36fffa	zink: clamp max buffer sizes to smallest buffer heap size the max driver limit for these is irrelevant if there isn't enough memory to allocate a buffer of that size KHR-GL46.texture_buffer.texture_buffer_max_size cc: mesa-stable fixes #5568 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13584>	2021-11-01 01:01:33 +00:00
Mike Blumenkrantz	fd2b47281f	zink: error when trying to allocate a bo larger than heap size this is illegal and would fail anyway cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13584>	2021-11-01 01:01:33 +00:00
Mike Blumenkrantz	aa5e544644	zink: don't clamp 2D_ARRAY surfaces to 2D another thing that used to be needed but now isn't cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13586>	2021-11-01 00:49:24 +00:00
Mike Blumenkrantz	8d2280f533	zink: don't clamp cube array surfacess to cubes this was probably necessary for some other reason that has since been fixed, and instead now just creates validation spam cc: mesa-stable fixes #5566 Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13586>	2021-11-01 00:49:24 +00:00
Mike Blumenkrantz	6ab915960c	zink: be more spec-compliant for unnormalizedCoordinates samplers the spec prohibits using most stuff with these, but also they probably are just texelfetch anyway so it doesn't matter Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13587>	2021-11-01 00:33:19 +00:00
Dave Airlie	75dc302340	lavapipe: drop EXT_acquire_xlib_display This has a requirement on the display extensions. VK-GL-CTS: dEQP-VK.info.instance_extensions Fixes: `1d574d4860` ("lavapipe: remove display extension support") Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13616>	2021-11-01 09:46:24 +10:00
Rob Clark	7e998783db	freedreno/ir3: xfb fix for duplicate outputs We can't rely on regid to be unique, shaders can have multiple varyings with the same output value. Normally shader linking deduplicates these, but we still need to handle the case for xfb. So use slot instead as the unique identifier. Fixes KHR-GLES31.core.gpu_shader5.fma_precision_* Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13605>	2021-10-31 16:30:13 +00:00
Rob Clark	f6f760a98d	freedreno/ir3/print: Show end's outidxs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13605>	2021-10-31 16:30:13 +00:00
Mike Blumenkrantz	6239adebbc	zink: flag renderpass change when toggling fbfetch ensure the input attachment gets updated fixes running KHR-GL46.blend_equation_advanced.blend_all.GL_MULTIPLY_KHR_all_qualifier after KHR-GL46.blend_equation_advanced.BlendEquationSeparate cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13598>	2021-10-31 14:56:51 +00:00
Jordan Justen	2d041d5f1e	Revert "iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13" Round and round we go :) In the "drm/i915/adlp/fb: Remove CCS FB stride restrictions" series, https://lists.freedesktop.org/archives/intel-gfx/2021-October/281768.html, it now appears that kernel can allow these modifiers to work with adl-p. This reverts commit `d4174f5f05`. Fixes: `d4174f5f05` ("iris: Disable I915_FORMAT_MOD_Y_TILED_GEN12* on adl-p/display 13") Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13565>	2021-10-31 01:24:43 -07:00
Connor Abbott	68a62226e4	ir3: Don't emit barriers for make_available/make_visible When looking at the output of some CTS tests, I realized that the barriers vtn currently inserts to emulate coherent memory accesses were being turned into fences, even though we never needed to do anything special for coherent accesses before so presumably accesses are already cache-coherent by default. Ignore make_visible/make_available semantics to get us back to parity with the old path. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13599>	2021-10-29 23:38:10 +00:00
Jason Ekstrand	4108fda426	vulkan: Move all the common object code to runtime/ Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	4af0c038cf	vulkan: Move trampoline code-gen to its own file This way we sepaprate the raw tables from the code that actively depends on vk_device and friends. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	d53f20ff4d	vulkan: Break entrypoint parsing into its own file Instead of having a bunch of stuff depend on vk_dispatch_table_gen to get the list of entrypoints, pull that into its own vk_entrypoints file. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	8f9e0c2816	vulkan/dispatch_table: EntrypointBase doesn't need to derive from object We use python3 now and everything derives from object automatically Suggested-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	3fd7cc9d42	vulkan: Drop unnecessary [en]coding comments from python generators Suggested-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	a208b849f3	vulkan: Rework mako error handling in python generators Suggested-by: Dylan Baker <dylan@pnwbakers.com> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	006633c043	lavapipe: Use vk_instance_get_proc_addr_unchecked for WSI It exists precisely to handle this case without the driver looking up trampolines itself. This is nearly identical to what ANV does. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	5ccba1576d	v3dv: Use vk_instance_get_proc_addr_unchecked for WSI It exists precisely to handle this case without the driver looking up trampolines itself. This is nearly identical to what ANV does. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	837a142f2d	vulkan/vk_extensions_gen: Stop including vk_object.h All we need from it is vulkan_common.h for VkExtensionProperties. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Jason Ekstrand	741262ee01	vulkan/vk_extensions_gen: Drop support for extra includes No one is using this anymore. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13156>	2021-10-29 23:12:32 +00:00
Vinson Lee	20620af1e4	clover: Add constructor for image_rd_argument. Fix defect reported by Coverity Scan. Uninitialized pointer field (UNINIT_CTOR) member_not_init_in_gen_ctor: The compiler-generated constructor for this class does not initialize st. Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13500>	2021-10-29 22:57:46 +00:00
Mike Blumenkrantz	e8f18385e0	zink: inject LOD for sampler version of OpImageQuerySize this is required by spec cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13585>	2021-10-29 19:05:07 +00:00
Mike Blumenkrantz	87fbb0eab0	zink: be more permissive for injecting LOD into texture() instructions there's other variants of implicit lod sampling, and none of them are valid outside fragment stage Fixes: `3ad06b6949` ("zink: always use explicit lod for texture() when legal in non-fragment stages") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13585>	2021-10-29 19:05:07 +00:00
Connor Abbott	8d0b508d91	ir3: Emit barriers for images again This was accidentally broken with the nir_var_image work. Fixes: `e87dbfd3` ("ir3: Check for nir_var_mem_image in shared_barrier handling") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13593>	2021-10-29 16:33:04 +00:00
Marek Olšák	8bfa146b80	radeonsi: print the border color error message only once Cc: 21.2 21.3 <mesa-stable@lists.freedesktop.org> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13590>	2021-10-29 12:33:55 +00:00
Marek Olšák	a4e0b02f85	mesa: skip strlen when hashing strings for ProgramResourceHash Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	7fe3cc94eb	mesa: add separate hash tables for each GLSL resource type This removes the malloc for the key, the associated string pointer indirection, and hopefully the hash table has fewer elements than the global one. This decrease overhead of glGetUniformLocation. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	1b1425d45c	mesa: handle hash collisions in program resource lookups (e.g. uniforms) We computed the hash of the name and used it as a key, and then the table computed the hash of the hash. Do it properly: Pass the name as a string into the hash table and let the table handle everything. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	aad903c3f5	mesa: preparse [ and [0] in gl_resource_name and use it in shader_query.cpp strrchr is very expensive here. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	4b67055fef	mesa: rename locals in _mesa_program_resource_find_name for clarity Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	22d51f3c92	mesa: precompute strlen in gl_resource_name::length and use it Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	81ad6a8b64	mesa: don't compute the same strlen up to 3x in _mesa_program_resource_find_name Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:20 -04:00
Marek Olšák	dea558cbd2	glsl: add gl_resource_name to precompute "name" properties later This just adds the structure with a name and its update function. strlen and others will be added in the following commits. The idea is to parse and analyze the name in advance to make glGetUniformLocation faster. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 07:19:18 -04:00
Marek Olšák	c216f1931d	mesa: use alloca in search_resource_hash Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13507>	2021-10-29 05:09:55 -04:00
Samuel Pitoiset	1776d741c5	zink: add CI lists and deqp-suite configuration for RADV This is used by our local CI (ie. vk-cts-image) which is a separate project outside of Mesa. We use it for testing RADV since a while. The CI lists have been created against Navi2x (Sienna Cichlid). Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13573>	2021-10-29 08:28:02 +00:00
Marek Olšák	76892c4e46	vbo: restructure vbo_save_vertex_list to get more cache hits - Move more stuff into the cold structure. - Reorder fields for better packing. - Flatten the gallium and merged nested structures. Since we have tens of thousands of these, decreasing the size improves performance by 13%. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	3835205a0e	vbo: use int16_t for vbo_save_vertex_list::gallium::private_refcount We never use more than 16 bits. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	97caca6b47	vbo: return a GL error earlier in vbo_save_playback_vertex_list_gallium Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	fa2c39df0f	mesa: remove PADDING_64BIT by adding the dlist header into vbo_save_vertex_list Now we can put useful data where the padding was. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	05605d7f53	mesa: remove display list OPCODE_NOP This decreases overhead because there are fewer nodes to parse. There are 2 changes done here: - If the next node offset is (offset % 8) == 4, pad the last node instead of inserting NOP. This makes sure that the node offset is aligned. - The vertex list node will add 4 bytes to the header to make the payload aligned, so the payload will be at &n[2]. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	4a26f57103	mesa: fix locking when destroying/overwriting/adding display lists We need to hold the lock when calling destroy_list and doing _mesa_HashInsertLocked in EndList. So move the locking out of destroy_list. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13506>	2021-10-29 07:33:50 +00:00
Marek Olšák	c494cfb1dd	radeonsi: don't invoke si_decompress_depth if textures are not dirty at binding This eliminates the overhead of invoking si_decompress_depth. The complication here is that we need to update needs_depth_decompress_mask every time we update dirty_level_mask. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13492>	2021-10-29 07:14:33 +00:00
Marek Olšák	0e54ac7a3c	winsys/amdgpu: optimize looping inefficiencies in add_bo_fence_dependencies Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:22 +00:00
Marek Olšák	c4ba003e2f	winsys/amdgpu: move BO fence array updates to the CS thread We always wait for num_active_ioctls == 0 before we use the fence, so we can just add fences to BOs in the CS thread. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:22 +00:00
Marek Olšák	67de09acbd	winsys/amdgpu: don't use ip_instance and ring fields of fence and IB structures They are always 0. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:22 +00:00
Marek Olšák	f6d072b2f0	winsys/amdgpu: increase the BO hash list size This decreases overhead inside amdgpu_cs_add_buffer by 40% for viewperf2020/catia. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:22 +00:00
Marek Olšák	a5118bc97d	winsys/amdgpu: don't clear RADEON_USAGE_SYNCHRONIZED for last_added_bo_usage It was breaking the early return path. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:22 +00:00
Marek Olšák	107bc76882	winsys/amdgpu: remove an amdgpu_cs dereference from amdgpu_cs_add_buffer Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	8bb0c09f9e	winsys/amdgpu: simplify parameter passing and derefs in cs_add_buffer Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	61bd8ec043	gallium/radeon: merge BO read/write usage flags with priority flags Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	90ff5ef5c0	gallium/radeon: remove unused RADEON_DEPENDENCY_START_FENCE Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	b5cf0d118c	gallium/radeon: remove/merge some BO priorities and remove holes The upper bits will be used by RADEON_USAGE_* Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	f815009036	gallium/radeon: change the BO priority definitions to bits This is for the next microoptimization. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13478>	2021-10-29 06:54:21 +00:00
Marek Olšák	a0f05a5b20	radeonsi: remove unused parameters in si_emit_draw_packets This is a leftover from GS fast launch and compute-based culling. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13539>	2021-10-29 06:33:29 +00:00
Marek Olšák	98f696c972	radeonsi: enable shader culling for indirect draws It was mistakenly disabled, decreasing performance a lot. Only valid for Mesa 21.3. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Cc: 21.3 <mesa-stable@lists.freedesktop.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13539>	2021-10-29 06:33:29 +00:00
Greg V	98dbd01a96	util: make util_get_process_exec_path work on FreeBSD w/o procfs sysctl is the correct way of getting the current executable's path. procfs is not mounted by default. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1598>	2021-10-29 06:06:05 +00:00
Greg V	1d9eda1b57	util: __getProgramName: remove check for ancient FreeBSD versions, simplify ifdefs FreeBSD 5.0 was released in 2003. We really do not need to check that we're on >= 4.4. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/1598>	2021-10-29 06:06:05 +00:00
Boyuan Zhang	ed5d7987dc	radeon/vcn: combine session init func Combine the session init function for h.264 and hevc to reduce redundancy. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13511>	2021-10-28 23:44:15 +00:00
Boyuan Zhang	ced5a54c13	radeon/vcn: combine encode params func Combine the encode params function for h.264 and hevc to reduce redundancy. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13511>	2021-10-28 23:44:15 +00:00
Boyuan Zhang	49fff27d46	radeon/vcn: remove redundancy for vcn2 enc Remove redundancy functions for vcn2 encode. Re-using the vcn1 quality params function as a result. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13511>	2021-10-28 23:44:15 +00:00
Boyuan Zhang	4abc6d64e7	radeon/vcn: update vcn2 enc interface Add missing parameters according to vcn 2 encode interface. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13511>	2021-10-28 23:44:14 +00:00
Boyuan Zhang	299097d17b	radeon/vcn: update vcn1 enc interface Update vcn 1 encode interface, upgrade interface minor version from 2 to 9, and add necessary parameters accordingly. Signed-off-by: Boyuan Zhang <boyuan.zhang@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13511>	2021-10-28 23:44:14 +00:00
Emma Anholt	8fb850651c	ci: Enable testing radeonsi's libva using libva-util unit tests. We've noticed issues with these tests when uprevving Mesa in Chrome OS. This CI catches some existing failures, and some debug-build assertion failures as well. To do this, uprev deqp-runner for its new gtest-runner command. This runner is not as efficient as I would hope, due to some expensive code in gtest. I've reported the issue to gtest and it should be easily fixable, but for now it at least means we get to use the same baseline/skip/flake handling we have from deqp and piglit runners. I also fixed build-libdrm for our rootfses to not throw away libdrm's share directory, which was causing a bunch of test-time spam from radeon's libdrm when trying to look up its marketing name tables (not that big of a deal for deqp-runner, but really noisy for piglit and libva-utils which make gallium screens approximatly per-test). Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13419>	2021-10-28 23:17:19 +00:00
Connor Abbott	e6ae0e9b95	freedreno/a6xx: Emit GRAS_LRZ_MRT_BUF_INFO_0 Analogous to the previous commit, this fixes the case where turnip sets this reg to a media (yuv) format and then a gallium job is run next. Fixes: `9c895e13` ("tu: Emit GRAS_LRZ_MRT_BUF_INFO_0") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13578>	2021-10-28 22:19:09 +00:00
Connor Abbott	98c1448509	tu: Always write GRAS_LRZ_MRT_BUF_INFO_0 This fixes flakes in dEQP-VK.pipeline.stencil.nocolor.format.* when run after ycbcr tests. Apparently LRZ needs to know if there's a media format enabled even if there are no color attachments, so we need to write something here. Presumably any "normal" format would work but 0 seems like a good neutral choice. Fixes: `9c895e13` ("tu: Emit GRAS_LRZ_MRT_BUF_INFO_0") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13578>	2021-10-28 22:19:09 +00:00
Kenneth Graunke	2f58a63b2f	intel/genxml: Add XY_BLOCK_COPY_BLT on Tigerlake and later. This is a new blitter command introduced on Tigerlake and expanded substantially on XeHP. XY_BLOCK_COPY_BLT is actually fast, unlike the legacy blitter commands. iris will use this in the future, and anv hopefully could use it for a transfer queue someday as well. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13520>	2021-10-28 14:17:32 -07:00
Kenneth Graunke	9163500aa1	intel/genxml: Allow MI_FLUSH_DW on the blitter Pretty sure this is how you flush the blitter. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13520>	2021-10-28 14:17:32 -07:00
Kenneth Graunke	d9ffdfc16d	intel/genxml: Include blitter commands in gen*_pack.h We're going to want to use the blitter again on newer hardware, which means we need to be able to use genxml to emit those commands. Acked-by: Sagar Ghuge <sagar.ghuge@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13520>	2021-10-28 14:17:29 -07:00
Kenneth Graunke	7b78b2fcac	intel/genxml: Assert that all MOCS fields are non-zero on Gfx7+ Let's try and catch performance problems before we have to do large painful amounts of analysis to detect a missed field. Acked-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	ebe2a2b5f6	intel/genxml: Add an field option for nonzero="true" This asserts that the value supplied is non-zero. Useful for things like MOCS fields on modern platforms where we really want to avoid setting it to 0 (uncached). mbz types cannot be flagged as nonzero. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	e6ebf5add7	i965: Set MOCS for Bindless Surface/Sampler State base addresses We don't use bindless surface or sampler states today, and are unlikely to ever implement that in i965, but we can set a MOCS value regardless to avoid asserts in upcoming patches that assert MOCS isn't zero. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	148ea65ee1	i965: Port STATE_BASE_ADDRESS to genxml and fix bugs This largely copies crocus's code for this (but with Gfx9+ handling). This version also fixes missing MOCS settings on several platforms, which we hadn't noticed were missing. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	0a64007676	i965: Fix MOCS for BLORP buffer copies We were passing a MOCS of 0, which is uncached. Yikes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	75e86afb50	i965: Set MOCS for 3DSTATE_INDEX_BUFFER on Gfx6/7 as well. For some reason we were only setting this on Gfx8+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	ab44a54646	i965: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too For some reason we were only setting this on Gfx8+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	eaaa3c7e04	i965: Set MOCS on NULL stream output buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled stream output targets, MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null stream output buffers; we can just assume a MOCS for generic internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	59f53b07c4	i965: Set MOCS for push constant buffers on Haswell and Gfx9+ We set MOCS on Ivybridge/Baytrail, but not Haswell, and not Skylake and later. We shoud set it everywhere. While we're at it, we also set it for null constant buffers, so that we aren't programming a 0 MOCS, to allow us to add some safeguards against that. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	d0e356b333	i965: Set default MOCS for NULL depth/stencil/HiZ buffers isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. When we have entirely NULL surfaces, we just default to the MOCS value for an internal buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:56 +00:00
Kenneth Graunke	f0a5f10b6c	i965: Use ISL for MOCS rather than open coding it everywhere The ISL MOCS infrastructure didn't exist when we wrote the i965 code, but now that it does, we ought to use it, deleting a complicated mess that was replicated all throughout the codebase. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	d9decdb2c4	crocus: Fix MOCS for buffer copies. We were passing a MOCS of 0, which is uncached. Yikes. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	dc29e9dbb3	crocus: Set MOCS for 3DSTATE_SO_BUFFERS on Gfx7.x too For some reason we were only setting this on Gfx8+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	72ffcd1965	crocus: Set MOCS for push constant buffers where possible We apparently were not setting MOCS for 3DSTATE_CONSTANT_XS at all. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	798cc4be1b	crocus: Set default MOCS for NULL depth/stencil/HiZ buffers isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. When we have entirely NULL surfaces, we just default to the MOCS value for an internal buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	737b3fae73	crocus: Set MOCS on NULL stream output buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled stream output targets, MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null stream output buffers; we can just assume a MOCS for generic internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	de99b5502b	crocus: Set MOCS for index buffers on Gen6+ For some reason we were only setting them on Gen8+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	be0d22a9ce	crocus: Tidy the ifdefs for emitting STATE_BASE_ADDRESS This reorganizes the code so that we set fields in a tidy order: 1. Set the base addresses 2. Set either buffer sizes (Gfx8) or upper bound values (Gfx4-7) (These are logically the same thing, but expressed differently.) 3. Set MOCS (Gfx6+) I find this easier to follow. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	51f843ad60	crocus: Set MOCS for most state base addresses on pre-Gen8 We were only setting MOCS for dynamic state, surface state, instruction, and indirect base addresses on Gen8+. We should set them on Gen6+. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	dfe455b6df	anv: Set MOCS on NULL stream output buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled stream output targets, MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null stream output buffers; we can just assume a MOCS for generic internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	818b5d2b9e	anv: Set MOCS on NULL vertex buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is 3DSTATE_VERTEX_BUFFERS where we set NullVertexBuffer. It shouldn't matter here, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for null vertex buffers. We can assume an internal buffer and request isl's vertex buffer MOCS. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	7ae9f516f0	anv: Set MOCS in 3DSTATE_CONSTANT_XS even if there isn't a buffer. This avoids MOCS != 0 assertions in later patches. iris also does this, and we do it for the 3DSTATE_CONSTANT_ALL packet path as well. It's a bit pointless, but it should hopefully be harmless also. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	158ad07b3e	anv: Set MOCS for 3DSTATE_CONSTANT_XS on Gfx7.x as well We were only setting this on Gfx9+. It's MBZ on Gfx8, but it exists on Gfx7.x and doesn't have those restrictions there; we should set it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	b4310ef8ee	anv: Set default MOCS for NULL depth/stencil/HiZ buffers isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. When we have entirely NULL surfaces, we just default to isl's MOCS value for an internal depth buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	d8cb76211c	iris: Fix MOCS for buffer copies We were passing a MOCS of 0, which is uncached. Yikes. Fixes: `c5b22441f1` ("iris: Fix buffer -> buffer copy_region") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	256d48eb8c	iris: Set MOCS on NULL stream output buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled stream output targets, MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null stream output buffers; we can just assume a MOCS for generic internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	d8e1d0fecc	iris: Set MOCS on NULL vertex buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is 3DSTATE_VERTEX_BUFFERS where we set NullVertexBuffer. It shouldn't matter here, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for null vertex buffers. We can assume an internal buffer and request isl's vertex buffer MOCS. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	369cd9ae28	iris: Set MOCS on 3DSTATE_CONSTANT_ALL packets that disable all buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we missed setting a non-zero MOCS was in 3DSTATE_CONSTANT_ALL packets which fully disable all constant buffers. (If any constant buffer was present, we would set an actual MOCS value.) MOCS really shouldn't matter here, as there are no actual constant buffers to be cached. That said, it should be harmless to do so, and we can just assume a generic MOCS for internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	0544afd2df	iris: Set MOCS on 3DSTATE_CONSTANT_XS on Gfx9+ We were leaving this blank due to a Broadwell restriction, causing our constant buffers to be uncached. We later fixed this for Gfx12+, but left Gfx9-11 without a fix. We should specify one. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	8336054024	iris: Set default MOCS for NULL depth/stencil/HiZ buffers isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. When we have entirely NULL surfaces, we just default to isl's MOCS value for an internal depth buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	0a5e225779	iris: Set Bindless Sampler State MOCS We don't use bindless sampler states today, but when we do, we'll want them to have proper MOCS values. This also avoids asserts in upcoming patches which enforce that MOCS isn't zero. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	a6690dc1ee	iris: Drop unnecessary parenthesis Trivial. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	fc5168b01d	blorp: Use a non-zero MOCS for disabled constant buffers We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is disabled constant buffers, where MOCS shouldn't matter, as there's no actual buffer to be cached. That said, it should be harmless to set MOCS for these null constant buffers; we can just assume a generic MOCS for internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	28a503c1a1	blorp: Fill in MOCS for null depth/stencil/HiZ buffers. isl now uses info->mocs regardless of whether there's any actual depth/stencil/HiZ buffers involved, so pass it a legitimate one, rather than zero. We just assume a generic internal MOCS when we have entirely NULL surfaces. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	edce4649b8	blorp: Fill in MOCS even for SURFTYPE_NULL surfaces. We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is SURFTYPE_NULL surfaces, where MOCS really shouldn't matter, as there's no actual surface to be cached. That said, it should be harmless to set MOCS for these null surfaces; we can just assume a generic MOCS for internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	c27fcb1d3b	isl: Fill in MOCS for NULL depth, stencil, and HiZ buffers. We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is SURFTYPE_NULL depth, stencil, and HiZ buffers, where MOCS really shouldn't matter, as there's no actual surface to be cached. That said, it should be harmless to set MOCS for these null surfaces. We now set the one provided in info->mocs regardless of whether any buffers actually exist or not. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	801ecb6f12	isl: Fill in MOCS even for SURFTYPE_NULL surfaces. We'd like to add safeguards against accidental use of MOCS 0 (uncached), which can have large performance implications. One case where we use MOCS of 0 is SURFTYPE_NULL surfaces, where MOCS really shouldn't matter, as there's no actual surface to be cached. That said, it should be harmless to set MOCS for these null surfaces; we can just assume a generic MOCS for internal buffers. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	78d2605e57	intel/genxml: Change 3DSTATE_CONSTANT_XS::MOCS to be MBZ on Gfx8. The Broadwell PRM says: "Constant Buffer Object Control State must always be programmed to zero." This patch changes the MOCS field in gen8.xml to be "mbz" type, so that it's impossible to set it to a non-zero value. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	956effb88a	intel/genxml: Drop "Hierarchical Depth Buffer MOCS" field This is redundant with the existing "MOCS" field. We don't need both. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	e69d395cd1	intel/genxml: Add an "mbz" data type There are some fields which Must Be Zero, and we don't want to allow setting them from the template struct, but we do want them in the XML to allow them to be decoded properly, and for documentation purposes. This adds a new "mbz" type, much like "mbo", except it doesn't set anything in the struct. We also update the decoder to handle it. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Kenneth Graunke	58dc7f6ea6	intel/genxml: Fix Indirect Object Access Upper Bound on Gfx4 We had this field mislabeled as "Instruction Access Upper Bound", but instruction state base address doesn't exist until Gfx5. This is supposed to be the upper bound for indirect object base address, matching the G45 copy. Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13480>	2021-10-28 19:45:55 +00:00
Pierre-Eric Pelloux-Prayer	cf76247f38	drirc: enable do_dce_before_clip_cull_analysis for ANSA Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12897>	2021-10-28 18:01:04 +00:00
Pierre-Eric Pelloux-Prayer	95ded68984	glsl/drirc: add an option for gl_ClipVertex / gl_CullDistance checks The GLSL spec says it's an error if a shader statically writes to these 2 variables. Until this commit, Mesa refused to link a shader if it had an unused function writing to one of these variables while another (used) function wrote to the other. This commit adds an option to perform dead function elimination after the intra-stage linking step but before performing these checks. Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12897>	2021-10-28 18:01:04 +00:00
Dylan Baker	c0fc76a172	docs: update calendar and link releases notes for 21.2.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13579>	2021-10-28 17:46:35 +00:00
Dylan Baker	c8bf9100cd	docs: add sha256 sums for 21.2.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13579>	2021-10-28 17:46:35 +00:00
Dylan Baker	f07a614995	docs: add release notes for 21.2.5 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13579>	2021-10-28 17:46:35 +00:00
Filip Gawin	021ec93273	r300: improve precission of linear interpolation Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13554>	2021-10-28 15:23:47 +00:00
Danylo Piliaiev	aa264ded94	ir3/ra: Check register file upper bound when updating preferred_reg Otherwise we could get invalid reg in get_reg() Would fix many dEQP-VK.ssbo.phys.layout.* Fixes: `0ffcb19b9d` "ir3: Rewrite register allocation" Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13546>	2021-10-28 14:08:43 +00:00
shanshengwang	4f4164d62a	radeon/vce: Limiting max supported refernce frames to 1 for h264 encoding VCE currently restricted max_supported reference frames to 1 Signed-off-by: shanshengwang <shansheng.wang@amd.com> Suggested-by: Suresh Guttula <suresh.guttula@amd.com> Acked-by: Boyuan Zhang <Boyuan.Zhang@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13543>	2021-10-28 13:56:24 +00:00
Samuel Pitoiset	d8e4546707	ac/nir: remove bogus assertion about the position for culling It's undefined to not export a position but some applications rely on that. The position is always initialized to 0,0,0,1 everywhere else if not exported. Fixes KHR-GL46.shader_image_load_store.multiple-uniforms with Zink. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13470>	2021-10-28 10:44:20 +00:00
Lionel Landwerlin	d3b3daa06b	intel/pps: reuse timestamp_frequency from intel_device_info Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Lionel Landwerlin	43d5b55bc1	intel/pps: provide accurate min sampling period Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Lionel Landwerlin	3dda80fcf6	intel/dev: printout timestamp period Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Lionel Landwerlin	127863ddd3	docs: put a list of commands to setup perfetto Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Lionel Landwerlin	bc0a702c52	pps: add an intel config file It was useful to set a colleague up on perfetto. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Lionel Landwerlin	32b28f2cfa	pps: remove counter_ids fields Those appear not to be recognized anymore by perfetto. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Antonio Caggiano <antonio.caggiano@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13571>	2021-10-28 13:16:56 +03:00
Jordan Justen	64157c706e	intel/dev/test: Assert (verx10 / 10) == ver Signed-off-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13568>	2021-10-28 09:19:07 +00:00
Rhys Perry	11602d2d36	aco: use std::vector and IDSet in RA validator Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Daniel Schürmann <daniel@schuermann.dev> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13541>	2021-10-28 08:55:35 +00:00
Iago Toral Quiroga	b42f4b8809	broadcom/compiler: padding fixes to QPU assembly dumps When there are dst/src modifiers it is pretty common that instructions take too much space and lead to alignment issues that make code a lot harder to read, so align the MUL and SIG columns a bit wider to avoid this: Before: 0x380021828003faa8 fmax rf2, rf42.abs, rf40.abs; nop 0x3800f186c503f0f0 fcmp.pushc -, rf3, rf48; nop 0x380c038b85b83282 fmax rf11, rf10, rf2; mov.ifa rf14, rf46 0x3800219ab503f359 and rf26, rf13, rf25; nop 0x3820f186c503f2f0 fcmp.pushc -, rf11, rf48; nop ; thrsw 0x382c013fb5b8368e and rf63, rf26, rf14; mov.ifa rf4, rf46; thrsw 0x38002185b503ffc4 and rf5, rf63, rf4 ; nop 0x38002186b503f141 and rf6, rf5, rf1 ; nop 0x382031873503f186 vfpack tlb, rf6, rf6; nop ; thrsw 0x380031873503f18f vfpack tlb, rf6, rf15; nop 0x38003186bb03f000 nop ; nop After: 0x380021828003faa8 fmax rf2, rf42.abs, rf40.abs ; nop 0x3800f186c503f0f0 fcmp.pushc -, rf3, rf48 ; nop 0x380c038b85b83282 fmax rf11, rf10, rf2 ; mov.ifa rf14, rf46 0x3800219ab503f359 and rf26, rf13, rf25 ; nop 0x3820f186c503f2f0 fcmp.pushc -, rf11, rf48 ; nop ; thrsw 0x382c013fb5b8368e and rf63, rf26, rf14 ; mov.ifa rf4, rf46 ; thrsw 0x38002185b503ffc4 and rf5, rf63, rf4 ; nop 0x38002186b503f141 and rf6, rf5, rf1 ; nop 0x382031873503f186 vfpack tlb, rf6, rf6 ; nop ; thrsw 0x380031873503f18f vfpack tlb, rf6, rf15 ; nop 0x38003186bb03f000 nop ; nop Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13545>	2021-10-28 08:12:14 +00:00
Mike Blumenkrantz	3ad06b6949	zink: always use explicit lod for texture() when legal in non-fragment stages implicit lod is something else entirely fixes #5566 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13563>	2021-10-28 02:32:23 +00:00
Mike Blumenkrantz	4d9fc17ae8	zink: set aspectMask for renderpass2 VkAttachmentReference2 structs this is otherwise just garbage fixes #5569 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13561>	2021-10-28 02:16:15 +00:00
Mike Blumenkrantz	c4a513d978	zink: use align64 for allocation sizes avoid 32bit sint overflows fixes #5568 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13560>	2021-10-28 02:01:43 +00:00
Mike Blumenkrantz	2e9e113b7f	zink: cache bo SpvId array types this cuts down on a truckload of useless new validation spam Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13559>	2021-10-28 01:48:17 +00:00
Yiwei Zhang	65abd1d4ae	venus: implement vn_buffer_cache_entries_create 1. advertise high hit rate cache combinations, and we should limit the caches to those only require device memory pool alloc 2. use size = 1 to ask for buffer memory requirements so that we do a sanity check on our assumption of returned size and alignment. For implementations don't meet our assumption, continue without cache. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	af505ff3c8	venus: implement vn_buffer_cache_get_memory_requirements Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	a74f2495ca	venus: implement vn_buffer_get_max_buffer_size This change estimates the max_buffer_size with quick sort. Try to avoid some traffic upon device creation time, but not worth adding a buffer simple create api to avoid the extra requirement query traffic since this is temporary. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	13f2e50aee	venus: add buffer cache init and usage flows 1. struct vn_buffer_cache_entry for buffer memory requirements 2. struct vn_buffer_cache for all buffer related cached info 3. implement vn_buffer_cache_init 4. implement vn_buffer_cache_fini 5. empty vn_buffer_get_max_buffer_size 6. empty vn_buffer_cache_entries_create 7. implement vn_buffer_cache_entries_destroy 8. empty vn_buffer_cache_get_memory_requirements Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	fc237f80c6	venus: add struct vn_image_memory_requirements This aligns with vn_buffer_memory_requirements and can potentially simplify future image memory requirements cache init. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	b108e096d1	venus: add struct vn_buffer_memory_requirements This will simplify later buffer cache api. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	927dea7c34	venus: refactor the ahb buffer mem_type_bits query api Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	34b7d820e2	venus: refactor to add vn_buffer_init Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	df93a8a6dd	venus: refactor to add vn_device_init Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Yiwei Zhang	2a79dfb724	venus: release queues on device creation failure Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13428>	2021-10-28 00:23:14 +00:00
Mike Blumenkrantz	2de6beaa12	zink: add better handling for CUBE_COMPATIBLE bit this check was illegal because the usage bits weren't yet populated, so add another check after usage bits are determined to figure out if CUBE_COMPATIBLE can be applied additionally, checking sample counts was never needed since the spec prohibits CUBE_COMPATIBLE use with multisampling zink DEBUG: ERR: 'Validation Error: [ VUID-vkGetPhysicalDeviceImageFormatProperties-usage-requiredbitmask ] Object 0: VK_NULL_HANDLE, type = VK_OBJECT_TYPE_DEVICE; \| MessageID = 0x991b3105 \| vkGetPhysicalDeviceImageFormatProperties: value of usage must not be 0. The Vulkan spec states: usage must not be 0 (https://www.khronos.org/registry/vulkan/specs/1.2-extensions/html/vkspec.html#VUID-vkGetPhysicalDeviceImageFormatProperties-usage-requiredbitmask)' Fixes: `71494c4874` ("zink: only mark resources as cube-compatible if supported") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12580>	2021-10-28 00:11:24 +00:00
Yiwei Zhang	5a2513a515	venus: assign valid memoryTypeIndex of exportable ahb memory for image The current AHB spec leaves the input memoryTypeIndex undefined when allocating exportable AHB memory backing an external image. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Reviewed-by: Chia-I Wu <olvaffe@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13537>	2021-10-28 00:02:54 +00:00
Bas Nieuwenhuizen	2c43fd4c41	amd/rgp: Use VGH clocks for RGP workaround. Hear that it matters for RGP. This is the most likely scenario where we would hit this workaround, given the tooling for profiling on the deck will set profile_peak as workaround for hangs. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13534>	2021-10-27 21:51:59 +00:00
Emma Anholt	bfbc41a9fa	ci/piglit-runner: Merge piglit-driver-.txt files into driver-.txt. The test names are definitely unique (deqp has specific prefixes, piglit uses '@' as a separator instead of '.'), so we can just have a single file regardless of test type. Merges the two groups of xfails together so you can't mix up which file to edit (I certainly have), and so that we don't need to introduce yet another set of files when we add gtest for libva. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13517>	2021-10-27 20:54:11 +00:00
Emma Anholt	38dff02bfb	ci/deqp-runner: Rename the deqp-drivername-.txt files to drivername-.txt We have two testsuites with the same format for fails/flakes/skips files, and test names that are definitely unique. As I'm about to add a third testsuite (gtest for libva-utils), so let's have just one file each for fails/flakes/skips instead of one per type of testsuite. This starts the move with just the bulk rename of deqp. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Acked-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Acked-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13517>	2021-10-27 20:54:11 +00:00
Paulo Zanoni	3c49916d51	iris: destroy our mutexes a little later While there seems to be no bug with the state things are today, I was recently doing some debugging and put an iris_bo_wait() before a bo_close() in iris_bufmgr_destroy(), which caused an issue since the bo_deps_lock mutex had already been destroyed. Since there are quite a few things we do with the bufmgr after destroying the mutexes, I figured we should probably postpone mutex destruction in order to be a little safer against future code modifications like the one I just did. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13494>	2021-10-27 20:08:51 +00:00
Eric Engestrom	e68616e5e6	docs: update calendar for 21.3.0-rc3 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13557>	2021-10-27 20:45:46 +01:00
Yiwei Zhang	9fa702f28c	venus: refactor private descriptor_set helpers to be private Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Ryan Neph <ryanneph@google.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13522>	2021-10-27 18:33:11 +00:00
Sagar Ghuge	565d65baaf	anv: Enable CCS for storage image formats v2: (Jason Ekstrand) - Restructure if condition. - Add early return. v3: (Felix) - Don't set aux_supported to false for storage image on XeHPG. v4: (Nanley) - Check image view format against fmt_list. - Add helper anv_get_isl_format_with_usage. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3606>	2021-10-27 10:59:56 -07:00
Sagar Ghuge	64ea7e5e33	anv: Pass correct aux usage while filling out surface state While filling out surface state, pass correct aux usage for storage images as we support compression on XeHPG. v2: (Jason Ekstrand) - Move assertion down a bit - Use general layout aux usage Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/3606>	2021-10-27 10:59:56 -07:00
Mike Blumenkrantz	f79a25653b	zink: move all shader bo/sharedmem access to compiler passes this moves more code to nir passes, which makes it easier to debug and also allows deleting some much-more-difficult-to-read ntv code Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	c18413b877	zink: add more glsl base types to get_glsl_basetype() Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	6be3c0f82d	zink: move all 64-32bit shader store rewriting to nir pass this also enables natural 64bit stores on drivers that support it Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	8a98e6fb97	zink: move shared intrinsic offset adjustments to compiler passes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	6d31f4b7b0	zink: move ssbo store offset adjustment to compiler passes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	150d6ee97e	zink: move all 64-32bit shader load rewriting to nir pass this also enables natural 64bit loads on drivers that support it Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	3a1ecd1e8c	zink: run lower_io_to_scalar before rewriting bo access this is happening in ntv anyway, so move it to the compiler here Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	ee22cd619f	zink: move bo load offset adjustment to compiler passes Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	14f7eb9d4c	zink: run optimize_nir() only once during compile running all the optimizer passes repeatedly is stupid Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13484>	2021-10-27 17:13:26 +00:00
Mike Blumenkrantz	16f838576c	nir/lower_io_to_scalar: add support for bo and shared io Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13485>	2021-10-27 16:46:01 +00:00
Emma Anholt	60cb471805	ci/radeonsi: Use a deqp-runner suite suite for stoney. This should make it easier to tune the runtime, and enable KHR-GL* tests in the future. (Not done currently because something in KHR-GL* causes oomkiller). This drops the redundant FDO_CI_CONCURRENT settings, since the default on these boards is 4 anyway. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13504>	2021-10-27 09:19:34 -07:00
Thomas Wagner	4856586ac6	util: use anonymous file for memory fd creation The original implementation in os_memory_fd.c always uses memfds. Replace this by using the already existing os_create_anonymous_file in order to support older systems or systems without memfd. Fixes: `1166ee9caf` ("gallium: add utility and interface for memory fd allocations") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13331>	2021-10-27 15:26:52 +00:00
Rhys Perry	49d290bcf7	radv: don't use a separate cache entry for GS copy shaders This seems simpler and probably faster. This also fixes a warning for these CTS tests: dEQP-VK.pipeline.creation_feedback.graphics_tests.vertex_stage_geometry_stage_delayed_destroy_fragment_stage_delayed_destroy dEQP-VK.pipeline.creation_feedback.graphics_tests.vertex_stage_geometry_stage_fragment_stage because we no longer set found_in_application_cache=false for pipelines with NGG GS. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13528>	2021-10-27 13:34:56 +00:00
Samuel Pitoiset	704340f0f6	radv: fix invalid wait_dst_stage_mask type Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13354>	2021-10-27 07:24:35 +00:00
Jason Ekstrand	0bbb32ece4	glsl/nir/linker: Also remove image variables If we don't, then the array shrinker may shrink them to an array of zero images which can cause GLSL serialization to blow up but only the next time the GLSL shader is loaded from the disk cache. Fixes: `b8ee37472d` ("glsl: Use nir_var_mem_image for images") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5520 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13412>	2021-10-27 01:45:13 -05:00
Iago Toral Quiroga	0a277fabce	broadcom/compiler: fix condition encoding bug When both AC and MC are set, AC is encoded in bits 0..1 not 0..3. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13527>	2021-10-27 06:03:12 +00:00
Iago Toral Quiroga	3fbd6662b7	broadcom/compiler: rework simultaneous peripheral access checks This was not quite correct in that our checks for the allowed cases were not checking that there were no other peripheral access other than the ones allowed. For example, we allowed wrtmuc signal and TMU write other than TMUC, and we also allowed TMU read and VPM read/write. But we cannot allow wrtmuc with TMU write other than TMUC and at the same time a VPM write for example, so we can't just check if we have a combination of allowed peripherals, we still need to check that those are the only ones in use by the combined instructions. Another example is that even if we allow a TMU write (other than TMUC) with a wrtmuc signal, the resulting instruction must still have just one TMU write other than TMUC, but we were allowing the merge if one instruction signaled wrtmuc and the other wrote to tmu other than tmuc without testing if the combined result would have 2 tmu writes. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13527>	2021-10-27 06:03:12 +00:00
Manuel Stoeckl	2c61d89d36	gbm: add GBM_FORMAT_GR1616 and RG1616 Only GR1616 has a corresponding DRI format. Signed-off-by: Manuel Stoeckl <code@mstoeckl.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13501>	2021-10-27 02:53:05 +00:00
Manuel Stoeckl	759eaf517a	gbm: add missing R16 case in gbm_bo_get_bpp Signed-off-by: Manuel Stoeckl <code@mstoeckl.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13523>	2021-10-27 02:28:34 +00:00
Bas Nieuwenhuizen	1fe375e7cf	radv: Add bufferDeviceAddressMultiDevice support. We don't support multiple devices so this is a nop. However, Baldurs Gate 3 enables this and with the new more complete checks this causes device creation to fail. Fixes: `2e5718c957` ("vulkan: provide common functions to check device features") Gitlab: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5509 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13482>	2021-10-27 01:47:58 +00:00
Marek Olšák	e1619b268a	glthread: add a trivial thread-safe way to skip display list execution There are apps that never put state changes into display lists. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13403>	2021-10-27 01:24:03 +00:00
Marek Olšák	c14d755f3d	glthread: add an option to make glCheckFramebufferStatus a no-op Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13403>	2021-10-27 01:24:03 +00:00
Marek Olšák	f4348ef60d	glthread: don't sync for glIsEnabled with a few enums viewperf benefits Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13403>	2021-10-27 01:24:03 +00:00
Marek Olšák	6b370cbe28	glthread: don't execute display lists if they have no effect Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13403>	2021-10-27 01:24:03 +00:00
Mike Blumenkrantz	b0c40bc905	nir/lower_samplers_as_deref: rewrite more image intrinsics "I think we want to lower them." -Jason "And I do know how the pass works" Ekstrand fixes #5540 cc: mesa-stable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13489>	2021-10-27 00:02:22 +00:00
Mike Blumenkrantz	c9ce151ff9	zink: more accurately update samplemask for fs shader keys the fs samplemask needs to be updated on framebuffer rebind and on fs bind to ensure that the key gets updated in time for the pipeline change fixes #5559 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13531>	2021-10-26 23:48:11 +00:00
Mike Blumenkrantz	8899f6a198	zink: fix gl_SampleMaskIn spirv generation the uint[1] -> uint dance is only relevant on the first load, so move the variable type shuffling inside the create block to avoid breaking successive loads fixes #5543 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13488>	2021-10-26 23:34:23 +00:00
Dave Airlie	f42a4f6451	radv: fence->user_ptr and ctx->fence_map are now totally unused. Garbage collect them. Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13525>	2021-10-26 22:22:22 +00:00
Alyssa Rosenzweig	861a35b3bc	mesa: Require MRT support for GL3/ES3 OpenGL 3.0 requires the driver can draw to 8 simultaneous render targets. Similarly, OpenGL ES 3.0 requires the driver can draw to 4 simultaneous render targets. Fix the version computation logic to take this into account. On Mali T720, we support ~all features of OpenGL ES 3.1 except we only support a single render target. Mali T720 should advertise OpenGL 2.1 and OpenGL ES 2.0 only. With the previous logic, it incorrectly advertised OpenGL ES 3.1. v2: Lie about the minimum for GL 3.0 to make freedreno a3xx happy. Add Emma's reviewed-by. v3: Update the Mali T720 CI expectations. There are tests that pass on GLES3 but not GLES2. Unclear if these are dEQP bugs or Mesa bugs, lima hits the same issues. Add them to the known fails Note to mesa-stable maintainers: this downgrades the OpenGL version advertised on Mali T720. As such, this patch should apply to the unreleased 21.3 (Eric) but should NOT be backported to any released Mesa versions (21.2 or older should NOT have this patch). This is a bit of a compromise; Emma agreed with this plan on IRC. Reported-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Emma Anholt <emma@anholt.net> [v2] Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> [v2] Cc: 21.3 mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13455>	2021-10-26 21:53:43 +00:00
Michael Tang	7e26ea84da	microsoft/compiler: Use memcpy instead of a union to write dxil_features Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13496>	2021-10-26 20:27:56 +00:00
Alyssa Rosenzweig	d8b1afdc85	nir/lower_blend: Use correct clamp for SNORM nir_lower_blend was written against the OpenGL ES 3.2 specification, which does not support blending SNORM render targets. The ES spec says that non-floating point buffers get clamped to [0, 1] before blending. The story is not so simple: SNORM buffers are blendable in OpenGL and must clamped to [-1, 1] rather than [0, 1]. Handle this case. NIR does have the fsat_signed_mali instruction to clamp to [-1, 1], but it is only implemented in Panfrost, and this pass is in common code. Open code it instead. Panfrost optimizes the open coded version, so this is good enough. Fixes SNORM subtests of Piglit arb_texture_view-rendering-formats. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13499>	2021-10-26 19:16:36 +00:00
Alyssa Rosenzweig	3b146fb466	panvk: Pass through alpha_zero_nop/one_store flags When constructing the Internal Blend Descriptor for fixed-function blending, set the alpha_zero_nop/one_store flags based on the given equation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13508>	2021-10-26 19:01:47 +00:00
Alyssa Rosenzweig	77928e45eb	panfrost: Pass through alpha_zero_nop/one_store Compute whether these flags can be set when constructing the blend CSO and pass them in the appropriate place in the Internal Blend Descriptor. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13508>	2021-10-26 19:01:47 +00:00
Alyssa Rosenzweig	169aa9f177	panfrost: Test alpha_zero_nop/one_store predicates For each blend mode in our blending unit tests, add whether we can set the alpha_zero_nop and alpha_one_store flags and check against the predicates. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13508>	2021-10-26 19:01:47 +00:00
Alyssa Rosenzweig	c6b2c1069b	panfrost: Add alpha_zero_nop/one_store predicate Some Mali GPUs can avoid storing to the tilebuffer if src alpha = 0, and can replace blending with a store if src alpha = 1. This saves power in the common case of alpha blending. Add predicates to check if these optimizations are valid for a given blend equation. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Acked-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13508>	2021-10-26 19:01:46 +00:00
Alyssa Rosenzweig	87b68e77cc	panfrost: Rename depth bias fields Make it clear that the distinction is the facingness of the primitives the depth bias applies to. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13509>	2021-10-26 18:45:08 +00:00
Sagar Ghuge	29762ea897	iris: Drop hint if primitive id is required or not Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13474>	2021-10-26 18:22:15 +00:00
Sagar Ghuge	1ee043e662	anv: Drop hint if primitive id is required or not Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13474>	2021-10-26 18:22:14 +00:00
Sagar Ghuge	3f33222426	intel/compiler: Track primitive id in domain/evaluation shader Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Suggeted-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13474>	2021-10-26 18:22:14 +00:00
Sagar Ghuge	2b86cf2850	intel/genxml: Add new Primitive ID Not Required bit field to 3DSTATE_DS Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13474>	2021-10-26 18:22:14 +00:00
Mike Blumenkrantz	90228a80ea	zink: don't add dynamic vertex pipeline states if no attribs are used adding the states requires that vertex attribs be bound, but it's illegal to bind 0 attribs cc: mesa-stable fixes #5558 Reviewed-by: Erik Faye-Lund <erik.faye-lund@collabora.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13519>	2021-10-26 18:07:25 +00:00
Caio Marcelo de Oliveira Filho	3072e6e0da	intel/compiler: Don't use SIMD larger than needed for workgroup Unless we are combining multiple workgroups in the same HW thread, there's no advantage of using SIMD16 when SIMD8 already fits the entire workgroup. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13249>	2021-10-26 17:49:09 +00:00
Caio Marcelo de Oliveira Filho	4e7b71e00c	intel/compiler: Use SIMD selection helpers for variable workgroup size Variable workgroup size works by compiling as much SIMD variants as possible and then selecting the right one during dispatch (when the actual workgroup size is passed to us). Instead of replicating the logic in a separate function, reuse the same logic for regular SIMD selection. And move function for that together with the remaining simd selection functions. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13249>	2021-10-26 17:49:09 +00:00
Caio Marcelo de Oliveira Filho	7dda0cf2b8	intel/compiler: Use SIMD selection helpers for CS Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13249>	2021-10-26 17:49:09 +00:00
Caio Marcelo de Oliveira Filho	7558340ebb	intel/compiler: Add helpers to select SIMD for compute shaders Clean up the logic and move it to functions that work with prog_data attributes to select the right SIMD. This shouldn't change any behavior compared to the original. Having it extracted will allow reuse by Task/Mesh and make it easier to write tests. Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13249>	2021-10-26 17:49:09 +00:00
Mike Blumenkrantz	c13da98929	zink: stop exporting PIPE_SHADER_CAP_FP16_DERIVATIVES spirv doesn't support this fixes #5561 cc: mesa-stable Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13530>	2021-10-26 17:33:47 +00:00
Michael Tang	3094524621	microsoft/spirv_to_dxil: turn sysvals into input varyings Fixes: `b47090c5b3` ("spirv: Always declare FragCoord as a sysval") Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13276>	2021-10-26 16:18:09 +00:00
Lionel Landwerlin	a6031cd9bd	anv: fix push constant lowering with bindless shaders Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `9fa1cdfe7f` ("intel/rt: Implement push constants as global memory reads") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13529>	2021-10-26 15:41:43 +00:00
Mike Blumenkrantz	f16961f222	zink: add notes about binding points which aren't counted in util funcs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13490>	2021-10-26 15:07:22 +00:00
Mike Blumenkrantz	0a6f5ec942	zink: don't check rebind count outside of buffer/image rebind function zink_resource_has_binds() only checks descriptor binds, and this doesn't include streamout or fb bindings, so call these functions from the specific rebind points to ensure those cases are also checked fixes #5541 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13490>	2021-10-26 15:07:22 +00:00
Mike Blumenkrantz	1a68f2eb8f	zink: only reset zink_resource::so_valid on buffer rebind otherwise this is going to randomly modify some image properties cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13490>	2021-10-26 15:07:22 +00:00
Mike Blumenkrantz	dabe477b4f	zink: don't break early when applying fb clears a resource can be bound to multiple fb attachments, each with its own clear, so ensure that all of these are applied fixes #5542 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13491>	2021-10-26 14:52:54 +00:00
Mike Blumenkrantz	2a91e83b7f	zink: detect prim type more accurately for tess/gs lines u_reduced_prim() can't determine the output primitive when vs isn't the last vertex stage, so store this from the appropriate shader info and use it when it's available fixes #5547 cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13487>	2021-10-26 14:40:30 +00:00
Mike Blumenkrantz	3a894d2b2f	zink: split out descriptor pool sizing into separate struct this migrates existing uses of zink_descriptor_layout_key to use zink_descriptor_pool_key instead, which ends up being more helpful overall since it puts all the pool-related data into a single struct this also has a(n incredibly small) benefit of removing VkDescriptorPoolSize[6] from every program data struct and deduplicating it into the descriptor pool key set Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13483>	2021-10-26 14:27:01 +00:00
Mike Blumenkrantz	66a0d8204f	zink: reduce hashed region of zink_descriptor_layout_key only the first 3 members of VkDescriptorSetLayoutBinding are useful here Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13483>	2021-10-26 14:27:01 +00:00
Mike Blumenkrantz	103e93cbe6	zink: eliminate a hole in zink_descriptor_layout_key Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13483>	2021-10-26 14:27:01 +00:00
Mike Blumenkrantz	3acab7a24c	zink: rename zink_descriptor_layout_key::num_descriptors -> num_bindings this was always misnamed and thus misleading Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13483>	2021-10-26 14:27:01 +00:00
Jose Fonseca	fb78c2de21	d3d10umd: Update for set_sampler_views take_ownership parameter. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11973>	2021-10-26 14:05:33 +00:00
Jose Fonseca	e69a82f988	d3d10umd: Fix MSVC build. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11973>	2021-10-26 14:05:33 +00:00
Jose Fonseca	cf464affb4	d3d10umd: Update for transfer interface changes. Update for `eb74f97769`. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11973>	2021-10-26 14:05:33 +00:00
Jose Fonseca	b3c1090a5f	d3d10umd: Rename Dxgi.h to DxgiFns.h. To avoid clashing with the WinSDK's dxgi.h header, which we'll need later. Reviewed-by: Roland Scheidegger <sroland@vmware.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11973>	2021-10-26 14:05:33 +00:00
Lionel Landwerlin	d944136f36	vulkan/wsi/wayland: don't expose surface formats not fully supported Depending on whether an application creates a swapchain with VK_COMPOSITE_ALPHA_PRE_MULTIPLIED_BIT_KHR or not, we might use 2 different formats with the compositor. This change makes sure that we support all the underlying formats before exposing the corresponding VkFormat to the application. v2: Don't forget get_formats2() (Ivan) v3: Replace formats with availability boolean (Simon) Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Fixes: `151b65b211` ("vulkan/wsi/wayland: generalize modifier handling") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5522 Reviewed-by: Ivan Briano <ivan.briano@intel.com> (v2) Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13453>	2021-10-26 16:12:06 +03:00
Derek Foreman	1c4f1c2290	panfrost: support PIPE_RESOURCE_PARAM_NPLANES query While panfrost doesn't support multi-planar formats directly, we might still import multi-planar BOs through GBM (for example, when doing direct scanout in weston). We can support PIPE_RESOURCE_PARAM_NPLANES by simply counting the planes we have. Signed-off-by: Derek Foreman <derek.foreman@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13039>	2021-10-26 12:19:11 +00:00
Derek Foreman	f96ad5d71c	panfrost: Support planar formats for scanout While panfrost doesn't directly support planar formats, we can get here from GBM. This happens, for example, when weston is trying to use a BO for direct scanout. Walk the list of planes to find the right one. Signed-off-by: Derek Foreman <derek.foreman@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Simon Ser <contact@emersion.fr> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13039>	2021-10-26 12:19:10 +00:00
Samuel Pitoiset	1cd43ff030	radv: lower the viewport index to zero when the VGT stage doesn't export it This is allowed and the fragment shader should read zero. No fossils db changes. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13222>	2021-10-26 08:37:12 +02:00
Ilia Mirkin	fad6a80635	meson: build freedreno tools when other parts of freedreno not enabled Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13510>	2021-10-26 00:33:31 +00:00
Ilia Mirkin	6c61494771	freedreno: support lua54 This is the default version in gentoo, and it apparently uses the lua54 variant rather than lua. Signed-off-by: Ilia Mirkin <imirkin@alum.mit.edu> Reviewed-by: Emma Anholt <emma@anholt.net> Acked-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13510>	2021-10-26 00:33:31 +00:00
Rob Clark	d2a7afe34d	freedreno/drm: Move suballoc_bo to device Having it in msm_pipe isn't saving any locking. But it does mean that cleanup_fences() can drop the last pipe reference, which in turn drops the last suballoc_bo reference, which can cause recursion back into the bo cache. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5562 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13521>	2021-10-26 00:12:02 +00:00
Rob Clark	2c6fb9780c	freedreno/drm: Add some asserts Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13521>	2021-10-26 00:12:02 +00:00
Marek Olšák	00a1eda61b	mesa: add a no_error path to _mesa_handle_bind_buffer_gen It only helps when it's inlined, which is true for glBindBuffer. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13513>	2021-10-25 19:53:17 +00:00
Marek Olšák	87b9a9667a	mesa: remove redundant flagging USAGE_ARRAY_BUFFER We do that in gl*Pointer, glBindVertexBuffer, etc. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13513>	2021-10-25 19:53:17 +00:00
Marek Olšák	2f059b861e	mesa: move setting USAGE_PIXEL_PACK_BUFFER out of BindBuffer to reduce overhead Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13513>	2021-10-25 19:53:17 +00:00
Marek Olšák	d3a134bbd7	mesa: remove USAGE_ELEMENT_ARRAY_BUFFER because it's unused and adding overhead Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13513>	2021-10-25 19:53:17 +00:00
Boris Brezillon	1813bb5917	vulkan: Fix entrypoint generation when compiling for x86 with MSVC When compiling for x86 with MSVC, Vulkan API entry points follow the __stdcall convention (VKAPI_CALL maps to __stdcall), which uses the following name mangling: _<function_name>@<arguments_size> Fix the vk_entrypoint_stub()/alternatename definitions accordingly. Fixes: `6d44b21d4f` ("vulkan: Fix weak symbol emulation when compiling with MSVC") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13516>	2021-10-25 18:31:22 +00:00
Danylo Piliaiev	b7c7abded7	nir/serialize: Make more space for intrinsic_op allowing 1024 ops We are close to the limit of 512 intrinsics, make more space to be able to support up to 1024 intrinsics. Take one bit from packed_const_indices, they shouldn't suffer in a common case. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13456>	2021-10-25 16:17:09 +00:00
Samuel Pitoiset	dc74285d32	aco: only load streamout buffers if streamout is enabled The streamout_config SGPR is used to determine if streamout is enabled. This fixes a GPU hang with various transform feedback tests: - dEQP-GLES3.functional.transform_feedback.* - KHR-GL46.transform_feedback.api_errors_test - KHR-GL46.draw_indirect.basic-draw*-xfbPaused - KHR-GL46.geometry_shader.api.draw_calls_while_tf_is_paused Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13514>	2021-10-25 13:43:10 +00:00
Samuel Pitoiset	db82d90451	radv: report error messages when the driver can't be initialized Not only with RADV_DEBUG=startup. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13343>	2021-10-25 14:13:46 +02:00
Samuel Pitoiset	4765edb4e0	radv: fix build errors with Android Fixes: `49c3a88fad` ("radv: implement VK_KHR_format_feature_flags2") Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5518 Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13450>	2021-10-25 06:45:47 +00:00
Samuel Pitoiset	ae2881c0f5	radv: remove old RADV_TRACE_FILE warning It has been replaced by RADV_DEBUG=hang a year ago. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13452>	2021-10-25 08:08:52 +02:00
Alyssa Rosenzweig	c2d522b07f	panfrost: Remove duplicated #if Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13208>	2021-10-23 02:21:13 +00:00
Alyssa Rosenzweig	baaa88cf9e	panfrost: Remove ancient TODO This was implemented ages ago. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13208>	2021-10-23 02:21:13 +00:00
Alyssa Rosenzweig	2526f6f229	panfrost: Enable AFBC on v7 The bugs blocking this have been resolved, so flip on AFBC again and get moar fps. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13205>	2021-10-22 21:55:32 -04:00
Alyssa Rosenzweig	789601a189	panfrost: Decompress for incompatible AFBC formats AFBC is keyed to the format. Depending on the hardware, we'll get an Invalid Data Fault or a GPU timeout if we attempt to sample from an AFBC-compressed RGBA8 texture as R32F (for example). Fixes Piglit ./bin/arb_texture_view-rendering-formats_gles3 with AFBC. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13205>	2021-10-22 21:55:32 -04:00
Alyssa Rosenzweig	93c9123c31	panfrost: Add internal afbc_formats We need to know the internal (physical) formats used for AFBC of a given logical format, in order to check format compatibility and determine if we need to decompress AFBC for conformance. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13205>	2021-10-22 21:55:32 -04:00
Alyssa Rosenzweig	342ed4909f	panfrost: Workaround ISSUE_TSIX_2033 According to mali_kbase, all Bifrost and Valhall GPUs are affected by issue TSIX_2033. This hardware bug breaks the INTERSECT frame shader mode when forcing clean_tile_writes. What does that mean? The hardware considers a tile "clean" if it has been cleared but not drawn to. Setting clean_tile_write forces the hardware to write back such "clean" tiles to main memory. Bifrost hardware supports frame shaders, which insert a rectangle into every tile according to a configured rule. Frame shaders are used in Panfrost to implement tile reloads (i.e. LOAD_OP_LOAD). Two modes are relevant to the current discussion: ALWAYS, which always inserts a frame shader, and INTERSECT, which tries to only insert where there is geometry. Normally, we use INTERSECT for tile reloads as it is more efficient than ALWAYS-- it allows us to skip reloads of tiles that are discarded and never written back to memory. From a software perspective, Panfrost's current logic is correct: if we clear, we set clean_tile_writes, else we use an INTERSECT frame shader. There is no software interaction between the two. Unfortunately, there is a hardware interaction. The hardware forces clean_tile_writes in certain circumstances when AFBC is used. Ordinarily, this is a hardware implementation detail and invisible to software. Unfortunately, this implicit clean tile write is enough to trigger the hardware bug when using INTERSECT. As such, we need to detect this case and use ALWAYS instead of INTERSECT for correct results. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13205>	2021-10-22 21:55:32 -04:00
Alyssa Rosenzweig	e0335ad888	panfrost: Fix gl_FragColor lowering The gl_FragColor lowering in the fragment shader depends on the number of render targets, which can change every set_framebuffer_state. set_framebuffer_state thus needs to force a rebind of the fragment shader. Fixes a regression in Piglit fbo-drawbuffers-none gl_FragColor -auto -fbo from enabling AFBC on Mali G52. Fixes: `28ac4d1e00` ("panfrost: Call nir_lower_fragcolor based on key") Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13498>	2021-10-23 00:02:53 +00:00
Alyssa Rosenzweig	be5456e116	panfrost: Remove unused MIDGARD_NO_AFBC quirk This is not Midgard-specific and is handled outside of the quirks infrastructure now. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13497>	2021-10-22 19:33:38 -04:00
Alyssa Rosenzweig	68a7fafe2a	panfrost,panvk: Use dev->has_afbc instead of quirks This uses the new property for AFBC we've added. The AFBC quirk is applied only to v4, and we only set dev->has_afbc on v5+ so this is not a regression. It now respects the hardware-specific AFBC disable. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13497>	2021-10-22 19:33:38 -04:00
Alyssa Rosenzweig	3e168b97cc	panfrost: Detect implementations support AFBC AFBC is an optional feature on Bifrost. If it is missing, a bit will be set in the poorly named AFBC_FEATURES register. Check this so we can act appropriately. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13497>	2021-10-22 19:33:38 -04:00
Emma Anholt	ebe9494b61	turnip: Drop the assertion about the temporary bit in sync fd imports. Khronos's conclusion was that you only need the bit when you want temporary and there's a choice between temporary and permanent. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13473>	2021-10-22 21:44:44 +00:00
Emma Anholt	8ccf672fa3	gallium/u_blitter: Read MSAA z/s from sampler's .x instead of .y or .z. u_format defines depth formats as having depth in .x, mesa/st samples for depth or stencil in .x (not making use of any other channels). util_make_fs_blit_zs() looks for depth or stencil in .x. The MSAA path was the exception looking for it in .z or .y, which was causing drivers to need to splat their values out to the other channels. This should be better on hardware that can emit shorter messages for sampling just the first channels. Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13446>	2021-10-22 18:31:20 +00:00
Paulo Zanoni	785dd68599	iris: also dump bo's imported and exported flags My original patch also aligned the columns for better printing, but Ken's recent suballocation series incorporated those changes. My original patch was also printing the EXEC_OBJECT_ASYNC flag, but this is not possible anymore as we don't have the validation list here. But it's fine since EXEC_OBJECT_ASYNC is conditional on iris_bo_is_external(), which is true for either exported or imported. Example output: BO list (length 13): [ 0]: 8 ( 8) command buffer @ 0xfffffffefffdd000 (system 65536B) 2 refs [ 1]: 1 ( 1) workaround @ 0xfffffffefffff000 (system 4096B) 3 refs [ 2]: 14 ( 14) dynamic state @ 0x00000002fffef000 (system 65536B) 2 refs write [ 3]: 0 ( 10) miptree @ 0xfffffffeffc00000 (system 471104B) 4 refs write [ 4]: 15 ( 15) shader kernels @ 0x00000000ffff7000 (system 16384B) 2 refs [ 5]: 0 ( 13) buffer @ 0xfffffffefe700000 (system 1048576B) 2 refs [ 6]: 4 ( 4) surface state @ 0x00000001fffef000 (system 65536B) 2 refs [ 7]: 3 ( 3) binder @ 0x0000000100000000 (system 65536B) 2 refs [ 8]: 18 ( 18) miptree @ 0xfffffffeffebd000 (system 524288B) 2 refs write exported [ 9]: 11 ( 11) buffer @ 0xfffffffefe800000 (system 20971520B) 2 refs [10]: 0 ( 13) buffer @ 0xfffffffefe600000 (system 1048576B) 2 refs [11]: 12 ( 12) shader kernels @ 0x00000000ffffb000 (system 16384B) 2 refs [12]: 5 ( 5) buffer @ 0xfffffffeffffe000 (system 4096B) 2 refs write Reviewed-by: Marcin Ślusarz <marcin.slusarz@intel.com> Signed-off-by: Paulo Zanoni <paulo.r.zanoni@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12654>	2021-10-22 16:54:42 +00:00
Marek Olšák	520300ad22	st/mesa: don't crash when draw indirect buffer has no storage Fixes: `22f6624ed3` - gallium: separate indirect stuff from pipe_draw_info Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13471>	2021-10-22 16:25:28 +00:00
Matt Turner	2822b1345c	tu: Expose required VK_FORMAT_FEATURE bits for planar YUV formats Specifically this enables these VK_FORMAT_FEATURE bits: VK_FORMAT_FEATURE_TRANSFER_SRC_BIT VK_FORMAT_FEATURE_TRANSFER_DST_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_YCBCR_CONVERSION_LINEAR_FILTER_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_FILTER_MINMAX_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_FILTER_LINEAR_BIT VK_FORMAT_FEATURE_SAMPLED_IMAGE_BIT VK_FORMAT_FEATURE_MIDPOINT_CHROMA_SAMPLES_BIT VK_FORMAT_FEATURE_COSITED_CHROMA_SAMPLES_BIT Fixes the following tests: dEQP-VK.api.info.format_properties.g8_b8_r8_3plane_420_unorm dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm dEQP-VK.api.info.image_format_properties.2d.optimal.g8_b8_r8_3plane_420_unorm dEQP-VK.api.info.image_format_properties.2d.optimal.g8_b8r8_2plane_420_unorm Additionally allows 339 tests in dEQP-VK.ycbcr.* to go from Skip to Pass. [ Connor: Fake support for 3-plane formats, fixup modifiers path ] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:31 +00:00
Jonathan Marek	330a8cfa07	turnip: enable UBWC for NV12 Use the special format for accessing the Y plane of UBWC-enabled NV12, and enable UBWC for NV12. Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:30 +00:00
Connor Abbott	9c895e133b	tu: Emit GRAS_LRZ_MRT_BUF_INFO_0 The blob seems to always emit this, even though it seems to only be used when rendering to the special planar formats (which we only do in the blit path). Based on the LRZ prefix it might used in other cases though. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:30 +00:00
Connor Abbott	c135c2cdb7	freedreno/a6xx: Rename GRAS_2D_BLIT_INFO It's not actually used for 2d blits, it's supposed to mirror RB_MRT_BUF_INFO[0].COLOR_FORMAT and seems to be used only when rendering to the special planar formats, although the blob name seems to suggest it's connected to LRZ. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:30 +00:00
Jonathan Marek	8ea6f17fdf	freedreno/layout: Fix the UBWC block size for the Y plane Reviewed-by: Matt Turner <mattst88@gmail.com> Signed-off-by: Jonathan Marek <jonathan@marek.ca> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:30 +00:00
Matt Turner	e0a74c7cad	util/format: Add PIPE_FORMAT_Y8_UNORM as an "other" layout format freedreno has a different layout for tiled Y8 planes from normal R8 textures, so we need to be able to talk about them separately. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/6792>	2021-10-22 11:25:30 +00:00
Iago Toral Quiroga	ceaf56920c	v3dv: refactor TFU jobs We had an implementation for image copies and another for buffer to image copies. Refactor the code so we have a single implementation of this. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13481>	2021-10-22 11:05:33 +00:00
Tapani Pälli	1465ec8cf3	iris: clear bos_written when resetting a batch This fixes dEQP-EGL.functional.sharing.gles2.multithread.* tests that are hitting: "iris: Failed to submit batchbuffer: Invalid argument" error. v2: clear on reset rather than clear 'on-the-fly' (Kenneth Graunke) Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5537 Fixes: `e4c3d3efc7` ("iris: Defer construction of the validation (exec_object2) list") Signed-off-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13464>	2021-10-22 09:37:50 +00:00
Samuel Pitoiset	b6a69dbb40	radv: re-emit prolog inputs when the nontrivial divisors state changed If the application first uses nontrivial divisors, the driver emits the vertex shader VA to the upload BO rather than directly via the user SGPRs locations. But, if the vertex input dynamic state changes, the driver might select a different VS prolog that no longer needs nontrivial divisors. In this case, the driver needs to re-emit the prolog inputs because otherwise the VS prolog will jump to the PC that is emitted via the user SGPR locations, and the previous one was somewhere in the upload BO... This fixes a GPU hang with Bioshock and Zink. Fixes: `d9c7a17542` ("radv: enable VK_EXT_vertex_input_dynamic_state") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13377>	2021-10-22 09:47:50 +02:00
Samuel Pitoiset	8ec6824335	radv,aco: decouple shader_info/options from radv_shader_args Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13287>	2021-10-22 07:10:40 +00:00
Kenneth Graunke	1429feaf29	crocus: Replace devinfo->ver[x10] checks with GFX_VER[x10] These files are compiled per-generation, so we can just use the #define instead of the actual field dereference to allow the compiler to dead code eliminate whole paths. Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13475>	2021-10-22 06:19:15 +00:00
Iago Toral Quiroga	1561d0126a	broadcom/compiler: fix assert that current instruction must be in current block This was not considering the possibility that the driver has called nir_before_block() or nir_after_block() to update the cursor, in which case the cursor link points to the instruction list header and not to an actual instruction. Fixes incorrect debug-assert crash in: dEQP-VK.graphicsfuzz.cov-increment-vector-component-with-matrix-copy Fixes: `265515fa62` ("broadcom/compiler: check instruction belongs to current block") Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13467>	2021-10-22 05:39:05 +00:00
Kenneth Graunke	e79e1ca304	intel: Drop Tigerlake revision 0 workarounds Tigerlake revision 0 is an early stepping that should not be used in production anywhere, so this code was only used for hardware bringup. We can drop the unnecessary workarounds. This also keeps them from triggering on early steppings of other Gfx12 parts. Reviewed-by: Francisco Jerez <currojerez@riseup.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13266>	2021-10-21 16:53:43 -07:00
Marek Olšák	6ef192bddf	mesa: discard draws with count=0 to decrease overhead Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13445>	2021-10-21 21:16:13 +00:00
Nanley Chery	7daff157bb	iris: Refactor the assignment to possible_usages * Make the outer if-ladder dependent on the has_* variables. * Make the possible_usages assignments happen at the same nesting level. * Move the combined HIZ/MCS assert closer to relevant if-else blocks. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11960>	2021-10-21 20:21:26 +00:00
Nanley Chery	114f87c1c7	iris: Set DISABLE_AUX_BIT for AUX_USAGE_NONE modifiers This avoids unnecessary surface padding on TGL+. Also, drop some of the logic to handle modifiers in iris_resource_configure_aux as the bit now causes it to be handled implicitly. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11960>	2021-10-21 20:21:26 +00:00
Nanley Chery	b9d8793646	iris: Disable the MC_CCS modifier with norbc We generally try to disable CCS whenever the norbc debug flag is set. Also, this enables simplifying iris_resource_configure_aux later on. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11960>	2021-10-21 20:21:26 +00:00
Nanley Chery	b71264e465	iris: Convert some mod_info checks to asserts Depth and multisample images aren't supported with modifiers. So, instead of checking for the absence of modifiers before adding HiZ or MCS, simply assert that they aren't present at a more convenient time. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11960>	2021-10-21 20:21:26 +00:00
Rob Clark	138be96301	freedreno/ir3: Fix validation of subgroup macros They don't need to enforce that src types are all the same. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	e68d918ffb	freedreno/ir3: Get req_local_mem from pipe_compute_state mesa/st initializes req_local_mem to shader->info.shared_size. But for clover the shared size doesn't come from the shader. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f58438320c	freedreno/ir3: Add ihadd/uhadd Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f5ce806ed7	freedreno/ir3: Add wide load/store lowering Lower load/store for vectors wider than 4. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	7a7ac8cd40	freedreno/ir3: Fix reg size validation 8b types also live in half-regs Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	8a6934dfe8	freedreno/ir3: Fix load/store_global_ir3 type Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	81eefe0090	freedreno/ir3: 8bit fixes Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f7b2d613c5	freedreno/ir3: 16b bools A create_immed_for_instr() type thing could be useful to make the immed type match other src(s) for instruction.. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	9a2562a545	freedreno/ir3: Deal with zero-source instructions Needed by the next patch, which starts treating bools as 16bit exposing a bug that was previously accidentially hidden for instructions like ELECT_MACRO. Needed for next patch. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	b6e11225a2	freedreno: Fix set_global_binding The gallium interface is a bit awkward, but pointer sizes are actually 64b despite what the API suggests. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	0a35ba5c43	freedreno/ir3: Move lower_idiv_options Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	e544a9db16	freedreno/ir3: Add support for load_kernel_input Used for function arguments to compute kernels (ie. OpenCL). Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	e10c76d277	freedreno/ir3: implement load_work_dim intrinsic Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	3bd265a393	freedreno/ir3: vec8+vec16 support Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f5bbf77be8	freedreno: implement set_compute_state() This interface should really go away, but for now just implement it to directly update constant state (ie. what clover should be doing instead) Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	1e9f27f37f	freedreno/ir3: Handle MESA_SHADER_KERNEL Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	83a1bca952	freedreno: Skip built-in shaders for clover Avoids assert: ../src/compiler/glsl_types.cpp:1134: static const glsl_type glsl_type::get_array_instance(const glsl_type , unsigned int, unsigned int): Assertion `glsl_type_users > 0' failed. caused by us trying to compile built-in shaders (ie. clear, gmem<->mem, etc) before clover has initialized glsl_types. But we don't need these shaders for compute-only contexts. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	dff8a0c4cb	isaspec: inherite parent's bitset gpu gen requirements Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	d77b9fb518	isaspec: Fix gpu_id for default_options We forgot to set this. It starts to matter in the next patch, otherwise pre-pass to detect branch targets (needed for backwards jumps/branches) will not work. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	c4e7541b9d	freedreno/ir3: use stg.a/ldg.a only if offset is reg or doesn't fit Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	064c806d23	freedreno/ir3: Add load/store_global lowering Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	d85eb9268a	freedreno/ir3: set proper dst size for {store,load}_{global,shared}_ir3 We want to pass 64b variables. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	1ef43a0be7	freedreno/ir3: disallow immediate addr/offset for ldg/ldg.a Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	f45b7c58c4	freedreno/ir3: Lower 64b phis Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	bee9212efb	ir3/freedreno: add 64b undef lowering Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Rob Clark	2d65e6f56d	freedreno/ir3: 64b intrinsic lowering Both for OpenCL and VK_KHR_buffer_device_address Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Danylo Piliaiev	1eee1fda11	nir/lower_amul: do not lower 64bit amul to imul24 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13300>	2021-10-21 18:59:57 +00:00
Sagar Ghuge	b83c9b21a6	intel/compiler: Set correct cache policy for A64 byte scattered read This doesn't impact any performance since the previous typo value matches the current cache control value. Signed-off-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13458>	2021-10-21 17:32:23 +00:00
Marek Olšák	272af39be1	amd/addrlib: cosmetic addrlib update Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13459>	2021-10-21 16:26:06 +00:00
Marek Olšák	69a1b02b68	amd/addrlib: change how the license is formatted to match internal tree It's the same MIT license. Acked-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13459>	2021-10-21 16:26:06 +00:00
Sajeesh Sidharthan	cf0bc4fb55	frontends/va/av1: handle multiple slice params Multiple slice params in a single vaRenderPicture function call is not handled. This patch will fix overwriting slice params when multiple slice params received in one buffer. Change-Id: I880df5bc35dfbd64382a178074482548882ee4af Signed-off-by: Sajeesh Sidharthan <sajeesh.sidharthan@amd.com> Reviewed-by: Leo Liu <leo.liu@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13463>	2021-10-21 16:12:30 +00:00
Samuel Pitoiset	996e81fb70	aco: fix loading 64-bit inputs with fragment shaders Fixes a bunch of 64-bit IO tests with piglit and Zink. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13454>	2021-10-21 12:50:55 +02:00
Iago Toral Quiroga	75bd37dc6a	broadcom/compiler: disallow tsy barrier in thrsw delay slots A TSY barrier becomes effective at the point of the next thread switch, so if we have one coming after a previous thread switch we need to be careful not to emit it in its delay slots, or we would be effectively moving the barrier earlier than intended. Fixes simulator assert crash in: dEQP-VK.graphicsfuzz.two-for-loops-with-barrier-function Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13468>	2021-10-21 12:40:00 +02:00
Emma Anholt	9202e8cbaf	turnip: Make copy_format() and tu6_plane_format() return pipe_format [ Connor: Keep the argument to copy_format() a VkFormat, fold in plane_format() conversion. ] Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Emma Anholt	68f8bbb37e	util: Move freedreno's snorm-to-unorm to util/, adding remaining cases. I want it in turnip too. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Emma Anholt	cbdc8e09bf	turnip: Switch format_to_ifmt() to take a pipe_format. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Emma Anholt	e4e8db0132	turnip: Switch tu6_format_color() to a pipe_format. To handle Y8 specially, we want a PIPE_FORMAT instead of VK_FORMAT. There are some redundant vk-to-pipe conversions, but they're going to go away shortly. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Emma Anholt	3b68fc0c6a	turnip: Switch tu6_format_texture() to a pipe_format. To handle Y8 specially, we want a PIPE_FORMAT instead of VK_FORMAT. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Connor Abbott	cfabdbd7d3	tu/clear_blit: Move around copy_format()/tu6_plane_format() We want these functions to take a Vulkan format and return a pipe_format, but tu6_plane_format() was getting redundantly called on the result of copy_format() and copy_format() was also getting called twice with image to image copies. Pull these functions further up the call chain so that they're only called once. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13379>	2021-10-21 08:46:31 +00:00
Iago Toral Quiroga	acb83e1b13	v3dv: enable Vulkan 1.1 Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13465>	2021-10-21 10:12:38 +02:00
Emma Anholt	bd81a23620	ci/piglit-runner: Fix funny indentation of the piglit-runner command. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	440f207a1f	ci/deqp-runner: Move more non-suite logic under the non-suite 'if'. Changing these variables won't do anything for you otherwise. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com>. Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	92748e40ef	ci/deqp-runner: Don't start GPU hang detection for making junit results. It's just CPU-side post-processing, not running tests. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	61ca900b69	ci/deqp-runner: Drop LD_LIBRARY_PATH=/usr/local for libkms workaround. deqp hasn't been linking against that in quite some time. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	899174c210	ci/deqp-runner: Move remaining asan runs to --env LD_PRELOAD= This should improve their reliability and speed a little by getting deqp-runner off of asan. This removes the last jobs setting TEST_LD_PRELOAD, so remove passing that variable around from other scripts. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	37c690ad1a	ci/deqp-runner: Drop silly CSV env vars. One was unused, the other was used once. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	b978688df6	ci/deqp-runner: Use new deqp-runner's built-in renderer/version checks. This is prettier in the log files, less shell code, and for non-suite mode adds checking that the driver has the right git sha1. Also, no need for suites to have a DEQP_VER to say which dEQP we should run for the renderer check. The version checks can help us make sure that GL version exposed doesn't accidentally regress, and the ".*git" checks that we're using a git version of Mesa rather than something that snuck in through distro packages. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	9ddfd297e0	ci/deqp-runner: Simplify the --jobs argument setup. We can use the general "how parallel should we go on this runner?" env var and save a bunch of massaging env var names. Fixes how PIGLIT_PARALLEL looked like it was useful but actually wasn't passed through to HW runners. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Emma Anholt	59f3a8e6b4	ci/deqp-runner: Drop SUMMARY_LIMIT env var. Nobody uses it any more, and you could just put it in DEQP_OPTIONS. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-by: Tomeu Vizoso <tomeu.vizoso@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13372>	2021-10-21 07:34:19 +00:00
Vinson Lee	670fd8123b	radv: Fix memory leak on error path. Fix defect reported by Coverity Scan. Resource leak (RESOURCE_LEAK) leaked_storage: Variable prolog going out of scope leaks the storage it points to Fixes: `80841196b2` ("radv: implement dynamic vertex input state using vertex shader prologs") Suggested-by: Rhys Perry <pendingchaos02@gmail.com> Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13402>	2021-10-21 07:14:40 +00:00
Samuel Pitoiset	b797ecac7a	ac/rgp: remove useless code related to GFX6-7 RGP only supports GFX8+. RADV doesn't allow SQTT on < GFX8 and RadeonSI only allows it on GFX9+. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13451>	2021-10-21 06:44:50 +00:00
Samuel Pitoiset	8304392c35	radv: add an assertion to prevent GPU hangs when VRS isn't supported Just hit this case with a buggy CTS test. Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13361>	2021-10-21 08:07:54 +02:00
Caio Marcelo de Oliveira Filho	9a32a7fdfe	util: Move test sources to tests/ directory Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	abf2af64ac	util: Convert sparse array multithread test to use gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	213c9e944c	util: Convert roundeven_test to use gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	83449f61ba	util: Convert rb_tree_test to gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	0d36ea7d58	util: Convert mesa-sha1_test to use gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	89eebca057	util: Convert blob_test to use gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	d4c536d3d9	util: Convert u_atomic_test to use gtest Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	1d78a31bec	util: Move tests in single file directories to tests/ Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Caio Marcelo de Oliveira Filho	2209f5794d	util: Consolidate existing gtests in a single binary Acked-by: Emma Anholt <emma@anholt.net> Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13425>	2021-10-20 21:40:31 -07:00
Eric Engestrom	60768f4029	docs: update calendar for 21.3.0-rc2 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13460>	2021-10-20 20:51:28 +01:00
Caio Marcelo de Oliveira Filho	662fbc0120	nir: Use a single binary for gtests Less artifacts and less time running linker. The load_store_vectorizer test is still split since we need to update gitlab-ci scripts to skip certain tests in certain builds. Added a TODO with the concrete suggestion. Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13414>	2021-10-20 18:26:31 +00:00
Caio Marcelo de Oliveira Filho	8cb7d6f81b	spirv: Use a single binary for gtests Less artifacts and less time running linker. Also set the guideline for future tests to not create new binaries for extra gtests. Reviewed-by: Dylan Baker <dylan@pnwbakers.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13415>	2021-10-20 17:55:36 +00:00
Jason Ekstrand	39f2594531	anv: Implement VK_EXT_global_priority_query Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/11250>	2021-10-20 15:51:59 +00:00
Connor Abbott	e7599f09a1	ir3: Use stp/ldp base offset for {load,store}_scratch When we have a series of loads/stores we were creating a constant for each one, which isn't great. Furthermore, because the nir pass puts the offset constant at the top of the shader, it resulted in extra register pressure and spilling when that happened inside a loop. Fix this by using the base/offset form of stp and ldp. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13307>	2021-10-20 15:19:15 +00:00
Connor Abbott	7deb0d296d	ir3/cse: Support mov instructions This doesn't affect shader-db at all, but it will help clean up the mov's emitted in the next commit when there are multiple ldp/stp. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13307>	2021-10-20 15:19:15 +00:00
Rhys Perry	cd3f0683cd	aco: simplify emit_stream_output() Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13438>	2021-10-20 15:00:23 +01:00
Alejandro Piñeiro	d50be41f8f	broadcom/compiler: remove unused macro and function definition Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13444>	2021-10-20 10:08:27 +00:00
Rhys Perry	9bc0fc89c8	aco: disable mul(cndmask(0, 1, b), a) optimization sometimes This optimization doesn't work for SDWA or DPP multiplications and we can't do it if denormal flushing is required because v_cndmask_b32 doesn't do that and we can't do it if we can't assume operands are finite because 0.0 * inf is NaN, not 0. Signed-off-by: Rhys Perry <pendingchaos02@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13434>	2021-10-20 07:35:52 +00:00
Mike Blumenkrantz	86b3d8c66c	zink: rescue surfaces/bufferviews for cache hits during deletion this is a wild race condition, but it's possible for these to get their final unref, enter their destructor, and then get a cache hit while waiting on the lock to remove themselves from the cache in such a scenario, a second, normal check of the refcount will suffice, as the increment is atomic, and the value will otherwise be zero fixes crashes in basemark cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13442>	2021-10-19 19:43:19 +00:00
Emma Anholt	80d5e40fd1	freedreno/afuc: Disable the disassembler on 32-bit builds. There's an mmap(2 << 32), which armhf can't handle. Fixes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5514 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13421>	2021-10-19 18:55:07 +00:00
Mykhailo Skorokhodov	5afce85f2b	Revert "iris: add tile cache flush to iris_copy_region" This reverts commit `27534a49cf` Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12979>	2021-10-19 18:22:31 +00:00
Mykhailo Skorokhodov	0523607ebb	iris: Add missed tile flush flag Without adding `PIPE_CONTROL_TILE_CACHE_FLUSH` into `iris_emit_pipe_control` gen12+ (UHD 750 in my case) has issues with textures. Related-to: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5029 Fixes: c85ea824('iris: reduce redundant tile cache flushes') Signed-off-by: Mykhailo Skorokhodov <mykhailo.skorokhodov@globallogic.com> Reviewed-by: Felix DeGrood <felix.j.degrood@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12979>	2021-10-19 18:22:31 +00:00
Mike Blumenkrantz	8633ce06af	zink: stop leaking descriptor pool references this never really mattered before, but now these need to actually get freed Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	7fb8e0b9fb	zink: don't clear descriptor pool cache on context destroy the final ref on pools is owned by their program struct(s), and liveshader cache can trigger shader deletion after a context is destroyed, so attempting to prune pools here may end up deleting them before the last ref is actually removed Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	6a852e4e06	zink: always invalidate descriptor sets on pool free this used to be bad and only for debugging, but now it's good and useful for saving memory Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	94fc6b0875	zink: unref descriptor pools in hybrid mode when they explode these will no longer be used, so unref them so they can be deleted to free up some vram in the driver Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	d065294434	zink: remove descriptor pools from hash table on deletion ensure these aren't just sitting around stale Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	6d93729881	zink: fix descriptor interface param for program_deinit this should match the _init() method and take the context Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Mike Blumenkrantz	7a9e0e4fc0	zink: use ctx params for program ref/destroy functions these are context objects, so destroy with the context Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13430>	2021-10-19 18:07:00 +00:00
Marek Olšák	d29e507adc	radeonsi: don't set inline_uniforms for viewperf because it's enabled by default Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:30 +00:00
Marek Olšák	c5f39acb33	winsys/amdgpu: set max_ib_size and max_check_space_size later in cs_check_space If there is enough CS space, don't update them. This eliminates some overhead. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:30 +00:00
Marek Olšák	6129db68bf	winsys/amdgpu: remove force_chaining parameter from cs_check_space it's always false Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:30 +00:00
Marek Olšák	9d852a4695	radeonsi: properly destroy buffers on failure Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	0d2dc06761	radeonsi: don't sync before clear_buffer and copy_buffer if the buffer is idle Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	d4cf4b3cee	radeonsi: don't update bind_history for internal buffer clears and copies Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	61ebdcfc29	radeonsi: don't sync PS or CS before (clear\|copy)_buffer based on bind history Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	4bc8c2590e	radeonsi: rebind a buffer only in shader stages where it's been bound Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	13b1424e96	radeonsi: change bind_history to track usage in each shader stage Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	9f2a97e9df	radeonsi: add an option to use CPU storage uploads for threaded context It's only enabled for viewperf for now. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	745ea99484	radeonsi: add SI_MAX_VRAM_MAP_SIZE definition Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13400>	2021-10-19 16:53:29 +00:00
Marek Olšák	03186773a6	mesa: fix crashes in the no_error path of glUniform Fixes: `bd2662bfa1` - mesa: add KHR_no_error support to glUniform*() functions Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13417>	2021-10-19 16:24:31 +00:00
Rob Clark	5948ff4826	freedreno/computerator: Fix mergedregs This was getting set after ir3_shader_assemble, which was too late. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13426>	2021-10-19 16:04:42 +00:00
Rob Clark	22a203aa4c	freedreno/isa: Fix ldg/stg "halfness" Whether the load dst or store src is a half reg is determined by the type field, similar to cat5. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13426>	2021-10-19 16:04:42 +00:00
Rob Clark	834e8066c1	freedreno/ir3/tests: Add some 8/16b ldg/stg tests Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13426>	2021-10-19 16:04:42 +00:00
Rob Clark	8657e201d0	freedreno/ir3/tests: Don't skip encode test if decode fails Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13426>	2021-10-19 16:04:42 +00:00
Samuel Pitoiset	572a902566	aco: fix emitting stream outputs when the first component isn't zero Fixes a bunch of XFB piglit tests with Zink. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13437>	2021-10-19 15:19:42 +00:00
Samuel Pitoiset	e3cbb0eb6a	aco: fix invalid IR generated for b2f64 when the dest is a VGPR Fixes few 64-bit piglit tests with Zink. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13435>	2021-10-19 14:47:32 +00:00
Marek Olšák	3df9d8ed80	gallium/u_threaded: implement pipelined partial buffer uploads using CPU storage This removes resource_copy_region for BufferSubData. Drivers have to opt in to use this. See the comment in the header file. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13399>	2021-10-19 13:12:37 +00:00
Marek Olšák	cc2f3a0168	gallium,vbo: add PIPE_BIND_VERTEX_STATE for display lists Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13399>	2021-10-19 13:12:37 +00:00
Marek Olšák	5ee2965283	ac/llvm: accept primitives whose face culling determinant is Inf or NaN Based on https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13299/diffs Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13380>	2021-10-19 12:49:06 +00:00
Marek Olšák	efaab0ec50	ac/llvm: add helper ac_build_is_inf_or_nan Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13380>	2021-10-19 12:49:06 +00:00
Marek Olšák	5e8f76b713	ac/llvm: use fmac instead of mul+sub in face culling Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13380>	2021-10-19 12:49:05 +00:00
Samuel Pitoiset	19c91a120d	radv: do not remove PSIZ for streamout shaders It might still be read later from the streamout buffer. Fixes a regression with ext_transform_feedback-builtin-varyings gl_PointSize and Zink. Fixes: `92e1981a80` ("radv: Remove PSIZ output when it isn't needed.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13413>	2021-10-19 12:29:40 +00:00
Jan Beich	60b7c3a0f4	meson: disable -Werror=thread-safety on FreeBSD Annotated <pthread.h> exposes too many errors in Mesa that are non-trivial to fix and keep working without FreeBSD CI. Fixes: `0d5fe24c9b` ("macros: Add thread-safety annotation macros") Reviewed-by: Eric Engestrom <eric@engestrom.ch> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/9168>	2021-10-19 06:59:32 +00:00
Dave Airlie	37d6ce4ebb	llvmpipe: swizzle image stores for CL BGRA OpenCL requires image stores to BGRA to work, so add the swizzle code here. Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13404>	2021-10-19 03:11:08 +00:00
Mike Blumenkrantz	86b24fba05	zink: align pipe_resource and sampler_view allocations to cachelines this eliminates false sharing for atomics Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13431>	2021-10-19 02:40:37 +00:00
Mike Blumenkrantz	89ed9ed400	zink: don't ralloc zink_resource structs the hash tables can just have their own contexts and be freed explicitly Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13431>	2021-10-19 02:40:37 +00:00
Mike Blumenkrantz	4cad3da409	lavapipe: clamp attachment clear rect sizes this is a spec violation, but crashing isn't cool, so do a little clamping Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	67dd6b9f0f	lavapipe: pull layer count from render state during resolve vk_framebuffer may be null Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	f25a98318b	lavapipe: remove lvp_subpass::has_color_att unused Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	f7e9500dc2	lavapipe: simplify some attachment derefs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	d914df72ab	lavapipe: store subpass directly to rendering_state this is fewer array derefs Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	a0c81efcf4	lavapipe: remove last VK_ATTACHMENT_UNUSED check these are all just null checks now Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	58e18a2be9	lavapipe: remove lvp_subpass_attachment and use lvp_render_pass_attachment refs this is one fewer indirect reference that allows removing a bunch of stuff from renderpass creation now also unused subpass attachments are just NULL pointers, making detection simpler Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	5e3bc8b18d	lavapipe: remove lvp_subpass::max_sample_count Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	5e3d84e30a	lavapipe: add attachment index to lvp_render_pass_attachment Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	5cf568ce09	lavapipe: remove lvp_subpass_attachment::layout unused Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	eaa82252f9	lavapipe: remove lvp_subpass_attachment::in_render_loop this isn't used Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	afd8820d66	lavapipe: use framebuffer attachment_count member instead of renderpass according to spec, these must be equal Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	dd70ff3b8c	lavapipe: remove some unused struct members Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Mike Blumenkrantz	d103d5bb5d	lavapipe: stop reading renderpass during pipeline creation this is unnecessary and is going to be annoying in the future Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13349>	2021-10-19 01:18:24 +00:00
Dave Airlie	cae1ef0a11	clover: use max shader sampler view/images queries for clover. This is probably sane than my last answer to this question Reviewed-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13398>	2021-10-19 01:03:58 +00:00
Mike Blumenkrantz	dfd0f5dbfd	zink: move last of lazy descriptor state updating back to lazy-only code hybrid mode is controlled by the caching manager, so state tracking is irrelevant Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	140d3ea8c6	zink: add an early return for zink_descriptors_update_lazy_masked() no point in generating pools/sets that won't be used here Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	7c840f5103	zink: move push descriptor updating into lazy-only codepath this was a bit confusing to read, and I originally left it in the hybrid path to enable fallbacks to push descriptors in hybrid mode. the problem with that idea is that it's impossible: the constant buffer set is the one set that will never, ever trigger a fallback, so leaving it there just leaves room for error and confusion Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	b140d58b1f	zink: don't update lazy descriptor states in hybrid mode I'm not 100% sure how, but this breaks tomb raider Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	75e51138b1	zink: assert compute descriptor key is valid before hashing it Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	497ce3c38a	zink: clear descriptor refs on buffer replacement the bo here can only ever be destroyed before it gets reused, so prune it from the descriptor cache immediately Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13350>	2021-10-19 00:51:50 +00:00
Mike Blumenkrantz	e66558985a	zink: fully zero surface creation struct gotta get those holes for caching cc: mesa-stable Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13410>	2021-10-19 00:33:50 +00:00
Mike Blumenkrantz	a2789fde0c	zink: add a read barrier for indirect dispatch using the draw stage here doesn't make much sense, but that's what the spec says, so let's git er done fixes dEQP-GL45.functional.compute.indirect_dispatch* on radv Cc: mesa-stable Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13381>	2021-10-19 00:17:00 +00:00
Mike Blumenkrantz	11dd9e4ee4	zink: use static array for detecting VK_TIME_DOMAIN_DEVICE_EXT there's only a few possible values for this, so just use a static array to avoid leaking Fixes: `039078fe97` ("zink: slight refactor of load_device_extensions()") Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13360>	2021-10-19 00:02:39 +00:00
Neha Bhende	061610a7dd	st: Fix comments in commit `be6d584de4` This patch is adding comments as suggested by Ilia in MR Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13423>	2021-10-18 21:30:03 +00:00
Karol Herbst	753f595e3d	clover/api: fix clGetMemObjectInfo for images Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13424>	2021-10-19 06:55:38 +10:00
Karol Herbst	b4f9b15dd0	clover/formats: pass in cl_mem_flags for better format checking This allows us to advertise more formats depending on how the image is getting used inside a kernel. Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13424>	2021-10-19 06:55:34 +10:00
Karol Herbst	f09e6c1c5f	clover/format: Full rework on how we declare supported images. While at it also remove CL_LUMINANCE and CL_INTENSITY, which are optional but also quite broken. Also advertize all formats we can already support and make the list easier to read. Also adds support for newer formats. v2: fixup packing for non-8 bits (airlied) Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13424>	2021-10-19 06:55:22 +10:00
Neha Bhende	be6d584de4	st: Fix 64-bit vertex attrib index for TGSI path Patch `77c2b022a0` removed lowering of 64-bit vertex attribs to 32bits. This has thrown TGSI translation off the guard for 64bit attrib. This lead to fail/crash of 1000+ piglit tests. This patch basically fixes 64 bit attrib index for TGSI shader by adding placeholder for second part of a double attribute. It fixes all regressed piglit tests. A big help from Charmaine to fix this regression Reviewed-by: Charmaine Lee <charmainel@vmware.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Reviewed-by: Ilia Mirkin <imirkin@alum.mit.edu> Fixes: `77c2b022a0` ("st/mesa: remove lowering of 64-bit vertex attribs to 32 bits") Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13363>	2021-10-18 19:02:55 +00:00
Marek Olšák	e65d6f45d2	radeonsi: reorder and don't print patch level DRM version in the renderer string Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13392>	2021-10-18 18:37:09 +00:00
Marek Olšák	f9d7db0262	ac,radeonsi: print a lowercase codename in the renderer string to make it stand out less Reviewed-by: Zoltán Böszörményi <zboszor@gmail.com> Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13392>	2021-10-18 18:37:09 +00:00
Marek Olšák	cbcdcd42fc	radeonsi: enable shader culling on Navi1x consumer SKUs as well Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13393>	2021-10-18 18:08:59 +00:00
Marek Olšák	8cf802e8ef	radeonsi: replace the GS prolog with a monolithic shader variant It only exists because of the hw bug and is used very rarely. Let's simplify it. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13393>	2021-10-18 18:08:59 +00:00
Marek Olšák	62798d2c1f	radeonsi: don't pass NULL into si_get_nir_shader so that we always have the shader key there Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13393>	2021-10-18 18:08:59 +00:00
Danylo Piliaiev	3350957f3c	drirc: Apply vk_dont_care_as_load workaround to Forsaken Remastered Game has one renderpass which loads attachment with DONT_CARE, does nothing, and writes it back. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5437 Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13367>	2021-10-18 17:26:41 +00:00
Danylo Piliaiev	ebca227db1	turnip: implement vk_dont_care_as_load workaround Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13367>	2021-10-18 17:26:41 +00:00
Danylo Piliaiev	3d69800a0b	driconf: add vk_dont_care_as_load workaround option It's easy to make a mistake of using VK_ATTACHMENT_LOAD_OP_DONT_CARE when LOAD_OP_LOAD should be used since all desktop GPUs do the same thing (nothing) for both of them. However tiler GPUs do actually care about them. Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Reviewed-by: Rob Clark <robdclark@chromium.org> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13367>	2021-10-18 17:26:41 +00:00
Danylo Piliaiev	fd31989ecb	turnip: add support for dirconf Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13367>	2021-10-18 17:26:41 +00:00
Samuel Pitoiset	5b797bd485	radv: fix OpImageQuerySamples with non-zero descriptor set The descriptor set was always 0 because it wasn't gathered by the shader info pass. This fixes CPU crashes with arb_shader_texture_image_samples-builtin-image and Zink. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13411>	2021-10-18 16:47:50 +00:00
Connor Abbott	de568c3b2c	tu/clear_blit: Stop creating a franken-image for staging blits Extricate the last use of tu_image_*. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:39 +00:00
Connor Abbott	9803c1aa10	tu: Remove cross-check scaffolding Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:39 +00:00
Connor Abbott	d785aea530	tu: Switch clear/blit to fdl6_view and cross-check This will help us create staging resources with a Y8 format and avoids calling into the Vulkan-level entrypoints which will have to be changed to use vk_image and vk_image_view. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:39 +00:00
Connor Abbott	1874e12f19	tu: Use fdl6_view in tu_image_view and cross-check Because some of the fields aren't filled out when a format doesn't support rendering, we temporarily clear the structure so that we can compare. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:39 +00:00
Connor Abbott	5509132a80	freedreno/fdl: Add fdl6_view This is mostly based on tu_image_view. The notable difference is that we don't handle choosing the correct plane out of multiple planes when indicated by the aspect, which means that there is no equivalent of VK_IMAGE_ASPECT_PLANE_1 etc. This is expected to be done in the driver, and note that freedreno gallium handles this very differently anyway. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:38 +00:00
Connor Abbott	464b9d6bf1	freedreno/fdl: Add mip_level to fdl_layout We need this when calculating the descriptors in the image view. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:38 +00:00
Connor Abbott	7bcccd1f08	freedreno/fdl: Constify fdl6_get_ubwc_blockwidth() Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:38 +00:00
Connor Abbott	7a90aa8d2e	vk/format, v3dv: Add a vulkan -> pipe swizzle helper And use it to replace the one in v3dv. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13359>	2021-10-18 16:00:38 +00:00
Pierre-Eric Pelloux-Prayer	7a2e40df5e	Revert "gallium: add a is_dri_blit_image bool to pipe_blit_info" This reverts commit `22a1b7c5b3`. The only use was radeonsi and it switched to PIPE_BIND_DRI_PRIME instead. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 17:16:53 +02:00
Pierre-Eric Pelloux-Prayer	ec2eff8f38	radeonsi: use PIPE_BIND_DRI_PRIME instead of is_dri_blit_image Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 17:16:53 +02:00
Pierre-Eric Pelloux-Prayer	1863b761a6	radeonsi/gfx10.3: enable SDMA for DRI_PRIME copies Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 17:16:53 +02:00
Pierre-Eric Pelloux-Prayer	8791e831b1	winsys/amdgpu: add uncached flag to the imported DRI_PRIME buffer There's no point in caching this linear buffer since we're only ever writing to it. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 17:16:53 +02:00
Pierre-Eric Pelloux-Prayer	a905072521	radeon_winsys.h: add a parameter to buffer_from_handle Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 17:16:53 +02:00
Pierre-Eric Pelloux-Prayer	e9c3dbd046	gallium/dri: let the driver know if the imported image is DRI_PRIME buffer Use createImageFromFds2 together with __DRI_IMAGE_PRIME_LINEAR_BUFFER, so the driver's resource_from_handle hook will be aware that this specific image is the linear buffer. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 16:24:00 +02:00
Pierre-Eric Pelloux-Prayer	7a5de84249	gallium/dri: add createImageFromFds2 Same as createImageFromFds but with added flags so the caller can give the driver more context. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 16:23:58 +02:00
Pierre-Eric Pelloux-Prayer	48551a1807	gallium/dri: replace bool with flag parameter This will allow to pass more information. Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13362>	2021-10-18 16:23:56 +02:00
Witold Baryluk	ae525da0e4	zink: Fully initialize VkBufferViewCreateInfo for hashing Makes hashing achieve higher hit rate, and valgrind happier. Reviewed-by: Hoe Hao Cheng <haochengho12907@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13371>	2021-10-18 13:53:26 +00:00
Juan A. Suarez Romero	4878e35159	v3dv/ci: update expected results Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13407>	2021-10-18 11:48:30 +00:00
Pierre-Eric Pelloux-Prayer	234c69f600	radeonsi: use viewport offset in quant_mode determination Instead of only using the viewport extent. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5344 Cc: mesa-stable Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13382>	2021-10-18 11:15:54 +00:00
Vinson Lee	9eb010ee1e	anv: Fix assertion. Fix defect reported by Coverity Scan. Assign instead of compare (PW.ASSIGN_WHERE_COMPARE_MEANT) assign_where_compare_meant: use of "=" where "==" may have been intended Fixes: `35315c68a5` ("anv: Use the common wrapper for GetPhysicalDeviceFormatProperties") Signed-off-by: Vinson Lee <vlee@freedesktop.org> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13395>	2021-10-18 09:04:47 +00:00
Samuel Pitoiset	61be0bd34b	radv: fix removing PSIZ when it's not emitted by the last VGT stage This dereferences a NULL pointer and crash many tests with Zink. Fixes: `92e1981a80` ("radv: Remove PSIZ output when it isn't needed.") Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Timur Kristóf <timur.kristof@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13378>	2021-10-18 08:54:53 +02:00
Aaron Watry	91ff83b6c8	clover/image: add dimension property With that we can fix CL_IMAGE_HEIGHT and CL_IMAGE_DEPTH. v2 (Karol Herbst): split up commit (Serge Martin): convert to virtual method Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:28:20 +10:00
Edward O'Callaghan	786987c447	clover: Implement CL_MEM_OBJECT_IMAGE1D_ARRAY v2: Consider surface height as valid when unused by using 1. Fixup width boundary checking. v3 (Karol): Pull in changes from later commits Fix validation Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:24:15 +10:00
Edward O'Callaghan	3200669c2b	clover: Implement CL_MEM_OBJECT_IMAGE1D_BUFFER v2: Consider surface height as valid when unused by using 1. Fixup width boundary checking. v3 (Karol): Pull in changes from later commits v4:(airlied): use max_buffer_size as the limit (Fixes CTS test) Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:23:41 +10:00
Edward O'Callaghan	0ec5e50d8a	clover: Implement CL_MEM_OBJECT_IMAGE2D_ARRAY v2: Ensure we pass in row_pitch state as well. v3 (Karol): Pull in changes from later commits Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:23:00 +10:00
Aaron Watry	0abfbb76ff	clover: implement CL_IMAGE_BUFFER We will also need it to implement image1Dbuffer_t v2 (Karol Herbst): extracted from other commit Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:22:41 +10:00
Edward O'Callaghan	3298ee546e	clover/images: Add array_size to implement CL_IMAGE_ARRAY_SIZE This will be needed to implement array immages. v2 (Karol Herbst): Extracted from other commit Fix clEnqueueMapImage for arrays Add some basic support for image arrays Signed-off-by: Edward O'Callaghan <funfunctor@folklore1984.net> Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:22:21 +10:00
Karol Herbst	029f22e430	clover/image: add templated basic_image class to simplify image subclassing Signed-off-by: Karol Herbst <kherbst@redhat.com> Reviewed-by: Dave Airlie <airlied@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:22:01 +10:00
Karol Herbst	f6ecd284e5	spirv: Don't add 0.5 to array indicies for OpImageSampleExplicitLod This fixes CLs 1.2 1Darray and 2Darray images. Fixes: `589d918a4f` ("spirv: Add 0.5 to integer coordinates for OpImageSampleExplicitLod") Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Signed-off-by: Karol Herbst <kherbst@redhat.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13401>	2021-10-18 12:21:52 +10:00
Juan A. Suarez Romero	ab2cfeba48	vc4/ci: update expected results Signed-off-by: Juan A. Suarez Romero <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13397>	2021-10-17 23:41:02 +02:00
Dave Airlie	17a565e0cf	llvmpipe: fix userptr for texture resources. This is needed for CL image hostptr support, but it's possible it could hit these paths from GL/Vulkan Fixes: `9a57dceeb7` ("llvmpipe: add support for user memory pointers") Reviewed-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13375>	2021-10-18 06:30:07 +10:00
Alyssa Rosenzweig	d31ca63527	panfrost: Don't allow rendering/texturing 48-bit Matches freedreno. Fixes crashes in Piglit arb_texture_view. Signed-off-by: Alyssa Rosenzweig <alyssa@collabora.com> Cc: mesa-stable Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13394>	2021-10-16 23:49:13 +00:00
Derek Foreman	28d12716e8	egl/wayland: Properly clear stale buffers on resize The following chain of events results in an incorrectly sized buffer persisting beyond its useful lifetime, and causing visual artifacts. buffer is attached at size A window is resized to size B rendering takes place for size B window is resized back to size A swapbuffers with damage is called In this scenario, update_buffers fails to recognize that the surface it's about to commit is a different size than it has rendered. The attached_width and attached_height are set incorrectly, and periodic flickering is observed. Instead, we set a boolean flag at time of resize and use this at the time we latch the window dimensions as surface dimensions to decide whether to discard stale buffers. Signed-off-by: Derek Foreman <derek.foreman@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13270>	2021-10-16 19:13:58 +00:00
Marek Olšák	885f9b3b75	radeonsi: don't memcmp inlined uniform values if uniform inlining is disabled This uses a C++ template to compute the memcmp size at compile time, which is important for getting inlined memcmp. There are 4 different key sizes now: GE with inlined uniforms: 68 bytes GE without inlined uniforms: 52 bytes PS with inlined uniforms: 28 bytes PS without inlined uniforms: 12 bytes Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Marek Olšák	8c5a32b5fe	radeonsi: split si_shader_key into ps and ge parts to minimize memcmp overhead ps is for the pixel shader, while ge is for VS, TCS, TES, and GS. si_shader_key: 68 bytes si_shader_key_ge: 68 bytes si_shader_key_ps: 28 bytes The only notable change is that si_shader_select_with_key is changed to a C++ template. Other changes are trivial. Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Marek Olšák	385c9e1caf	radeonsi: si_state_shaders.c -> cpp We'll add some templates here. Why is `extern "C"` not needed for exported functions? Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Marek Olšák	8a42ea69a6	gallium/util: add some extern "C" guards Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13285>	2021-10-16 10:41:51 +00:00
Jason Ekstrand	b62b2fa4b9	compiler/types: Add a wrap_in_arrays helper This has been copied+pasted 3 times now. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	5818d47ae6	spirv: Use texture types for sampled images Instead of using gsamplerND types for sampled images, use the new gtextureND types for sampled images and reserve gsamplerND for combined image+samplers. Combined image+sampler bindings still get a gsamplerND type. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	99cda38c81	clover/nir: Don't remove texture variables Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	3c398139e1	lavapipe: Allow for texture types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	b8a0bf2343	nir/deref: Also optimize samplerND -> textureND casts Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	2ab5546a96	nir: Allow texture types Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	3ace6b968b	compiler/types: Add a texture type This is separate from images and samplers. It's a texture (not a storage image) without a sampler. We also add C-visible helpers to convert between sampler and image types. Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	7558c9cb07	compiler/types: Unify the guts of get_sampler/image_count Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Jason Ekstrand	175f33e88f	compiler/types: Combine image and sampler type serialization Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13389>	2021-10-16 05:49:34 +00:00
Yiwei Zhang	2d58e31f10	dri_interface: remove gl header Only gl typedefs are used. So just remove the header and update the types to the underlying types. Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13387>	2021-10-16 04:48:40 +00:00
Yiwei Zhang	e19d9046db	dri_interface: remove obsolete interfaces Below are removed: __DRI_FRAME_TRACKING __DRI_TEX_OFFSET __DRI_GET_DRAWABLE_INFO Signed-off-by: Yiwei Zhang <zzyiwei@chromium.org> Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Emma Anholt <emma@anholt.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13387>	2021-10-16 04:48:39 +00:00
Jason Ekstrand	d343aef942	nir/serialize: Pack deref modes better With nir_var_image, we've now run out of bits in our packed blob for deref instructions. We could revert to an unpacked blob or we could be a bit more clever about how we encode deref modes and pack them into 5 bits. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13386>	2021-10-16 03:47:10 +00:00
Jason Ekstrand	9272a952c9	nir: Re-arrange the variable modes Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13386>	2021-10-16 03:47:10 +00:00
Jason Ekstrand	956199e870	nir: s/nir_var_mem_image/nir_var_image/g We typically use nir_var_mem_* for stuff that has an explicit byte-based memory layout. Images are opaque. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13386>	2021-10-16 03:47:10 +00:00
Dylan Baker	e73096bd6d	meson: use gtest protocol for gtest based tests when possible With the `gtest` protocol meson will add some extra arguments to the test to generate better junit results, which may be useful. This protocol is only available in meson 0.55.0+, so keep using the default `exitcode` protocol for meson older than that. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/8484>	2021-10-16 03:22:24 +00:00
Enrico Galli	aac47c4b24	microsoft/compiler: Shadow tex instructions always use shadow samplers Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13321>	2021-10-16 00:12:04 +00:00
Mike Blumenkrantz	fe2674dd52	aux/pb: more correctly check number of reclaims the increment needs to happen before the comparison here Fixes: `3d6c8829f5` ("aux/pb: add a tolerance for reclaim failure") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13388>	2021-10-15 23:36:48 +00:00
Jason Ekstrand	58f605e4d4	nir: Drop our attempt at typed-based image mode validation This is broken for bindless images declared as local variables. It turns out nir_variable::data::bindless is only used for uniforms and we already assume anything in nir_var_function_temp or similar is bindless. We could try to make a tricky assert but now that we have everything else passing but now that we've got everyone converted the extra validation probably isn't necessary. Acked-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13384>	2021-10-15 22:35:59 +00:00
Marcin Ślusarz	d05f7b4a2c	intel: fix INTEL_DEBUG environment variable on 32-bit systems INTEL_DEBUG is defined (since `4015e1876a`) as: #define INTEL_DEBUG __builtin_expect(intel_debug, 0) which unfortunately chops off upper 32 bits from intel_debug on platforms where sizeof(long) != sizeof(uint64_t) because __builtin_expect is defined only for the long type. Fix this by changing the definition of INTEL_DEBUG to be function-like macro with "flags" argument. New definition returns 0 or 1 when any of the flags match. Most of the changes in this commit were generated using: for c in `git grep INTEL_DEBUG \| grep "&" \| grep -v i915 \| awk -F: '{print $1}' \| sort \| uniq`; do perl -pi -e "s/INTEL_DEBUG & ([A-Z0-9a-z_]+)/INTEL_DBG(\1)/" $c perl -pi -e "s/INTEL_DEBUG & ($[A-Z0-9_ \|]+$)/INTEL_DBG\1/" $c done but it didn't handle all cases and required minor cleanups (like removal of round brackets which were not needed anymore). Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13334>	2021-10-15 19:55:14 +00:00
Mike Blumenkrantz	182237e1e8	virgl: remove unused pipebuffer include Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13385>	2021-10-15 19:03:46 +00:00
Mike Blumenkrantz	3d6c8829f5	aux/pb: add a tolerance for reclaim failure originally, a slab attempts to reclaim a single bo. there are two outcomes to this which can occur: * the bo is reclaimed * the bo is not reclaimed if the bo is reclaimed, great. if the bo is not reclaimed, it remains at the head of the list until it can be reclaimed. this means that any bo with a "long" work queue which makes it into a slab will effectively kill the entire slab. in a benchmarking scenario, this can occur in rapid succession, and every slab will get 1-2 suballocations before it reaches a bo that blocks long enough for a new slab to be needed. the inevitable result of this scenario is that all memory is depleted almost instantly, all because pb assumes that if the first bo in the reclaim list isn't ready, none of them can be ready for drivers like radeonsi, this happens to be a fine assumption for drivers like zink, this is entirely not workable and explodes the gpu Cc: mesa-stable Reviewed-by: Witold Baryluk <witold.baryluk@gmail.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Tested-by: Witold Baryluk <witold.baryluk@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13345>	2021-10-15 17:46:51 +00:00
Caio Marcelo de Oliveira Filho	29177c7cee	intel/compiler: Build all tests in a single binary With gtest is possible to filter execution and run only a specific test suite or individual test, so there's no particular reason here to generate multiple binaries for the tests of a single module. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13303>	2021-10-15 10:06:51 -07:00
Caio Marcelo de Oliveira Filho	35b6990706	intel/compiler: Rename vec4 test fixtures Include vec4 in their names to avoid same names as the fs counterparts. This will allow compiling all the tests together in the future. Reviewed-by: Ian Romanick <ian.d.romanick@intel.com> Acked-by: Matt Turner <mattst88@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13303>	2021-10-15 10:06:51 -07:00
Rob Clark	0480595d03	freedreno/isa: Add immed reg accessors This way we can assert that a src that we expect to be an immediate actually is. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	e08d152d68	isaspec: Add bitfield size assertions Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	4166635bd1	isaspec: Do not emit duplicate field encodes If an <override> overrides the definition of a field, don't emit encoding for both the override's definition and the fallback. (See "SAMP" in #cat5-src3). It is harmless currently, because (in this case) it will just re-encode the low bits of "SAMP". But when we start asserting on that the field being encoded fits in the allowed number of bits, the re-encoding of the fallback field definition will start triggering asserts. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	e01759e6f3	isaspec: Fix derived field width The low/high bit positions should be integers. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	5b6e5db5d0	freedreno/ir3: Don't lower s2en if samp/tex is too large We only have four bits to encode an immediate samp/tex. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	bfd8b7c930	freedreno/ir3/tests: Add additional disasm test vectors Add branch with negative offset, and a couple others to trigger issues I found while adding pack_field() overflow asserts. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	c0ecfeb023	freedreno/ir3/tests: Fix indentation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	8b0550f09f	freedreno/isa: Fixes for validation Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Rob Clark	9516d8ce98	freedreno/ir3+isa: Cleanup bindless cat5 samp/tex encoding Don't let the way they are encoded at the isa level leak thru to the ir3 level. Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13353>	2021-10-15 15:52:33 +00:00
Jason Ekstrand	d43f89f17a	ir3: Images are always nir_var_mem_image Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jesse Natalie	9601556079	microsoft/clc: Images use nir_var_mem_image The only big change is that lower_vars_to_explicit no longer assigns a driver_location for images. That means that the storage for the format/order loads is no longer implicitly "allocated" in the middle of the kernel args. Instead, manually add the storage for that to the end of the input args buffer. Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	26d603da07	nir/gl_nir_lower_images: Require nir_var_mem_image Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	e6cce80976	intel/fs: Stop emitting TGM fences for nir_var_mem_ssbo Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	8ab40f517f	aco: Split var_mem_image barrier handling from global/ssbo Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	4c5a88d735	nir: Validate image variable modes We can also significantly simplify the foreach_image_variable helper. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	97a7c0ab1b	st/pbo: Use nir_var_mem_image for images Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Rhys Perry	458b4d2095	radv: Use nir_var_mem_image in meta shaders Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	9f51fda92c	ttn: Use nir_var_mem_image Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	b8ee37472d	glsl: Use nir_var_mem_image for images We don't use it for bindless images because the uniforms in that case just contain a bindless handle and aren't an actual image. Bound images, on the other hand, go in the nir_var_mem_image class. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Caio Marcelo de Oliveira Filho	cfdc7ee066	spirv: Use nir_var_mem_image Use the new nir_var_mem_image mode for images that are not known to be used with a sampler (i.e. storage images). Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	219ac26ea3	spirv: Assert that OpTypeForwardPointer only points to structs Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	e87dbfd3e8	ir3: Check for nir_var_mem_image in shared_barrier handling Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	ae58894ee7	zink: Images can live in nir_var_mem_image now Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	d68bedbb45	clover: Use nir_foreach_image_variable for images This splits image and sampler handling into two separate loops. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	aefa22ddb5	clover: Insert dummy uniform variables for images Instead of making images have a well-defined size, insert a dummy variable of the appropriate type which we can use for the parameter block layout. This will work much better when we switch over to nir_var_mem_image. Acked-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	6818811fc4	nir/lower_readonly_images_to_tex: Also rewrite variable modes Storage images will start using nir_var_mem_image but sampled images still use nir_var_uniform. If we're going to rewrite types, we need to rewrite the modes as well. Otherwise, nir_validate will get grumpy and drivers might get confused. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	225caf537a	llvmpipe: Support image variables living in nir_var_mem_image Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	d84fd86af1	ntt: Separate image and sampler handling Use nir_foreach_image_variable for images so we survive the coming refactor where they get their own mode. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	12b3ffe400	st/nir: Assign uniform locations to nir_var_mem_image vars Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	b1385f3c87	nir/gl_nir_lower_images: Support nir_var_mem_image Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	7bdae87b93	nir/gl_nir_lower_samplers_as_deref: Support nir_var_mem_image Contrary to the name of the pass, it also handles storage images so we need to support nir_var_mem_image. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	c0d8dc13e0	glsl/nir_linker: nir_var_mem_image is also a GL uniform Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	94b9f25883	aco: Add support for nir_var_mem_image Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:56 +00:00
Jason Ekstrand	cd49706cb1	amd/llvm/nir: Add support for nir_var_mem_image Reviewed-by: Rhys Perry <pendingchaos02@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Caio Marcelo de Oliveira Filho	26582db077	anv: Use nir_foreach_image_variable Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Jason Ekstrand	ff39916ce7	i965/uniforms: Handle images as a separate pass Instead of walking all uniforms and handling images as a special case, walk "normal" uniforms first and images as a second pass. This lets us use nir_foreach_image_variable which will survive the upcoming refactor. While we're at it, use nir_foreach_image_variable in brw_nir_lower_gl_images too. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Caio Marcelo de Oliveira Filho	2d7065ef04	intel/fs: Consider nir_var_mem_image for TGM fences Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Jason Ekstrand	2a53c33fbe	nir: Add a nir_foreach_image_variable() iterator Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Caio Marcelo de Oliveira Filho	de3705edb0	nir: Add nir_var_mem_image Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Caio Marcelo de Oliveira Filho	872750bb96	nir/schedule: Handle nir_intrisic_scoped_barrier Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/4743>	2021-10-15 14:58:55 +00:00
Rob Clark	73d6e153eb	freedreno: Fix for large epilogues Apparently Rocket League overflows the fixed size epilogue. Switch it to be growable. Closes: #5493 Signed-off-by: Rob Clark <robdclark@chromium.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13365>	2021-10-15 14:30:51 +00:00
Ella-0	9ee060b614	v3dv: enable VK_KHR_swapchain_mutable_format Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13297>	2021-10-15 08:36:36 +00:00
Samuel Pitoiset	aac4e1f822	aco: do not return an empty string when disassembly is not supported Fixes dEQP-VK.pipeline.executable_properties.* on GFX6-7 when clrxdisasm isn't found. Other generations are also affected if RADV is built without LLVM. Cc: 21.3 mesa-stable Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Tony Wasserka <tony.wasserka@gmx.de> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13333>	2021-10-15 09:48:29 +02:00
Marcin Ślusarz	5387522bd0	iris: fix scratch address patching for TESS_EVAL stage Scratch patching code in iris_upload_dirty_render_state (see MERGE_SCRATCH_ADDR calls) assumes that in all shader stages derived_data field stores 3DSTATE_XS packet first. This is not true for TESS_EVAL (DS), so we end up patching 3DSTATE_TE instead of 3DSTATE_DS leading to DWordLength becoming 11 instead of 9 (9 == 3DSTATE_DS.DWordLength, 2 == 3DSTATE_TE.DWordLength, and 9\|2 == 11), and hardware hanging on the next instruction. Fix this by reversing the order of packets for TESS_EVAL stage. Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5499 Fixes: `4256f7ed58` ("iris: Fill out scratch base address dynamically") Signed-off-by: Marcin Ślusarz <marcin.slusarz@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13358>	2021-10-15 07:07:51 +00:00
Dave Airlie	7681500ead	crocus: Delete the MI_COPY_MEM_MEM resource_copy_region implementation. (ported from iris - airlied) The MI_COPY_MEM_MEM version of resource_copy_region has known bugs: It's failing to set valid_buffer_range correctly It's missing iris_emit_buffer_barrier_for() for the source/destination, so there may be missing flushes. There are some bad interactions with the tile cache and VF using L3. Even with those fixed, if you expand the "no more than 16 bytes" restriction to allow copies up to 1024 bytes, then it starts failing Piglit tests on Icelake. We could probably fix this. However, I had originally only measured a 0.689096% +/- 0.473968% (n=4) speedup in Shadow of Mordor's OpenGL port, which is already fairly small, especially before adding missing flushes. Further, some of that likely came from not switching between render and compute...which we'll soon be able to avoid thanks to BLOCS. Folks were also worried that MI_COPY_MEM_MEM can't be pipelined, and that stalling the command streamer may actually slow things down, especially as the GPUs become more powerful. We aren't really sure about this, but it's another concern. So, let's just get rid of this optimization. It seemed like a good idea at the time, but it's just causing issues for very little gain. Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13374>	2021-10-15 16:47:24 +10:00
Maniraj D	796c9ab3fd	egl: set TSD as NULL after deinit When eglReleaseThread() is called from application's destructor (API with __attribute__((destructor))), it crashes due to invalid memory access. In this case, _egl_TLS is freed in the flow of _eglAtExit() as below but _egl_TLS is not set to NULL. _eglDestroyThreadInfo _eglFiniTSD _eglAtExit _run_exit_handlers exit Later when the eglReleaseThread is called from application's destructor, it ends-up accessing the freed _egl_TLS pointer. eglReleaseThread -> in libEGL_mesa eglReleaseThread -> in libEGL(glvnd) destructor() -> App's destructor To resolve the invalid access, setting the _egl_TLS pointer as NULL after freeing it. Reviewed-by: Eric Engestrom <eric@engestrom.ch> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Cc: mesa-stable Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5466 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13302>	2021-10-15 06:22:13 +00:00
Ella-0	835b98e101	v3dv: implement VK_EXT_host_query_reset Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13319>	2021-10-15 05:36:42 +00:00
Jason Ekstrand	393fda2d34	i965: Emit a NULL surface for buffer textures with no buffer This is a preexisting bug but it was uncovered by `231653ea35` ("intel/isl: Add a max_buffer_size limit to isl_device") which added an assert(num_elements > 0) for typed buffers. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13351>	2021-10-15 03:50:58 +00:00
Witold Baryluk	4d777631b5	zink: Do not access just freed zink_batch_state Cc: mesa-stable Reviewed-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13370>	2021-10-14 23:02:38 +00:00
Clayton Craft	b2ef7e6d6b	anv: don't advertise vk conformance on GPUs that aren't conformant This sets the conformance version to 0.0.0.0 for GPUs that have incomplete support for vulkan, so that it's easier to check if vulkan is fully supported by a GPU at runtime for applications/libraries. $ vulkaninfo\|grep conf MESA-INTEL: warning: Ivy Bridge Vulkan support is incomplete conformanceVersion = 0.0.0.0 Signed-off-by: Clayton Craft <clayton@craftyguy.net> Reviewed-by: Ivan Briano <ivan.briano@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13275>	2021-10-14 22:16:30 +00:00
Dylan Baker	2d47f3640f	docs: update calendar and link releases notes for 21.2.4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13369>	2021-10-14 20:50:11 +00:00
Dylan Baker	57755cad55	docs: add sha256 sum for 21.2.4 release Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13369>	2021-10-14 20:50:11 +00:00
Dylan Baker	8236a7741d	docs: add release notes for 21.2.4 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13369>	2021-10-14 20:50:11 +00:00
Eric Engestrom	70df31f5e0	docs: update calendar for 21.3.0-rc1 Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13368>	2021-10-14 20:51:08 +01:00
Danylo Piliaiev	1c0eb7aa78	ir3/freedreno: account for component in build_tessfactor_base The burden was put on the caller, which caused: - Reading of tess levels back in TCS not accounting for component - Reading patch outputs in TES account for component twice Fixes vkd3d tests: - test_tessellation_read_tesslevel - test_tessellation_primitive_id - test_line_tessellation_dxbc Signed-off-by: Danylo Piliaiev <dpiliaiev@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13338>	2021-10-14 17:35:24 +00:00
Emma Anholt	f839b9599f	loader: Avoid enumerating drm devices just to get an fd's PCI ID. Cuts 1/3 of the runtime of the VA-API unit tests (which do a separate pipe-loader init per test) on radeonsi on my system by not faffing around in sysfs so much. Reviewed-by: Adam Jackson <ajax@redhat.com> Reviewed-by: Pierre-Eric Pelloux-Prayer <pierre-eric.pelloux-prayer@amd.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13324>	2021-10-14 17:09:55 +00:00
Jason Ekstrand	b79e978ae4	vulkan/wsi/win32: Delete the wrapper entrypoints Reviewed-by: Boris Brezillon <boris.brezillon@collabora.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13352>	2021-10-14 15:44:51 +00:00
Mike Blumenkrantz	f769f34680	nir/print: print bindless info as applicable this is useful to know Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13204>	2021-10-14 15:11:38 +00:00
Jason Ekstrand	116e23e385	vulkan/log: Don't assert on non-client-visible objects We already have code to deal with non-client-visible objects but we were asserting if it didn't fall into one of the clearly mappable error cases. However, we didn't have a mapping for VK_ERROR_NOT_PERMITTED which can happen during object creation. Let's just be sloppy and drop the assert. Worst case, the client gets an error with no object. Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13341>	2021-10-14 14:23:45 +00:00
Jason Ekstrand	071437d29d	vulkan/log: Tweak our handling of a couple error enums VK_ERROR_INITIALIZATION_FAILED can happen as part of device creation and isn't really an instance error in that case. VK_ERROR_EXTENSION_NOT_PRESENT, on the other hand, is always an instance thing and we should handle it as such. Fixes: `0cad3beb2a` ("vulkan/log: Add common vk_error and vk_errorf helpers") Reviewed-by: Tapani Pälli <tapani.palli@intel.com> Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13341>	2021-10-14 14:23:45 +00:00
Boris Brezillon	fd46749234	vulkan: Set unused entrypoints to vk_entrypoint_stub when compiling with MSVC If we don't do that we hit the assert(entry[i] != NULL) added by commit `6d44b21d4f` ("vulkan: Fix weak symbol emulation when compiling with MSVC"). Fixes: `6d44b21d4f` ("vulkan: Fix weak symbol emulation when compiling with MSVC") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13355>	2021-10-14 13:56:38 +00:00
Bas Nieuwenhuizen	b4aa5a3fdd	radv: Fix modifier property query. radv_get_modifier_flags read the format properties, doesn't write any. Setting the central format properties based on the drm format properties doesn't make any sense. Fixes: `5dee0d9da9` "radv: switch to VK_FORMAT_FEATURE_2_XXX/VkFormatProperties3KHR" Closes: https://gitlab.freedesktop.org/mesa/mesa/-/issues/5498 Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Tested-By: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13357>	2021-10-14 13:29:42 +00:00
Iago Toral Quiroga	8e6f5aab33	v3dv: fix TLB buffer to image copy path for 3D images Another instance of not taking the Z offset from the right place. We had not seen this one until now because we typically use the TFU path, where we also fixed this same issue in commit `df1d08533c`. Reviewed-by: Alejandro Piñeiro <apinheiro@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13356>	2021-10-14 11:29:26 +02:00
Boris Brezillon	6d44b21d4f	vulkan: Fix weak symbol emulation when compiling with MSVC Mapping unimplemented entrypoints to a global function pointer variable initialized to NULL is a bit cumbersome, and actually led to a bug in the vk_xxx_dispatch_table_from_entrypoints() template: the !override case didn't have the right check on the source table entries. Instead of fixing that case, let's simplify the logic by creating a stub function and making the alternatename pragma point to this stub. This way we get rid of all those uneeded xxx_Null symbols/variables and simplify the tests in vk_xxxx_dispatch_table_from_entrypoints(). Cc: mesa-stable Fixes: `98c622a96e` ("vulkan: Update dispatch table gen for Windows") Signed-off-by: Boris Brezillon <boris.brezillon@collabora.com> Reviewed-by: Jesse Natalie <jenatali@microsoft.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13348>	2021-10-14 09:09:23 +02:00
Ian Romanick	ae99ea6f4d	nir/loop_unroll: Always unroll loops that iterate at most once Two carchase compute shaders (shader-db) and two Fallout 4 fragment shaders (fossil-db) were helped. Based on the NIR of the shaders, all four had structures like for (i = 0; i < 1; i++) { ... for (...) { ... } } All HSW+ platforms had similar results. (Ice Lake shown) total loops in shared programs: 6033 -> 6031 (-0.03%) loops in affected programs: 4 -> 2 (-50.00%) helped: 2 HURT: 0 All Intel platforms had similar results. (Ice Lake shown) Instructions in all programs: 143692018 -> 143692006 (-0.0%) SENDs in all programs: 6947154 -> 6947154 (+0.0%) Loops in all programs: 38285 -> 38283 (-0.0%) Cycles in all programs: 8434822225 -> 8434476815 (-0.0%) Spills in all programs: 191665 -> 191665 (+0.0%) Fills in all programs: 298822 -> 298822 (+0.0%) In the presense of loop unrolling like this, the change in cycles is not accurate. v2: Rearrange the logic in the if-condition to read a little better. Suggested by Tim. Closes: #5089 Reviewed-by: Timothy Arceri <tarceri@itsqueeze.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13323>	2021-10-13 20:11:13 -07:00
Dave Airlie	c4323dc846	brw/nir: remove unused function prototypes. These got moved into common code a good while ago. Reviewed-by: Caio Marcelo de Oliveira Filho <caio.oliveira@intel.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13328>	2021-10-13 22:52:59 +00:00
Anuj Phogat	a98ece61e9	anv: Enable tessellation redistribution This patch adds Tessellation Distribution on top of Geometry Distribution. Using recommended values based on performance studies across a range of workloads. Rework: - Add comment for new packet bits (Sagar) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Anuj Phogat	20c0ca75f5	iris: Enable tessellation redistribution This patch adds Tessellation Distribution on top of Geometry Distribution. Using recommended values based on performance studies across a range of workloads. Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Anuj Phogat	867e2e0716	anv: Enable geometry distribution Using recommended values based on performance studies across a range of workloads. Rework: * Always enable geometry distribution * Set ListCutIndexEnable if primitive restart is enabled * Set distribution mode based on TEEnable * Add comment explaining the 3DSTATE_VFG bits (Sagar) v2: - Emit 3DSTATE_VFG dynamically based on primitive restart (Ken) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Anuj Phogat	efa27572a1	iris: Enable geometry distribution Using recommended values based on performance studies across a range of workloads. Rework: * Always enable geometry distribution * Set ListCutIndexEnable if primitive restart is enabled * Set distribution mode based on TEEnable v2: - Flag missing IRIS_DIRTY_VFG bit (Ken) Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Anuj Phogat	1d224e7f14	genxml/gen125: Update 3DSTATE_TE fields Signed-off-by: Anuj Phogat <anuj.phogat@gmail.com> Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Jordan Justen	9a7e54b87f	intel/genxml: Update genxml to support tessellation/geometry distribution Rework: - Fix 3DSTATE_VFG opcode (Lionel) - Fix distribution mode values (Sagar) - Update 3DSTATE_VFG fields (Anuj) Reviewed-by: Sagar Ghuge <sagar.ghuge@intel.com> Reviewed-by: Kenneth Graunke <kenneth@whitecape.org> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/12091>	2021-10-13 22:36:54 +00:00
Emma Anholt	3eadb03db7	ci/lvp: Skip some slow tests under ASan. depending on the runner's load, we might see timeouts. The subgroupbroadcast one has hit us a couple times this week. Acked-by: Mike Blumenkrantz <michael.blumenkrantz@gmail.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13346>	2021-10-13 21:41:07 +00:00
Alejandro Piñeiro	ec51c8774d	v3d/clif: add support for dumping GS shader state The basic vertex+fragment shader state uses the packet GL_SHADER_STATE, but when geometry shader are involved, the packet used is GL_SHADER_STATE_INCLUDING_GS. Without this commit any program using a geometry shader would dump their shader state (and their shader state record and attribues) as binaries. Reviewed-by: Juan A. Suarez <jasuarez@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13269>	2021-10-13 21:23:10 +00:00
Alejandro Piñeiro	19894bec1f	v3dv/pipeline: don't clone the nir shader at pipeline_state_create_binning At that point we didn't call all the v3dv lowerings. So the reference nir shader used to call the v3d compiler could be different. Note that at that point the nir shader is only available for internal shaders (like gs multiview). This specifically affected multiview tests that wrote gl_PointSize, as the nir shader for the geometry shader were wrongly exposing per_vertex_point_size as false, as we were basing our check on the nir_shader_info, and that was gathered calling nir_shader_gather_info at pipeline_lower_nir. Reviewed-by: Iago Toral Quiroga <itoral@igalia.com> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13325>	2021-10-13 21:01:03 +00:00
Eric Engestrom	c7c484d7f4	VERSION: bump to 22.0 I mistakenly bumped it from 21.3 to 21.4, but there is no 4 (: Signed-off-by: Eric Engestrom <eric@engestrom.ch> Part-of: <https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/13347>	2021-10-13 20:52:19 +01:00
Eric Engestrom	7d9950e924	docs: reset new_features.txt Signed-off-by: Eric Engestrom <eric@engestrom.ch>	2021-10-13 20:29:27 +01:00
Eric Engestrom	91009cbaa8	VERSION: bump to 21.4 Signed-off-by: Eric Engestrom <eric@engestrom.ch>	2021-10-13 20:28:29 +01:00

4076 changed files with 357609 additions and 534817 deletions

6

.gitattributes vendored Normal file

View File

@@ -0,0 +1,6 @@
 *.csv eol=crlf
 * text=auto
 *.jpg binary
 *.png binary
 *.gif binary
 *.ico binary

1116

.gitlab-ci.yml

View File

File diff suppressed because it is too large Load Diff

11

.gitlab-ci/deqp-all-skips.txt → .gitlab-ci/all-skips.txt

View File

@@ -6,5 +6,12 @@
 # reliable to be run in parallel with other tests due to CPU-side timing.
 dEQP-GLES[0-9]*.functional.flush_finish.*
 # https://gitlab.freedesktop.org/mesa/mesa/-/issues/4575
 dEQP-VK.wsi.display.get_display_plane_capabilities
 # piglit: WGL is Windows-only
 wgl@.*
 # These are sensitive to CPU timing, and would need to be run in isolation
 # on the system rather than in parallel with other tests.
 glx@glx_arb_sync_control@timing.*
 # This test is not built with waffle, while we do build tests with waffle
 spec@!opengl 1.1@windowoverlap

									
										2

.gitlab-ci/bare-metal/arm64_a630_egl.sh
									
												View File
												
				@@ -10,7 +10,7 @@ EXIT=0

				# Run reset tests without parallelism:

				if ! env \

				  DEQP_RESULTS_DIR=results/reset \

				  DEQP_PARALLEL=1 \

				  FDO_CI_CONCURRENT=1 \

				  DEQP_CASELIST_FILTER='.*reset.*' \

				  /install/deqp-runner.sh; then

				    EXIT=1

									
										17

.gitlab-ci/bare-metal/cisco-2960-poe-off.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,17 @@

				#!/bin/bash

				if [ -z "$BM_POE_INTERFACE" ]; then

				    echo "Must supply the PoE Interface to power down"

				    exit 1

				fi

				if [ -z "$BM_POE_ADDRESS" ]; then

				    echo "Must supply the PoE Switch host"

				    exit 1

				fi

				SNMP_KEY="1.3.6.1.4.1.9.9.402.1.2.1.1.1.$BM_POE_INTERFACE"

				SNMP_ON="i 1"

				SNMP_OFF="i 4"

				snmpset -v2c -r 3 -t 30 -cmesaci $BM_POE_ADDRESS $SNMP_KEY $SNMP_OFF

									
										21

.gitlab-ci/bare-metal/cisco-2960-poe-on.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,21 @@

				#!/bin/bash

				if [ -z "$BM_POE_INTERFACE" ]; then

				    echo "Must supply the PoE Interface to power up"

				    exit 1

				fi

				if [ -z "$BM_POE_ADDRESS" ]; then

				    echo "Must supply the PoE Switch host"

				    exit 1

				fi

				set -ex

				SNMP_KEY="1.3.6.1.4.1.9.9.402.1.2.1.1.1.$BM_POE_INTERFACE"

				SNMP_ON="i 1"

				SNMP_OFF="i 4"

				snmpset -v2c -r 3 -t 10 -cmesaci $BM_POE_ADDRESS $SNMP_KEY $SNMP_OFF

				sleep 3s

				snmpset -v2c -r 3 -t 10 -cmesaci $BM_POE_ADDRESS $SNMP_KEY $SNMP_ON

									
										14

.gitlab-ci/bare-metal/cros_servo_run.py
									
												View File
												
				@@ -50,12 +50,18 @@ class CrosServoRun:

				            target=self.iter_feed_queue, daemon=True, args=(self.cpu_ser.lines(),))

				        self.iter_feed_cpu.start()

				    def close(self):

				        self.ec_ser.close()

				        self.cpu_ser.close()

				        self.iter_feed_ec.join()

				        self.iter_feed_cpu.join()

				    # Feed lines from our serial queues into the merged queue, marking when our

				    # input is done.

				    def iter_feed_queue(self, it):

				        for i in it:

				            self.serial_queue.put(i)

				        self.serial_queue.put(sentinel)

				        self.serial_queue.put(self.sentinel)

				    # Return the next line from the queue, counting how many threads have

				    # terminated and joining when done

				@@ -150,6 +156,10 @@ class CrosServoRun:

				                    "Detected spontaneous reboot, restarting run...")

				                return 2

				            if re.search("arm-smmu 5040000.iommu: TLB sync timed out -- SMMU may be deadlocked", line):

				                self.print_error("Detected cheza MMU fail, restarting run...")

				                return 2

				            result = re.search("hwci: mesa: (\S*)", line)

				            if result:

				                if result.group(1) == "pass":

				@@ -179,6 +189,8 @@ def main():

				    # power down the CPU on the device

				    servo.ec_write("power off\n")

				    servo.close()

				    sys.exit(retval)

									
										22

.gitlab-ci/bare-metal/fastboot_run.py
									
												View File
												
				@@ -36,6 +36,9 @@ class FastbootRun:

				        self.ser = SerialBuffer(args.dev, "results/serial-output.txt", "R SERIAL> ", timeout=600)

				        self.fastboot="fastboot boot -s {ser} artifacts/fastboot.img".format(ser=args.fbserial)

				    def close(self):

				        self.ser.close()

				    def print_error(self, message):

				        RED = '\033[0;31m'

				        NO_COLOR = '\033[0m'

				@@ -67,7 +70,13 @@ class FastbootRun:

				        if self.logged_system(self.fastboot) != 0:

				            return 1

				        print_more_lines = -1

				        for line in self.ser.lines():

				            if print_more_lines == 0:

				                return 2

				            if print_more_lines > 0:

				                print_more_lines -= 1

				            if re.search("---. end Kernel panic", line):

				                return 1

				@@ -89,6 +98,18 @@ class FastbootRun:

				                    "Detected network device failure, restarting run...")

				                return 2

				            # A3xx recovery doesn't quite work. Sometimes the GPU will get

				            # wedged and recovery will fail (because power can't be reset?)

				            # This assumes that the jobs are sufficiently well-tested that GPU

				            # hangs aren't always triggered, so just try again. But print some

				            # more lines first so that we get better information on the cause

				            # of the hang. Once a hang happens, it's pretty chatty.

				            if "[drm:adreno_recover] *ERROR* gpu hw init failed: -22" in line:

				                self.print_error(

				                    "Detected GPU hang, restarting run...")

				                if print_more_lines == -1:

				                    print_more_lines = 30

				            result = re.search("hwci: mesa: (\S*)", line)

				            if result:

				                if result.group(1) == "pass":

				@@ -111,6 +132,7 @@ def main():

				    while True:

				        retval = fastboot.run()

				        fastboot.close()

				        if retval != 2:

				            break

									
										34

.gitlab-ci/bare-metal/poe-powered.sh
									
												View File
												
				@@ -20,18 +20,6 @@ if [ -z "$BM_POE_ADDRESS" ]; then

				  exit 1

				fi

				if [ -z "$BM_POE_USERNAME" ]; then

				  echo "Must set BM_POE_USERNAME in your gitlab-runner config.toml [[runners]] environment"

				  echo "This is the PoE switch username."

				  exit 1

				fi

				if [ -z "$BM_POE_PASSWORD" ]; then

				  echo "Must set BM_POE_PASSWORD in your gitlab-runner config.toml [[runners]] environment"

				  echo "This is the PoE switch password."

				  exit 1

				fi

				if [ -z "$BM_POE_INTERFACE" ]; then

				  echo "Must set BM_POE_INTERFACE in your gitlab-runner config.toml [[runners]] environment"

				  echo "This is the PoE switch interface where the device is connected."

				@@ -107,11 +95,25 @@ fi

				# Install kernel modules (it could be either in /lib/modules or

				# /usr/lib/modules, but we want to install in the latter)

				[ -d $BM_BOOTFS/usr/lib/modules ] && rsync -a --delete $BM_BOOTFS/usr/lib/modules/ /nfs/usr/lib/modules/

				[ -d $BM_BOOTFS/lib/modules ] && rsync -a --delete $BM_BOOTFS/lib/modules/ /nfs/usr/lib/modules/

				[ -d $BM_BOOTFS/usr/lib/modules ] && rsync -a $BM_BOOTFS/usr/lib/modules/ /nfs/usr/lib/modules/

				[ -d $BM_BOOTFS/lib/modules ] && rsync -a $BM_BOOTFS/lib/modules/ /nfs/lib/modules/

				# Install kernel image + bootloader files

				rsync -a --delete $BM_BOOTFS/boot/ /tftp/

				rsync -aL --delete $BM_BOOTFS/boot/ /tftp/

				# Set up the pxelinux config for Jetson Nano

				mkdir -p /tftp/pxelinux.cfg

				cat <<EOF >/tftp/pxelinux.cfg/default-arm-tegra210-p3450-0000

				PROMPT 0

				TIMEOUT 30

				DEFAULT primary

				MENU TITLE jetson nano boot options

				LABEL primary

				      MENU LABEL CI kernel on TFTP

				      LINUX Image

				      FDT tegra210-p3450-0000.dtb

				      APPEND \${cbootargs} $BM_CMDLINE

				EOF

				# Create the rootfs in the NFS directory

				mkdir -p /nfs/results

				@@ -123,7 +125,7 @@ echo "$BM_CMDLINE" > /tftp/cmdline.txt

				printf "$BM_BOOTCONFIG" >> /tftp/config.txt

				set +e

				ATTEMPTS=2

				ATTEMPTS=10

				while [ $((ATTEMPTS--)) -gt 0 ]; do

				  python3 $BM/poe_run.py \

				          --dev="$BM_SERIAL" \

									
										4

.gitlab-ci/bare-metal/poe_run.py
									
												View File
												
				@@ -66,6 +66,10 @@ class PoERun:

				                self.print_error("Memory overflow in the binner; GPU hang")

				                return 1

				            if re.search("nouveau 57000000.gpu: bus: MMIO read of 00000000 FAULT at 137000", line):

				                self.print_error("nouveau jetson boot bug, retrying.")

				                return 2

				            result = re.search("hwci: mesa: (\S*)", line)

				            if result:

				                if result.group(1) == "pass":

									
										8

.gitlab-ci/bare-metal/rootfs-setup.sh
									
												View File
												
				@@ -8,15 +8,21 @@ mkdir -p $rootfs_dst/results

				cp $BM/bm-init.sh $rootfs_dst/init

				cp $CI_COMMON/init*.sh $rootfs_dst/

				# Make JWT token available as file in the bare-metal storage to enable access

				# to MinIO

				cp "${CI_JOB_JWT_FILE}" "${rootfs_dst}${CI_JOB_JWT_FILE}"

				cp $CI_COMMON/capture-devcoredump.sh $rootfs_dst/

				cp $CI_COMMON/intel-gpu-freq.sh $rootfs_dst/

				set +x

				# Pass through relevant env vars from the gitlab job to the baremetal init script

				"$CI_COMMON"/generate-env.sh > $rootfs_dst/set-job-env-vars.sh

				chmod +x $rootfs_dst/set-job-env-vars.sh

				echo "Variables passed through:"

				cat $rootfs_dst/set-job-env-vars.sh

				echo "export CI_JOB_JWT=${CI_JOB_JWT@Q}" >> $rootfs_dst/set-job-env-vars.sh

				set -x

				# Add the Mesa drivers we built, and make a consistent symlink to them.

									
										27

.gitlab-ci/bare-metal/serial_buffer.py
									
												View File
												
				@@ -28,7 +28,6 @@ import serial

				import threading

				import time

				class SerialBuffer:

				    def __init__(self, dev, filename, prefix, timeout = None):

				        self.filename = filename

				@@ -36,15 +35,17 @@ class SerialBuffer:

				        if dev:

				            self.f = open(filename, "wb+")

				            self.serial = serial.Serial(dev, 115200, timeout=timeout if timeout else 10)

				            self.serial = serial.Serial(dev, 115200, timeout=timeout)

				        else:

				            self.f = open(filename, "rb")

				            self.serial = None

				        self.byte_queue = queue.Queue()

				        self.line_queue = queue.Queue()

				        self.prefix = prefix

				        self.timeout = timeout

				        self.sentinel = object()

				        self.closing = False

				        if self.dev:

				            self.read_thread = threading.Thread(

				@@ -58,24 +59,31 @@ class SerialBuffer:

				            target=self.serial_lines_thread_loop, daemon=True)

				        self.lines_thread.start()

				    def close(self):

				        self.closing = True

				        if self.serial:

				            self.serial.cancel_read()

				        self.read_thread.join()

				        self.lines_thread.join()

				        if self.serial:

				            self.serial.close()

				    # Thread that just reads the bytes from the serial device to try to keep from

				    # buffer overflowing it. If nothing is received in 1 minute, it finalizes.

				    def serial_read_thread_loop(self):

				        greet = "Serial thread reading from %s\n" % self.dev

				        self.byte_queue.put(greet.encode())

				        while True:

				        while not self.closing:

				            try:

				                b = self.serial.read()

				                if len(b) > 0:

				                    self.byte_queue.put(b)

				                elif self.timeout:

				                    self.byte_queue.put(self.sentinel)

				                if len(b) == 0:

				                    break

				                self.byte_queue.put(b)

				            except Exception as err:

				                print(self.prefix + str(err))

				                self.byte_queue.put(self.sentinel)

				                break

				        self.byte_queue.put(self.sentinel)

				    # Thread that just reads the bytes from the file of serial output that some

				    # other process is appending to.

				@@ -83,12 +91,13 @@ class SerialBuffer:

				        greet = "Serial thread reading from %s\n" % self.filename

				        self.byte_queue.put(greet.encode())

				        while True:

				        while not self.closing:

				            line = self.f.readline()

				            if line:

				                self.byte_queue.put(line)

				            else:

				                time.sleep(0.1)

				        self.byte_queue.put(self.sentinel)

				    # Thread that processes the stream of bytes to 1) log to stdout, 2) log to

				    # file, 3) add to the queue of lines to be read by program logic

									
										525

.gitlab-ci/build/gitlab-ci.yml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,525 @@

				# Shared between windows and Linux

				.build-common:

				  extends: .ci-run-policy

				  # Cancel job if a newer commit is pushed to the same branch

				  interruptible: true

				  artifacts:

				    name: "mesa_${CI_JOB_NAME}"

				    when: always

				    paths:

				      - _build/meson-logs/*.txt

				      - _build/meson-logs/strace

				      - shader-db

				# Just Linux

				.build-linux:

				  extends: .build-common

				  variables:

				    CCACHE_COMPILERCHECK: "content"

				    CCACHE_COMPRESS: "true"

				    CCACHE_DIR: /cache/mesa/ccache

				  # Use ccache transparently, and print stats before/after

				  before_script:

				    - !reference [default, before_script]

				    - export PATH="/usr/lib/ccache:$PATH"

				    - export CCACHE_BASEDIR="$PWD"

				    - echo -e "\e[0Ksection_start:$(date +%s):ccache_before[collapsed=true]\r\e[0Kccache stats before build"

				    - ccache --show-stats

				    - echo -e "\e[0Ksection_end:$(date +%s):ccache_before\r\e[0K"

				  after_script:

				    - echo -e "\e[0Ksection_start:$(date +%s):ccache_after[collapsed=true]\r\e[0Kccache stats after build"

				    - ccache --show-stats

				    - echo -e "\e[0Ksection_end:$(date +%s):ccache_after\r\e[0K"

				    - !reference [default, after_script]

				.build-windows:

				  extends: .build-common

				  tags:

				    - windows

				    - docker

				    - "1809"

				    - mesa

				  cache:

				    key: ${CI_JOB_NAME}

				    paths:

				      - subprojects/packagecache

				.meson-build:

				  extends:

				    - .build-linux

				    - .use-debian/x86_build

				  stage: build-x86_64

				  variables:

				    LLVM_VERSION: 11

				  script:

				    - .gitlab-ci/meson/build.sh

				debian-testing:

				  extends:

				    - .meson-build

				    - .ci-deqp-artifacts

				  variables:

				    UNWIND: "enabled"

				    DRI_LOADERS: >

				      -D glx=dri

				      -D gbm=enabled

				      -D egl=enabled

				      -D platforms=x11

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-va=enabled

				    GALLIUM_DRIVERS: "swrast,virgl,radeonsi,zink,crocus,iris,i915"

				    VULKAN_DRIVERS: "swrast,amd,intel"

				    BUILDTYPE: "debugoptimized"

				    EXTRA_OPTION: >

				      -D valgrind=false

				    MINIO_ARTIFACT_NAME: mesa-amd64

				  script:

				    - .gitlab-ci/lava/lava-pytest.sh

				    - .gitlab-ci/meson/build.sh

				    - .gitlab-ci/prepare-artifacts.sh

				  artifacts:

				    reports:

				      junit: artifacts/ci_scripts_report.xml

				debian-testing-asan:

				  extends:

				    - debian-testing

				  variables:

				    C_ARGS: >

				      -Wno-error=stringop-truncation

				    EXTRA_OPTION: >

				      -D b_sanitize=address

				      -D valgrind=false

				      -D tools=dlclose-skip

				    MINIO_ARTIFACT_NAME: ""

				    ARTIFACTS_DEBUG_SYMBOLS: 1

				debian-testing-msan:

				  extends:

				    - debian-clang

				  variables:

				    # l_undef is incompatible with msan

				    EXTRA_OPTION:

				      -D b_sanitize=memory

				      -D b_lundef=false

				    MINIO_ARTIFACT_NAME: ""

				    ARTIFACTS_DEBUG_SYMBOLS: 1

				    # Don't run all the tests yet:

				    # GLSL has some issues in sexpression reading.

				    # gtest has issues in its test initialization.

				    MESON_TEST_ARGS: "--suite glcpp --suite gallium  --suite format"

				    # Freedreno dropped because freedreno tools fail at msan.

				    GALLIUM_DRIVERS: "iris,nouveau,kmsro,r300,r600,swrast,svga,v3d,vc4,virgl,etnaviv,panfrost,lima,zink,radeonsi,tegra,d3d12,crocus"

				    VULKAN_DRIVERS: intel,amd,broadcom,virtio-experimental

				debian-clover-testing:

				  extends:

				    - .meson-build

				    - .ci-deqp-artifacts

				  variables:

				    UNWIND: "enabled"

				    DRI_LOADERS: >

				      -D glx=disabled

				      -D egl=disabled

				      -D gbm=disabled

				    GALLIUM_ST: >

				      -D gallium-opencl=icd

				      -D opencl-spirv=true

				    GALLIUM_DRIVERS: "swrast"

				    BUILDTYPE: "debugoptimized"

				    EXTRA_OPTION: >

				      -D valgrind=false

				  script:

				    - .gitlab-ci/meson/build.sh

				    - .gitlab-ci/prepare-artifacts.sh

				debian-gallium:

				  extends: .meson-build

				  variables:

				    UNWIND: "enabled"

				    DRI_LOADERS: >

				      -D glx=dri

				      -D gbm=enabled

				      -D egl=enabled

				      -D platforms=x11,wayland

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-extra-hud=true

				      -D gallium-vdpau=enabled

				      -D gallium-xvmc=enabled

				      -D gallium-omx=bellagio

				      -D gallium-va=enabled

				      -D gallium-xa=enabled

				      -D gallium-nine=true

				      -D gallium-opencl=disabled

				    GALLIUM_DRIVERS: "iris,nouveau,kmsro,r300,r600,freedreno,swrast,svga,v3d,vc4,virgl,etnaviv,panfrost,lima,zink,d3d12,asahi,crocus"

				    VULKAN_DRIVERS: swrast

				    EXTRA_OPTION: >

				      -D osmesa=true

				      -D tools=drm-shim,etnaviv,freedreno,glsl,intel,intel-ui,nir,nouveau,xvmc,lima,panfrost,asahi

				  script:

				    - .gitlab-ci/meson/build.sh

				    - .gitlab-ci/run-shader-db.sh

				# Test a release build with -Werror so new warnings don't sneak in.

				debian-release:

				  extends: .meson-build

				  variables:

				    UNWIND: "enabled"

				    DRI_LOADERS: >

				      -D glx=dri

				      -D gbm=enabled

				      -D egl=enabled

				      -D platforms=x11,wayland

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-extra-hud=true

				      -D gallium-vdpau=enabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=enabled

				      -D gallium-xa=enabled

				      -D gallium-nine=false

				      -D gallium-opencl=disabled

				      -D llvm=enabled

				    GALLIUM_DRIVERS: "i915,iris,nouveau,kmsro,freedreno,r300,svga,swrast,v3d,vc4,virgl,etnaviv,panfrost,lima,zink,d3d12,crocus"

				    VULKAN_DRIVERS: "amd,imagination-experimental"

				    BUILDTYPE: "release"

				    EXTRA_OPTION: >

				      -D osmesa=true

				      -D tools=all

				      -D intel-clc=enabled

				      -D imagination-srv=true

				  script:

				    - .gitlab-ci/meson/build.sh

				fedora-release:

				  extends:

				    - .meson-build

				    - .use-fedora/x86_build

				  variables:

				    BUILDTYPE: "release"

				    C_ARGS: >

				      -Wno-error=array-bounds

				      -Wno-error=maybe-uninitialized

				      -Wno-error=stringop-overread

				      -Wno-error=uninitialized

				    CPP_ARGS: >

				      -Wno-error=array-bounds

				    DRI_LOADERS: >

				      -D glx=dri

				      -D gbm=enabled

				      -D egl=enabled

				      -D glvnd=true

				      -D platforms=x11,wayland

				    EXTRA_OPTION: >

				      -D osmesa=true

				      -D selinux=true

				      -D tools=drm-shim,etnaviv,freedreno,glsl,intel,nir,nouveau,lima,panfrost,imagination

				      -D intel-clc=enabled

				      -D imagination-srv=true

				    GALLIUM_DRIVERS: "crocus,etnaviv,freedreno,iris,kmsro,lima,nouveau,panfrost,r300,r600,radeonsi,svga,swrast,tegra,v3d,vc4,virgl,zink"

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-extra-hud=true

				      -D gallium-vdpau=enabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=enabled

				      -D gallium-xa=enabled

				      -D gallium-nine=false

				      -D gallium-opencl=icd

				      -D gles1=disabled

				      -D gles2=enabled

				      -D llvm=enabled

				      -D microsoft-clc=disabled

				      -D shared-llvm=enabled

				      -D vulkan-device-select-layer=true

				    LLVM_VERSION: ""

				    UNWIND: "disabled"

				    VULKAN_DRIVERS: "amd,broadcom,freedreno,intel,imagination-experimental"

				  script:

				    - .gitlab-ci/meson/build.sh

				debian-android:

				  extends:

				    - .meson-cross

				    - .use-debian/android_build

				  variables:

				    UNWIND: "disabled"

				    C_ARGS: >

				      -Wno-error=asm-operand-widths

				      -Wno-error=constant-conversion

				      -Wno-error=enum-conversion

				      -Wno-error=initializer-overrides

				      -Wno-error=missing-braces

				      -Wno-error=sometimes-uninitialized

				      -Wno-error=unused-function

				    CPP_ARGS: >

				      -Wno-error=deprecated-declarations

				    DRI_LOADERS: >

				      -D glx=disabled

				      -D gbm=disabled

				      -D egl=enabled

				      -D platforms=android

				    EXTRA_OPTION: >

				      -D android-stub=true

				      -D llvm=disabled

				      -D platform-sdk-version=29

				      -D valgrind=false

				    GALLIUM_ST: >

				      -D dri3=disabled

				      -D gallium-vdpau=disabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=disabled

				      -D gallium-xa=disabled

				      -D gallium-nine=false

				      -D gallium-opencl=disabled

				    LLVM_VERSION: ""

				    PKG_CONFIG_LIBDIR: "/disable/non/android/system/pc/files"

				  script:

				    - PKG_CONFIG_PATH=/usr/local/lib/aarch64-linux-android/pkgconfig/:/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/sysroot/usr/lib/aarch64-linux-android/pkgconfig/ CROSS=aarch64-linux-android GALLIUM_DRIVERS=etnaviv,freedreno,lima,panfrost,vc4,v3d VULKAN_DRIVERS=freedreno,broadcom,virtio-experimental .gitlab-ci/meson/build.sh

				    # x86_64 build:

				    # Can't do Intel because gen_decoder.c currently requires libexpat, which

				    # is not a dependency that AOSP wants to accept.  Can't do Radeon Gallium

				    # drivers because they requires LLVM, which we don't have an Android build

				    # of.

				    - PKG_CONFIG_PATH=/usr/local/lib/x86_64-linux-android/pkgconfig/:/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/sysroot/usr/lib/x86_64-linux-android/pkgconfig/ CROSS=x86_64-linux-android GALLIUM_DRIVERS=iris VULKAN_DRIVERS=amd,intel .gitlab-ci/meson/build.sh

				.meson-cross:

				  extends:

				    - .meson-build

				  stage: build-misc

				  variables:

				    UNWIND: "disabled"

				    DRI_LOADERS: >

				      -D glx=dri

				      -D gbm=enabled

				      -D egl=enabled

				      -D platforms=x11

				      -D osmesa=false

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-vdpau=disabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=disabled

				      -D gallium-xa=disabled

				      -D gallium-nine=false

				.meson-arm:

				  extends:

				    - .meson-cross

				    - .use-debian/arm_build

				  needs:

				    - debian/arm_build

				  variables:

				    VULKAN_DRIVERS: freedreno,broadcom

				    GALLIUM_DRIVERS: "etnaviv,freedreno,kmsro,lima,nouveau,panfrost,swrast,tegra,v3d,vc4"

				    BUILDTYPE: "debugoptimized"

				  tags:

				    - aarch64

				debian-armhf:

				  extends:

				    - .meson-arm

				    - .ci-deqp-artifacts

				  variables:

				    CROSS: armhf

				    EXTRA_OPTION: >

				      -D llvm=disabled

				      -D valgrind=false

				    MINIO_ARTIFACT_NAME: mesa-armhf

				  script:

				    - .gitlab-ci/meson/build.sh

				    - .gitlab-ci/prepare-artifacts.sh

				debian-arm64:

				  extends:

				    - .meson-arm

				    - .ci-deqp-artifacts

				  variables:

				    VULKAN_DRIVERS: "freedreno,broadcom,panfrost,imagination-experimental"

				    EXTRA_OPTION: >

				      -D llvm=disabled

				      -D valgrind=false

				      -D imagination-srv=true

				    MINIO_ARTIFACT_NAME: mesa-arm64

				  script:

				    - .gitlab-ci/meson/build.sh

				    - .gitlab-ci/prepare-artifacts.sh

				debian-arm64-asan:

				  extends:

				    - debian-arm64

				  variables:

				    C_ARGS: >

				      -Wno-error=stringop-truncation

				    EXTRA_OPTION: >

				      -D llvm=disabled

				      -D b_sanitize=address

				      -D valgrind=false

				      -D tools=dlclose-skip

				    ARTIFACTS_DEBUG_SYMBOLS: 1

				    MINIO_ARTIFACT_NAME: mesa-arm64-asan

				    MESON_TEST_ARGS: "--no-suite mesa:compiler"

				debian-arm64-build-test:

				  extends:

				    - .meson-arm

				    - .ci-deqp-artifacts

				  variables:

				    VULKAN_DRIVERS: "amd"

				    EXTRA_OPTION: >

				      -Dtools=panfrost,imagination

				  script:

				    - .gitlab-ci/meson/build.sh

				debian-clang:

				  extends: .meson-build

				  variables:

				    UNWIND: "enabled"

				    C_ARGS: >

				      -Wno-error=constant-conversion

				      -Wno-error=enum-conversion

				      -Wno-error=implicit-const-int-float-conversion

				      -Wno-error=initializer-overrides

				      -Wno-error=sometimes-uninitialized

				      -Wno-error=unused-function

				    CPP_ARGS: >

				      -Wno-error=c99-designator

				      -Wno-error=deprecated-declarations

				      -Wno-error=implicit-const-int-float-conversion

				      -Wno-error=missing-braces

				      -Wno-error=overloaded-virtual

				      -Wno-error=tautological-constant-out-of-range-compare

				      -Wno-error=unused-const-variable

				      -Wno-error=unused-private-field

				    DRI_LOADERS: >

				      -D glvnd=true

				    GALLIUM_DRIVERS: "iris,nouveau,kmsro,r300,r600,freedreno,swrast,svga,v3d,vc4,virgl,etnaviv,panfrost,lima,zink,radeonsi,tegra,d3d12,crocus,i915,asahi"

				    VULKAN_DRIVERS: intel,amd,freedreno,broadcom,virtio-experimental,swrast,panfrost,imagination-experimental

				    EXTRA_OPTIONS:

				      -D imagination-srv=true

				    CC: clang

				    CXX: clang++

				windows-vs2019:

				  extends:

				    - .build-windows

				    - .use-windows_build_vs2019

				    - .windows-build-rules

				  stage: build-misc

				  script:

				    - . .\.gitlab-ci\windows\mesa_build.ps1

				  artifacts:

				    paths:

				      - _build/meson-logs/*.txt

				      - _install/

				debian-clover:

				  extends: .meson-build

				  variables:

				    UNWIND: "enabled"

				    DRI_LOADERS: >

				      -D glx=disabled

				      -D egl=disabled

				      -D gbm=disabled

				    GALLIUM_DRIVERS: "r600,radeonsi"

				    GALLIUM_ST: >

				      -D dri3=disabled

				      -D gallium-vdpau=disabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=disabled

				      -D gallium-xa=disabled

				      -D gallium-nine=false

				      -D gallium-opencl=icd

				    EXTRA_OPTION: >

				      -D valgrind=false

				  script:

				    - LLVM_VERSION=9 GALLIUM_DRIVERS=r600,swrast .gitlab-ci/meson/build.sh

				    - .gitlab-ci/meson/build.sh

				debian-vulkan:

				  extends: .meson-build

				  variables:

				    UNWIND: "disabled"

				    DRI_LOADERS: >

				      -D glx=disabled

				      -D gbm=disabled

				      -D egl=disabled

				      -D platforms=x11,wayland

				      -D osmesa=false

				    GALLIUM_ST: >

				      -D dri3=enabled

				      -D gallium-vdpau=disabled

				      -D gallium-xvmc=disabled

				      -D gallium-omx=disabled

				      -D gallium-va=disabled

				      -D gallium-xa=disabled

				      -D gallium-nine=false

				      -D gallium-opencl=disabled

				      -D b_sanitize=undefined

				      -D c_args=-fno-sanitize-recover=all

				      -D cpp_args=-fno-sanitize-recover=all

				    UBSAN_OPTIONS: "print_stacktrace=1"

				    VULKAN_DRIVERS: intel,amd,freedreno,broadcom,virtio-experimental,imagination-experimental

				    EXTRA_OPTION: >

				      -D vulkan-layers=device-select,overlay

				      -D build-aco-tests=true

				      -D intel-clc=enabled

				      -D imagination-srv=true

				debian-i386:

				  extends:

				    - .meson-cross

				    - .use-debian/i386_build

				  variables:

				    CROSS: i386

				    VULKAN_DRIVERS: intel,amd,swrast,virtio-experimental

				    GALLIUM_DRIVERS: "iris,nouveau,r300,r600,radeonsi,swrast,virgl,zink,crocus"

				    EXTRA_OPTION: >

				      -D vulkan-layers=device-select,overlay

				debian-s390x:

				  extends:

				    - debian-ppc64el

				    - .use-debian/s390x_build

				    - .s390x-rules

				  tags:

				    - kvm

				  variables:

				    CROSS: s390x

				    GALLIUM_DRIVERS: "swrast,zink"

				    # The lp_test_blend test times out with LLVM 11

				    LLVM_VERSION: 9

				    VULKAN_DRIVERS: "swrast"

				debian-ppc64el:

				  extends:

				    - .meson-cross

				    - .use-debian/ppc64el_build

				    - .ppc64el-rules

				  variables:

				    CROSS: ppc64el

				    GALLIUM_DRIVERS: "nouveau,radeonsi,swrast,virgl,zink"

				    VULKAN_DRIVERS: "amd,swrast"

				debian-mingw32-x86_64:

				  extends: .meson-build

				  stage: build-misc

				  variables:

				    UNWIND: "disabled"

				    C_ARGS: >

				      -Wno-error=format

				      -Wno-error=format-extra-args

				    CPP_ARGS: $C_ARGS

				    GALLIUM_DRIVERS: "swrast"

				    EXTRA_OPTION: >

				      -Dllvm=disabled

				      -Dzlib=disabled

				      -Dosmesa=true

				      --cross-file=.gitlab-ci/x86_64-w64-mingw32

									
										47

.gitlab-ci/common/generate-env.sh
									
												View File
												
				@@ -5,8 +5,11 @@ for var in \

				    BASE_SYSTEM_FORK_HOST_PREFIX \

				    BASE_SYSTEM_MAINLINE_HOST_PREFIX \

				    CI_COMMIT_BRANCH \

				    CI_COMMIT_REF_NAME \

				    CI_COMMIT_TITLE \

				    CI_JOB_ID \

				    CI_JOB_JWT_FILE \

				    CI_JOB_NAME \

				    CI_JOB_URL \

				    CI_MERGE_REQUEST_SOURCE_BRANCH_NAME \

				    CI_MERGE_REQUEST_TITLE \

				@@ -14,22 +17,26 @@ for var in \

				    CI_NODE_TOTAL \

				    CI_PAGES_DOMAIN \

				    CI_PIPELINE_ID \

				    CI_PIPELINE_URL \

				    CI_PROJECT_DIR \

				    CI_PROJECT_NAME \

				    CI_PROJECT_PATH \

				    CI_PROJECT_ROOT_NAMESPACE \

				    CI_RUNNER_DESCRIPTION \

				    CI_SERVER_URL \

				    CROSVM_GALLIUM_DRIVER \

				    CROSVM_GPU_ARGS \

				    DEQP_BIN_DIR \

				    DEQP_CASELIST_FILTER \

				    DEQP_CASELIST_INV_FILTER \

				    DEQP_CONFIG \

				    DEQP_EXPECTED_RENDERER \

				    DEQP_FRACTION \

				    DEQP_HEIGHT \

				    DEQP_PARALLEL \

				    DEQP_RESULTS_DIR \

				    DEQP_RUNNER_OPTIONS \

				    DEQP_SUITE \

				    DEQP_TEMP_DIR \

				    DEQP_VARIANT \

				    DEQP_VER \

				    DEQP_WIDTH \

				@@ -41,44 +48,68 @@ for var in \

				    FDO_UPSTREAM_REPO \

				    FD_MESA_DEBUG \

				    FLAKES_CHANNEL \

				    FREEDRENO_HANGCHECK_MS \

				    GALLIUM_DRIVER \

				    GALLIVM_PERF \

				    GPU_VERSION \

				    GTEST \

				    GTEST_FAILS \

				    GTEST_FRACTION \

				    GTEST_RESULTS_DIR \

				    GTEST_RUNNER_OPTIONS \

				    GTEST_SKIPS \

				    HWCI_FREQ_MAX \

				    HWCI_KERNEL_MODULES \

				    HWCI_KVM \

				    HWCI_START_XORG \

				    HWCI_TEST_SCRIPT \

				    IR3_SHADER_DEBUG \

				    JOB_ARTIFACTS_BASE \

				    JOB_RESULTS_PATH \

				    JOB_ROOTFS_OVERLAY_PATH \

				    KERNEL_IMAGE_BASE_URL \

				    KERNEL_IMAGE_NAME \

				    LD_LIBRARY_PATH \

				    LP_NUM_THREADS \

				    MESA_BASE_TAG \

				    MESA_BUILD_PATH \

				    MESA_GL_VERSION_OVERRIDE \

				    MESA_GLSL_VERSION_OVERRIDE \

				    MESA_DEBUG \

				    MESA_GLES_VERSION_OVERRIDE \

				    MESA_GLSL_VERSION_OVERRIDE \

				    MESA_GL_VERSION_OVERRIDE \

				    MESA_IMAGE \

				    MESA_IMAGE_PATH \

				    MESA_IMAGE_TAG \

				    MESA_TEMPLATES_COMMIT \

				    MESA_VK_IGNORE_CONFORMANCE_WARNING \

				    MESA_SPIRV_LOG_LEVEL \

				    MINIO_HOST \

				    NIR_VALIDATE \

				    MINIO_RESULTS_UPLOAD \

				    NIR_DEBUG \

				    PAN_I_WANT_A_BROKEN_VULKAN_DRIVER \

				    PAN_MESA_DEBUG \

				    PIGLIT_FRACTION \

				    PIGLIT_JUNIT_RESULTS \

				    PIGLIT_NO_WINDOW \

				    PIGLIT_OPTIONS \

				    PIGLIT_PLATFORM \

				    PIGLIT_PROFILES \

				    PIGLIT_REPLAY_ARTIFACTS_BASE_URL \

				    PIGLIT_REPLAY_SUBCOMMAND \

				    PIGLIT_REPLAY_DESCRIPTION_FILE \

				    PIGLIT_REPLAY_DEVICE_NAME \

				    PIGLIT_REPLAY_EXTRA_ARGS \

				    PIGLIT_REPLAY_LOOP_TIMES \

				    PIGLIT_REPLAY_REFERENCE_IMAGES_BASE \

				    PIGLIT_REPLAY_UPLOAD_TO_MINIO \

				    PIGLIT_REPLAY_SUBCOMMAND \

				    PIGLIT_RESULTS \

				    PIGLIT_TESTS \

				    PIPELINE_ARTIFACTS_BASE \

				    TEST_LD_PRELOAD \

				    SKQP_ASSETS_DIR \

				    SKQP_BACKENDS \

				    TU_DEBUG \

				    VIRGL_HOST_API \

				    VK_CPU \

				    VK_DRIVER \

				    VK_ICD_FILENAMES \

				    ; do

				  if [ -n "${!var+x}" ]; then

				    echo "export $var=${!var@Q}"

1

.gitlab-ci/common/init-stage1.sh

View File

@@ -9,6 +9,7 @@ cd /
 mount -t proc none /proc
 mount -t sysfs none /sys
 mount -t debugfs none /sys/kernel/debug
 mount -t devtmpfs none /dev || echo possibly already mounted
 mkdir -p /dev/pts
 mount -t devpts devpts /dev/pts

									
										54

.gitlab-ci/common/init-stage2.sh
									
												View File
												
				@@ -8,7 +8,31 @@

				set -ex

				# Set up any devices required by the jobs

				[ -z "$HWCI_KERNEL_MODULES" ] || (echo -n $HWCI_KERNEL_MODULES | xargs -d, -n1 /usr/sbin/modprobe)

				[ -z "$HWCI_KERNEL_MODULES" ] || {

				    echo -n $HWCI_KERNEL_MODULES | xargs -d, -n1 /usr/sbin/modprobe

				}

				#

				# Load the KVM module specific to the detected CPU virtualization extensions:

				# - vmx for Intel VT

				# - svm for AMD-V

				#

				# Additionally, download the kernel image to boot the VM via HWCI_TEST_SCRIPT.

				#

				if [ "$HWCI_KVM" = "true" ]; then

				    unset KVM_KERNEL_MODULE

				    grep -qs '\bvmx\b' /proc/cpuinfo && KVM_KERNEL_MODULE=kvm_intel || {

				        grep -qs '\bsvm\b' /proc/cpuinfo && KVM_KERNEL_MODULE=kvm_amd

				    }

				    [ -z "${KVM_KERNEL_MODULE}" ] && \

				        echo "WARNING: Failed to detect CPU virtualization extensions" || \

				        modprobe ${KVM_KERNEL_MODULE}

				    mkdir -p /lava-files

				    wget -S --progress=dot:giga -O /lava-files/${KERNEL_IMAGE_NAME} \

				        "${KERNEL_IMAGE_BASE_URL}/${KERNEL_IMAGE_NAME}"

				fi

				# Fix prefix confusion: the build installs to $CI_PROJECT_DIR, but we expect

				# it in /install

				@@ -36,6 +60,16 @@ if [ "$HWCI_FREQ_MAX" = "true" ]; then

				  # Disable GPU runtime power management

				  GPU_AUTOSUSPEND=`find /sys/devices -name autosuspend_delay_ms | grep gpu | head -1`

				  test -z "$GPU_AUTOSUSPEND" || echo -1 > $GPU_AUTOSUSPEND || true

				  # Lock Intel GPU frequency to 70% of the maximum allowed by hardware

				  # and enable throttling detection & reporting.

				  ./intel-gpu-freq.sh -s 70% -g all -d

				fi

				# Increase freedreno hangcheck timer because it's right at the edge of the

				# spilling tests timing out (and some traces, too)

				if [ -n "$FREEDRENO_HANGCHECK_MS" ]; then

				    echo $FREEDRENO_HANGCHECK_MS | tee -a /sys/kernel/debug/dri/128/hangcheck_period_ms

				fi

				# Start a little daemon to capture the first devcoredump we encounter.  (They

				@@ -61,18 +95,18 @@ if [ -n "$HWCI_START_XORG" ]; then

				  export DISPLAY=:0

				fi

				RESULT=fail

				if sh $HWCI_TEST_SCRIPT; then

				  RESULT=pass

				  rm -rf results/trace/$PIGLIT_REPLAY_DEVICE_NAME

				fi

				sh -c "$HWCI_TEST_SCRIPT" && RESULT=pass || RESULT=fail

				# Let's make sure the results are always stored in current working directory

				mv -f ${CI_PROJECT_DIR}/results ./ 2>/dev/null || true

				[ "${RESULT}" = "fail" ] || rm -rf results/trace/$PIGLIT_REPLAY_DEVICE_NAME

				# upload artifacts

				MINIO=$(cat /proc/cmdline | tr ' ' '\n' | grep minio_results | cut -d '=' -f 2 || true)

				if [ -n "$MINIO" ]; then

				if [ -n "$MINIO_RESULTS_UPLOAD" ]; then

				  tar -czf results.tar.gz results/;

				  ci-fairy minio login "$CI_JOB_JWT";

				  ci-fairy minio cp results.tar.gz minio://"$MINIO"/results.tar.gz;

				  ci-fairy minio login --token-file "${CI_JOB_JWT_FILE}";

				  ci-fairy minio cp results.tar.gz minio://"$MINIO_RESULTS_UPLOAD"/results.tar.gz;

				fi

				echo "hwci: mesa: $RESULT"

									
										567

.gitlab-ci/common/intel-gpu-freq.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,567 @@

				#!/bin/sh

				#

				# The Intel i915 GPU driver allows to change the minimum, maximum and boost

				# frequencies in steps of 50 MHz via /sys/class/drm/card<n>/<freq_info>,

				# where <n> is the DRM card index and <freq_info> one of the following:

				#

				# - gt_max_freq_mhz (enforced maximum freq)

				# - gt_min_freq_mhz (enforced minimum freq)

				# - gt_boost_freq_mhz (enforced boost freq)

				#

				# The hardware capabilities can be accessed via:

				#

				# - gt_RP0_freq_mhz (supported maximum freq)

				# - gt_RPn_freq_mhz (supported minimum freq)

				# - gt_RP1_freq_mhz (most efficient freq)

				#

				# The current frequency can be read from:

				# - gt_act_freq_mhz (the actual GPU freq)

				# - gt_cur_freq_mhz (the last requested freq)

				#

				# Copyright (C) 2022 Collabora Ltd.

				# Author: Cristian Ciocaltea <cristian.ciocaltea@collabora.com>

				#

				# SPDX-License-Identifier: MIT

				#

				#

				# Constants

				#

				DRM_FREQ_SYSFS_PATTERN="/sys/class/drm/card%d/gt_%s_freq_mhz"

				ENF_FREQ_INFO="max min boost"

				CAP_FREQ_INFO="RP0 RPn RP1"

				ACT_FREQ_INFO="act cur"

				THROTT_DETECT_SLEEP_SEC=2

				THROTT_DETECT_PID_FILE_PATH=/tmp/thrott-detect.pid

				#

				# Global variables.

				#

				unset INTEL_DRM_CARD_INDEX

				unset GET_ACT_FREQ GET_ENF_FREQ GET_CAP_FREQ

				unset SET_MIN_FREQ SET_MAX_FREQ

				unset MONITOR_FREQ

				unset DETECT_THROTT

				unset DRY_RUN

				#

				# Simple printf based stderr logger.

				#

				log() {

				    local msg_type=$1

				    shift

				    printf "%s: %s: " "${msg_type}" "${0##*/}" >&2

				    printf "$@" >&2

				    printf "\n" >&2

				}

				#

				# Helper to print sysfs path for the given card index and freq info.

				#

				# arg1: Frequency info sysfs name, one of *_FREQ_INFO constants above

				# arg2: Video card index, defaults to INTEL_DRM_CARD_INDEX

				#

				print_freq_sysfs_path() {

				    printf ${DRM_FREQ_SYSFS_PATTERN} "${2:-${INTEL_DRM_CARD_INDEX}}" "$1"

				}

				#

				# Helper to set INTEL_DRM_CARD_INDEX for the first identified Intel video card.

				#

				identify_intel_gpu() {

				    local i=0 vendor path

				    while [ ${i} -lt 16 ]; do

				        [ -c "/dev/dri/card$i" ] || {

				            i=$((i + 1))

				            continue

				        }

				        path=$(print_freq_sysfs_path "" ${i})

				        path=${path%/*}/device/vendor

				        [ -r "${path}" ] && read vendor < "${path}" && \

				            [ "${vendor}" = "0x8086" ] && INTEL_DRM_CARD_INDEX=$i && return 0

				        i=$((i + 1))

				    done

				    return 1

				}

				#

				# Read the specified freq info from sysfs.

				#

				# arg1: Flag (y/n) to also enable printing the freq info.

				# arg2...: Frequency info sysfs name(s), see *_FREQ_INFO constants above

				# return: Global variable(s) FREQ_${arg} containing the requested information

				#

				read_freq_info() {

				    local var val path print=0 ret=0

				    [ "$1" = "y" ] && print=1

				    shift

				    while [ $# -gt 0 ]; do

				        var=FREQ_$1

				        path=$(print_freq_sysfs_path "$1")

				        [ -r ${path} ] && read ${var} < ${path} || {

				            log ERROR "Failed to read freq info from: %s" "${path}"

				            ret=1

				            continue

				        }

				        [ -n "${var}" ] || {

				            log ERROR "Got empty freq info from: %s" "${path}"

				            ret=1

				            continue

				        }

				        [ ${print} -eq 1 ] && {

				            eval val=\$${var}

				            printf "%6s: %4s MHz\n" "$1" "${val}"

				        }

				        shift

				    done

				    return ${ret}

				}

				#

				# Display requested info.

				#

				print_freq_info() {

				    local req_freq

				    [ -n "${GET_CAP_FREQ}" ] && {

				        printf "* Hardware capabilities\n"

				        read_freq_info y ${CAP_FREQ_INFO}

				        printf "\n"

				    }

				    [ -n "${GET_ENF_FREQ}" ] && {

				        printf "* Enforcements\n"

				        read_freq_info y ${ENF_FREQ_INFO}

				        printf "\n"

				    }

				    [ -n "${GET_ACT_FREQ}" ] && {

				        printf "* Actual\n"

				        read_freq_info y ${ACT_FREQ_INFO}

				        printf "\n"

				    }

				}

				#

				# Helper to print frequency value as requested by user via '-s, --set' option.

				# arg1: user requested freq value

				#

				compute_freq_set() {

				    local val

				    case "$1" in

				    +)

				        val=${FREQ_RP0}

				        ;;

				    -)

				        val=${FREQ_RPn}

				        ;;

				    *%)

				        val=$((${1%?} * ${FREQ_RP0} / 100))

				        # Adjust freq to comply with 50 MHz increments

				        val=$((val / 50 * 50))

				        ;;

				    *[!0-9]*)

				        log ERROR "Cannot set freq to invalid value: %s" "$1"

				        return 1

				        ;;

				    "")

				        log ERROR "Cannot set freq to unspecified value"

				        return 1

				        ;;

				    *)

				        # Adjust freq to comply with 50 MHz increments

				        val=$(($1 / 50 * 50))

				        ;;

				    esac

				    printf "%s" "${val}"

				}

				#

				# Helper for set_freq().

				#

				set_freq_max() {

				    log INFO "Setting GPU max freq to %s MHz" "${SET_MAX_FREQ}"

				    read_freq_info n min || return $?

				    [ ${SET_MAX_FREQ} -gt ${FREQ_RP0} ] && {

				        log ERROR "Cannot set GPU max freq (%s) to be greater than hw max freq (%s)" \

				            "${SET_MAX_FREQ}" "${FREQ_RP0}"

				        return 1

				    }

				    [ ${SET_MAX_FREQ} -lt ${FREQ_RPn} ] && {

				        log ERROR "Cannot set GPU max freq (%s) to be less than hw min freq (%s)" \

				            "${SET_MIN_FREQ}" "${FREQ_RPn}"

				        return 1

				    }

				    [ ${SET_MAX_FREQ} -lt ${FREQ_min} ] && {

				        log ERROR "Cannot set GPU max freq (%s) to be less than min freq (%s)" \

				            "${SET_MAX_FREQ}" "${FREQ_min}"

				        return 1

				    }

				    [ -z "${DRY_RUN}" ] || return 0

				    printf "%s" ${SET_MAX_FREQ} | tee $(print_freq_sysfs_path max) \

				        $(print_freq_sysfs_path boost) > /dev/null

				    [ $? -eq 0 ] || {

				        log ERROR "Failed to set GPU max frequency"

				        return 1

				    }

				}

				#

				# Helper for set_freq().

				#

				set_freq_min() {

				    log INFO "Setting GPU min freq to %s MHz" "${SET_MIN_FREQ}"

				    read_freq_info n max || return $?

				    [ ${SET_MIN_FREQ} -gt ${FREQ_max} ] && {

				        log ERROR "Cannot set GPU min freq (%s) to be greater than max freq (%s)" \

				            "${SET_MIN_FREQ}" "${FREQ_max}"

				        return 1

				    }

				    [ ${SET_MIN_FREQ} -lt ${FREQ_RPn} ] && {

				        log ERROR "Cannot set GPU min freq (%s) to be less than hw min freq (%s)" \

				            "${SET_MIN_FREQ}" "${FREQ_RPn}"

				        return 1

				    }

				    [ -z "${DRY_RUN}" ] || return 0

				    printf "%s" ${SET_MIN_FREQ} > $(print_freq_sysfs_path min)

				    [ $? -eq 0 ] || {

				        log ERROR "Failed to set GPU min frequency"

				        return 1

				    }

				}

				#

				# Set min or max or both GPU frequencies to the user indicated values.

				#

				set_freq() {

				    # Get hw max & min frequencies

				    read_freq_info n RP0 RPn || return $?

				    [ -z "${SET_MAX_FREQ}" ] || {

				        SET_MAX_FREQ=$(compute_freq_set "${SET_MAX_FREQ}")

				        [ -z "${SET_MAX_FREQ}" ] && return 1

				    }

				    [ -z "${SET_MIN_FREQ}" ] || {

				        SET_MIN_FREQ=$(compute_freq_set "${SET_MIN_FREQ}")

				        [ -z "${SET_MIN_FREQ}" ] && return 1

				    }

				    #

				    # Ensure correct operation order, to avoid setting min freq

				    # to a value which is larger than max freq.

				    #

				    # E.g.:

				    #   crt_min=crt_max=600; new_min=new_max=700

				    #   > operation order: max=700; min=700

				    #

				    #   crt_min=crt_max=600; new_min=new_max=500

				    #   > operation order: min=500; max=500

				    #

				    if [ -n "${SET_MAX_FREQ}" ] && [ -n "${SET_MIN_FREQ}" ]; then

				        [ ${SET_MAX_FREQ} -lt ${SET_MIN_FREQ} ] && {

				            log ERROR "Cannot set GPU max freq to be less than min freq"

				            return 1

				        }

				        read_freq_info n min || return $?

				        if [ ${SET_MAX_FREQ} -lt ${FREQ_min} ]; then

				            set_freq_min || return $?

				            set_freq_max

				        else

				            set_freq_max || return $?

				            set_freq_min

				        fi

				    elif [ -n "${SET_MAX_FREQ}" ]; then

				        set_freq_max

				    elif [ -n "${SET_MIN_FREQ}" ]; then

				        set_freq_min

				    else

				        log "Unexpected call to set_freq()"

				        return 1

				    fi

				}

				#

				# Helper for detect_throttling().

				#

				get_thrott_detect_pid() {

				    [ -e ${THROTT_DETECT_PID_FILE_PATH} ] || return 0

				    local pid

				    read pid < ${THROTT_DETECT_PID_FILE_PATH} || {

				        log ERROR "Failed to read pid from: %s" "${THROTT_DETECT_PID_FILE_PATH}"

				        return 1

				    }

				    local proc_path=/proc/${pid:-invalid}/cmdline

				    [ -r ${proc_path} ] && grep -qs "${0##*/}" ${proc_path} && {

				        printf "%s" "${pid}"

				        return 0

				    }

				    # Remove orphaned PID file

				    rm -rf ${THROTT_DETECT_PID_FILE_PATH}

				    return 1

				}

				#

				# Control detection and reporting of GPU throttling events.

				# arg1: start - run throttle detector in background

				#       stop - stop throttle detector process, if any

				#       status - verify if throttle detector is running

				#

				detect_throttling() {

				    local pid

				    pid=$(get_thrott_detect_pid)

				    case "$1" in

				    status)

				        printf "Throttling detector is "

				        [ -z "${pid}" ] && printf "not running\n" && return 0

				        printf "running (pid=%s)\n" ${pid}

				        ;;

				    stop)

				        [ -z "${pid}" ] && return 0

				        log INFO "Stopping throttling detector (pid=%s)" "${pid}"

				        kill ${pid}; sleep 1; kill -0 ${pid} 2>/dev/null && kill -9 ${pid}

				        rm -rf ${THROTT_DETECT_PID_FILE_PATH}

				        ;;

				    start)

				        [ -n "${pid}" ] && {

				            log WARN "Throttling detector is already running (pid=%s)" ${pid}

				            return 0

				        }

				        (

				            read_freq_info n RPn || exit $?

				            while true; do

				                sleep ${THROTT_DETECT_SLEEP_SEC}

				                read_freq_info n act min cur || exit $?

				                #

				                # The throttling seems to occur when act freq goes below min.

				                # However, it's necessary to exclude the idle states, where

				                # act freq normally reaches RPn and cur goes below min.

				                #

				                [ ${FREQ_act} -lt ${FREQ_min} ] && \

				                [ ${FREQ_act} -gt ${FREQ_RPn} ] && \

				                [ ${FREQ_cur} -ge ${FREQ_min} ] && \

				                    printf "GPU throttling detected: act=%s min=%s cur=%s RPn=%s\n" \

				                    ${FREQ_act} ${FREQ_min} ${FREQ_cur} ${FREQ_RPn}

				            done

				        ) &

				        pid=$!

				        log INFO "Started GPU throttling detector (pid=%s)" ${pid}

				        printf "%s\n" ${pid} > ${THROTT_DETECT_PID_FILE_PATH} || \

				            log WARN "Failed to write throttle detector PID file"

				        ;;

				    esac

				}

				#

				# Show help message.

				#

				print_usage() {

				    cat <<EOF

				Usage: ${0##*/} [OPTION]...

				A script to manage Intel GPU frequencies. Can be used for debugging performance

				problems or trying to obtain a stable frequency while benchmarking.

				Note Intel GPUs only accept specific frequencies, usually multiples of 50 MHz.

				Options:

				  -g, --get [act|enf|cap|all]

				                        Get frequency information: active (default), enforced,

				                        hardware capabilities or all of them.

				  -s, --set [{min|max}=]{FREQUENCY[%]|+|-}

				                        Set min or max frequency to the given value (MHz).

				                        Append '%' to interpret FREQUENCY as % of hw max.

				                        Use '+' or '-' to set frequency to hardware max or min.

				                        Omit min/max prefix to set both frequencies.

				  -r, --reset           Reset frequencies to hardware defaults.

				  -m, --monitor [act|enf|cap|all]

				                        Monitor the indicated frequencies via 'watch' utility.

				                        See '-g, --get' option for more details.

				  -d|--detect-thrott [start|stop|status]

				                        Start (default operation) the throttling detector

				                        as a background process. Use 'stop' or 'status' to

				                        terminate the detector process or verify its status.

				  --dry-run             See what the script will do without applying any

				                        frequency changes.

				  -h, --help            Display this help text and exit.

				EOF

				}

				#

				# Parse user input for '-g, --get' option.

				# Returns 0 if a value has been provided, otherwise 1.

				#

				parse_option_get() {

				    local ret=0

				    case "$1" in

				    act) GET_ACT_FREQ=1;;

				    enf) GET_ENF_FREQ=1;;

				    cap) GET_CAP_FREQ=1;;

				    all) GET_ACT_FREQ=1; GET_ENF_FREQ=1; GET_CAP_FREQ=1;;

				    -*|"")

				        # No value provided, using default.

				        GET_ACT_FREQ=1

				        ret=1

				        ;;

				    *)

				        print_usage

				        exit 1

				        ;;

				    esac

				    return ${ret}

				}

				#

				# Validate user input for '-s, --set' option.

				#

				validate_option_set() {

				    case "$1" in

				    +|-|[0-9]%|[0-9][0-9]%)

				        return 0

				        ;;

				    *[!0-9]*|"")

				        print_usage

				        exit 1

				        ;;

				    esac

				}

				#

				# Parse script arguments.

				#

				[ $# -eq 0 ] && { print_usage; exit 1; }

				while [ $# -gt 0 ]; do

				    case "$1" in

				    -g|--get)

				        parse_option_get "$2" && shift

				        ;;

				    -s|--set)

				        shift

				        case "$1" in

				        min=*)

				            SET_MIN_FREQ=${1#min=}

				            validate_option_set "${SET_MIN_FREQ}"

				            ;;

				        max=*)

				            SET_MAX_FREQ=${1#max=}

				            validate_option_set "${SET_MAX_FREQ}"

				            ;;

				        *)

				            SET_MIN_FREQ=$1

				            validate_option_set "${SET_MIN_FREQ}"

				            SET_MAX_FREQ=${SET_MIN_FREQ}

				            ;;

				        esac

				        ;;

				    -r|--reset)

				        RESET_FREQ=1

				        SET_MIN_FREQ="-"

				        SET_MAX_FREQ="+"

				        ;;

				    -m|--monitor)

				        MONITOR_FREQ=act

				        parse_option_get "$2" && MONITOR_FREQ=$2 && shift

				        ;;

				    -d|--detect-thrott)

				        DETECT_THROTT=start

				        case "$2" in

				        start|stop|status)

				            DETECT_THROTT=$2

				            shift

				            ;;

				        esac

				        ;;

				    --dry-run)

				        DRY_RUN=1

				        ;;

				    -h|--help)

				        print_usage

				        exit 0

				        ;;

				    *)

				        print_usage

				        exit 1

				        ;;

				    esac

				    shift

				done

				#

				# Main

				#

				RET=0

				identify_intel_gpu || {

				    log INFO "No Intel GPU detected"

				    exit 0

				}

				[ -n "${SET_MIN_FREQ}${SET_MAX_FREQ}" ] && { set_freq || RET=$?; }

				print_freq_info

				[ -n "${DETECT_THROTT}" ] && detect_throttling ${DETECT_THROTT}

				[ -n "${MONITOR_FREQ}" ] && {

				    log INFO "Entering frequency monitoring mode"

				    sleep 2

				    exec watch -d -n 1 "$0" -g "${MONITOR_FREQ}"

				}

				exit ${RET}

31

.gitlab-ci/container/arm64.config

View File

@@ -14,9 +14,9 @@ CONFIG_DRM_ROCKCHIP=y
 CONFIG_DRM_PANFROST=y
 CONFIG_DRM_LIMA=y
 CONFIG_DRM_PANEL_SIMPLE=y
 CONFIG_DRM_PANEL_EDP=y
 CONFIG_DRM_MSM=y
 CONFIG_DRM_I2C_ADV7511=y
 CONFIG_DRM_I2C_ADV7533=y
 CONFIG_PWM_CROS_EC=y
 CONFIG_BACKLIGHT_PWM=y
@@ -32,6 +32,11 @@ CONFIG_TYPEC=y
 CONFIG_TYPEC_TCPM=y
 # MSM platform bits
 # For CONFIG_QCOM_LMH
 CONFIG_OF=y
 CONFIG_QCOM_COMMAND_DB=y
 CONFIG_QCOM_RPMHPD=y
 CONFIG_QCOM_RPMPD=y
 CONFIG_SDM_GPUCC_845=y
@@ -45,9 +50,11 @@ CONFIG_I2C_QCOM_GENI=y
 CONFIG_SPI_QCOM_GENI=y
 CONFIG_PHY_QCOM_QUSB2=y
 CONFIG_PHY_QCOM_QMP=y
 CONFIG_QCOM_LLCC=y
 CONFIG_QCOM_SPMI_TEMP_ALARM=y
 CONFIG_QCOM_CLK_APCC_MSM8996=y
 CONFIG_QCOM_LLCC=y
 CONFIG_QCOM_LMH=y
 CONFIG_QCOM_SPMI_TEMP_ALARM=y
 CONFIG_QCOM_WDT=y
 CONFIG_POWER_RESET_QCOM_PON=y
 CONFIG_RTC_DRV_PM8XXX=y
 CONFIG_INTERCONNECT=y
@@ -56,8 +63,9 @@ CONFIG_INTERCONNECT_QCOM_SDM845=y
 CONFIG_INTERCONNECT_QCOM_MSM8916=y
 CONFIG_INTERCONNECT_QCOM_OSM_L3=y
 CONFIG_INTERCONNECT_QCOM_SC7180=y
 CONFIG_QCOM_WDT=y
 CONFIG_CRYPTO_DEV_QCOM_RNG=y
 CONFIG_SC_DISPCC_7180=y
 CONFIG_SC_GPUCC_7180=y
 # db410c ethernet
 CONFIG_USB_RTL8152=y
@@ -142,8 +150,23 @@ CONFIG_DRM_MESON=y
 CONFIG_DRM_MEDIATEK=y
 CONFIG_PWM_MEDIATEK=y
 CONFIG_DRM_MEDIATEK_HDMI=y
 CONFIG_GNSS=y
 CONFIG_GNSS_MTK_SERIAL=y
 CONFIG_HW_RANDOM=y
 CONFIG_HW_RANDOM_MTK=y
 CONFIG_MTK_DEVAPC=y
 CONFIG_PWM_MTK_DISP=y
 CONFIG_MTK_CMDQ=y
 # For nouveau.  Note that DRM must be a module so that it's loaded after NFS is up to provide the firmware.
 CONFIG_ARCH_TEGRA=y
 CONFIG_DRM_NOUVEAU=m
 CONFIG_DRM_TEGRA=m
 CONFIG_R8169=y
 CONFIG_STAGING=y
 CONFIG_DRM_TEGRA_STAGING=y
 CONFIG_TEGRA_HOST1X=y
 CONFIG_ARM_TEGRA_DEVFREQ=y
 CONFIG_TEGRA_SOCTHERM=y
 CONFIG_DRM_TEGRA_DEBUG=y
 CONFIG_PWM_TEGRA=y

									
										5

.gitlab-ci/container/baremetal_build.sh
									
												View File
												
				@@ -25,7 +25,10 @@ if [[ $arch == "arm64" ]]; then

				    wget ${ARTIFACTS_URL}/Image.gz

				    wget ${ARTIFACTS_URL}/cheza-kernel

				    DEVICE_TREES="apq8016-sbc.dtb apq8096-db820c.dtb"

				    DEVICE_TREES=""

				    DEVICE_TREES="$DEVICE_TREES apq8016-sbc.dtb"

				    DEVICE_TREES="$DEVICE_TREES apq8096-db820c.dtb"

				    DEVICE_TREES="$DEVICE_TREES tegra210-p3450-0000.dtb"

				    for DTB in $DEVICE_TREES; do

				        wget ${ARTIFACTS_URL}/$DTB

									
										56

.gitlab-ci/container/build-crosvm.sh
									
												View File
												
				@@ -2,47 +2,25 @@

				set -ex

				# Pull down repositories that crosvm depends on to cros checkout-like locations.

				CROS_ROOT=/

				THIRD_PARTY_ROOT=$CROS_ROOT/third_party

				mkdir -p $THIRD_PARTY_ROOT

				AOSP_EXTERNAL_ROOT=$CROS_ROOT/aosp/external

				mkdir -p $AOSP_EXTERNAL_ROOT

				PLATFORM2_ROOT=/platform2

				SCRIPT_DIR="$(pwd)"

				PLATFORM2_COMMIT=72e56e66ccf3d2ea48f5686bd1f772379c43628b

				git clone --single-branch --no-checkout https://chromium.googlesource.com/chromiumos/platform2 $PLATFORM2_ROOT

				pushd $PLATFORM2_ROOT

				git checkout $PLATFORM2_COMMIT

				popd

				# minijail does not exist in upstream linux distros.

				MINIJAIL_COMMIT=debdf5de5a0ae3b667bee2f8fb1f755b0b3f5a6c

				git clone --single-branch --no-checkout https://android.googlesource.com/platform/external/minijail $AOSP_EXTERNAL_ROOT/minijail

				pushd $AOSP_EXTERNAL_ROOT/minijail

				git checkout $MINIJAIL_COMMIT

				make

				cp libminijail.so /usr/lib/x86_64-linux-gnu/

				popd

				# Pull the cras library for audio access.

				ADHD_COMMIT=a1e0869b95c845c4fe6234a7b92fdfa6acc1e809

				git clone --single-branch --no-checkout https://chromium.googlesource.com/chromiumos/third_party/adhd $THIRD_PARTY_ROOT/adhd

				pushd $THIRD_PARTY_ROOT/adhd

				git checkout $ADHD_COMMIT

				popd

				# Pull vHost (dataplane for virtio backend drivers)

				VHOST_COMMIT=3091854e27242d09453004b011f701fa29c0b8e8

				git clone --single-branch --no-checkout https://chromium.googlesource.com/chromiumos/third_party/rust-vmm/vhost $THIRD_PARTY_ROOT/rust-vmm/vhost

				pushd $THIRD_PARTY_ROOT/rust-vmm/vhost

				git checkout $VHOST_COMMIT

				popd

				CROSVM_VERSION=e42a43d880b0364b55559dbeade3af174f929001

				git clone --single-branch --no-checkout https://chromium.googlesource.com/chromiumos/platform/crosvm /platform/crosvm

				CROSVM_VERSION=c7cd0e0114c8363b884ba56d8e12adee718dcc93

				git clone --single-branch -b main --no-checkout https://chromium.googlesource.com/chromiumos/platform/crosvm /platform/crosvm

				pushd /platform/crosvm

				git checkout "$CROSVM_VERSION"

				git submodule update --init

				# Apply all crosvm patches for Mesa CI

				cat "$SCRIPT_DIR"/.gitlab-ci/container/build-crosvm_*.patch |

				    patch -p1

				VIRGLRENDERER_VERSION=0564c9a0c2f584e004a7d4864aee3b8ec9692105

				rm -rf third_party/virglrenderer

				git clone --single-branch -b master --no-checkout https://gitlab.freedesktop.org/virgl/virglrenderer.git third_party/virglrenderer

				pushd third_party/virglrenderer

				git checkout "$VIRGLRENDERER_VERSION"

				meson build/ $EXTRA_MESON_ARGS

				ninja -C build install

				popd

				RUSTFLAGS='-L native=/usr/local/lib' cargo install \

				  bindgen \

				@@ -60,4 +38,4 @@ RUSTFLAGS='-L native=/usr/local/lib' cargo install \

				popd

				rm -rf $PLATFORM2_ROOT $AOSP_EXTERNAL_ROOT/minijail $THIRD_PARTY_ROOT/adhd $THIRD_PARTY_ROOT/rust-vmm /platform/crosvm

				rm -rf /platform/crosvm

									
										43

.gitlab-ci/container/build-crosvm_no-syslog.patch
									
										Normal file
									
												View File
												
				@@ -0,0 +1,43 @@

				From 3c57ec558bccc67fd53363c23deea20646be5c47 Mon Sep 17 00:00:00 2001

				From: Tomeu Vizoso <tomeu.vizoso@collabora.com>

				Date: Wed, 17 Nov 2021 10:18:04 +0100

				Subject: [PATCH] Hack syslog out

				It's causing stability problems when running several Crosvm instances in

				parallel.

				Signed-off-by: Tomeu Vizoso <tomeu.vizoso@collabora.com>

				---

				 base/src/unix/linux/syslog.rs       | 2 +-

				 common/sys_util/src/linux/syslog.rs | 2 +-

				 2 files changed, 2 insertions(+), 2 deletions(-)

				diff --git a/base/src/unix/linux/syslog.rs b/base/src/unix/linux/syslog.rs

				index 05972a3a..f0db3781 100644

				--- a/base/src/unix/linux/syslog.rs

				+++ b/base/src/unix/linux/syslog.rs

				@@ -35,7 +35,7 @@ pub struct PlatformSyslog {

				 impl Syslog for PlatformSyslog {

				     fn new() -> Result<Self, Error> {

				         Ok(Self {

				-            socket: Some(openlog_and_get_socket()?),

				+            socket: None,

				         })

				     }

				diff --git a/common/sys_util/src/linux/syslog.rs b/common/sys_util/src/linux/syslog.rs

				index 05972a3a..f0db3781 100644

				--- a/common/sys_util/src/linux/syslog.rs

				+++ b/common/sys_util/src/linux/syslog.rs

				@@ -35,7 +35,7 @@ pub struct PlatformSyslog {

				 impl Syslog for PlatformSyslog {

				     fn new() -> Result<Self, Error> {

				         Ok(Self {

				-            socket: Some(openlog_and_get_socket()?),

				+            socket: None,

				         })

				     }

				-- 

				2.25.1

									
										27

.gitlab-ci/container/build-deqp-runner.sh
									
												View File
												
				@@ -1,9 +1,24 @@

				#!/bin/bash

				#!/bin/sh

				set -ex

				cargo install --locked deqp-runner \

				  -j ${FDO_CI_CONCURRENT:-4} \

				  --version 0.9.0 \

				  --root /usr/local \

				  $EXTRA_CARGO_ARGS

				if [ -n "${DEQP_RUNNER_GIT_TAG}${DEQP_RUNNER_GIT_REV}" ]; then

				    # Build and install from source

				    DEQP_RUNNER_CARGO_ARGS="--git ${DEQP_RUNNER_GIT_URL:-https://gitlab.freedesktop.org/anholt/deqp-runner.git}"

				    if [ -n "${DEQP_RUNNER_GIT_TAG}" ]; then

				        DEQP_RUNNER_CARGO_ARGS="--tag ${DEQP_RUNNER_GIT_TAG} ${DEQP_RUNNER_CARGO_ARGS}"

				    else

				        DEQP_RUNNER_CARGO_ARGS="--rev ${DEQP_RUNNER_GIT_REV} ${DEQP_RUNNER_CARGO_ARGS}"

				    fi

				    DEQP_RUNNER_CARGO_ARGS="${DEQP_RUNNER_CARGO_ARGS} ${EXTRA_CARGO_ARGS}"

				else

				    # Install from package registry

				    DEQP_RUNNER_CARGO_ARGS="--version 0.13.1 ${EXTRA_CARGO_ARGS} -- deqp-runner"

				fi

				cargo install --locked  \

				    -j ${FDO_CI_CONCURRENT:-4} \

				    --root /usr/local \

				    ${DEQP_RUNNER_CARGO_ARGS}

									
										10

.gitlab-ci/container/build-deqp.sh
									
												View File
												
				@@ -6,11 +6,15 @@ git config --global user.email "mesa@example.com"

				git config --global user.name "Mesa CI"

				git clone \

				    https://github.com/KhronosGroup/VK-GL-CTS.git \

				    -b vulkan-cts-1.2.7.1 \

				    -b vulkan-cts-1.3.1.1 \

				    --depth 1 \

				    /VK-GL-CTS

				pushd /VK-GL-CTS

				# Cherry-pick fix for zlib dependency

				git fetch origin main

				git cherry-pick -x ec1804831b654ac55bd2a7a5dd27a556afe05030

				# --insecure is due to SSL cert failures hitting sourceforge for zlib and

				# libpng (sigh).  The archives get their checksums checked anyway, and git

				# always goes through ssh or https.

				@@ -68,7 +72,11 @@ cp /deqp/executor/testlog-to-* /deqp/executor.save

				rm -rf /deqp/executor

				mv /deqp/executor.save /deqp/executor

				# Remove other mustpass files, since we saved off the ones we wanted to conventient locations above.

				rm -rf /deqp/external/openglcts/modules/gl_cts/data/mustpass

				rm -rf /deqp/external/vulkancts/modules/vulkan/vk-master*

				rm -rf /deqp/external/vulkancts/modules/vulkan/vk-default

				rm -rf /deqp/external/openglcts/modules/cts-runner

				rm -rf /deqp/modules/internal

				rm -rf /deqp/execserver

2

.gitlab-ci/container/build-fossilize.sh

View File

@@ -4,7 +4,7 @@ set -ex
 git clone https://github.com/ValveSoftware/Fossilize.git
 cd Fossilize
 git checkout 72088685d90bc814d14aad5505354ffa8a642789
 git checkout 16fba1b8b5d9310126bb02323d7bae3227338461
 git submodule update --init
 mkdir build
 cd build

									
										2

.gitlab-ci/container/build-kernel.sh
									
												View File
												
				@@ -28,7 +28,7 @@ if [[ -n ${DEVICE_TREES} ]]; then

				    cp ${DEVICE_TREES} /lava-files/.

				fi

				if [[ ${DEBIAN_ARCH} = "amd64" ]]; then

				if [[ ${DEBIAN_ARCH} = "amd64" || ${DEBIAN_ARCH} = "arm64" ]]; then

				    make modules

				    INSTALL_MOD_PATH=/lava-files/rootfs-${DEBIAN_ARCH}/ make modules_install

				fi

									
										2

.gitlab-ci/container/build-libdrm.sh
									
												View File
												
				@@ -2,7 +2,7 @@

				set -ex

				export LIBDRM_VERSION=libdrm-2.4.107

				export LIBDRM_VERSION=libdrm-2.4.110

				wget https://dri.freedesktop.org/libdrm/$LIBDRM_VERSION.tar.xz

				tar -xvf $LIBDRM_VERSION.tar.xz && rm $LIBDRM_VERSION.tar.xz

									
										2

.gitlab-ci/container/build-piglit.sh
									
												View File
												
				@@ -4,7 +4,7 @@ set -ex

				git clone https://gitlab.freedesktop.org/mesa/piglit.git --single-branch --no-checkout /piglit

				pushd /piglit

				git checkout 7d7dd2688c214e1b3c00f37226500cbec4a58efb

				git checkout 445711587d461539a4d8f9d35a7fe996a86d3c8d

				patch -p1 <$OLDPWD/.gitlab-ci/piglit/disable-vs_in.diff

				cmake -S . -B . -G Ninja -DCMAKE_BUILD_TYPE=Release $PIGLIT_OPTS $EXTRA_CMAKE_ARGS

				ninja $PIGLIT_BUILD_TARGETS

									
										97

.gitlab-ci/container/build-skqp.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,97 @@

				#!/bin/bash

				#

				# Copyright (C) 2022 Collabora Limited

				# Author: Guilherme Gallo <guilherme.gallo@collabora.com>

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				create_gn_args() {

				    # gn can be configured to cross-compile skia and its tools

				    # It is important to set the target_cpu to guarantee the intended target

				    # machine

				    cp "${BASE_ARGS_GN_FILE}" "${SKQP_OUT_DIR}"/args.gn

				    echo "target_cpu = \"${SKQP_ARCH}\"" >> "${SKQP_OUT_DIR}"/args.gn

				}

				download_skia_source() {

				    if [ -z ${SKIA_DIR+x} ]

				    then

				        return 1

				    fi

				    # Skia cloned from https://android.googlesource.com/platform/external/skqp

				    # has all needed assets tracked on git-fs

				    SKQP_REPO=https://android.googlesource.com/platform/external/skqp

				    SKQP_BRANCH=android-cts-10.0_r11

				    git clone --branch "${SKQP_BRANCH}" --depth 1 "${SKQP_REPO}" "${SKIA_DIR}"

				}

				set -ex

				SCRIPT_DIR=$(realpath "$(dirname "$0")")

				SKQP_PATCH_DIR="${SCRIPT_DIR}"

				BASE_ARGS_GN_FILE="${SCRIPT_DIR}/build-skqp_base.gn"

				SKQP_ARCH=${SKQP_ARCH:-x64}

				SKIA_DIR=${SKIA_DIR:-$(mktemp -d)}

				SKQP_OUT_DIR=${SKIA_DIR}/out/${SKQP_ARCH}

				SKQP_INSTALL_DIR=/skqp

				SKQP_ASSETS_DIR="${SKQP_INSTALL_DIR}/assets"

				SKQP_BINARIES=(skqp)

				download_skia_source

				pushd "${SKIA_DIR}"

				# Apply all skqp patches for Mesa CI

				cat "${SKQP_PATCH_DIR}"/build-skqp_*.patch |

				    patch -p1

				# Fetch some needed build tools needed to build skia/skqp.

				# Basically, it clones repositories with commits SHAs from ${SKIA_DIR}/DEPS

				# directory.

				python tools/git-sync-deps

				mkdir -p "${SKQP_OUT_DIR}"

				mkdir -p "${SKQP_INSTALL_DIR}"

				create_gn_args

				# Build and install skqp binaries

				bin/gn gen "${SKQP_OUT_DIR}"

				for BINARY in "${SKQP_BINARIES[@]}"

				do

				    /usr/bin/ninja -C "${SKQP_OUT_DIR}" "${BINARY}"

				    # Strip binary, since gn is not stripping it even when `is_debug == false`

				    ${STRIP_CMD:-strip} "${SKQP_OUT_DIR}/${BINARY}"

				    install -m 0755 "${SKQP_OUT_DIR}/${BINARY}" "${SKQP_INSTALL_DIR}"

				done

				# Move assets to the target directory, which will reside in rootfs.

				mv platform_tools/android/apps/skqp/src/main/assets/ "${SKQP_ASSETS_DIR}"

				popd

				rm -Rf "${SKIA_DIR}"

				set +ex

									
										13

.gitlab-ci/container/build-skqp_BUILD.gn.patch
									
										Normal file
									
												View File
												
				@@ -0,0 +1,13 @@

				diff --git a/BUILD.gn b/BUILD.gn

				index d2b1407..7b60c90 100644

				--- a/BUILD.gn

				+++ b/BUILD.gn

				@@ -144,7 +144,7 @@ config("skia_public") {

				 # Skia internal APIs, used by Skia itself and a few test tools.

				 config("skia_private") {

				-  visibility = [ ":*" ]

				+  visibility = [ "*" ]

				   include_dirs = [

				     "include/private",

47

.gitlab-ci/container/build-skqp_base.gn Normal file

View File

@@ -0,0 +1,47 @@
 cc = "clang"
 cxx = "clang++"
 extra_cflags = [ "-DSK_ENABLE_DUMP_GPU", "-DSK_BUILD_FOR_SKQP" ]
 extra_cflags_cc = [
         "-Wno-error",
         # skqp build process produces a lot of compilation warnings, silencing
         # most of them to remove clutter and avoid the CI job log to exceed the
         # maximum size
         # GCC flags
         "-Wno-redundant-move",
         "-Wno-suggest-override",
         "-Wno-class-memaccess",
         "-Wno-deprecated-copy",
         "-Wno-uninitialized",
         # Clang flags
         "-Wno-macro-redefined",
         "-Wno-anon-enum-enum-conversion",
         "-Wno-suggest-destructor-override",
         "-Wno-return-std-move-in-c++11",
         "-Wno-extra-semi-stmt",
     ]
 cc_wrapper = "ccache"
 is_debug = false
 skia_enable_fontmgr_android = false
 skia_enable_fontmgr_empty = true
 skia_enable_pdf = false
 skia_enable_skottie = false
 skia_skqp_global_error_tolerance = 8
 skia_tools_require_resources = true
 skia_use_dng_sdk = false
 skia_use_expat = true
 skia_use_icu = false
 skia_use_libheif = false
 skia_use_lua = false
 skia_use_piex = false
 skia_use_vulkan = true
 target_os = "linux"

									
										68

.gitlab-ci/container/build-skqp_fetch_gn.patch
									
										Normal file
									
												View File
												
				@@ -0,0 +1,68 @@

				diff --git a/bin/fetch-gn b/bin/fetch-gn

				index d5e94a2..59c4591 100755

				--- a/bin/fetch-gn

				+++ b/bin/fetch-gn

				@@ -5,39 +5,44 @@

				 # Use of this source code is governed by a BSD-style license that can be

				 # found in the LICENSE file.

				-import hashlib

				 import os

				+import platform

				 import shutil

				 import stat

				 import sys

				-import urllib2

				+import tempfile

				+import zipfile

				+

				+if sys.version_info[0] < 3:

				+  from urllib2 import urlopen

				+else:

				+  from urllib.request import urlopen

				 os.chdir(os.path.join(os.path.dirname(__file__), os.pardir))

				-dst = 'bin/gn.exe' if 'win32' in sys.platform else 'bin/gn'

				+gnzip = os.path.join(tempfile.mkdtemp(), 'gn.zip')

				+with open(gnzip, 'wb') as f:

				+  OS  = {'darwin': 'mac', 'linux': 'linux', 'linux2': 'linux', 'win32': 'windows'}[sys.platform]

				+  cpu = {'amd64': 'amd64', 'arm64': 'arm64', 'x86_64': 'amd64', 'aarch64': 'arm64'}[platform.machine().lower()]

				-sha1 = '2f27ff0b6118e5886df976da5effa6003d19d1ce' if 'linux'  in sys.platform else \

				-       '9be792dd9010ce303a9c3a497a67bcc5ac8c7666' if 'darwin' in sys.platform else \

				-       'eb69be2d984b4df60a8c21f598135991f0ad1742'  # Windows

				+  rev = 'd62642c920e6a0d1756316d225a90fd6faa9e21e'

				+  url = 'https://chrome-infra-packages.appspot.com/dl/gn/gn/{}-{}/+/git_revision:{}'.format(

				+          OS,cpu,rev)

				+  f.write(urlopen(url).read())

				-def sha1_of_file(path):

				-  h = hashlib.sha1()

				-  if os.path.isfile(path):

				-    with open(path, 'rb') as f:

				-      h.update(f.read())

				-  return h.hexdigest()

				+gn = 'gn.exe' if 'win32' in sys.platform else 'gn'

				+with zipfile.ZipFile(gnzip, 'r') as f:

				+  f.extract(gn, 'bin')

				-if sha1_of_file(dst) != sha1:

				-  with open(dst, 'wb') as f:

				-    f.write(urllib2.urlopen('https://chromium-gn.storage-download.googleapis.com/' + sha1).read())

				+gn = os.path.join('bin', gn)

				-  os.chmod(dst, stat.S_IRUSR | stat.S_IWUSR | stat.S_IXUSR |

				-                stat.S_IRGRP                | stat.S_IXGRP |

				-                stat.S_IROTH                | stat.S_IXOTH )

				+os.chmod(gn, stat.S_IRUSR | stat.S_IWUSR | stat.S_IXUSR |

				+             stat.S_IRGRP                | stat.S_IXGRP |

				+             stat.S_IROTH                | stat.S_IXOTH )

				 # We'll also copy to a path that depot_tools' GN wrapper will expect to find the binary.

				 copy_path = 'buildtools/linux64/gn' if 'linux'  in sys.platform else \

				             'buildtools/mac/gn'     if 'darwin' in sys.platform else \

				             'buildtools/win/gn.exe'

				 if os.path.isdir(os.path.dirname(copy_path)):

				-  shutil.copy(dst, copy_path)

				+  shutil.copy(gn, copy_path)

									
										142

.gitlab-ci/container/build-skqp_git-sync-deps.patch
									
										Normal file
									
												View File
												
				@@ -0,0 +1,142 @@

				Patch based from diff with skia repository from commit

				013397884c73959dc07cb0a26ee742b1cdfbda8a

				Adds support for Python3, but removes the constraint of only SHA based refs in

				DEPS

				diff --git a/tools/git-sync-deps b/tools/git-sync-deps

				index c7379c0b5c..f63d4d9ccf 100755

				--- a/tools/git-sync-deps

				+++ b/tools/git-sync-deps

				@@ -43,7 +43,7 @@ def git_executable():

				       A string suitable for passing to subprocess functions, or None.

				   """

				   envgit = os.environ.get('GIT_EXECUTABLE')

				-  searchlist = ['git']

				+  searchlist = ['git', 'git.bat']

				   if envgit:

				     searchlist.insert(0, envgit)

				   with open(os.devnull, 'w') as devnull:

				@@ -94,21 +94,25 @@ def is_git_toplevel(git, directory):

				   try:

				     toplevel = subprocess.check_output(

				       [git, 'rev-parse', '--show-toplevel'], cwd=directory).strip()

				-    return os.path.realpath(directory) == os.path.realpath(toplevel)

				+    return os.path.realpath(directory) == os.path.realpath(toplevel.decode())

				   except subprocess.CalledProcessError:

				     return False

				-def status(directory, checkoutable):

				-  def truncate(s, length):

				+def status(directory, commithash, change):

				+  def truncate_beginning(s, length):

				+    return s if len(s) <= length else '...' + s[-(length-3):]

				+  def truncate_end(s, length):

				     return s if len(s) <= length else s[:(length - 3)] + '...'

				+

				   dlen = 36

				-  directory = truncate(directory, dlen)

				-  checkoutable = truncate(checkoutable, 40)

				-  sys.stdout.write('%-*s @ %s\n' % (dlen, directory, checkoutable))

				+  directory = truncate_beginning(directory, dlen)

				+  commithash = truncate_end(commithash, 40)

				+  symbol = '>' if change else '@'

				+  sys.stdout.write('%-*s %s %s\n' % (dlen, directory, symbol, commithash))

				-def git_checkout_to_directory(git, repo, checkoutable, directory, verbose):

				+def git_checkout_to_directory(git, repo, commithash, directory, verbose):

				   """Checkout (and clone if needed) a Git repository.

				   Args:

				@@ -117,8 +121,7 @@ def git_checkout_to_directory(git, repo, checkoutable, directory, verbose):

				     repo (string) the location of the repository, suitable

				          for passing to `git clone`.

				-    checkoutable (string) a tag, branch, or commit, suitable for

				-                 passing to `git checkout`

				+    commithash (string) a commit, suitable for passing to `git checkout`

				     directory (string) the path into which the repository

				               should be checked out.

				@@ -129,7 +132,12 @@ def git_checkout_to_directory(git, repo, checkoutable, directory, verbose):

				   """

				   if not os.path.isdir(directory):

				     subprocess.check_call(

				-      [git, 'clone', '--quiet', repo, directory])

				+      [git, 'clone', '--quiet', '--no-checkout', repo, directory])

				+    subprocess.check_call([git, 'checkout', '--quiet', commithash],

				+                          cwd=directory)

				+    if verbose:

				+      status(directory, commithash, True)

				+    return

				   if not is_git_toplevel(git, directory):

				     # if the directory exists, but isn't a git repo, you will modify

				@@ -145,11 +153,11 @@ def git_checkout_to_directory(git, repo, checkoutable, directory, verbose):

				   with open(os.devnull, 'w') as devnull:

				     # If this fails, we will fetch before trying again.  Don't spam user

				     # with error infomation.

				-    if 0 == subprocess.call([git, 'checkout', '--quiet', checkoutable],

				+    if 0 == subprocess.call([git, 'checkout', '--quiet', commithash],

				                             cwd=directory, stderr=devnull):

				       # if this succeeds, skip slow `git fetch`.

				       if verbose:

				-        status(directory, checkoutable)  # Success.

				+        status(directory, commithash, False)  # Success.

				       return

				   # If the repo has changed, always force use of the correct repo.

				@@ -159,18 +167,24 @@ def git_checkout_to_directory(git, repo, checkoutable, directory, verbose):

				   subprocess.check_call([git, 'fetch', '--quiet'], cwd=directory)

				-  subprocess.check_call([git, 'checkout', '--quiet', checkoutable], cwd=directory)

				+  subprocess.check_call([git, 'checkout', '--quiet', commithash], cwd=directory)

				   if verbose:

				-    status(directory, checkoutable)  # Success.

				+    status(directory, commithash, True)  # Success.

				 def parse_file_to_dict(path):

				   dictionary = {}

				-  execfile(path, dictionary)

				+  with open(path) as f:

				+    exec('def Var(x): return vars[x]\n' + f.read(), dictionary)

				   return dictionary

				+def is_sha1_sum(s):

				+  """SHA1 sums are 160 bits, encoded as lowercase hexadecimal."""

				+  return len(s) == 40 and all(c in '0123456789abcdef' for c in s)

				+

				+

				 def git_sync_deps(deps_file_path, command_line_os_requests, verbose):

				   """Grab dependencies, with optional platform support.

				@@ -204,19 +218,19 @@ def git_sync_deps(deps_file_path, command_line_os_requests, verbose):

				         raise Exception('%r is parent of %r' % (other_dir, directory))

				   list_of_arg_lists = []

				   for directory in sorted(dependencies):

				-    if not isinstance(dependencies[directory], basestring):

				+    if not isinstance(dependencies[directory], str):

				       if verbose:

				-        print 'Skipping "%s".' % directory

				+        sys.stdout.write( 'Skipping "%s".\n' % directory)

				       continue

				     if '@' in dependencies[directory]:

				-      repo, checkoutable = dependencies[directory].split('@', 1)

				+      repo, commithash = dependencies[directory].split('@', 1)

				     else:

				-      raise Exception("please specify commit or tag")

				+      raise Exception("please specify commit")

				     relative_directory = os.path.join(deps_file_directory, directory)

				     list_of_arg_lists.append(

				-      (git, repo, checkoutable, relative_directory, verbose))

				+      (git, repo, commithash, relative_directory, verbose))

				   multithread(git_checkout_to_directory, list_of_arg_lists)

									
										13

.gitlab-ci/container/build-skqp_is_clang.py.patch
									
										Normal file
									
												View File
												
				@@ -0,0 +1,13 @@

				diff --git a/gn/BUILDCONFIG.gn b/gn/BUILDCONFIG.gn

				index 454334a..1797594 100644

				--- a/gn/BUILDCONFIG.gn

				+++ b/gn/BUILDCONFIG.gn

				@@ -80,7 +80,7 @@ if (current_cpu == "") {

				 is_clang = is_android || is_ios || is_mac ||

				            (cc == "clang" && cxx == "clang++") || clang_win != ""

				 if (!is_clang && !is_win) {

				-  is_clang = exec_script("gn/is_clang.py",

				+  is_clang = exec_script("//gn/is_clang.py",

				                          [

				                            cc,

				                            cxx,

									
										17

.gitlab-ci/container/build-va-tools.sh
									
										Normal file
									
												View File
												
				@@ -0,0 +1,17 @@

				#!/bin/bash

				set -ex

				git config --global user.email "mesa@example.com"

				git config --global user.name "Mesa CI"

				git clone \

				    https://github.com/intel/libva-utils.git \

				    -b 2.13.0 \

				    --depth 1 \

				    /va-utils

				pushd /va-utils

				meson build -D tests=true  -Dprefix=/va $EXTRA_MESON_ARGS

				ninja -C build install

				popd

				rm -rf /va-utils

									
										20

.gitlab-ci/container/build-virglrenderer.sh
									
												View File
											
				@@ -1,20 +0,0 @@

				#!/bin/bash

				set -ex

				mkdir -p /epoxy

				pushd /epoxy

				wget -qO- https://github.com/anholt/libepoxy/releases/download/1.5.8/libepoxy-1.5.8.tar.xz | tar -xJ --strip-components=1

				meson build/ $EXTRA_MESON_ARGS

				ninja -C build install

				popd

				rm -rf /epoxy

				VIRGLRENDERER_VERSION=f2ab66c6c00065b2944f4cd9d965ee455c535271

				git clone https://gitlab.freedesktop.org/virgl/virglrenderer.git --single-branch --no-checkout /virglrenderer

				pushd /virglrenderer

				git checkout "$VIRGLRENDERER_VERSION"

				meson build/ $EXTRA_MESON_ARGS

				ninja -C build install

				popd

				rm -rf /virglrenderer

									
										4

.gitlab-ci/container/build-vkd3d-proton.sh
									
												View File
												
				@@ -2,8 +2,8 @@

				set -ex

				VKD3D_PROTON_VERSION="2.3.1"

				VKD3D_PROTON_COMMIT="3ed3526332f53d7d35cf1b685fa8096b01f26ff0"

				VKD3D_PROTON_VERSION="2.6"

				VKD3D_PROTON_COMMIT="3e5aab6fb3e18f81a71b339be4cb5cdf55140980"

				VKD3D_PROTON_DST_DIR="/vkd3d-proton-tests"

				VKD3D_PROTON_SRC_DIR="/vkd3d-proton-src"

									
										22

.gitlab-ci/container/build-wayland.sh
									
										Normal file
									
												View File
												
				@@ -0,0 +1,22 @@

				#!/bin/bash

				set -ex

				export LIBWAYLAND_VERSION="1.18.0"

				export WAYLAND_PROTOCOLS_VERSION="1.24"

				git clone https://gitlab.freedesktop.org/wayland/wayland

				cd wayland

				git checkout "$LIBWAYLAND_VERSION"

				meson -Ddocumentation=false -Ddtd_validation=false -Dlibraries=true _build

				ninja -C _build install

				cd ..

				rm -rf wayland

				git clone https://gitlab.freedesktop.org/wayland/wayland-protocols

				cd wayland-protocols

				git checkout "$WAYLAND_PROTOCOLS_VERSION"

				meson _build

				ninja -C _build install

				cd ..

				rm -rf wayland-protocols

									
										2

.gitlab-ci/container/container_pre_build.sh
									
												View File
												
				@@ -10,7 +10,7 @@ fi

				export CCACHE_COMPILERCHECK=content

				export CCACHE_COMPRESS=true

				export CCACHE_DIR=/cache/mesa/ccache

				export CCACHE_DIR=/cache/$CI_PROJECT_NAME/ccache

				export PATH=$CCACHE_PATH:$PATH

				# CMake ignores $PATH, so we have to force CC/GCC to the ccache versions.

									
										2

.gitlab-ci/container/create-android-cross-file.sh
									
												View File
												
				@@ -17,7 +17,7 @@ cat >$cross_file <<EOF

				[binaries]

				ar = '$ndk/toolchains/llvm/prebuilt/linux-x86_64/bin/$arch-ar'

				c = ['ccache', '$ndk/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch2}29-clang', '-fno-exceptions', '-fno-unwind-tables', '-fno-asynchronous-unwind-tables']

				cpp = ['ccache', '$ndk/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch2}29-clang++', '-fno-exceptions', '-fno-unwind-tables', '-fno-asynchronous-unwind-tables', '-static-libstdc++']

				cpp = ['ccache', '$ndk/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch2}29-clang++', '-fno-exceptions', '-fno-unwind-tables', '-fno-asynchronous-unwind-tables']

				c_ld = 'lld'

				cpp_ld = 'lld'

				strip = '$ndk/toolchains/llvm/prebuilt/linux-x86_64/bin/$arch-strip'

									
										34

.gitlab-ci/container/create-rootfs.sh
									
												View File
												
				@@ -3,11 +3,26 @@

				set -ex

				if [ $DEBIAN_ARCH = arm64 ]; then

				    ARCH_PACKAGES="firmware-qcom-media"

				    ARCH_PACKAGES="firmware-qcom-media

				                   firmware-linux-nonfree

				                   libfontconfig1

				                   libgl1

				                   libglu1-mesa

				                   libvulkan-dev

				    "

				elif [ $DEBIAN_ARCH = amd64 ]; then

				    ARCH_PACKAGES="firmware-amd-graphics

				                   inetutils-syslogd

				                   iptables

				                   libcap2

				                   libelf1

				                   libfdt1

				                   libllvm11

				                   libva2

				                   libva-drm2

				                   socat

				                   spirv-tools

				                   sysvinit-core

				                  "

				fi

				@@ -21,6 +36,8 @@ INSTALL_CI_FAIRY_PACKAGES="git

				apt-get -y install --no-install-recommends \

				    $ARCH_PACKAGES \

				    $INSTALL_CI_FAIRY_PACKAGES \

				    $EXTRA_LOCAL_PACKAGES \

				    bash \

				    ca-certificates \

				    firmware-realtek \

				    initramfs-tools \

				@@ -64,12 +81,11 @@ apt-get -y install --no-install-recommends \

				    waffle-utils \

				    wget \

				    xinit \

				    xserver-xorg-core \

				    xz-utils

				    xserver-xorg-core

				# Needed for ci-fairy, this revision is able to upload files to

				# MinIO and doesn't depend on git

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@0f1abc24c043e63894085a6bd12f14263e8b29eb

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@34f4ade99434043f88e164933f570301fd18b125

				apt-get purge -y \

				        $INSTALL_CI_FAIRY_PACKAGES

				@@ -88,10 +104,6 @@ chmod +x  /init

				# Strip the image to a small minimal system without removing the debian

				# toolchain.

				# xz compress firmware so it doesn't waste RAM at runtime on ramdisk systems

				find /lib/firmware -type f -print0 | \

				    xargs -0r -P4 -n4 xz -T1 -C crc32

				# Copy timezone file and remove tzdata package

				rm -rf /etc/localtime

				cp /usr/share/zoneinfo/Etc/UTC /etc/localtime

				@@ -144,6 +156,8 @@ rm -rf usr/sbin/update-usbids

				rm -rf var/lib/usbutils/usb.ids

				rm -rf usr/share/misc/usb.ids

				rm -rf /root/.pip

				#######################################################################

				# Crush into a minimal production image to be deployed via some type of image

				# updating system.

				@@ -158,9 +172,7 @@ UNNEEDED_PACKAGES="apt libapt-pkg6.0 "\

				"insserv "\

				"udev "\

				"init-system-helpers "\

				"bash "\

				"cpio "\

				"xz-utils "\

				"passwd "\

				"libsemanage1 libsemanage-common "\

				"libsepol1 "\

				@@ -205,7 +217,7 @@ rm -rf var/* opt srv share

				# ca-certificates are in /etc drop the source

				rm -rf usr/share/ca-certificates

				# No bash, no need for completions

				# No need for completions

				rm -rf usr/share/bash-completion

				# No zsh, no need for comletions

									
										50

.gitlab-ci/container/debian/android_build.sh
									
												View File
												
				@@ -3,6 +3,7 @@

				set -ex

				EPHEMERAL="\

				         autoconf \

				         rdfind \

				         unzip \

				         "

				@@ -29,7 +30,7 @@ sh .gitlab-ci/container/create-android-cross-file.sh /$ndk arm-linux-androideabi

				# Not using build-libdrm.sh because we don't want its cleanup after building

				# each arch.  Fetch and extract now.

				export LIBDRM_VERSION=libdrm-2.4.102

				export LIBDRM_VERSION=libdrm-2.4.110

				wget https://dri.freedesktop.org/libdrm/$LIBDRM_VERSION.tar.xz

				tar -xf $LIBDRM_VERSION.tar.xz && rm $LIBDRM_VERSION.tar.xz

				@@ -50,11 +51,56 @@ for arch in \

				          -Detnaviv=false \

				          -Dfreedreno=false \

				          -Dintel=false \

				          -Dcairo-tests=false

				          -Dcairo-tests=false \

				          -Dvalgrind=false

				    ninja -C build-$arch install

				    cd ..

				done

				rm -rf $LIBDRM_VERSION

				export LIBELF_VERSION=libelf-0.8.13

				wget https://fossies.org/linux/misc/old/$LIBELF_VERSION.tar.gz

				# Not 100% sure who runs the mirror above so be extra careful

				if ! echo "4136d7b4c04df68b686570afa26988ac ${LIBELF_VERSION}.tar.gz" | md5sum -c -; then

				    echo "Checksum failed"

				    exit 1

				fi

				tar -xf ${LIBELF_VERSION}.tar.gz

				cd $LIBELF_VERSION

				# Work around a bug in the original configure not enabling __LIBELF64.

				autoreconf

				for arch in \

				        x86_64-linux-android \

				        i686-linux-android \

				        aarch64-linux-android \

				        arm-linux-androideabi ; do

				    ccarch=${arch}

				    if [ "${arch}" ==  'arm-linux-androideabi' ]

				    then

				       ccarch=armv7a-linux-androideabi

				    fi

				    export CC=/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch}-ar

				    export CC=/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/bin/${ccarch}29-clang

				    export CXX=/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/bin/${ccarch}29-clang++

				    export LD=/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch}-ld

				    export RANLIB=/android-ndk-r21d/toolchains/llvm/prebuilt/linux-x86_64/bin/${arch}-ranlib

				    # The configure script doesn't know about android, but doesn't really use the host anyway it

				    # seems

				    ./configure --host=x86_64-linux-gnu  --disable-nls --disable-shared \

				                --libdir=/usr/local/lib/${arch}

				    make install

				    make distclean

				done

				cd ..

				rm -rf $LIBELF_VERSION

				apt-get purge -y $EPHEMERAL

									
										3

.gitlab-ci/container/debian/arm_build.sh
									
												View File
												
				@@ -9,6 +9,7 @@ echo 'deb https://deb.debian.org/debian buster main' >/etc/apt/sources.list.d/bu

				apt-get update

				apt-get -y install \

					${EXTRA_LOCAL_PACKAGES} \

					abootimg \

					autoconf \

					automake \

				@@ -57,7 +58,7 @@ apt-get -y install \

				apt-get install -y --no-remove -t buster \

				        android-sdk-ext4-utils

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@6f5af7e5574509726c79109e3c147cee95e81366

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@34f4ade99434043f88e164933f570301fd18b125

				arch=armhf

				. .gitlab-ci/container/cross_build.sh

									
										6

.gitlab-ci/container/debian/arm_test.sh
									
												View File
												
				@@ -31,3 +31,9 @@ arch=armhf . .gitlab-ci/container/baremetal_build.sh

				# This firmware file from Debian bullseye causes hangs

				wget https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/plain/qcom/a530_pfp.fw?id=d5f9eea5a251d43412b07f5295d03e97b89ac4a5 \

				     -O /rootfs-arm64/lib/firmware/qcom/a530_pfp.fw

				mkdir -p /baremetal-files/jetson-nano/boot/

				ln -s \

				    /baremetal-files/Image \

				    /baremetal-files/tegra210-p3450-0000.dtb \

				    /baremetal-files/jetson-nano/boot/

									
										3

.gitlab-ci/container/debian/x86_build-base.sh
									
												View File
												
				@@ -63,7 +63,6 @@ apt-get install -y --no-remove \

				        python3-requests \

				        qemu-user \

				        valgrind \

				        wayland-protocols \

				        wget \

				        wine64 \

				        x11proto-dri2-dev \

				@@ -73,7 +72,7 @@ apt-get install -y --no-remove \

				        zlib1g-dev

				# Needed for ci-fairy, this revision is able to upload files to MinIO

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@6f5af7e5574509726c79109e3c147cee95e81366

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@34f4ade99434043f88e164933f570301fd18b125

				############### Uninstall ephemeral packages

									
										15

.gitlab-ci/container/debian/x86_build.sh
									
												View File
												
				@@ -11,8 +11,6 @@ STABLE_EPHEMERAL=" \

				      automake \

				      autotools-dev \

				      bzip2 \

				      cmake \

				      libgbm-dev \

				      libtool \

				      python3-pip \

				      "

				@@ -23,10 +21,13 @@ apt-get update

				apt-get install -y --no-remove \

				      $STABLE_EPHEMERAL \

				      check \

				      clang \

				      cmake \

				      libasan6 \

				      libarchive-dev \

				      libclang-cpp11-dev \

				      libgbm-dev \

				      libglvnd-dev \

				      libllvmspirvlib-dev \

				      liblua5.3-dev \

				@@ -43,6 +44,8 @@ apt-get install -y --no-remove \

				      llvm-11-dev \

				      llvm-9-dev \

				      ocl-icd-opencl-dev \

				      python3-freezegun \

				      python3-pytest \

				      procps \

				      spirv-tools \

				      strace \

				@@ -67,10 +70,8 @@ chmod +x /usr/local/bin/x86_64-w64-mingw32-pkg-config

				# dependencies where we want a specific version

				export              XORG_RELEASES=https://xorg.freedesktop.org/releases/individual

				export           WAYLAND_RELEASES=https://wayland.freedesktop.org/releases

				export         XORGMACROS_VERSION=util-macros-1.19.0

				export         LIBWAYLAND_VERSION=wayland-1.18.0

				wget $XORG_RELEASES/util/$XORGMACROS_VERSION.tar.bz2

				tar -xvf $XORGMACROS_VERSION.tar.bz2 && rm $XORGMACROS_VERSION.tar.bz2

				@@ -79,11 +80,7 @@ rm -rf $XORGMACROS_VERSION

				. .gitlab-ci/container/build-libdrm.sh

				wget $WAYLAND_RELEASES/$LIBWAYLAND_VERSION.tar.xz

				tar -xvf $LIBWAYLAND_VERSION.tar.xz && rm $LIBWAYLAND_VERSION.tar.xz

				cd $LIBWAYLAND_VERSION; ./configure --enable-libraries --without-host-scanner --disable-documentation --disable-dtd-validation; make install; cd ..

				rm -rf $LIBWAYLAND_VERSION

				. .gitlab-ci/container/build-wayland.sh

				pushd /usr/local

				git clone https://gitlab.freedesktop.org/mesa/shader-db.git --depth 1

									
										2

.gitlab-ci/container/debian/x86_test-base.sh
									
												View File
												
				@@ -59,7 +59,7 @@ apt-get install -y --no-install-recommends \

				# Needed for ci-fairy, this revision is able to upload files to MinIO

				# and doesn't depend on git

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@0f1abc24c043e63894085a6bd12f14263e8b29eb

				pip3 install git+http://gitlab.freedesktop.org/freedesktop/ci-templates@34f4ade99434043f88e164933f570301fd18b125

				############### Build dEQP runner

				. .gitlab-ci/container/build-deqp-runner.sh

									
										39

.gitlab-ci/container/debian/x86_test-gl.sh
									
												View File
												
				@@ -22,6 +22,7 @@ STABLE_EPHEMERAL=" \

				      libcap-dev \

				      libclang-cpp11-dev \

				      libelf-dev \

				      libexpat1-dev \

				      libfdt-dev \

				      libgbm-dev \

				      libgles2-mesa-dev \

				@@ -31,7 +32,6 @@ STABLE_EPHEMERAL=" \

				      libudev-dev \

				      libvulkan-dev \

				      libwaffle-dev \

				      libwayland-dev \

				      libx11-xcb-dev \

				      libxcb-dri2-0-dev \

				      libxext-dev \

				@@ -45,20 +45,18 @@ STABLE_EPHEMERAL=" \

				      patch \

				      pkg-config \

				      python3-distutils \

				      wayland-protocols \

				      wget \

				      xz-utils \

				      "

				apt-get install -y --no-remove \

				      $STABLE_EPHEMERAL \

				      clinfo \

				      inetutils-syslogd \

				      iptables \

				      libclang-common-11-dev \

				      libclang-cpp11 \

				      libcap2 \

				      libegl1 \

				      libepoxy-dev \

				      libfdt1 \

				      libllvmspirvlib11 \

				      libxcb-shm0 \

				@@ -66,12 +64,29 @@ apt-get install -y --no-remove \

				      python3-lxml \

				      python3-renderdoc \

				      python3-simplejson \

				      socat \

				      spirv-tools \

				      sysvinit-core

				      sysvinit-core \

				      wget

				. .gitlab-ci/container/container_pre_build.sh

				############### Build libdrm

				. .gitlab-ci/container/build-libdrm.sh

				############### Build Wayland

				. .gitlab-ci/container/build-wayland.sh

				############### Build Crosvm

				. .gitlab-ci/container/build-rust.sh

				. .gitlab-ci/container/build-crosvm.sh

				rm -rf /root/.cargo

				rm -rf /root/.rustup

				############### Build kernel

				export DEFCONFIG="arch/x86/configs/x86_64_defconfig"

				@@ -82,28 +97,14 @@ export DEBIAN_ARCH=amd64

				mkdir -p /lava-files/

				. .gitlab-ci/container/build-kernel.sh

				############### Build libdrm

				. .gitlab-ci/container/build-libdrm.sh

				############### Build libclc

				. .gitlab-ci/container/build-libclc.sh

				############### Build virglrenderer

				. .gitlab-ci/container/build-virglrenderer.sh

				############### Build piglit

				PIGLIT_OPTS="-DPIGLIT_BUILD_CL_TESTS=ON -DPIGLIT_BUILD_DMA_BUF_TESTS=ON" . .gitlab-ci/container/build-piglit.sh

				############### Build Crosvm

				. .gitlab-ci/container/build-rust.sh

				. .gitlab-ci/container/build-crosvm.sh

				rm -rf /root/.cargo

				############### Build dEQP GL

				DEQP_TARGET=surfaceless . .gitlab-ci/container/build-deqp.sh

									
										6

.gitlab-ci/container/debian/x86_test-vk.sh
									
												View File
												
				@@ -13,6 +13,7 @@ STABLE_EPHEMERAL=" \

				      g++-mingw-w64-i686-posix \

				      g++-mingw-w64-x86-64-posix \

				      glslang-tools \

				      libexpat1-dev \

				      libgbm-dev \

				      libgles2-mesa-dev \

				      liblz4-dev \

				@@ -20,7 +21,6 @@ STABLE_EPHEMERAL=" \

				      libudev-dev \

				      libvulkan-dev \

				      libwaffle-dev \

				      libwayland-dev \

				      libx11-xcb-dev \

				      libxcb-ewmh-dev \

				      libxcb-keysyms1-dev \

				@@ -124,6 +124,10 @@ wine \

				. .gitlab-ci/container/build-libdrm.sh

				############### Build Wayland

				. .gitlab-ci/container/build-wayland.sh

				############### Build parallel-deqp-runner's hang-detection tool

				. .gitlab-ci/container/build-hang-detection.sh

									
										14

.gitlab-ci/container/fedora/x86_build.sh
									
												View File
												
				@@ -27,6 +27,7 @@ dnf install -y --setopt=install_weak_deps=False \

				    gettext \

				    kernel-headers \

				    llvm-devel \

				    clang-devel \

				    meson \

				    "pkgconfig(dri2proto)" \

				    "pkgconfig(expat)" \

				@@ -40,9 +41,6 @@ dnf install -y --setopt=install_weak_deps=False \

				    "pkgconfig(pciaccess)" \

				    "pkgconfig(vdpau)" \

				    "pkgconfig(vulkan)" \

				    "pkgconfig(wayland-egl-backend)" \

				    "pkgconfig(wayland-protocols)" \

				    "pkgconfig(wayland-scanner)" \

				    "pkgconfig(x11)" \

				    "pkgconfig(x11-xcb)" \

				    "pkgconfig(xcb)" \

				@@ -66,6 +64,8 @@ dnf install -y --setopt=install_weak_deps=False \

				    python3-devel \

				    python3-mako \

				    vulkan-headers \

				    spirv-tools-devel \

				    spirv-llvm-translator-devel \

				    $EPHEMERAL

				@@ -74,10 +74,8 @@ dnf install -y --setopt=install_weak_deps=False \

				# dependencies where we want a specific version

				export              XORG_RELEASES=https://xorg.freedesktop.org/releases/individual

				export           WAYLAND_RELEASES=https://wayland.freedesktop.org/releases

				export         XORGMACROS_VERSION=util-macros-1.19.0

				export         LIBWAYLAND_VERSION=wayland-1.18.0

				wget $XORG_RELEASES/util/$XORGMACROS_VERSION.tar.bz2

				tar -xvf $XORGMACROS_VERSION.tar.bz2 && rm $XORGMACROS_VERSION.tar.bz2

				@@ -86,11 +84,7 @@ rm -rf $XORGMACROS_VERSION

				. .gitlab-ci/container/build-libdrm.sh

				wget $WAYLAND_RELEASES/$LIBWAYLAND_VERSION.tar.xz

				tar -xvf $LIBWAYLAND_VERSION.tar.xz && rm $LIBWAYLAND_VERSION.tar.xz

				cd $LIBWAYLAND_VERSION; ./configure --enable-libraries --without-host-scanner --disable-documentation --disable-dtd-validation; make install; cd ..

				rm -rf $LIBWAYLAND_VERSION

				. .gitlab-ci/container/build-wayland.sh

				pushd /usr/local

				git clone https://gitlab.freedesktop.org/mesa/shader-db.git --depth 1

									
										414

.gitlab-ci/container/gitlab-ci.yml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,414 @@

				# Docker image tag helper templates

				.incorporate-templates-commit:

				  variables:

				    FDO_DISTRIBUTION_TAG: "${MESA_IMAGE_TAG}--${MESA_TEMPLATES_COMMIT}"

				.incorporate-base-tag+templates-commit:

				  variables:

				    FDO_BASE_IMAGE: "${CI_REGISTRY_IMAGE}/${MESA_BASE_IMAGE}:${MESA_BASE_TAG}--${MESA_TEMPLATES_COMMIT}"

				    FDO_DISTRIBUTION_TAG: "${MESA_IMAGE_TAG}--${MESA_BASE_TAG}--${MESA_TEMPLATES_COMMIT}"

				.set-image:

				  extends:

				    - .incorporate-templates-commit

				  variables:

				    MESA_IMAGE: "$CI_REGISTRY_IMAGE/${MESA_IMAGE_PATH}:${FDO_DISTRIBUTION_TAG}"

				  image: "$MESA_IMAGE"

				.set-image-base-tag:

				  extends:

				    - .set-image

				    - .incorporate-base-tag+templates-commit

				  variables:

				    MESA_IMAGE: "$CI_REGISTRY_IMAGE/${MESA_IMAGE_PATH}:${FDO_DISTRIBUTION_TAG}"

				# Build the CI docker images.

				#

				# MESA_IMAGE_TAG is the tag of the docker image used by later stage jobs. If the

				# image doesn't exist yet, the container stage job generates it.

				#

				# In order to generate a new image, one should generally change the tag.

				# While removing the image from the registry would also work, that's not

				# recommended except for ephemeral images during development: Replacing

				# an image after a significant amount of time might pull in newer

				# versions of gcc/clang or other packages, which might break the build

				# with older commits using the same tag.

				#

				# After merging a change resulting in generating a new image to the

				# main repository, it's recommended to remove the image from the source

				# repository's container registry, so that the image from the main

				# repository's registry will be used there as well.

				.container:

				  stage: container

				  extends:

				    - .container-rules

				    - .incorporate-templates-commit

				  variables:

				    FDO_DISTRIBUTION_VERSION: bullseye-slim

				    FDO_REPO_SUFFIX: $CI_JOB_NAME

				    FDO_DISTRIBUTION_EXEC: 'env FDO_CI_CONCURRENT=${FDO_CI_CONCURRENT} bash .gitlab-ci/container/${CI_JOB_NAME}.sh'

				    # no need to pull the whole repo to build the container image

				    GIT_STRATEGY: none

				.use-base-image:

				  extends:

				    - .container

				    - .incorporate-base-tag+templates-commit

				    # Don't want the .container rules

				    - .ci-run-policy

				# Debian 11 based x86 build image base

				debian/x86_build-base:

				  extends:

				    - .fdo.container-build@debian

				    - .container

				  variables:

				    MESA_IMAGE_TAG: &debian-x86_build-base ${DEBIAN_BASE_TAG}

				.use-debian/x86_build-base:

				  extends:

				    - .fdo.container-build@debian

				    - .use-base-image

				  variables:

				    MESA_BASE_IMAGE: ${DEBIAN_X86_BUILD_BASE_IMAGE}

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_ARTIFACTS_BASE_TAG: *debian-x86_build-base

				  needs:

				    - debian/x86_build-base

				# Debian 11 based x86 main build image

				debian/x86_build:

				  extends:

				    - .use-debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-x86_build ${DEBIAN_BUILD_TAG}

				.use-debian/x86_build:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_IMAGE_PATH: ${DEBIAN_X86_BUILD_IMAGE_PATH}

				    MESA_IMAGE_TAG: *debian-x86_build

				  needs:

				    - debian/x86_build

				# Debian 11 based i386 cross-build image

				debian/i386_build:

				  extends:

				    - .use-debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-i386_build ${DEBIAN_BUILD_TAG}

				.use-debian/i386_build:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_IMAGE_PATH: "debian/i386_build"

				    MESA_IMAGE_TAG: *debian-i386_build

				  needs:

				    - debian/i386_build

				# Debian 11 based ppc64el cross-build image

				debian/ppc64el_build:

				  extends:

				    - .use-debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-ppc64el_build ${DEBIAN_BUILD_TAG}

				.use-debian/ppc64el_build:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_IMAGE_PATH: "debian/ppc64el_build"

				    MESA_IMAGE_TAG: *debian-ppc64el_build

				  needs:

				    - debian/ppc64el_build

				# Debian 11 based s390x cross-build image

				debian/s390x_build:

				  extends:

				    - .use-debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-s390x_build ${DEBIAN_BUILD_TAG}

				.use-debian/s390x_build:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_IMAGE_PATH: "debian/s390x_build"

				    MESA_IMAGE_TAG: *debian-s390x_build

				  needs:

				    - debian/s390x_build

				# Android NDK cross-build image

				debian/android_build:

				  extends:

				    - .use-debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-android_build ${DEBIAN_BUILD_TAG}

				.use-debian/android_build:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_build-base

				    MESA_IMAGE_PATH: "debian/android_build"

				    MESA_IMAGE_TAG: *debian-android_build

				  needs:

				    - debian/android_build

				# Debian 11 based x86 test image base

				debian/x86_test-base:

				  extends: debian/x86_build-base

				  variables:

				    MESA_IMAGE_TAG: &debian-x86_test-base ${DEBIAN_BASE_TAG}

				.use-debian/x86_test-base:

				  extends:

				    - .fdo.container-build@debian

				    - .use-base-image

				  variables:

				    MESA_BASE_IMAGE: ${DEBIAN_X86_TEST_BASE_IMAGE}

				    MESA_BASE_TAG: *debian-x86_test-base

				  needs:

				    - debian/x86_test-base

				# Debian 11 based x86 test image for GL

				debian/x86_test-gl:

				  extends: .use-debian/x86_test-base

				  variables:

				    FDO_DISTRIBUTION_EXEC: 'env KERNEL_URL=${KERNEL_URL} FDO_CI_CONCURRENT=${FDO_CI_CONCURRENT} bash .gitlab-ci/container/${CI_JOB_NAME}.sh'

				    KERNEL_URL: &kernel-rootfs-url "https://gitlab.freedesktop.org/gfx-ci/linux/-/archive/v5.16-for-mesa-ci-991fec6622591/linux-v5.16-for-mesa-ci-991fec6622591.tar.bz2"

				    MESA_IMAGE_TAG: &debian-x86_test-gl ${DEBIAN_X86_TEST_GL_TAG}

				.use-debian/x86_test-gl:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_test-base

				    MESA_IMAGE_PATH: ${DEBIAN_X86_TEST_IMAGE_PATH}

				    MESA_IMAGE_TAG: *debian-x86_test-gl

				  needs:

				    - debian/x86_test-gl

				# Debian 11 based x86 test image for VK

				debian/x86_test-vk:

				  extends: .use-debian/x86_test-base

				  variables:

				    MESA_IMAGE_TAG: &debian-x86_test-vk ${DEBIAN_X86_TEST_VK_TAG}

				.use-debian/x86_test-vk:

				  extends:

				    - .set-image-base-tag

				  variables:

				    MESA_BASE_TAG: *debian-x86_test-base

				    MESA_IMAGE_PATH: "debian/x86_test-vk"

				    MESA_IMAGE_TAG: *debian-x86_test-vk

				  needs:

				    - debian/x86_test-vk

				# Debian 11 based ARM build image

				debian/arm_build:

				  extends:

				    - .fdo.container-build@debian

				    - .container

				  tags:

				    - aarch64

				  variables:

				    MESA_IMAGE_TAG: &debian-arm_build ${DEBIAN_BASE_TAG}

				.use-debian/arm_build:

				  extends:

				    - .set-image

				  variables:

				    MESA_IMAGE_PATH: "debian/arm_build"

				    MESA_IMAGE_TAG: *debian-arm_build

				    MESA_ARTIFACTS_TAG: *debian-arm_build

				  needs:

				    - debian/arm_build

				# Fedora 34 based x86 build image

				fedora/x86_build:

				  extends:

				    - .fdo.container-build@fedora

				    - .container

				  variables:

				    FDO_DISTRIBUTION_VERSION: 34

				    MESA_IMAGE_TAG: &fedora-x86_build ${FEDORA_X86_BUILD_TAG}

				.use-fedora/x86_build:

				  extends:

				    - .set-image

				  variables:

				    MESA_IMAGE_PATH: "fedora/x86_build"

				    MESA_IMAGE_TAG: *fedora-x86_build

				  needs:

				    - fedora/x86_build

				.kernel+rootfs:

				  extends:

				    - .ci-run-policy

				  stage: container

				  variables:

				    GIT_STRATEGY: fetch

				    KERNEL_URL: *kernel-rootfs-url

				    MESA_ROOTFS_TAG: &kernel-rootfs ${KERNEL_ROOTFS_TAG}

				    DISTRIBUTION_TAG: &distribution-tag-arm "${MESA_ROOTFS_TAG}--${MESA_ARTIFACTS_TAG}--${MESA_TEMPLATES_COMMIT}"

				  script:

				    - .gitlab-ci/container/lava_build.sh

				kernel+rootfs_amd64:

				  extends:

				    - .use-debian/x86_build-base

				    - .kernel+rootfs

				  image: "$FDO_BASE_IMAGE"

				  variables:

				    DEBIAN_ARCH: "amd64"

				    DISTRIBUTION_TAG: &distribution-tag-amd64 "${MESA_ROOTFS_TAG}--${MESA_ARTIFACTS_BASE_TAG}--${MESA_TEMPLATES_COMMIT}"

				kernel+rootfs_arm64:

				  extends:

				    - .use-debian/arm_build

				    - .kernel+rootfs

				  tags:

				    - aarch64

				  variables:

				    DEBIAN_ARCH: "arm64"

				kernel+rootfs_armhf:

				  extends:

				    - kernel+rootfs_arm64

				  variables:

				    DEBIAN_ARCH: "armhf"

				# Cannot use anchors defined here from included files, so use extends: instead

				.use-kernel+rootfs-arm:

				  variables:

				    DISTRIBUTION_TAG: *distribution-tag-arm

				    MESA_ROOTFS_TAG: *kernel-rootfs

				.use-kernel+rootfs-amd64:

				  variables:

				    DISTRIBUTION_TAG: *distribution-tag-amd64

				    MESA_ROOTFS_TAG: *kernel-rootfs

				# x86 image with ARM64 & armhf kernel & rootfs for baremetal testing

				debian/arm_test:

				  extends:

				    - .fdo.container-build@debian

				    - .container

				    # Don't want the .container rules

				    - .ci-run-policy

				  needs:

				    - kernel+rootfs_arm64

				    - kernel+rootfs_armhf

				  variables:

				    FDO_DISTRIBUTION_EXEC: 'env ARTIFACTS_PREFIX=https://${MINIO_HOST}/mesa-lava ARTIFACTS_SUFFIX=${MESA_ROOTFS_TAG}--${MESA_ARM_BUILD_TAG}--${MESA_TEMPLATES_COMMIT} CI_PROJECT_PATH=${CI_PROJECT_PATH} FDO_CI_CONCURRENT=${FDO_CI_CONCURRENT} FDO_UPSTREAM_REPO=${FDO_UPSTREAM_REPO} bash .gitlab-ci/container/${CI_JOB_NAME}.sh'

				    FDO_DISTRIBUTION_TAG: "${MESA_IMAGE_TAG}--${MESA_ROOTFS_TAG}--${MESA_ARM_BUILD_TAG}--${MESA_TEMPLATES_COMMIT}"

				    MESA_ARM_BUILD_TAG: *debian-arm_build

				    MESA_IMAGE_TAG: &debian-arm_test ${DEBIAN_BASE_TAG}

				    MESA_ROOTFS_TAG: *kernel-rootfs

				.use-debian/arm_test:

				  image: "$CI_REGISTRY_IMAGE/${MESA_IMAGE_PATH}:${MESA_IMAGE_TAG}--${MESA_ROOTFS_TAG}--${MESA_ARM_BUILD_TAG}--${MESA_TEMPLATES_COMMIT}"

				  variables:

				    MESA_ARM_BUILD_TAG: *debian-arm_build

				    MESA_IMAGE_PATH: "debian/arm_test"

				    MESA_IMAGE_TAG: *debian-arm_test

				    MESA_ROOTFS_TAG: *kernel-rootfs

				  needs:

				    - debian/arm_test

				# Native Windows docker builds

				#

				# Unlike the above Linux-based builds - including MinGW builds which

				# cross-compile for Windows - which use the freedesktop ci-templates, we

				# cannot use the same scheme here. As Windows lacks support for

				# Docker-in-Docker, and Podman does not run natively on Windows, we have

				# to open-code much of the same ourselves.

				#

				# This is achieved by first running in a native Windows shell instance

				# (host PowerShell) in the container stage to build and push the image,

				# then in the build stage by executing inside Docker.

				.windows-docker-vs2019:

				  variables:

				    MESA_IMAGE: "$CI_REGISTRY_IMAGE/${MESA_IMAGE_PATH}:${MESA_IMAGE_TAG}"

				    MESA_UPSTREAM_IMAGE: "$CI_REGISTRY/$FDO_UPSTREAM_REPO/$MESA_IMAGE_PATH:${MESA_IMAGE_TAG}"

				.windows_container_build:

				  inherit:

				    default: false

				  extends:

				    - .container

				    - .windows-docker-vs2019

				  variables:

				    GIT_STRATEGY: fetch # we do actually need the full repository though

				    MESA_BASE_IMAGE: None

				  tags:

				    - windows

				    - shell

				    - "1809"

				    - mesa

				  script:

				    - .\.gitlab-ci\windows\mesa_container.ps1 $CI_REGISTRY $CI_REGISTRY_USER $CI_REGISTRY_PASSWORD $MESA_IMAGE $MESA_UPSTREAM_IMAGE ${DOCKERFILE} ${MESA_BASE_IMAGE}

				windows_build_vs2019:

				  inherit:

				    default: false

				  extends:

				    - .windows_container_build

				  variables:

				    MESA_IMAGE_PATH: &windows_build_image_path ${WINDOWS_X64_BUILD_PATH}

				    MESA_IMAGE_TAG: &windows_build_image_tag ${WINDOWS_X64_BUILD_TAG}

				    DOCKERFILE: Dockerfile_build

				  timeout: 2h 30m # LLVM takes ages

				windows_test_vs2019:

				  inherit:

				    default: false

				  extends:

				    - .windows_container_build

				    # Don't want the .container rules

				    - .ci-run-policy

				  variables:

				    MESA_IMAGE_PATH: &windows_test_image_path ${WINDOWS_X64_TEST_PATH}

				    MESA_IMAGE_TAG: &windows_test_image_tag ${WINDOWS_X64_BUILD_TAG}--${WINDOWS_X64_TEST_TAG}

				    DOCKERFILE: Dockerfile_test

				    # Right now this only needs the VS install to get DXIL.dll. Maybe see about decoupling this at some point

				    MESA_BASE_IMAGE_PATH: *windows_build_image_path

				    MESA_BASE_IMAGE_TAG: *windows_build_image_tag

				    MESA_BASE_IMAGE: "$CI_REGISTRY_IMAGE/${MESA_BASE_IMAGE_PATH}:${MESA_BASE_IMAGE_TAG}"

				  script:

				    - .\.gitlab-ci\windows\mesa_container.ps1 $CI_REGISTRY $CI_REGISTRY_USER $CI_REGISTRY_PASSWORD $MESA_IMAGE $MESA_UPSTREAM_IMAGE Dockerfile_test ${MESA_BASE_IMAGE}

				  needs:

				    - windows_build_vs2019

				.use-windows_build_vs2019:

				  inherit:

				    default: false

				  extends: .windows-docker-vs2019

				  image: "$MESA_IMAGE"

				  variables:

				    MESA_IMAGE_PATH: *windows_build_image_path

				    MESA_IMAGE_TAG: *windows_build_image_tag

				  needs:

				    - windows_build_vs2019

				.use-windows_test_vs2019:

				  inherit:

				    default: false

				  extends: .windows-docker-vs2019

				  image: "$MESA_IMAGE"

				  variables:

				    MESA_IMAGE_PATH: *windows_test_image_path

				    MESA_IMAGE_TAG: *windows_test_image_tag

									
										52

.gitlab-ci/container/lava_build.sh
									
												View File
												
				@@ -34,6 +34,8 @@ if [[ "$DEBIAN_ARCH" = "arm64" ]]; then

				    DEVICE_TREES+=" arch/arm64/boot/dts/qcom/apq8096-db820c.dtb"

				    DEVICE_TREES+=" arch/arm64/boot/dts/amlogic/meson-g12b-a311d-khadas-vim3.dtb"

				    DEVICE_TREES+=" arch/arm64/boot/dts/mediatek/mt8183-kukui-jacuzzi-juniper-sku16.dtb"

				    DEVICE_TREES+=" arch/arm64/boot/dts/nvidia/tegra210-p3450-0000.dtb"

				    DEVICE_TREES+=" arch/arm64/boot/dts/qcom/sc7180-trogdor-lazor-limozeen-nots.dtb"

				    KERNEL_IMAGE_NAME="Image"

				elif [[ "$DEBIAN_ARCH" = "armhf" ]]; then

				    GCC_ARCH="arm-linux-gnueabihf"

				@@ -50,6 +52,7 @@ else

				    DEFCONFIG="arch/x86/configs/x86_64_defconfig"

				    DEVICE_TREES=""

				    KERNEL_IMAGE_NAME="bzImage"

				    ARCH_PACKAGES="libasound2-dev libcap-dev libfdt-dev libva-dev wayland-protocols"

				fi

				# Determine if we're in a cross build.

				@@ -71,16 +74,23 @@ fi

				apt-get update

				apt-get install -y --no-remove \

				                   ${ARCH_PACKAGES} \

				                   automake \

				                   bc \

				                   clang \

				                   cmake \

				                   debootstrap \

				                   git \

				                   glslang-tools \

				                   libdrm-dev \

				                   libegl1-mesa-dev \

				                   libxext-dev \

				                   libfontconfig-dev \

				                   libgbm-dev \

				                   libgl-dev \

				                   libgles2-mesa-dev \

				                   libglu1-mesa-dev \

				                   libglx-dev \

				                   libpng-dev \

				                   libssl-dev \

				                   libudev-dev \

				@@ -90,11 +100,14 @@ apt-get install -y --no-remove \

				                   libx11-xcb-dev \

				                   libxcb-dri2-0-dev \

				                   libxkbcommon-dev \

				                   ninja-build \

				                   patch \

				                   python-is-python3 \

				                   python3-distutils \

				                   python3-mako \

				                   python3-numpy \

				                   python3-serial \

				                   unzip \

				                   wget

				@@ -116,7 +129,7 @@ fi

				############### Building

				STRIP_CMD="${GCC_ARCH}-strip"

				mkdir -p /lava-files/rootfs-${DEBIAN_ARCH}

				mkdir -p /lava-files/rootfs-${DEBIAN_ARCH}/usr/lib/$GCC_ARCH

				############### Build apitrace

				@@ -129,8 +142,7 @@ rm -rf /apitrace

				############### Build dEQP runner

				. .gitlab-ci/container/build-deqp-runner.sh

				mkdir -p /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin

				mv /usr/local/bin/deqp-runner /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin/.

				mv /usr/local/bin/piglit-runner /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin/.

				mv /usr/local/bin/*-runner /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin/.

				############### Build dEQP

				@@ -139,20 +151,49 @@ DEQP_TARGET=surfaceless . .gitlab-ci/container/build-deqp.sh

				mv /deqp /lava-files/rootfs-${DEBIAN_ARCH}/.

				############### Build SKQP

				if [[ "$DEBIAN_ARCH" = "arm64" ]]; then

				    SKQP_ARCH="arm64" . .gitlab-ci/container/build-skqp.sh

				    mv /skqp /lava-files/rootfs-${DEBIAN_ARCH}/.

				fi

				############### Build piglit

				PIGLIT_OPTS="-DPIGLIT_BUILD_DMA_BUF_TESTS=ON" . .gitlab-ci/container/build-piglit.sh

				mv /piglit /lava-files/rootfs-${DEBIAN_ARCH}/.

				############### Build libva tests

				if [[ "$DEBIAN_ARCH" = "amd64" ]]; then

				    . .gitlab-ci/container/build-va-tools.sh

				    mv /va/bin/* /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin/

				fi

				############### Build Crosvm

				if [[ ${DEBIAN_ARCH} = "amd64" ]]; then

				    . .gitlab-ci/container/build-crosvm.sh

				    mv /usr/local/bin/crosvm /lava-files/rootfs-${DEBIAN_ARCH}/usr/bin/

				    mv /usr/local/lib/$GCC_ARCH/libvirglrenderer.* /lava-files/rootfs-${DEBIAN_ARCH}/usr/lib/$GCC_ARCH/

				fi

				############### Build libdrm

				EXTRA_MESON_ARGS+=" -D prefix=/libdrm"

				. .gitlab-ci/container/build-libdrm.sh

				############### Build local stuff for use by igt and kernel testing, which

				############### will reuse most of our container build process from a specific

				############### hash of the Mesa tree.

				if [[ -e ".gitlab-ci/local/build-rootfs.sh" ]]; then

				    . .gitlab-ci/local/build-rootfs.sh

				fi

				############### Build kernel

				. .gitlab-ci/container/build-kernel.sh

				############### Delete rust, since the tests won't be compiling anything.

				rm -rf /root/.cargo

				rm -rf /root/.rustup

				############### Create rootfs

				set +e

				@@ -177,8 +218,9 @@ rm /lava-files/rootfs-${DEBIAN_ARCH}/create-rootfs.sh

				# Dependencies pulled during the creation of the rootfs may overwrite

				# the built libdrm. Hence, we add it after the rootfs has been already

				# created.

				mkdir -p /lava-files/rootfs-${DEBIAN_ARCH}/usr/lib/$GCC_ARCH

				find /libdrm/ -name lib\*\.so\* | xargs cp -t /lava-files/rootfs-${DEBIAN_ARCH}/usr/lib/$GCC_ARCH/.

				mkdir -p /lava-files/rootfs-${DEBIAN_ARCH}/libdrm/

				cp -Rp /libdrm/share /lava-files/rootfs-${DEBIAN_ARCH}/libdrm/share

				rm -rf /libdrm

				@@ -196,7 +238,7 @@ popd

				. .gitlab-ci/container/container_post_build.sh

				############### Upload the files!

				ci-fairy minio login $CI_JOB_JWT

				ci-fairy minio login --token-file "${CI_JOB_JWT_FILE}"

				FILES_TO_UPLOAD="lava-rootfs.tgz \

				                 $KERNEL_IMAGE_NAME"

12

.gitlab-ci/container/x86_64.config

View File

@@ -1,6 +1,11 @@
 CONFIG_LOCALVERSION_AUTO=y
 CONFIG_DEBUG_KERNEL=y
 CONFIG_PWM=y
 CONFIG_PM_DEVFREQ=y
 CONFIG_OF=y
 CONFIG_CROS_EC=y
 # abootimg with a 'dummy' rootfs fails with root=/dev/nfs
 CONFIG_BLK_DEV_INITRD=n
@@ -57,7 +62,7 @@ CONFIG_X86_AMD_FREQ_SENSITIVITY=y
 CONFIG_PINCTRL=y
 CONFIG_PINCTRL_AMD=y
 CONFIG_DRM_AMDGPU=m
 CONFIG_DRM_AMDGPU_SI=m
 CONFIG_DRM_AMDGPU_SI=y
 CONFIG_DRM_AMDGPU_USERPTR=y
 CONFIG_DRM_AMD_ACP=n
 CONFIG_ACPI_WMI=y
@@ -67,9 +72,11 @@ CONFIG_PARPORT_PC=y
 CONFIG_PARPORT_SERIAL=y
 CONFIG_SERIAL_8250_DW=y
 CONFIG_CHROME_PLATFORMS=y
 CONFIG_KVM_AMD=m
 #options for Intel devices
 CONFIG_MFD_INTEL_LPSS_PCI=y
 CONFIG_KVM_INTEL=m
 #options for KVM guests
 CONFIG_FUSE_FS=y
@@ -93,3 +100,6 @@ CONFIG_CRYPTO_DEV_VIRTIO=y
 CONFIG_HW_RANDOM_VIRTIO=y
 CONFIG_BLK_MQ_VIRTIO=y
 CONFIG_TUN=y
 CONFIG_VSOCKETS=y
 CONFIG_VIRTIO_VSOCKETS=y
 CONFIG_VHOST_VSOCK=m

1

.gitlab-ci/cross-xfail-s390x

View File

@@ -1,2 +1 @@
 lp_test_arit
 lp_test_format

									
										39

.gitlab-ci/crosvm-init.sh
									
												View File
												
				@@ -1,27 +1,42 @@

				#!/bin/sh

				set -ex

				set -e

				VSOCK_STDOUT=$1

				VSOCK_STDERR=$2

				VSOCK_TEMP_DIR=$3

				mount -t proc none /proc

				mount -t sysfs none /sys

				mount -t devtmpfs none /dev || echo possibly already mounted

				mkdir -p /dev/pts

				mount -t devpts devpts /dev/pts

				mount -t tmpfs tmpfs /tmp

				. /crosvm-env.sh

				. ${VSOCK_TEMP_DIR}/crosvm-env.sh

				# / is ro

				export PIGLIT_REPLAY_EXTRA_ARGS="$PIGLIT_REPLAY_EXTRA_ARGS --db-path /tmp/replayer-db"

				# .gitlab-ci.yml script variable is using relative paths to install directory,

				# so change to that dir before running `crosvm-script`

				cd "${CI_PROJECT_DIR}"

				if sh $CROSVM_TEST_SCRIPT; then

				    touch /results/success

				fi

				# The exception is the dEQP binary, as it needs to run from its own directory

				[ -z "${DEQP_BIN_DIR}" ] || cd "${DEQP_BIN_DIR}"

				sleep 5   # Leave some time to get the last output flushed out

				# Use a FIFO to collect relevant error messages

				STDERR_FIFO=/tmp/crosvm-stderr.fifo

				mkfifo -m 600 ${STDERR_FIFO}

				dmesg --level crit,err,warn -w > ${STDERR_FIFO} &

				DMESG_PID=$!

				# Transfer the errors and crosvm-script output via a pair of virtio-vsocks

				socat -d -u pipe:${STDERR_FIFO} vsock-listen:${VSOCK_STDERR} &

				socat -d -U vsock-listen:${VSOCK_STDOUT} \

				    system:"stdbuf -eL sh ${VSOCK_TEMP_DIR}/crosvm-script.sh 2> ${STDERR_FIFO}; echo \$? > ${VSOCK_TEMP_DIR}/exit_code",nofork

				kill ${DMESG_PID}

				wait

				sync

				poweroff -d -n -f || true

				sleep 10   # Just in case init would exit before the kernel shuts down the VM

				exit 1

				sleep 1   # Just in case init would exit before the kernel shuts down the VM

									
										138

.gitlab-ci/crosvm-runner.sh
									
												View File
												
				@@ -1,49 +1,125 @@

				#!/bin/sh

				set -x

				set -e

				ln -sf $CI_PROJECT_DIR/install /install

				#

				# Helper to generate CIDs for virtio-vsock based communication with processes

				# running inside crosvm guests.

				#

				# A CID is a 32-bit Context Identifier to be assigned to a crosvm instance

				# and must be unique across the host system. For this purpose, let's take

				# the least significant 25 bits from CI_JOB_ID as a base and generate a 7-bit

				# prefix number to handle up to 128 concurrent crosvm instances per job runner.

				#

				# As a result, the following variables are set:

				#  - VSOCK_CID: the crosvm unique CID to be passed as a run argument

				#

				#  - VSOCK_STDOUT, VSOCK_STDERR: the port numbers the guest should accept

				#    vsock connections on in order to transfer output messages

				#

				#  - VSOCK_TEMP_DIR: the temporary directory path used to pass additional

				#    context data towards the guest

				#

				set_vsock_context() {

				    [ -n "${CI_JOB_ID}" ] || {

				        echo "Missing or unset CI_JOB_ID env variable" >&2

				        exit 1

				    }

				export LD_LIBRARY_PATH=$CI_PROJECT_DIR/install/lib/

				export EGL_PLATFORM=surfaceless

				    local dir_prefix="/tmp-vsock."

				    local cid_prefix=0

				    unset VSOCK_TEMP_DIR

				export -p > /crosvm-env.sh

				export GALLIUM_DRIVER="$CROSVM_GALLIUM_DRIVER"

				export GALLIVM_PERF="nopt"

				export LIBGL_ALWAYS_SOFTWARE="true"

				    while [ ${cid_prefix} -lt 128 ]; do

				        VSOCK_TEMP_DIR=${dir_prefix}${cid_prefix}

				        mkdir "${VSOCK_TEMP_DIR}" >/dev/null 2>&1 && break || unset VSOCK_TEMP_DIR

				        cid_prefix=$((cid_prefix + 1))

				    done

				CROSVM_KERNEL_ARGS="root=my_root rw rootfstype=virtiofs loglevel=3 init=$CI_PROJECT_DIR/install/crosvm-init.sh ip=192.168.30.2::192.168.30.1:255.255.255.0:crosvm:eth0"

				    [ -n "${VSOCK_TEMP_DIR}" ] || return 1

				# Temporary results dir because from the guest we cannot write to /

				mkdir -p /results

				mount -t tmpfs tmpfs /results

				    VSOCK_CID=$(((CI_JOB_ID & 0x1ffffff) | ((cid_prefix & 0x7f) << 25)))

				    VSOCK_STDOUT=5001

				    VSOCK_STDERR=5002

				mkdir -p /piglit/.gitlab-ci/piglit

				mount -t tmpfs tmpfs /piglit/.gitlab-ci/piglit

				    return 0

				}

				# The dEQP binary needs to run from the directory it's in

				if [ -n "${1##*.sh}" ] && [ -z "${1##*"deqp"*}" ]; then

				    DEQP_BIN_DIR=$(dirname "$1")

				    export DEQP_BIN_DIR

				fi

				set_vsock_context || { echo "Could not generate crosvm vsock CID" >&2; exit 1; }

				# Ensure cleanup on script exit

				trap 'exit ${exit_code}' INT TERM

				trap 'exit_code=$?; [ -z "${CROSVM_PID}${SOCAT_PIDS}" ] || kill ${CROSVM_PID} ${SOCAT_PIDS} >/dev/null 2>&1 || true; rm -rf ${VSOCK_TEMP_DIR}' EXIT

				# Securely pass the current variables to the crosvm environment

				echo "Variables passed through:"

				SCRIPT_DIR=$(readlink -en "${0%/*}")

				${SCRIPT_DIR}/common/generate-env.sh | tee ${VSOCK_TEMP_DIR}/crosvm-env.sh

				# Set the crosvm-script as the arguments of the current script

				echo "$@" > ${VSOCK_TEMP_DIR}/crosvm-script.sh

				# Setup networking

				/usr/sbin/iptables-legacy -w -t nat -A POSTROUTING -o eth0 -j MASQUERADE

				echo 1 > /proc/sys/net/ipv4/ip_forward

				# Start background processes to receive output from guest

				socat -u vsock-connect:${VSOCK_CID}:${VSOCK_STDERR},retry=200,interval=0.1 stderr &

				SOCAT_PIDS=$!

				socat -u vsock-connect:${VSOCK_CID}:${VSOCK_STDOUT},retry=200,interval=0.1 stdout &

				SOCAT_PIDS="${SOCAT_PIDS} $!"

				# Prepare to start crosvm

				unset DISPLAY

				unset XDG_RUNTIME_DIR

				/usr/sbin/iptables-legacy  -t nat -A POSTROUTING -o eth0 -j MASQUERADE

				echo 1 > /proc/sys/net/ipv4/ip_forward

				CROSVM_KERN_ARGS="quiet console=null root=my_root rw rootfstype=virtiofs ip=192.168.30.2::192.168.30.1:255.255.255.0:crosvm:eth0"

				CROSVM_KERN_ARGS="${CROSVM_KERN_ARGS} init=${SCRIPT_DIR}/crosvm-init.sh -- ${VSOCK_STDOUT} ${VSOCK_STDERR} ${VSOCK_TEMP_DIR}"

				# Crosvm wants this

				syslogd > /dev/null

				[ "${CROSVM_GALLIUM_DRIVER}" = "llvmpipe" ] && \

				    CROSVM_LIBGL_ALWAYS_SOFTWARE=true || CROSVM_LIBGL_ALWAYS_SOFTWARE=false

				# We aren't testing LLVMPipe here, so we don't need to validate NIR on the host

				export NIR_VALIDATE=0

				set +e -x

				# We aren't testing the host driver here, so we don't need to validate NIR on the host

				NIR_DEBUG="novalidate" \

				LIBGL_ALWAYS_SOFTWARE=${CROSVM_LIBGL_ALWAYS_SOFTWARE} \

				GALLIUM_DRIVER=${CROSVM_GALLIUM_DRIVER} \

				crosvm run \

				  --gpu "$CROSVM_GPU_ARGS" \

				  -m 4096 \

				  -c $((FDO_CI_CONCURRENT > 1 ? FDO_CI_CONCURRENT - 1 : 1)) \

				  --disable-sandbox \

				  --shared-dir /:my_root:type=fs:writeback=true:timeout=60:cache=always \

				  --host_ip=192.168.30.1 --netmask=255.255.255.0 --mac "AA:BB:CC:00:00:12" \

				  -p "$CROSVM_KERNEL_ARGS" \

				  /lava-files/bzImage

				    --gpu "${CROSVM_GPU_ARGS}" -m 4096 -c 2 --disable-sandbox \

				    --shared-dir /:my_root:type=fs:writeback=true:timeout=60:cache=always \

				    --host_ip "192.168.30.1" --netmask "255.255.255.0" --mac "AA:BB:CC:00:00:12" \

				    --cid ${VSOCK_CID} -p "${CROSVM_KERN_ARGS}" \

				    /lava-files/${KERNEL_IMAGE_NAME:-bzImage} > ${VSOCK_TEMP_DIR}/crosvm 2>&1 &

				mkdir -p $CI_PROJECT_DIR/results

				mv /results/* $CI_PROJECT_DIR/results/.

				# Wait for crosvm process to terminate

				CROSVM_PID=$!

				wait ${CROSVM_PID}

				CROSVM_RET=$?

				unset CROSVM_PID

				test -f $CI_PROJECT_DIR/results/success

				[ ${CROSVM_RET} -eq 0 ] && {

				    # socat background processes terminate gracefully on remote peers exit

				    wait

				    unset SOCAT_PIDS

				    # The actual return code is the crosvm guest script's exit code

				    CROSVM_RET=$(cat ${VSOCK_TEMP_DIR}/exit_code 2>/dev/null)

				    # Force error when the guest script's exit code is not available

				    CROSVM_RET=${CROSVM_RET:-1}

				}

				# Show crosvm output on error to help with debugging

				[ ${CROSVM_RET} -eq 0 ] || {

				    set +x

				    echo "Dumping crosvm output.." >&2

				    cat ${VSOCK_TEMP_DIR}/crosvm >&2

				    set -x

				}

				exit ${CROSVM_RET}

									
										222

.gitlab-ci/deqp-runner.sh
									
												View File
												
				@@ -1,31 +1,14 @@

				#!/bin/sh

				echo -e "\e[0Ksection_start:$(date +%s):test_setup[collapsed=true]\r\e[0Kpreparing test setup"

				set -ex

				DEQP_WIDTH=${DEQP_WIDTH:-256}

				DEQP_HEIGHT=${DEQP_HEIGHT:-256}

				DEQP_CONFIG=${DEQP_CONFIG:-rgba8888d24s8ms0}

				DEQP_VARIANT=${DEQP_VARIANT:-master}

				DEQP_OPTIONS="$DEQP_OPTIONS --deqp-surface-width=$DEQP_WIDTH --deqp-surface-height=$DEQP_HEIGHT"

				DEQP_OPTIONS="$DEQP_OPTIONS --deqp-surface-type=${DEQP_SURFACE_TYPE:-pbuffer}"

				DEQP_OPTIONS="$DEQP_OPTIONS --deqp-gl-config-name=$DEQP_CONFIG"

				DEQP_OPTIONS="$DEQP_OPTIONS --deqp-visibility=hidden"

				if [ -z "$DEQP_VER" ]; then

				   echo 'DEQP_VER must be set to something like "gles2", "gles31-khr" or "vk" for the test run'

				   exit 1

				fi

				if [ "$DEQP_VER" = "vk" ]; then

				   if [ -z "$VK_DRIVER" ]; then

				      echo 'VK_DRIVER must be to something like "radeon" or "intel" for the test run'

				      exit 1

				   fi

				fi

				# Needed so configuration files can contain paths to files in /install

				ln -sf $CI_PROJECT_DIR/install /install

				if [ -z "$GPU_VERSION" ]; then

				   echo 'GPU_VERSION must be set to something like "llvmpipe" or "freedreno-a630" (the name used in .gitlab-ci/deqp-gpu-version-*.txt)'

				   echo 'GPU_VERSION must be set to something like "llvmpipe" or "freedreno-a630" (the name used in .gitlab-ci/gpu-version-*.txt)'

				   exit 1

				fi

				@@ -36,35 +19,57 @@ export LD_LIBRARY_PATH=`pwd`/install/lib/

				export EGL_PLATFORM=surfaceless

				export VK_ICD_FILENAMES=`pwd`/install/share/vulkan/icd.d/"$VK_DRIVER"_icd.${VK_CPU:-`uname -m`}.json

				# the runner was failing to look for libkms in /usr/local/lib for some reason

				# I never figured out.

				export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/usr/local/lib

				RESULTS=`pwd`/${DEQP_RESULTS_DIR:-results}

				mkdir -p $RESULTS

				# Ensure Mesa Shader Cache resides on tmpfs.

				SHADER_CACHE_HOME=${XDG_CACHE_HOME:-${HOME}/.cache}

				SHADER_CACHE_DIR=${MESA_SHADER_CACHE_DIR:-${SHADER_CACHE_HOME}/mesa_shader_cache}

				findmnt -n tmpfs ${SHADER_CACHE_HOME} || findmnt -n tmpfs ${SHADER_CACHE_DIR} || {

				    mkdir -p ${SHADER_CACHE_DIR}

				    mount -t tmpfs -o nosuid,nodev,size=2G,mode=1755 tmpfs ${SHADER_CACHE_DIR}

				}

				HANG_DETECTION_CMD=""

				# Generate test case list file.

				if [ "$DEQP_VER" = "vk" ]; then

				   MUSTPASS=/deqp/mustpass/vk-$DEQP_VARIANT.txt

				   DEQP=/deqp/external/vulkancts/modules/vulkan/deqp-vk

				   HANG_DETECTION_CMD="/parallel-deqp-runner/build/bin/hang-detection"

				elif [ "$DEQP_VER" = "gles2" -o "$DEQP_VER" = "gles3" -o "$DEQP_VER" = "gles31" -o "$DEQP_VER" = "egl" ]; then

				   MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				   DEQP=/deqp/modules/$DEQP_VER/deqp-$DEQP_VER

				   SUITE=dEQP

				elif [ "$DEQP_VER" = "gles2-khr" -o "$DEQP_VER" = "gles3-khr" -o "$DEQP_VER" = "gles31-khr" -o "$DEQP_VER" = "gles32-khr" ]; then

				   MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				   DEQP=/deqp/external/openglcts/modules/glcts

				   SUITE=dEQP

				else

				   MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				   DEQP=/deqp/external/openglcts/modules/glcts

				   SUITE=KHR

				fi

				if [ -z "$DEQP_SUITE" ]; then

				    if [ -z "$DEQP_VER" ]; then

				        echo 'DEQP_SUITE must be set to the name of your deqp-gpu_version.toml, or DEQP_VER must be set to something like "gles2", "gles31-khr" or "vk" for the test run'

				        exit 1

				    fi

				    DEQP_WIDTH=${DEQP_WIDTH:-256}

				    DEQP_HEIGHT=${DEQP_HEIGHT:-256}

				    DEQP_CONFIG=${DEQP_CONFIG:-rgba8888d24s8ms0}

				    DEQP_VARIANT=${DEQP_VARIANT:-master}

				    DEQP_OPTIONS="$DEQP_OPTIONS --deqp-surface-width=$DEQP_WIDTH --deqp-surface-height=$DEQP_HEIGHT"

				    DEQP_OPTIONS="$DEQP_OPTIONS --deqp-surface-type=${DEQP_SURFACE_TYPE:-pbuffer}"

				    DEQP_OPTIONS="$DEQP_OPTIONS --deqp-gl-config-name=$DEQP_CONFIG"

				    DEQP_OPTIONS="$DEQP_OPTIONS --deqp-visibility=hidden"

				    if [ "$DEQP_VER" = "vk" -a -z "$VK_DRIVER" ]; then

				        echo 'VK_DRIVER must be to something like "radeon" or "intel" for the test run'

				        exit 1

				    fi

				    # Generate test case list file.

				    if [ "$DEQP_VER" = "vk" ]; then

				       MUSTPASS=/deqp/mustpass/vk-$DEQP_VARIANT.txt

				       DEQP=/deqp/external/vulkancts/modules/vulkan/deqp-vk

				       HANG_DETECTION_CMD="/parallel-deqp-runner/build/bin/hang-detection"

				    elif [ "$DEQP_VER" = "gles2" -o "$DEQP_VER" = "gles3" -o "$DEQP_VER" = "gles31" -o "$DEQP_VER" = "egl" ]; then

				       MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				       DEQP=/deqp/modules/$DEQP_VER/deqp-$DEQP_VER

				    elif [ "$DEQP_VER" = "gles2-khr" -o "$DEQP_VER" = "gles3-khr" -o "$DEQP_VER" = "gles31-khr" -o "$DEQP_VER" = "gles32-khr" ]; then

				       MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				       DEQP=/deqp/external/openglcts/modules/glcts

				    else

				       MUSTPASS=/deqp/mustpass/$DEQP_VER-$DEQP_VARIANT.txt

				       DEQP=/deqp/external/openglcts/modules/glcts

				    fi

				    cp $MUSTPASS /tmp/case-list.txt

				    # If the caselist is too long to run in a reasonable amount of time, let the job

				@@ -94,80 +99,30 @@ if [ -z "$DEQP_SUITE" ]; then

				    fi

				fi

				if [ -e "$INSTALL/deqp-$GPU_VERSION-fails.txt" ]; then

				    DEQP_RUNNER_OPTIONS="$DEQP_RUNNER_OPTIONS --baseline $INSTALL/deqp-$GPU_VERSION-fails.txt"

				if [ -e "$INSTALL/$GPU_VERSION-fails.txt" ]; then

				    DEQP_RUNNER_OPTIONS="$DEQP_RUNNER_OPTIONS --baseline $INSTALL/$GPU_VERSION-fails.txt"

				fi

				# Default to an empty known flakes file if it doesn't exist.

				touch $INSTALL/deqp-$GPU_VERSION-flakes.txt

				touch $INSTALL/$GPU_VERSION-flakes.txt

				if [ -n "$VK_DRIVER" ] && [ -e "$INSTALL/deqp-$VK_DRIVER-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/deqp-$VK_DRIVER-skips.txt"

				if [ -n "$VK_DRIVER" ] && [ -e "$INSTALL/$VK_DRIVER-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/$VK_DRIVER-skips.txt"

				fi

				if [ -n "$GALLIUM_DRIVER" ] && [ -e "$INSTALL/deqp-$GALLIUM_DRIVER-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/deqp-$GALLIUM_DRIVER-skips.txt"

				if [ -n "$GALLIUM_DRIVER" ] && [ -e "$INSTALL/$GALLIUM_DRIVER-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/$GALLIUM_DRIVER-skips.txt"

				fi

				if [ -n "$DRIVER_NAME" ] && [ -e "$INSTALL/deqp-$DRIVER_NAME-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/deqp-$DRIVER_NAME-skips.txt"

				if [ -n "$DRIVER_NAME" ] && [ -e "$INSTALL/$DRIVER_NAME-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/$DRIVER_NAME-skips.txt"

				fi

				if [ -e "$INSTALL/deqp-$GPU_VERSION-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/deqp-$GPU_VERSION-skips.txt"

				if [ -e "$INSTALL/$GPU_VERSION-skips.txt" ]; then

				    DEQP_SKIPS="$DEQP_SKIPS $INSTALL/$GPU_VERSION-skips.txt"

				fi

				set +e

				if [ -n "$DEQP_PARALLEL" ]; then

				   JOB="--jobs $DEQP_PARALLEL"

				elif [ -n "$FDO_CI_CONCURRENT" ]; then

				   JOB="--jobs $FDO_CI_CONCURRENT"

				else

				   JOB="--jobs 4"

				fi

				parse_renderer() {

				    RENDERER=`grep -A1 TestCaseResult.\*info.renderer $RESULTS/deqp-info.qpa | grep '<Text' | sed 's|.*<Text>||g' | sed 's|</Text>||g'`

				    VERSION=`grep -A1 TestCaseResult.\*info.version $RESULTS/deqp-info.qpa | grep '<Text' | sed 's|.*<Text>||g' | sed 's|</Text>||g'`

				    echo "Renderer: $RENDERER"

				    echo "Version: $VERSION "

				    if ! echo $RENDERER | grep -q $DEQP_EXPECTED_RENDERER; then

				        echo "Expected GL_RENDERER $DEQP_EXPECTED_RENDERER"

				        exit 1

				    fi

				}

				check_renderer() {

				    if echo $DEQP_VER | grep -q egl; then

				        return

				    fi

				    echo "Capturing renderer info for GLES driver sanity checks"

				    # If you're having trouble loading your driver, uncommenting this may help

				    # debug.

				    # export EGL_LOG_LEVEL=debug

				    VERSION=`echo $DEQP_VER | cut -d '-' -f1 | tr '[a-z]' '[A-Z]'`

				    export LD_PRELOAD=$TEST_LD_PRELOAD

				    $DEQP $DEQP_OPTIONS --deqp-case=$SUITE-$VERSION.info.\* --deqp-log-filename=$RESULTS/deqp-info.qpa

				    export LD_PRELOAD=

				    parse_renderer

				}

				check_vk_device_name() {

				    echo "Capturing device info for VK driver sanity checks"

				    export LD_PRELOAD=$TEST_LD_PRELOAD

				    $DEQP $DEQP_OPTIONS --deqp-case=dEQP-VK.info.device --deqp-log-filename=$RESULTS/deqp-info.qpa

				    export LD_PRELOAD=

				    DEVICENAME=`grep deviceName $RESULTS/deqp-info.qpa | sed 's|deviceName: ||g'`

				    echo "deviceName: $DEVICENAME"

				    if ! echo $DEVICENAME | grep -q "$DEQP_EXPECTED_RENDERER"; then

				        echo "Expected deviceName $DEQP_EXPECTED_RENDERER"

				        exit 1

				    fi

				}

				report_load() {

				    echo "System load: $(cut -d' ' -f1-3 < /proc/loadavg)"

				    echo "# of CPU cores: $(cat /proc/cpuinfo | grep processor | wc -l)"

				@@ -190,34 +145,37 @@ if [ "$GALLIUM_DRIVER" = "virpipe" ]; then

				    fi

				    GALLIUM_DRIVER=llvmpipe \

				    GALLIVM_PERF="nopt" \

				    virgl_test_server $VTEST_ARGS >$RESULTS/vtest-log.txt 2>&1 &

				    sleep 1

				fi

				if [ $DEQP_VER = vk ]; then

				    quiet check_vk_device_name

				else

				    quiet check_renderer

				if [ -z "$DEQP_SUITE" ]; then

				    if [ -n "$DEQP_EXPECTED_RENDERER" ]; then

				        export DEQP_RUNNER_OPTIONS="$DEQP_RUNNER_OPTIONS --renderer-check "$DEQP_EXPECTED_RENDERER""

				    fi

				    if [ $DEQP_VER != vk -a $DEQP_VER != egl ]; then

				        export DEQP_RUNNER_OPTIONS="$DEQP_RUNNER_OPTIONS --version-check `cat $INSTALL/VERSION | sed 's/[() ]/./g'`"

				    fi

				fi

				RESULTS_CSV=$RESULTS/results.csv

				FAILURES_CSV=$RESULTS/failures.csv

				set +x

				echo -e "\e[0Ksection_end:$(date +%s):test_setup\r\e[0K"

				export LD_PRELOAD=$TEST_LD_PRELOAD

				echo -e "\e[0Ksection_start:$(date +%s):deqp[collapsed=false]\r\e[0Kdeqp-runner"

				set -x

				set +e

				if [ -z "$DEQP_SUITE" ]; then

				    deqp-runner \

				        run \

				        --deqp $DEQP \

				        --output $RESULTS \

				        --caselist /tmp/case-list.txt \

				        --skips $INSTALL/deqp-all-skips.txt $DEQP_SKIPS \

				        --flakes $INSTALL/deqp-$GPU_VERSION-flakes.txt \

				        --skips $INSTALL/all-skips.txt $DEQP_SKIPS \

				        --flakes $INSTALL/$GPU_VERSION-flakes.txt \

				        --testlog-to-xml /deqp/executor/testlog-to-xml \

				        $JOB \

				        $SUMMARY_LIMIT \

				        --jobs ${FDO_CI_CONCURRENT:-4} \

					$DEQP_RUNNER_OPTIONS \

				        -- \

				        $DEQP_OPTIONS

				@@ -226,20 +184,24 @@ else

				        suite \

				        --suite $INSTALL/deqp-$DEQP_SUITE.toml \

				        --output $RESULTS \

				        --skips $INSTALL/deqp-all-skips.txt $DEQP_SKIPS \

				        --flakes $INSTALL/deqp-$GPU_VERSION-flakes.txt \

				        --skips $INSTALL/all-skips.txt $DEQP_SKIPS \

				        --flakes $INSTALL/$GPU_VERSION-flakes.txt \

				        --testlog-to-xml /deqp/executor/testlog-to-xml \

				        --fraction-start $CI_NODE_INDEX \

				        --fraction $CI_NODE_TOTAL \

				        $JOB \

				        $SUMMARY_LIMIT \

				        --fraction `expr $CI_NODE_TOTAL \* ${DEQP_FRACTION:-1}` \

				        --jobs ${FDO_CI_CONCURRENT:-4} \

					$DEQP_RUNNER_OPTIONS

				fi

				DEQP_EXITCODE=$?

				export LD_PRELOAD=

				quiet report_load

				set +x

				echo -e "\e[0Ksection_end:$(date +%s):deqp\r\e[0K"

				report_load

				echo -e "\e[0Ksection_start:$(date +%s):test_post_process[collapsed=true]\r\e[0Kpost-processing test results"

				set -x

				# Remove all but the first 50 individual XML files uploaded as artifacts, to

				# save fd.o space when you break everything.

				@@ -253,8 +215,8 @@ find $RESULTS -name \*.xml \

				    -exec cp /deqp/testlog.css /deqp/testlog.xsl "$RESULTS/" ";" \

				    -quit

				$HANG_DETECTION_CMD deqp-runner junit \

				   --testsuite $DEQP_VER \

				deqp-runner junit \

				   --testsuite dEQP \

				   --results $RESULTS/failures.csv \

				   --output $RESULTS/junit.xml \

				   --limit 50 \

				@@ -265,8 +227,8 @@ if [ -n "$FLAKES_CHANNEL" ]; then

				  python3 $INSTALL/report-flakes.py \

				         --host irc.oftc.net \

				         --port 6667 \

				         --results $RESULTS_CSV \

				         --known-flakes $INSTALL/deqp-$GPU_VERSION-flakes.txt \

				         --results $RESULTS/results.csv \

				         --known-flakes $INSTALL/$GPU_VERSION-flakes.txt \

				         --channel "$FLAKES_CHANNEL" \

				         --runner "$CI_RUNNER_DESCRIPTION" \

				         --job "$CI_JOB_ID" \

				@@ -275,4 +237,6 @@ if [ -n "$FLAKES_CHANNEL" ]; then

				         --branch-title "${CI_MERGE_REQUEST_TITLE:-$CI_COMMIT_TITLE}"

				fi

				echo -e "\e[0Ksection_end:$(date +%s):test_post_process\r\e[0K"

				exit $DEQP_EXITCODE

									
										8

.gitlab-ci/download-git-cache.sh
									
												View File
												
				@@ -5,7 +5,7 @@ set -o xtrace

				# if we run this script outside of gitlab-ci for testing, ensure

				# we got meaningful variables

				CI_PROJECT_DIR=${CI_PROJECT_DIR:-$(mktemp -d)/mesa}

				CI_PROJECT_DIR=${CI_PROJECT_DIR:-$(mktemp -d)/$CI_PROJECT_NAME}

				if [[ -e $CI_PROJECT_DIR/.git ]]

				then

				@@ -16,8 +16,8 @@ fi

				TMP_DIR=$(mktemp -d)

				echo "Downloading archived master..."

				/usr/bin/wget -O $TMP_DIR/mesa.tar.gz \

				              https://${MINIO_HOST}/git-cache/${FDO_UPSTREAM_REPO}/mesa.tar.gz

				/usr/bin/wget -O $TMP_DIR/$CI_PROJECT_NAME.tar.gz \

				              https://${MINIO_HOST}/git-cache/${FDO_UPSTREAM_REPO}/$CI_PROJECT_NAME.tar.gz

				# check wget error code

				if [[ $? -ne 0 ]]

				@@ -31,6 +31,6 @@ set -e

				rm -rf "$CI_PROJECT_DIR"

				echo "Extracting tarball into '$CI_PROJECT_DIR'..."

				mkdir -p "$CI_PROJECT_DIR"

				tar xzf "$TMP_DIR/mesa.tar.gz" -C "$CI_PROJECT_DIR"

				tar xzf "$TMP_DIR/$CI_PROJECT_NAME.tar.gz" -C "$CI_PROJECT_DIR"

				rm -rf "$TMP_DIR"

				chmod a+w "$CI_PROJECT_DIR"

									
										70

.gitlab-ci/gtest-runner.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,70 @@

				#!/bin/sh

				set -ex

				INSTALL=`pwd`/install

				# Set up the driver environment.

				export LD_LIBRARY_PATH=`pwd`/install/lib/

				export LIBVA_DRIVERS_PATH=`pwd`/install/lib/dri/

				# libva spams driver open info by default, and that happens per testcase.

				export LIBVA_MESSAGING_LEVEL=1

				if [ -e "$INSTALL/$GPU_VERSION-fails.txt" ]; then

				    GTEST_RUNNER_OPTIONS="$GTEST_RUNNER_OPTIONS --baseline $INSTALL/$GPU_VERSION-fails.txt"

				fi

				# Default to an empty known flakes file if it doesn't exist.

				touch $INSTALL/$GPU_VERSION-flakes.txt

				if [ -n "$GALLIUM_DRIVER" ] && [ -e "$INSTALL/$GALLIUM_DRIVER-skips.txt" ]; then

				    GTEST_SKIPS="$GTEST_SKIPS --skips $INSTALL/$GALLIUM_DRIVER-skips.txt"

				fi

				if [ -n "$DRIVER_NAME" ] && [ -e "$INSTALL/$DRIVER_NAME-skips.txt" ]; then

				    GTEST_SKIPS="$GTEST_SKIPS --skips $INSTALL/$DRIVER_NAME-skips.txt"

				fi

				if [ -e "$INSTALL/$GPU_VERSION-skips.txt" ]; then

				    GTEST_SKIPS="$GTEST_SKIPS --skips $INSTALL/$GPU_VERSION-skips.txt"

				fi

				set +e

				gtest-runner \

				    run \

				    --gtest $GTEST \

				    --output ${GTEST_RESULTS_DIR:-results} \

				    --jobs ${FDO_CI_CONCURRENT:-4} \

				    $GTEST_SKIPS \

				    --flakes $INSTALL/$GPU_VERSION-flakes.txt \

				    --fraction-start ${CI_NODE_INDEX:-1} \

				    --fraction $((${CI_NODE_TOTAL:-1} * ${GTEST_FRACTION:-1})) \

				    --env "LD_PRELOAD=$TEST_LD_PRELOAD" \

				    $GTEST_RUNNER_OPTIONS

				GTEST_EXITCODE=$?

				deqp-runner junit \

				   --testsuite gtest \

				   --results $RESULTS/failures.csv \

				   --output $RESULTS/junit.xml \

				   --limit 50 \

				   --template "See https://$CI_PROJECT_ROOT_NAMESPACE.pages.freedesktop.org/-/$CI_PROJECT_NAME/-/jobs/$CI_JOB_ID/artifacts/results/{{testcase}}.xml"

				# Report the flakes to the IRC channel for monitoring (if configured):

				if [ -n "$FLAKES_CHANNEL" ]; then

				  python3 $INSTALL/report-flakes.py \

				         --host irc.oftc.net \

				         --port 6667 \

				         --results $RESULTS/results.csv \

				         --known-flakes $INSTALL/$GPU_VERSION-flakes.txt \

				         --channel "$FLAKES_CHANNEL" \

				         --runner "$CI_RUNNER_DESCRIPTION" \

				         --job "$CI_JOB_ID" \

				         --url "$CI_JOB_URL" \

				         --branch "${CI_MERGE_REQUEST_SOURCE_BRANCH_NAME:-$CI_COMMIT_BRANCH}" \

				         --branch-title "${CI_MERGE_REQUEST_TITLE:-$CI_COMMIT_TITLE}"

				fi

				exit $GTEST_EXITCODE

									
										21

.gitlab-ci/image-tags.yml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,21 @@

				variables:

				   DEBIAN_X86_BUILD_BASE_IMAGE: "debian/x86_build-base"

				   DEBIAN_BASE_TAG: "2022-02-21-libdrm"

				   DEBIAN_X86_BUILD_IMAGE_PATH: "debian/x86_build"

				   DEBIAN_BUILD_TAG: "2022-02-21-libdrm"

				   DEBIAN_X86_TEST_BASE_IMAGE: "debian/x86_test-base"

				   DEBIAN_X86_TEST_IMAGE_PATH: "debian/x86_test-gl"

				   DEBIAN_X86_TEST_GL_TAG: "2022-04-07-virgl-crosvm"

				   DEBIAN_X86_TEST_VK_TAG: "2022-04-05-deqp-runner"

				   FEDORA_X86_BUILD_TAG: "2022-03-18-spirv-tools-5"

				   KERNEL_ROOTFS_TAG: "2022-04-07-prefix-skqp"

				   WINDOWS_X64_BUILD_PATH: "windows/x64_build"

				   WINDOWS_X64_BUILD_TAG: "2022-20-02-base_split"

				   WINDOWS_X64_TEST_PATH: "windows/x64_test"

				   WINDOWS_X64_TEST_TAG: "2022-04-13-dozen_ci"

									
										11

.gitlab-ci/lava/lava-gitlab-ci.yml
									
												View File
												
				@@ -5,7 +5,7 @@

				  interruptible: true

				  variables:

				    GIT_STRATEGY: none # testing doesn't build anything from source

				    DEQP_PARALLEL: 6 # should be replaced by per-machine definitions

				    FDO_CI_CONCURRENT: 6 # should be replaced by per-machine definitions

				    DEQP_VER: gles2

				    # proxy used to cache data locally

				    FDO_HTTP_CACHE_URI: "http://caching-proxy/cache/?uri="

				@@ -14,20 +14,23 @@

				    BASE_SYSTEM_MAINLINE_HOST_PATH: "${BASE_SYSTEM_HOST_PREFIX}/${FDO_UPSTREAM_REPO}/${DISTRIBUTION_TAG}/${ARCH}"

				    BASE_SYSTEM_FORK_HOST_PATH: "${BASE_SYSTEM_HOST_PREFIX}/${CI_PROJECT_PATH}/${DISTRIBUTION_TAG}/${ARCH}"

				    # per-job build artifacts

				    MESA_BUILD_PATH: "${PIPELINE_ARTIFACTS_BASE}/mesa-${ARCH}.tar.gz"

				    BUILD_PATH: "${PIPELINE_ARTIFACTS_BASE}/${CI_PROJECT_NAME}-${ARCH}.tar.gz"

				    JOB_ROOTFS_OVERLAY_PATH: "${JOB_ARTIFACTS_BASE}/job-rootfs-overlay.tar.gz"

				    JOB_RESULTS_PATH: "${JOB_ARTIFACTS_BASE}/results.tar.gz"

				    MINIO_RESULTS_UPLOAD: "${JOB_ARTIFACTS_BASE}"

				    PIGLIT_NO_WINDOW: 1

				    VISIBILITY_GROUP: "Collabora+fdo"

				  script:

				    - ./artifacts/lava/lava-submit.sh

				  artifacts:

				    name: "mesa_${CI_JOB_NAME}"

				    name: "${CI_PROJECT_NAME}_${CI_JOB_NAME}"

				    when: always

				    paths:

				      - results/

				    exclude:

				      - results/*.shader_cache

				  tags:

				    - $RUNNER_TAG

				  after_script:

				    - wget -q "https://${JOB_RESULTS_PATH}" -O- | tar -xz

				@@ -85,7 +88,7 @@

				.lava-traces-base:

				  variables:

				    HWCI_TEST_SCRIPT: "/install/piglit/run.sh"

				    HWCI_TEST_SCRIPT: "/install/piglit/piglit-traces.sh"

				  artifacts:

				    reports:

				      junit: results/junit.xml

									
										34

.gitlab-ci/lava/lava-pytest.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,34 @@

				#!/bin/sh

				#

				# Copyright (C) 2022 Collabora Limited

				# Author: Guilherme Gallo <guilherme.gallo@collabora.com>

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				# This script runs unit/integration tests related with LAVA CI tools

				set -ex

				TEST_DIR=${CI_PROJECT_DIR}/.gitlab-ci/tests

				PYTHONPATH="${TEST_DIR}:${PYTHONPATH}" python3 -m \

				    pytest "${TEST_DIR}" \

				            -W ignore::DeprecationWarning \

				            --junitxml=artifacts/ci_scripts_report.xml

									
										20

.gitlab-ci/lava/lava-submit.sh
									
												View File
												
				@@ -14,15 +14,16 @@ fi

				rm -rf results

				mkdir -p results/job-rootfs-overlay/

				# LAVA always uploads to MinIO when necessary as we don't have direct upload

				# from the DUT

				export PIGLIT_REPLAY_UPLOAD_TO_MINIO=1

				cp artifacts/ci-common/capture-devcoredump.sh results/job-rootfs-overlay/

				cp artifacts/ci-common/init-*.sh results/job-rootfs-overlay/

				artifacts/ci-common/generate-env.sh > results/job-rootfs-overlay/set-job-env-vars.sh

				cp artifacts/ci-common/intel-gpu-freq.sh results/job-rootfs-overlay/

				# Prepare env vars for upload.

				KERNEL_IMAGE_BASE_URL="https://${BASE_SYSTEM_HOST_PATH}" \

					artifacts/ci-common/generate-env.sh > results/job-rootfs-overlay/set-job-env-vars.sh

				tar zcf job-rootfs-overlay.tar.gz -C results/job-rootfs-overlay/ .

				ci-fairy minio login "${CI_JOB_JWT}"

				ci-fairy minio login --token-file "${CI_JOB_JWT_FILE}"

				ci-fairy minio cp job-rootfs-overlay.tar.gz "minio://${JOB_ROOTFS_OVERLAY_PATH}"

				touch results/lava.log

				@@ -30,15 +31,16 @@ tail -f results/lava.log &

				artifacts/lava/lava_job_submitter.py \

					--dump-yaml \

					--pipeline-info "$CI_JOB_NAME: $CI_PIPELINE_URL on $CI_COMMIT_REF_NAME ${CI_NODE_INDEX}/${CI_NODE_TOTAL}" \

					--base-system-url-prefix "https://${BASE_SYSTEM_HOST_PATH}" \

					--mesa-build-url "${FDO_HTTP_CACHE_URI:-}https://${MESA_BUILD_PATH}" \

					--rootfs-url-prefix "https://${BASE_SYSTEM_HOST_PATH}" \

					--kernel-url-prefix "https://${BASE_SYSTEM_HOST_PATH}" \

					--build-url "${FDO_HTTP_CACHE_URI:-}https://${BUILD_PATH}" \

					--job-rootfs-overlay-url "${FDO_HTTP_CACHE_URI:-}https://${JOB_ROOTFS_OVERLAY_PATH}" \

					--job-artifacts-base ${JOB_ARTIFACTS_BASE} \

					--job-timeout ${JOB_TIMEOUT:-30} \

					--first-stage-init artifacts/ci-common/init-stage1.sh \

					--ci-project-dir ${CI_PROJECT_DIR} \

					--device-type ${DEVICE_TYPE} \

					--dtb ${DTB} \

					--jwt "${CI_JOB_JWT}" \

					--jwt-file "${CI_JOB_JWT_FILE}" \

					--kernel-image-name ${KERNEL_IMAGE_NAME} \

					--kernel-image-type "${KERNEL_IMAGE_TYPE}" \

					--boot-method ${BOOT_METHOD} \

									
										136

.gitlab-ci/lava/lava_job_submitter.py
									
												View File
												
				@@ -25,31 +25,33 @@

				"""Send a job to LAVA, track it and collect log back"""

				import argparse

				import lavacli

				import os

				import pathlib

				import sys

				import time

				import traceback

				import urllib.parse

				import xmlrpc

				import yaml

				from datetime import datetime, timedelta

				from os import getenv

				import lavacli

				import yaml

				from lavacli.utils import loader

				# Timeout in minutes to decide if the device from the dispatched LAVA job has

				# Timeout in seconds to decide if the device from the dispatched LAVA job has

				# hung or not due to the lack of new log output.

				DEVICE_HANGING_TIMEOUT_MIN = 5

				DEVICE_HANGING_TIMEOUT_SEC = int(getenv("LAVA_DEVICE_HANGING_TIMEOUT_SEC",  5*60))

				# How many seconds the script should wait before try a new polling iteration to

				# check if the dispatched LAVA job is running or waiting in the job queue.

				WAIT_FOR_DEVICE_POLLING_TIME_SEC = 10

				WAIT_FOR_DEVICE_POLLING_TIME_SEC = int(getenv("LAVA_WAIT_FOR_DEVICE_POLLING_TIME_SEC", 10))

				# How many seconds to wait between log output LAVA RPC calls.

				LOG_POLLING_TIME_SEC = 5

				LOG_POLLING_TIME_SEC = int(getenv("LAVA_LOG_POLLING_TIME_SEC", 5))

				# How many retries should be made when a timeout happen.

				NUMBER_OF_RETRIES_TIMEOUT_DETECTION = 2

				NUMBER_OF_RETRIES_TIMEOUT_DETECTION = int(getenv("LAVA_NUMBER_OF_RETRIES_TIMEOUT_DETECTION", 2))

				def print_log(msg):

				@@ -59,6 +61,11 @@ def fatal_err(msg):

				    print_log(msg)

				    sys.exit(1)

				def hide_sensitive_data(yaml_data, hide_tag="HIDEME"):

				    return "".join(line for line in yaml_data.splitlines(True) if hide_tag not in line)

				def generate_lava_yaml(args):

				    # General metadata and permissions, plus also inexplicably kernel arguments

				    values = {

				@@ -67,11 +74,11 @@ def generate_lava_yaml(args):

				        'visibility': { 'group': [ args.visibility_group ] },

				        'priority': 75,

				        'context': {

				            'extra_nfsroot_args': ' init=/init rootwait minio_results={}'.format(args.job_artifacts_base)

				            'extra_nfsroot_args': ' init=/init rootwait usbcore.quirks=0bda:8153:k'

				        },

				        'timeouts': {

				            'job': {

				                'minutes': 30

				                'minutes': args.job_timeout

				            }

				        },

				    }

				@@ -86,10 +93,10 @@ def generate_lava_yaml(args):

				      'to': 'tftp',

				      'os': 'oe',

				      'kernel': {

				        'url': '{}/{}'.format(args.base_system_url_prefix, args.kernel_image_name),

				        'url': '{}/{}'.format(args.kernel_url_prefix, args.kernel_image_name),

				      },

				      'nfsrootfs': {

				        'url': '{}/lava-rootfs.tgz'.format(args.base_system_url_prefix),

				        'url': '{}/lava-rootfs.tgz'.format(args.rootfs_url_prefix),

				        'compression': 'gz',

				      }

				    }

				@@ -97,7 +104,7 @@ def generate_lava_yaml(args):

				        deploy['kernel']['type'] = args.kernel_image_type

				    if args.dtb:

				        deploy['dtb'] = {

				          'url': '{}/{}.dtb'.format(args.base_system_url_prefix, args.dtb)

				          'url': '{}/{}.dtb'.format(args.kernel_url_prefix, args.dtb)

				        }

				    # always boot over NFS

				@@ -111,7 +118,7 @@ def generate_lava_yaml(args):

				    # skeleton test definition: only declaring each job as a single 'test'

				    # since LAVA's test parsing is not useful to us

				    test = {

				      'timeout': { 'minutes': 30 },

				      'timeout': { 'minutes': args.job_timeout },

				      'failure_retry': 1,

				      'definitions': [ {

				        'name': 'mesa',

				@@ -140,15 +147,22 @@ def generate_lava_yaml(args):

				    #   - fetch and unpack per-job environment from lava-submit.sh

				    #   - exec .gitlab-ci/common/init-stage2.sh 

				    init_lines = []

				    with open(args.first_stage_init, 'r') as init_sh:

				      init_lines += [ x.rstrip() for x in init_sh if not x.startswith('#') and x.rstrip() ]

				    with open(args.jwt_file) as jwt_file:

				        init_lines += [

				            "set +x",

				            f'echo -n "{jwt_file.read()}" > "{args.jwt_file}"  # HIDEME',

				            "set -x",

				        ]

				    init_lines += [

				      'mkdir -p {}'.format(args.ci_project_dir),

				      'wget -S --progress=dot:giga -O- {} | tar -xz -C {}'.format(args.mesa_build_url, args.ci_project_dir),

				      'wget -S --progress=dot:giga -O- {} | tar -xz -C {}'.format(args.build_url, args.ci_project_dir),

				      'wget -S --progress=dot:giga -O- {} | tar -xz -C /'.format(args.job_rootfs_overlay_url),

				      'set +x',

				      'export CI_JOB_JWT="{}"'.format(args.jwt),

				      'set -x',

				      f'echo "export CI_JOB_JWT_FILE={args.jwt_file}" >> /set-job-env-vars.sh',

				      'exec /init-stage2.sh',

				    ]

				    test['definitions'][0]['repository']['run']['steps'] = init_lines

				@@ -192,7 +206,6 @@ def _call_proxy(fn, *args):

				                fatal_err("A protocol error occurred (Err {} {})".format(err.errcode, err.errmsg))

				            else:

				                time.sleep(15)

				                pass

				        except xmlrpc.client.Fault as err:

				            traceback.print_exc()

				            fatal_err("FATAL: Fault: {} (code: {})".format(err.faultString, err.faultCode))

				@@ -203,8 +216,8 @@ def get_job_results(proxy, job_id, test_suite, test_case):

				    results_yaml = _call_proxy(proxy.results.get_testjob_results_yaml, job_id)

				    results = yaml.load(results_yaml, Loader=loader(False))

				    for res in results:

				        metadata = res['metadata']

				        if not 'result' in metadata or metadata['result'] != 'fail':

				        metadata = res["metadata"]

				        if "result" not in metadata or metadata["result"] != "fail":

				            continue

				        if 'error_type' in metadata and metadata['error_type'] == "Infrastructure":

				            print_log("LAVA job {} failed with Infrastructure Error. Retry.".format(job_id))

				@@ -241,8 +254,7 @@ def follow_job_execution(proxy, job_id):

				    last_time_logs = datetime.now()

				    while not finished:

				        (finished, data) = _call_proxy(proxy.scheduler.jobs.logs, job_id, line_count)

				        logs = yaml.load(str(data), Loader=loader(False))

				        if logs:

				        if logs := yaml.load(str(data), Loader=loader(False)):

				            # Reset the timeout

				            last_time_logs = datetime.now()

				            for line in logs:

				@@ -251,7 +263,7 @@ def follow_job_execution(proxy, job_id):

				            line_count += len(logs)

				        else:

				            time_limit = timedelta(minutes=DEVICE_HANGING_TIMEOUT_MIN)

				            time_limit = timedelta(seconds=DEVICE_HANGING_TIMEOUT_SEC)

				            if datetime.now() - last_time_logs > time_limit:

				                print_log("LAVA job {} doesn't advance (machine got hung?). Retry.".format(job_id))

				                return False

				@@ -279,23 +291,7 @@ def submit_job(proxy, job_file):

				    return _call_proxy(proxy.scheduler.jobs.submit, job_file)

				def main(args):

				    proxy = setup_lava_proxy()

				    yaml_file = generate_lava_yaml(args)

				    if args.dump_yaml:

				        censored_args = args

				        censored_args.jwt = "jwt-hidden"

				        print(generate_lava_yaml(censored_args))

				    if args.validate_only:

				        ret = validate_job(proxy, yaml_file)

				        if not ret:

				            fatal_err("Error in LAVA job definition")

				        print("LAVA job definition validated successfully")

				        return

				def retriable_follow_job(proxy, yaml_file):

				    retry_count = NUMBER_OF_RETRIES_TIMEOUT_DETECTION

				    while retry_count >= 0:

				@@ -315,23 +311,46 @@ def main(args):

				        show_job_data(proxy, job_id)

				        if get_job_results(proxy,  job_id, "0_mesa", "mesa") == True:

				             break

				        if get_job_results(proxy, job_id, "0_mesa", "mesa") == True:

				            break

				    else:

				        # The script attempted all the retries. The job seemed to fail.

				        return False

				    return True

				if __name__ == '__main__':

				    # given that we proxy from DUT -> LAVA dispatcher -> LAVA primary -> us ->

				    # GitLab runner -> GitLab primary -> user, safe to say we don't need any

				    # more buffering

				    sys.stdout.reconfigure(line_buffering=True)

				    sys.stderr.reconfigure(line_buffering=True)

				def main(args):

				    proxy = setup_lava_proxy()

				    yaml_file = generate_lava_yaml(args)

				    if args.dump_yaml:

				        print(hide_sensitive_data(generate_lava_yaml(args)))

				    if args.validate_only:

				        ret = validate_job(proxy, yaml_file)

				        if not ret:

				            fatal_err("Error in LAVA job definition")

				        print("LAVA job definition validated successfully")

				        return

				    if not retriable_follow_job(proxy, yaml_file):

				        fatal_err(

				            "Job failed after it exceeded the number of"

				            f"{NUMBER_OF_RETRIES_TIMEOUT_DETECTION} retries."

				        )

				def create_parser():

				    parser = argparse.ArgumentParser("LAVA job submitter")

				    parser.add_argument("--pipeline-info")

				    parser.add_argument("--base-system-url-prefix")

				    parser.add_argument("--mesa-build-url")

				    parser.add_argument("--rootfs-url-prefix")

				    parser.add_argument("--kernel-url-prefix")

				    parser.add_argument("--build-url")

				    parser.add_argument("--job-rootfs-overlay-url")

				    parser.add_argument("--job-artifacts-base")

				    parser.add_argument("--job-timeout", type=int)

				    parser.add_argument("--first-stage-init")

				    parser.add_argument("--ci-project-dir")

				    parser.add_argument("--device-type")

				@@ -340,11 +359,22 @@ if __name__ == '__main__':

				    parser.add_argument("--kernel-image-type", nargs='?', default="")

				    parser.add_argument("--boot-method")

				    parser.add_argument("--lava-tags", nargs='?', default="")

				    parser.add_argument("--jwt")

				    parser.add_argument("--jwt-file", type=pathlib.Path)

				    parser.add_argument("--validate-only", action='store_true')

				    parser.add_argument("--dump-yaml", action='store_true')

				    parser.add_argument("--visibility-group")

				    return parser

				if __name__ == "__main__":

				    # given that we proxy from DUT -> LAVA dispatcher -> LAVA primary -> us ->

				    # GitLab runner -> GitLab primary -> user, safe to say we don't need any

				    # more buffering

				    sys.stdout.reconfigure(line_buffering=True)

				    sys.stderr.reconfigure(line_buffering=True)

				    parser = create_parser()

				    parser.set_defaults(func=main)

				    args = parser.parse_args()

				    args.func(args)

									
										1

.gitlab-ci/meson/build.sh
									
												View File
												
				@@ -68,7 +68,6 @@ meson _build --native-file=native.file \

				      -D cpp_args="$(echo -n $CPP_ARGS)" \

				      -D libunwind=${UNWIND} \

				      ${DRI_LOADERS} \

				      -D dri-drivers=${DRI_DRIVERS:-[]} \

				      ${GALLIUM_ST} \

				      -D gallium-drivers=${GALLIUM_DRIVERS:-[]} \

				      -D vulkan-drivers=${VULKAN_DRIVERS:-[]} \

6

.gitlab-ci/piglit/piglit-all-skips.txt

View File

@@ -1,6 +0,0 @@
 # WGL is Windows-only
 wgl@.*
 # These are sensitive to CPU timing, and would need to be run in isolation
 # on the system rather than in parallel with other tests.
 glx@glx_arb_sync_control@timing.*

									
										89

.gitlab-ci/piglit/piglit-runner.sh
									
												View File
												
				@@ -3,7 +3,7 @@

				set -ex

				if [ -z "$GPU_VERSION" ]; then

				   echo 'GPU_VERSION must be set to something like "llvmpipe" or "freedreno-a630" (the name used in your ci/piglit-gpu-version-*.txt)'

				   echo 'GPU_VERSION must be set to something like "llvmpipe" or "freedreno-a630" (the name used in your ci/gpu-version-*.txt)'

				   exit 1

				fi

				@@ -17,6 +17,31 @@ export VK_ICD_FILENAMES=`pwd`/install/share/vulkan/icd.d/"$VK_DRIVER"_icd.${VK_C

				RESULTS=`pwd`/${PIGLIT_RESULTS_DIR:-results}

				mkdir -p $RESULTS

				# Ensure Mesa Shader Cache resides on tmpfs.

				SHADER_CACHE_HOME=${XDG_CACHE_HOME:-${HOME}/.cache}

				SHADER_CACHE_DIR=${MESA_SHADER_CACHE_DIR:-${SHADER_CACHE_HOME}/mesa_shader_cache}

				findmnt -n tmpfs ${SHADER_CACHE_HOME} || findmnt -n tmpfs ${SHADER_CACHE_DIR} || {

				    mkdir -p ${SHADER_CACHE_DIR}

				    mount -t tmpfs -o nosuid,nodev,size=2G,mode=1755 tmpfs ${SHADER_CACHE_DIR}

				}

				if [ "$GALLIUM_DRIVER" = "virpipe" ]; then

				    # deqp is to use virpipe, and virgl_test_server llvmpipe

				    export GALLIUM_DRIVER="$GALLIUM_DRIVER"

				    VTEST_ARGS="--use-egl-surfaceless"

				    if [ "$VIRGL_HOST_API" = "GLES" ]; then

				        VTEST_ARGS="$VTEST_ARGS --use-gles"

				    fi

				    GALLIUM_DRIVER=llvmpipe \

				    GALLIVM_PERF="nopt" \

				    virgl_test_server $VTEST_ARGS >$RESULTS/vtest-log.txt 2>&1 &

				    sleep 1

				fi

				if [ -n "$PIGLIT_FRACTION" -o -n "$CI_NODE_INDEX" ]; then

				   FRACTION=`expr ${PIGLIT_FRACTION:-1} \* ${CI_NODE_TOTAL:-1}`

				PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --fraction $FRACTION"

				@@ -28,59 +53,45 @@ if [ -n "$CI_NODE_INDEX" ]; then

				   PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --fraction-start ${CI_NODE_INDEX}"

				fi

				if [ -e "$INSTALL/piglit-$GPU_VERSION-fails.txt" ]; then

				    PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --baseline $INSTALL/piglit-$GPU_VERSION-fails.txt"

				if [ -e "$INSTALL/$GPU_VERSION-fails.txt" ]; then

				    PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --baseline $INSTALL/$GPU_VERSION-fails.txt"

				fi

				# Default to an empty known flakes file if it doesn't exist.

				touch $INSTALL/piglit-$GPU_VERSION-flakes.txt

				touch $INSTALL/$GPU_VERSION-flakes.txt

				if [ -n "$VK_DRIVER" ] && [ -e "$INSTALL/piglit-$VK_DRIVER-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/piglit-$VK_DRIVER-skips.txt"

				if [ -n "$VK_DRIVER" ] && [ -e "$INSTALL/$VK_DRIVER-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/$VK_DRIVER-skips.txt"

				fi

				if [ -n "$GALLIUM_DRIVER" ] && [ -e "$INSTALL/piglit-$GALLIUM_DRIVER-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/piglit-$GALLIUM_DRIVER-skips.txt"

				if [ -n "$GALLIUM_DRIVER" ] && [ -e "$INSTALL/$GALLIUM_DRIVER-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/$GALLIUM_DRIVER-skips.txt"

				fi

				if [ -n "$DRIVER_NAME" ] && [ -e "$INSTALL/piglit-$DRIVER_NAME-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/piglit-$DRIVER_NAME-skips.txt"

				if [ -n "$DRIVER_NAME" ] && [ -e "$INSTALL/$DRIVER_NAME-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/$DRIVER_NAME-skips.txt"

				fi

				if [ -e "$INSTALL/piglit-$GPU_VERSION-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/piglit-$GPU_VERSION-skips.txt"

				if [ -e "$INSTALL/$GPU_VERSION-skips.txt" ]; then

				    PIGLIT_SKIPS="$PIGLIT_SKIPS $INSTALL/$GPU_VERSION-skips.txt"

				fi

				set +e

				if [ -n "$PIGLIT_PARALLEL" ]; then

				   PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --jobs $PIGLIT_PARALLEL"

				elif [ -n "$FDO_CI_CONCURRENT" ]; then

				   PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --jobs $FDO_CI_CONCURRENT"

				else

				   PIGLIT_RUNNER_OPTIONS="$PIGLIT_RUNNER_OPTIONS --jobs 4"

				fi

				RESULTS_CSV=$RESULTS/results.csv

				FAILURES_CSV=$RESULTS/failures.csv

				export LD_PRELOAD=$TEST_LD_PRELOAD

				    piglit-runner \

				        run \

				        --piglit-folder /piglit \

				        --output $RESULTS \

				        --skips $INSTALL/piglit/piglit-all-skips.txt $PIGLIT_SKIPS \

				        --flakes $INSTALL/piglit-$GPU_VERSION-flakes.txt \

				        --profile $PIGLIT_PROFILES \

				        --process-isolation \

					$PIGLIT_RUNNER_OPTIONS \

				        -v -v

				piglit-runner \

				    run \

				    --piglit-folder /piglit \

				    --output $RESULTS \

				    --jobs ${FDO_CI_CONCURRENT:-4} \

				    --skips $INSTALL/all-skips.txt $PIGLIT_SKIPS \

				    --flakes $INSTALL/$GPU_VERSION-flakes.txt \

				    --profile $PIGLIT_PROFILES \

				    --process-isolation \

				    $PIGLIT_RUNNER_OPTIONS \

				    -v -v

				PIGLIT_EXITCODE=$?

				export LD_PRELOAD=

				deqp-runner junit \

				   --testsuite $PIGLIT_PROFILES \

				   --results $RESULTS/failures.csv \

				@@ -93,8 +104,8 @@ if [ -n "$FLAKES_CHANNEL" ]; then

				  python3 $INSTALL/report-flakes.py \

				         --host irc.oftc.net \

				         --port 6667 \

				         --results $RESULTS_CSV \

				         --known-flakes $INSTALL/piglit-$GPU_VERSION-flakes.txt \

				         --results $RESULTS/results.csv \

				         --known-flakes $INSTALL/$GPU_VERSION-flakes.txt \

				         --channel "$FLAKES_CHANNEL" \

				         --runner "$CI_RUNNER_DESCRIPTION" \

				         --job "$CI_JOB_ID" \

									
										102

.gitlab-ci/piglit/run.sh → .gitlab-ci/piglit/piglit-traces.sh
									
												View File
												
				@@ -40,19 +40,17 @@ if [ "$VK_DRIVER" ]; then

				    # Set the Vulkan driver to use.

				    export VK_ICD_FILENAMES="$INSTALL/share/vulkan/icd.d/${VK_DRIVER}_icd.x86_64.json"

				    if [ "x$PIGLIT_PROFILES" = "xreplay" ]; then

				        # Set environment for Wine.

				        export WINEDEBUG="-all"

				        export WINEPREFIX="/dxvk-wine64"

				        export WINEESYNC=1

				    # Set environment for Wine.

				    export WINEDEBUG="-all"

				    export WINEPREFIX="/dxvk-wine64"

				    export WINEESYNC=1

				        # Set environment for DXVK.

				        export DXVK_LOG_LEVEL="none"

				        export DXVK_STATE_CACHE=0

				    # Set environment for DXVK.

				    export DXVK_LOG_LEVEL="none"

				    export DXVK_STATE_CACHE=0

				        # Set environment for gfxreconstruct executables.

				        export PATH="/gfxreconstruct/build/bin:$PATH"

				    fi

				    # Set environment for gfxreconstruct executables.

				    export PATH="/gfxreconstruct/build/bin:$PATH"

				    SANITY_MESA_VERSION_CMD="vulkaninfo"

				@@ -77,14 +75,12 @@ else

				    ### GL/ES ###

				    if [ "x$PIGLIT_PROFILES" = "xreplay" ]; then

				        # Set environment for apitrace executable.

				        export PATH="/apitrace/build:$PATH"

				    # Set environment for apitrace executable.

				    export PATH="/apitrace/build:$PATH"

				        # Our rootfs may not have "less", which apitrace uses during

				        # apitrace dump

				        export PAGER=cat

				    fi

				    # Our rootfs may not have "less", which apitrace uses during

				    # apitrace dump

				    export PAGER=cat

				    SANITY_MESA_VERSION_CMD="wflinfo"

				@@ -107,7 +103,6 @@ else

				            LD_LIBRARY_PATH="$__LD_LIBRARY_PATH" \

				            GALLIUM_DRIVER=llvmpipe \

				            GALLIVM_PERF="nopt" \

				            VTEST_USE_EGL_SURFACELESS=1 \

				            VTEST_USE_GLES=1 \

				            virgl_test_server >"$RESULTS"/vtest-log.txt 2>&1 &

				@@ -133,13 +128,6 @@ fi

				# If the job is parallel at the  gitlab job level, will take the corresponding

				# fraction of the caselist.

				if [ -n "$CI_NODE_INDEX" ]; then

				    if [ "$PIGLIT_PROFILES" != "${PIGLIT_PROFILES% *}" ]; then

				        FAILURE_MESSAGE=$(printf "%s" "Can't parallelize piglit with multiple profiles")

				        quiet print_red printf "%s\n" "$FAILURE_MESSAGE"

				        exit 1

				    fi

				    USE_CASELIST=1

				fi

				@@ -176,7 +164,7 @@ cd /piglit

				if [ -n "$USE_CASELIST" ]; then

				    PIGLIT_TESTS=$(printf "%s" "$PIGLIT_TESTS")

				    PIGLIT_GENTESTS="./piglit print-cmd $PIGLIT_TESTS $PIGLIT_PROFILES --format \"{name}\" > /tmp/case-list.txt"

				    PIGLIT_GENTESTS="./piglit print-cmd $PIGLIT_TESTS replay --format \"{name}\" > /tmp/case-list.txt"

				    RUN_GENTESTS="export LD_LIBRARY_PATH=$__LD_LIBRARY_PATH; $PIGLIT_GENTESTS"

				    eval $RUN_GENTESTS

				@@ -190,7 +178,7 @@ PIGLIT_OPTIONS=$(printf "%s" "$PIGLIT_OPTIONS")

				PIGLIT_TESTS=$(printf "%s" "$PIGLIT_TESTS")

				PIGLIT_CMD="./piglit run --timeout 300 -j${FDO_CI_CONCURRENT:-4} $PIGLIT_OPTIONS $PIGLIT_TESTS $PIGLIT_PROFILES "$(/usr/bin/printf "%q" "$RESULTS")

				PIGLIT_CMD="./piglit run --timeout 300 -j${FDO_CI_CONCURRENT:-4} $PIGLIT_OPTIONS $PIGLIT_TESTS replay "$(/usr/bin/printf "%q" "$RESULTS")

				RUN_CMD="export LD_LIBRARY_PATH=$__LD_LIBRARY_PATH; $SANITY_MESA_VERSION_CMD && $HANG_DETECTION_CMD $PIGLIT_CMD"

				@@ -198,12 +186,14 @@ if [ "$RUN_CMD_WRAPPER" ]; then

				    RUN_CMD="set +e; $RUN_CMD_WRAPPER "$(/usr/bin/printf "%q" "$RUN_CMD")"; set -e"

				fi

				FAILURE_MESSAGE=$(printf "%s" "Unexpected change in results:")

				ci-fairy minio login $MINIO_ARGS --token-file "${CI_JOB_JWT_FILE}"

				if [ "x$PIGLIT_PROFILES" = "xreplay" ] \

				       && [ ${PIGLIT_REPLAY_UPLOAD_TO_MINIO:-0} -eq 1 ]; then

				    ci-fairy minio login $MINIO_ARGS $CI_JOB_JWT

				fi

				# The replayer doesn't do any size or checksum verification for the traces in

				# the replayer db, so if we had to restart the system due to intermittent device

				# errors (or tried to cache replayer-db between runs, which would be nice to

				# have), you could get a corrupted local trace that would spuriously fail the

				# run.

				rm -rf replayer-db

				eval $RUN_CMD

				@@ -213,12 +203,9 @@ fi

				ARTIFACTS_BASE_URL="https://${CI_PROJECT_ROOT_NAMESPACE}.${CI_PAGES_DOMAIN}/-/${CI_PROJECT_NAME}/-/jobs/${CI_JOB_ID}/artifacts"

				if [ ${PIGLIT_JUNIT_RESULTS:-0} -eq 1 ]; then

				    ./piglit summary aggregate "$RESULTS" -o junit.xml

				    FAILURE_MESSAGE=$(printf "${FAILURE_MESSAGE}\n%s" "Check the JUnit report for failures at: ${ARTIFACTS_BASE_URL}/results/junit.xml")

				fi

				./piglit summary aggregate "$RESULTS" -o junit.xml

				PIGLIT_RESULTS="${PIGLIT_RESULTS:-$PIGLIT_PROFILES}"

				PIGLIT_RESULTS="${PIGLIT_RESULTS:-replay}"

				RESULTSFILE="$RESULTS/$PIGLIT_RESULTS.txt"

				mkdir -p .gitlab-ci/piglit

				./piglit summary console "$RESULTS"/results.json.bz2 \

				@@ -227,49 +214,28 @@ mkdir -p .gitlab-ci/piglit

				    | sed '/^summary:/Q' \

				    > $RESULTSFILE

				if [ "x$PIGLIT_PROFILES" = "xreplay" ] \

				       && [ ${PIGLIT_REPLAY_UPLOAD_TO_MINIO:-0} -eq 1 ]; then

				__PREFIX="trace/$PIGLIT_REPLAY_DEVICE_NAME"

				__MINIO_PATH="$PIGLIT_REPLAY_ARTIFACTS_BASE_URL"

				__MINIO_TRACES_PREFIX="traces"

				    __PREFIX="trace/$PIGLIT_REPLAY_DEVICE_NAME"

				    __MINIO_PATH="$PIGLIT_REPLAY_ARTIFACTS_BASE_URL"

				    __MINIO_TRACES_PREFIX="traces"

				    if [ "x$PIGLIT_REPLAY_SUBCOMMAND" != "xprofile" ]; then

				        quiet replay_minio_upload_images

				    fi

				if [ "x$PIGLIT_REPLAY_SUBCOMMAND" != "xprofile" ]; then

				    quiet replay_minio_upload_images

				fi

				if [ -n "$USE_CASELIST" ]; then

				    # Just filter the expected results based on the tests that were actually

				    # executed, and switch to the version with no summary

				    cat ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.orig" | sed '/^summary:/Q' | rev \

				        | cut -f2- -d: | rev | sed "s/$/:/g" > /tmp/executed.txt

				    grep -F -f /tmp/executed.txt "$INSTALL/$PIGLIT_RESULTS.txt" \

				       > ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.baseline" || true

				elif [ -f "$INSTALL/$PIGLIT_RESULTS.txt" ]; then

				    cp "$INSTALL/$PIGLIT_RESULTS.txt" \

				       ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.baseline"

				else

				    touch ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.baseline"

				fi

				if diff -q ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.baseline" $RESULTSFILE; then

				if [ ! -s $RESULTSFILE ]; then

				    exit 0

				fi

				./piglit summary html --exclude-details=pass \

				"$RESULTS"/summary "$RESULTS"/results.json.bz2

				if [ "x$PIGLIT_PROFILES" = "xreplay" ]; then

				find "$RESULTS"/summary -type f -name "*.html" -print0 \

				        | xargs -0 sed -i 's%<img src="file://'"${RESULTS}"'.*-\([0-9a-f]*\)\.png%<img src="https://'"${JOB_ARTIFACTS_BASE}"'/traces/\1.png%g'

				find "$RESULTS"/summary -type f -name "*.html" -print0 \

				        | xargs -0 sed -i 's%<img src="file://%<img src="https://'"${PIGLIT_REPLAY_REFERENCE_IMAGES_BASE}"'/%g'

				fi

				FAILURE_MESSAGE=$(printf "${FAILURE_MESSAGE}\n%s" "Check the HTML summary for problems at: ${ARTIFACTS_BASE_URL}/results/summary/problems.html")

				quiet print_red printf "%s\n" "$FAILURE_MESSAGE"

				quiet diff --color=always -u ".gitlab-ci/piglit/$PIGLIT_RESULTS.txt.baseline" $RESULTSFILE

				quiet print_red echo "Failures in traces:"

				cat $RESULTSFILE

				quiet print_red echo "Review the image changes and get the new checksums at: ${ARTIFACTS_BASE_URL}/results/summary/problems.html"

				exit 1

									
										8

.gitlab-ci/prepare-artifacts.sh
									
												View File
												
				@@ -31,12 +31,11 @@ cp -Rp .gitlab-ci/piglit install/

				cp -Rp .gitlab-ci/fossils.yml install/

				cp -Rp .gitlab-ci/fossils install/

				cp -Rp .gitlab-ci/fossilize-runner.sh install/

				cp -Rp .gitlab-ci/deqp-runner.sh install/

				cp -Rp .gitlab-ci/crosvm-runner.sh install/

				cp -Rp .gitlab-ci/crosvm-init.sh install/

				cp -Rp .gitlab-ci/deqp-*.txt install/

				cp -Rp .gitlab-ci/*.txt install/

				cp -Rp .gitlab-ci/report-flakes.py install/

				cp -Rp .gitlab-ci/vkd3d-proton install/

				cp -Rp .gitlab-ci/*-runner.sh install/

				find . -path \*/ci/\*.txt \

				    -o -path \*/ci/\*.toml \

				    -o -path \*/ci/\*traces\*.yml \

				@@ -48,11 +47,12 @@ mkdir -p artifacts/

				tar -cf artifacts/install.tar install

				cp -Rp .gitlab-ci/common artifacts/ci-common

				cp -Rp .gitlab-ci/lava artifacts/

				cp -Rp .gitlab-ci/valve artifacts/

				if [ -n "$MINIO_ARTIFACT_NAME" ]; then

				    # Pass needed files to the test stage

				    MINIO_ARTIFACT_NAME="$MINIO_ARTIFACT_NAME.tar.gz"

				    gzip -c artifacts/install.tar > ${MINIO_ARTIFACT_NAME}

				    ci-fairy minio login $CI_JOB_JWT

				    ci-fairy minio login --token-file "${CI_JOB_JWT_FILE}"

				    ci-fairy minio cp ${MINIO_ARTIFACT_NAME} minio://${PIPELINE_ARTIFACTS_BASE}/${MINIO_ARTIFACT_NAME}

				fi

									
										153

.gitlab-ci/skqp-runner.sh
									
										Executable file
									
												View File
												
				@@ -0,0 +1,153 @@

				#!/bin/sh

				#

				# Copyright (C) 2022 Collabora Limited

				# Author: Guilherme Gallo <guilherme.gallo@collabora.com>

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				copy_tests_files() (

				    # Copy either unit test or render test files from a specific driver given by

				    # GPU VERSION variable.

				    # If there is no test file at the expected location, this function will

				    # return error_code 1

				    SKQP_BACKEND="${1}"

				    SKQP_FILE_PREFIX="${INSTALL}/${GPU_VERSION}-skqp"

				    if echo "${SKQP_BACKEND}" | grep -qE 'vk|gl(es)?'

				    then

				        SKQP_RENDER_TESTS_FILE="${SKQP_FILE_PREFIX}-${SKQP_BACKEND}_rendertests.txt"

				        [ -f "${SKQP_RENDER_TESTS_FILE}" ] || return 1

				        cp "${SKQP_RENDER_TESTS_FILE}" "${SKQP_ASSETS_DIR}"/skqp/rendertests.txt

				        return 0

				    fi

				    # The unittests.txt path is hardcoded inside assets directory,

				    # that is why it needs to be a special case.

				    if echo "${SKQP_BACKEND}" | grep -qE "unitTest"

				    then

				        SKQP_UNIT_TESTS_FILE="${SKQP_FILE_PREFIX}_unittests.txt"

				        [ -f "${SKQP_UNIT_TESTS_FILE}" ] || return 1

				        cp "${SKQP_UNIT_TESTS_FILE}" "${SKQP_ASSETS_DIR}"/skqp/unittests.txt

				    fi

				)

				test_vk_backend() {

				    if echo "${SKQP_BACKENDS}" | grep -qE 'vk'

				    then

				        if [ -n "$VK_DRIVER" ]; then

				            return 0

				        fi

				        echo "VK_DRIVER environment variable is missing."

				        VK_DRIVERS=$(ls "$INSTALL"/share/vulkan/icd.d/ | cut -f 1 -d '_')

				        if [ -n "${VK_DRIVERS}" ]

				        then

				            echo "Please set VK_DRIVER to the correct driver from the list:"

				            echo "${VK_DRIVERS}"

				        fi

				        echo "No Vulkan tests will be executed, but it was requested in SKQP_BACKENDS variable. Exiting."

				        exit 2

				    fi

				    # Vulkan environment is not configured, but it was not requested by the job

				    return 1

				}

				setup_backends() {

				    if test_vk_backend

				    then

				        export VK_ICD_FILENAMES="$INSTALL"/share/vulkan/icd.d/"$VK_DRIVER"_icd."${VK_CPU:-$(uname -m)}".json

				    fi

				}

				set -ex

				# Needed so configuration files can contain paths to files in /install

				ln -sf "$CI_PROJECT_DIR"/install /install

				INSTALL=${PWD}/install

				if [ -z "$GPU_VERSION" ]; then

				    echo 'GPU_VERSION must be set to something like "llvmpipe" or

				"freedreno-a630" (it will serve as a component to find the path for files

				residing in src/**/ci/*.txt)'

				    exit 1

				fi

				LD_LIBRARY_PATH=$INSTALL:$LD_LIBRARY_PATH

				setup_backends

				SKQP_ASSETS_DIR=/skqp/assets

				SKQP_RESULTS_DIR="${SKQP_RESULTS_DIR:-$PWD/results}"

				mkdir -p "${SKQP_ASSETS_DIR}"/skqp

				SKQP_EXITCODE=0

				for SKQP_BACKEND in ${SKQP_BACKENDS}

				do

				    set -e

				    if !  copy_tests_files "${SKQP_BACKEND}"

				    then

				        echo "No override test file found for ${SKQP_BACKEND}. Using the default one."

				    fi

				    set +e

				    SKQP_BACKEND_RESULTS_DIR="${SKQP_RESULTS_DIR}"/"${SKQP_BACKEND}"

				    mkdir -p "${SKQP_BACKEND_RESULTS_DIR}"

				    /skqp/skqp "${SKQP_ASSETS_DIR}" "${SKQP_BACKEND_RESULTS_DIR}" "${SKQP_BACKEND}_"

				    BACKEND_EXITCODE=$?

				    if [ ! $BACKEND_EXITCODE -eq 0 ]

				    then

				        echo "skqp failed on ${SKQP_BACKEND} tests with ${BACKEND_EXITCODE} exit code."

				    fi

				    # Propagate error codes to leverage the final job result

				    SKQP_EXITCODE=$(( SKQP_EXITCODE | BACKEND_EXITCODE ))

				done

				set +x

				# Unit tests produce empty HTML reports, guide the user to check the TXT file.

				if echo "${SKQP_BACKENDS}" | grep -qE "unitTest"

				then

				    # Remove the empty HTML report to avoid confusion

				    rm -f "${SKQP_RESULTS_DIR}"/unitTest/report.html

				    echo "See skqp unit test results at:"

				    echo "https://$CI_PROJECT_ROOT_NAMESPACE.pages.freedesktop.org/-/$CI_PROJECT_NAME/-/jobs/$CI_JOB_ID/artifacts/${SKQP_RESULTS_DIR}/unitTest/unit_tests.txt"

				fi

				REPORT_FILES=$(mktemp)

				find "${SKQP_RESULTS_DIR}"/**/report.html -type f > "${REPORT_FILES}"

				while read -r REPORT

				do

				    BACKEND_NAME=$(echo "${REPORT}" | sed  's@.*/\([^/]*\)/report.html@\1@')

				    echo "See skqp ${BACKEND_NAME} render tests report at:"

				    echo "https://$CI_PROJECT_ROOT_NAMESPACE.pages.freedesktop.org/-/$CI_PROJECT_NAME/-/jobs/$CI_JOB_ID/artifacts/${REPORT}"

				done < "${REPORT_FILES}"

				# If there is no report available, tell the user that something is wrong.

				if [ ! -s "${REPORT_FILES}" ]

				then

				    echo "No skqp report available. Probably some fatal error has occured during the skqp execution."

				fi

				exit $SKQP_EXITCODE

									
										119

.gitlab-ci/test-source-dep.yml
									
												View File
												
				@@ -18,6 +18,7 @@

				      - .gitlab-ci/**/*

				      - include/**/*

				      - meson.build

				      - .gitattributes

				      - src/*

				      - src/compiler/**/*

				      - src/drm-shim/**/*

				@@ -30,10 +31,6 @@

				      - src/loader/**/*

				      - src/mapi/**/*

				      - src/mesa/*

				      - src/mesa/drivers/*

				      - src/mesa/drivers/common/**/*

				      - src/mesa/drivers/dri/*

				      - src/mesa/drivers/dri/common/**/*

				      - src/mesa/main/**/*

				      - src/mesa/math/**/*

				      - src/mesa/program/**/*

				@@ -41,11 +38,10 @@

				      - src/mesa/state_tracker/**/*

				      - src/mesa/swrast/**/*

				      - src/mesa/swrast_setup/**/*

				      - src/mesa/tnl/**/*

				      - src/mesa/tnl_dd/**/*

				      - src/mesa/vbo/**/*

				      - src/mesa/x86/**/*

				      - src/mesa/x86-64/**/*

				      - src/tool/**/*

				      - src/util/**/*

				.vulkan-rules:

				@@ -132,6 +128,7 @@

				      - .gitlab-ci.yml

				      - .gitlab-ci/**/*

				      - meson.build

				      - .gitattributes

				      - include/**/*

				      - src/compiler/**/*

				      - src/include/**/*

				@@ -153,6 +150,8 @@

				  rules:

				    - if: '$FD_FARM == "offline"'

				      when: never

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - *ignore_scheduled_pipelines

				    - changes:

				        *mesa_core_file_list

				@@ -180,9 +179,11 @@

				  rules:

				    - if: '$FD_FARM == "offline"'

				      when: never

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    # If the triggerer has access to the restricted traces and if it is pre-merge

				    - if: '($GITLAB_USER_LOGIN !~ "/^(robclark|anholt|flto|cwabbott0|Danil|tomeu)$/") &&

				           ($GITLAB_USER_LOGIN != "marge-bot" || $CI_MERGE_REQUEST_SOURCE_BRANCH_NAME != $CI_COMMIT_REF_NAME)'

				           ($GITLAB_USER_LOGIN != "marge-bot" || $CI_COMMIT_BRANCH)'

				      when: never

				    - *ignore_scheduled_pipelines

				    - changes:

				@@ -206,9 +207,11 @@

				  rules:

				    - if: '$FD_FARM == "offline"'

				      when: never

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - *ignore_scheduled_pipelines

				    # Run only on pre-merge pipelines from Marge

				    - if: '$GITLAB_USER_LOGIN != "marge-bot" || $CI_MERGE_REQUEST_SOURCE_BRANCH_NAME != $CI_COMMIT_REF_NAME'

				    - if: '$GITLAB_USER_LOGIN != "marge-bot" || $CI_COMMIT_BRANCH'

				      when: never

				    - changes:

				        *mesa_core_file_list

				@@ -224,10 +227,30 @@

				      when: manual

				    - when: never

				.nouveau-rules:

				  stage: nouveau

				  rules:

				    - *ignore_scheduled_pipelines

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				    - changes:

				        *gallium_core_file_list

				      when: on_success

				    - changes:

				      - src/nouveau/**/*

				      - src/gallium/drivers/nouveau/**/*

				      - src/gallium/winsys/kmsro/**/*

				      - src/gallium/winsys/nouveau/**/*

				      when: on_success

				    - when: never

				.panfrost-midgard-rules:

				  stage: arm

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -254,6 +277,8 @@

				  stage: arm

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -349,6 +374,8 @@

				  stage: amd

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -377,10 +404,37 @@

				      when: on_success

				    - when: never

				# Unfortunately YAML doesn't let us concatenate arrays, so we have to do the

				# rules duplication manually

				.virgl-lava-rules-performance:

				  stage: layered-backends

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    # Run only on pre-merge pipelines from Marge

				    - if: '$GITLAB_USER_LOGIN != "marge-bot" || $CI_COMMIT_BRANCH'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: manual

				    - changes:

				        *gallium_core_file_list

				      when: manual

				    - changes:

				        *llvmpipe_file_list

				      when: manual

				    - changes:

				        *virgl_file_list

				      when: manual

				    - when: never

				.radeonsi-rules:

				  stage: amd

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -398,6 +452,27 @@

				      when: on_success

				    - when: never

				.radeonsi-vaapi-rules:

				  stage: amd

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				    - changes:

				        *gallium_core_file_list

				      when: on_success

				    - changes:

				        *radeonsi_file_list

				      when: on_success

				    - changes: &radeon_vcn_file_list

				      - src/gallium/frontends/va/**/*

				      - src/gallium/drivers/radeon/**/*

				      when: on_success

				    - when: never

				.i915g-rules:

				  stage: intel

				  rules:

				@@ -415,10 +490,29 @@

				      when: on_success

				    - when: never

				.crocus-rules:

				  stage: intel

				  rules:

				    - *ignore_scheduled_pipelines

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				    - changes:

				        *gallium_core_file_list

				      when: on_success

				    - changes:

				      - src/gallium/drivers/crocus/**/*

				      - src/gallium/winsys/crocus/**/*

				      - src/intel/**/*

				      when: on_success

				    - when: never

				.iris-rules:

				  stage: intel

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -438,8 +532,10 @@

				  stage: intel

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    # Run only on pre-merge pipelines from Marge

				    - if: '$GITLAB_USER_LOGIN != "marge-bot" || $CI_MERGE_REQUEST_SOURCE_BRANCH_NAME != $CI_COMMIT_REF_NAME'

				    - if: '$GITLAB_USER_LOGIN != "marge-bot" || $CI_COMMIT_BRANCH'

				      when: never

				    - changes:

				        *mesa_core_file_list

				@@ -456,6 +552,8 @@

				  stage: intel

				  rules:

				    - *ignore_scheduled_pipelines

				    - if: '$COLLABORA_FARM == "offline" && $RUNNER_TAG =~ /^mesa-ci-x86-64-lava-/'

				      when: never

				    - changes:

				        *mesa_core_file_list

				      when: on_success

				@@ -496,6 +594,9 @@

				    - changes:

				        *gallium_core_file_list

				      when: on_success

				    - changes:

				        *softpipe_file_list

				      when: on_success

				    - changes:

				        *lavapipe_file_list

				      when: on_success

									
										315

.gitlab-ci/test/gitlab-ci.yml
									
										Normal file
									
												View File
												
				@@ -0,0 +1,315 @@

				.test:

				  extends:

				    - .ci-run-policy

				  # Cancel job if a newer commit is pushed to the same branch

				  interruptible: true

				  variables:

				    GIT_STRATEGY: none # testing doesn't build anything from source

				  before_script:

				    - !reference [default, before_script]

				    # Note: Build dir (and thus install) may be dirty due to GIT_STRATEGY

				    - rm -rf install

				    - tar -xf artifacts/install.tar

				    - echo -e "\e[0Ksection_start:$(date +%s):ldd_section[collapsed=true]\r\e[0KChecking ldd on driver build"

				    - LD_LIBRARY_PATH=install/lib find install/lib -name "*.so" -print -exec ldd {} \;

				    - echo -e "\e[0Ksection_end:$(date +%s):ldd_section\r\e[0K"

				  artifacts:

				    when: always

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - results/

				.test-gl:

				  extends:

				    - .test

				    - .use-debian/x86_test-gl

				  needs:

				    - debian/x86_test-gl

				    - debian-testing

				.test-vk:

				  extends:

				    - .test

				    - .use-debian/x86_test-vk

				  needs:

				    - debian-testing

				    - debian/x86_test-vk

				.test-cl:

				  extends:

				    - .test

				    - .use-debian/x86_test-gl

				  needs:

				    - debian/x86_test-gl

				    - debian-clover-testing

				.vkd3d-proton-test:

				  artifacts:

				    when: on_failure

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - results/vkd3d-proton.log

				  script:

				    - ./install/vkd3d-proton/run.sh

				.piglit-test:

				  artifacts:

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - results

				    reports:

				      junit: results/junit.xml

				  variables:

				    PIGLIT_NO_WINDOW: 1

				    HWCI_TEST_SCRIPT: "/install/piglit/piglit-runner.sh"

				  script:

				    - install/piglit/piglit-runner.sh

				.piglit-traces-test:

				  extends:

				    - .piglit-test

				  cache:

				    key: ${CI_JOB_NAME}

				    paths:

				      - replayer-db/

				  artifacts:

				    when: on_failure

				    name: "mesa_${CI_JOB_NAME}"

				    reports:

				      junit: results/junit.xml

				    paths:

				      - results/summary/

				      - results/*.txt

				  variables:

				    PIGLIT_REPLAY_EXTRA_ARGS:  --keep-image --db-path ${CI_PROJECT_DIR}/replayer-db/ --minio_host=minio-packet.freedesktop.org --minio_bucket=mesa-tracie-public --role-session-name=${CI_PROJECT_PATH}:${CI_JOB_ID} --jwt-file=${CI_JOB_JWT_FILE}

				  script:

				    - install/piglit/piglit-traces.sh

				.deqp-test:

				  script:

				    - ./install/deqp-runner.sh

				  artifacts:

				    exclude:

				      - results/*.shader_cache

				    reports:

				      junit: results/junit.xml

				.deqp-test-vk:

				  extends:

				    - .deqp-test

				  variables:

				    DEQP_VER: vk

				.fossilize-test:

				  script:

				    - ./install/fossilize-runner.sh

				  artifacts:

				    when: on_failure

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - results/

				.baremetal-test:

				  extends:

				    - .ci-run-policy

				    - .test

				  # Cancel job if a newer commit is pushed to the same branch

				  interruptible: true

				  stage: test

				  before_script:

				    - !reference [default, before_script]

				    # Use this instead of gitlab's artifacts download because it hits packet.net

				    # instead of fd.o.  Set FDO_HTTP_CACHE_URI to an http cache for your test lab to

				    # improve it even more (see https://docs.mesa3d.org/ci/bare-metal.html for

				    # setup).

				    - wget ${FDO_HTTP_CACHE_URI:-}https://${PIPELINE_ARTIFACTS_BASE}/${MINIO_ARTIFACT_NAME}.tar.gz -S --progress=dot:giga -O- | tar -xz

				  artifacts:

				    when: always

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - results/

				      - serial*.txt

				    exclude:

				      - results/*.shader_cache

				    reports:

				      junit: results/junit.xml

				.baremetal-test-armhf:

				  extends:

				    - .baremetal-test

				  variables:

				    BM_ROOTFS: /rootfs-armhf

				    MINIO_ARTIFACT_NAME: mesa-armhf

				.baremetal-test-arm64:

				  extends:

				    - .baremetal-test

				  variables:

				    BM_ROOTFS: /rootfs-arm64

				    MINIO_ARTIFACT_NAME: mesa-arm64

				.baremetal-arm64-asan-test:

				  variables:

				    DEQP_RUNNER_OPTIONS: "--env LD_PRELOAD=libasan.so.6:/install/lib/libdlclose-skip.so"

				    MINIO_ARTIFACT_NAME: mesa-arm64-asan

				  needs:

				    - debian/arm_test

				    - job: debian-arm64-asan

				      artifacts: false

				.baremetal-deqp-test:

				  variables:

				    HWCI_TEST_SCRIPT: "/install/deqp-runner.sh"

				    FDO_CI_CONCURRENT: 0 # Default to number of CPUs

				.baremetal-skqp-test:

				  variables:

				    HWCI_START_XORG: 1

				    HWCI_TEST_SCRIPT: "/install/skqp-runner.sh"

				# For Valve's bare-metal testing farm jobs.

				.b2c-test:

				  # It would be nice to use ci-templates within Mesa CI for this job's

				  # image:, but the integration is not possible for the current

				  # use-case. Within this job, two containers are managed. 1) the

				  # gitlab runner container from which the job is submitted to the

				  # DUT, and 2) the test container (e.g. debian/x86_test-vk) within

				  # which the test cases will run on the DUT. Since ci-templates and

				  # the associated image setting macros in this file rely on variables

				  # like FDO_DISTRIBUTION_TAG for *the* image, there is no way to

				  # depend on more than one image per job. So, the job container is

				  # built as part of the CI in the boot2container project.

				  image: registry.freedesktop.org/mupuf/valve-infra/mesa-trigger:2022-03-03.2

				  extends:

				    # Only pull in what is needed to build up the MESA_IMAGE (which is

				    # called for clarity IMAGE_UNDER_TEST). This is in distinction to

				    # the image within which the job runs on the runner machines. The

				    # IMAGE_UNDER_TEST is deployed to the DUTs.

				    - .incorporate-base-tag+templates-commit

				  variables:

				    # No need by default to pull the whole repo

				    GIT_STRATEGY: none

				    # boot2container initrd configuration parameters.

				    B2C_KERNEL_URL: 'https://gitlab.freedesktop.org/mupuf/valve-infra/-/package_files/117/download'  # 5.16-for-mesa-ci

				    B2C_INITRAMFS_URL: 'https://gitlab.freedesktop.org/mupuf/boot2container/-/releases/v0.9.4/downloads/initramfs.linux_amd64.cpio.xz'

				    B2C_JOB_SUCCESS_REGEX: '\[.*\]: Execution is over, pipeline status: 0\r$'

				    B2C_JOB_WARN_REGEX: 'null'

				    B2C_LOG_LEVEL: 6

				    B2C_POWEROFF_DELAY: 15

				    B2C_SESSION_END_REGEX: '^.*It''s now safe to turn off your computer\r$'

				    B2C_SESSION_REBOOT_REGEX: 'GPU hang detected!'

				    B2C_TIMEOUT_BOOT_MINUTES: 240

				    B2C_TIMEOUT_BOOT_RETRIES: 2

				    B2C_TIMEOUT_FIRST_MINUTES: 5

				    B2C_TIMEOUT_FIRST_RETRIES: 3

				    B2C_TIMEOUT_MINUTES: 2

				    B2C_TIMEOUT_OVERALL_MINUTES: 240

				    B2C_TIMEOUT_RETRIES: 0

				    MESA_IMAGE_PATH: "debian/x86_test-vk"

				    IMAGE_UNDER_TEST: "$CI_REGISTRY_IMAGE/${MESA_IMAGE_PATH}:${FDO_DISTRIBUTION_TAG}"

				    INSTALL_TARBALL: "./artifacts/install.tar"

				    CI_VALVE_ARTIFACTS: "./artifacts/valve"

				    CI_COMMON_SCRIPTS: "./artifacts/ci-common"

				    GENERATE_ENV_SCRIPT: "${CI_COMMON_SCRIPTS}/generate-env.sh"

				    B2C_JOB_TEMPLATE: "${CI_VALVE_ARTIFACTS}/b2c.yml.jinja2.jinja2"

				    JOB_FOLDER: "job_folder"

				  before_script:

				    # We don't want the tarball unpacking of .test, but will take the JWT bits.

				    - !reference [default, before_script]

				    - |

				      set -x

				      # Useful as a hook point for runner admins. You may edit the

				      # config.toml for the Gitlab runner and use a bind-mount to

				      # populate the hook script with some executable commands. This

				      # allows quicker feedback than resubmitting pipelines and

				      # potentially having to wait for a debug build of Mesa to

				      # complete.

				      if [ -x /runner-before-script.sh ]; then

				         echo "Executing runner before-script hook..."

				         sh /runner-before-script.sh

				         if [ $? -ne 0 ]; then

				            echo "Runner hook failed, goodbye"

				            exit $?

				         fi

				      fi

				      [ -s "$INSTALL_TARBALL" ] || exit 1

				      [ -d "$CI_VALVE_ARTIFACTS" ] || exit 1

				      [ -d "$CI_COMMON_SCRIPTS" ] || exit 1

				      B2C_TEST_SCRIPT="bash -c 'source ./set-job-env-vars.sh ; ${B2C_TEST_SCRIPT}'"

				      # The Valve CI gateway receives jobs in a YAML format. Create a

				      # job description from the CI environment.

				      python3 "$CI_VALVE_ARTIFACTS"/generate_b2c.py \

				        --ci-job-id "${CI_JOB_ID}" \

				        --container-cmd "${B2C_TEST_SCRIPT}" \

				        --initramfs-url "${B2C_INITRAMFS_URL}" \

				        --job-success-regex "${B2C_JOB_SUCCESS_REGEX}" \

				        --job-warn-regex "${B2C_JOB_WARN_REGEX}" \

				        --kernel-url "${B2C_KERNEL_URL}" \

				        --log-level "${B2C_LOG_LEVEL}" \

				        --poweroff-delay "${B2C_POWEROFF_DELAY}" \

				        --session-end-regex "${B2C_SESSION_END_REGEX}" \

				        --session-reboot-regex "${B2C_SESSION_REBOOT_REGEX}" \

				        --tags "${CI_RUNNER_TAGS}" \

				        --template "${B2C_JOB_TEMPLATE}" \

				        --timeout-boot-minutes "${B2C_TIMEOUT_BOOT_MINUTES}" \

				        --timeout-boot-retries "${B2C_TIMEOUT_BOOT_RETRIES}" \

				        --timeout-first-minutes "${B2C_TIMEOUT_FIRST_MINUTES}" \

				        --timeout-first-retries "${B2C_TIMEOUT_FIRST_RETRIES}" \

				        --timeout-minutes "${B2C_TIMEOUT_MINUTES}" \

				        --timeout-overall-minutes "${B2C_TIMEOUT_OVERALL_MINUTES}" \

				        --timeout-retries "${B2C_TIMEOUT_RETRIES}" \

				        --job-volume-exclusions "${B2C_JOB_VOLUME_EXCLUSIONS}" \

				        --local-container "${IMAGE_UNDER_TEST}" \

				        ${B2C_EXTRA_VOLUME_ARGS} \

				        --working-dir "$CI_PROJECT_DIR"

				      cat b2c.yml.jinja2

				      rm -rf ${JOB_FOLDER} || true

				      mkdir -v ${JOB_FOLDER}

				      # Create a script to regenerate the CI environment when this job

				      # begins running on the remote DUT.

				      set +x

				      "$CI_COMMON_SCRIPTS"/generate-env.sh > ${JOB_FOLDER}/set-job-env-vars.sh

				      chmod +x ${JOB_FOLDER}/set-job-env-vars.sh

				      echo "Variables passed through:"

				      cat ${JOB_FOLDER}/set-job-env-vars.sh

				      echo "export CI_JOB_JWT=${CI_JOB_JWT}" >> ${JOB_FOLDER}/set-job-env-vars.sh

				      set -x

				      # Extract the Mesa distribution into the location expected by

				      # the Mesa CI deqp-runner scripts.

				      tar x -C ${JOB_FOLDER} -f $INSTALL_TARBALL

				  script: |

				      slugify () {

				          echo "$1" | sed -r s/[~\^]+//g | sed -r s/[^a-zA-Z0-9]+/-/g | sed -r s/^-+\|-+$//g | tr A-Z a-z

				      }

				      # Submit the job to Valve's CI gateway service with the CI

				      # provisioned job_folder.

				      env PYTHONUNBUFFERED=1 executorctl \

				          run -w b2c.yml.jinja2 -j $(slugify "$CI_JOB_NAME") -s ${JOB_FOLDER}

				      ls -l

				      # Anything our job places in results/ will be collected by the

				      # Gitlab coordinator for status presentation. results/junit.xml

				      # will be parsed by the UI for more detailed explanations of

				      # test execution.

				  needs:

				    - debian/x86_test-vk

				    - debian-testing

				  artifacts:

				    when: always

				    name: "mesa_${CI_JOB_NAME}"

				    paths:

				      - ${JOB_FOLDER}/results

				    reports:

				      junit: ${JOB_FOLDER}/results/junit.xml

0

src/amd/ci/deqp-radv-bonaire-aco-fails.txt → .gitlab-ci/tests/init.py

View File

									
										250

.gitlab-ci/tests/test_lava_job_submitter.py
									
										Normal file
									
												View File
												
				@@ -0,0 +1,250 @@

				#!/usr/bin/env python3

				#

				# Copyright (C) 2022 Collabora Limited

				# Author: Guilherme Gallo <guilherme.gallo@collabora.com>

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				import xmlrpc.client

				from contextlib import nullcontext as does_not_raise

				from datetime import datetime

				from itertools import repeat

				from typing import Tuple

				from unittest.mock import MagicMock, patch

				import pytest

				import yaml

				from freezegun import freeze_time

				from lava.lava_job_submitter import (

				    DEVICE_HANGING_TIMEOUT_SEC,

				    follow_job_execution,

				    hide_sensitive_data,

				    retriable_follow_job,

				)

				def jobs_logs_response(finished=False, msg=None) -> Tuple[bool, str]:

				    timed_msg = {"dt": str(datetime.now()), "msg": "New message"}

				    logs = [timed_msg] if msg is None else msg

				    return finished, yaml.safe_dump(logs)

				def result_get_testjob_results_response() -> str:

				    result = {"result": "test"}

				    results = [{"metadata": result}]

				    return yaml.safe_dump(results)

				def result_get_testcase_results_response() -> str:

				    result = {"result": "pass"}

				    test_cases = [result]

				    return yaml.safe_dump(test_cases)

				@pytest.fixture

				def mock_proxy():

				    def create_proxy_mock(**kwargs):

				        proxy_mock = MagicMock()

				        proxy_submit_mock = proxy_mock.scheduler.jobs.submit

				        proxy_submit_mock.return_value = "1234"

				        proxy_results_mock = proxy_mock.results.get_testjob_results_yaml

				        proxy_results_mock.return_value = result_get_testjob_results_response()

				        proxy_test_cases_mock = proxy_mock.results.get_testcase_results_yaml

				        proxy_test_cases_mock.return_value = result_get_testcase_results_response()

				        proxy_logs_mock = proxy_mock.scheduler.jobs.logs

				        proxy_logs_mock.return_value = jobs_logs_response()

				        for key, value in kwargs.items():

				            setattr(proxy_logs_mock, key, value)

				        return proxy_mock

				    yield create_proxy_mock

				@pytest.fixture

				def mock_proxy_waiting_time(mock_proxy):

				    def update_mock_proxy(frozen_time, **kwargs):

				        wait_time = kwargs.pop("wait_time", 0)

				        proxy_mock = mock_proxy(**kwargs)

				        proxy_job_state = proxy_mock.scheduler.job_state

				        proxy_job_state.return_value = {"job_state": "Running"}

				        proxy_job_state.side_effect = frozen_time.tick(wait_time)

				        return proxy_mock

				    return update_mock_proxy

				@pytest.fixture

				def mock_sleep():

				    """Mock time.sleep to make test faster"""

				    with patch("time.sleep", return_value=None):

				        yield

				@pytest.fixture

				def frozen_time(mock_sleep):

				    with freeze_time() as frozen_time:

				        yield frozen_time

				@pytest.mark.parametrize("exception", [RuntimeError, SystemError, KeyError])

				def test_submit_and_follow_respects_exceptions(mock_sleep, mock_proxy, exception):

				    with pytest.raises(exception):

				        follow_job_execution(mock_proxy(side_effect=exception), "")

				def generate_n_logs(n=1, tick_sec=1):

				    """Simulate a log partitionated in n components"""

				    with freeze_time(datetime.now()) as time_travel:

				        while True:

				            # Simulate a scenario where the target job is waiting for being started

				            for _ in range(n - 1):

				                time_travel.tick(tick_sec)

				                yield jobs_logs_response(finished=False, msg=[])

				            time_travel.tick(tick_sec)

				            yield jobs_logs_response(finished=True)

				NETWORK_EXCEPTION = xmlrpc.client.ProtocolError("", 0, "test", {})

				XMLRPC_FAULT = xmlrpc.client.Fault(0, "test")

				PROXY_SCENARIOS = {

				    "finish case": (generate_n_logs(1), does_not_raise(), True),

				    "works at last retry": (

				        generate_n_logs(n=3, tick_sec=DEVICE_HANGING_TIMEOUT_SEC + 1),

				        does_not_raise(),

				        True,

				    ),

				    "timed out more times than retry attempts": (

				        generate_n_logs(n=4, tick_sec=DEVICE_HANGING_TIMEOUT_SEC + 1),

				        does_not_raise(),

				        False,

				    ),

				    "long log case, no silence": (

				        generate_n_logs(n=1000, tick_sec=0),

				        does_not_raise(),

				        True,

				    ),

				    "very long silence": (

				        generate_n_logs(n=4, tick_sec=100000),

				        does_not_raise(),

				        False,

				    ),

				    # If a protocol error happens, _call_proxy will retry without affecting timeouts

				    "unstable connection, ProtocolError followed by final message": (

				        (NETWORK_EXCEPTION, jobs_logs_response(finished=True)),

				        does_not_raise(),

				        True,

				    ),

				    # After an arbitrary number of retries, _call_proxy should call sys.exit

				    "unreachable case, subsequent ProtocolErrors": (

				        repeat(NETWORK_EXCEPTION),

				        pytest.raises(SystemExit),

				        False,

				    ),

				    "XMLRPC Fault": ([XMLRPC_FAULT], pytest.raises(SystemExit, match="1"), False),

				}

				@patch("time.sleep", return_value=None)  # mock sleep to make test faster

				@pytest.mark.parametrize(

				    "side_effect, expectation, has_finished",

				    PROXY_SCENARIOS.values(),

				    ids=PROXY_SCENARIOS.keys(),

				)

				def test_retriable_follow_job(

				    mock_sleep, side_effect, expectation, has_finished, mock_proxy

				):

				    with expectation:

				        result = retriable_follow_job(mock_proxy(side_effect=side_effect), "")

				        assert has_finished == result

				WAIT_FOR_JOB_SCENARIOS = {

				    "one log run taking (sec):": (generate_n_logs(1), True),

				}

				@pytest.mark.parametrize("wait_time", (0, DEVICE_HANGING_TIMEOUT_SEC * 2))

				@pytest.mark.parametrize(

				    "side_effect, has_finished",

				    WAIT_FOR_JOB_SCENARIOS.values(),

				    ids=WAIT_FOR_JOB_SCENARIOS.keys(),

				)

				def test_simulate_a_long_wait_to_start_a_job(

				    frozen_time,

				    wait_time,

				    side_effect,

				    has_finished,

				    mock_proxy_waiting_time,

				):

				    start_time = datetime.now()

				    result = retriable_follow_job(

				        mock_proxy_waiting_time(

				            frozen_time, side_effect=side_effect, wait_time=wait_time

				        ),

				        "",

				    )

				    end_time = datetime.now()

				    delta_time = end_time - start_time

				    assert has_finished == result

				    assert delta_time.total_seconds() >= wait_time

				SENSITIVE_DATA_SCENARIOS = {

				    "no sensitive data tagged": (

				        ["bla  bla", "mytoken: asdkfjsde1341=="],

				        ["bla  bla", "mytoken: asdkfjsde1341=="],

				        "HIDEME",

				    ),

				    "sensitive data tagged": (

				        ["bla  bla", "mytoken: asdkfjsde1341== # HIDEME"],

				        ["bla  bla"],

				        "HIDEME",

				    ),

				    "sensitive data tagged with custom word": (

				        ["bla  bla", "mytoken: asdkfjsde1341== # DELETETHISLINE", "third line"],

				        ["bla  bla", "third line"],

				        "DELETETHISLINE",

				    ),

				}

				@pytest.mark.parametrize(

				    "input, expectation, tag",

				    SENSITIVE_DATA_SCENARIOS.values(),

				    ids=SENSITIVE_DATA_SCENARIOS.keys(),

				)

				def test_hide_sensitive_data(input, expectation, tag):

				    yaml_data = yaml.safe_dump(input)

				    yaml_result = hide_sensitive_data(yaml_data, tag)

				    result = yaml.safe_load(yaml_result)

				    assert result == expectation

63

.gitlab-ci/valve/b2c.yml.jinja2.jinja2 Normal file

View File

@@ -0,0 +1,63 @@
 version: 1
 # Rules to match for a machine to qualify
 target:
 {% if tags %}
 {% set b2ctags = tags.split(',') %}
   tags:
 {% for tag in b2ctags %}
     - '{{ tag | trim }}'
 {% endfor %}
 {% endif %}
 timeouts:
   first_console_activity:  # This limits the time it can take to receive the first console log
     minutes: {{ timeout_first_minutes }}
     retries: {{ timeout_first_retries }}
   console_activity:        # Reset every time we receive a message from the logs
     minutes: {{ timeout_minutes }}
     retries: {{ timeout_retries }}
   boot_cycle:
     minutes: {{ timeout_boot_minutes }}
     retries: {{ timeout_boot_retries }}
   overall:                 # Maximum time the job can take, not overrideable by the "continue" deployment
     minutes: {{ timeout_overall_minutes }}
     retries: 0
     # no retries possible here
 console_patterns:
     session_end:
         regex: >-
           {{ session_end_regex }}
     session_reboot:
         regex: >-
           {{ session_reboot_regex }}
     job_success:
         regex: >-
           {{ job_success_regex }}
 # Environment to deploy
 deployment:
   # Initial boot
   start:
     kernel:
       url: '{{ kernel_url }}'
       cmdline: >
         SALAD.machine_id={{ '{{' }} machine_id }}
         console={{ '{{' }} local_tty_device }},115200 earlyprintk=vga,keep
         loglevel={{ log_level }} amdgpu.gpu_recovery=0 no_hash_pointers
         b2c.container="-ti --tls-verify=false docker://{{ '{{' }} fdo_proxy_registry }}/mupuf/valve-infra/machine_registration:latest check"
         b2c.ntp_peer=10.42.0.1 b2c.pipefail b2c.cache_device=auto b2c.poweroff_delay={{ poweroff_delay }}
         b2c.minio="gateway,{{ '{{' }} minio_url }},{{ '{{' }} job_bucket_access_key }},{{ '{{' }} job_bucket_secret_key }}"
         b2c.volume="{{ '{{' }} job_bucket }}-results,mirror=gateway/{{ '{{' }} job_bucket }},pull_on=pipeline_start,push_on=changes,overwrite{% for excl in job_volume_exclusions %},exclude={{ excl }}{% endfor %},expiration=pipeline_end,preserve"
 {% for volume in volumes %}
         b2c.volume={{ volume }}
 {% endfor %}
         b2c.container="-v {{ '{{' }} job_bucket }}-results:{{ working_dir }} -w {{ working_dir }} {% for mount_volume in mount_volumes %} -v {{ mount_volume }}{% endfor %} --tls-verify=false docker://{{ local_container }} {{ container_cmd }}"
         {% if cmdline_extras is defined %}
         {{ cmdline_extras }}
         {% endif %}
     initramfs:
       url: '{{ initramfs_url }}'

									
										101

.gitlab-ci/valve/generate_b2c.py
									
										Executable file
									
												View File
												
				@@ -0,0 +1,101 @@

				#!/usr/bin/env python3

				# Copyright © 2022 Valve Corporation

				#

				# Permission is hereby granted, free of charge, to any person obtaining a

				# copy of this software and associated documentation files (the "Software"),

				# to deal in the Software without restriction, including without limitation

				# the rights to use, copy, modify, merge, publish, distribute, sublicense,

				# and/or sell copies of the Software, and to permit persons to whom the

				# Software is furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice (including the next

				# paragraph) shall be included in all copies or substantial portions of the

				# Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT.  IN NO EVENT SHALL

				# THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING

				# FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS

				# IN THE SOFTWARE.

				from jinja2 import Environment, FileSystemLoader

				from argparse import ArgumentParser

				from os import environ, path

				parser = ArgumentParser()

				parser.add_argument('--ci-job-id')

				parser.add_argument('--container-cmd')

				parser.add_argument('--initramfs-url')

				parser.add_argument('--job-success-regex')

				parser.add_argument('--job-warn-regex')

				parser.add_argument('--kernel-url')

				parser.add_argument('--log-level', type=int)

				parser.add_argument('--poweroff-delay', type=int)

				parser.add_argument('--session-end-regex')

				parser.add_argument('--session-reboot-regex')

				parser.add_argument('--tags', nargs='?', default='')

				parser.add_argument('--template', default='b2c.yml.jinja2.jinja2')

				parser.add_argument('--timeout-boot-minutes', type=int)

				parser.add_argument('--timeout-boot-retries', type=int)

				parser.add_argument('--timeout-first-minutes', type=int)

				parser.add_argument('--timeout-first-retries', type=int)

				parser.add_argument('--timeout-minutes', type=int)

				parser.add_argument('--timeout-overall-minutes', type=int)

				parser.add_argument('--timeout-retries', type=int)

				parser.add_argument('--job-volume-exclusions', nargs='?', default='')

				parser.add_argument('--volume', action='append')

				parser.add_argument('--mount-volume', action='append')

				parser.add_argument('--local-container', default=environ.get('B2C_LOCAL_CONTAINER', 'alpine:latest'))

				parser.add_argument('--working-dir')

				args = parser.parse_args()

				env = Environment(loader=FileSystemLoader(path.dirname(args.template)),

				                  trim_blocks=True, lstrip_blocks=True)

				template = env.get_template(path.basename(args.template))

				values = {}

				values['ci_job_id'] = args.ci_job_id

				values['container_cmd'] = args.container_cmd

				values['initramfs_url'] = args.initramfs_url

				values['job_success_regex'] = args.job_success_regex

				values['job_warn_regex'] = args.job_warn_regex

				values['kernel_url'] = args.kernel_url

				values['log_level'] = args.log_level

				values['poweroff_delay'] = args.poweroff_delay

				values['session_end_regex'] = args.session_end_regex

				values['session_reboot_regex'] = args.session_reboot_regex

				values['tags'] = args.tags

				values['template'] = args.template

				values['timeout_boot_minutes'] = args.timeout_boot_minutes

				values['timeout_boot_retries'] = args.timeout_boot_retries

				values['timeout_first_minutes'] = args.timeout_first_minutes

				values['timeout_first_retries'] = args.timeout_first_retries

				values['timeout_minutes'] = args.timeout_minutes

				values['timeout_overall_minutes'] = args.timeout_overall_minutes

				values['timeout_retries'] = args.timeout_retries

				if len(args.job_volume_exclusions) > 0:

				    exclusions = args.job_volume_exclusions.split(",")

				    values['job_volume_exclusions'] = [excl for excl in exclusions if len(excl) > 0]

				if args.volume is not None:

				    values['volumes'] = args.volume

				if args.mount_volume is not None:

				    values['mount_volumes'] = args.mount_volume

				values['working_dir'] = args.working_dir

				assert(len(args.local_container) > 0)

				values['local_container'] = args.local_container.replace(

				    # Use the gateway's pull-through registry cache to reduce load on fd.o.

				    'registry.freedesktop.org', '{{ fdo_proxy_registry }}'

				)

				if 'B2C_KERNEL_CMDLINE_EXTRAS' in environ:

				    values['cmdline_extras'] = environ['B2C_KERNEL_CMDLINE_EXTRAS']

				f = open(path.splitext(path.basename(args.template))[0], "w")

				f.write(template.render(values))

				f.close()

4

.gitlab-ci/windows/Dockerfile → .gitlab-ci/windows/Dockerfile_build

View File

@@ -9,5 +9,5 @@ ENV ErrorActionPreference='Stop'
 COPY mesa_deps_vs2019.ps1 C:\
 RUN C:\mesa_deps_vs2019.ps1
 COPY mesa_deps.ps1 C:\
 RUN C:\mesa_deps.ps1
 COPY mesa_deps_build.ps1 C:\
 RUN C:\mesa_deps_build.ps1

7

.gitlab-ci/windows/Dockerfile_test Normal file

View File

@@ -0,0 +1,7 @@
 # escape=`
 ARG base_image
 FROM ${base_image}
 COPY mesa_deps_test.ps1 C:\
 RUN C:\mesa_deps_test.ps1

									
										31

.gitlab-ci/windows/deqp_runner_run.ps1
									
										Normal file
									
												View File
												
				@@ -0,0 +1,31 @@

				$dxil_dll = cmd.exe /C "C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 -no_logo && where dxil.dll" 2>&1

				if ($dxil_dll -notmatch "dxil.dll$") {

				    Write-Output "Couldn't get path to dxil.dll"

				    exit 1

				}

				$env:Path = "$(Split-Path $dxil_dll);$env:Path"

				# VK_ICD_FILENAMES environment variable is not used when running with

				# elevated privileges. Add a key to the registry instead.

				$hkey_path = "HKLM:\SOFTWARE\Khronos\Vulkan\Drivers\"

				$hkey_name = Join-Path -Path $pwd -ChildPath "_install\share\vulkan\icd.d\dzn_icd.x86_64.json"

				New-Item -Path $hkey_path -force

				New-ItemProperty -Path $hkey_path -Name $hkey_name -Value 0 -PropertyType DWORD

				$results = New-Item -ItemType Directory results

				$deqp_options = @("--deqp-surface-width", 256, "--deqp-surface-height", 256, "--deqp-surface-type", "pbuffer", "--deqp-gl-config-name", "rgba8888d24s8ms0", "--deqp-visibility", "hidden")

				$deqp_module = "C:\deqp\external\vulkancts\modules\vulkan\deqp-vk.exe"

				$caselist = "C:\deqp\mustpass\vk-master.txt"

				$baseline = ".\_install\warp-fails.txt"

				$includes = @("-t", "dEQP-VK.api.*", "-t", "dEQP-VK.info.*", "-t", "dEQP-VK.draw.*", "-t", "dEQP-VK.query_pool.*", "-t", "dEQP-VK.memory.*")

				$env:DZN_DEBUG = "warp"

				deqp-runner run --deqp $($deqp_module) --output $($results) --caselist $($caselist) --baseline $($baseline) $($includes) --testlog-to-xml C:\deqp\executor\testlog-to-xml.exe --jobs 4 -- $($deqp_options)

				$deqpstatus = $?

				$template = "See https://$($env:CI_PROJECT_ROOT_NAMESPACE).pages.freedesktop.org/-/$($env:CI_PROJECT_NAME)/-/jobs/$($env:CI_JOB_ID)/artifacts/results/{{testcase}}.xml"

				deqp-runner junit --testsuite dEQP --results "$($results)/failures.csv" --output "$($results)/junit.xml" --limit 50 --template $template

				if (!$deqpstatus) {

				    Exit 1

				}

									
										49

.gitlab-ci/windows/mesa_build.ps1
									
												View File
												
				@@ -6,10 +6,43 @@ $env:PYTHONUTF8=1

				Get-Date

				Write-Host "Compiling Mesa"

				$builddir = New-Item -ItemType Directory -Name "_build"

				$installdir = New-Item -ItemType Directory -Name "_install"

				Push-Location $builddir.FullName

				cmd.exe /C "C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 && meson --default-library=shared -Dzlib:default_library=static --buildtype=release -Db_ndebug=false -Dc_std=c17 -Dcpp_std=vc++latest -Db_vscrt=mt --cmake-prefix-path=`"C:\llvm-10`" --pkg-config-path=`"C:\llvm-10\lib\pkgconfig;C:\llvm-10\share\pkgconfig;C:\spirv-tools\lib\pkgconfig`" --prefix=`"$installdir`" -Dllvm=enabled -Dshared-llvm=disabled -Dvulkan-drivers=swrast,amd -Dgallium-drivers=swrast,d3d12,zink -Dshared-glapi=enabled -Dgles2=enabled -Dmicrosoft-clc=enabled -Dstatic-libclc=all -Dspirv-to-dxil=true -Dbuild-tests=true -Dwerror=true -Dwarning_level=2 -Dzlib:warning_level=1 -Dlibelf:warning_level=1 && ninja -j32 install && meson test --num-processes 32"

				$builddir = New-Item -Force -ItemType Directory -Name "_build"

				$installdir = New-Item -Force -ItemType Directory -Name "_install"

				$builddir=$builddir.FullName

				$installdir=$installdir.FullName

				$sourcedir=$PWD

				Remove-Item -Recurse -Force $builddir

				Remove-Item -Recurse -Force $installdir

				New-Item -ItemType Directory -Path $builddir

				New-Item -ItemType Directory -Path $installdir

				Write-Output builddir:$builddir

				Write-Output installdir:$installdir

				Write-Output sourcedir:$sourcedir

				$installPath=& "C:\Program Files (x86)\Microsoft Visual Studio\Installer\vswhere.exe" -version 16.0  -property installationpath

				Write-Output "vswhere.exe installPath: $installPath"

				$installPath="C:\BuildTools"

				Write-Output "Final installPath: $installPath"

				Import-Module (Join-Path $installPath "Common7\Tools\Microsoft.VisualStudio.DevShell.dll")

				Enter-VsDevShell -VsInstallPath $installPath -SkipAutomaticLocation -DevCmdArguments '-arch=x64 -no_logo -host_arch=amd64'

				Push-Location $builddir

				meson --default-library=shared -Dzlib:default_library=static --buildtype=release -Db_ndebug=false `

				-Db_vscrt=mt --cmake-prefix-path="C:\llvm-10" `

				--pkg-config-path="C:\llvm-10\lib\pkgconfig;C:\llvm-10\share\pkgconfig;C:\spirv-tools\lib\pkgconfig" `

				--prefix="$installdir" `

				-Dllvm=enabled -Dshared-llvm=disabled `

				"-Dvulkan-drivers=swrast,amd,microsoft-experimental" "-Dgallium-drivers=swrast,d3d12,zink" `

				-Dshared-glapi=enabled -Dgles2=enabled -Dmicrosoft-clc=enabled -Dstatic-libclc=all -Dspirv-to-dxil=true `

				-Dbuild-tests=true -Dwerror=true -Dwarning_level=2 -Dzlib:warning_level=1 -Dlibelf:warning_level=1 `

				$sourcedir

				ninja install -j32

				meson test --num-processes 32

				$buildstatus = $?

				Pop-Location

				@@ -21,4 +54,10 @@ if (!$buildstatus) {

				}

				Copy-Item ".\.gitlab-ci\windows\piglit_run.ps1" -Destination $installdir

				Copy-Item ".\.gitlab-ci\windows\quick_gl.txt" -Destination $installdir

				Copy-Item ".\.gitlab-ci\windows\spirv2dxil_check.ps1" -Destination $installdir

				Copy-Item ".\.gitlab-ci\windows\spirv2dxil_run.ps1" -Destination $installdir

				Copy-Item ".\.gitlab-ci\windows\deqp_runner_run.ps1" -Destination $installdir

				Get-ChildItem -Recurse -Filter "ci" | Get-ChildItem -Filter "*.txt" | Copy-Item -Destination $installdir

									
										4

.gitlab-ci/windows/mesa_container.ps1
									
												View File
												
				@@ -6,6 +6,8 @@ $registry_username = $args[1]

				$registry_password = $args[2]

				$registry_user_image = $args[3]

				$registry_central_image = $args[4]

				$build_dockerfile = $args[5]

				$registry_base_image = $args[6]

				Set-Location -Path ".\.gitlab-ci\windows"

				@@ -39,7 +41,7 @@ if ($?) {

				}

				Write-Host "No image found at $registry_user_image or $registry_central_image; rebuilding"

				docker --config "windows-docker.conf" build --no-cache -t "$registry_user_image" .

				docker --config "windows-docker.conf" build --no-cache -t "$registry_user_image" -f "$build_dockerfile" --build-arg base_image="$registry_base_image" .

				if (!$?) {

				  Write-Host "Container build failed"

				  docker --config "windows-docker.conf" logout "$registry_uri"

									
										63

.gitlab-ci/windows/mesa_deps.ps1 → .gitlab-ci/windows/mesa_deps_build.ps1
									
												View File
												
				@@ -129,6 +129,8 @@ if (!$buildstatus) {

				  Exit 1

				}

				# See https://gitlab.freedesktop.org/mesa/mesa/-/issues/3855

				# Until that's resolved, we need the vulkan-runtime as a build dependency to be able to run any unit tests on GL

				Get-Date

				Write-Host "Downloading Vulkan-Runtime"

				Invoke-WebRequest -Uri 'https://sdk.lunarg.com/sdk/download/latest/windows/vulkan-runtime.exe' -OutFile 'C:\vulkan-runtime.exe' | Out-Null

				@@ -140,66 +142,5 @@ if (!$?) {

				}

				Remove-Item C:\vulkan-runtime.exe -Force

				Get-Date

				Write-Host "Downloading Freeglut"

				$freeglut_zip = 'freeglut-MSVC.zip'

				$freeglut_url = "https://www.transmissionzero.co.uk/files/software/development/GLUT/$freeglut_zip"

				For ($i = 0; $i -lt 5; $i++) {

				  Invoke-WebRequest -Uri $freeglut_url -OutFile $freeglut_zip

				  $freeglut_downloaded = $?

				  if ($freeglut_downloaded) {

				    Break

				  }

				}

				if (!$freeglut_downloaded) {

				  Write-Host "Failed to download Freeglut"

				  Exit 1

				}

				Get-Date

				Write-Host "Installing Freeglut"

				Expand-Archive $freeglut_zip -DestinationPath C:\

				if (!$?) {

				  Write-Host "Failed to install Freeglut"

				  Exit 1

				}

				Get-Date

				Write-Host "Downloading glext.h"

				New-Item -ItemType Directory -Path ".\glext" -Name "GL"

				$ProgressPreference = "SilentlyContinue"

				Invoke-WebRequest -Uri 'https://www.khronos.org/registry/OpenGL/api/GL/glext.h' -OutFile '.\glext\GL\glext.h' | Out-Null

				Get-Date

				Write-Host "Cloning Piglit"

				git clone --no-progress --single-branch --no-checkout https://gitlab.freedesktop.org/mesa/piglit.git 'C:\src\piglit'

				if (!$?) {

				  Write-Host "Failed to clone Piglit repository"

				  Exit 1

				}

				Push-Location -Path C:\src\piglit

				git checkout b0bbeb876a506e0ee689dd7e17cee374c8284058

				Pop-Location

				Get-Date

				$piglit_build = New-Item -ItemType Directory -Path "C:\src\piglit" -Name "build"

				Push-Location -Path $piglit_build.FullName

				Write-Host "Compiling Piglit"

				cmd.exe /C 'C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 && cmake .. -GNinja -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX="C:\Piglit" -DGLUT_INCLUDE_DIR=C:\freeglut\include -DGLUT_glut_LIBRARY_RELEASE=C:\freeglut\lib\x64\freeglut.lib -DGLEXT_INCLUDE_DIR=.\glext && ninja -j32'

				$buildstatus = $?

				ninja -j32 install | Out-Null

				$installstatus = $?

				Pop-Location

				Remove-Item -Recurse -Path $piglit_build

				if (!$buildstatus -Or !$installstatus) {

				  Write-Host "Failed to compile or install Piglit"

				  Exit 1

				}

				Copy-Item -Path C:\freeglut\bin\x64\freeglut.dll -Destination C:\Piglit\lib\piglit\bin\freeglut.dll

				Get-Date

				Write-Host "Complete"

									
										124

.gitlab-ci/windows/mesa_deps_test.ps1
									
										Normal file
									
												View File
												
				@@ -0,0 +1,124 @@

				Get-Date

				Write-Host "Downloading Freeglut"

				$freeglut_zip = 'freeglut-MSVC.zip'

				$freeglut_url = "https://www.transmissionzero.co.uk/files/software/development/GLUT/$freeglut_zip"

				For ($i = 0; $i -lt 5; $i++) {

				  Invoke-WebRequest -Uri $freeglut_url -OutFile $freeglut_zip

				  $freeglut_downloaded = $?

				  if ($freeglut_downloaded) {

				    Break

				  }

				}

				if (!$freeglut_downloaded) {

				  Write-Host "Failed to download Freeglut"

				  Exit 1

				}

				Get-Date

				Write-Host "Installing Freeglut"

				Expand-Archive $freeglut_zip -DestinationPath C:\

				if (!$?) {

				  Write-Host "Failed to install Freeglut"

				  Exit 1

				}

				Get-Date

				Write-Host "Downloading glext.h"

				New-Item -ItemType Directory -Path ".\glext" -Name "GL"

				$ProgressPreference = "SilentlyContinue"

				Invoke-WebRequest -Uri 'https://www.khronos.org/registry/OpenGL/api/GL/glext.h' -OutFile '.\glext\GL\glext.h' | Out-Null

				Get-Date

				Write-Host "Cloning Piglit"

				git clone --no-progress --single-branch --no-checkout https://gitlab.freedesktop.org/mesa/piglit.git 'C:\src\piglit'

				if (!$?) {

				  Write-Host "Failed to clone Piglit repository"

				  Exit 1

				}

				Push-Location -Path C:\src\piglit

				git checkout f7f2a6c2275cae023a27b6cc81be3dda8c99492d

				Pop-Location

				Get-Date

				$piglit_build = New-Item -ItemType Directory -Path "C:\src\piglit" -Name "build"

				Push-Location -Path $piglit_build.FullName

				Write-Host "Compiling Piglit"

				cmd.exe /C 'C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 && cmake .. -GNinja -DCMAKE_BUILD_TYPE=Release -DCMAKE_INSTALL_PREFIX="C:\Piglit" -DGLUT_INCLUDE_DIR=C:\freeglut\include -DGLUT_glut_LIBRARY_RELEASE=C:\freeglut\lib\x64\freeglut.lib -DGLEXT_INCLUDE_DIR=.\glext && ninja -j32'

				$buildstatus = $?

				ninja -j32 install | Out-Null

				$installstatus = $?

				Pop-Location

				Remove-Item -Recurse -Path $piglit_build

				if (!$buildstatus -Or !$installstatus) {

				  Write-Host "Failed to compile or install Piglit"

				  Exit 1

				}

				Copy-Item -Path C:\freeglut\bin\x64\freeglut.dll -Destination C:\Piglit\lib\piglit\bin\freeglut.dll

				Get-Date

				Write-Host "Cloning spirv-samples"

				git clone --no-progress --single-branch --no-checkout https://github.com/dneto0/spirv-samples.git  C:\spirv-samples\

				Push-Location -Path C:\spirv-samples\

				git checkout 7ac0ad5a7fe0ec884faba1dc2916028d0268eeef

				Pop-Location

				Get-Date

				Write-Host "Cloning Vulkan and GL Conformance Tests"

				$deqp_source = "C:\src\VK-GL-CTS\"

				git clone --no-progress --single-branch https://github.com/lfrb/VK-GL-CTS.git -b windows-flush $deqp_source

				if (!$?) {

				  Write-Host "Failed to clone deqp repository"

				  Exit 1

				}

				Push-Location -Path $deqp_source

				# --insecure is due to SSL cert failures hitting sourceforge for zlib and

				# libpng (sigh).  The archives get their checksums checked anyway, and git

				# always goes through ssh or https.

				py .\external\fetch_sources.py --insecure

				Pop-Location

				Get-Date

				$deqp_build = New-Item -ItemType Directory -Path "C:\deqp"

				Push-Location -Path $deqp_build.FullName

				Write-Host "Compiling deqp"

				cmd.exe /C "C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 && cmake -S $($deqp_source) -B . -GNinja -DCMAKE_BUILD_TYPE=Release -DDEQP_TARGET=default && ninja -j32"

				$buildstatus = $?

				Pop-Location

				if (!$buildstatus -Or !$installstatus) {

				  Write-Host "Failed to compile or install deqp"

				  Exit 1

				}

				# Copy test result templates

				Copy-Item -Path "$($deqp_source)\doc\testlog-stylesheet\testlog.css" -Destination $deqp_build

				Copy-Item -Path "$($deqp_source)\doc\testlog-stylesheet\testlog.xsl" -Destination $deqp_build

				# Copy Vulkan must-pass list

				$deqp_mustpass = New-Item -ItemType Directory -Path $deqp_build -Name "mustpass"

				$root_mustpass = Join-Path -Path $deqp_source -ChildPath "external\vulkancts\mustpass\master"

				$files = Get-Content "$($root_mustpass)\vk-default.txt"

				foreach($file in $files) {

				  Get-Content "$($root_mustpass)\$($file)" | Add-Content -Path "$($deqp_mustpass)\vk-master.txt"

				}

				Remove-Item -Force -Recurse $deqp_source

				Get-Date

				$url = 'https://static.rust-lang.org/rustup/dist/x86_64-pc-windows-msvc/rustup-init.exe';

				Write-Host ('Downloading {0} ...' -f $url);

				Invoke-WebRequest -Uri $url -OutFile 'rustup-init.exe';

				Write-Host "Installing rust toolchain"

				C:\rustup-init.exe -y;

				Remove-Item C:\rustup-init.exe;

				Get-Date

				Write-Host "Installing deqp-runner"

				$env:Path += ";$($env:USERPROFILE)\.cargo\bin"

				cargo install --git https://gitlab.freedesktop.org/anholt/deqp-runner.git

				Get-Date

				Write-Host "Complete"

									
										2

.gitlab-ci/windows/piglit_run.ps1
									
												View File
												
				@@ -9,7 +9,7 @@ cmd.exe /C "C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd6

				py -3 C:\Piglit\bin\piglit.py summary console .\results | Select -SkipLast 1 | Select-String -NotMatch -Pattern ': pass' | Set-Content -Path .\result.txt

				$reference = Get-Content ".\_install\$env:PIGLIT_PROFILE.txt"

				$reference = Get-Content ".\_install\$env:PIGLIT_RESULTS.txt"

				$result = Get-Content .\result.txt

				if (-Not ($reference -And $result)) {

				  Exit 1

									
										54

.gitlab-ci/windows/spirv2dxil_check.ps1
									
										Normal file
									
												View File
												
				@@ -0,0 +1,54 @@

				# Ensure that dxil.dll in on the %PATH%

				$dxil_dll = cmd.exe /C "C:\BuildTools\Common7\Tools\VsDevCmd.bat -host_arch=amd64 -arch=amd64 -no_logo && where dxil.dll" 2>&1

				if ($dxil_dll -notmatch "dxil.dll$") {

				    Write-Output "Couldn't get path to dxil.dll"

				    exit 1

				}

				$env:Path = "$(Split-Path $dxil_dll);$env:Path"

				$exec_mode_to_stage = @{ Fragment = "fragment"; Vertex = "vertex"; GLCompute = "compute" }

				$spvasm_files = (Get-ChildItem C:\spirv-samples\spvasm\*.spvasm) | Sort-Object Name

				foreach ($spvasm in $spvasm_files) {

				    $test_name = "Test:$($spvasm.Name):"

				    $spvfile = ($spvasm -replace '\.spvasm$', '.spv')

				    $content = Get-Content $spvasm

				    $spv_version = "1.0"

				    if ($content | Where-Object { $_ -match 'Version:\s(\d+\.\d+)' }) {

				        $spv_version = $Matches[1]

				    }

				    $as_output = C:\spirv-tools\bin\spirv-as.exe --target-env spv$spv_version --preserve-numeric-ids -o $spvfile $spvasm 2>&1 | % { if ($_ -is [System.Management.Automation.ErrorRecord]) { $_.Exception.Message } else { $_ } }  | Out-String

				    if ($LASTEXITCODE -ne 0) {

				        Write-Output "$test_name Skip: Unable to assemble shader"

				        Write-Output "$as_output`n"

				        continue

				    }

				    $entry_points = $content | Select-String -Pattern '^OpEntryPoint\s(\w+)[^"]+"(\w+)"' | Select-Object -ExpandProperty Matches -First 1

				    if ($entry_points.Count -eq 0) {

				        Write-Output "$test_name Skip"

				        Write-Output "No OpEntryPoint not found`n"

				        continue

				    }

				    foreach ($match in $entry_points) {

				        $exec_mode, $entry_point = $match.Groups[1].Value, $match.Groups[2].Value

				        $subtest = "$test_name$entry_point|${exec_mode}:"

				        $stage = $exec_mode_to_stage[$exec_mode]

				        if ($stage -eq '') {

				            Write-Output "$subtest Fail: Unknown shader type ($exec_mode)"

				            continue

				        }

				        $s2d_output = .\_install\bin\spirv2dxil.exe -v -e "$entry_point" -s "$stage" -o NUL $spvfile 2>&1 | ForEach-Object { if ($_ -is [System.Management.Automation.ErrorRecord]) { $_.Exception.Message } else { $_ } }  | Out-String

				        if ($LASTEXITCODE -eq 0) {

				            Write-Output "$subtest Pass"

				        }

				        else {

				            Write-Output "$subtest Fail"

				            $sanitized_output = $s2d_output -replace ', file .+, line \d+' -replace '    In file .+:\d+'

				            Write-Output "$sanitized_output`n"

				        }

				    }

				}

									
										16

.gitlab-ci/windows/spirv2dxil_run.ps1
									
										Normal file
									
												View File
												
				@@ -0,0 +1,16 @@

				. .\_install\spirv2dxil_check.ps1 2>&1 | Set-Content -Path .\spirv2dxil_results.txt

				$reference = Get-Content .\_install\spirv2dxil_reference.txt

				$result = Get-Content .\spirv2dxil_results.txt

				if (-Not ($reference -And $result)) {

				    Exit 1

				}

				$diff = Compare-Object -ReferenceObject $reference -DifferenceObject $result

				if (-Not $diff) {

				    Exit 0

				}

				Write-Host "Unexpected change in results:"

				Write-Output $diff | Format-Table -Property SideIndicator, InputObject -Wrap

				Exit 1

5

.mailmap

View File

@@ -107,6 +107,8 @@ Bruce Cherniak <bruce.cherniak@intel.com>
 Bruce Merry <bmerry@users.sourceforge.net> <bmerry@gmail.com>
 Caio Oliveira <caio.oliveira@intel.com>
 Carl-Philip Hänsch <cphaensch@googlemail.com>
 Carl-Philip Hänsch <cphaensch@googlemail.com> <s3734770@mail.zih.tu-dresden.de>
 Carl-Philip Hänsch <cphaensch@googlemail.com> <carli@carli-laptop.(none)>
@@ -295,7 +297,8 @@ Jan Vesely <jano.vesely@gmail.com> Jan Vesely <jan.vesely@rutgers.edu>
 Jan Zielinski <jan.zielinski@intel.com> jzielins <jan.zielinski@intel.com>
 Jason Ekstrand <jason@jlekstrand.net> <jason.ekstrand@intel.com>
 Jason Ekstrand <jason.ekstrand@collabora.com> <jason@jlekstrand.net>
 Jason Ekstrand <jason.ekstrand@collabora.com> <jason.ekstrand@intel.com>
 Jeremy Huddleston <jeremyhu@apple.com>
 Jeremy Huddleston <jeremyhu@apple.com> <jeremyhu@freedesktop.org>

1946

.pick_status.json

View File

File diff suppressed because it is too large Load Diff

25

CODEOWNERS

View File

@@ -98,6 +98,14 @@ meson.build @dbaker @eric
 /src/*/vulkan/*_wsi_display.c @keithp
 ######
 # CI #
 ######
 # Broadcom
 /src/broadcom/ci/ @jasuarez @chema
 ###########
 # Drivers #
 ###########
@@ -106,9 +114,19 @@ meson.build @dbaker @eric
 /src/asahi/ @alyssa
 /src/gallium/drivers/asahi/ @alyssa
 # Broadcom
 /src/broadcom/ @itoral @apinheiro
 /src/gallium/drivers/v3d/ @itoral @chema @jasuarez
 /src/gallium/drivers/vc4/ @itoral @chema @jasuarez
 # Freedreno
 /src/gallium/drivers/freedreno/ @robclark
 # Imagination
 /include/drm-uapi/pvr_drm.h @CreativeCylon @frankbinns @rajnesh-kanwal
 /src/imagination/ @CreativeCylon @frankbinns @rajnesh-kanwal
 /src/imagination/rogue/ @simon-perretta-img
 # Intel
 /include/drm-uapi/i915_drm.h @kwg @llandwerlin @jekstrand @idr
 /include/pci_ids/i*_pci_ids.h @kwg @llandwerlin @jekstrand @idr
@@ -116,8 +134,6 @@ meson.build @dbaker @eric
 /src/gallium/winsys/iris/ @kwg @llandwerlin @jekstrand @idr
 /src/gallium/drivers/iris/ @kwg @llandwerlin @jekstrand @idr
 /src/gallium/drivers/i915/ @anholt
 /src/mesa/drivers/dri/i965/ @kwg @llandwerlin @jekstrand @idr
 /doxygen/i965.doxy @kwg @llandwerlin @jekstrand @idr
 # Microsoft
 /src/microsoft/ @jenatali
@@ -128,11 +144,6 @@ meson.build @dbaker @eric
 /src/panfrost/vulkan/ @bbrezillon
 /src/gallium/drivers/panfrost/ @alyssa
 # SWR
 /src/gallium/drivers/swr/ @jzielins @krzysztof.raszkowski
 /docs/gallium/drivers/openswr.rst @jzielins @krzysztof.raszkowski
 /docs/gallium/drivers/openswr/ @jzielins @krzysztof.raszkowski
 # VMware
 /src/gallium/drivers/svga/ @brianp @charmainel
 /src/gallium/winsys/svga/ @thomash @drawat

2

VERSION

View File

@@ -1 +1 @@
 .3.0-rc2
 .1.0-devel

									
										13

android/Android.mk
									
												View File
												
				@@ -42,10 +42,13 @@ LOCAL_SHARED_LIBRARIES := libc libdl libdrm libm liblog libcutils libz libc++ li

				LOCAL_STATIC_LIBRARIES := libexpat libarect libelf

				LOCAL_HEADER_LIBRARIES := libnativebase_headers hwvulkan_headers libbacktrace_headers

				MESON_GEN_PKGCONFIGS := backtrace cutils expat hardware libdrm:$(LIBDRM_VERSION) nativewindow sync zlib:1.2.11 libelf

				LOCAL_CFLAGS += $(BOARD_MESA3D_CFLAGS)

				ifneq ($(filter swr swrast,$(BOARD_MESA3D_GALLIUM_DRIVERS) $(BOARD_MESA3D_VULKAN_DRIVERS)),)

				ifneq ($(filter swrast,$(BOARD_MESA3D_GALLIUM_DRIVERS) $(BOARD_MESA3D_VULKAN_DRIVERS)),)

				ifeq ($(BOARD_MESA3D_FORCE_SOFTPIPE),)

				MESON_GEN_LLVM_STUB := true

				endif

				endif

				ifneq ($(filter zink,$(BOARD_MESA3D_GALLIUM_DRIVERS)),)

				LOCAL_SHARED_LIBRARIES += libvulkan

				@@ -74,10 +77,14 @@ LOCAL_SHARED_LIBRARIES += libdrm_nouveau

				MESON_GEN_PKGCONFIGS += libdrm_nouveau:$(LIBDRM_VERSION)

				endif

				ifneq ($(filter d3d12,$(BOARD_MESA3D_GALLIUM_DRIVERS)),)

				LOCAL_HEADER_LIBRARIES += DirectX-Headers

				LOCAL_STATIC_LIBRARIES += DirectX-Guids

				MESON_GEN_PKGCONFIGS += DirectX-Headers

				endif

				ifneq ($(MESON_GEN_LLVM_STUB),)

				MESON_LLVM_VERSION := 12.0.0

				# Required for swr gallium target

				MESON_LLVM_IRBUILDER_PATH := external/llvm-project/llvm/include/llvm/IR/IRBuilder.h

				LOCAL_SHARED_LIBRARIES += libLLVM12

				endif

									
										10

android/mesa3d_cross.mk
									
												View File
												
				@@ -93,6 +93,7 @@ MESON_GEN_NINJA := \

					-Dvulkan-drivers=$(subst $(space),$(comma),$(subst radeon,amd,$(BOARD_MESA3D_VULKAN_DRIVERS)))   \

					-Dgbm=enabled                                                                \

					-Degl=enabled                                                                \

					-Dcpp_rtti=false                                                             \

				MESON_BUILD := PATH=/usr/bin:/bin:/sbin:$$PATH ninja -C $(MESON_OUT_DIR)/build

				@@ -128,7 +129,6 @@ $(MESON_GEN_FILES_TARGET): PRIVATE_C_INCLUDES := $(my_c_includes)

				$(MESON_GEN_FILES_TARGET): PRIVATE_IMPORTED_INCLUDES := $(imported_includes)

				$(MESON_GEN_FILES_TARGET): PRIVATE_LDFLAGS := $(my_ldflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_LDLIBS := $(my_ldlibs)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_GLOBAL_LDFLAGS := $(my_target_global_ldflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TIDY_CHECKS := $(my_tidy_checks)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TIDY_FLAGS := $(my_tidy_flags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_ARFLAGS := $(my_arflags)

				@@ -139,6 +139,11 @@ $(MESON_GEN_FILES_TARGET): PRIVATE_ALL_OBJECTS := $(strip $(all_objects))

				$(MESON_GEN_FILES_TARGET): PRIVATE_ARM_CFLAGS := $(normal_objects_cflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_GLOBAL_CFLAGS := $(my_target_global_cflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_GLOBAL_CONLYFLAGS := $(my_target_global_conlyflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_GLOBAL_CPPFLAGS := $(my_target_global_cppflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_GLOBAL_LDFLAGS := $(my_target_global_ldflags)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_LIBCRT_BUILTINS := $(my_target_libcrt_builtins)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_LIBATOMIC := $(my_target_libatomic)

				$(MESON_GEN_FILES_TARGET): PRIVATE_TARGET_CRTBEGIN_SO_O := $(my_target_crtbegin_so_o)

				@@ -252,8 +257,7 @@ ifneq ($(MESON_GEN_LLVM_STUB),)

					mkdir -p $(dir $@)/subprojects/llvm/

					echo -e "project('llvm', 'cpp', version : '$(MESON_LLVM_VERSION)')\n" \

						"dep_llvm = declare_dependency()\n"                           \

						"has_rtti = false\n"                                          \

						"irbuilder_h = files('$(AOSP_ABSOLUTE_PATH)/$(MESON_LLVM_IRBUILDER_PATH)')" > $(dir $@)/subprojects/llvm/meson.build

						"has_rtti = false\n" > $(dir $@)/subprojects/llvm/meson.build

				endif

					$(MESON_GEN_NINJA)

					$(MESON_BUILD)

									
										6

bin/gen_calendar_entries.py
									
												View File
												
				@@ -105,7 +105,7 @@ def release_candidate(args: RCArguments) -> None:

				    data = read_calendar()

				    with CALENDAR_CSV.open('w') as f:

				    with CALENDAR_CSV.open('w', newline='') as f:

				        writer = csv.writer(f)

				        writer.writerows(data)

				@@ -147,7 +147,7 @@ def final_release(args: FinalArguments) -> None:

				    data = read_calendar()

				    date = _calculate_next_release_date(not args.zero_released)

				    with CALENDAR_CSV.open('w') as f:

				    with CALENDAR_CSV.open('w', newline='') as f:

				        writer = csv.writer(f)

				        writer.writerows(data)

				@@ -199,7 +199,7 @@ def extend(args: ExtendArguments) -> None:

				    current = read_calendar()

				    with CALENDAR_CSV.open('w') as f:

				    with CALENDAR_CSV.open('w', newline='') as f:

				        writer = csv.writer(f)

				        with write_existing(writer, current) as row:

				            # Get rid of -rcX as well

									
										7

bin/khronos-update.py
									
												View File
												
				@@ -177,6 +177,13 @@ SOURCES = [

				            Source('include/vulkan/vulkan_xlib.h',              'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vulkan/vulkan_xlib.h'),

				            Source('include/vulkan/vulkan_xlib_xrandr.h',       'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vulkan/vulkan_xlib_xrandr.h'),

				            Source('include/vulkan/vk_android_native_buffer.h', 'https://android.googlesource.com/platform/frameworks/native/+/master/vulkan/include/vulkan/vk_android_native_buffer.h?format=TEXT'),

				            Source('include/vk_video/vulkan_video_codec_h264std.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h264std.h'),

				            Source('include/vk_video/vulkan_video_codec_h264std_decode.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h264std_decode.h'),

				            Source('include/vk_video/vulkan_video_codec_h264std_encode.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h264std_encode.h'),

				            Source('include/vk_video/vulkan_video_codec_h265std.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h265std.h'),

				            Source('include/vk_video/vulkan_video_codec_h265std_decode.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h265std_decode.h'),

				            Source('include/vk_video/vulkan_video_codec_h265std_encode.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codec_h265std_encode.h'),

				            Source('include/vk_video/vulkan_video_codecs_common.h', 'https://github.com/KhronosGroup/Vulkan-Headers/raw/main/include/vk_video/vulkan_video_codecs_common.h'),

				            Source('include/vulkan/.editorconfig',              None),

				        ],

				    },

9

docs/_extra/_redirects Normal file

View File

@@ -0,0 +1,9 @@
 /drivers/vmware-guest.html /drivers/svga3d.html 301
 /gallium/drivers/freedreno.html /drivers/freedreno.html 301
 /gallium/drivers/freedreno/ir3-notes.html /drivers/freedreno/ir3-notes.html 301
 /gallium/drivers/llvmpipe.html /drivers/llvmpipe.html 301
 /gallium/drivers/zink.html /drivers/zink.html 301
 /llvmpipe.html /drivers/llvmpipe.html 301
 /postprocess.html /gallium/postprocess.html 301
 /versions.html /relnotes.html 301
 /vmware-guest.html /drivers/vmware-guest.html 301

									
										20

docs/_exts/redirects.py
									
												View File
												
				@@ -1,3 +1,23 @@

				# Copyright © 2020-2021 Collabora Ltd

				#

				# Permission is hereby granted, free of charge, to any person obtaining a copy

				# of this software and associated documentation files (the "Software"), to deal

				# in the Software without restriction, including without limitation the rights

				# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell

				# copies of the Software, and to permit persons to whom the Software is

				# furnished to do so, subject to the following conditions:

				#

				# The above copyright notice and this permission notice shall be included in

				# all copies or substantial portions of the Software.

				#

				# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR

				# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,

				# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE

				# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER

				# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,

				# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE

				# SOFTWARE.

				import os

				import pathlib

				from urllib.parse import urlparse

									
										74

docs/ci/bare-metal.rst
									
												View File
												
				@@ -3,7 +3,7 @@ Bare-metal CI

				The bare-metal scripts run on a system with gitlab-runner and Docker,

				connected to potentially multiple bare-metal boards that run tests of

				Mesa.  Currently only "fastboot" and "ChromeOS Servo" devices are

				Mesa.  Currently "fastboot", "ChromeOS Servo", and POE-powered devices are

				supported.

				In comparison with LAVA, this doesn't involve maintaining a separate

				@@ -25,11 +25,17 @@ should probably have the console on a serial connection, so that you

				can see bootloader progress.

				The boards need to be able to have a kernel/initramfs supplied by the

				gitlab-runner system, since the initramfs is what contains the Mesa

				testing payload.

				gitlab-runner system, since Mesa often needs to update the kernel either for new

				DRM functionality, or to fix kernel bugs.

				The boards should have networking, so that we can extract the dEQP .xml

				results to artifacts on GitLab.

				The boards must have networking, so that we can extract the dEQP .xml results to

				artifacts on GitLab, and so that we can download traces (too large for an

				initramfs) for trace replay testing.  Given that we need networking already, and

				our deqp/piglit/etc. payload is large, we use nfs from the x86 runner system

				rather than initramfs.

				See `src/freedreno/ci/gitlab-ci.yml` for an example of fastboot on DB410c and

				DB820c (freedreno-a306 and freereno-a530).

				Requirements (servo)

				--------------------

				@@ -68,6 +74,64 @@ call "servo"::

				  dhcp-option=tag:cheza1,option:root-path,/srv/nfs/cheza1

				  dhcp-option=tag:cheza2,option:root-path,/srv/nfs/cheza2

				See `src/freedreno/ci/gitlab-ci.yml` for an example of servo on cheza.  Note

				that other servo boards in CI are managed using LAVA.

				Requirements (POE)

				------------------

				For boards with 30W or less power consumption, POE can be used for the power

				control.  The parts list ends up looking something like (for example):

				- x86-64 gitlab-runner machine with a mid-range CPU, and 3+ GB of SSD storage

				  per board.  This can host at least 15 boards in our experience.

				- Cisco 2960S gigabit ethernet switch with POE. (Cisco 3750G, 3560G, or 2960G

				  were also recommended as reasonable-priced HW, but make sure the name ends in

				  G, X, or S)

				- POE splitters to power the boards (you can find ones that go to micro USB,

				  USBC, and 5V barrel jacks at least)

				- USB serial cables (Adafruit sells pretty reliable ones)

				- A large powered USB hub for all the serial cables

				- A pile of ethernet cables

				You'll talk to the Cisco for configuration using its USB port, which provides a

				serial terminal at 9600 baud.  You need to enable SNMP control, which we'll do

				using a "mesaci" community name that the gitlab runner can access as its

				authentication (no password) to configure.  To talk to the SNMP on the router,

				you need to put an ip address on the default vlan (vlan 1).

				Setting that up looks something like:

				.. code-block: console

				  Switch>

				  Password:

				  Switch#configure terminal

				  Switch(config)#interface Vlan 1

				  Switch(config-if)#ip address 10.42.0.2 255.255.0.0

				  Switch(config-if)#end

				  Switch(config)#snmp-server community mesaci RW

				  Switch(config)#end

				  Switch#copy running-config startup-config

				With that set up, you should be able to power on/off a port with something like:

				.. code-block: console

				  % snmpset -v2c -r 3 -t 30 -cmesaci 10.42.0.2 1.3.6.1.4.1.9.9.402.1.2.1.1.1.1 i 1

				  % snmpset -v2c -r 3 -t 30 -cmesaci 10.42.0.2 1.3.6.1.4.1.9.9.402.1.2.1.1.1.1 i 4

				Note that the "1.3.6..." SNMP OID changes between switches.  The last digit

				above is the interface id (port number).  You can probably find the right OID by

				google, that was easier than figuring it out from finding the switch's MIB

				database.  You can query the POE status from the switch serial using the `show

				power inline` command.

				Other than that, find the dnsmasq/tftp/nfs setup for your boards "servo" above.

				See `src/broadcom/ci/gitlab-ci.yml` and `src/nouveau/ci/gitlab-ci.yml` for an

				examples of POE for Raspberry Pi 3/4, and Jetson Nano.

				Setup

				-----

									
										24

docs/ci/index.rst
									
												View File
												
				@@ -242,3 +242,27 @@ directory.  You can hack on mesa and iterate testing the build with:

				.. code-block:: console

				    sudo docker run --rm -v `pwd`:/mesa $IMAGE ninja -C /mesa/_build

				Conformance Tests

				-----------------

				Some conformance tests require a special treatment to be maintained on Gitlab CI.

				This section lists their documentation pages.

				.. toctree::

				  :maxdepth: 1

				  skqp

				Updating Gitlab CI Linux Kernel

				-------------------------------

				Gitlab CI usually runs a bleeding-edge kernel. The following documentation has

				instructions on how to uprev Linux Kernel in the Gitlab Ci ecosystem.

				.. toctree::

				  :maxdepth: 1

				  kernel

									
										121

docs/ci/kernel.rst
									
										Normal file
									
												View File
												
				@@ -0,0 +1,121 @@

				Upreving Linux Kernel

				=====================

				Occasionally, the Gitlab CI needs a Linux Kernel update to enable new kernel

				features, device drivers, bug fixes etc to CI jobs.

				Kernel uprevs in Gitlab CI are relatively simple, but prone to lots of

				side-effects since many devices from different platforms are involved in the

				pipeline.

				Kernel repository

				-----------------

				The Linux Kernel used in the Gitlab CI is stored at the following repository:

				https://gitlab.freedesktop.org/gfx-ci/linux

				It is common that Mesa kernel brings some patches that were not merged on the

				Linux mainline, that is why Mesa has its own kernel version which should be used

				as the base for newer kernels.

				So, one should base the kernel uprev from the last tag used in the Mesa CI,

				please refer to `.gitlab-ci.yml` `KERNEL_URL` variable.

				Every tag has a standard naming: `vX.YZ-for-mesa-ci-<commit_short_SHA>`, which

				can be created via the command:

				:code:`git tag vX.YZ-for-mesa-ci-$(git rev-parse --short HEAD)`

				Building Kernel

				---------------

				When Mesa CI generates a new rootfs image, the Linux Kernel is built based on

				the script located at `.gitlab-ci/build-kernel.sh`.

				Updating Kconfigs

				^^^^^^^^^^^^^^^^^

				When a Kernel uprev happens, it is worth compiling and cross-compiling the

				Kernel locally, in order to update the Kconfigs accordingly.  Remember that the

				resulting Kconfig is a merge between *Mesa CI Kconfig* and *Linux tree

				defconfig* made via `merge_config.sh` script located at Linux Kernel tree.

				Kconfigs location

				"""""""""""""""""

				+------------+--------------------------------------------+-------------------------------------+

				| Platform   | Mesa CI Kconfig location                   | Linux tree defconfig                |

				+============+============================================+=====================================+

				| arm        | .gitlab-ci/container/arm.config            | arch/arm/configs/multi_v7_defconfig |

				+------------+--------------------------------------------+-------------------------------------+

				| arm64      | .gitlab-ci/container/arm64.config          | arch/arm64/configs/defconfig        |

				+------------+--------------------------------------------+-------------------------------------+

				| x86-64     | .gitlab-ci/container/x86_64.config         | arch/x86/configs/x86_64_defconfig   |

				+------------+--------------------------------------------+-------------------------------------+

				Updating image tags

				-------------------

				Every kernel uprev should update 3 image tags, located at two files.

				:code:`.gitlab-ci.yml` tag

				^^^^^^^^^^^^^^^^^^^^^^^^^^

				- **KERNEL_URL** for the location of the new kernel

				:code:`.gitlab-ci/image-tags.yml` tags

				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				- **KERNEL_ROOTFS_TAG** to rebuild rootfs with the new kernel

				- **DEBIAN_X86_TEST_GL_TAG** to ensure that the new rootfs is being used by the Gitlab x86 jobs

				Development routine

				-------------------

				1. Compile the newer kernel locally for each platform.

				2. Compile device trees for ARM platforms

				3. Update Kconfigs. Are new Kconfigs necessary? Is CONFIG_XYZ_BLA deprecated? Does the `merge_config.sh` override an important config?

				4. Push a new development branch to `Kernel repository`_ based on the latest kernel tag used in Gitlab CI

				5. Hack `build-kernel.sh` script to clone kernel from your development branch

				6. Update image tags. See `Updating image tags`_

				7. Run the entire CI pipeline, all the automatic jobs should be green. If some job is red or taking too long, you will need to investigate it and probably ask for help.

				When the Kernel uprev is stable

				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				1. Push a new tag to Mesa CI `Kernel repository`_

				2. Update KERNEL_URL `debian/x86_test-gl` job definition

				3. Open a merge request, if it is not opened yet

				Tips and Tricks

				---------------

				Compare pipelines

				^^^^^^^^^^^^^^^^^

				To have the most confidence that a kernel uprev does not break anything in Mesa,

				it is suggested that one runs the entire CI pipeline to check if the update affected the manual CI jobs.

				Step-by-step

				""""""""""""

				1. Create a local branch in the same git ref (should be the main branch) before branching to the kernel uprev kernel.

				2. Push this test branch

				3. Run the entire pipeline against the test branch, even the manual jobs

				4. Now do the same for the kernel uprev branch

				5. Compare the job results. If a CI job turned red on your uprev branch, it means that the kernel update broke the test. Otherwise, it should be fine.

				Bare-metal custom kernels

				^^^^^^^^^^^^^^^^^^^^^^^^^

				Some CI jobs have support to plug in a custom kernel by simply changing a variable.

				This is great, since rebuilding the kernel and rootfs may takes dozens of minutes.

				For example, freedreno jobs `gitlab.yml` manifest support a variable named

				`BM_KERNEL`. If one puts a gz-compressed kernel URL there, the job will use that

				kernel to boot the freedreno bare-metal devices. The same works for `BM_DTB` in

				the case of device tree binaries.

				Careful reading of the job logs

				^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

				Sometimes a job may turn to red for reasons unrelated to the kernel update, e.g.

				LAVA `tftp` timeout, problems with the freedesktop servers etc.

				So it is important to see the reason why the job turned red, and retry it if an

				infrastructure error has happened.

									
										101

docs/ci/skqp.rst
									
										Normal file
									
												View File
												
				@@ -0,0 +1,101 @@

				skqp

				====

				`skqp <https://skia.org/docs/dev/testing/skqp/>`_ stands for SKIA Quality

				Program conformance tests.  Basically, it has sets of rendering tests and unit

				tests to ensure that `SKIA <https://skia.org/>`_ is meeting its design specifications on a specific

				device.

				The rendering tests have support for GL, GLES and Vulkan backends and test some

				rendering scenarios.

				And the unit tests check the GPU behavior without rendering images.

				Tests 

				-----

				Render tests design

				^^^^^^^^^^^^^^^^^^^

				It is worth noting that `rendertests.txt` can bring some detail about each test

				expectation, so each test can have a max pixel error count, to tell skqp that it

				is OK to have at most that number of errors for that test. See also:

				https://github.com/google/skia/blob/main/tools/skqp/README_ALGORITHM.md

				.. _test-location:

				Location

				^^^^^^^^

				Each `rendertests.txt` and `unittest.txt` file must be located inside a specific

				subdirectory inside skqp assets directory.

				+--------------+--------------------------------------------+

				| Test type    | Location                                   |

				+==============+============================================+

				| Render tests |  `${SKQP_ASSETS_DIR}/skqp/rendertests.txt` |

				+--------------+--------------------------------------------+

				| Unit tests   |  `${SKQP_ASSETS_DIR}/skqp/unittests.txt`   |

				+--------------+--------------------------------------------+

				The `skqp-runner.sh` script will make the necessary modifications to separate

				`rendertests.txt` for each backend-driver combination. As long as the test files are located in the expected place:

				+--------------+----------------------------------------------------------------------------------------------+

				| Test type    | Location                                                                                     |

				+==============+==============================================================================================+

				| Render tests | `${MESA_REPOSITORY_DIR}/src/${GPU_DRIVER}/ci/${GPU_VERSION}-${SKQP_BACKEND}_rendertests.txt` |

				+--------------+----------------------------------------------------------------------------------------------+

				| Unit tests   | `${MESA_REPOSITORY_DIR}/src/${GPU_DRIVER}/ci/${GPU_VERSION}_unittests.txt`                   |

				+--------------+----------------------------------------------------------------------------------------------+

				Where `SKQP_BACKEND` can be:

				- gl: for GL backend

				- gles: for GLES backend

				- vk: for Vulkan backend

				Example file

				""""""""""""

				.. code-block:: console

				  src/freedreno/ci/freedreno-a630-skqp-gl_rendertests.txt

				- GPU_DRIVER: `freedreno`

				- GPU_VERSION: `freedreno-a630`

				- SKQP_BACKEND: `gl`

				.. _rendertests-design:

				skqp reports

				------------

				skqp generates reports after finishing its execution, they are located at the job

				artifacts results directory and are divided in subdirectories by rendering tests

				backends and unit

				tests. The job log has links to every generated report in order to facilitate

				the skqp debugging.

				Maintaining skqp on Mesa CI

				---------------------------

				skqp is built alongside with another binary, namely `list_gpu_unit_tests`, it is

				located in the same folder where `skqp` binary is.

				This binary will generate the expected `unittests.txt` for the target GPU, so

				ideally it should be executed on every skqp update and when a new device

				receives skqp CI jobs.

				1. Generate target unit tests for the current GPU with :code:`./list_gpu_unit_tests > unittests.txt`

				2. Run skqp job

				3. If there is a failing or crashing unit test, remove it from the corresponding `unittests.txt`

				4. If there is a crashing render test, remove it from the corresponding `rendertests.txt`

				5. If there is a failing render test, visually inspect the result from the HTML report

				    - If the render result is OK, update the max error count for that test

				    - Otherwise, or put `-1` in the same threshold, as seen in :ref:`rendertests-design`

				6. Remember to put the new tests files to the locations cited in :ref:`test-location`

									
										4

docs/codingstyle.rst
									
												View File
												
				@@ -128,5 +128,5 @@ Basic formatting guidelines

				   prefer the use of ``bool``, ``true``, and ``false`` over

				   ``GLboolean``, ``GL_TRUE``, and ``GL_FALSE``. In C code, this may

				   mean that ``#include <stdbool.h>`` needs to be added. The

				   ``try_emit_*`` methods in ``src/mesa/program/ir_to_mesa.cpp`` and

				   ``src/mesa/state_tracker/st_glsl_to_tgsi.cpp`` can serve as examples.

				   ``try_emit_*`` method ``src/mesa/state_tracker/st_glsl_to_tgsi.cpp``

				   can serve as an example.

Compare commits

6230 Commits mesa-21.3. ... 22.1-branc

6 .gitattributes vendored Normal file Unescape Escape View File

1116 .gitlab-ci.yml View File

11 .gitlab-ci/deqp-all-skips.txt → .gitlab-ci/all-skips.txt Unescape Escape View File

2 .gitlab-ci/bare-metal/arm64_a630_egl.sh Unescape Escape View File

17 .gitlab-ci/bare-metal/cisco-2960-poe-off.sh Executable file Unescape Escape View File

21 .gitlab-ci/bare-metal/cisco-2960-poe-on.sh Executable file Unescape Escape View File

14 .gitlab-ci/bare-metal/cros_servo_run.py Unescape Escape View File

22 .gitlab-ci/bare-metal/fastboot_run.py Unescape Escape View File

34 .gitlab-ci/bare-metal/poe-powered.sh Unescape Escape View File

4 .gitlab-ci/bare-metal/poe_run.py Unescape Escape View File

8 .gitlab-ci/bare-metal/rootfs-setup.sh Unescape Escape View File

27 .gitlab-ci/bare-metal/serial_buffer.py Unescape Escape View File

525 .gitlab-ci/build/gitlab-ci.yml Normal file Unescape Escape View File

47 .gitlab-ci/common/generate-env.sh Unescape Escape View File

1 .gitlab-ci/common/init-stage1.sh Unescape Escape View File

54 .gitlab-ci/common/init-stage2.sh Unescape Escape View File

567 .gitlab-ci/common/intel-gpu-freq.sh Executable file Unescape Escape View File

31 .gitlab-ci/container/arm64.config Unescape Escape View File

5 .gitlab-ci/container/baremetal_build.sh Unescape Escape View File

56 .gitlab-ci/container/build-crosvm.sh Unescape Escape View File

43 .gitlab-ci/container/build-crosvm_no-syslog.patch Normal file Unescape Escape View File

27 .gitlab-ci/container/build-deqp-runner.sh Unescape Escape View File

10 .gitlab-ci/container/build-deqp.sh Unescape Escape View File

2 .gitlab-ci/container/build-fossilize.sh Unescape Escape View File

2 .gitlab-ci/container/build-kernel.sh Unescape Escape View File

2 .gitlab-ci/container/build-libdrm.sh Unescape Escape View File

2 .gitlab-ci/container/build-piglit.sh Unescape Escape View File

97 .gitlab-ci/container/build-skqp.sh Executable file Unescape Escape View File

13 .gitlab-ci/container/build-skqp_BUILD.gn.patch Normal file Unescape Escape View File

47 .gitlab-ci/container/build-skqp_base.gn Normal file Unescape Escape View File

68 .gitlab-ci/container/build-skqp_fetch_gn.patch Normal file Unescape Escape View File

142 .gitlab-ci/container/build-skqp_git-sync-deps.patch Normal file Unescape Escape View File

13 .gitlab-ci/container/build-skqp_is_clang.py.patch Normal file Unescape Escape View File

17 .gitlab-ci/container/build-va-tools.sh Normal file Unescape Escape View File

20 .gitlab-ci/container/build-virglrenderer.sh Unescape Escape View File

4 .gitlab-ci/container/build-vkd3d-proton.sh Unescape Escape View File

22 .gitlab-ci/container/build-wayland.sh Normal file Unescape Escape View File

2 .gitlab-ci/container/container_pre_build.sh Unescape Escape View File

2 .gitlab-ci/container/create-android-cross-file.sh Unescape Escape View File

34 .gitlab-ci/container/create-rootfs.sh Unescape Escape View File

50 .gitlab-ci/container/debian/android_build.sh Unescape Escape View File

3 .gitlab-ci/container/debian/arm_build.sh Unescape Escape View File

6 .gitlab-ci/container/debian/arm_test.sh Unescape Escape View File

3 .gitlab-ci/container/debian/x86_build-base.sh Unescape Escape View File

15 .gitlab-ci/container/debian/x86_build.sh Unescape Escape View File

2 .gitlab-ci/container/debian/x86_test-base.sh Unescape Escape View File

39 .gitlab-ci/container/debian/x86_test-gl.sh Unescape Escape View File

6 .gitlab-ci/container/debian/x86_test-vk.sh Unescape Escape View File

14 .gitlab-ci/container/fedora/x86_build.sh Unescape Escape View File

414 .gitlab-ci/container/gitlab-ci.yml Normal file Unescape Escape View File

52 .gitlab-ci/container/lava_build.sh Unescape Escape View File

12 .gitlab-ci/container/x86_64.config Unescape Escape View File

1 .gitlab-ci/cross-xfail-s390x Unescape Escape View File

39 .gitlab-ci/crosvm-init.sh Unescape Escape View File

138 .gitlab-ci/crosvm-runner.sh Unescape Escape View File

222 .gitlab-ci/deqp-runner.sh Unescape Escape View File

8 .gitlab-ci/download-git-cache.sh Unescape Escape View File

70 .gitlab-ci/gtest-runner.sh Executable file Unescape Escape View File

21 .gitlab-ci/image-tags.yml Normal file Unescape Escape View File

11 .gitlab-ci/lava/lava-gitlab-ci.yml Unescape Escape View File

34 .gitlab-ci/lava/lava-pytest.sh Executable file Unescape Escape View File

20 .gitlab-ci/lava/lava-submit.sh Unescape Escape View File

136 .gitlab-ci/lava/lava_job_submitter.py Unescape Escape View File

1 .gitlab-ci/meson/build.sh Unescape Escape View File

6 .gitlab-ci/piglit/piglit-all-skips.txt Unescape Escape View File

89 .gitlab-ci/piglit/piglit-runner.sh Unescape Escape View File

102 .gitlab-ci/piglit/run.sh → .gitlab-ci/piglit/piglit-traces.sh Unescape Escape View File

8 .gitlab-ci/prepare-artifacts.sh Unescape Escape View File

153 .gitlab-ci/skqp-runner.sh Executable file Unescape Escape View File

119 .gitlab-ci/test-source-dep.yml Unescape Escape View File

315 .gitlab-ci/test/gitlab-ci.yml Normal file Unescape Escape View File

0 src/amd/ci/deqp-radv-bonaire-aco-fails.txt → .gitlab-ci/tests/__init__.py Unescape Escape View File

250 .gitlab-ci/tests/test_lava_job_submitter.py Normal file Unescape Escape View File

63 .gitlab-ci/valve/b2c.yml.jinja2.jinja2 Normal file Unescape Escape View File

101 .gitlab-ci/valve/generate_b2c.py Executable file Unescape Escape View File

4 .gitlab-ci/windows/Dockerfile → .gitlab-ci/windows/Dockerfile_build Unescape Escape View File

7 .gitlab-ci/windows/Dockerfile_test Normal file Unescape Escape View File

31 .gitlab-ci/windows/deqp_runner_run.ps1 Normal file Unescape Escape View File

6230 Commits

mesa-21.3. ... 22.1-branc

6

.gitattributes vendored Normal file

View File

1116

.gitlab-ci.yml

View File

11

.gitlab-ci/deqp-all-skips.txt → .gitlab-ci/all-skips.txt

View File

2

.gitlab-ci/bare-metal/arm64_a630_egl.sh

View File

17

.gitlab-ci/bare-metal/cisco-2960-poe-off.sh Executable file

View File

21

.gitlab-ci/bare-metal/cisco-2960-poe-on.sh Executable file

View File

14

.gitlab-ci/bare-metal/cros_servo_run.py

View File

22

.gitlab-ci/bare-metal/fastboot_run.py

View File

34

.gitlab-ci/bare-metal/poe-powered.sh

View File

4

.gitlab-ci/bare-metal/poe_run.py

View File

8

.gitlab-ci/bare-metal/rootfs-setup.sh

View File

27

.gitlab-ci/bare-metal/serial_buffer.py

View File

525

.gitlab-ci/build/gitlab-ci.yml Normal file

View File

47

.gitlab-ci/common/generate-env.sh

View File

1

.gitlab-ci/common/init-stage1.sh

View File

54

.gitlab-ci/common/init-stage2.sh

View File

567

.gitlab-ci/common/intel-gpu-freq.sh Executable file

View File

31

.gitlab-ci/container/arm64.config

View File

5

.gitlab-ci/container/baremetal_build.sh

View File

56

.gitlab-ci/container/build-crosvm.sh

View File

43

.gitlab-ci/container/build-crosvm_no-syslog.patch Normal file

View File

27

.gitlab-ci/container/build-deqp-runner.sh

View File

10

.gitlab-ci/container/build-deqp.sh

View File

2

.gitlab-ci/container/build-fossilize.sh

View File

2

.gitlab-ci/container/build-kernel.sh

View File

2

.gitlab-ci/container/build-libdrm.sh

View File

2

.gitlab-ci/container/build-piglit.sh

View File

97

.gitlab-ci/container/build-skqp.sh Executable file

View File

13

.gitlab-ci/container/build-skqp_BUILD.gn.patch Normal file

View File

47

.gitlab-ci/container/build-skqp_base.gn Normal file

View File

68

.gitlab-ci/container/build-skqp_fetch_gn.patch Normal file

View File

142

.gitlab-ci/container/build-skqp_git-sync-deps.patch Normal file

View File

13

.gitlab-ci/container/build-skqp_is_clang.py.patch Normal file

View File

17

.gitlab-ci/container/build-va-tools.sh Normal file

View File

20

.gitlab-ci/container/build-virglrenderer.sh

View File

4

.gitlab-ci/container/build-vkd3d-proton.sh

View File

22

.gitlab-ci/container/build-wayland.sh Normal file

View File

2

.gitlab-ci/container/container_pre_build.sh

View File

2

.gitlab-ci/container/create-android-cross-file.sh

View File

34

.gitlab-ci/container/create-rootfs.sh

View File

50

.gitlab-ci/container/debian/android_build.sh

View File

3

.gitlab-ci/container/debian/arm_build.sh

View File

6

.gitlab-ci/container/debian/arm_test.sh

View File

3

.gitlab-ci/container/debian/x86_build-base.sh

View File

15

.gitlab-ci/container/debian/x86_build.sh

View File

2

.gitlab-ci/container/debian/x86_test-base.sh

View File

39

.gitlab-ci/container/debian/x86_test-gl.sh

View File

6

.gitlab-ci/container/debian/x86_test-vk.sh

View File

14

.gitlab-ci/container/fedora/x86_build.sh

View File

414

.gitlab-ci/container/gitlab-ci.yml Normal file

View File

52

.gitlab-ci/container/lava_build.sh

View File

12

.gitlab-ci/container/x86_64.config

View File

1

.gitlab-ci/cross-xfail-s390x

View File

39

.gitlab-ci/crosvm-init.sh

View File

138

.gitlab-ci/crosvm-runner.sh

View File

222

.gitlab-ci/deqp-runner.sh

View File

8

.gitlab-ci/download-git-cache.sh

View File

70

.gitlab-ci/gtest-runner.sh Executable file

View File

21

.gitlab-ci/image-tags.yml Normal file

View File

11

.gitlab-ci/lava/lava-gitlab-ci.yml

View File

34

.gitlab-ci/lava/lava-pytest.sh Executable file

View File

20

.gitlab-ci/lava/lava-submit.sh

View File

136

.gitlab-ci/lava/lava_job_submitter.py

View File

1

.gitlab-ci/meson/build.sh

View File

6

.gitlab-ci/piglit/piglit-all-skips.txt

View File

89

.gitlab-ci/piglit/piglit-runner.sh

View File

102

.gitlab-ci/piglit/run.sh → .gitlab-ci/piglit/piglit-traces.sh

View File

8

.gitlab-ci/prepare-artifacts.sh

View File

153

.gitlab-ci/skqp-runner.sh Executable file

View File

119

.gitlab-ci/test-source-dep.yml

View File

315

.gitlab-ci/test/gitlab-ci.yml Normal file

View File

0

src/amd/ci/deqp-radv-bonaire-aco-fails.txt → .gitlab-ci/tests/init.py

View File

250

.gitlab-ci/tests/test_lava_job_submitter.py Normal file

View File

63

.gitlab-ci/valve/b2c.yml.jinja2.jinja2 Normal file

View File

101

.gitlab-ci/valve/generate_b2c.py Executable file

View File

4

.gitlab-ci/windows/Dockerfile → .gitlab-ci/windows/Dockerfile_build

View File

7

.gitlab-ci/windows/Dockerfile_test Normal file

View File

31

.gitlab-ci/windows/deqp_runner_run.ps1 Normal file

View File

49

.gitlab-ci/windows/mesa_build.ps1

View File