fran/mesa - mesa - GNLUG git store

fran/mesa

Author	SHA1	Message	Date
Bas Nieuwenhuizen	f46ba9ee35	ac/nir: Fix nir_texop_lod on GFX for 1D arrays. Fixes: `1bcb953e16` 'radv: handle GFX9 1D textures' Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `2c5b43c87f`)	2017-10-27 18:07:37 +03:00
Stefan Schake	2d51d41865	broadcom/vc4: Fix aliasing issue This was causing Android clang version 3.8.256229 to miscompile, presumably due to strict aliasing. Fixes: `14dc281c13` ("vc4: Enforce one-uniform-per-instruction after optimization.") (cherry picked from commit `e5fea0d621`)	2017-10-27 18:07:37 +03:00
Kenneth Graunke	cbc081b871	i965: Revert absolute mode for constant buffer pointers. The kernel doesn't initialize the value of the INSTPM or CS_DEBUG_MODE2 registers at context initialization time. Instead, they're inherited from whatever happened to be running on the GPU prior to first run of a new context. So, when we started setting these, other contexts in the system started inheriting our values. Since this controls whether 3DSTATE_CONSTANT_* takes a pointer or an offset, getting the wrong setting is fatal for almost any process which isn't expecting this. Unfortunately, VA-API and Beignet don't initialize this (nor does older Mesa), so they will die horribly if we start doing this. UXA and SNA don't use any push constants, so they are unaffected. Until we have some kind of solution to this problem, I'm going to revert this patch and abandon using the feature for now. It will lead to fewer pushed UBO ranges on Broadwell+, which may lead to lower performance, though I don't have any data on the impact. Cc: "17.3 17.2" <mesa-stable@lists.freedesktop.org> Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102774 (cherry picked from commit `013d331220`) [Andres Gomez: resolve trivial conflicts] Signed-off-by: Andres Gomez <agomez@igalia.com> Conflicts: src/mesa/drivers/dri/i965/brw_state_upload.c src/mesa/drivers/dri/i965/intel_screen.c	2017-10-27 18:07:24 +03:00
Michel Dänzer	546b4d455a	st/mesa: Initialize textures array in st_framebuffer_validate And just reference pipe_resources to it in the validate callbacks. Avoids pipe_resource leaks when st_framebuffer_validate ends up calling the validate callback multiple times, e.g. when a window is resized. v2: * Use generic stable tag instead of Fixes: tag, since the problem could already happen before the commit referenced in v1 (Thomas Hellstrom) * Use memset to initialize the array on the stack instead of allocating the array with os_calloc. Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Thomas Hellstrom <thellstrom@vmware.com> (cherry picked from commit `7561da367b`) Squashed with: st/osmesa: include u_inlines.h for pipe_resource_reference Fixes build failure due to unresolved symbol. Fixes: `7561da367b` "st/mesa: Initialize textures array in st_framebuffer_validate" Trivial. (cherry picked from commit `8c9e7c9638`)	2017-10-27 18:04:59 +03:00
Henri Verbeet	9cbf8c910e	vulkan/wsi: Free the event in x11_manage_fifo_queues(). Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Henri Verbeet <hverbeet@gmail.com> Fixes: `e73d136a02` ("vulkan/wsi/x11: Implement FIFO mode.") Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com (cherry picked from commit `3de87f7cd7`)	2017-10-27 18:04:59 +03:00
Dave Airlie	1eb4cbc934	radv/image: bump all the offset to uint64_t. So one of the CTS tests tries to allocate a 16384x1 2048 array texture. This overflows a bunch of calculations when we want it tiled as the heights goes to 128. addrlib returns us the correct size (16GB or so), but we mangle it in the htile calcs due to the 32-bit offset fields, then userspace gives us the reduced number and we try to allocate it on a heap and things blow up. We really need to give the app back the correct size for the image so we can blow up properly in memory allocation later. This should fix hangs in dEQP-VK.pipeline.render_to_image.core.1d_array.huge.width_layers.r8g8b8a8_unorm_d32_sfloat_s8_uint since Fixes: `ad3d98da9f` (radv: enable tc compatible htile for d32s8 also.) Now there's an open question if we should be enabling tc-compat htile at all for shallow textures like the above. This might cause some other wierd side effects in CTS even without the tc compat so: Cc: "17.2" <mesa-stable@lists.freedesktop.org> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Signed-off-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `35c66f3e40`) [Andres Gomez: resolve trivial conflicts] Signed-off-by: Andres Gomez <agomez@igalia.com> Conflicts: src/amd/vulkan/radv_private.h	2017-10-27 18:04:59 +03:00
Marek Olšák	fba44d91d0	Revert "mesa: fix texture updates for ATI_fragment_shader" This reverts commit `9d54025cd1`. It breaks KOTOR. Cc: 17.1 17.2 <mesa-stable@lists.freedesktop.org> (cherry picked from commit `5d071bf04b`)	2017-10-27 18:04:59 +03:00
Samuel Pitoiset	9cba4d491c	radv: add the draw count buffer to the list of buffers My guess is that the GPU is going to report VM faults if vkCmdDrawIndirectCountAMD() (and friends) are used. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> (cherry picked from commit `3e5f27faf3`)	2017-10-27 18:04:59 +03:00
Emil Velikov	facc851818	docs: add sha256 checksums for 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-19 13:28:13 +01:00
Emil Velikov	28dc4b64f2	docs: add release notes for 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com> mesa-17.2.3	2017-10-19 13:10:20 +01:00
Emil Velikov	ea38f4c33a	Update version to 17.2.3 Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-19 13:02:58 +01:00
Emil Velikov	23c08dabc3	eglmesaext: add forward declaration for struct wl_buffers The user does not need to know the specifics of the struct, as only a pointer to it is used. Just forward declare the struct making the header self-contained. v2: Remove deprecation warning text/bugzilla - patch does no help there. Cc: Greg V <greg@unrelenting.technology> Fixes: `5cddb1ce3c` ("wayland: Add an extension to create wl_buffers from EGLImages") Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> (v1) (cherry picked from commit `66ebdfbd44`)	2017-10-17 16:59:31 +01:00
Emil Velikov	dc9bd1dade	wayland-drm: use a copy of the wayland_drm_callbacks struct The callbacks may be called even when they are no longer valid. Say, the user is dlclose(ing) libEGL while the buffers are being destroyed. Cc: mesa-stable@lists.freedesktop.org Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Tested-by: Derek Foreman <derekf@osg.samsung.com> (cherry picked from commit `0cfd6f6cfc`)	2017-10-17 16:59:31 +01:00
Jason Ekstrand	d001ff1267	nir: Get rid of the variable on vote intrinsics This looks like a copy+paste error. They don't actually write into that variable as would be implied by putting the return there. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `3442c9fc3e`)	2017-10-17 16:59:31 +01:00
Jason Ekstrand	88a16c895b	nir/opcodes: Fix constant-folding of ufind_msb We didn't fold correctly in the case of 0x1 because we never let the loop counter hit 0. Switching it to bit >= 0 solves this problem. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Reviewed-by: Connor Abbott <cwabbott0@gmail.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `a0947921eb`)	2017-10-17 16:59:31 +01:00
Jason Ekstrand	b640bf38ca	glsl/blob: Return false from grow_to_fit if we've ever failed Otherwise we could have a failure followed by a smaller write that succeeds and get a corrupted blob. If we ever OOM, we should stop. v2 (Jason Ekstrand): - Initialize the new boolean member in create_blob Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `e03717efbd`)	2017-10-17 16:59:31 +01:00
Jason Ekstrand	4d1ae3283c	glsl/blob: Return false from ensure_can_read on overrun Otherwise, if you have a large read fail and then try to do a small read, the small read may succeed even though it's at the wrong offset. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Jordan Justen <jordan.l.justen@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `7118851374`)	2017-10-17 16:59:31 +01:00
Eric Engestrom	d56aa9fe43	scons: use python3-compatible print() These changes were generated using python's `2to3` tool. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102852 Reported-by: Alex Granni <liviuprodea@yahoo.com> Signed-off-by: Eric Engestrom <eric.engestrom@imgtec.com> Reviewed-by: Jose Fonseca <jfonseca@vmware.com> (cherry picked from commit `7d48219b3a`)	2017-10-17 16:59:31 +01:00
Bas Nieuwenhuizen	3b657e4ff5	radv: Only set the MTYPE flags on GFX9+. Older kernels fail the va_op with this flag set. If the kernel supports GFX9 usefully, it will also support this flag. Fixes: `e8d57802fe` "radv/gfx9: allocate events from uncached VA space" Reviewed-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `96f80c8d4d`)	2017-10-17 16:59:31 +01:00
Daniel Stone	99d3661bce	egl/wayland: Don't use dmabuf with no modifiers The dmabuf interface requires a valid modifier to be sent. If we don't explicitly get a modifier from the driver, we can't know what to send; it must be inferred from legacy side-channels (or assumed to linear, if none exists). If we have no modifier, then we can only have a single-plane format anyway, so fall back to the old wl_drm buffer import path. Fixes: `a65db0ad1c` ("st/dri: don't expose modifiers in EGL if the driver doesn't implement them") Fixes: `02cc359372` ("egl/wayland: Use linux-dmabuf interface for buffers") Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reported-by: Andy Furniss <adf.lists@gmail.com> Cc: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `b65d6dafd6`)	2017-10-17 16:59:31 +01:00
Daniel Stone	0f6e89dfe0	egl/wayland: Check queryImage return for wl_buffer When creating a wl_buffer from a DRIImage, we extract all the DRIImage information via queryImage. Check whether or not it actually succeeds, either bailing out if the query was critical, or providing sensible fallbacks for information which was not available in older DRIImage versions. Fixes: `a65db0ad1c` ("st/dri: don't expose modifiers in EGL if the driver doesn't implement them") Fixes: `02cc359372` ("egl/wayland: Use linux-dmabuf interface for buffers") Signed-off-by: Daniel Stone <daniels@collabora.com> Reviewed-by: Emil Velikov <emil.velikov@collabora.com> Reported-by: Andy Furniss <adf.lists@gmail.com> Cc: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `6273d2f269`)	2017-10-17 16:59:31 +01:00
Emil Velikov	bee97ec32e	swr/rast: do not crash on NULL strings returned by getenv The current convenience function GetEnv feeds the results of getenv directly into std::string(). That is a bad idea, since the variable may be unset, thus we feed NULL into the C++ construct. The latter of which is not allowed and leads to a crash. v2: Better variable name, implicit char* -> std::string conversion (Eric) Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=101832 Fixes: `a25093de71` ("swr/rast: Implement JIT shader caching to disk") Cc: Tim Rowley <timothy.o.rowley@intel.com> Cc: Laurent Carlier <lordheavym@gmail.com> Cc: Bernhard Rosenkraenzer <bero@lindev.ch> [Emil Velikov: make an actual commit from the misc diff] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Reviewed-by: Eric Engestrom <eric.engestrom@imgtec.com> (v1) Reviewed-by: Laurent Carlier <lordheavym@gmail.com> (v1) (cherry picked from commit `21e271024d`)	2017-10-17 16:59:31 +01:00
Nicolai Hähnle	9f4b0a336c	radeonsi: clamp border colors for upgraded depth textures The hardware does this automatically for unorm formats, but we need to do it manually for unorm depth formats that have been upgraded to Z32_FLOAT. Fixes dEQP-GLES31.functional.texture.border_clamp.range_clamp.nearest_unorm_depth and others. Fixes: `d4d9ec55c5` ("radeonsi: implement TC-compatible HTILE") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> (cherry picked from commit `6eb9483912`)	2017-10-17 16:59:31 +01:00
Nicolai Hähnle	74a28d85de	radeonsi: clamp depth comparison value only for fixed point formats The hardware usually does this automatically. However, we upgrade depth to Z32_FLOAT to enable TC-compatible HTILE, which means the hardware no longer clamps the comparison value for us. The only way to tell in the shader whether a clamp is required seems to be to communicate an additional bit in the descriptor table. While VI has some unused bits in the resource descriptor, those bits have unfortunately all been used in gfx9. So we use an unused bit in the sampler state instead. Fixes dEQP-GLES3.functional.texture.shadow.2d.linear.equal_depth_component32f and many other tests in dEQP-GLES3.functional.texture.shadow.* Fixes: `d4d9ec55c5` ("radeonsi: implement TC-compatible HTILE") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> (cherry picked from commit `4c56e07029`) [Emil Velikov: handle lack of dirty_mask in original patch] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/gallium/drivers/radeonsi/si_descriptors.c	2017-10-17 16:59:31 +01:00
Nicolai Hähnle	f805a61e04	st/glsl_to_tgsi: fix a use-after-free in merge_two_dsts Found by address sanitizer. The loop here tries to be safe, but in doing so, it ends up doing exactly the wrong thing: the safe foreach is for when the loop variable (inst) could be deleted and nothing else. However, this particular can delete inst's successor, but not inst itself. Fixes: `8c6a0ebaad` ("st/mesa: add st fp64 support (v7.1)") Reviewed-by: Marek Olšák <marek.olsak@amd.com> Tested-by: Dieter Nützel <Dieter@nuetzel-hh.de> (cherry picked from commit `2703fa613b`)	2017-10-17 16:59:31 +01:00
Lionel Landwerlin	6957dfb0d8	anv: bo_cache: allow importing a BO larger than needed It's not a problem if a BO has been allocated larger than we need it to be. Bugzilla: https://bugs.freedesktop.org/show_bug.cgi?id=102940 Fixes: `818b857914` ("anv: Use the BO cache for DeviceMemory allocations") Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Daniel Stone <daniels@collabora.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `c0a4f56fb9`)	2017-10-17 16:59:31 +01:00
Nicolai Hähnle	410f4dbcb1	st/glsl_to_tgsi: fix indirect access to 64-bit integer Make sure we actually allocate two adjacent TGSI temporaries. The current code fails e.g. when an arithmetic operation has two operands with indirect accesses. I will send out a new piglit test (arb_gpu_shader_int64/execution/indirect-array-two-accesses.shader_test) Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `541208cf13`)	2017-10-17 16:59:31 +01:00
Ilia Mirkin	d22e779d6a	nv50,nvc0: fix push hint logic in presence of a start offset Previously buffer offsets were passed in explicitly as an offset, which had to be added to the resource address. Now they are passed in via an increased 'start' parameter. As a result, we were double-adding the start offset in this kind of situation. This condition was triggered by piglit's draw-elements test which has a requisite glMultiDrawElements in combination with a small enough number of vertices to go through the immediate push path. Fixes: `330d0607ed` ("gallium: remove pipe_index_buffer and set_index_buffer") Reported-by: Karol Herbst <karolherbst@gmail.com> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `b20bccbcac`)	2017-10-17 16:59:31 +01:00
Dave Airlie	41ec2af2a8	radv: lower ffma in nir. So it appears the Vulkan SPIR-V fma opcode can be equivalent to a mad operation, and the fma hw opcode on AMD hw is issued like a double opcode so is slower. Also the radeonsi stack does this. This appears to improve performance on a number of games from Feral, and thanks to Feral for noticing the problem. I'm reposting this one as Marek indicated he thinks this is what we should be doing on AMD hw. Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Marek Olšák <marek.olsak@amd.com> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `2c61594d84`) [Emil Velikov: use correct file radv_shader.c -> radv_pipeline.c] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/amd/vulkan/radv_shader.c	2017-10-17 16:59:31 +01:00
Alex Smith	0bd7be0142	radv: Add R16G16B16A16_SNORM fast clear support Signed-off-by: Alex Smith <asmith@feralinteractive.com> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `25d76fd658`)	2017-10-17 16:59:30 +01:00
Nicolai Hähnle	4bcadb533f	st/mesa: don't clobber glGetInternalformat* buffer for GL_NUM_SAMPLE_COUNTS Applications might pass in a buffer that is sized too large and rely on the extra space of the buffer not being overwritten. Fixes dEQP-GLES31.functional.state_query.internal_format.partial_query.num_sample_counts Cc: mesa-stable@lists.freedesktop.org Reviewed-by: Marek Olšák <marek.olsak@amd.com> (cherry picked from commit `9a8f13a33b`)	2017-10-17 16:59:30 +01:00
Ilia Mirkin	2eae2a6f0e	nv50/ir: fix 64-bit integer shifts TGSI was adjusted to always pass in 64-bit integers but nouveau was left with the old semantics. Update to the new thing. Fixes: `d10fbe5159` (st/glsl_to_tgsi: fix 64-bit integer bit shifts) Reported-by: Karol Herbst <karolherbst@gmail.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `ce6da2a026`)	2017-10-17 16:59:30 +01:00
Józef Kucia	077f925473	anv: Do not assert() on VK_ATTACHMENT_UNUSED Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Reviewed-by: Nanley Chery <nanley.g.chery@intel.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `91ba331ef4`)	2017-10-17 16:59:30 +01:00
Józef Kucia	2e92d16f9d	spirv: Fix SpvOpAtomicISub Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `e0acb630a5`)	2017-10-17 16:59:30 +01:00
Emil Velikov	83dcf9dc33	cherry-ignore: add "anv/wsi: Allocate enough memory for the entire image" Addresses bug introduced with a feature patch, which is not in branch. Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-17 16:59:28 +01:00
Lionel Landwerlin	406e7e0e17	anv/cmd_buffer: Reset state in cmd_buffer_destroy This ensures that everything gets cleaned up properly. In particular, it fixes a memory leak where we were leaking the push constants structs. Valgrind stats on dEQP-VK.pipeline.push_constant.graphics_pipeline.range_size_128 : Before: HEAP SUMMARY: in use at exit: 2,467,513 bytes in 1,305 blocks total heap usage: 697,853 allocs, 696,530 frees, 138,466,600 bytes allocated LEAK SUMMARY: definitely lost: 1,068 bytes in 11 blocks indirectly lost: 24,669 bytes in 412 blocks possibly lost: 0 bytes in 0 blocks still reachable: 2,441,776 bytes in 882 blocks suppressed: 0 bytes in 0 blocks After: HEAP SUMMARY: in use at exit: 2,467,381 bytes in 1,304 blocks total heap usage: 697,853 allocs, 696,531 frees, 138,466,600 bytes allocated LEAK SUMMARY: definitely lost: 936 bytes in 10 blocks indirectly lost: 24,669 bytes in 412 blocks possibly lost: 0 bytes in 0 blocks still reachable: 2,441,776 bytes in 882 blocks suppressed: 0 bytes in 0 blocks Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.2 17.1" <mesa-stable@lists.freedesktop.org> (cherry picked from commit `0763f814d7`)	2017-10-17 16:59:02 +01:00
Lionel Landwerlin	60466859fa	anv/cmd_buffer: fix push descriptors with set > 0 When writing to set > 0, we were just wrongly writing to set 0. This commit fixes this by lazily allocating each set as we write to them. We didn't go for having them directly into the command buffer as this would require an additional ~45Kb per command buffer. v2: Allocate push descriptors from system memory rather than in BO streams. (Lionel) Cc: "17.2 17.1" <mesa-stable@lists.freedesktop.org> Fixes: `9f60ed98e5` ("anv: add VK_KHR_push_descriptor support") Reported-by: Daniel Ribeiro Maciel <daniel.maciel@gmail.com> Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> (cherry picked from commit `d296dea54e`)	2017-10-17 16:59:02 +01:00
Marek Olšák	014b5a7209	glsl_to_tgsi: fix instruction order for bindless textures We emitted instructions loading the bindless handle after the memory instruction. Cc: 17.2 <mesa-stable@lists.freedesktop.org> Reviewed-by: Samuel Pitoiset <samuel.pitoiset@gmail.com> Reviewed-by: Nicolai Hähnle <nicolai.haehnle@amd.com> (cherry picked from commit `985338e2cb`)	2017-10-17 16:59:02 +01:00
Jason Ekstrand	f9c4f22f5a	intel/compiler: Don't propagate cmod into integer multiplies No shader-db change on Sky Lake. Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `7463d50580`)	2017-10-17 16:59:02 +01:00
Jason Ekstrand	4ae1a62b26	intel/compiler: Don't cmod propagate into a saturated operation Shader-db results on Sky Lake: total instructions in shared programs: 12954445 -> 12955125 (0.01%) instructions in affected programs: 141862 -> 142542 (0.48%) helped: 0 HURT: 626 Reviewed-by: Matt Turner <mattst88@gmail.com> Cc: mesa-stable@lists.freedesktop.org (cherry picked from commit `b91ecee04a`)	2017-10-17 16:59:02 +01:00
Ben Crocker	ea3ad52ad3	gallivm/ppc64le: allow environmental control of Altivec code generation In check_os_altivec_support(), allow control of Altivec (first PPC vector instruction set) code generation via a new environmental control, GALLIVM_ALTIVEC, which is expected to take on a value of 1 or 0. The default is to enable Altivec code generation. This environmental control of Altivec code generation is initially available only #ifdef DEBUG. Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Ben Crocker <bcrocker@redhat.com> Acked-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `1359af930e`)	2017-10-17 16:59:02 +01:00
Ben Crocker	3f5d2b768c	gallivm/ppc64le: adjust VSX code generation control. In lp_build_create_jit_compiler_for_module(), advance the minimum version of LLVM for VSX code generation to 4.0; this is the minimum revision at which several known VSX code generation bugs are fixed: https://llvm.org/bugs/show_bug.cgi?id=25503 (fixed in 3.8.1) https://llvm.org/bugs/show_bug.cgi?id=26775 (fixed in 3.8.1) https://llvm.org/bugs/show_bug.cgi?id=33531 (fixed in 4.0) An llc performance bug introduced in LLVM 4.0, https://llvm.org/bugs/show_bug.cgi?id=34647 is still pending as of LLVM 5.0, but only has a pronounced effect on one of the Piglit tests: ext_transform_feedback-max-varyings. All changes tested via Piglit. Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Ben Crocker <bcrocker@redhat.com> Acked-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `e93f056a4e`)	2017-10-17 16:59:02 +01:00
Ben Crocker	070d2dcfac	gallivm: allow additional llc options In init_native_targets, allow the passing of additional options to the LLC compiler via new GALLIVM_LLC_OPTIONS environmental control. This option is available only #ifdef DEBUG, initially. At top, add #include <llvm-c/Support.h> for LLVMParseCommandLineOptions() declaration. v2: Fix compile error with old llvm versions (sroland) Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Ben Crocker <bcrocker@redhat.com> Acked-by: Nicolai Hähnle <nicolai.haehnle@amd.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `5c75f0c8bb`)	2017-10-17 16:59:02 +01:00
Ben Crocker	1a8ccdc6e9	gallivm: fix typo in debug_printf message In gallivm_compile_module, fix a typo in the debug_printf("Invoke as \"llc ..." message. Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Ben Crocker <bcrocker@redhat.com> Reviewed-by: Roland Scheidegger <sroland@vmware.com> (cherry picked from commit `3a9feb4db8`)	2017-10-17 16:59:02 +01:00
Leo Liu	608bea62ca	st/va: don't re-allocate interlaced buffer with pakced format It caused corruption, when vlVaPutImage putting raw data to the fields v2: add RGB formats since it got uploaded here as well Cc: mesa-stable@lists.freedesktop.org Cc: Andy Furniss <adf.lists@gmail.com> Tested-by: Andy Furniss <adf.lists@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com> (cherry picked from commit `0fa950ecd3`)	2017-10-17 16:59:02 +01:00
Leo Liu	dc47b179ed	st/vdpau: don't re-allocate interlaced buffer with packed YUV format It caused corruption, when vlVdpVideoSurfacePutBitsYCbCr putting YUV to the fields Cc: mesa-stable@lists.freedesktop.org Cc: Andy Furniss <adf.lists@gmail.com> Tested-by: Andy Furniss <adf.lists@gmail.com> Reviewed-by: Christian König <christian.koenig@amd.com> (cherry picked from commit `327480d10f`)	2017-10-17 16:59:02 +01:00
Dave Airlie	1146591d4b	radv: emit fmuladd instead of fma to llvm. For Vulkan SPIR-V the spec states fma() Inherited from OpFMul followed by OpFAdd. Matt says the backend will do the right thing depending on the hardware being compiled for, if you use the fmuladd intrinsic. Using the Mad Max pts test, on high settings at 4K: CHP: 55->60 HGDD: 46->50 LM: 55->60 No change on Stronghold. Thanks to Feral for spending the time to track this down. Reviewed-by: Bas Nieuwenhuizen <bas@basnieuwenhuizen.nl> Cc: "17.2" <mesa-stable@lists.freedesktop.org> Signed-off-by: Dave Airlie <airlied@redhat.com> (cherry picked from commit `4e93d6baae`) [Emil Velikov: s/ac_to_float_type/to_float_type/] Signed-off-by: Emil Velikov <emil.velikov@collabora.com> Conflicts: src/amd/common/ac_nir_to_llvm.c	2017-10-17 16:59:02 +01:00
Emil Velikov	76c6bbca7c	cherry-ignore: add "anv: Remove unreachable cases from isl_format_for_size" The commit causes a number of regressions with dEQP Signed-off-by: Emil Velikov <emil.velikov@collabora.com>	2017-10-17 16:57:45 +01:00
Lionel Landwerlin	3315bb4f08	intel: compiler: vec4: add missing default 0 lod We set a similar default value for LOD in the fs backend for TXS/TXL. Without this we end up generating invalid MOV with a null src. Signed-off-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Cc: "17.2 17.1" <mesa-stable@lists.freedesktop.org> Reviewed-by: Matt Turner <mattst88@gmail.com> (cherry picked from commit `d3acc240d0`)	2017-10-17 16:53:25 +01:00
Józef Kucia	cb72969d8b	anv: Fix vkCmdFillBuffer() The vkCmdFillBuffer() command fills a buffer with an uint32_t value. Reviewed-by: Lionel Landwerlin <lionel.g.landwerlin@intel.com> Reviewed-by: Jason Ekstrand <jason@jlekstrand.net> Cc: "17.1 17.2" <mesa-stable@lists.freedesktop.org> (cherry picked from commit `15fdbf9c39`)	2017-10-11 17:44:36 +01:00

1 2 3 4 5 ...

94787 Commits